Re: [R] Multiple count if style "queries"

From: David Winsemius <dwinsemius_at_comcast.net>
Date: Fri, 20 May 2011 17:57:50 -0400

On May 20, 2011, at 5:50 PM, Chris Mcowen wrote:

> Sorry for not including the data, i did intend to.

You included the data, just not in a format that anyone could reasonably be expected to edit so it can be used. Please read the Posting Guide.

-- 
David.


>
> I tried your code but got this error:
>
>>> answer <- sqldf("
>> + select ECO_NAME
>> + , count(distinct Order) as Order
>> + , count(distinct Family) as Family
>> + , count(distinct Genus) as Genus
>> + from data2
>> + group by ECO_NAME
>> + ")
>> Loading required package: tcltk
>> Loading Tcl/Tk interface ... done
>> Error in sqliteExecStatement(con, statement, bind.data) :
>> RS-DBI driver: (error in statement: near "Order": syntax error)
>
> For the second question i would want:
>
> The number of times each Order/Family and Genus was represented in
> each eco_region.
>
> I.E
>
> Biak-Numfoor rain forests
> Order: Alismatales-1, Araceae-1
> Family: Homalomena -1, Asparagales-1
> Genus: Orchidaceae-1, Bromheadia-1
>
> So the number after each name is the count.
>
> Thanks for the help.
>
> Chris
>
>
> data2 <- structure(list(ECO_NAME = structure(c(1L, 1L, 2L, 2L, 2L, 2L,
> 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 3L,
> 3L, 3L, 3L, 3L, 3L), .Label = c("Biak-Numfoor rain forests",
> "Central Range montane rain forests", "Huon Peninsula montane rain
> forests"
> ), class = "factor"), Order = structure(c(1L, 2L, 1L, 2L, 2L,
> 2L, 2L, 2L, 2L, 2L, 2L, 3L, 4L, 4L, 4L, 4L, 4L, 4L, 5L, 5L, 5L,
> 2L, 2L, 2L, 2L, 4L, 4L), .Label = c("Alismatales", "Asparagales",
> "Commelinales", "Poales", "Zingiberales"), class = "factor"),
> Family = structure(c(1L, 7L, 1L, 6L, 7L, 7L, 7L, 7L, 7L,
> 7L, 7L, 3L, 2L, 4L, 4L, 5L, 8L, 8L, 9L, 9L, 9L, 7L, 7L, 7L,
> 7L, 2L, 8L), .Label = c("Araceae", "Centrolepidaceae",
> "Commelinaceae",
> "Cyperaceae", "Eriocaulaceae", "Lomandraceae", "Orchidaceae",
> "Poaceae", "Zingiberaceae"), class = "factor"), Genus =
> structure(c(13L,
> 3L, 13L, 6L, 19L, 9L, 20L, 4L, 4L, 9L, 9L, 15L, 5L, 14L,
> 10L, 11L, 17L, 16L, 1L, 8L, 2L, 18L, 7L, 19L, 12L, 5L, 16L
> ), .Label = c("Alpinia", "Amomum", "Bromheadia", "Bulbophyllum",
> "Centrolepis", "Cordyline", "Corybas", "Curcuma", "Dendrobium",
> "Eleocharis", "Eriocaulon", "Glomera", "Homalomena", "Machaerina",
> "Murdannia", "Poa", "Schizostachyum", "Taeniophyllum",
> "Thelymitra",
> "Vanda"), class = "factor")), .Names = c("ECO_NAME", "Order",
> "Family", "Genus"), class = "data.frame", row.names = c(NA, -27L
> ))
>
>
> On 20 May 2011, at 22:33, jim holtman wrote:
>
> use the 'sqldf' package. Also use 'dput' to include sample data since
> it was impossible to use the data in the format you provided, so my
> guess at a solution would be:
>
> answer <- sqldf("
> select ECO_NAME
> , count(distinct Order) as Order
> , count(distinct Family) as Family
> , count(distinct Genus) as Genus
> from yourData
> group by ECO_NAME
> ")
>
> Not sure exactly what you wanted in the second question; it would be
> nice to include what some sample output would be so we have an idea of
> what you are after.
>
> On Fri, May 20, 2011 at 5:18 PM, Chris Mcowen
> <chrismcowen_at_gmail.com> wrote:
>> Dear List,
>>
>> I am looking to calculate two things from my data frame and was
>> after some advice. For the example below i want to know.
>>
>> 1. How many unique Orders/Families and Genera there are per eco-name
>>
>> 2. How many incidences are there for each Order/Family and Genus
>> there are per eco-region
>>
>> I have 650 econame.
>>
>> I.e for Biak-Numfoor rain forests there are 2 orders, 2 families
>> and two genera. Also, Alismatales are represented once, Asparagales
>> once etc etc.
>>
>> Thanks for any advice.
>>
>> Chris
>>
>>
>> ECO_NAME
>> Order Family Genus
>> Biak-Numfoor rain forests
>> Alismatales Araceae Homalomena
>> Biak-Numfoor rain forests
>> Asparagales Orchidaceae Bromheadia
>> Central Range montane rain forests Alismatales
>> Araceae Homalomena
>> Central Range montane rain forests Asparagales
>> Lomandraceae Cordyline
>> Central Range montane rain forests Asparagales
>> Orchidaceae Thelymitra
>> Central Range montane rain forests Asparagales
>> Orchidaceae Dendrobium
>> Central Range montane rain forests Asparagales
>> Orchidaceae Vanda
>> Central Range montane rain forests Asparagales
>> Orchidaceae Bulbophyllum
>> Central Range montane rain forests Asparagales
>> Orchidaceae Bulbophyllum
>> Central Range montane rain forests Asparagales
>> Orchidaceae Dendrobium
>> Central Range montane rain forests Asparagales
>> Orchidaceae Dendrobium
>> Central Range montane rain forests Commelinales
>> Commelinaceae Murdannia
>> Central Range montane rain forests Poales
>> Centrolepidaceae Centrolepis
>> Central Range montane rain forests Poales
>> Cyperaceae Machaerina
>> Central Range montane rain forests Poales
>> Cyperaceae Eleocharis
>> Central Range montane rain forests Poales
>> Eriocaulaceae Eriocaulon
>> Central Range montane rain forests Poales
>> Poaceae Schizostachyum
>> Central Range montane rain forests Poales
>> Poaceae Poa
>> Central Range montane rain forests Zingiberales
>> Zingiberaceae Alpinia
>> Central Range montane rain forests Zingiberales
>> Zingiberaceae Curcuma
>> Central Range montane rain forests Zingiberales
>> Zingiberaceae Amomum
>> Huon Peninsula montane rain forests Asparagales
>> Orchidaceae Taeniophyllum
>> Huon Peninsula montane rain forests Asparagales
>> Orchidaceae Corybas
>> Huon Peninsula montane rain forests Asparagales
>> Orchidaceae Thelymitra
>> Huon Peninsula montane rain forests Asparagales
>> Orchidaceae Glomera
>> Huon Peninsula montane rain forests Poales
>> Centrolepidaceae Centrolepis
>> Huon Peninsula montane rain forests Poales
>> Poaceae Poa
>> ______________________________________________
>> R-help_at_r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
>>
>
>
>
> --
> Jim Holtman
> Data Munger Guru
>
> What is the problem that you are trying to solve?
>
> ______________________________________________
> R-help_at_r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
David Winsemius, MD West Hartford, CT ______________________________________________ R-help_at_r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Received on Fri 20 May 2011 - 22:00:14 GMT

This quarter's messages: by month, or sorted: [ by date ] [ by thread ] [ by subject ] [ by author ]

All messages

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Fri 20 May 2011 - 22:20:08 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.

list of date sections of archive