[R] Summarising by group

From: Martyn Byng <Martyn.Byng_at_nag.co.uk>
Date: Mon, 09 May 2011 10:12:57 +0100

I wonder if someone with more experience than me on using R to summarise by group wants to post a reply to this

http://www.analyticbridge.com/group/sasandstatisticalprogramming/forum/t opics/why-still-use-sas-with-a-lot

To save everyone having to follow the link, the text is copied below

"SAS has some nice features, such as the SQL procedure or simple "group by" features. Try to compute correlations "by group" in R: say you have 2,000 groups, 2 variables e.g. salary and education level, and 2 million observations - you want to compute correlation between salary and education within each group.

It is not obvious, your best bet is to use some R package (see sample code on Analyticbridge to do it), and the solution is painful, you can not return both correlation and stdev "by group", as the function can return only one argument, not a vector. So if you want to return not just two, but say 100 metrics, it becomes a nightmare."

The Numerical Algorithms Group Ltd is a company registered in England and Wales with company number 1249803. The registered office is: Wilkinson House, Jordan Hill Road, Oxford OX2 8DR, United Kingdom.

This e-mail has been scanned for all viruses by Star. Th...{{dropped:4}}

R-help_at_r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. Received on Mon 09 May 2011 - 09:17:47 GMT

This quarter's messages: by month, or sorted: [ by date ] [ by thread ] [ by subject ] [ by author ]

All messages

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Mon 09 May 2011 - 14:40:06 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.

list of date sections of archive