Re: [R] median of binned values

From: Chuck Cleland <ccleland_at_optonline.net>
Date: Wed, 19 Dec 2007 04:33:11 -0500

Martin Tomko wrote:
> Dear list,
> I have a vector (array, table row, whatever is best) of frequency values
> for categories (or bins), and I need to find the median category.
> Trivial to do by hand, but I was wondering if there is a means to do it
> in R in an elegant way.
>
> The obvious medioan(vector) returns the median frequency for the binns,
> and that is not what I want. i.e,:
> freq
> cat1 1
> cat2 10
> cat3 100
> cat4 1000
> cat5 10000
>
> I want it to return cat5, instead of cat3.

df <- data.frame(binname = as.factor(paste("cat", 1:5, sep="")),

                 freq = c(1,10,100,1000,10000))

df
  binname freq
1 cat1 1
2 cat2 10
3 cat3 100
4 cat4 1000
5 cat5 10000

with(df, levels(binname)[median(rep(as.numeric(binname), freq))]) [1] "cat5"

> Thanks a lot
> Martin
>
> ______________________________________________
> R-help_at_r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.

-- 
Chuck Cleland, Ph.D.
NDRI, Inc.
71 West 23rd Street, 8th floor
New York, NY 10010
tel: (212) 845-4495 (Tu, Th)
tel: (732) 512-0171 (M, W, F)
fax: (917) 438-0894

______________________________________________
R-help_at_r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
Received on Wed 19 Dec 2007 - 09:36:51 GMT

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Wed 19 Dec 2007 - 23:30:19 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.