Re: [R] randomly select duplicated entries

From: Marc Schwartz <marc_schwartz_at_comcast.net>
Date: Wed, 09 Jul 2008 15:52:35 -0500

on 07/09/2008 02:17 PM Juliet Hannah wrote:
> Using this data as an example
>
> dat <- read.table(textConnection("Id myvar
> 12 1
> 12 2
> 12 6
> 34 9
> 34 4
> 34 8
> 65 15
> 65 23"), header = TRUE)
> closeAllConnections()
>
> how can I create another data set that does not have duplicate entries
> for 'Id', but the included values
> are randomly selected from the available ones.
>
> Thanks!
>
> Juliet

 > aggregate(dat$myvar, list(dat$Id), sample, 1)    Group.1 x

1      12  6
2      34  4
3      65 15

 > aggregate(dat$myvar, list(dat$Id), sample, 1)    Group.1 x

1      12  2
2      34  9
3      65 15

 > aggregate(dat$myvar, list(dat$Id), sample, 1)    Group.1 x

1      12  1
2      34  8
3      65 23


HTH, Marc Schwartz



R-help_at_r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. Received on Wed 09 Jul 2008 - 21:03:14 GMT

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Wed 09 Jul 2008 - 22:31:21 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.

list of date sections of archive