RE: [R] extract rows in dataframe with duplicated column values

From: Marc Schwartz <MSchwartz_at_medanalytics.com>
Date: Fri 18 Mar 2005 - 14:46:18 EST

Here's one more possibility:

> subset(x, a %in% a[duplicated(a)])

  a b
2 2 10
3 2 10
4 3 10
5 3 10
6 3 10

HTH, Marc Schwartz

On Thu, 2005-03-17 at 22:25 -0500, Liaw, Andy wrote:
> OK, strike one...
>
> Here's my second try:
>
> > cnt <- table(x[,1])
> > v <- as.numeric(names(cnt[cnt > 1]))
> > v
> [1] 2 3
> > x[x[,1] %in% v, ]
> a b
> 2 2 10
> 3 2 10
> 4 3 10
> 5 3 10
> 6 3 10
>
> Andy
>
> > From: Liaw, Andy
> >
> > Does this work for you?
> >
> > > x[table(x[,1]) > 1,]
> > a b
> > 2 2 10
> > 3 2 10
> > 5 3 10
> > 6 3 10
> >
> > Andy
> >
> > > From: Tiago R Magalhaes
> > >
> > > Hi
> > >
> > > I want to extract all the rows in a data frame that have duplicates
> > > for a given column.
> > > I would expect this question to come up pretty often but I have
> > > researched the archives and surprisingly couldn't find anything.
> > > The best I can come up with is:
> > >
> > > x <- data.frame(a=c(1,2,2,3,3,3), b=10)
> > > xdup1 <- duplicated(x[,1])
> > > xdup2 <- duplicated(x[,1][nrow(x):1])[nrow(x):1]
> > > xAllDups <- x[(xdup1+xdup2)!=0,]
> > >
> > > This seems to work, but it's so convoluted that I'm sure there's a
> > > better method.
> > > Thanks for any help and enlightenment
> > > [[alternative HTML version deleted]]
> > >
> > > ______________________________________________
> > > R-help@stat.math.ethz.ch mailing list
> > > https://stat.ethz.ch/mailman/listinfo/r-help
> > > PLEASE do read the posting guide!
> > > http://www.R-project.org/posting-guide.html
> > >
> > >
> > >
> >
> > ______________________________________________
> > R-help@stat.math.ethz.ch mailing list
> > https://stat.ethz.ch/mailman/listinfo/r-help
> > PLEASE do read the posting guide!
> > http://www.R-project.org/posting-guide.html
> >
> >
> > --------------------------------------------------------------
> > ----------------
> > Notice: This e-mail message, together with any attachments,
> > contains information of Merck & Co., Inc. (One Merck Drive,
> > Whitehouse Station, New Jersey, USA 08889), and/or its
> > affiliates (which may be known outside the United States as
> > Merck Frosst, Merck Sharp & Dohme or MSD and in Japan, as
> > Banyu) that may be confidential, proprietary copyrighted
> > and/or legally privileged. It is intended solely for the use
> > of the individual or entity named on this message. If you
> > are not the intended recipient, and have received this
> > message in error, please notify us immediately by reply
> > e-mail and then delete it from your system.
> > --------------------------------------------------------------
> > ----------------
> >
> >
>
> ______________________________________________
> R-help@stat.math.ethz.ch mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html



R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html Received on Fri Mar 18 14:51:23 2005

This archive was generated by hypermail 2.1.8 : Fri 03 Mar 2006 - 03:30:51 EST