[R] which duplicated rows to delete

From: Søren Merser <merser_at_image.dk>
Date: Mon 30 Oct 2006 - 10:11:01 GMT


Hi
Say I've this vector with several duplicates
>x<-c(1,2,3,4,2,6,2,8,2,3)

>which(duplicated(x))
[1] 5 7 9 10 11

But what I realy want is somthing like:
List({2,5,7}, {3,10}, ...)

Then from each sublist I can specify which of the duplicate items to drop

res<-NULL
for(vec in myDuplicateList)

        res<-rbind(res, subset(data[vec,], myCrit))

I'll get some of the way by sorting my original data appropriately, as it's the second and following rows that are 'marked' as duplicates, but that's not quite enough

Hope for some hints
Kind regards Søren



R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. Received on Tue Oct 31 03:13:37 2006

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.1.8, at Mon 30 Oct 2006 - 16:30:14 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.