Re: [Rd] duplicates() function

From: Kasper Daniel Hansen <kasperdanielhansen_at_gmail.com>
Date: Fri, 08 Apr 2011 11:21:00 -0400

On Fri, Apr 8, 2011 at 11:08 AM, Joshua Ulrich <josh.m.ulrich_at_gmail.com> wrote:
> How about:
>
> y <- rep(NA,length(x))
> y[duplicated(x)] <- match(x[duplicated(x)] ,x)
>

I use Joshua's trick all the time. But it might still be nice with a C implementation.

While we are discussing duplication, I would also like to see something like duplicated() but which returns TRUE whenever a value is later duplicated, so I can easily select the values of a vector which has are never duplicated. Right now I need to do something like   y [ ! y %in% y[duplicated(y)] ]
I am only bringing this up because of Duncan's request.

Kasper

> --
> Joshua Ulrich  |  FOSS Trading: www.fosstrading.com
>
>
>
> On Fri, Apr 8, 2011 at 9:59 AM, Duncan Murdoch <murdoch.duncan_at_gmail.com> wrote:
>> I need a function which is similar to duplicated(), but instead of returning
>> TRUE/FALSE, returns indices of which element was duplicated.  That is,
>>
>>> x <- c(9,7,9,3,7)
>>> duplicated(x)
>> [1] FALSE FALSE  TRUE FALSE TRUE
>>
>>> duplicates(x)
>> [1] NA NA  1 NA  2
>>
>> (so that I know that element 3 is a duplicate of element 1, and element 5 is
>> a duplicate of element 2, whereas the others were not duplicated according
>> to our definition.)
>>
>> Is there a simple way to write this function?  I have  an ugly
>> implementation in R that loops over all the values; it would make more sense
>> to redo it in C, if there isn't a simple implementation I missed.
>>
>> Duncan Murdoch
>>
>> ______________________________________________
>> R-devel_at_r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-devel
>>
>
> ______________________________________________
> R-devel_at_r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-devel
>



R-devel_at_r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel Received on Fri 08 Apr 2011 - 15:35:13 GMT

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Fri 08 Apr 2011 - 15:40:43 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-devel. Please read the posting guide before posting to the list.

list of date sections of archive