[R] join tables in R

From: Roccato Alfredo (UniCredit) <Alfredo.Roccato_at_unicredit.eu>
Date: Wed, 04 May 2011 16:32:14 +0200


I'd to match-merge 2 tables in such a manner that I keep all the rows in table 1, but not the rows that are in both table 1 and 2. Thank you for your help,
Alfredo

> master <- data.frame(ID=2001:2011)
> train <- data.frame(ID=2004:2006)
> valid <- ???

in this example table valid should have the following

> str(valid)

 Year: int 2001 2002 2003 2007 2008 2009 2010 2011

in SAS I'd do the following:
data master; do id=2001 to 2011; output; end; run; data train; do id=2004 to 2006; output; end; run; data valid; merge master(in=a) train(in=b); by id; if a and not b; run;

and in SQL:
create table valid as
  select a.* from master where ID not in (select ID from train)

        [[alternative HTML version deleted]]



R-help_at_r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. Received on Thu 05 May 2011 - 06:25:02 GMT

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Thu 05 May 2011 - 07:00:05 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.

list of date sections of archive