[R] RJDBC vs RMySQL vs ???

From: Ralf B <ralf.bierig_at_gmail.com>
Date: Wed, 23 Jun 2010 15:40:56 -0400


I am running a simple SQL SELECT statement that involvs 50k + data points using R and the RJDBC interface. I am facing very slow response times in both the RGUI and the R console. When running this SQL statement directly in a SQL client I have processing times that are a lot lot faster (which means that the SQL statement itself is not the problem).

Did any of you compare RJDBC vs RMySQL or is there a better, more efficient way to extract large data from databases using R? Would you recommend dumping data out completely into flat files and working with flat files instead? I expected that this would not be such a problem given that businesses maintain their data in DBs and R is supposed to be good in shifting around data. Am I doing something wrong?

Ralf



R-help_at_r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. Received on Wed 23 Jun 2010 - 19:43:34 GMT

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Wed 23 Jun 2010 - 20:50:34 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.

list of date sections of archive