Re: [R] Very Large Data Sets

About this list Date view Thread view Subject view Author view

Bill Venables (William.Venables@cmis.CSIRO.AU)
Thu, 23 Dec 1999 16:05:23 +1000



Tony Fagan asks:

> List,

Sir,

> Can R handle very large data sets (say, 100 million records) for data
> mining applications?

The question assumes that the data handling capacity is a
property of the software alone, which is nonsense. It is partly
a property of the software, partly of what you want to do with
the records, but mostly of the system on which it is run.

> My understanding is that Splus can not, but SAS can easily.

Try handling 100 million records with SAS (or anything else) on a
486 and see how easily it does it.

More seriously, the consensus is that on the same modern system
SAS is usually better able to handle large, dumb calculations
than S-PLUS, which is (generally) better than R. Horses for
courses.

Bill Venables.

-- 
-----------------------------------------------------------------
Bill Venables, Statistician, CMIS Environmetrics Project.

Physical address: Postal address: CSIRO Marine Laboratories, PO Box 120, 233 Middle St, Cleveland, Queensland Cleveland, Qld, 4163 AUSTRALIA AUSTRALIA

Telephone: +61 7 3826 7251 Email: Bill.Venables@cmis.csiro.au

Fax: +61 7 3826 7304

-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the "body", not the subject !) To: r-help-request@stat.math.ethz.ch _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._


About this list Date view Thread view Subject view Author view

This archive was generated by hypermail 2.0b3 on Tue 04 Jan 2000 - 13:34:03 EST