Re: [R] "Large" data set: performance issue

About this list Date view Thread view Subject view Author view Attachment view

From: Peter Dalgaard BSA (p.dalgaard@biostat.ku.dk)
Date: Wed 03 Apr 2002 - 00:16:41 EST


Message-id: <x2wuvqgp86.fsf@blueberry.kubism.ku.dk>

Till Baumgaertel <till.baumgaertel@epost.de> writes:

> hi all,
>
> I've got to import CSV-datasets (with variable-names in the first line)
> into data.frames. each is about 12MB (or more!) with 1823 columns and about
> 500 rows. the first 22 columns are in "character"-mode, the rest is "numeric".
>
> I run R 1.4.1 on a Windows 2000 system.

What happens if you try this?:

datfull <- read.csv("foo", colClasses=rep(c("character","numeric"),c(22,1801)))

-- 
   O__  ---- Peter Dalgaard             Blegdamsvej 3  
  c/ /'_ --- Dept. of Biostatistics     2200 Cph. N   
 (*) \(*) -- University of Copenhagen   Denmark      Ph: (+45) 35327918
~~~~~~~~~~ - (p.dalgaard@biostat.ku.dk)             FAX: (+45) 35327907
-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !)  To: r-help-request@stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._


About this list Date view Thread view Subject view Author view Attachment view

This archive was generated by hypermail 2.1.3 : Wed 16 Oct 2002 - 11:57:10 EST