Re: [R] read.table

From: Gabor Grothendieck <ggrothendieck_at_gmail.com>
Date: Thu 14 Jul 2005 - 07:08:51 EST

[I had some email problems and am sending this again. Sorry if you get it twice.]

You could use the nlines= argument to scan to read in a portion at a time.    

>
>
> On 7/13/05, Weiwei Shi <helprhelp@gmail.com> wrote:
> > add:
> > I used
> > trn<-matrix(scan('train1.dat', sep='|', na.string='.'), nrow=273529, ncol=195)
> >
> > it is done.
> > so it seems that I just have no patience to wait for half an hour :)
> >
> > but i still have that question:
> > is there a way to track the process if it takes too long. Could we
> > stop in the middle to see at which line it "hesitates" to move on?
> >
> > regards,
> >
> > weiwei
> >
> >
> > On 7/13/05, Weiwei Shi <helprhelp@gmail.com> wrote:
> > > Hi,
> > > I have a question on read.table.
> > >
> > > I have a dataset with 273,000 lines and 195 columns. I used the
> > > read.table to load the data into R:
> > > trn<-read.table('train1.dat', header=F, sep='|', na.strings='.')
> > > I found it takes forever.
> > >
> > > then I run 1/10 of the data (test) using read.table again. And this
> > > time it finished quickly. So, there might be something wrong in my
> > > data format causing that problem.
> > >
> > > then, my question is, is there a way in R to track at which line,
> > > something wrong occurs?
> > >
> > > Thanks,
> > >
> > > Weiwei
> > >
> > >
> > > --
> > > Weiwei Shi, Ph.D
> > >
> > > "Did you always know?"
> > > "No, I did not. But I believed..."
> > > ---Matrix III
> > >
> >
> >
> > --
> > Weiwei Shi, Ph.D
> >
> > "Did you always know?"
> > "No, I did not. But I believed..."
> > ---Matrix III
> >
> > ______________________________________________
> > R-help@stat.math.ethz.ch mailing list
> > https://stat.ethz.ch/mailman/listinfo/r-help
> > PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html
> >
>
>



R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html Received on Thu Jul 14 07:21:28 2005

This archive was generated by hypermail 2.1.8 : Fri 03 Mar 2006 - 03:33:38 EST