[R] Pulling strings from a Flat file

From: Kalicin, Sarah <sarah.kalicin_at_intel.com>
Date: Tue, 05 Apr 2011 16:48:52 -0700


I have a flat file that contains a bunch of strings that look like this. The file was originally in Unix and brought over into Windows:

E123456E234567E345678E456789E567891E678910E. . . . Basically the string starts with E and is followed with 6 numbers. One string=E123456, length=7 characters. This file contains 10,000's of these strings. I want to separate them into one vector the length of the number of strings in the flat file, where each string is it's on unique value.

> aa<- file("Master","r", raw=TRUE)
> readChar(aa, cc, useBytes = FALSE)

[1] "E123456" "\nE23456" "7\nE3456" "78\nE456" "789\nE56" "7891\nE6" "78910\nE"
> close(aa)
> unlink("Master")

The biggest issue is I am getting \n added into the string, which I am not sure where it is coming from, and splices the strings. Any suggestions on getting rid of the /n and create an infinite sequence of 7's for the string length for the cc vector? Is there a better way to do this?


        [[alternative HTML version deleted]]

R-help_at_r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. Received on Wed 06 Apr 2011 - 02:49:15 GMT

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Wed 06 Apr 2011 - 07:20:28 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.

list of date sections of archive