Re: [Rd] read.csv confused by newline characters in header (PR#14103)

From: Peter Dalgaard <P.Dalgaard_at_biostat.ku.dk>
Date: Wed, 02 Dec 2009 13:55:40 +0100

g.russell_at_eos-solutions.com wrote:
> Full_Name: George Russell
> Version: 2.10.0
> OS: Microsoft Windows XP Service Pack 2
> Submission from: (NULL) (217.111.3.131)
>
>
> The following code (typed into R --vanilla)
>
> testString <- '"B1\nB2"\n1\n'
> con <- textConnection(testString)
> tab <- read.csv(con,stringsAsFactors = FALSE)
>
> produces a data frame with with one row and one column; the name of the column
> is "B1.B2" (alright so far). However according to
> print(tab[[1,1]])
>
> the value of the entry in the first row and first column is
>
> "B2\n1\n"
>
> So B2 has somehow got into both the names of the data frame and its entry.
> Either R is confused or I am. What is going on?

Presumably, read.table is not obeying quotes when removing what it thinks is the header line. Another variation is this:

> tab <- read.table(stdin(), head=T)

0: "B1
0: B2"
1: 1
2:

> tab

  B1.B2
1 B2"
2 1

It's somehow connected to the

pushBack(c(lines, lines), file)

bits in readtable.R, but I don't quite get it.

-- 
   O__  ---- Peter Dalgaard             Ă˜ster Farimagsgade 5, Entr.B
  c/ /'_ --- Dept. of Biostatistics     PO Box 2099, 1014 Cph. K
 (*) \(*) -- University of Copenhagen   Denmark      Ph:  (+45) 35327918
~~~~~~~~~~ - (p.dalgaard_at_biostat.ku.dk)              FAX: (+45) 35327907

______________________________________________
R-devel_at_r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel
Received on Wed 02 Dec 2009 - 13:04:33 GMT

This archive was generated by hypermail 2.2.0 : Fri 04 Dec 2009 - 13:40:59 GMT