[R] text mining

From: rgui <wa7.mej_at_gmail.com>
Date: Mon, 30 May 2011 03:17:41 -0700 (PDT)


Hi,

I have a problem when indexing the corpus. I used the following syntax:

> Setwd ("c :/....")
> Library (tm)
> Txt = Corpus (DirSource ("."); readerControl = list (language = "frensh"))

an error message comes:

>>> Messages d'avis :
1: In readLines(y, encoding = x$Encoding) :   ligne finale incompl√®te trouv√©e dans './n3.txt' 2: In readLines(y, encoding = x$Encoding) :
  ligne finale incompl√®te trouv√©e dans './n32.

another question:
 how can I read different document types (. pdf,. "...) html using the
package "tm"?

Thanks very well for help

--
View this message in context: http://r.789695.n4.nabble.com/text-mining-tp3560367p3560367.html
Sent from the R help mailing list archive at Nabble.com.

______________________________________________
R-help_at_r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
Received on Mon 30 May 2011 - 12:24:12 GMT

This quarter's messages: by month, or sorted: [ by date ] [ by thread ] [ by subject ] [ by author ]

All messages

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Mon 30 May 2011 - 12:40:11 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.

list of date sections of archive