I have tried to create a DocumentTermMatrix with a tm package, but i get this error :
Error in tolower(txt) :
invalid input 'PROD Z LAHKO GNETNO MELJNO GLINO, ... in 'utf8towcs'
I tried doing this as it is showed in :
http://www.r-project.org/doc/Rnews/Rnews_2008-2.pdf (An Introduction to Text Mining),
with this R code :
tekst <- Corpus(DirSource("."))
meta(tekst, "Heading", "local") <- c("test")
DateTimeStamp: 2011-05-21 11:25:21
Heading : test
ID : test.txt
Language : en
test <- TermDocumentMatrix(tekst)
> Error in tolower(txt) :
> invalid input 'PROD Z LAHKO GNETNO MELJNO GLINO, ... in 'utf8towcs'
Attached is a small sample (test.txt) on which i worked.
Any help would be appreaciated,
This quarter's messages: by month, or sorted: [ by date ] [ by thread ] [ by subject ] [ by author ]
Archive maintained by Robert King, hosted by
the discipline of
statistics at the
University of Newcastle,
Archive generated by hypermail 2.2.0, at Sat 21 May 2011 - 12:00:08 GMT.
Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.
list of date sections of archive