[R] scanning a pdf scan

From: roger koenker <rkoenker_at_uiuc.edu>
Date: Fri 27 Oct 2006 - 16:34:48 GMT


I have a pdf scan of several pages of data from a quite famous old paper by
C.S. Pierce (1873). I would like (what else?) to convert it into an R dataframe.
Somewhat to my surprise the pdf seems to already be in a character recognized
form, since I can search for numerical strings and they are nicely found. Of
course, as is usual with such tables there are also headings and column lines, etc
etc. that are less interesting than the numbers themselves. I've tried saving the
pdf in various formats, some of which look vaguely tractable, but I'm hoping
that there is something that is more automatic.

Does anyone have experience that they could share toward this objective?

url:    www.econ.uiuc.edu/~roger            Roger Koenker
email    rkoenker@uiuc.edu            Department of Economics
vox:     217-333-4558                University of Illinois
fax:       217-244-6678                Champaign, IL 61820

______________________________________________
R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. Received on Sat Oct 28 02:50:50 2006

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.1.8, at Fri 27 Oct 2006 - 21:30:16 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.