Re: [R] URL Scan

From: Barry Rowlingson <b.rowlingson_at_lancaster.ac.uk>
Date: Sun, 17 Apr 2011 23:32:19 +0100

On Sun, Apr 17, 2011 at 9:40 PM, jmsc <michaelfpage_at_gmail.com> wrote:
> I am wondering why when I try to input data from the first site listed below
> into R using the scan() function, a different page is read in instead (the
> second site listed):
>
> http://data.visionappraisal.com/CanterburyCT/parcel.asp?pid=1242
>
> http://www.visionappraisal.com/databases/
>
> I am wondering if this is an issue with R or something in the source code of
> the web page that I am not familiar with. Since I can access the first site
> directly, I assume it is not within the source code. Any help would be
> appreciated.

 I can't access the first URL directly - even from my web browser without R being involved at all. Is that "pid" a parcel ID that you need to be logged in to see? Or not a valid parcel id anymore?

 If you want to access a web site from R that needs a login/password then you need to send the appropriate login form info from R and keep the cookie session info that gets returned. Web sessions from R and from a web browser are independent.

Barry



R-help_at_r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. Received on Sun 17 Apr 2011 - 22:35:03 GMT

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Mon 18 Apr 2011 - 00:40:31 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.

list of date sections of archive