Re: [R] Datasets in R

From: Antony Unwin <>
Date: Fri, 30 May 2008 14:31:42 +0200


There are many sources of real datasets (in R itself, on the web), you just need to look a little. For teaching purposes, I think it is always better to use real datasets than to use simulated ones.

One thing bothers me, though. You imply that in all the examples you have the data are well fit with linear models, the residuals are normal and there is no sign of heteroscedacity. That sounds a very unusual set of examples!



> From: Roland Rau <>
> Date: 30 May 2008 12:23:17 AM GMT+02:00
> To: Carlos López <>
> Cc:
> Subject: Re: [R] Datasets in R
> Carlos López wrote:
>> I´m trying to find datasets that will give me residuals, after
>> applying the lm function, with no normality, non linearity, and
>> heteroscedacity so I can try to exemplify
>> those cases in the linear regression model. Can you give any advice
>> on what datasets would be appropiate? I can´t use the ones in the
>> alr3 package because those have
>> already been seen in class.
>> Thank you very much :-)
>> natorro
> if you know what you are looking for (or not looking for), wouldn't
> it be the easiest and fastest thing to do to simulate such a dataset
> yourself?
> Best,
> Roland

        [[alternative HTML version deleted]] mailing list PLEASE do read the posting guide and provide commented, minimal, self-contained, reproducible code. Received on Mon 02 Jun 2008 - 05:27:37 GMT

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Mon 02 Jun 2008 - 05:30:36 GMT.

Mailing list information is available at Please read the posting guide before posting to the list.

list of date sections of archive