Re: [R] Logistic regression with multiple imputation

From: Simon Blomberg <>
Date: Wed, 30 Jun 2010 15:36:30 +1000

mitools is useful too, and I can vouch for mice. mice is easy to use, and easy to write new imputation methods too. So it is also very flexible.


On 30/06/10 15:31, Jeremy Miles wrote:
> Hi Daniel
> First, newer versions of SPSS have dramatically improved their ability
> to do stuff with missing data - I believe it's an additional module,
> and in SPSS-world, each additional module = $$$.
> Analyzing missing data is a 3 step process. First, you impute,
> creating multiple datasets, then you analyze each dataset in the
> conventional way, then you combine the results. There are two (that
> I know of) packages for imputaton - these are mi and mice.
> will find them for you.
> Hope that helps,
> Jeremy
> On 29 June 2010 22:14, Daniel Chen<> wrote:
>> Hi,
>> I am a long time SPSS user but new to R, so please bear with me if my
>> questions seem to be too basic for you guys.
>> I am trying to figure out how to analyze survey data using logistic
>> regression with multiple imputation.
>> I have a survey data of about 200,000 cases and I am trying to predict the
>> odds ratio of a dependent variable using 6 categorical independent variables
>> (dummy-coded). Approximatively 10% of the cases (~20,000) have missing data
>> in one or more of the independent variables. The percentage of missing
>> ranges from 0.01% to 10% for the independent variables.
>> My current thinking is to conduct a logistic regression with multiple
>> imputation, but I don't know how to do it in R. I searched the web but
>> couldn't find instructions or examples on how to do this. Since SPSS is
>> hopeless with missing data, I have to learn to do this in R. I am new to R,
>> so I would really appreciate if someone can show me some examples or tell me
>> where to find resources.

>> Thank you!
>> Daniel
>> [[alternative HTML version deleted]]
>> ______________________________________________
>> mailing list
>> PLEASE do read the posting guide
>> and provide commented, minimal, self-contained, reproducible code.

Simon Blomberg, BSc (Hons), PhD, MAppStat.
Lecturer and Consultant Statistician
School of Biological Sciences
The University of Queensland
St. Lucia Queensland 4072
T: +61 7 3365 2506

1.  I will NOT analyse your data for you.
2.  Your deadline is your problem

Statistics is the grammar of science - Karl Pearson.

______________________________________________ mailing list
PLEASE do read the posting guide
and provide commented, minimal, self-contained, reproducible code.
Received on Wed 30 Jun 2010 - 05:38:26 GMT

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Wed 30 Jun 2010 - 06:10:43 GMT.

Mailing list information is available at Please read the posting guide before posting to the list.

list of date sections of archive