Re: [R] [handling] Missing [values in randomForest]

From: Jim Porzak <jporzak_at_gmail.com>
Date: Mon 12 Sep 2005 - 02:50:07 EST

On 9/11/05, Uwe Ligges <ligges@statistik.uni-dortmund.de> wrote:
> Jan-Paul Roodbol wrote:
>
> > Does anyone know if randomForest in R can handle
> > dataset with missings?
>
> See ?randomForest, you can omit observations including NAs by specifying
> na.action=na.omit

Uwe,
While strictly true, this tells randomForest to ignore any rows with one or more NAs in the predictor variables.

Since, randomForest is often used for problems with a lot of (canidate) predictors, na.omit can result in a lot of rows being discarded. Hence, my reply to Jan-Paul's original posting suggesting the impute functions in randomForest.

JIm Porzak

> Please do not cross-post!
> Please specify a sensible subject!
>
> Uwe Ligges
>
>
> > Thank you
> >
> > Kind regards
> >
> > Jan-Paul
> >
> > ______________________________________________
> > R-help@stat.math.ethz.ch mailing list
> >
https://stat.ethz.ch/mailman/listinfo/r-help
> > PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html
>
> ______________________________________________
> R-help@stat.math.ethz.ch mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html
>



R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html Received on Mon Sep 12 02:57:22 2005

This archive was generated by hypermail 2.1.8 : Sun 23 Oct 2005 - 16:50:07 EST