[R] RandomForest question

From: <Arne.Muller_at_sanofi-aventis.com>
Date: Thu 21 Jul 2005 - 23:20:50 EST


I'm trying to find out the optimal number of splits (mtry parameter) for a randomForest classification. The classification is binary and there are 32 explanatory variables (mostly factors with each up to 4 levels but also some numeric variables) and 575 cases.

I've seen that although there are only 32 explanatory variables the best classification performance is reached when choosing mtry=80. How is it possible that more variables can used than there are in columns the data frame?

	thanks for your help
	+ kind regards,


	[[alternative HTML version deleted]]

R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html Received on Thu Jul 21 23:36:56 2005

This archive was generated by hypermail 2.1.8 : Fri 03 Mar 2006 - 03:33:53 EST