[R] imbalanced data set

From: Weiwei Shi <helprhelp_at_gmail.com>
Date: Sun 24 Jul 2005 - 15:46:08 EST


Hi,
I have a question of classification on imbalanced dataset. I am wondering if there is a package which can solve this problem via sampling approach, like one-sided selection.

A follow-up question is, how to select those 'representative' samples and remove noise/borderlines and redundancy in order to increase classification accuracy. Is there any work which has been implemented in R or some GNU softwares?

Thanks,

weiwei

-- 
Weiwei Shi, Ph.D


"Did you always know?"
"No, I did not. But I believed..."
---Matrix III ______________________________________________ R-help@stat.math.ethz.ch mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html
Received on Sun Jul 24 15:53:50 2005

This archive was generated by hypermail 2.1.8 : Fri 03 Mar 2006 - 03:33:58 EST