[R] How to do multi-factor stratified sampling in R

From: Robert A. LaBudde <ral_at_lcfltd.com>
Date: Sat, 08 Mar 2008 11:48:42 -0500


Given a set of data with a number of variables plus a response, I'd like to obtain a randomized subset of the rows such that the marginal proportions of each variable are maintained closely in the subset to that of the dataset, and possibly maintaining as well the two-factor interaction marginal proportions as well for some pairs.

This must be a common problem in data mining, but I don't seem to be able to locate the proper library or function for doing this in R.

Thanks for any help.



Robert A. LaBudde, PhD, PAS, Dpl. ACAFS e-mail: ral_at_lcfltd.com
Least Cost Formulations, Ltd.            URL: http://lcfltd.com/
824 Timberlake Drive                     Tel: 757-467-0954
Virginia Beach, VA 23464-3239            Fax: 757-467-2947

"Vere scire est per causas scire"



R-help_at_r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. Received on Sat 08 Mar 2008 - 16:56:04 GMT

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Sat 08 Mar 2008 - 21:30:21 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.

list of date sections of archive