[R] Refactor all factors in a data frame

From: Hilmar Berger <hilmar.berger_at_imise.uni-leipzig.de>
Date: Tue, 05 Jun 2007 14:20:24 +0200

Hi all,

Assume I have a data frame with numerical and factor variables that I got through merging various other data frames and subsetting the resulting data frame afterwards. The number levels of the factors seem to be the same as in the original data frames, probably because subset() calls [.factor without drop = TRUE (that's what I gather from scanning the mailing lists).

I wonder if there is a easy way to refactor all factors in the data frame at once. I noted that fix(data_frame) does the trick, however, this needs user interaction, which I'd like to avoid. Subsequent write.table / read.table would be another option but I'm not sure if R can guess the factor/char/numeric-type correctly when reading the table.

So, is there any way in drop the unused factor levels from *all* factors of a data frame without import/export ?

Thanks in advance,


Hilmar Berger
Institut für medizinische Informatik, Statistik und Epidemiologie
Universität Leipzig
Härtelstr. 16-18
D-04107 Leipzig

Tel. +49 341 97 16 101
Fax. +49 341 97 16 109
email: hilmar.berger_at_imise.uni-leipzig.de

R-help_at_stat.math.ethz.ch mailing list
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
Received on Tue 05 Jun 2007 - 12:39:23 GMT

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Wed 06 Jun 2007 - 12:31:44 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.