Re: [R] recoding data with loops

From: Erik Iverson <iverson_at_biostat.wisc.edu>
Date: Mon, 19 May 2008 15:07:58 -0500

Hello -

Donald Braman wrote:
> # I'm new to R and am trying to get the hang of how it handles
> # dataframes & loops. If anyone can help me with some simple tasks,
> # I'd be much obliged.
>
> # First, i'd like to generate some random data in a dataframe
> # to efficiently illustrate what I'm up to.
> # let's say I have six variables as listed below (I really
> # have hundreds, but a few will illustrate the point).
> # I want to generate my dataframe (mdf)
> # with the 6 variables X 100 values with rnorm(7).
> # How do I do this? I tried many variations on the following:
>
> var_list <- c("HEQUAL", "EWEALTH", "ERADEQ", "HREVDIS1", "EDISCRIM",
> "HREVDIS2")
> for(i in 1:length(var_list)) {var_list[1] <- rnorm(100)}
> mdf <- data.frame(cbind(varlist[1:length(var_list)])
> mdf
>
There are many ways to do this. Do you mean that you want 6 columns, 100 observations in each column, each a sample from a normal distribution with mean = 7 and sd = 1? You can do this without looping in one of several ways. If you are coming from a SAS environment (my guess since you talk of looping over data.frames), you may be used to looping through a data object. In R, you can usually avoid this since many functions are vectorized, or take a 'whole object' approach.

var_list <- c("HEQUAL", "EWEALTH", "ERADEQ", "HREVDIS1", "EDISCRIM", "HREVDIS2") mdf <- data.frame(replicate(6, rnorm(100, 7))) ## generate random data names(mdf) ## default names
names(mdf) <- var_list ## use our names

>
> # Then, I'd like to recode the variables that begin with the letter "H".
> # I've tried many variations of the following, but to no avail:
>
> reverse_list <- c("HEQUAL", "HREVDIS1", "HREVDIS2")
> reversed_list <- c("RHEQUAL", "RHREVDIS1", "RHREVDIS2")
> for(i in 1:length(reverse_list))
> {mdf[ ,e_reversed_list][[i]] <- recode(mdf[ ,e_reverse_list][[i]],
> '5:99=NA; 1=4; 2=3; 3=2; 4=1; ', as.factor.result=FALSE)
>

I'm not quite sure what you are after here. What do you mean by recode? What package is your 'recode' function located in?

It appears that you may be under the impression that the data.frame contains integers, but certainly it will not since it was generated with rnorm? sample can generate a samples of the type you may be after, for example,

 > sample(7, 100, replace = TRUE)

Best,
Erik Iverson



R-help_at_r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. Received on Mon 19 May 2008 - 20:40:47 GMT

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Mon 19 May 2008 - 21:30:46 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.

list of date sections of archive