Re: [R] converting a data set to a format for time series analysis

From: jim holtman <jholtman_at_gmail.com>
Date: Mon, 09 Jun 2008 20:54:21 -0400

Here is one way of doing it:

> x <- read.table(textConnection("subject hospital date_enrollment
hospital_beds

+  1       hospitalA       1/3/2002        300
+  2       hospitalA       1/6/2002        300
+  3       hospitalB       2/4/2002        150
+  4       hospitalC       3/2/2002        200"), header=TRUE)

> closeAllConnections()
> y <- as.Date(x$date_enrollment, "%m/%d/%Y")
> z <- cbind(x, year=format(y, "%Y"), month=format(y, "%m"))
> # partition the data
> z.s <- split(z, list(z$year, z$month, z$hospital), drop=TRUE)
> # now aggregate
> do.call(rbind, lapply(z.s, function(a) data.frame(hospital=a$hospital[1],
cases=nrow(a),
+     year=a$year[1], month=a$month[1], beds=a$hospital[1])))
                   hospital cases year month      beds
2002.01.hospitalA hospitalA     2 2002    01 hospitalA
2002.02.hospitalB hospitalB     1 2002    02 hospitalB
2002.03.hospitalC hospitalC     1 2002    03 hospitalC

>
>
>

On Mon, Jun 9, 2008 at 12:45 PM, Ricardo Pietrobon <pietr007_at_gmail.com> wrote:

> I currently have a data set describing human subjects enrolled into an
> international clinical trial, the name of the hospital enrolling this
> human subject, the date when the subject was enrolled, and a vector
> with variables representing characteristics of the site (e.g., number
> of beds in a hospital). my data sets looks like this:
>
> subject hospital date_enrollment hospital_beds
> 1 hospitalA 1/3/2002 300
> 2 hospitalA 1/6/2002 300
> 3 hospitalB 2/4/2002 150
> 4 hospitalC 3/2/2002 200
>
> to perform a time series analysis I am now trying to get to a format
> that would give me the following variables:
>
> month year site number_enrolled_subjects hospital_beds
>
> the data would be displayed on one-month intervals, and number of
> subjects clustered around sites.
>
> any help would be greatly appreciate
>
> thanks
>
>
> Ricardo
>
> ______________________________________________
> R-help_at_r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html<http://www.r-project.org/posting-guide.html>
> and provide commented, minimal, self-contained, reproducible code.
>

-- 
Jim Holtman
Cincinnati, OH
+1 513 646 9390

What is the problem you are trying to solve?

	[[alternative HTML version deleted]]

______________________________________________
R-help_at_r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
Received on Tue 10 Jun 2008 - 00:57:32 GMT

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Tue 10 Jun 2008 - 01:30:43 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.

list of date sections of archive