Re: [R] survival analysis and censoring

From: Terry Therneau <>
Date: Wed, 12 Mar 2008 13:40:13 -0500 (CDT)

  In your particular case I don't think that censoring is an issue, at least not for the reason that you discuss. The basic censoring assumption in the Cox model is that subjects who are censored have the same future risk as those who were a. not censored and b. have the same covariates.

   The real problem with informative censoring are the covaraites that are not in the model; ones that I likely don't even know exist. Assume for instance that some unknown exposure X, Perth sunlight say, makes people much more likely to get both of the outcomes. Assume further that it matters, i.e., the study includes a reasonable number of people with and without this exposure. Then someone who has an early heart attack actually has a higher risk of colorectal cancer than a colleague of the same age/sex/followup who did not have a heart attack, the reason being that the HA guy is more likely to be from Perth.    

   Your simulation went wrong by not actually accounting for time. You created an outcome table for CC & HD and added a random time vector to it. If someone would have had CC at 2 years and now has HD at 1 year, you can't just change the status to make them censored at 2. The gambling analogy would be kicking someone out of the casino just before they win -- it does odd things to the odds.

       Terry Therneau mailing list PLEASE do read the posting guide and provide commented, minimal, self-contained, reproducible code. Received on Wed 12 Mar 2008 - 18:49:16 GMT

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Thu 13 Mar 2008 - 06:30:21 GMT.

Mailing list information is available at Please read the posting guide before posting to the list.

list of date sections of archive