Re: [R] Dequantizing

From: <Richard.Cotton_at_hsl.gov.uk>
Date: Thu, 20 Nov 2008 17:23:45 +0000


> I have some data measured with a coarsely-quantized clock. Let's say
> the real data are
>
> q<- sort(rexp(100,.5))
>
> The quantized form is floor(q), so a simple quantile plot of one
> against the other can be calculated using:
>
> plot(q,type="l"); points(floor(q),col="red")
>
> which of course shows the characteristic stair-step. I would like to
> smooth the quantized form back into an approximation of the underlying
> data.
>
> The simplest approach I can think of adds a uniform random variable of
> the size of the quantization:
>
> plot(q,type="l"); points(floor(q),col="red");
> points(floor(q)+runif(100,0,1),col="blue")
>
> This gives pretty good results for uniform distributions, but less
> good for others (like exponential). Is there a better
> interpolation/smoothing function

I'm not convinced that adding a random amount to the floor values to 'make up' the underlying data is very meaningful.

If you know what the underlying distribution is, then you are best off using this distribution to generate plots and extra pretend data.

If you know the distribution is exponential, then you can estimate the rate by treating the true values as interval censored data, somewhere between floor and floor+1.

library(survival)
q <- sort(rexp(100,.5))
qq <- floor(q)
qq[qq==0] <- 0.00001 #survreg doesn't like values that are exactly zero
ss <- Surv(qq, qq+1, type="interval2")
model <- survreg(ss ~ 1, dist="exponential") summary(model)
rate <- 1/exp(model$coefficients["(Intercept)"]); rate #hopefully something close to 0.5

If you don't know the underlying distribution either, then things get trickier. Try a histogram/kernel density plot/boxplot to see what it looks like.

Regards,
Richie.

Mathematical Sciences Unit
HSL



ATTENTION: This message contains privileged and confidential inform...{{dropped:20}}

R-help_at_r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. Received on Thu 20 Nov 2008 - 17:37:40 GMT

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Thu 20 Nov 2008 - 18:30:26 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.

list of date sections of archive