Re: [R] slowness of plot(x, type='l')

From: Prof Brian Ripley <ripley_at_stats.ox.ac.uk>
Date: Sat 07 May 2005 - 15:36:35 EST

Plotting times depend on the graphics device. That is nowhere mentioned here, which is unhelpful, and we have already seen a post saying it does not happen on another unmentioned device (presumably X11).

Let us assume the unmentioned device was windows(), as that is the only one I see any slowdown for. (Others like win.metafile are windows() under the skin.)

On Fri, 6 May 2005 apjaworski@mmm.com wrote:

> A couple of days ago a few messages indicated that something changed in the
> basic plot routine that made plot(*, type='l') slow for large data sets.
> Some people even reported crashes for very large data sets. As far as I
> remember, this was not reported as a formal bug.

Well, _is_ there a bug in R (as distinct from in Windows graphics internals)? I am almost certain there is not in R and this is a bug in Windows.

> I am still not sure if this is a bug, so I report my findings here. First
> of all, I think I see a slowdown of the plot function, although I do not
> have older versions of R installed, so I cannot do side-by-side
> comparisons. Secondly, I noticed that the behavior of plot(*, type='l')
> differs. Before R-2.1, the plotted lines would appear on the plot
> gradually. Now, after the wait, the whole plot appears at once.
>
> Here are my timing results. I am on Windows2000, IBM Intellistation with
> Xenon 2.8MHz with 1Gb of memory. I checked May-06 versions of R-patched
> and R-devel built from sources. I ran the following simple test:
>
> x <- rnorm(n)
> date(); plot(x, type='l'); date()

Oh, PLEASE, use system.time() to time things. Had you done so you might have seen things like

> windows()
> n <- 10000
> system.time(plot(rnorm(n), type="l"))
[1] 0.03 13.11 13.21 NA NA

> postscript()
> system.time(plot(rnorm(n), type="l"))
[1] 0.07 0.00 0.08 NA NA
> dev.off()
> system.time(plot(rnorm(n), type="p"))

[1] 0.07 0.93 1.00 NA NA

so the time is not being taken by R but by Windows.

I can tell you the reason: it is the support for mitred etc line ends introduced in R 2.0.0 and only supported in windows() from 2.1.0. This has slowed solid lines down to the sort of times taken for dashed lines previously.

Now, the best we can do to work around this is to follow what we did for dashed lines, and not attempt to be accurate for very large numbers of line segments. By plotting in bunches of 1000 lines I get

> system.time(plot(rnorm(n), type="l"))
[1] 0.03 0.36 0.42 NA NA
> system.time(plot(rnorm(n), type="l", lty=3))
[1] 0.22 2.89 3.11 NA NA

We have been here before, and as I recall this slowdown happens only in NT-based versions of Windows which seem _de facto_ restricted to about 1000 line elements in a path: what we were not aware of was that it happened for solid lines as well as dashed ones.

I've put the bunching into R-patched.

It is very regretable that this sort of thing was not tested for during beta-testing.

-- 
Brian D. Ripley,                  ripley@stats.ox.ac.uk
Professor of Applied Statistics,  http://www.stats.ox.ac.uk/~ripley/
University of Oxford,             Tel:  +44 1865 272861 (self)
1 South Parks Road,                     +44 1865 272866 (PA)
Oxford OX1 3TG, UK                Fax:  +44 1865 272595

______________________________________________
R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html
Received on Sat May 07 15:42:21 2005

This archive was generated by hypermail 2.1.8 : Fri 03 Mar 2006 - 03:31:38 EST