[R] Gap statistic

From: Nestor Fernandez <nestor.fernandez_at_ufz.de>
Date: Fri 11 Mar 2005 - 00:00:44 EST


Dear All,

I need to calculate the optimal number of clusters for a classification based on a large number of observations (tens of thousands). Thibshirani et al. proposed the gap statistic for this purpose. I tried the R-code developed by R. Jörnsten but R hangs with such amount of data (). Is it available any other (optimised) code? Any help would be appreciated, including suggestions about other alternatives for the selection of an optimal number of cluster from large datasets.

Thanks,

Néstor Fernández, PhD.

Department of Ecological Modelling
UFZ - Centre for Environmental Research
PF 500136, DE-04301, Leipzig, Germany.
Tel: +49 341-2352034
E-mail: nestor.fernandez@ufz.de



R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html Received on Fri Mar 11 00:08:28 2005

This archive was generated by hypermail 2.1.8 : Fri 03 Mar 2006 - 03:30:42 EST