From: TEMPL Matthias (Matthias.Templ@statistik.gv.at)
Date: Tue 25 May 2004 - 00:39:33 EST
I think the "cmeans" method in library(e1071) works better for large data sets as "fanny".
(note,not for this data: fanny has also problems with standardized large data sets - here produce fanny the same memberships for all observations; cmeans works "correctly")
> -----Ursprüngliche Nachricht-----
> Von: Bhaskar S. Manda [mailto:firstname.lastname@example.org]
> Gesendet: Montag, 24. Mai 2004 17:12
> An: email@example.com
> Betreff: Re: [R] non-hierarchical non-exclusive clustering of
> large data sets
> On Mon, 24 May 2004 15:58:57 +0200, Murk Wuite wrote:
> > I'm trying to use R to cluster words with related meanings. Does
> > anyone know of a non-hierarchical clustering method in R
> that produces
> > non-exclusive clusters? With non-exclusive, I mean that words should
> The "fanny" method in library(cluster) outputs probabilities
> of membership in each cluster.
> > the one above, its dimensions would be in the order of (100000,
> > 100000). Does anyone know if this would cause practical problems,
> > perhaps very slow clustering?
> I had a much smaller matrix, 4000x3, fanny took about 4
> minutes wall clock time on a lightly loaded (there were many
> other processes, but none
> computational) 1.4 GHz Athlon, It was completely CPU-bound.
> Rfirstname.lastname@example.org mailing list
> https://www.stat.math.ethz.ch/mailman/listinfo> /r-help
> do read the posting guide!
Remail@example.com mailing list
PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html
This archive was generated by hypermail 2.1.3 : Mon 31 May 2004 - 23:05:12 EST