Re: [R] Cluster analysis using numeric and factor variables

From: Moshe Olshansky <m_olshansky_at_yahoo.com>
Date: Tue, 10 Jun 2008 19:13:13 -0700 (PDT)


If you can define a distance between two vectors (where each one has some numerical and some categorical coordinates) then you can proceed with any clustering algorithm.

One possibility to get such a distance is to use RandomForest which can produce a proximity matrix which can be turned into distance matrix.

Regards,

Moshe.

> From: Nagu <thogiti_at_gmail.com>
> Subject: [R] Cluster analysis using numeric and factor variables
> To: r-help@r-project.org
> Received: Wednesday, 11 June, 2008, 11:49 AM
> Hi,
>
> Are there any algorithms that handle numeric and factor
> variables
> together in a cluster analysis?
>
> Thank you,
> Nagu
>
> ______________________________________________
> R-help_at_r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained,
> reproducible code.



R-help_at_r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. Received on Wed 11 Jun 2008 - 03:33:28 GMT

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Wed 11 Jun 2008 - 05:30:46 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.

list of date sections of archive