[R] cauculating dissimilarities in R

From: virgin <virgin_at_seychelles.sc>
Date: Tue 26 Sep 2006 - 05:40:42 GMT


Dear All,
I’ve got a statistical question on calculating dissimilarities in R.
I want to calculate the different types of dissimilarities on the ‘flower’ dataset found in the package ‘cluster’. Flower is a data frame with 18 observations on 8 variables. Variable 1 and 2 are binary, variable 3 is asymmetric binary, variable 4 is nominal, variable 5 and 6 are ordered and variable 7 and 8 are interval scaled.

Commands to load the dataset in R.
library(cluster)
data(flower)
flower

What are the different types of dissimilarities that can be calculated on such a dataset?
Do I need to group the types of variables first i.e. all binary together then run the calculation? Do I use dissimilarity indices such as Jaccard or should it be classification function such as ‘daisy’ which should be used?

Many thanks,

Elvina Payet (MSc)
University of La Reunion



R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. Received on Tue Sep 26 16:38:50 2006

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.1.8, at Tue 26 Sep 2006 - 07:30:07 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.