[R] correlation matrix - large dataset

From: suman Duvvuru <duvvuru.suman_at_gmail.com>
Date: Tue, 8 Jan 2008 01:34:02 -0500


I have a dataset with 20,000 variables.and I would like to compute a pearson correlation matrix which will be 20000*20000. The cor() function doesnt work in this case due to memory problem. If you have any ideas regarding a feasible way to compute correlations on such a huge dataset, please help me out.

Please feel free to share your memory handling techniques in R.


        [[alternative HTML version deleted]]

R-help_at_r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. Received on Tue 08 Jan 2008 - 09:20:40 GMT

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Tue 08 Jan 2008 - 15:30:05 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.

list of date sections of archive