[R] Again on Data Mining

From: Lorenzo Isella <lorenzo.isella_at_gmail.com>
Date: Sun, 15 May 2011 18:41:30 +0200


Dear All,
I have already posted before on the list about data mining and it has proved very useful.
I have now a training dataset consisting of N objects of M<<N different kinds (actually, M is usually 3 to 5, whereas N is of the order of 1000). Every object has its own label L_i, i=1...N, that is known. For each of these objects I measure some property in time (let's say I measure it Q times in a given time interval), i.e. the i-th object has an associated file {t, y}, where t=(t_1,t_2....t_Q) and y=(y_1,y_2,...y_Q). My problem is then to come up with an algorithm that after learning on the training dataset, can guess the labels of a testing dataset. The difference with respect to the datamining I have done so far is that I do not have a set of properties for every object (e.g. age, sex, income, etc...) but rather an associated function y=f(t). Any suggestion (either conceptual or about which R package I should turn to) is greatly appreciated.
Many thanks

Lorenzo



R-help_at_r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. Received on Sun 15 May 2011 - 16:52:33 GMT

This quarter's messages: by month, or sorted: [ by date ] [ by thread ] [ by subject ] [ by author ]

All messages

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Sun 15 May 2011 - 18:20:09 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.

list of date sections of archive