Re: [R] Popularity of R, SAS, SPSS, Stata...

From: Keo Ormsby <keo.ormsby2_at_gmail.com>
Date: Tue, 22 Jun 2010 15:26:43 -0500

>> -----Original Message-----
>> From: r-help-bounces_at_r-project.org [mailto:r-help-bounces_at_r-project.org]
>> On Behalf Of Ivan Calandra
>> Sent: Sunday, June 20, 2010 3:47 PM
>> To: r-help_at_r-project.org
>> Subject: Re: [R] Popularity of R, SAS, SPSS, Stata...
>>
>> Bob,
>>
>> I have no idea whether it is realistic, but if you look for the papers

>> that used R or SAS (or anything), you might get better results by
>> searching for the way R and SAS are cited.
>>
>
> Hi Ivan, that was what I tried when more generic keywords failed. However, almost no one seems to use that citation. For example, in 2009, only 28 papers contain "R Foundation" and 61 contain Bioconductor, which uses R. One single paper contains both. I appreciate the idea though!
>
> Thanks,
> Bob
>
>
Hi Bob,
Great work. I just did a quick-and-dirty search on Google Scholar, but trying to be less stringent about what to consider a "hit". I used for R: "R statistical" OR "R package", for SPSS; just "SPSS", for S-plus; ditto, for SAS; "SAS statistical" OR "SAS institute", for STATA; "STATA" AND "statistical". For estimating a "background" of scholarly statistical publications I just searched for the word "statistical" and did this and searching for each year. Of course this "methodology" is far from acceptable, and includes many things that were not necessarily works that used the software, and may have only mentioned it, but it does throw some interesting things. I am using *cited* in the sense of *mentioned in the article as to be indexed by Google*, not as an academic citation.
It appears that SPSS is going down from having the lion's-share in mid 2000's, but remains the most cited. SAS had a big dip from being the most cited, but is having a come-back. STATA, and R are both rising steadly, with R having a more than 16-fold increase from 2000, but still being a small fraction of total citations. There is a dip in 2010 for STATA, R, and S-Plus, and a big peak for SAS, but I am not sure these are artifacts form Goggle's algorithm.

If anyone wants to give it a look, here's the data: Search
term,2010,2009,2008,2007,2006,2005,2004,2003,2002,2001,2000,1999,1998,1997,1996,1995 SPSS,28900,46700,64600,84600,97600,104000,105000,91800,67700,53600,40600,22600,15300,10100,7530,6330 SAS,15300,15000,14700,15200,14900,15900,16800,16800,16200,15500,14600,13100,11900,10700,9270,8950

STATA,9800,19300,16900,14600,12800,10700,8930,6780,4900,4050,3280,2120,1560,959,642,501
S-Plus,1900,4820,4900,5050,4950,4780,5100,4110,3610,2970,2540,2150,1780,1500,1200,990
R,3470,6140,4660,3510,2450,1700,1090,642,416,294,247,188,203,178,133,158
statistical,181000,347000,709000,885000,1010000,1030000,1060000,1110000,1090000,1020000,968000,887000,786000,718000,670000,580000

Best wishes,
Keo.



R-help_at_r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. Received on Tue 22 Jun 2010 - 20:56:29 GMT

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Tue 22 Jun 2010 - 21:20:34 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.

list of date sections of archive