http://www.nytimes.com/2009/01/07/technology/business-computing/07program.html
Excellent publicity for one of the best, and yet, most underappreciated, applications in the OSS world.
The article glosses over one thing that makes <R> really well suited to the statistics world... A SAS representative makes this gibe:
But one of the hidden but major problems with programs like SAS, SPSS, LISREL, MPlus, etc, is that, for complicated statistics, for which there are no completely well-established numerical methodologies, these programs, being closed source, do not publish their source code and are therefore not open to analysis by the community to determine whether their assumptions in conducting these analyses are actually properly warranted. Even in relatively simple analyses like basic SEM, it's fairly common that results from different packages are slightly different, with little clarity as to why.
R has a huge advantage, in principle, that even for its most complex statistics, how it arrives at its results is completely available for analysis.
To some people R is just the 18th letter of the alphabet. To others, it’s the rating on racy movies, a measure of an attic’s insulation or what pirates in movies say.
R is also the name of a popular programming language used by a growing number of data analysts inside corporations and academia. It is becoming their lingua franca partly because data mining has entered a golden age, whether being used to set ad prices, find new drugs more quickly or fine-tune financial models. Companies as diverse as Google, Pfizer, Merck, Bank of America, the InterContinental Hotels Group and Shell use it.
But R has also quickly found a following because statisticians, engineers and scientists without computer programming skills find it easy to use.
“R is really important to the point that it’s hard to overvalue it,” said Daryl Pregibon, a research scientist at Google, which uses the software widely. “It allows statisticians to do very intricate and complicated analyses without knowing the blood and guts of computing systems.”
Excellent publicity for one of the best, and yet, most underappreciated, applications in the OSS world.
The article glosses over one thing that makes <R> really well suited to the statistics world... A SAS representative makes this gibe:
“I think it addresses a niche market for high-end data analysts that want free, readily available code," said Anne H. Milley, director of technology product marketing at SAS. She adds, “We have customers who build engines for aircraft. I am happy they are not using freeware when I get on a jet.”
But one of the hidden but major problems with programs like SAS, SPSS, LISREL, MPlus, etc, is that, for complicated statistics, for which there are no completely well-established numerical methodologies, these programs, being closed source, do not publish their source code and are therefore not open to analysis by the community to determine whether their assumptions in conducting these analyses are actually properly warranted. Even in relatively simple analyses like basic SEM, it's fairly common that results from different packages are slightly different, with little clarity as to why.
R has a huge advantage, in principle, that even for its most complex statistics, how it arrives at its results is completely available for analysis.