Why R ?

R (http://www.r-project.org/) is an environment which let the user produce statistical analysis and create professional graphics.

It is also a full programming language.

Moreover, it has an opensource licence, and its professional use is free.

The R environment has many features for data mining, graphical displays, which can be used in a lot of domains: client data, marketing studies, biostatistics/bioinformatics, surveys, genetics, quality (6 sigma), etc.

Examples of topics where R can be used:

  1. Applied Statistics & Biostatistics
  2. Bayesian Statistics
  3. Chemometrics and Computational Physics
  4. Econometrics & Finance
  5. Environmetrics & Ecological Modeling
  6. Machine Learning
  7. Marketing & Business Analytics
  8. Psychometrics
  9. Robust Statistics
  10. Social network analysis
  11. Spatial Statistics
  12. Statistics in the Social and Political Sciences
  13. Teaching
  14. Visualization & Graphics

It includes :

- data manipulation and transformation features (OLAP for instance)

- extended professional graphical capabilities

- it can be interfaced with existing databases and softwares in your company

- hundreds (thousands ?) statistical functions

- finally, it is possible to set up a high performance computing cluster and parallel computations if you have a lot of data (Terabytes). It can be used in the following domains: BigData, OLAP, Business analytics.

A short comparison:

Excel ?
It is an ubiquitous software found in every company. Excel is easy to use, but limited in term of features and amount of data that can be processed. It can be slow, almost unusable if your data is too big. It includes few statistical functions and the graphs are limited.

It depends.
MATLAB licenses are typically expensive (thousands of euro per year per user). If you have the budget for it, MATLAB is a very good commercial software. Its documentation is excellent and the features are more standardized compared to R.

R is opensource, therefore it is free. It is more complete than MATLAB and it offers more possibilities, but the learning curve is a bit slower.

... or SAS ?
SAS can be used to aggregate the data of all the divisions your company. It takes some time to configure it properly for your firm. You will need experts to install it. Its programming language is old (designed in the 70s). It is "Business Intelligence" oriented for big firms.