In the early 2000s, a new software emerged and progressively established itself as an equal to the three major softwares that corner the market in statistical analysis
We would anticipate users of these softwares may be interested in R
SPSS license Base edition
Software/System | Windows | MacOS | Linux | BSD | other Unix |
---|---|---|---|---|---|
SAS | yes | terminated | yes | no | yes |
SPAD | yes | no | no | no | no |
SPSS | yes | yes | no | no | no |
Stata | yes | yes | yes | no | no |
https://en.wikipedia.org/wiki/Comparison_of_statistical_packages
They provide zero or few network analysis, sequence data analysis, lexicometry (except for SPAD), and few features dedicated to valorisation.
Centralized management limits the following:
That is why we use R
Two languages often used in data management and data analysis
and compared against each other because of their similar features…
Choosing R or Python depends on
who I am and what I want to do
R is as brilliant…
For users less advanced in programming
specialized in data analysis
…as Python is powerful!
For experts in programming
specialized in data science
R is based on programming language S, created in 1988
https://blog.revolutionanalytics.com/2017/10/updated-history-of-r.html
The result of 30 years of research and development
Major financiers support the development of R: Microsoft, Google, Oracle, Esri…
Software/System | Windows | MacOS | Linux | BSD | Other Unix |
---|---|---|---|---|---|
R | yes | yes | yes | yes | yes |
R offers 2292 standard statistical analysis and graphics functions (core-based)
Many packages are available to enrich this core base, they are listed on the
Comprehensive R Archive Network (CRAN). Ex :
R has a modular structure that offers a multitude of applications
Its development is only limited by contributions
Number of available packages on the CRAN
Available packages allow a huge range of operations. From data collection to the final results’ valorisation (chart, gaphic design, document, website…)
its versatility makes R a complement and even a competition to many existing softwares
The information quickly runs through open software communities
Reproducibility means sharing and transparency!
RStudio is a company developing and releasing softwares and services based on R.
It is the major private actor in R community
RStudio (its employees) developed several reference packages. Ex :
Rstudio also released an integrated development environment (IDE),
making R easier to work with
R interface on Windows
no interface on Linux (terminal)
The RStudio IDE makes it easier to learn and use R
it is simple, complete and constantly evolving…
Use the RStudio environment!
Installing R and the Rstudio IDE is as smooth as any other software. Download R through the CRAN
Download the Desktop version from Rstudio website
Launch RStudio (not R) to begin
consultation:
source code:
Numerous referenced documentary resources (EN, FR and SP) on…
Natacha Bohin (Barts Cancer Institute)
Violaine Jurie (Université de Paris)
REVEAL.JS