SlideShare a Scribd company logo
1 of 79
Download to read offline
The History and Use of R
Joseph Kambourakis
Ground Rules
• Interrupt me
• These are all my opinions and
not of EMC or Big Data
Analytics, Discovery &
Visualization Meetup
• Slides will be available
Joseph Kambourakis
@mouthorjoe
Taught Around the World
WPI
Bentley University
Big Data School
Source:Data Analytics Master's Degrees: 20 Top Programs
Sam Woolford & Dominique Haughton
First Got Exposed to R
What is
R is a free software environment for statistical
computing and graphics
A language plus a run-time environment with
graphics, a debugger, access to certain system
functions, and the ability to run programs stored
in script files
What is R’s Hashtag?
Robert Gentleman & Ross Ihaka
• R: A Language for Data Analysis and Graphics
Starts with S
1976 1988 1991
Scheme
• Lexical scoping
Lexical scoping
• Searches through environments
– First global
• Global is your workspace
– Second namespace of packages
• More on packages later
Fortran
• source: Wikipedia
Under the Hood
Open Source
• GNU General Public License
• Freedom 0: The freedom to run the program for any
purpose.
• Freedom 1: The freedom to study how the program
works, and change it to make it do what you wish.
• Freedom 2: The freedom to redistribute copies so you
can help your neighbor.
• Freedom 3: The freedom to improve the program, and
release your improvements (and modified versions in
general) to the public, so that the whole community
benefits.
• source: GNU.org
R Project
• The R Foundation is a not for profit organization working in
the public interest. It has been founded by the members of
the R Development Core Team in order to
– Provide support for the R project and other innovations in
statistical computing. We believe that R has become a mature
and valuable tool and we would like to ensure its continued
development and the development of future innovations in
software for statistical and computational research.
– Provide a reference point for individuals, institutions or
commercial enterprises that want to support or interact with
the R development community.
– Hold and administer the copyright of R software and
documentation.
• source: R Project
Contributors
How it Works: Design
How it Works: Design
• Functional
– mean()
– plot()
How it Works: Design
• Interpreted language
How it Works: Install
• Hosted on Comprehensive R Archive Network
(CRAN)
• 54 megabytes
http://cran.rstudio.com/
• Download and Install R
• Precompiled binary distributions of the base system
and contributed packages, Windows and Mac users
most likely want one of these versions of R:
• Download R for Linux
• Download R for (Mac) OS X
• Download R for Windows
• R is part of many Linux distributions, you should check
with your Linux package management system in
addition to the link above.
How it works: Command Line
How it Works: Packages
• Base
– mean()
• Utils
– read.csv()
• Stats
– lm()
– sd()
Packages
• Mostly hosted on CRAN
• Many others hosted elsewhere
– Github
– RStudio
– Bioconductor
– RevolutionR
Packages
• source: http://r4stats.com/articles/popularity/
Top 10 Most Popular Packages
• source: Revolution Analytics Blog
Data Frame
Capabilities
• ANALYTICS
– Basic Mathematics
– Basic Statistics
– Probability Distributions
– Machine Learning
– Optimization and Mathematical Programming
– Signal Processing
– Simulation and Random Number Generation
– Statistical Modeling
– Statistical Tests
• GRAPHICS AND VISUALIZATION
– Static Graphics
– Dynamic Graphics
– Devices and Formats
Model & Plot
GUI:RStudio
How Does it Compare?
How Does it Compare?
How Does it Compare?
R SAS SPSS Professional MATLAB
Cost Free! Very VERY High High - $9,975 High
Documentation Yes
Very
comprehensive
OK Some examples
Training Course NA Yes Yes Yes
User interface Low Medium Best Medium
Output
Separate
commands
Automatically produce
diagnosis graph and
forecast
Totally automated
Some automated via GUI,
some specific command
Models*
Does not STL moving
average
Does not have
ARCH/GARCH + and
other moving average
models
Does not have MA &
decomposition models
Certification
Program
Yes Yes Yes
Commercial Support
Commercial Support
• Version 3.1.1
7/10/2014
• source
Where it’s Now?
Where it’s Going
Source: Revolution Analytics Blog
Where it’s Going: Extensions and
Interactions
• Rcpp
– Transfer from R to C++, and from C++ to R
• RLLVM
– Creates code
• H2O
– Big data package
The best thing about R is
that it was developed by
statisticians. The worst
thing about R is that...it
was developed by
statisticians.
Bo Cowgill
Good: Open Source
• So many contributors
• Free!
• Community
Bad: Open Source
• No customer support
• Features
Good: Frequent Updates
• Always new packages
• New updates and bug fixes
Bad: Frequent Updates
• Package updates
• R updates
Bad: Documentation
Bad:Speed
• 40 year old code
Bad:Speed
• Interpreted
Bad:Speed
• Single threaded
Bad: Memory
• All stored in memory
Soccer Example
@11tegen11
Congressional Approval Rating
@adamramey
Source: fun with R
Use Cases 4
How to Learn:
How to Learn:
How to Learn: RStudio
How to Learn:
How to Learn: Data CampHow to Learn:
How to Learn:
Springer Series
How to Learn:
Art of R
Programming
How to Learn: Boot Camp
Boston Predictive Analytics Meetup
How to Learn: Online Videos
Web Resources:
Web Resources:
Web Resources:
UseR Groups & Conferences
Closing Thoughts
Thank You
Thank You
Questions
?

More Related Content

What's hot

2 it unit-1 start learning r
2 it   unit-1 start learning r2 it   unit-1 start learning r
2 it unit-1 start learning r
Netaji Gandi
 

What's hot (20)

R programming slides
R  programming slidesR  programming slides
R programming slides
 
R programming language
R programming languageR programming language
R programming language
 
R programming for data science
R programming for data scienceR programming for data science
R programming for data science
 
R Programming
R ProgrammingR Programming
R Programming
 
How to get started with R programming
How to get started with R programmingHow to get started with R programming
How to get started with R programming
 
R programming
R programmingR programming
R programming
 
R language tutorial
R language tutorialR language tutorial
R language tutorial
 
Introduction to statistical software R
Introduction to statistical software RIntroduction to statistical software R
Introduction to statistical software R
 
1 R Tutorial Introduction
1 R Tutorial Introduction1 R Tutorial Introduction
1 R Tutorial Introduction
 
A short tutorial on r
A short tutorial on rA short tutorial on r
A short tutorial on r
 
2 it unit-1 start learning r
2 it   unit-1 start learning r2 it   unit-1 start learning r
2 it unit-1 start learning r
 
R tutorial
R tutorialR tutorial
R tutorial
 
1.3 introduction to R language, importing dataset in r, data exploration in r
1.3 introduction to R language, importing dataset in r, data exploration in r1.3 introduction to R language, importing dataset in r, data exploration in r
1.3 introduction to R language, importing dataset in r, data exploration in r
 
R programming
R programmingR programming
R programming
 
R language
R languageR language
R language
 
An Intoduction to R
An Intoduction to RAn Intoduction to R
An Intoduction to R
 
Introduction to R ajay Ohri
Introduction to R ajay OhriIntroduction to R ajay Ohri
Introduction to R ajay Ohri
 
R language
R languageR language
R language
 
R Programming Overview
R Programming Overview R Programming Overview
R Programming Overview
 
Introduction to data analysis using R
Introduction to data analysis using RIntroduction to data analysis using R
Introduction to data analysis using R
 

Viewers also liked

Microsoft SQL Server 2008 R2 - Analysis Services Presentation
Microsoft SQL Server 2008 R2 - Analysis Services PresentationMicrosoft SQL Server 2008 R2 - Analysis Services Presentation
Microsoft SQL Server 2008 R2 - Analysis Services Presentation
Microsoft Private Cloud
 

Viewers also liked (9)

Microsoft SQL Server 2008 R2 - Analysis Services Presentation
Microsoft SQL Server 2008 R2 - Analysis Services PresentationMicrosoft SQL Server 2008 R2 - Analysis Services Presentation
Microsoft SQL Server 2008 R2 - Analysis Services Presentation
 
R Programming Features
R Programming FeaturesR Programming Features
R Programming Features
 
BI Dashboards with SQL Server 2008 R2
BI Dashboards with SQL Server 2008 R2BI Dashboards with SQL Server 2008 R2
BI Dashboards with SQL Server 2008 R2
 
Executive Intro to R
Executive Intro to RExecutive Intro to R
Executive Intro to R
 
R programming Basic & Advanced
R programming Basic & AdvancedR programming Basic & Advanced
R programming Basic & Advanced
 
Introduction to R Programming
Introduction to R ProgrammingIntroduction to R Programming
Introduction to R Programming
 
Class ppt intro to r
Class ppt intro to rClass ppt intro to r
Class ppt intro to r
 
An Interactive Introduction To R (Programming Language For Statistics)
An Interactive Introduction To R (Programming Language For Statistics)An Interactive Introduction To R (Programming Language For Statistics)
An Interactive Introduction To R (Programming Language For Statistics)
 
The Outcome Economy
The Outcome EconomyThe Outcome Economy
The Outcome Economy
 

Similar to The History and Use of R

Business Analytics with R
Business Analytics with RBusiness Analytics with R
Business Analytics with R
Edureka!
 
Business Analytics with R
Business Analytics with RBusiness Analytics with R
Business Analytics with R
Edureka!
 

Similar to The History and Use of R (20)

Introduction to R
Introduction to RIntroduction to R
Introduction to R
 
An introduction to R is a document useful
An introduction to R is a document usefulAn introduction to R is a document useful
An introduction to R is a document useful
 
Business Analytics with R
Business Analytics with RBusiness Analytics with R
Business Analytics with R
 
Business Analytics with R
Business Analytics with RBusiness Analytics with R
Business Analytics with R
 
Performance and Scale Options for R with Hadoop: A comparison of potential ar...
Performance and Scale Options for R with Hadoop: A comparison of potential ar...Performance and Scale Options for R with Hadoop: A comparison of potential ar...
Performance and Scale Options for R with Hadoop: A comparison of potential ar...
 
Introduction to R Language
Introduction to R LanguageIntroduction to R Language
Introduction to R Language
 
Big data analytics using R
Big data analytics using RBig data analytics using R
Big data analytics using R
 
Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...
Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...
Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...
 
Extending Power BI Functionality with R
Extending Power BI Functionality with RExtending Power BI Functionality with R
Extending Power BI Functionality with R
 
R training at Aimia
R training at AimiaR training at Aimia
R training at Aimia
 
Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL S...
Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL S...Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL S...
Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL S...
 
Learn Business Analytics with R at edureka!
Learn Business Analytics with R at edureka!Learn Business Analytics with R at edureka!
Learn Business Analytics with R at edureka!
 
R Intro
R IntroR Intro
R Intro
 
Big Data Analytics with R
Big Data Analytics with RBig Data Analytics with R
Big Data Analytics with R
 
Introduction to r
Introduction to rIntroduction to r
Introduction to r
 
High Performance Machine Learning in R with H2O
High Performance Machine Learning in R with H2OHigh Performance Machine Learning in R with H2O
High Performance Machine Learning in R with H2O
 
In-Database Analytics Deep Dive with Teradata and Revolution
In-Database Analytics Deep Dive with Teradata and RevolutionIn-Database Analytics Deep Dive with Teradata and Revolution
In-Database Analytics Deep Dive with Teradata and Revolution
 
R_L1-Aug-2022.pptx
R_L1-Aug-2022.pptxR_L1-Aug-2022.pptx
R_L1-Aug-2022.pptx
 
FOSDEM 2014: Social Network Benchmark (SNB) Graph Generator
FOSDEM 2014:  Social Network Benchmark (SNB) Graph GeneratorFOSDEM 2014:  Social Network Benchmark (SNB) Graph Generator
FOSDEM 2014: Social Network Benchmark (SNB) Graph Generator
 
Analytics Beyond RAM Capacity using R
Analytics Beyond RAM Capacity using RAnalytics Beyond RAM Capacity using R
Analytics Beyond RAM Capacity using R
 

More from AnalyticsWeek

Data-As-A-Service to enable compliance reporting
Data-As-A-Service to enable compliance reportingData-As-A-Service to enable compliance reporting
Data-As-A-Service to enable compliance reporting
AnalyticsWeek
 
Advanced Analytics in Hadoop
Advanced Analytics in HadoopAdvanced Analytics in Hadoop
Advanced Analytics in Hadoop
AnalyticsWeek
 
Rethinking classical approaches to analysis and predictive modeling
Rethinking classical approaches to analysis and predictive modelingRethinking classical approaches to analysis and predictive modeling
Rethinking classical approaches to analysis and predictive modeling
AnalyticsWeek
 

More from AnalyticsWeek (8)

Understanding Customer Buying Journey with Big Data
Understanding Customer Buying Journey with Big DataUnderstanding Customer Buying Journey with Big Data
Understanding Customer Buying Journey with Big Data
 
Data-As-A-Service to enable compliance reporting
Data-As-A-Service to enable compliance reportingData-As-A-Service to enable compliance reporting
Data-As-A-Service to enable compliance reporting
 
Making sense of unstructured data by turning strings into things
Making sense of unstructured data by turning strings into thingsMaking sense of unstructured data by turning strings into things
Making sense of unstructured data by turning strings into things
 
Reimagining the role of data in government
Reimagining the role of data in governmentReimagining the role of data in government
Reimagining the role of data in government
 
Advanced Analytics in Hadoop
Advanced Analytics in HadoopAdvanced Analytics in Hadoop
Advanced Analytics in Hadoop
 
Rethinking classical approaches to analysis and predictive modeling
Rethinking classical approaches to analysis and predictive modelingRethinking classical approaches to analysis and predictive modeling
Rethinking classical approaches to analysis and predictive modeling
 
Using Topological Data Analysis on your BigData
Using Topological Data Analysis on your BigDataUsing Topological Data Analysis on your BigData
Using Topological Data Analysis on your BigData
 
Big Data Introduction to D3
Big Data Introduction to D3Big Data Introduction to D3
Big Data Introduction to D3
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

The History and Use of R