SlideShare una empresa de Scribd logo
1 de 22
Descargar para leer sin conexión
Why Data Science is
Something You Should
Care About

Presented @ South Dakota Code Camp 2012


Ryan Swanstrom @swgoof
About Ryan Swanstrom
Find me on the web


       http://twitter.com/swgoof

       http://linkedin.com/in/ryanswanstrom

       http://datascience101.wordpress.com/
Data Science

"[ability to] obtain, scrub, explore, model and
interpret data, blending hacking, statistics, and
machine learning."
                  definition by Hilary Mason, Chief Scientist @ Bit.ly
Data Science




http://www.drewconway.com/zia/?p=2378
Who is a data scientist?




http://onforb.es/WNLnRu
Big Data

Any dataset where the size or speed of
incoming data causes difficulties in processing

  ● Volume
  ● Velocity
  ● Variety
Hadoop
"[...] a framework that allows for the distributed
processing of large data sets across clusters of
computers using simple programming models."
                                  Apache Hadoop Website



  ● HDFS - Hadoop Distributed File System
  ● MapReduce
Lots of Data


      18 Months
 the amount of time for digital data to double
Data Products
Why Do You Care?
McKinsey Global Big Data Report

● 140k - 190k Unfilled Jobs by 2018

● 1.5M Managers & Analysts
Indeed Data Science Job Listings




http://www.indeed.com/jobtrends?q=Data-science&relative=1
Now That You Care, What Skills?

   1.   Machine Learning
   2.   Statistics
   3.   Story Telling (Communication)
   4.   Big Data
   5.   Algorithms
   6.   Curiosity
College and University




 http://datascience101.wordpress.com/2012/04/09/colleges-with-data-science-degrees/
 http://whatsthebigdata.com/2012/08/09/graduate-programs-in-big-data-and-data-science/
College and University
         Pros                  Cons

●   Credentials       ●   Expensive
●   Experts           ●   Not Individualized
●   Familiar          ●   School
●   Widely Accepted   ●   Lengthy
●   Structured        ●   Inflexible
                      ●   Not Real World
Corporate Training




               General Assembly - Not really Corp Training,
                                   but it looks really good
Corporate Training
         Pros                  Cons

●   Short Timeframe   ●   Expensive
●   Experts           ●   Not Individualized
●   Certificates      ●   Product Focused
●   Business-Savy     ●   Sales Pitch
●   Real World
●   Structured
MOOCs (Massive Open Online
Courses)
MOOCs (Massive Open Online
Courses)
        Pros           Cons

● Free          ● No Credentials
● Experts       ● Single Course
● Flexible      ● No Programs (Yet)
Blogs/Wikis/Other
Blogs/Wikis/Other
          Pros               Cons

●   Free            ●   Quality?
●   Very Specific   ●   No Credentials
●   Short           ●   No Structure
●   Lots of them    ●   Too many!
Blogs/Wikis/Other
              The Problem

 ● What content is good?

 ● What order should I cover the content?

 ● Where do I find new content?

 ● Who can help me understand?
Data Science 201 - coming soon

     http://www.datascience201.com
          Helping you find the best
        data science learning content!




                Thank You

Más contenido relacionado

Destacado

What is a Data Scientist
What is a Data Scientist What is a Data Scientist
What is a Data Scientist Experian_US
 
Be a Data Scientist in 8 steps!
Be a Data Scientist in 8 steps! Be a Data Scientist in 8 steps!
Be a Data Scientist in 8 steps! PromptCloud
 
The path to be a data scientist
The path to be a data scientistThe path to be a data scientist
The path to be a data scientistPoo Kuan Hoong
 
Standardizing +113 million Merchant Names in Financial Services with Greenplu...
Standardizing +113 million Merchant Names in Financial Services with Greenplu...Standardizing +113 million Merchant Names in Financial Services with Greenplu...
Standardizing +113 million Merchant Names in Financial Services with Greenplu...Data Science London
 
How to become a data scientist in 6 months
How to become a data scientist in 6 monthsHow to become a data scientist in 6 months
How to become a data scientist in 6 monthsTetiana Ivanova
 
Introduction on Data Science
Introduction on Data ScienceIntroduction on Data Science
Introduction on Data ScienceEdureka!
 
10 R Packages to Win Kaggle Competitions
10 R Packages to Win Kaggle Competitions10 R Packages to Win Kaggle Competitions
10 R Packages to Win Kaggle CompetitionsDataRobot
 
How to Become a Data Scientist
How to Become a Data ScientistHow to Become a Data Scientist
How to Become a Data Scientistryanorban
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural networkDEEPASHRI HK
 
Artificial Intelligence Presentation
Artificial Intelligence PresentationArtificial Intelligence Presentation
Artificial Intelligence Presentationlpaviglianiti
 
Tips for data science competitions
Tips for data science competitionsTips for data science competitions
Tips for data science competitionsOwen Zhang
 
Tutorial on Deep learning and Applications
Tutorial on Deep learning and ApplicationsTutorial on Deep learning and Applications
Tutorial on Deep learning and ApplicationsNhatHai Phan
 
Hadoop and Machine Learning
Hadoop and Machine LearningHadoop and Machine Learning
Hadoop and Machine Learningjoshwills
 
Deep Learning for Natural Language Processing
Deep Learning for Natural Language ProcessingDeep Learning for Natural Language Processing
Deep Learning for Natural Language ProcessingDevashish Shanker
 
Data By The People, For The People
Data By The People, For The PeopleData By The People, For The People
Data By The People, For The PeopleDaniel Tunkelang
 
An Introduction to Supervised Machine Learning and Pattern Classification: Th...
An Introduction to Supervised Machine Learning and Pattern Classification: Th...An Introduction to Supervised Machine Learning and Pattern Classification: Th...
An Introduction to Supervised Machine Learning and Pattern Classification: Th...Sebastian Raschka
 
A tutorial on deep learning at icml 2013
A tutorial on deep learning at icml 2013A tutorial on deep learning at icml 2013
A tutorial on deep learning at icml 2013Philip Zheng
 
Hands-on Deep Learning in Python
Hands-on Deep Learning in PythonHands-on Deep Learning in Python
Hands-on Deep Learning in PythonImry Kissos
 

Destacado (20)

What is a Data Scientist
What is a Data Scientist What is a Data Scientist
What is a Data Scientist
 
Be a Data Scientist in 8 steps!
Be a Data Scientist in 8 steps! Be a Data Scientist in 8 steps!
Be a Data Scientist in 8 steps!
 
The path to be a data scientist
The path to be a data scientistThe path to be a data scientist
The path to be a data scientist
 
Data Science Thailand Meetup#11
Data Science Thailand Meetup#11Data Science Thailand Meetup#11
Data Science Thailand Meetup#11
 
Standardizing +113 million Merchant Names in Financial Services with Greenplu...
Standardizing +113 million Merchant Names in Financial Services with Greenplu...Standardizing +113 million Merchant Names in Financial Services with Greenplu...
Standardizing +113 million Merchant Names in Financial Services with Greenplu...
 
Ibm data science experience
Ibm data science experienceIbm data science experience
Ibm data science experience
 
How to become a data scientist in 6 months
How to become a data scientist in 6 monthsHow to become a data scientist in 6 months
How to become a data scientist in 6 months
 
Introduction on Data Science
Introduction on Data ScienceIntroduction on Data Science
Introduction on Data Science
 
10 R Packages to Win Kaggle Competitions
10 R Packages to Win Kaggle Competitions10 R Packages to Win Kaggle Competitions
10 R Packages to Win Kaggle Competitions
 
How to Become a Data Scientist
How to Become a Data ScientistHow to Become a Data Scientist
How to Become a Data Scientist
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural network
 
Artificial Intelligence Presentation
Artificial Intelligence PresentationArtificial Intelligence Presentation
Artificial Intelligence Presentation
 
Tips for data science competitions
Tips for data science competitionsTips for data science competitions
Tips for data science competitions
 
Tutorial on Deep learning and Applications
Tutorial on Deep learning and ApplicationsTutorial on Deep learning and Applications
Tutorial on Deep learning and Applications
 
Hadoop and Machine Learning
Hadoop and Machine LearningHadoop and Machine Learning
Hadoop and Machine Learning
 
Deep Learning for Natural Language Processing
Deep Learning for Natural Language ProcessingDeep Learning for Natural Language Processing
Deep Learning for Natural Language Processing
 
Data By The People, For The People
Data By The People, For The PeopleData By The People, For The People
Data By The People, For The People
 
An Introduction to Supervised Machine Learning and Pattern Classification: Th...
An Introduction to Supervised Machine Learning and Pattern Classification: Th...An Introduction to Supervised Machine Learning and Pattern Classification: Th...
An Introduction to Supervised Machine Learning and Pattern Classification: Th...
 
A tutorial on deep learning at icml 2013
A tutorial on deep learning at icml 2013A tutorial on deep learning at icml 2013
A tutorial on deep learning at icml 2013
 
Hands-on Deep Learning in Python
Hands-on Deep Learning in PythonHands-on Deep Learning in Python
Hands-on Deep Learning in Python
 

Similar a Learn Data Science

Educate 2017 (Learnosity Developer Conference) Opening Keynote
Educate 2017 (Learnosity Developer Conference) Opening KeynoteEducate 2017 (Learnosity Developer Conference) Opening Keynote
Educate 2017 (Learnosity Developer Conference) Opening KeynoteLearnosity
 
How to Implement Educational Technology
How to Implement Educational Technology How to Implement Educational Technology
How to Implement Educational Technology Circulus Education
 
Large drupal site builds a workshop for sxsw interactive - march 17, 2015
Large drupal site builds   a workshop for sxsw interactive - march 17, 2015Large drupal site builds   a workshop for sxsw interactive - march 17, 2015
Large drupal site builds a workshop for sxsw interactive - march 17, 2015rgristroph
 
NACDEP 2015 - Are we entrepreneurs?
NACDEP 2015  - Are we entrepreneurs?NACDEP 2015  - Are we entrepreneurs?
NACDEP 2015 - Are we entrepreneurs?Glenn Muske
 
Staying Competitive in Data Analytics: Analyze Boulder 20140903
Staying Competitive in Data Analytics: Analyze Boulder 20140903Staying Competitive in Data Analytics: Analyze Boulder 20140903
Staying Competitive in Data Analytics: Analyze Boulder 20140903Richard Hackathorn
 
Building Data Products with Python (Georgetown)
Building Data Products with Python (Georgetown)Building Data Products with Python (Georgetown)
Building Data Products with Python (Georgetown)Benjamin Bengfort
 
General intro to agile tools - Forecast.it
General intro to agile tools - Forecast.itGeneral intro to agile tools - Forecast.it
General intro to agile tools - Forecast.itDennis Kayser
 
How to Digitally Transform Higher Ed with Drupal
How to Digitally Transform Higher Ed with DrupalHow to Digitally Transform Higher Ed with Drupal
How to Digitally Transform Higher Ed with DrupalMediacurrent
 
Ton stegeman donaldhessing_knowledge_management_using_social_software_nonotes
Ton stegeman donaldhessing_knowledge_management_using_social_software_nonotesTon stegeman donaldhessing_knowledge_management_using_social_software_nonotes
Ton stegeman donaldhessing_knowledge_management_using_social_software_nonotesJeong-woo Choi
 
Workshop Wednesdays - Growth Tactics For Technical Buyers with Weights & Biases
Workshop Wednesdays - Growth Tactics For Technical Buyers with Weights & BiasesWorkshop Wednesdays - Growth Tactics For Technical Buyers with Weights & Biases
Workshop Wednesdays - Growth Tactics For Technical Buyers with Weights & Biasessaastr
 
InfoVision_PM101_RPadaki
InfoVision_PM101_RPadakiInfoVision_PM101_RPadaki
InfoVision_PM101_RPadakiRavi Padaki
 
Tales of Integration and Implementation
Tales of Integration and ImplementationTales of Integration and Implementation
Tales of Integration and ImplementationTargetX
 
Lean Product Development
Lean Product DevelopmentLean Product Development
Lean Product DevelopmentHaris Aghadi
 
Tips for Nonprofits- Manage Content & Collaborate in the Cloud.
Tips for Nonprofits-  Manage Content & Collaborate in the Cloud.Tips for Nonprofits-  Manage Content & Collaborate in the Cloud.
Tips for Nonprofits- Manage Content & Collaborate in the Cloud.Box
 
TYE Oregon Overview
TYE Oregon OverviewTYE Oregon Overview
TYE Oregon OverviewShashi Jain
 
How to crack Big Data and Data Science roles
How to crack Big Data and Data Science rolesHow to crack Big Data and Data Science roles
How to crack Big Data and Data Science rolesUpXAcademy
 
Building SharePoint Enterprise Platforms - Off the beaten path
Building SharePoint Enterprise Platforms - Off the beaten pathBuilding SharePoint Enterprise Platforms - Off the beaten path
Building SharePoint Enterprise Platforms - Off the beaten pathAndy Talbot
 
Drupal and Devops , the Survey Results
Drupal and Devops , the Survey ResultsDrupal and Devops , the Survey Results
Drupal and Devops , the Survey ResultsKris Buytaert
 
[CXL Live 16] Opening Keynote by Peep Laja
[CXL Live 16] Opening Keynote by Peep Laja[CXL Live 16] Opening Keynote by Peep Laja
[CXL Live 16] Opening Keynote by Peep LajaCXL
 
Kalamazoo Valley Community College’s Application Integration Using Axiom Elite
Kalamazoo Valley Community College’s Application Integration Using Axiom EliteKalamazoo Valley Community College’s Application Integration Using Axiom Elite
Kalamazoo Valley Community College’s Application Integration Using Axiom EliteTargetX
 

Similar a Learn Data Science (20)

Educate 2017 (Learnosity Developer Conference) Opening Keynote
Educate 2017 (Learnosity Developer Conference) Opening KeynoteEducate 2017 (Learnosity Developer Conference) Opening Keynote
Educate 2017 (Learnosity Developer Conference) Opening Keynote
 
How to Implement Educational Technology
How to Implement Educational Technology How to Implement Educational Technology
How to Implement Educational Technology
 
Large drupal site builds a workshop for sxsw interactive - march 17, 2015
Large drupal site builds   a workshop for sxsw interactive - march 17, 2015Large drupal site builds   a workshop for sxsw interactive - march 17, 2015
Large drupal site builds a workshop for sxsw interactive - march 17, 2015
 
NACDEP 2015 - Are we entrepreneurs?
NACDEP 2015  - Are we entrepreneurs?NACDEP 2015  - Are we entrepreneurs?
NACDEP 2015 - Are we entrepreneurs?
 
Staying Competitive in Data Analytics: Analyze Boulder 20140903
Staying Competitive in Data Analytics: Analyze Boulder 20140903Staying Competitive in Data Analytics: Analyze Boulder 20140903
Staying Competitive in Data Analytics: Analyze Boulder 20140903
 
Building Data Products with Python (Georgetown)
Building Data Products with Python (Georgetown)Building Data Products with Python (Georgetown)
Building Data Products with Python (Georgetown)
 
General intro to agile tools - Forecast.it
General intro to agile tools - Forecast.itGeneral intro to agile tools - Forecast.it
General intro to agile tools - Forecast.it
 
How to Digitally Transform Higher Ed with Drupal
How to Digitally Transform Higher Ed with DrupalHow to Digitally Transform Higher Ed with Drupal
How to Digitally Transform Higher Ed with Drupal
 
Ton stegeman donaldhessing_knowledge_management_using_social_software_nonotes
Ton stegeman donaldhessing_knowledge_management_using_social_software_nonotesTon stegeman donaldhessing_knowledge_management_using_social_software_nonotes
Ton stegeman donaldhessing_knowledge_management_using_social_software_nonotes
 
Workshop Wednesdays - Growth Tactics For Technical Buyers with Weights & Biases
Workshop Wednesdays - Growth Tactics For Technical Buyers with Weights & BiasesWorkshop Wednesdays - Growth Tactics For Technical Buyers with Weights & Biases
Workshop Wednesdays - Growth Tactics For Technical Buyers with Weights & Biases
 
InfoVision_PM101_RPadaki
InfoVision_PM101_RPadakiInfoVision_PM101_RPadaki
InfoVision_PM101_RPadaki
 
Tales of Integration and Implementation
Tales of Integration and ImplementationTales of Integration and Implementation
Tales of Integration and Implementation
 
Lean Product Development
Lean Product DevelopmentLean Product Development
Lean Product Development
 
Tips for Nonprofits- Manage Content & Collaborate in the Cloud.
Tips for Nonprofits-  Manage Content & Collaborate in the Cloud.Tips for Nonprofits-  Manage Content & Collaborate in the Cloud.
Tips for Nonprofits- Manage Content & Collaborate in the Cloud.
 
TYE Oregon Overview
TYE Oregon OverviewTYE Oregon Overview
TYE Oregon Overview
 
How to crack Big Data and Data Science roles
How to crack Big Data and Data Science rolesHow to crack Big Data and Data Science roles
How to crack Big Data and Data Science roles
 
Building SharePoint Enterprise Platforms - Off the beaten path
Building SharePoint Enterprise Platforms - Off the beaten pathBuilding SharePoint Enterprise Platforms - Off the beaten path
Building SharePoint Enterprise Platforms - Off the beaten path
 
Drupal and Devops , the Survey Results
Drupal and Devops , the Survey ResultsDrupal and Devops , the Survey Results
Drupal and Devops , the Survey Results
 
[CXL Live 16] Opening Keynote by Peep Laja
[CXL Live 16] Opening Keynote by Peep Laja[CXL Live 16] Opening Keynote by Peep Laja
[CXL Live 16] Opening Keynote by Peep Laja
 
Kalamazoo Valley Community College’s Application Integration Using Axiom Elite
Kalamazoo Valley Community College’s Application Integration Using Axiom EliteKalamazoo Valley Community College’s Application Integration Using Axiom Elite
Kalamazoo Valley Community College’s Application Integration Using Axiom Elite
 

Learn Data Science

  • 1. Why Data Science is Something You Should Care About Presented @ South Dakota Code Camp 2012 Ryan Swanstrom @swgoof
  • 2. About Ryan Swanstrom Find me on the web http://twitter.com/swgoof http://linkedin.com/in/ryanswanstrom http://datascience101.wordpress.com/
  • 3. Data Science "[ability to] obtain, scrub, explore, model and interpret data, blending hacking, statistics, and machine learning." definition by Hilary Mason, Chief Scientist @ Bit.ly
  • 5. Who is a data scientist? http://onforb.es/WNLnRu
  • 6. Big Data Any dataset where the size or speed of incoming data causes difficulties in processing ● Volume ● Velocity ● Variety
  • 7. Hadoop "[...] a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models." Apache Hadoop Website ● HDFS - Hadoop Distributed File System ● MapReduce
  • 8. Lots of Data 18 Months the amount of time for digital data to double
  • 10. Why Do You Care? McKinsey Global Big Data Report ● 140k - 190k Unfilled Jobs by 2018 ● 1.5M Managers & Analysts
  • 11. Indeed Data Science Job Listings http://www.indeed.com/jobtrends?q=Data-science&relative=1
  • 12. Now That You Care, What Skills? 1. Machine Learning 2. Statistics 3. Story Telling (Communication) 4. Big Data 5. Algorithms 6. Curiosity
  • 13. College and University http://datascience101.wordpress.com/2012/04/09/colleges-with-data-science-degrees/ http://whatsthebigdata.com/2012/08/09/graduate-programs-in-big-data-and-data-science/
  • 14. College and University Pros Cons ● Credentials ● Expensive ● Experts ● Not Individualized ● Familiar ● School ● Widely Accepted ● Lengthy ● Structured ● Inflexible ● Not Real World
  • 15. Corporate Training General Assembly - Not really Corp Training, but it looks really good
  • 16. Corporate Training Pros Cons ● Short Timeframe ● Expensive ● Experts ● Not Individualized ● Certificates ● Product Focused ● Business-Savy ● Sales Pitch ● Real World ● Structured
  • 17. MOOCs (Massive Open Online Courses)
  • 18. MOOCs (Massive Open Online Courses) Pros Cons ● Free ● No Credentials ● Experts ● Single Course ● Flexible ● No Programs (Yet)
  • 20. Blogs/Wikis/Other Pros Cons ● Free ● Quality? ● Very Specific ● No Credentials ● Short ● No Structure ● Lots of them ● Too many!
  • 21. Blogs/Wikis/Other The Problem ● What content is good? ● What order should I cover the content? ● Where do I find new content? ● Who can help me understand?
  • 22. Data Science 201 - coming soon http://www.datascience201.com Helping you find the best data science learning content! Thank You