SlideShare una empresa de Scribd logo
1 de 31
Descargar para leer sin conexión
Data Science for Developers
Quick & Dirty Introduction to Data Science tool’s Ecosystem
@pdelboca @pcelayes
Patricio Del Boca Pablo Celayes
Agenda
¿Por qué esta charla?
¿Por qué Data Science?
Data Science
Herramientas
Preguntas
¿Por qué esta charla?
Objetivo
Recorrido por la definición de la disciplina y las principales herramientas que hay
para trabajar.
¿Por qué Data Science?
2015
Bajos costos de procesamiento,
muchos datos,
y algoritmos.
Raise of Data Science
“Data Science is a team sport.”
DJ Patil - Chief Data Scientist @ White House
”
Visualizing Nepal’s Earthquake
The Human Size of Data Science
Diabetic Retinopathy Detection
Identify signs of diabetic retinopathy in eye
images
Click-Through Rate Prediction
Predict whether a mobile ad will be clicked.
Improve Healthcare
Identify patients who will be admitted to a
hospital within the next year using historical
claims data.
A taxonomy for Data Science (V 2.0)
Methodology Data
Manipulation
Data
Modeling
Data
Visualization
Define
Obtain
Scrub
Explore
Model
Interpret
Communicate
http://www.dataists.com/2010/09/a-taxonomy-of-data-science/
A taxonomy for Data Science (V 2.0)
Methodology Data
Manipulation
Data
Modeling
Data
Visualization
Define
Obtain
Scrub
Explore
Model
Interpret
Communicate
http://www.dataists.com/2010/09/a-taxonomy-of-data-science/
A taxonomy for Data Science (V 2.0)
Methodology Data
Manipulation
Data
Modeling
Data
Visualization
Define X X
Obtain X
Scrub X
Explore X X X
Model X X
Interpret X X X
Communicate X X
http://www.dataists.com/2010/09/a-taxonomy-of-data-science/
Data Scientist Main Toolkit
Define what are you trying to solve
Data Manipulation
dplyr
http://pandas.pydata.org/https://github.com/hadley/dplyr
Data Modeling in R
randomForest
lm
nnet
gbm
e1071
Data Modeling in Python
sklearn.tree.DecisionTreeClassifier
sklearn.linear_model.LinearRegression
sklearn.svm.SVC
sklearn.svm.SVR
sklearn.ensemble.RandomForestClassifier
sklearn.ensemble.GradientBoostingClassifier
Data Modeling
CRAN
https://cran.r-project.org/web/views/MachineLearning.html http://scikit-learn.org/
Data Visualization
R Base Graphics
Lattice
Bokeh
seaborn
pandas.plot()
Data Visualization
+
Power up!
“The best minds of my generation are thinking
about how to make people click ads.
… that sucks.”
Jeff Hamerbacher - Former Data Scientist @ Facebook
”
Preguntas?
?
Muchas Gracias!
http://www.meetup.com/es/Encuentros-Data-Science-Cordoba/

Más contenido relacionado

La actualidad más candente

Transforming Business with Intelligent Data
Transforming Business with Intelligent DataTransforming Business with Intelligent Data
Transforming Business with Intelligent Data
ashbhatia
 
Skytree Partner Program 2-15
Skytree Partner Program 2-15Skytree Partner Program 2-15
Skytree Partner Program 2-15
Dylan Steeg
 

La actualidad más candente (18)

Agile Data Science
Agile Data ScienceAgile Data Science
Agile Data Science
 
1645 dyskant using our laptop
1645 dyskant using our laptop1645 dyskant using our laptop
1645 dyskant using our laptop
 
Transforming Business with Intelligent Data
Transforming Business with Intelligent DataTransforming Business with Intelligent Data
Transforming Business with Intelligent Data
 
J mc callumbig datapsp2013
J mc callumbig datapsp2013J mc callumbig datapsp2013
J mc callumbig datapsp2013
 
R&D Search 081013 Search Solutions Conference
R&D Search 081013 Search Solutions ConferenceR&D Search 081013 Search Solutions Conference
R&D Search 081013 Search Solutions Conference
 
Datascienceindia article
Datascienceindia articleDatascienceindia article
Datascienceindia article
 
Enterprise Data World Webinar: Make BIG DATA Work for You
Enterprise Data World Webinar: Make BIG DATA Work for YouEnterprise Data World Webinar: Make BIG DATA Work for You
Enterprise Data World Webinar: Make BIG DATA Work for You
 
Skytree Partner Program 2-15
Skytree Partner Program 2-15Skytree Partner Program 2-15
Skytree Partner Program 2-15
 
A Modern Data Strategy for Precision Medicine
A Modern Data Strategy for Precision MedicineA Modern Data Strategy for Precision Medicine
A Modern Data Strategy for Precision Medicine
 
Big data Competitions by Komes Chandavimol
Big data Competitions by Komes ChandavimolBig data Competitions by Komes Chandavimol
Big data Competitions by Komes Chandavimol
 
Bigdata
BigdataBigdata
Bigdata
 
Insight into AstraZeneca's Technology Services.
Insight into AstraZeneca's Technology Services.Insight into AstraZeneca's Technology Services.
Insight into AstraZeneca's Technology Services.
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Understand the Demand of Analyst Opportunity in U.S
Understand the Demand of Analyst Opportunity in U.SUnderstand the Demand of Analyst Opportunity in U.S
Understand the Demand of Analyst Opportunity in U.S
 
Automating Data Curation with AI and NLP for Biomedical Graph Applications
Automating Data Curation with AI and NLP for Biomedical Graph ApplicationsAutomating Data Curation with AI and NLP for Biomedical Graph Applications
Automating Data Curation with AI and NLP for Biomedical Graph Applications
 
Data quality - The True Big Data Challenge
Data quality - The True Big Data ChallengeData quality - The True Big Data Challenge
Data quality - The True Big Data Challenge
 
Big Data Analytics: Challenge or Opportunity?
Big Data Analytics: Challenge or Opportunity?Big Data Analytics: Challenge or Opportunity?
Big Data Analytics: Challenge or Opportunity?
 
Data Science: Philosopher's Stone
Data Science: Philosopher's StoneData Science: Philosopher's Stone
Data Science: Philosopher's Stone
 

Similar a Data science for developers

Making an impact with data science
Making an impact  with data scienceMaking an impact  with data science
Making an impact with data science
Jordan Engbers
 
Untitled document.pdf
Untitled document.pdfUntitled document.pdf
Untitled document.pdf
MuhammadTahiriqbal13
 

Similar a Data science for developers (20)

Embracing data science
Embracing data scienceEmbracing data science
Embracing data science
 
Making an impact with data science
Making an impact  with data scienceMaking an impact  with data science
Making an impact with data science
 
Data science
Data scienceData science
Data science
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
 
Predictive Analytics - How to get stuff out of your Crystal Ball
Predictive Analytics - How to get stuff out of your Crystal BallPredictive Analytics - How to get stuff out of your Crystal Ball
Predictive Analytics - How to get stuff out of your Crystal Ball
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
 
Untitled document.pdf
Untitled document.pdfUntitled document.pdf
Untitled document.pdf
 
Data science
Data scienceData science
Data science
 
David Cocker big data MDCPartners ta-scan
David Cocker big data MDCPartners ta-scanDavid Cocker big data MDCPartners ta-scan
David Cocker big data MDCPartners ta-scan
 
Fair by design
Fair by designFair by design
Fair by design
 
365 Data Science
365 Data Science365 Data Science
365 Data Science
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Frankie Rybicki slide set for Deep Learning in Radiology / Medicine
Frankie Rybicki slide set for Deep Learning in Radiology / MedicineFrankie Rybicki slide set for Deep Learning in Radiology / Medicine
Frankie Rybicki slide set for Deep Learning in Radiology / Medicine
 
Insight white paper_2014
Insight white paper_2014Insight white paper_2014
Insight white paper_2014
 
Borys Pratsiuk "How to be NVidia partner"
Borys Pratsiuk "How to be NVidia partner"Borys Pratsiuk "How to be NVidia partner"
Borys Pratsiuk "How to be NVidia partner"
 
ODI Overview 2013-04-09
ODI Overview 2013-04-09ODI Overview 2013-04-09
ODI Overview 2013-04-09
 
Accretive Health - Quality Management in Health Care
Accretive Health - Quality Management in Health CareAccretive Health - Quality Management in Health Care
Accretive Health - Quality Management in Health Care
 
How Your Data Can Predict The Future
How Your Data Can Predict The FutureHow Your Data Can Predict The Future
How Your Data Can Predict The Future
 
Big Data in Healthcare and Medical Devices
Big Data in Healthcare and Medical DevicesBig Data in Healthcare and Medical Devices
Big Data in Healthcare and Medical Devices
 
How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)
 

Último

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 

Data science for developers