SlideShare a Scribd company logo
1 of 16
DATA
SCIENCE *
PROJECT
KEVIN BLUER
DSRUPTION
http://www.dsruption.com/
TRENDS
TECHNOLOGY
http://www.dsruption.com/trend/wearable-computing
GOALS
Derive Insight from Dsruption
(www.dsruption.com)
Focus on establishing company (startup)
momentum & insights
#1 Article popularity (FB / Twitter shares)
#2 Auto generation of article tags
FEATURES
dsruption.activity, 691 documents (744 KB)
dsruption.articles, 14022 documents (125.61 MB)
dsruption.comment, 43 (40 KB)
dsruption.companies, 524 (3.65 MB)
dsruption.tags, 329 (40 KB)
dsruption.trends, 32 (140 KB)
dsruption.users, 39 (632 KB)
TECHNOLOGIES
MongoDB
JavaScript and Node.js
D3.js
Hadoop
Python
Facebook and Twitter API’s
ARTICLE POPULARITY
IMPORTING
TWEETS & SHARES
http://www.dsruption.com/dwolla/json-social
SIMPLE D3.JS
VISUALIZATION
http://www.dsruption.com/dwolla/visualize
COMPANY TAGS
FROM ARTICLES
HADOOP -> MONGO
http://www.dsruption.com/dwolla/articles
http://www.dsruption.com/data/dwolla.json
BEAUTIFUL SOUP
<p><ul><li><span style="font-family: arial;"><i>100,000 refrigerators
and freezers have now made their way through the revolutionary
UNTHA Recycling Technology system</i></span></li><li><span
style="font-family: arial;"><i>Innovative recycling system reduces
landfill waste and greenhouse gas and ozone-depleting substance
emissions</i></span></li><li><span style="font-family:
arial;"><i>Initiative has diverted 5.5 million pounds of material from
U.S. landfills<b><a href="#_ftn1"
name="_ftnref1">[1]</a></b></i></span></li></ul> </p><p
style="text-indent: -
0.25in;"><i><b><br/><br/></b></i></p><p><div><br/><div id="ftn1">
</div> </div></p>
100,000 refrigerators and freezers have now made their way through
the revolutionary UNTHA Recycling Technology systemInnovative
recycling system reduces landfill waste and greenhouse gas and
ozone-depleting substance emissionsInitiative has diverted 5.5
million pounds of material from U.S. landfills.
LOTS OF
NOISE
http://www.dsruption.com/dwolla/words
EXCLUDE NOISE
count: 252, word: "Dwolla”
count: 73, word: "money”
count: 45, word: "photo”
count: 44, word: "people”
count: 42, word: "pay”
count: 39, word: "payment”
count: 35, word: "payments”
count: 34, word: "business"
WHAT’S NEXT?
Sentiment Analysis (both on the articles / comments)
Integration of Additional Datasets (Crunchbase, etc)
Broader Visualization
THANK YOU 

More Related Content

What's hot

Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...
Simplilearn
 
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
Simplilearn
 

What's hot (20)

Data science
Data scienceData science
Data science
 
Data science - An Introduction
Data science - An IntroductionData science - An Introduction
Data science - An Introduction
 
Data Science Full Course | Edureka
Data Science Full Course | EdurekaData Science Full Course | Edureka
Data Science Full Course | Edureka
 
Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...
 
Introduction to Tableau
Introduction to TableauIntroduction to Tableau
Introduction to Tableau
 
Tools and techniques for data science
Tools and techniques for data scienceTools and techniques for data science
Tools and techniques for data science
 
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
 
Top data science projects
Top data science projectsTop data science projects
Top data science projects
 
Data Science
Data ScienceData Science
Data Science
 
What Is Data Visualization, and Why Is It Important?
What Is Data Visualization, and Why Is It Important?What Is Data Visualization, and Why Is It Important?
What Is Data Visualization, and Why Is It Important?
 
Internship Presentation.pdf
Internship Presentation.pdfInternship Presentation.pdf
Internship Presentation.pdf
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI day
 
Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science
 
Data analytics
Data analyticsData analytics
Data analytics
 
Introduction to text classification using naive bayes
Introduction to text classification using naive bayesIntroduction to text classification using naive bayes
Introduction to text classification using naive bayes
 
Data storytelling with personas, Utrecht
Data storytelling with personas, UtrechtData storytelling with personas, Utrecht
Data storytelling with personas, Utrecht
 
Big data and data science overview
Big data and data science overviewBig data and data science overview
Big data and data science overview
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Understanding big data and data analytics big data
Understanding big data and data analytics big dataUnderstanding big data and data analytics big data
Understanding big data and data analytics big data
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 

Viewers also liked

Data Science-Final Project Presentation- M Roussel
Data Science-Final Project Presentation- M RousselData Science-Final Project Presentation- M Roussel
Data Science-Final Project Presentation- M Roussel
Mikael Roussel
 

Viewers also liked (9)

Data Science-Final Project Presentation- M Roussel
Data Science-Final Project Presentation- M RousselData Science-Final Project Presentation- M Roussel
Data Science-Final Project Presentation- M Roussel
 
My First Data Science Project (Data Science Thailand Meetup #1)
My First Data Science Project (Data Science Thailand Meetup #1)My First Data Science Project (Data Science Thailand Meetup #1)
My First Data Science Project (Data Science Thailand Meetup #1)
 
Preface to a Strategic Plan for Data Science at the NIH
Preface to a Strategic Plan for Data Science at the NIHPreface to a Strategic Plan for Data Science at the NIH
Preface to a Strategic Plan for Data Science at the NIH
 
Interoperability Testing
Interoperability TestingInteroperability Testing
Interoperability Testing
 
How big data tranform your business? Data Science Thailand Meet up #6
How big data tranform your business? Data Science Thailand Meet up #6How big data tranform your business? Data Science Thailand Meet up #6
How big data tranform your business? Data Science Thailand Meet up #6
 
CRISP-DM: a data science project methodology
CRISP-DM: a data science project methodologyCRISP-DM: a data science project methodology
CRISP-DM: a data science project methodology
 
Applying Data Science to Your Business Problem
Applying Data Science to Your Business ProblemApplying Data Science to Your Business Problem
Applying Data Science to Your Business Problem
 
Intro to Data Science for Enterprise Big Data
Intro to Data Science for Enterprise Big DataIntro to Data Science for Enterprise Big Data
Intro to Data Science for Enterprise Big Data
 
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
 

Similar to Data science project presentation

Making friends with big data resource links
Making friends with big data resource linksMaking friends with big data resource links
Making friends with big data resource links
Heather Stark
 

Similar to Data science project presentation (20)

Building a Consistent Hybrid Cloud Semantic Model In Denodo
Building a Consistent Hybrid Cloud Semantic Model In DenodoBuilding a Consistent Hybrid Cloud Semantic Model In Denodo
Building a Consistent Hybrid Cloud Semantic Model In Denodo
 
Research software and Dataverse
Research software and DataverseResearch software and Dataverse
Research software and Dataverse
 
Responsive web design
Responsive web designResponsive web design
Responsive web design
 
Linked Data: opportunities and challenges
Linked Data: opportunities and challengesLinked Data: opportunities and challenges
Linked Data: opportunities and challenges
 
Druid Adoption Tips and Tricks
Druid Adoption Tips and TricksDruid Adoption Tips and Tricks
Druid Adoption Tips and Tricks
 
Enterprise 20 Summary
Enterprise 20 SummaryEnterprise 20 Summary
Enterprise 20 Summary
 
Accelerate Big Data Application Development with Cascading and HDP, Hortonwor...
Accelerate Big Data Application Development with Cascading and HDP, Hortonwor...Accelerate Big Data Application Development with Cascading and HDP, Hortonwor...
Accelerate Big Data Application Development with Cascading and HDP, Hortonwor...
 
30° Nexa Lunch Seminar - Linked Data Platform vs real world
30° Nexa Lunch Seminar - Linked Data Platform vs real world30° Nexa Lunch Seminar - Linked Data Platform vs real world
30° Nexa Lunch Seminar - Linked Data Platform vs real world
 
DCSF19 How To Build Your Containerization Strategy
DCSF19 How To Build Your Containerization Strategy  DCSF19 How To Build Your Containerization Strategy
DCSF19 How To Build Your Containerization Strategy
 
DCEU 18: How To Build Your Containerization Strategy
DCEU 18: How To Build Your Containerization StrategyDCEU 18: How To Build Your Containerization Strategy
DCEU 18: How To Build Your Containerization Strategy
 
Open Source CDNs | LAWebSpeed April 29th 2014
Open Source CDNs | LAWebSpeed April 29th 2014Open Source CDNs | LAWebSpeed April 29th 2014
Open Source CDNs | LAWebSpeed April 29th 2014
 
(PROJEKTURA) open data big data @tgg osijek
(PROJEKTURA) open data big data @tgg osijek(PROJEKTURA) open data big data @tgg osijek
(PROJEKTURA) open data big data @tgg osijek
 
DockerCon 16 General Session Day 2
DockerCon 16 General Session Day 2 DockerCon 16 General Session Day 2
DockerCon 16 General Session Day 2
 
Rank | Analyse | Lead | Search
Rank | Analyse | Lead | SearchRank | Analyse | Lead | Search
Rank | Analyse | Lead | Search
 
One Man Band - Drupal Lightning Talks
One Man Band - Drupal Lightning TalksOne Man Band - Drupal Lightning Talks
One Man Band - Drupal Lightning Talks
 
Containers & CaaS
Containers & CaaSContainers & CaaS
Containers & CaaS
 
Integration-Monday-Logic-Apps-Tips-Tricks
Integration-Monday-Logic-Apps-Tips-TricksIntegration-Monday-Logic-Apps-Tips-Tricks
Integration-Monday-Logic-Apps-Tips-Tricks
 
Making friends with big data resource links
Making friends with big data resource linksMaking friends with big data resource links
Making friends with big data resource links
 
Resources (Links) for 2016
Resources (Links) for 2016Resources (Links) for 2016
Resources (Links) for 2016
 
Kubernetes Operability Tooling (devopsdays Seattle 2019)
Kubernetes Operability Tooling (devopsdays Seattle 2019)Kubernetes Operability Tooling (devopsdays Seattle 2019)
Kubernetes Operability Tooling (devopsdays Seattle 2019)
 

Data science project presentation