SlideShare una empresa de Scribd logo
1 de 28
Descargar para leer sin conexión
Data Science Applications & Use
Cases
Instructor: Ekpe Okorafor
1. Accenture – Big Data Academy
2. Computer Science African University of Science &
Technology
Objectives
Objectives
• Understand Big Data Challenges
• What exactly is Data Science and what do Data
Scientists do
• Data Science contrasted with other disciplines
• Case Study & Use Cases
2
Outline
• Big Data & Challenges
• What is Data Science
• Data Science & Academia
• Data Science & Others
• Case Studies
• Essential points
• Conclusion
3
Data All Around
• Lots of data is being collected
and warehoused
– Scientific Experiments
– Internet of Things
– Web data, e-commerce
– Financial transactions, bank/credit transactions
– Online trading and purchasing
– Social Network
– ……many more!
4
Big Data
• Big Data are data sets so large or so complex that traditional methods
of storing, accessing, and analyzing their breakdown are too
expensive. However, there is a lot of potential value hidden in this
data, so organizations are eager to harness it to drive innovation and
competitive advantage.
• Big Data technologies and approaches are used to drive value out of
data rich environments in ways that traditional analytics tools and
methods cannot.
5
What To Do With These Data?
6
• Aggregation and Statistics
– Data warehousing and OLAP
• Indexing, Searching, and Querying
– Keyword based search
– Pattern matching (XML/RDF)
• Knowledge discovery
– Data Mining
– Statistical Modeling
• Data Driven
– Predictive Analytics
– Deep Learning
Big Data & Data Science
7
• “… the sexy job in the next 10 years will be
statisticians,” Hal Varian, Google Chief Economist
• The U.S. will need 140,000-190,000 predictive
analysts and 1.5 million managers/analysts by 2018.
McKinsey Global Institute’s June 2011
• New Data Science institutes being created or
repurposed – NYU, Columbia, Washington, UCB,...
• New degree programs, courses, boot-camps:
– e.g., at Berkeley: Stats, I-School, CS, Astronomy…
– One proposal (elsewhere) for an MS in “Big Data Science”
– Plans for Data Science Stream at AUST
– RDA-CODATA School of Research Data Science
What is Data Science?
8
• Some definitions link computational, statistical, and
substantive expertise.
What is Data Science?
9
• Other definitions focus more on technical skills alone.
What is Data Science?
10
• An area that manages, manipulates,
extracts, and interprets knowledge from
tremendous amount of data
• Data science (DS) is a multidisciplinary field
of study with goal to address the challenges
in big data
• Data science principles apply to all data –
big and small
What is Data Science?
11
• Theories and techniques from many fields and
disciplines are used to investigate and analyze a
large amount of data to help decision makers in
many industries such as science, engineering,
economics, politics, finance, and education
– Computer Science
• Pattern recognition, visualization, data warehousing, High
performance computing, Databases, AI
– Mathematics
• Mathematical Modeling
– Statistics
• Statistical and Stochastic modeling, Probability.
Data Science Vs Analysis Vs Software
Delivery
12
Component Traditional Analysis Traditional Software
Delivery
Data Science
Tools SAS, R, Excel, SQL, in-
house tools
Java, source control, Linux,
continuous integration, unit
testing, bug reports and
project management
R, Java, scientific Python libraries,
Excel, SQL, Hadoop, Hive, Pig,
Mahout and other machine learning
libraries, github for source control
and issue management
Analytical
Methods
Regressions,
classifications,
measuring prediction
accuracy and
coverage/error,
sampling
N/A Classification, clustering, similarity
detection, recommenders,
unsupervised and supervised
learning, small- and large-scale
computations, measuring prediction
accuracy and coverage/error
Team
Structure
Statisticians,
Mathematicians,
Scientists
Developers, Project
Managers, Systems
Engineers
Mathematicians, Statisticians,
Scientists, Developers, Systems
Engineers
Time Frame Either:
• Usually on-going
research and
discovery within a
team in the
organization
Or:
• Specific project to
determine answers
Regular software release
cycle, continuous delivery, etc.
Either:
• Discovery/learning phase leading
to product development
Or:
• On-going research and product
invention/improvement
Contrast: Scientific Computing
13
Scientific Modeling
Physics-based models
Problem-Structured
Mostly deterministic, precise
Run on Supercomputer or High-end
Computing Cluster
Supernova
Not
Image General purpose classifier
Data-Driven Approach
General inference engine replaces model
Structure not related to problem
Statistical models handle true randomness,
and un-modeled complexity.
Run on cheaper computer Clusters (EC2)
Nugent group / C3 LBL
Contrast: Machine Learning
14
Machine Learning
Develop new (individual) models
Prove mathematical properties of
models
Improve/validate on a few, relatively
clean, small datasets
Publish a paper 
Data Science
Explore many models, build and tune
hybrids
Understand empirical properties of
models
Develop/use tools that can handle
massive datasets
Take action!
Contrast: Data Engineering
15
Data Science Data Engineering
Approach Scientific (Exploration) Engineering (Development)
Problems Unbounded Bounded
Path to Solution Iterative, exploratory,
nonlinear
Mostly linear
Education More is better (PhD’s
common)
BS and/or self-trained
Presentation Skills Important Not as important
Research
Experience
Important Not as important
Programming
Skills
Not as important Important
Data Skills Important Important
Data Science & Academia
16
• In the words of Alex Szalay, these sorts of researchers must be "Pi-shaped" as
opposed to the more traditional "T-shaped" researcher. In Szalay's view, a
classic PhD program generates T-shaped researchers: scientists with wide-
but-shallow general knowledge, but deep skill and expertise in one particular
area. The new breed of scientific researchers, the data scientists, must be Pi-
shaped: that is, they maintain the same wide breadth, but push deeper both in
their own subject area and in the statistical or computational methods that help
drive modern research:
Data Science & Academia
17
• In a post by Jake Vanderplas in 2014 related to SciFoo discussion on:
Academia and Data Science, the following questions below were
discussed.
• I encourage you to develop your own thoughts on them and come up
with your assessment
– Where does Data Science fit within the current structure of the
university & research institutions?
– What is it that academic data scientists want from their career?
How can academia offer that?
– What drivers might shift academia toward recognizing & rewarding
data scientists in domain fields?
– Recognizing that graduates will go on to work in both academia
and industry, how do we best prepare them for success in both
worlds?
Data Science Applications
18
Business Health Care Urban Leaving
Summary From car design to
insurance to pizza delivery,
businesses are using data
science to optimize their
operations and better meet
their customers’
expectations.
Tomorrow’s healthcare may
look more efficient thanks to
things like electronic health
records. It also may look a lot
more effective. Reduced
readmissions, better care, and
earlier detection are on the
horizon.
For the first time in human
history, more people live in
cities than in suburban or
rural areas. An emerging field
called “urban informatics”
combines data science with
the unique challenges facing
the world’s growing cities
What is
happening?
Two-Way Street for the
Ford Focus Electric Car
Reducing Hospital
Readmissions
Taking on Megacity Traffic
Better Fraud Detection
Boosts Customer
Satisfaction
Better Point-of-Care Decisions Fighting Crime with Data
"predictive policing"
E-Commerce Insights:
Domino’s Secret Sauce
What is possible Using Social Data to
Select Successful Retail
Locations
.
Medical Exams by Bathroom
Mirrors
Instrumenting cities
Contrast: Computational Sciences
19
• Is there a contrast between Data Science and
Computational Science?
Data Science: Case Study
Cancer Research
20
• Cancer is an incredibly complex disease; a single tumor can have
more than 100 billion cells, and each cell can acquire mutations
individually. The disease is always changing, evolving, and adapting.
• Employ the power of big data analytics and high-performance
computing.
• Leverage sophisticated pattern and machine learning algorithms to
identify patterns that are potentially linked to cancer
• Huge amount of data processing and recognition
Data Science: Case Study
Health Care
21
• Stanford Medicine, Google
team up to harness power of
data science for health care
• Stanford Medicine will use the
power, security and scale of
Google Cloud Platform to
support precision health and
more efficient patient care.
• Analyzing genetic data
• Focusing on precision health
• Data as the engine that
drives research
http://med.stanford.edu/news/all-news/2016/08/stanford-medicine-google-team-up-to-harness-power-of-data-science.html
Data Science: Case Study
Elections
22
• The Obama campaigns in 2008 and 2012 are credited for their
successful use of social media and data mining.
• Micro-targeting in 2012
– http://www.theatlantic.com/politics/archive/2012/04/the-
creepiness-factor-how-obama-and-romney-are-getting-to-know-
you/255499/
– http://www.mediabizbloggers.com/group-m/How-Data-and-Micro-
Targeting-Won-the-2012-Election-for-Obama---Antony-Young-
Mindshare-North-America.html
• Micro-profiles built from multiple sources accessed by aps, real-
time updating data based on door-to-door visits, focused media
buys, e-mails and Facebook messages highly targeted.
• 1 million people installed the Obama Facebook app that gave
access to info on “friends”.
Data Science: Case Study
Internet of Things (IoT)
23
• The Internet of Things is rapidly growing. It is predicted that more than 25 billion devices
will be connected by 2020.
• The Internet of Things (IOT) will soon produce a massive volume and variety of data at
unprecedented velocity. If "Big Data" is the product of the IOT, "Data Science" is it's
soul.
Data Science: Case Study
Customer Analytics
24
Essential Points
• Big Data has given rise to Data Science
• Data science is rooted in solid foundations of
mathematics and statistics, computer science, and
domain knowledge
• Sexy profession – Data Scientists 
• Not every thing with data or science is Data Science!
• The use cases for Data Science are compelling
25
Conclusion
In this section you have learned
• What Big Data Challenges are
• What exactly is Data Science and what do Data
Scientists do
• Data Science contrasted with other disciplines
• Case Study & Use Cases
26
Questions?
27
28
Thank
You!
http://www.ign.com/articles/2015/12/16/star-wars-the-force-awakens-review

Más contenido relacionado

La actualidad más candente

A Survey on Big Data Analytics: Challenges
A Survey on Big Data Analytics: ChallengesA Survey on Big Data Analytics: Challenges
A Survey on Big Data Analytics: ChallengesDr. Amarjeet Singh
 
COM 578 Empirical Methods in Machine Learning and Data Mining
COM 578 Empirical Methods in Machine Learning and Data MiningCOM 578 Empirical Methods in Machine Learning and Data Mining
COM 578 Empirical Methods in Machine Learning and Data Miningbutest
 
Griffiths lace workshop-eden-2016
Griffiths lace workshop-eden-2016Griffiths lace workshop-eden-2016
Griffiths lace workshop-eden-2016Dai Griffiths
 
Making an impact with data science
Making an impact  with data scienceMaking an impact  with data science
Making an impact with data scienceJordan Engbers
 
IBM Watson in Healthcare
IBM Watson in HealthcareIBM Watson in Healthcare
IBM Watson in HealthcareAnders Quitzau
 
Using socioeconomic data in teaching and research
Using socioeconomic data in teaching and researchUsing socioeconomic data in teaching and research
Using socioeconomic data in teaching and researchJackie Carter
 
The NEEDS vs. the WANTS in IoT
The NEEDS vs. the WANTS in IoTThe NEEDS vs. the WANTS in IoT
The NEEDS vs. the WANTS in IoTPrasant Misra
 
Sdal air education workforce analytics workshop jan. 7 , 2014.pptx
Sdal air education workforce analytics workshop jan. 7 , 2014.pptxSdal air education workforce analytics workshop jan. 7 , 2014.pptx
Sdal air education workforce analytics workshop jan. 7 , 2014.pptxkimlyman
 
Sdal air health and social development (jan. 27, 2014) final
Sdal air health and social development (jan. 27, 2014) finalSdal air health and social development (jan. 27, 2014) final
Sdal air health and social development (jan. 27, 2014) finalkimlyman
 
Luciano uvi hackfest.28.10.2020
Luciano uvi hackfest.28.10.2020Luciano uvi hackfest.28.10.2020
Luciano uvi hackfest.28.10.2020Joanne Luciano
 
IBM Watson for Healthcare
IBM Watson for HealthcareIBM Watson for Healthcare
IBM Watson for HealthcareRomeo Kienzler
 
University of Virginia School of Data Science
University of Virginia School of Data ScienceUniversity of Virginia School of Data Science
University of Virginia School of Data SciencePhilip Bourne
 
Social Networks and Collaborative Platforms for Data Sharing in Radiology
Social Networks and Collaborative Platforms for Data Sharing in RadiologySocial Networks and Collaborative Platforms for Data Sharing in Radiology
Social Networks and Collaborative Platforms for Data Sharing in RadiologyErik R. Ranschaert, MD, PhD
 
Challenges and outlook with Big Data
Challenges and outlook with Big Data Challenges and outlook with Big Data
Challenges and outlook with Big Data IJCERT JOURNAL
 
Building the Data Science Profession in Europe
Building the Data Science Profession in EuropeBuilding the Data Science Profession in Europe
Building the Data Science Profession in EuropeSteven Miller
 

La actualidad más candente (20)

A Survey on Big Data Analytics: Challenges
A Survey on Big Data Analytics: ChallengesA Survey on Big Data Analytics: Challenges
A Survey on Big Data Analytics: Challenges
 
COM 578 Empirical Methods in Machine Learning and Data Mining
COM 578 Empirical Methods in Machine Learning and Data MiningCOM 578 Empirical Methods in Machine Learning and Data Mining
COM 578 Empirical Methods in Machine Learning and Data Mining
 
Griffiths lace workshop-eden-2016
Griffiths lace workshop-eden-2016Griffiths lace workshop-eden-2016
Griffiths lace workshop-eden-2016
 
Making an impact with data science
Making an impact  with data scienceMaking an impact  with data science
Making an impact with data science
 
IBM Watson in Healthcare
IBM Watson in HealthcareIBM Watson in Healthcare
IBM Watson in Healthcare
 
Data mining
Data mining Data mining
Data mining
 
Using socioeconomic data in teaching and research
Using socioeconomic data in teaching and researchUsing socioeconomic data in teaching and research
Using socioeconomic data in teaching and research
 
The NEEDS vs. the WANTS in IoT
The NEEDS vs. the WANTS in IoTThe NEEDS vs. the WANTS in IoT
The NEEDS vs. the WANTS in IoT
 
Sdal air education workforce analytics workshop jan. 7 , 2014.pptx
Sdal air education workforce analytics workshop jan. 7 , 2014.pptxSdal air education workforce analytics workshop jan. 7 , 2014.pptx
Sdal air education workforce analytics workshop jan. 7 , 2014.pptx
 
Sdal air health and social development (jan. 27, 2014) final
Sdal air health and social development (jan. 27, 2014) finalSdal air health and social development (jan. 27, 2014) final
Sdal air health and social development (jan. 27, 2014) final
 
Luciano uvi hackfest.28.10.2020
Luciano uvi hackfest.28.10.2020Luciano uvi hackfest.28.10.2020
Luciano uvi hackfest.28.10.2020
 
IBM Watson for Healthcare
IBM Watson for HealthcareIBM Watson for Healthcare
IBM Watson for Healthcare
 
Hands-on Introduction to Machine Learning
Hands-on Introduction to Machine LearningHands-on Introduction to Machine Learning
Hands-on Introduction to Machine Learning
 
University of Virginia School of Data Science
University of Virginia School of Data ScienceUniversity of Virginia School of Data Science
University of Virginia School of Data Science
 
Social Networks and Collaborative Platforms for Data Sharing in Radiology
Social Networks and Collaborative Platforms for Data Sharing in RadiologySocial Networks and Collaborative Platforms for Data Sharing in Radiology
Social Networks and Collaborative Platforms for Data Sharing in Radiology
 
Challenges and outlook with Big Data
Challenges and outlook with Big Data Challenges and outlook with Big Data
Challenges and outlook with Big Data
 
50 Years of Data Science
50 Years of Data Science50 Years of Data Science
50 Years of Data Science
 
Building the Data Science Profession in Europe
Building the Data Science Profession in EuropeBuilding the Data Science Profession in Europe
Building the Data Science Profession in Europe
 
Big Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARLBig Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARL
 
Big data trends in 2020
Big data trends in 2020Big data trends in 2020
Big data trends in 2020
 

Similar a Data_Science_Applications_&_Use_Cases.pdf

Real-time applications of Data Science.pptx
Real-time applications  of Data Science.pptxReal-time applications  of Data Science.pptx
Real-time applications of Data Science.pptxshalini s
 
The Analytics and Data Science Landscape
The Analytics and Data Science LandscapeThe Analytics and Data Science Landscape
The Analytics and Data Science LandscapePhilip Bourne
 
Fundamentals of Data science Introduction Unit 1
Fundamentals of Data science Introduction Unit 1Fundamentals of Data science Introduction Unit 1
Fundamentals of Data science Introduction Unit 1sasi
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactDr. Sunil Kr. Pandey
 
dissertation proposal writing service
dissertation proposal writing servicedissertation proposal writing service
dissertation proposal writing servicePhd Assistance
 
NCME Big Data in Education
NCME Big Data  in EducationNCME Big Data  in Education
NCME Big Data in EducationPhilip Piety
 
University Public Driven Applications - Big Data and Organizational Design
University Public Driven Applications - Big Data and Organizational Design University Public Driven Applications - Big Data and Organizational Design
University Public Driven Applications - Big Data and Organizational Design maria chiara pettenati
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedPhilip Bourne
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangePhilip Bourne
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedPhilip Bourne
 
dataminingppt-170616163835.pdf jejwwkwnwnn
dataminingppt-170616163835.pdf jejwwkwnwnndataminingppt-170616163835.pdf jejwwkwnwnn
dataminingppt-170616163835.pdf jejwwkwnwnnjainutkarsh078
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewPhilip Bourne
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data SciencePhilip Bourne
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AlonePhilip Bourne
 

Similar a Data_Science_Applications_&_Use_Cases.pdf (20)

Real-time applications of Data Science.pptx
Real-time applications  of Data Science.pptxReal-time applications  of Data Science.pptx
Real-time applications of Data Science.pptx
 
BIG DATA.ppt
BIG DATA.pptBIG DATA.ppt
BIG DATA.ppt
 
BIG-DATAPPTFINAL.ppt
BIG-DATAPPTFINAL.pptBIG-DATAPPTFINAL.ppt
BIG-DATAPPTFINAL.ppt
 
The Analytics and Data Science Landscape
The Analytics and Data Science LandscapeThe Analytics and Data Science Landscape
The Analytics and Data Science Landscape
 
Fundamentals of Data science Introduction Unit 1
Fundamentals of Data science Introduction Unit 1Fundamentals of Data science Introduction Unit 1
Fundamentals of Data science Introduction Unit 1
 
Big data
Big dataBig data
Big data
 
ppt1.pptx
ppt1.pptxppt1.pptx
ppt1.pptx
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
 
dissertation proposal writing service
dissertation proposal writing servicedissertation proposal writing service
dissertation proposal writing service
 
NCME Big Data in Education
NCME Big Data  in EducationNCME Big Data  in Education
NCME Big Data in Education
 
University Public Driven Applications - Big Data and Organizational Design
University Public Driven Applications - Big Data and Organizational Design University Public Driven Applications - Big Data and Organizational Design
University Public Driven Applications - Big Data and Organizational Design
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything Change
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
dataminingppt-170616163835.pdf jejwwkwnwnn
dataminingppt-170616163835.pdf jejwwkwnwnndataminingppt-170616163835.pdf jejwwkwnwnn
dataminingppt-170616163835.pdf jejwwkwnwnn
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's View
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data Science
 
DATAIA & TransAlgo
DATAIA & TransAlgoDATAIA & TransAlgo
DATAIA & TransAlgo
 
Big Data for Library Services (2017)
Big Data for Library Services (2017)Big Data for Library Services (2017)
Big Data for Library Services (2017)
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not Alone
 

Más de vishal choudhary (20)

SE-Lecture1.ppt
SE-Lecture1.pptSE-Lecture1.ppt
SE-Lecture1.ppt
 
SE-Testing.ppt
SE-Testing.pptSE-Testing.ppt
SE-Testing.ppt
 
SE-CyclomaticComplexityand Testing.ppt
SE-CyclomaticComplexityand Testing.pptSE-CyclomaticComplexityand Testing.ppt
SE-CyclomaticComplexityand Testing.ppt
 
SE-Lecture-7.pptx
SE-Lecture-7.pptxSE-Lecture-7.pptx
SE-Lecture-7.pptx
 
Se-Lecture-6.ppt
Se-Lecture-6.pptSe-Lecture-6.ppt
Se-Lecture-6.ppt
 
SE-Lecture-5.pptx
SE-Lecture-5.pptxSE-Lecture-5.pptx
SE-Lecture-5.pptx
 
XML.pptx
XML.pptxXML.pptx
XML.pptx
 
SE-Lecture-8.pptx
SE-Lecture-8.pptxSE-Lecture-8.pptx
SE-Lecture-8.pptx
 
SE-coupling and cohesion.ppt
SE-coupling and cohesion.pptSE-coupling and cohesion.ppt
SE-coupling and cohesion.ppt
 
SE-Lecture-2.pptx
SE-Lecture-2.pptxSE-Lecture-2.pptx
SE-Lecture-2.pptx
 
SE-software design.ppt
SE-software design.pptSE-software design.ppt
SE-software design.ppt
 
SE1.ppt
SE1.pptSE1.ppt
SE1.ppt
 
SE-Lecture-4.pptx
SE-Lecture-4.pptxSE-Lecture-4.pptx
SE-Lecture-4.pptx
 
SE-Lecture=3.pptx
SE-Lecture=3.pptxSE-Lecture=3.pptx
SE-Lecture=3.pptx
 
Multimedia-Lecture-Animation.pptx
Multimedia-Lecture-Animation.pptxMultimedia-Lecture-Animation.pptx
Multimedia-Lecture-Animation.pptx
 
MultimediaLecture5.pptx
MultimediaLecture5.pptxMultimediaLecture5.pptx
MultimediaLecture5.pptx
 
Multimedia-Lecture-7.pptx
Multimedia-Lecture-7.pptxMultimedia-Lecture-7.pptx
Multimedia-Lecture-7.pptx
 
MultiMedia-Lecture-4.pptx
MultiMedia-Lecture-4.pptxMultiMedia-Lecture-4.pptx
MultiMedia-Lecture-4.pptx
 
Multimedia-Lecture-6.pptx
Multimedia-Lecture-6.pptxMultimedia-Lecture-6.pptx
Multimedia-Lecture-6.pptx
 
Multimedia-Lecture-3.pptx
Multimedia-Lecture-3.pptxMultimedia-Lecture-3.pptx
Multimedia-Lecture-3.pptx
 

Último

Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jisc
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfNirmal Dwivedi
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxPooja Bhuva
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structuredhanjurrannsibayan2
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.MaryamAhmad92
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxmarlenawright1
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - Englishneillewis46
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxUmeshTimilsina1
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...Amil baba
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17Celine George
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 

Último (20)

Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptx
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 

Data_Science_Applications_&_Use_Cases.pdf

  • 1. Data Science Applications & Use Cases Instructor: Ekpe Okorafor 1. Accenture – Big Data Academy 2. Computer Science African University of Science & Technology
  • 2. Objectives Objectives • Understand Big Data Challenges • What exactly is Data Science and what do Data Scientists do • Data Science contrasted with other disciplines • Case Study & Use Cases 2
  • 3. Outline • Big Data & Challenges • What is Data Science • Data Science & Academia • Data Science & Others • Case Studies • Essential points • Conclusion 3
  • 4. Data All Around • Lots of data is being collected and warehoused – Scientific Experiments – Internet of Things – Web data, e-commerce – Financial transactions, bank/credit transactions – Online trading and purchasing – Social Network – ……many more! 4
  • 5. Big Data • Big Data are data sets so large or so complex that traditional methods of storing, accessing, and analyzing their breakdown are too expensive. However, there is a lot of potential value hidden in this data, so organizations are eager to harness it to drive innovation and competitive advantage. • Big Data technologies and approaches are used to drive value out of data rich environments in ways that traditional analytics tools and methods cannot. 5
  • 6. What To Do With These Data? 6 • Aggregation and Statistics – Data warehousing and OLAP • Indexing, Searching, and Querying – Keyword based search – Pattern matching (XML/RDF) • Knowledge discovery – Data Mining – Statistical Modeling • Data Driven – Predictive Analytics – Deep Learning
  • 7. Big Data & Data Science 7 • “… the sexy job in the next 10 years will be statisticians,” Hal Varian, Google Chief Economist • The U.S. will need 140,000-190,000 predictive analysts and 1.5 million managers/analysts by 2018. McKinsey Global Institute’s June 2011 • New Data Science institutes being created or repurposed – NYU, Columbia, Washington, UCB,... • New degree programs, courses, boot-camps: – e.g., at Berkeley: Stats, I-School, CS, Astronomy… – One proposal (elsewhere) for an MS in “Big Data Science” – Plans for Data Science Stream at AUST – RDA-CODATA School of Research Data Science
  • 8. What is Data Science? 8 • Some definitions link computational, statistical, and substantive expertise.
  • 9. What is Data Science? 9 • Other definitions focus more on technical skills alone.
  • 10. What is Data Science? 10 • An area that manages, manipulates, extracts, and interprets knowledge from tremendous amount of data • Data science (DS) is a multidisciplinary field of study with goal to address the challenges in big data • Data science principles apply to all data – big and small
  • 11. What is Data Science? 11 • Theories and techniques from many fields and disciplines are used to investigate and analyze a large amount of data to help decision makers in many industries such as science, engineering, economics, politics, finance, and education – Computer Science • Pattern recognition, visualization, data warehousing, High performance computing, Databases, AI – Mathematics • Mathematical Modeling – Statistics • Statistical and Stochastic modeling, Probability.
  • 12. Data Science Vs Analysis Vs Software Delivery 12 Component Traditional Analysis Traditional Software Delivery Data Science Tools SAS, R, Excel, SQL, in- house tools Java, source control, Linux, continuous integration, unit testing, bug reports and project management R, Java, scientific Python libraries, Excel, SQL, Hadoop, Hive, Pig, Mahout and other machine learning libraries, github for source control and issue management Analytical Methods Regressions, classifications, measuring prediction accuracy and coverage/error, sampling N/A Classification, clustering, similarity detection, recommenders, unsupervised and supervised learning, small- and large-scale computations, measuring prediction accuracy and coverage/error Team Structure Statisticians, Mathematicians, Scientists Developers, Project Managers, Systems Engineers Mathematicians, Statisticians, Scientists, Developers, Systems Engineers Time Frame Either: • Usually on-going research and discovery within a team in the organization Or: • Specific project to determine answers Regular software release cycle, continuous delivery, etc. Either: • Discovery/learning phase leading to product development Or: • On-going research and product invention/improvement
  • 13. Contrast: Scientific Computing 13 Scientific Modeling Physics-based models Problem-Structured Mostly deterministic, precise Run on Supercomputer or High-end Computing Cluster Supernova Not Image General purpose classifier Data-Driven Approach General inference engine replaces model Structure not related to problem Statistical models handle true randomness, and un-modeled complexity. Run on cheaper computer Clusters (EC2) Nugent group / C3 LBL
  • 14. Contrast: Machine Learning 14 Machine Learning Develop new (individual) models Prove mathematical properties of models Improve/validate on a few, relatively clean, small datasets Publish a paper  Data Science Explore many models, build and tune hybrids Understand empirical properties of models Develop/use tools that can handle massive datasets Take action!
  • 15. Contrast: Data Engineering 15 Data Science Data Engineering Approach Scientific (Exploration) Engineering (Development) Problems Unbounded Bounded Path to Solution Iterative, exploratory, nonlinear Mostly linear Education More is better (PhD’s common) BS and/or self-trained Presentation Skills Important Not as important Research Experience Important Not as important Programming Skills Not as important Important Data Skills Important Important
  • 16. Data Science & Academia 16 • In the words of Alex Szalay, these sorts of researchers must be "Pi-shaped" as opposed to the more traditional "T-shaped" researcher. In Szalay's view, a classic PhD program generates T-shaped researchers: scientists with wide- but-shallow general knowledge, but deep skill and expertise in one particular area. The new breed of scientific researchers, the data scientists, must be Pi- shaped: that is, they maintain the same wide breadth, but push deeper both in their own subject area and in the statistical or computational methods that help drive modern research:
  • 17. Data Science & Academia 17 • In a post by Jake Vanderplas in 2014 related to SciFoo discussion on: Academia and Data Science, the following questions below were discussed. • I encourage you to develop your own thoughts on them and come up with your assessment – Where does Data Science fit within the current structure of the university & research institutions? – What is it that academic data scientists want from their career? How can academia offer that? – What drivers might shift academia toward recognizing & rewarding data scientists in domain fields? – Recognizing that graduates will go on to work in both academia and industry, how do we best prepare them for success in both worlds?
  • 18. Data Science Applications 18 Business Health Care Urban Leaving Summary From car design to insurance to pizza delivery, businesses are using data science to optimize their operations and better meet their customers’ expectations. Tomorrow’s healthcare may look more efficient thanks to things like electronic health records. It also may look a lot more effective. Reduced readmissions, better care, and earlier detection are on the horizon. For the first time in human history, more people live in cities than in suburban or rural areas. An emerging field called “urban informatics” combines data science with the unique challenges facing the world’s growing cities What is happening? Two-Way Street for the Ford Focus Electric Car Reducing Hospital Readmissions Taking on Megacity Traffic Better Fraud Detection Boosts Customer Satisfaction Better Point-of-Care Decisions Fighting Crime with Data "predictive policing" E-Commerce Insights: Domino’s Secret Sauce What is possible Using Social Data to Select Successful Retail Locations . Medical Exams by Bathroom Mirrors Instrumenting cities
  • 19. Contrast: Computational Sciences 19 • Is there a contrast between Data Science and Computational Science?
  • 20. Data Science: Case Study Cancer Research 20 • Cancer is an incredibly complex disease; a single tumor can have more than 100 billion cells, and each cell can acquire mutations individually. The disease is always changing, evolving, and adapting. • Employ the power of big data analytics and high-performance computing. • Leverage sophisticated pattern and machine learning algorithms to identify patterns that are potentially linked to cancer • Huge amount of data processing and recognition
  • 21. Data Science: Case Study Health Care 21 • Stanford Medicine, Google team up to harness power of data science for health care • Stanford Medicine will use the power, security and scale of Google Cloud Platform to support precision health and more efficient patient care. • Analyzing genetic data • Focusing on precision health • Data as the engine that drives research http://med.stanford.edu/news/all-news/2016/08/stanford-medicine-google-team-up-to-harness-power-of-data-science.html
  • 22. Data Science: Case Study Elections 22 • The Obama campaigns in 2008 and 2012 are credited for their successful use of social media and data mining. • Micro-targeting in 2012 – http://www.theatlantic.com/politics/archive/2012/04/the- creepiness-factor-how-obama-and-romney-are-getting-to-know- you/255499/ – http://www.mediabizbloggers.com/group-m/How-Data-and-Micro- Targeting-Won-the-2012-Election-for-Obama---Antony-Young- Mindshare-North-America.html • Micro-profiles built from multiple sources accessed by aps, real- time updating data based on door-to-door visits, focused media buys, e-mails and Facebook messages highly targeted. • 1 million people installed the Obama Facebook app that gave access to info on “friends”.
  • 23. Data Science: Case Study Internet of Things (IoT) 23 • The Internet of Things is rapidly growing. It is predicted that more than 25 billion devices will be connected by 2020. • The Internet of Things (IOT) will soon produce a massive volume and variety of data at unprecedented velocity. If "Big Data" is the product of the IOT, "Data Science" is it's soul.
  • 24. Data Science: Case Study Customer Analytics 24
  • 25. Essential Points • Big Data has given rise to Data Science • Data science is rooted in solid foundations of mathematics and statistics, computer science, and domain knowledge • Sexy profession – Data Scientists  • Not every thing with data or science is Data Science! • The use cases for Data Science are compelling 25
  • 26. Conclusion In this section you have learned • What Big Data Challenges are • What exactly is Data Science and what do Data Scientists do • Data Science contrasted with other disciplines • Case Study & Use Cases 26