SlideShare una empresa de Scribd logo
1 de 28
CAP4770/5771
Introduction to Data Science
Fall 2015
University of Florida, CISE Department
Prof. Daisy Zhe Wang
Based on notes from CS194 at UC Berkeley by Michael
Franklin, John Canny, and Jeff Hammerbacher
Data Science Overview
Why, Where, What,
How, Who
Outline
• Data Science -- Why all the excitement?
– history
– examples
• Where does data come from?
• What is Data Science?
• How to do Data Science?
• Who are Data Scientists?
3
Data Science – Why all the excitement?
4
Data Analysis Has Been Around for a
While…
R.A. Fisher
Howard
Dresner
Peter Luhn
W.E.
Deming
Data Science: Why all the Excitement?
6
Exciting new effective
applications of data analytics
e.g.,
Google Flu Trends:
Detecting outbreaks
two weeks ahead
of CDC data
New models are estimating
which cities are most at risk
for spread of the Ebola virus.
Prediction model is built on
Various data sources,
types and analysis.
Why the all the Excitement?
7
Predicting political
champagne and election
Outcome:
Data and Election 2012 (cont.)
• …that was just one of several ways that Mr. Obama’s
campaign operations, some unnoticed by Mr. Romney’s aides
in Boston, helped save the president’s candidacy. In Chicago,
the campaign recruited a team of behavioral scientists to build
an extraordinarily sophisticated database
• …that allowed the Obama campaign not only to alter the very
nature of the electorate, making it younger and less white,
but also to create a portrait of shifting voter allegiances. The
power of this operation stunned Mr. Romney’s aides on
election night, as they saw voters they never even knew
existed turn out in places like Osceola County, Fla.
-- New York Times, Wed Nov 7, 2012
• The White House Names Dr. DJ Patil as the First U.S. Chief
Data Scientist, Feb. 18th 2015
8
A history of the (Business)
Internet: 1997
PageRank: The web as a behavioral dataset
Sponsored search
Sponsored search
• Google revenue around $50 bn/year from marketing,
97% of the companies revenue.
• Sponsored search uses an auction – a pure competition
for marketers trying to win access to consumers.
• In other words, a competition for models of consumers
– their likelihood of responding to the ad – and of
determining the right bid for the item.
• There are around 30 billion search requests a month.
Perhaps a trillion events of history between search
providers.
• Google Adwords and Adsense
Sponsored search
Other Data Science Applications
• Transaction Databases  Recommender systems
(NetFlix), Fraud Detection (Security and Privacy)
• Wireless Sensor Data  Smart Home, Real-time
Monitoring, Internet of Things
• Text Data, Social Media Data  Product Review and
Consumer Satisfaction (Facebook, Twitter, LinkedIn), E-
discovery
• Software Log Data  Automatic Trouble Shooting
(Splunk)
• Genotype and Phenotype Data  Epic, 23andme, Patient-
Centered Care, Personalized Medicine
Where does data come from?
15
“Big Data” Sources
Every:
Click
Ad impression
Billing event
Fast Forward, pause,…
Server request
Transaction
Network message
Fault
…
User Generated (Web &
Mobile)
…
..
Internet of Things /
M2M
Health/Scientific
Computing
It’s All Happening On-
line
“Data is the New Oil”
– World Economic Forum 2011
5 Vs of Big Data
• Raw Data: Volume
• Change over time: Velocity
• Data types: Variety
• Data Quality: Veracity
• Information for Decision Making: Value
What can you do with the data?
Traffic Prediction and Earthquake Warning
19
Crowdsourcing + physical modeling + sensing + data assimilation
to produce:
From Alex Bayen, UCB, Director, Institute for Transportation Studies
What is Data Science?
20
“Data Science” an Emerging Field
21
O’Reilly Radar report, 2011
Data Science – A Definition
Data Science is the science which uses
computer science, statistics and machine
learning, visualization and human-
computer interactions to collect, clean,
integrate, analyze, visualize, interact with
data to create data products.
22
Goal of Data Science
Turn data into data products.
Some recent ML
Competitions at
https://www.kaggle.c
om/
NIST Pre-Pilot Data
Science Evaluation –
likely to be
incorporated to be
part of Labs/Final
project
Data Science – A Visual Definition
Contrast: Databases
Databases Data Science
Data Value “Precious” “Cheap”
Data Volume Modest Massive
Examples Bank records,
Personnel records,
Census,
Medical records
Online clicks,
GPS logs,
Tweets,
Building sensor readings
Priorities Consistency,
Error recovery,
Auditability
Speed,
Availability,
Query richness
Structured Strongly (Schema) Weakly or none (Text)
Properties Transactions, ACID* CAP* theorem (2/3),
eventual consistency
Realizations SQL NoSQL:
MongoDB, CouchDB,
Hbase, Cassandra, Riak,
Memcached,
Apache River, …
ACID = Atomicity, Consistency, Isolation and Durability
CAP = Consistency, Availability, Partition
Tolerance
Contrast: Business Intelligence
Business
Intelligence
Data Science
Querying the past Querying the past
present and future
Contrast: Machine Learning
Data Science
Explore many models, build and
tune hybrids
Understand empirical properties of
models
Develop/use tools that can handle
massive datasets
Take action!
Machine Learning
Develop new (individual)
models
Prove mathematical properties
of models
Improve/validate on a few,
relatively clean, small datasets
Publish a paper

Más contenido relacionado

Similar a 1. Data Science overview - part1.pptx

Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)
Thinkful
 

Similar a 1. Data Science overview - part1.pptx (20)

data, big data, open data
data, big data, open datadata, big data, open data
data, big data, open data
 
Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data Science
 
Data Mining With Big Data
Data Mining With Big DataData Mining With Big Data
Data Mining With Big Data
 
Big data and development
Big data and developmentBig data and development
Big data and development
 
Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)
 
Lecture 01-1-IIS.pptx
Lecture 01-1-IIS.pptxLecture 01-1-IIS.pptx
Lecture 01-1-IIS.pptx
 
Predictive Analytics - How to get stuff out of your Crystal Ball
Predictive Analytics - How to get stuff out of your Crystal BallPredictive Analytics - How to get stuff out of your Crystal Ball
Predictive Analytics - How to get stuff out of your Crystal Ball
 
mineria de datos
mineria de datosmineria de datos
mineria de datos
 
mineria datos
mineria datosmineria datos
mineria datos
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
 
Behavioral Big Data & Healthcare Research: Talk at WiDS Taipei
Behavioral Big Data & Healthcare Research: Talk at WiDS TaipeiBehavioral Big Data & Healthcare Research: Talk at WiDS Taipei
Behavioral Big Data & Healthcare Research: Talk at WiDS Taipei
 
Data_Mining.ppt
Data_Mining.pptData_Mining.ppt
Data_Mining.ppt
 
Big Data World
Big Data WorldBig Data World
Big Data World
 
The art and science of data-driven journalism
The art and science of data-driven journalism The art and science of data-driven journalism
The art and science of data-driven journalism
 
Big Data for Library Services (2017)
Big Data for Library Services (2017)Big Data for Library Services (2017)
Big Data for Library Services (2017)
 
The REAL Impact of Big Data on Privacy
The REAL Impact of Big Data on PrivacyThe REAL Impact of Big Data on Privacy
The REAL Impact of Big Data on Privacy
 
Big Data in Healthcare and Medical Devices
Big Data in Healthcare and Medical DevicesBig Data in Healthcare and Medical Devices
Big Data in Healthcare and Medical Devices
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
Real-time applications of Data Science.pptx
Real-time applications  of Data Science.pptxReal-time applications  of Data Science.pptx
Real-time applications of Data Science.pptx
 
Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?
 

Último

Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
AroojKhan71
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
amitlee9823
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
amitlee9823
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
JoseMangaJr1
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 

Último (20)

Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 

1. Data Science overview - part1.pptx

  • 1. CAP4770/5771 Introduction to Data Science Fall 2015 University of Florida, CISE Department Prof. Daisy Zhe Wang Based on notes from CS194 at UC Berkeley by Michael Franklin, John Canny, and Jeff Hammerbacher
  • 2. Data Science Overview Why, Where, What, How, Who
  • 3. Outline • Data Science -- Why all the excitement? – history – examples • Where does data come from? • What is Data Science? • How to do Data Science? • Who are Data Scientists? 3
  • 4. Data Science – Why all the excitement? 4
  • 5. Data Analysis Has Been Around for a While… R.A. Fisher Howard Dresner Peter Luhn W.E. Deming
  • 6. Data Science: Why all the Excitement? 6 Exciting new effective applications of data analytics e.g., Google Flu Trends: Detecting outbreaks two weeks ahead of CDC data New models are estimating which cities are most at risk for spread of the Ebola virus. Prediction model is built on Various data sources, types and analysis.
  • 7. Why the all the Excitement? 7 Predicting political champagne and election Outcome:
  • 8. Data and Election 2012 (cont.) • …that was just one of several ways that Mr. Obama’s campaign operations, some unnoticed by Mr. Romney’s aides in Boston, helped save the president’s candidacy. In Chicago, the campaign recruited a team of behavioral scientists to build an extraordinarily sophisticated database • …that allowed the Obama campaign not only to alter the very nature of the electorate, making it younger and less white, but also to create a portrait of shifting voter allegiances. The power of this operation stunned Mr. Romney’s aides on election night, as they saw voters they never even knew existed turn out in places like Osceola County, Fla. -- New York Times, Wed Nov 7, 2012 • The White House Names Dr. DJ Patil as the First U.S. Chief Data Scientist, Feb. 18th 2015 8
  • 9. A history of the (Business) Internet: 1997
  • 10. PageRank: The web as a behavioral dataset
  • 12. Sponsored search • Google revenue around $50 bn/year from marketing, 97% of the companies revenue. • Sponsored search uses an auction – a pure competition for marketers trying to win access to consumers. • In other words, a competition for models of consumers – their likelihood of responding to the ad – and of determining the right bid for the item. • There are around 30 billion search requests a month. Perhaps a trillion events of history between search providers. • Google Adwords and Adsense
  • 14. Other Data Science Applications • Transaction Databases  Recommender systems (NetFlix), Fraud Detection (Security and Privacy) • Wireless Sensor Data  Smart Home, Real-time Monitoring, Internet of Things • Text Data, Social Media Data  Product Review and Consumer Satisfaction (Facebook, Twitter, LinkedIn), E- discovery • Software Log Data  Automatic Trouble Shooting (Splunk) • Genotype and Phenotype Data  Epic, 23andme, Patient- Centered Care, Personalized Medicine
  • 15. Where does data come from? 15
  • 16. “Big Data” Sources Every: Click Ad impression Billing event Fast Forward, pause,… Server request Transaction Network message Fault … User Generated (Web & Mobile) … .. Internet of Things / M2M Health/Scientific Computing It’s All Happening On- line
  • 17. “Data is the New Oil” – World Economic Forum 2011
  • 18. 5 Vs of Big Data • Raw Data: Volume • Change over time: Velocity • Data types: Variety • Data Quality: Veracity • Information for Decision Making: Value
  • 19. What can you do with the data? Traffic Prediction and Earthquake Warning 19 Crowdsourcing + physical modeling + sensing + data assimilation to produce: From Alex Bayen, UCB, Director, Institute for Transportation Studies
  • 20. What is Data Science? 20
  • 21. “Data Science” an Emerging Field 21 O’Reilly Radar report, 2011
  • 22. Data Science – A Definition Data Science is the science which uses computer science, statistics and machine learning, visualization and human- computer interactions to collect, clean, integrate, analyze, visualize, interact with data to create data products. 22
  • 23. Goal of Data Science Turn data into data products.
  • 24. Some recent ML Competitions at https://www.kaggle.c om/ NIST Pre-Pilot Data Science Evaluation – likely to be incorporated to be part of Labs/Final project
  • 25. Data Science – A Visual Definition
  • 26. Contrast: Databases Databases Data Science Data Value “Precious” “Cheap” Data Volume Modest Massive Examples Bank records, Personnel records, Census, Medical records Online clicks, GPS logs, Tweets, Building sensor readings Priorities Consistency, Error recovery, Auditability Speed, Availability, Query richness Structured Strongly (Schema) Weakly or none (Text) Properties Transactions, ACID* CAP* theorem (2/3), eventual consistency Realizations SQL NoSQL: MongoDB, CouchDB, Hbase, Cassandra, Riak, Memcached, Apache River, … ACID = Atomicity, Consistency, Isolation and Durability CAP = Consistency, Availability, Partition Tolerance
  • 27. Contrast: Business Intelligence Business Intelligence Data Science Querying the past Querying the past present and future
  • 28. Contrast: Machine Learning Data Science Explore many models, build and tune hybrids Understand empirical properties of models Develop/use tools that can handle massive datasets Take action! Machine Learning Develop new (individual) models Prove mathematical properties of models Improve/validate on a few, relatively clean, small datasets Publish a paper