SlideShare una empresa de Scribd logo
1 de 48
Descargar para leer sin conexión
Fickr: Nikos Koutoulas
Big Data & Career Paths
Marcos Colebrook
Univ. de La Laguna
@MColebrook
ETS Ingeniería Informática – 16.06.2014#BigDataCanarias
Contents
Big Data facts
Definition of Big Data
Techs & Tools
Data Science: skills and career
paths
Conclusions
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 2
Big Data everywhere!!
16.06.2014 3#BigDataCanarias: "Big Data & Career Paths"
Data vs. God
“In God we trust, all others
bring data.“
 W.E. Deming
16.06.2014 4#BigDataCanarias: "Big Data & Career Paths"
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 5
Source: M. Deutscher, When Will the World Reach 8 Zetabytes of Stored Data? (2012).
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 6
Source: Intel (2014), What Happens In An Internet Minute?
Big Data in Facebook
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 7
Google trends on Big Data
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 8
Hadoop
Big Data
Data
Analytics
Massive Data
Father to the ‘Big Data’ term
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 9
Source: S. Lohr (2013), The Origins of ‘Big Data’: An Etymological Detective Story.
John R. Mashey
Chief Scientist at Silicon Graphics
Big Data: think-tank Policy Exchange
Big Data: datasets that are too
awkward to work with using traditional,
hands-on database management tools.
Big Data Analytics: the process of
examining and interrogating big data
assets to derive insights of value for
decision making.
16.06.2014 10#BigDataCanarias: "Big Data & Career Paths"
Source: C. Yiu (2012), The Big Data Opportunity.
What is Big Data?
Big Data is a term that describes
large volumes of high velocity,
complex and variable data that
require advanced techniques and
technologies to enable the capture,
storage, distribution, management,
and analysis of the information.
16.06.2014 11#BigDataCanarias: "Big Data & Career Paths"
Source: Demystifying Big Data (2012), TechAmerica Foundation.
Big Data
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 12
Source: J. Bloem et al. (2012), VINT Research Report 1: Creating Clarity with Big Data.
Sources & types of data
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 13
Source: Big Data, BBVA Innovation Edge 2013 (from Booz & Company “Benefitting from Big Data: Leveraging Unstructured
Data Capabilities for Competitive Advantage”)
Big Data sources
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 14
Source: M. Schroeck et al. (2012), Analytics: The Real-World Use of Big Data.
The three Vs of Big Data
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 15
Source: D. Soubra (2012), The 3Vs that define Big Data.
The other “Vs” in Big Data
“ ’Vs’ like veracity,
validity, value,
viability, etc. are
aspirational qualities
of all data, not
definitional qualities of
Big Data.”
 Doug Laney
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 16
Source: D. Laney (2013), Batman on Big Data.
What is really important in Big Data?
“The Big in Big Data relates to
importance not size”
 Rafael Irizarry
16.06.2014 17#BigDataCanarias: "Big Data & Career Paths"
Source: R. Irizarry (2014), The Big in Big Data relates to importance not size.
My best “V”
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 18
Is Big Data a marketing campaign?
“If you’re like me, the mere mention of Big Data now
turns your stomach.
Nearly every business intelligence (BI) vendor,
publication, and event has Big Data flashing in neon
colors in Times Square dimensions.
Never before have I seen an idea in the BI space elicit
this much obsession. Why all the fuss? Why, indeed.
Essentially, Big Data is a marketing campaign, pure
and simple.”
 Stephen Few
16.06.2014 19#BigDataCanarias: "Big Data & Career Paths"
Gartner's 2013 Hype Cycle
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 20
Source: Gartner's 2013 Hype Cycle for Emerging Technologies
Big Data: McKinsey Report
 140.000 – 190.000 more deep analytical talent positions,
and 1.5 million data savvy managers needed to take full
advantage of Big Data in the USA.
 Techniques: data mining (cluster analysis, classification,
regression, etc), (un)supervised learning, ML, neural
networks, optimization, predictive modeling, statistics,
simulation, etc.
 Technologies: BI, Cassandra, DW, ETL, Hadoop, HBase,
Map/Reduce, R, RDBMS, etc.
 Potential of Big Data in five domains:
 Healthcare
 Public Sector
 Retail
 Manufacturing
 Telecommunications.
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 21
Source: J. Manyika, et al. (2012), Big Data: The Next Frontier for Innovation, Competition and Productivity.
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 22
Hadoop-NoSQL Market Forecast
2012-2017
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 23
Source: J. Kelly (2013), Hadoop-NoSQL Software And Services Market Forecast 2012-2017.
Big Data Techs
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 24
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 25
Data Tools
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 26
Source: J. King, R. Magoulas (2013), Data Science Salary Survey.
Salary vs. Data Tools
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 27
Source: J. King, R. Magoulas (2013), Data Science Salary Survey.
Median Salary vs. #Tools
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 28
Source: J. King, R. Magoulas (2013), Data Science Salary Survey.
Data Skills
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 29
Source: H.D. Harris et al. (2013), Analyzing the Analyzers
Data Role vs. Data Skills
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 30
Source: H.D. Harris et al. (2013), Analyzing the Analyzers
Big Data capabilities
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 31
Source: M. Schroeck et al. (2012), Analytics: The Real-World Use of Big Dat.
Market & jobs opportunity
 The demand for Big Data services spending
projected to reach $132,300M in 2015.
 By 2015, Big Data demand will reach 4.4 million
jobs globally, but only one-third of those jobs will
be filled.
 The demand for services will generate 550,000
external services jobs in the next 3 years.
 Another 40,000 jobs will be created at software
vendors in the next 3 years.
16.06.2014 32#BigDataCanarias: "Big Data & Career Paths"
Source: Big Data, BBVA Innovation Edge 2013 (from Gartner’s “Top Technology Predictions for 2013 and Beyond”)
Statiscian: a sexy job
“I keep saying the sexy job in the next ten years will be
statisticians.
People think I’m joking, but who would’ve guessed that
computer engineers would’ve been the sexy job of the
1990s?
The ability to take data—to be able to understand it, to
process it, to extract value from it, to visualize it, to
communicate it—that’s going to be a hugely
important skill in the next decades [...]”
 Hal Varian
Google’s Chief Economist
16.06.2014 33#BigDataCanarias: "Big Data & Career Paths"
Source: Hal Varian on how the Web challenges managers, McKinsey & Co. 2009.
Data Scientist
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 34
Source: Josh Wills (2012).
Data Science Venn Diagram
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 35
Source: Drew Conway (2010).
Data Scientist skill set: ACM
A data scientist requires an integrated
skill set spanning mathematics,
machine learning, artificial
intelligence, statistics, databases, and
optimization, along with a deep
understanding of the craft of problem
formulation to engineer effective
solutions.
16.06.2014 36#BigDataCanarias: "Big Data & Career Paths"
Source: V. Dhar (2013), Data Science and Prediction, Comm. of the ACM.
Intelligence over DIKW
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 37
Source: The Internet of Things 2010 at YouTube (1:40).
Data→Info→Knowledge→Understanding
→Wisdom!!
“There are known knowns.
These are things we know that
we know.
There are known unknowns.
That is to say, there are things
that we know we don't know.
But there are also unknown
unknowns. There are things we
don't know we don't know.”
 Donald Rumsfeld
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 38
Source: C. Somohano (2013), Big Data [sorry] & Data Science: What Does a Data Scientist Do?
BI vs. Data Discovery
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 39
Source: J. Kolb (2010), The New Reality for Business Intelligence and Big Data.
Data Science Teams
Data scientists as having the following qualities:
 Technical expertise: the best data scientists typically have
deep expertise in some scientific discipline.
 Curiosity: a desire to go beneath the surface and discover
and distill a problem down into a very clear set of
hypotheses that can be tested.
 Storytelling: the ability to use data to tell a story and to be
able to communicate it effectively.
 Cleverness: the ability to look at a problem in different,
creative ways.
16.06.2014 40#BigDataCanarias: "Big Data & Career Paths"
Source: D.J. Patil (2011), Building Data Science Team.
Data Science skills: Accenture
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 41
Source: J.G. Harris et al. (2013), The Team Solution to the Data Scientist Shortage.
Insight Data Science Fellow Program
 6 week, full-time, postdoctoral
data science training fellowship
in Silicon Valley or New York City.
 Self-directed, project-based
learning (no classes!).
 Software Engineering Best
Practices: Python, Git, Flask,
Javascript.
 Storing and Retrieving Data:
MySQL, Hadoop, Hive.
 Statistical Analysis & Machine
Learning: NumPy & SciPy,
Pandas, scikit-learn, R.
 Visualizing and
Communicating Results: D3
Javascript library, visualization
and presentation best practices.
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 42
Insight Data Engineering Fellow
Program
 6 week, full-time,
professional data
engineering training
fellowship in Silicon Valley,
California.
 Self-directed, project-based
learning (no classes!).
 Big Data Infrastructure.
 Extracting data.
 Transforming data.
 Loading / Storing data.
 Building visualizations
and dashboards.
16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 43
Conclusions
 Big Data is still an emerging topic that gathers a lot of new
technologies, and needs some time to mature.
 But, on the other hand, it has a true market opportunity.
 Data Science / Engineering skills to acquire:
 Math/Statistics and business knowledge.
 Technical expertise: R, Python, Hadoop, Spark/Storm, D3,
Java/Javascript, ...
 Curiosity and cleverness.
 Storytelling: ability to communicate results.
 Trends:
 Data Visualization
 Predictive Modelling
 Social Analytics
 Data Mining / Machine Learning
 Forensic Computer Science
 Spark / Storm vs. Hadoop MapReduce
16.06.2014 44#BigDataCanarias: "Big Data & Career Paths"
References (1/3)
1. Big Data (2013), BBVA Innovation Edge (31 pp).
2. Demystifying Big Data: A Practical Guide To Transforming The Business
of Government (2012), TechAmerica Foundation (40 pp).
3. Gartner's 2013 Hype Cycle for Emerging Technologies Maps Out
Evolving Relationship Between Humans and Machines (2013), Gartner.
4. Hal Varian on How the Web Challenges Managers (2009), McKinsey &
Co.
5. Insight Data Engineering Fellows Program (2014).
6. Insight Data Science Fellows Program (2014).
7. The Internet of Things (2010), IBM Social Media.
8. What Happens In An Internet Minute? (2014), Intel.
16.06.2014 45#BigDataCanarias: "Big Data & Career Paths"
References (2/3)
9. J. Bloem, M. van Doorn, S. Duivestein, T. van Manen, E. van Ommeren (2012), VINT Research Report
1: Creating Clarity with Big Data, SOGETI.
10. D. Conway (2010), The Data Science Venn Diagram.
11. M. Deutscher, When Will the World Reach 8 Zetabytes of Stored Data? (2012), Silicon Angle (blog).
12. V. Dhar (2013), Data Science and Prediction, Communications of the ACM 56 (12), pp. 64-73.
13. S. Few (2012), Big Data, Big Ruse, Perceptual Edge - Visual Business Intelligence Newsletter (blog,
8 pp).
14. H.D. Harris, S.P. Murphy, M. Vaisman (2013), Analyzing the Analyzers, O’Reilly Media (40 pp).
15. J.G. Harris, N. Shetterley, A.E. Alter, K. Schnell (2013), The Team Solution to the Data Scientist
Shortage, Accenture Institute for High Performance.
16. R. Irizarry (2014), The Big in Big Data Relates to Importance Not Size, Simply Statistics (blog).
17. J. King, R. Magoulas (2013), Data Science Salary Survey, O’Reilly Media (23 pp).
18. J. Kelly (2013), Hadoop-NoSQL Software and Services Market Forecast 2012-2017, Wikibon (blog).
19. J. Kolb (2010), The New Reality for Business Intelligence and Big Data, Applied Data Labs (blog).
20. D. Laney (2013), Batman on Big Data, Gartner.
16.06.2014 46#BigDataCanarias: "Big Data & Career Paths"
References (3/3)
21. D. Laney (2013), Batman on Big Data, Gartner.
22. S. Lohr (2013), The Origins of ‘Big Data’: An Etymological Detective Story, The New York Times.
23. J. Manyika, M. Chui, B. Brown, J. Bughin, R. Dobbs, C. Roxburgh, A.H. Byers (2012), Big Data: The Next
Frontier for Innovation, Competition and Productivity, McKinsey Global Institute (156 pp).
24. R. Nair, A. Narayanan (2012), Benefitting from Big Data: Leveraging Unstructured Data Capabilities
for Competitive Advantage, Booz & Company (16 pp).
25. D.J. Patil (2011), Building Data Science Teams, O’Reilly Media (26 pp).
26. G. Piatetsky (2014), Big Data Landscape v3.0 Analyzed, KDnuggets (blog).
27. J. Podesta, P. Pritzker, E.J. Moniz, J. Holdren, J. Zients (2014), Big Data: Seizing Opportunities,
Preserving Values, The White House (79 pp).
28. M. Schroeck, R. Shockley, J. Smart, D. Romero-Morales, P. Tufano (2012), Analytics: The Real-World
Use of Big Data, IBM Global Services.
29. C. Somohano (2013), Big Data [sorry] & Data Science: What Does a Data Scientist Do?, Data Science
London (55 pp).
30. D. Soubra (2012), The 3Vs that define Big Data, Data Science Central (blog).
31. C. Yiu, The Big Data Opportunity (2012), Policy Exchange (36 pp).
32. P. Zikopoulos, C. Eaton, D. deRoos, T. Deutsch, G. Lapis (2012), Understanding Big Data, McGraw-Hill.
16.06.2014 47#BigDataCanarias: "Big Data & Career Paths"
Datos de contacto y cuestiones
¡¡Gracias!!
¿Preguntas?
 Datos de contacto:
 Marcos Colebrook
 Email: mcolesan@ull.edu.es
 Twitter: @MColebrook
 SlideShare: www.slideshare.net/MarcosColebrookSantamaria
16.06.2014 48#BigDataCanarias: "Big Data & Career Paths"

Más contenido relacionado

La actualidad más candente

Forecast of Big Data Trends
Forecast of Big Data TrendsForecast of Big Data Trends
Forecast of Big Data TrendsIMC Institute
 
Big Data Information Architecture PowerPoint Presentation Slide
Big Data Information Architecture PowerPoint Presentation SlideBig Data Information Architecture PowerPoint Presentation Slide
Big Data Information Architecture PowerPoint Presentation SlideSlideTeam
 
Big Data - Applications and Technologies Overview
Big Data - Applications and Technologies OverviewBig Data - Applications and Technologies Overview
Big Data - Applications and Technologies OverviewSivashankar Ganapathy
 
The Business of Big Data - IA Ventures
The Business of Big Data - IA VenturesThe Business of Big Data - IA Ventures
The Business of Big Data - IA VenturesBen Siscovick
 
Introduction to Data Mining, Business Intelligence and Data Science
Introduction to Data Mining, Business Intelligence and Data ScienceIntroduction to Data Mining, Business Intelligence and Data Science
Introduction to Data Mining, Business Intelligence and Data ScienceIMC Institute
 
Big data analytics with Apache Hadoop
Big data analytics with Apache  HadoopBig data analytics with Apache  Hadoop
Big data analytics with Apache HadoopSuman Saurabh
 
Big Data’s Big Impact on Businesses
Big Data’s Big Impact on BusinessesBig Data’s Big Impact on Businesses
Big Data’s Big Impact on BusinessesCRISIL Limited
 
10 Most Effective Big Data Technologies
10 Most Effective Big Data Technologies10 Most Effective Big Data Technologies
10 Most Effective Big Data TechnologiesMahindra Comviva
 
Big data analysis using map/reduce
Big data analysis using map/reduceBig data analysis using map/reduce
Big data analysis using map/reduceRenuSuren
 
Big Data Presentation at SCQAA-SF on June 12 2013
Big Data Presentation at SCQAA-SF on June 12 2013Big Data Presentation at SCQAA-SF on June 12 2013
Big Data Presentation at SCQAA-SF on June 12 2013Sujit Ghosh
 
Data-Ed Webinar: Demystifying Big Data
Data-Ed Webinar: Demystifying Big Data Data-Ed Webinar: Demystifying Big Data
Data-Ed Webinar: Demystifying Big Data DATAVERSITY
 
Big Data Career Path | Big Data Learning Path | Hadoop Tutorial | Edureka
Big Data Career Path | Big Data Learning Path | Hadoop Tutorial | EdurekaBig Data Career Path | Big Data Learning Path | Hadoop Tutorial | Edureka
Big Data Career Path | Big Data Learning Path | Hadoop Tutorial | EdurekaEdureka!
 

La actualidad más candente (20)

Forecast of Big Data Trends
Forecast of Big Data TrendsForecast of Big Data Trends
Forecast of Big Data Trends
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
Sina Sohangir Presentation on IWMC 2015
Sina Sohangir Presentation on IWMC 2015Sina Sohangir Presentation on IWMC 2015
Sina Sohangir Presentation on IWMC 2015
 
The promise and challenge of Big Data
The promise and challenge of Big DataThe promise and challenge of Big Data
The promise and challenge of Big Data
 
What is big data?
What is big data?What is big data?
What is big data?
 
Big Data Information Architecture PowerPoint Presentation Slide
Big Data Information Architecture PowerPoint Presentation SlideBig Data Information Architecture PowerPoint Presentation Slide
Big Data Information Architecture PowerPoint Presentation Slide
 
Big Data - Applications and Technologies Overview
Big Data - Applications and Technologies OverviewBig Data - Applications and Technologies Overview
Big Data - Applications and Technologies Overview
 
The Business of Big Data - IA Ventures
The Business of Big Data - IA VenturesThe Business of Big Data - IA Ventures
The Business of Big Data - IA Ventures
 
Research paper on big data and hadoop
Research paper on big data and hadoopResearch paper on big data and hadoop
Research paper on big data and hadoop
 
Introduction to Data Mining, Business Intelligence and Data Science
Introduction to Data Mining, Business Intelligence and Data ScienceIntroduction to Data Mining, Business Intelligence and Data Science
Introduction to Data Mining, Business Intelligence and Data Science
 
Big data analytics with Apache Hadoop
Big data analytics with Apache  HadoopBig data analytics with Apache  Hadoop
Big data analytics with Apache Hadoop
 
Big Data’s Big Impact on Businesses
Big Data’s Big Impact on BusinessesBig Data’s Big Impact on Businesses
Big Data’s Big Impact on Businesses
 
10 Most Effective Big Data Technologies
10 Most Effective Big Data Technologies10 Most Effective Big Data Technologies
10 Most Effective Big Data Technologies
 
Big data analysis using map/reduce
Big data analysis using map/reduceBig data analysis using map/reduce
Big data analysis using map/reduce
 
Big Data Presentation at SCQAA-SF on June 12 2013
Big Data Presentation at SCQAA-SF on June 12 2013Big Data Presentation at SCQAA-SF on June 12 2013
Big Data Presentation at SCQAA-SF on June 12 2013
 
Big data 101
Big data 101Big data 101
Big data 101
 
Data-Ed Webinar: Demystifying Big Data
Data-Ed Webinar: Demystifying Big Data Data-Ed Webinar: Demystifying Big Data
Data-Ed Webinar: Demystifying Big Data
 
Big Data Career Path | Big Data Learning Path | Hadoop Tutorial | Edureka
Big Data Career Path | Big Data Learning Path | Hadoop Tutorial | EdurekaBig Data Career Path | Big Data Learning Path | Hadoop Tutorial | Edureka
Big Data Career Path | Big Data Learning Path | Hadoop Tutorial | Edureka
 
Big data
Big dataBig data
Big data
 
Our big data
Our big dataOur big data
Our big data
 

Destacado

Wave Hackathon Intro
Wave Hackathon IntroWave Hackathon Intro
Wave Hackathon IntroJose Quesada
 
R for the semantic web, Quesada useR 2009
R for the semantic web, Quesada useR 2009R for the semantic web, Quesada useR 2009
R for the semantic web, Quesada useR 2009Jose Quesada
 
A quick overview of the available reference managers2010
A quick overview of the available reference managers2010A quick overview of the available reference managers2010
A quick overview of the available reference managers2010Jose Quesada
 
Irmles2010 Random indexing spaces to bridge the human and data webs
Irmles2010 Random indexing spaces to bridge the human and data websIrmles2010 Random indexing spaces to bridge the human and data webs
Irmles2010 Random indexing spaces to bridge the human and data websJose Quesada
 
Data science-retreat-how it works plus advice for upcoming data scientists
Data science-retreat-how it works plus advice for upcoming data scientistsData science-retreat-how it works plus advice for upcoming data scientists
Data science-retreat-how it works plus advice for upcoming data scientistsJose Quesada
 
Wikibon Big Data Capital Markets Day 2014
Wikibon Big Data Capital Markets Day 2014Wikibon Big Data Capital Markets Day 2014
Wikibon Big Data Capital Markets Day 2014Jeff Kelly
 
Hortonworks Big Data Career Paths and Training
Hortonworks Big Data Career Paths and Training Hortonworks Big Data Career Paths and Training
Hortonworks Big Data Career Paths and Training Aengus Rooney
 
Create your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseCreate your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseJeff Kelly
 
Steps towards a Data Value Chain
Steps towards a Data Value ChainSteps towards a Data Value Chain
Steps towards a Data Value ChainPRELIDA Project
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataMohammed Guller
 
Big data & data science challenges and opportunities
Big data & data science   challenges and opportunitiesBig data & data science   challenges and opportunities
Big data & data science challenges and opportunitiesJose Quesada
 
Future of data science as a profession
Future of data science as a professionFuture of data science as a profession
Future of data science as a professionJose Quesada
 
Becoming a Data Driven Organisation
Becoming a Data Driven OrganisationBecoming a Data Driven Organisation
Becoming a Data Driven OrganisationWizdee
 
인공지능은 의료를 어떻게 혁신할 것인가 (ver 2)
인공지능은 의료를 어떻게 혁신할 것인가 (ver 2)인공지능은 의료를 어떻게 혁신할 것인가 (ver 2)
인공지능은 의료를 어떻게 혁신할 것인가 (ver 2)Yoon Sup Choi
 
"Hadoop and Data Warehouse (DWH) – Friends, Enemies or Profiteers? What about...
"Hadoop and Data Warehouse (DWH) – Friends, Enemies or Profiteers? What about..."Hadoop and Data Warehouse (DWH) – Friends, Enemies or Profiteers? What about...
"Hadoop and Data Warehouse (DWH) – Friends, Enemies or Profiteers? What about...Kai Wähner
 
Lecture on Data Science in a Data-Driven Culture
Lecture on Data Science in a Data-Driven Culture Lecture on Data Science in a Data-Driven Culture
Lecture on Data Science in a Data-Driven Culture Johan Himberg
 
How to reach a Data Driven culture
How to reach a Data Driven cultureHow to reach a Data Driven culture
How to reach a Data Driven cultureMark Beekman
 
Ppt shapes
Ppt shapesPpt shapes
Ppt shapesNag S
 
The big data value chain r1-31 oct13
The big data value chain r1-31 oct13The big data value chain r1-31 oct13
The big data value chain r1-31 oct13Rei Lynn Hayashi
 
Big Data Industry Insights 2015
Big Data Industry Insights 2015 Big Data Industry Insights 2015
Big Data Industry Insights 2015 Den Reymer
 

Destacado (20)

Wave Hackathon Intro
Wave Hackathon IntroWave Hackathon Intro
Wave Hackathon Intro
 
R for the semantic web, Quesada useR 2009
R for the semantic web, Quesada useR 2009R for the semantic web, Quesada useR 2009
R for the semantic web, Quesada useR 2009
 
A quick overview of the available reference managers2010
A quick overview of the available reference managers2010A quick overview of the available reference managers2010
A quick overview of the available reference managers2010
 
Irmles2010 Random indexing spaces to bridge the human and data webs
Irmles2010 Random indexing spaces to bridge the human and data websIrmles2010 Random indexing spaces to bridge the human and data webs
Irmles2010 Random indexing spaces to bridge the human and data webs
 
Data science-retreat-how it works plus advice for upcoming data scientists
Data science-retreat-how it works plus advice for upcoming data scientistsData science-retreat-how it works plus advice for upcoming data scientists
Data science-retreat-how it works plus advice for upcoming data scientists
 
Wikibon Big Data Capital Markets Day 2014
Wikibon Big Data Capital Markets Day 2014Wikibon Big Data Capital Markets Day 2014
Wikibon Big Data Capital Markets Day 2014
 
Hortonworks Big Data Career Paths and Training
Hortonworks Big Data Career Paths and Training Hortonworks Big Data Career Paths and Training
Hortonworks Big Data Career Paths and Training
 
Create your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseCreate your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouse
 
Steps towards a Data Value Chain
Steps towards a Data Value ChainSteps towards a Data Value Chain
Steps towards a Data Value Chain
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big data & data science challenges and opportunities
Big data & data science   challenges and opportunitiesBig data & data science   challenges and opportunities
Big data & data science challenges and opportunities
 
Future of data science as a profession
Future of data science as a professionFuture of data science as a profession
Future of data science as a profession
 
Becoming a Data Driven Organisation
Becoming a Data Driven OrganisationBecoming a Data Driven Organisation
Becoming a Data Driven Organisation
 
인공지능은 의료를 어떻게 혁신할 것인가 (ver 2)
인공지능은 의료를 어떻게 혁신할 것인가 (ver 2)인공지능은 의료를 어떻게 혁신할 것인가 (ver 2)
인공지능은 의료를 어떻게 혁신할 것인가 (ver 2)
 
"Hadoop and Data Warehouse (DWH) – Friends, Enemies or Profiteers? What about...
"Hadoop and Data Warehouse (DWH) – Friends, Enemies or Profiteers? What about..."Hadoop and Data Warehouse (DWH) – Friends, Enemies or Profiteers? What about...
"Hadoop and Data Warehouse (DWH) – Friends, Enemies or Profiteers? What about...
 
Lecture on Data Science in a Data-Driven Culture
Lecture on Data Science in a Data-Driven Culture Lecture on Data Science in a Data-Driven Culture
Lecture on Data Science in a Data-Driven Culture
 
How to reach a Data Driven culture
How to reach a Data Driven cultureHow to reach a Data Driven culture
How to reach a Data Driven culture
 
Ppt shapes
Ppt shapesPpt shapes
Ppt shapes
 
The big data value chain r1-31 oct13
The big data value chain r1-31 oct13The big data value chain r1-31 oct13
The big data value chain r1-31 oct13
 
Big Data Industry Insights 2015
Big Data Industry Insights 2015 Big Data Industry Insights 2015
Big Data Industry Insights 2015
 

Similar a Big Data Career Paths and Skills

Data Scientist - Good Rebels -
Data Scientist - Good Rebels -Data Scientist - Good Rebels -
Data Scientist - Good Rebels -Good Rebels
 
Big Data v. Small data - Rules to thumb for 2015
Big Data v. Small data - Rules to thumb for 2015Big Data v. Small data - Rules to thumb for 2015
Big Data v. Small data - Rules to thumb for 2015Visart
 
BigData & Supply Chain: A "Small" Introduction
BigData & Supply Chain: A "Small" IntroductionBigData & Supply Chain: A "Small" Introduction
BigData & Supply Chain: A "Small" IntroductionIvan Gruer
 
Minne analytics presentation 2018 12 03 final compressed
Minne analytics presentation 2018 12 03 final   compressedMinne analytics presentation 2018 12 03 final   compressed
Minne analytics presentation 2018 12 03 final compressedBonnie Holub
 
CPA ONE 2016 - Big data: big decisions or big fallacy
CPA ONE 2016 - Big data: big decisions or big fallacyCPA ONE 2016 - Big data: big decisions or big fallacy
CPA ONE 2016 - Big data: big decisions or big fallacyLaurie Desautels
 
Data science market insights usa
Data science market insights usaData science market insights usa
Data science market insights usaKaitlin McAndrews
 
Big Data, Republicans and 2016
Big Data, Republicans and 2016Big Data, Republicans and 2016
Big Data, Republicans and 2016steveparkhurst
 
Minne analytics presentation 2018 12 03 final compressed
Minne analytics presentation 2018 12 03 final   compressedMinne analytics presentation 2018 12 03 final   compressed
Minne analytics presentation 2018 12 03 final compressedBonnie Holub
 
Big data Career Opportunuties
Big data  Career OpportunutiesBig data  Career Opportunuties
Big data Career OpportunutiesDevashish Mishra
 
Intro to Big Data in Urban GIS Research
Intro to Big Data in Urban GIS ResearchIntro to Big Data in Urban GIS Research
Intro to Big Data in Urban GIS ResearchRobert Goodspeed
 
Big data characteristics, value chain and challenges
Big data characteristics, value chain and challengesBig data characteristics, value chain and challenges
Big data characteristics, value chain and challengesMusfiqur Rahman
 
BigData-Challenges.pptx
BigData-Challenges.pptxBigData-Challenges.pptx
BigData-Challenges.pptxamanyosama12
 
Deck 92-146 (3)
Deck 92-146 (3)Deck 92-146 (3)
Deck 92-146 (3)Thinkful
 
Using Graphs to Enable National-Scale Analytics
Using Graphs to Enable National-Scale AnalyticsUsing Graphs to Enable National-Scale Analytics
Using Graphs to Enable National-Scale AnalyticsNeo4j
 
BIG DATA ANALYTICS.pptx
BIG DATA ANALYTICS.pptxBIG DATA ANALYTICS.pptx
BIG DATA ANALYTICS.pptxVikas Bhowate
 
Big Data and HR - Talk @SwissHR Congress
Big Data and HR - Talk @SwissHR CongressBig Data and HR - Talk @SwissHR Congress
Big Data and HR - Talk @SwissHR CongressMarcel Blattner, PhD
 

Similar a Big Data Career Paths and Skills (20)

Data Scientist - Good Rebels -
Data Scientist - Good Rebels -Data Scientist - Good Rebels -
Data Scientist - Good Rebels -
 
Big Data v. Small data - Rules to thumb for 2015
Big Data v. Small data - Rules to thumb for 2015Big Data v. Small data - Rules to thumb for 2015
Big Data v. Small data - Rules to thumb for 2015
 
BigData & Supply Chain: A "Small" Introduction
BigData & Supply Chain: A "Small" IntroductionBigData & Supply Chain: A "Small" Introduction
BigData & Supply Chain: A "Small" Introduction
 
Big Data RF
Big Data RFBig Data RF
Big Data RF
 
Minne analytics presentation 2018 12 03 final compressed
Minne analytics presentation 2018 12 03 final   compressedMinne analytics presentation 2018 12 03 final   compressed
Minne analytics presentation 2018 12 03 final compressed
 
CPA ONE 2016 - Big data: big decisions or big fallacy
CPA ONE 2016 - Big data: big decisions or big fallacyCPA ONE 2016 - Big data: big decisions or big fallacy
CPA ONE 2016 - Big data: big decisions or big fallacy
 
Data science market insights usa
Data science market insights usaData science market insights usa
Data science market insights usa
 
Big Data, Republicans and 2016
Big Data, Republicans and 2016Big Data, Republicans and 2016
Big Data, Republicans and 2016
 
Minne analytics presentation 2018 12 03 final compressed
Minne analytics presentation 2018 12 03 final   compressedMinne analytics presentation 2018 12 03 final   compressed
Minne analytics presentation 2018 12 03 final compressed
 
Data Scientist
Data ScientistData Scientist
Data Scientist
 
The 25 Predictions About The Future Of Big Data
The 25 Predictions About The Future Of Big DataThe 25 Predictions About The Future Of Big Data
The 25 Predictions About The Future Of Big Data
 
Big data Career Opportunuties
Big data  Career OpportunutiesBig data  Career Opportunuties
Big data Career Opportunuties
 
Business Intelligence
Business IntelligenceBusiness Intelligence
Business Intelligence
 
Intro to Big Data in Urban GIS Research
Intro to Big Data in Urban GIS ResearchIntro to Big Data in Urban GIS Research
Intro to Big Data in Urban GIS Research
 
Big data characteristics, value chain and challenges
Big data characteristics, value chain and challengesBig data characteristics, value chain and challenges
Big data characteristics, value chain and challenges
 
BigData-Challenges.pptx
BigData-Challenges.pptxBigData-Challenges.pptx
BigData-Challenges.pptx
 
Deck 92-146 (3)
Deck 92-146 (3)Deck 92-146 (3)
Deck 92-146 (3)
 
Using Graphs to Enable National-Scale Analytics
Using Graphs to Enable National-Scale AnalyticsUsing Graphs to Enable National-Scale Analytics
Using Graphs to Enable National-Scale Analytics
 
BIG DATA ANALYTICS.pptx
BIG DATA ANALYTICS.pptxBIG DATA ANALYTICS.pptx
BIG DATA ANALYTICS.pptx
 
Big Data and HR - Talk @SwissHR Congress
Big Data and HR - Talk @SwissHR CongressBig Data and HR - Talk @SwissHR Congress
Big Data and HR - Talk @SwissHR Congress
 

Último

QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 
A Glance At The Java Performance Toolbox
A Glance At The Java Performance ToolboxA Glance At The Java Performance Toolbox
A Glance At The Java Performance ToolboxAna-Maria Mihalceanu
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech
 
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...BookNet Canada
 
All These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFAll These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFMichael Gough
 
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...itnewsafrica
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Digital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentDigital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentMahmoud Rabie
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...amber724300
 
Accelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with PlatformlessAccelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with PlatformlessWSO2
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024TopCSSGallery
 

Último (20)

QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 
A Glance At The Java Performance Toolbox
A Glance At The Java Performance ToolboxA Glance At The Java Performance Toolbox
A Glance At The Java Performance Toolbox
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App Framework
 
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
 
All These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFAll These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDF
 
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Digital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentDigital Tools & AI in Career Development
Digital Tools & AI in Career Development
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
 
Accelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with PlatformlessAccelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with Platformless
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024
 

Big Data Career Paths and Skills

  • 1. Fickr: Nikos Koutoulas Big Data & Career Paths Marcos Colebrook Univ. de La Laguna @MColebrook ETS Ingeniería Informática – 16.06.2014#BigDataCanarias
  • 2. Contents Big Data facts Definition of Big Data Techs & Tools Data Science: skills and career paths Conclusions 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 2
  • 3. Big Data everywhere!! 16.06.2014 3#BigDataCanarias: "Big Data & Career Paths"
  • 4. Data vs. God “In God we trust, all others bring data.“  W.E. Deming 16.06.2014 4#BigDataCanarias: "Big Data & Career Paths"
  • 5. 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 5 Source: M. Deutscher, When Will the World Reach 8 Zetabytes of Stored Data? (2012).
  • 6. 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 6 Source: Intel (2014), What Happens In An Internet Minute?
  • 7. Big Data in Facebook 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 7
  • 8. Google trends on Big Data 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 8 Hadoop Big Data Data Analytics Massive Data
  • 9. Father to the ‘Big Data’ term 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 9 Source: S. Lohr (2013), The Origins of ‘Big Data’: An Etymological Detective Story. John R. Mashey Chief Scientist at Silicon Graphics
  • 10. Big Data: think-tank Policy Exchange Big Data: datasets that are too awkward to work with using traditional, hands-on database management tools. Big Data Analytics: the process of examining and interrogating big data assets to derive insights of value for decision making. 16.06.2014 10#BigDataCanarias: "Big Data & Career Paths" Source: C. Yiu (2012), The Big Data Opportunity.
  • 11. What is Big Data? Big Data is a term that describes large volumes of high velocity, complex and variable data that require advanced techniques and technologies to enable the capture, storage, distribution, management, and analysis of the information. 16.06.2014 11#BigDataCanarias: "Big Data & Career Paths" Source: Demystifying Big Data (2012), TechAmerica Foundation.
  • 12. Big Data 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 12 Source: J. Bloem et al. (2012), VINT Research Report 1: Creating Clarity with Big Data.
  • 13. Sources & types of data 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 13 Source: Big Data, BBVA Innovation Edge 2013 (from Booz & Company “Benefitting from Big Data: Leveraging Unstructured Data Capabilities for Competitive Advantage”)
  • 14. Big Data sources 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 14 Source: M. Schroeck et al. (2012), Analytics: The Real-World Use of Big Data.
  • 15. The three Vs of Big Data 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 15 Source: D. Soubra (2012), The 3Vs that define Big Data.
  • 16. The other “Vs” in Big Data “ ’Vs’ like veracity, validity, value, viability, etc. are aspirational qualities of all data, not definitional qualities of Big Data.”  Doug Laney 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 16 Source: D. Laney (2013), Batman on Big Data.
  • 17. What is really important in Big Data? “The Big in Big Data relates to importance not size”  Rafael Irizarry 16.06.2014 17#BigDataCanarias: "Big Data & Career Paths" Source: R. Irizarry (2014), The Big in Big Data relates to importance not size.
  • 18. My best “V” 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 18
  • 19. Is Big Data a marketing campaign? “If you’re like me, the mere mention of Big Data now turns your stomach. Nearly every business intelligence (BI) vendor, publication, and event has Big Data flashing in neon colors in Times Square dimensions. Never before have I seen an idea in the BI space elicit this much obsession. Why all the fuss? Why, indeed. Essentially, Big Data is a marketing campaign, pure and simple.”  Stephen Few 16.06.2014 19#BigDataCanarias: "Big Data & Career Paths"
  • 20. Gartner's 2013 Hype Cycle 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 20 Source: Gartner's 2013 Hype Cycle for Emerging Technologies
  • 21. Big Data: McKinsey Report  140.000 – 190.000 more deep analytical talent positions, and 1.5 million data savvy managers needed to take full advantage of Big Data in the USA.  Techniques: data mining (cluster analysis, classification, regression, etc), (un)supervised learning, ML, neural networks, optimization, predictive modeling, statistics, simulation, etc.  Technologies: BI, Cassandra, DW, ETL, Hadoop, HBase, Map/Reduce, R, RDBMS, etc.  Potential of Big Data in five domains:  Healthcare  Public Sector  Retail  Manufacturing  Telecommunications. 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 21 Source: J. Manyika, et al. (2012), Big Data: The Next Frontier for Innovation, Competition and Productivity.
  • 22. 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 22
  • 23. Hadoop-NoSQL Market Forecast 2012-2017 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 23 Source: J. Kelly (2013), Hadoop-NoSQL Software And Services Market Forecast 2012-2017.
  • 24. Big Data Techs 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 24
  • 25. 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 25
  • 26. Data Tools 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 26 Source: J. King, R. Magoulas (2013), Data Science Salary Survey.
  • 27. Salary vs. Data Tools 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 27 Source: J. King, R. Magoulas (2013), Data Science Salary Survey.
  • 28. Median Salary vs. #Tools 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 28 Source: J. King, R. Magoulas (2013), Data Science Salary Survey.
  • 29. Data Skills 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 29 Source: H.D. Harris et al. (2013), Analyzing the Analyzers
  • 30. Data Role vs. Data Skills 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 30 Source: H.D. Harris et al. (2013), Analyzing the Analyzers
  • 31. Big Data capabilities 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 31 Source: M. Schroeck et al. (2012), Analytics: The Real-World Use of Big Dat.
  • 32. Market & jobs opportunity  The demand for Big Data services spending projected to reach $132,300M in 2015.  By 2015, Big Data demand will reach 4.4 million jobs globally, but only one-third of those jobs will be filled.  The demand for services will generate 550,000 external services jobs in the next 3 years.  Another 40,000 jobs will be created at software vendors in the next 3 years. 16.06.2014 32#BigDataCanarias: "Big Data & Career Paths" Source: Big Data, BBVA Innovation Edge 2013 (from Gartner’s “Top Technology Predictions for 2013 and Beyond”)
  • 33. Statiscian: a sexy job “I keep saying the sexy job in the next ten years will be statisticians. People think I’m joking, but who would’ve guessed that computer engineers would’ve been the sexy job of the 1990s? The ability to take data—to be able to understand it, to process it, to extract value from it, to visualize it, to communicate it—that’s going to be a hugely important skill in the next decades [...]”  Hal Varian Google’s Chief Economist 16.06.2014 33#BigDataCanarias: "Big Data & Career Paths" Source: Hal Varian on how the Web challenges managers, McKinsey & Co. 2009.
  • 34. Data Scientist 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 34 Source: Josh Wills (2012).
  • 35. Data Science Venn Diagram 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 35 Source: Drew Conway (2010).
  • 36. Data Scientist skill set: ACM A data scientist requires an integrated skill set spanning mathematics, machine learning, artificial intelligence, statistics, databases, and optimization, along with a deep understanding of the craft of problem formulation to engineer effective solutions. 16.06.2014 36#BigDataCanarias: "Big Data & Career Paths" Source: V. Dhar (2013), Data Science and Prediction, Comm. of the ACM.
  • 37. Intelligence over DIKW 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 37 Source: The Internet of Things 2010 at YouTube (1:40).
  • 38. Data→Info→Knowledge→Understanding →Wisdom!! “There are known knowns. These are things we know that we know. There are known unknowns. That is to say, there are things that we know we don't know. But there are also unknown unknowns. There are things we don't know we don't know.”  Donald Rumsfeld 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 38 Source: C. Somohano (2013), Big Data [sorry] & Data Science: What Does a Data Scientist Do?
  • 39. BI vs. Data Discovery 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 39 Source: J. Kolb (2010), The New Reality for Business Intelligence and Big Data.
  • 40. Data Science Teams Data scientists as having the following qualities:  Technical expertise: the best data scientists typically have deep expertise in some scientific discipline.  Curiosity: a desire to go beneath the surface and discover and distill a problem down into a very clear set of hypotheses that can be tested.  Storytelling: the ability to use data to tell a story and to be able to communicate it effectively.  Cleverness: the ability to look at a problem in different, creative ways. 16.06.2014 40#BigDataCanarias: "Big Data & Career Paths" Source: D.J. Patil (2011), Building Data Science Team.
  • 41. Data Science skills: Accenture 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 41 Source: J.G. Harris et al. (2013), The Team Solution to the Data Scientist Shortage.
  • 42. Insight Data Science Fellow Program  6 week, full-time, postdoctoral data science training fellowship in Silicon Valley or New York City.  Self-directed, project-based learning (no classes!).  Software Engineering Best Practices: Python, Git, Flask, Javascript.  Storing and Retrieving Data: MySQL, Hadoop, Hive.  Statistical Analysis & Machine Learning: NumPy & SciPy, Pandas, scikit-learn, R.  Visualizing and Communicating Results: D3 Javascript library, visualization and presentation best practices. 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 42
  • 43. Insight Data Engineering Fellow Program  6 week, full-time, professional data engineering training fellowship in Silicon Valley, California.  Self-directed, project-based learning (no classes!).  Big Data Infrastructure.  Extracting data.  Transforming data.  Loading / Storing data.  Building visualizations and dashboards. 16.06.2014 #BigDataCanarias: "Big Data & Career Paths" 43
  • 44. Conclusions  Big Data is still an emerging topic that gathers a lot of new technologies, and needs some time to mature.  But, on the other hand, it has a true market opportunity.  Data Science / Engineering skills to acquire:  Math/Statistics and business knowledge.  Technical expertise: R, Python, Hadoop, Spark/Storm, D3, Java/Javascript, ...  Curiosity and cleverness.  Storytelling: ability to communicate results.  Trends:  Data Visualization  Predictive Modelling  Social Analytics  Data Mining / Machine Learning  Forensic Computer Science  Spark / Storm vs. Hadoop MapReduce 16.06.2014 44#BigDataCanarias: "Big Data & Career Paths"
  • 45. References (1/3) 1. Big Data (2013), BBVA Innovation Edge (31 pp). 2. Demystifying Big Data: A Practical Guide To Transforming The Business of Government (2012), TechAmerica Foundation (40 pp). 3. Gartner's 2013 Hype Cycle for Emerging Technologies Maps Out Evolving Relationship Between Humans and Machines (2013), Gartner. 4. Hal Varian on How the Web Challenges Managers (2009), McKinsey & Co. 5. Insight Data Engineering Fellows Program (2014). 6. Insight Data Science Fellows Program (2014). 7. The Internet of Things (2010), IBM Social Media. 8. What Happens In An Internet Minute? (2014), Intel. 16.06.2014 45#BigDataCanarias: "Big Data & Career Paths"
  • 46. References (2/3) 9. J. Bloem, M. van Doorn, S. Duivestein, T. van Manen, E. van Ommeren (2012), VINT Research Report 1: Creating Clarity with Big Data, SOGETI. 10. D. Conway (2010), The Data Science Venn Diagram. 11. M. Deutscher, When Will the World Reach 8 Zetabytes of Stored Data? (2012), Silicon Angle (blog). 12. V. Dhar (2013), Data Science and Prediction, Communications of the ACM 56 (12), pp. 64-73. 13. S. Few (2012), Big Data, Big Ruse, Perceptual Edge - Visual Business Intelligence Newsletter (blog, 8 pp). 14. H.D. Harris, S.P. Murphy, M. Vaisman (2013), Analyzing the Analyzers, O’Reilly Media (40 pp). 15. J.G. Harris, N. Shetterley, A.E. Alter, K. Schnell (2013), The Team Solution to the Data Scientist Shortage, Accenture Institute for High Performance. 16. R. Irizarry (2014), The Big in Big Data Relates to Importance Not Size, Simply Statistics (blog). 17. J. King, R. Magoulas (2013), Data Science Salary Survey, O’Reilly Media (23 pp). 18. J. Kelly (2013), Hadoop-NoSQL Software and Services Market Forecast 2012-2017, Wikibon (blog). 19. J. Kolb (2010), The New Reality for Business Intelligence and Big Data, Applied Data Labs (blog). 20. D. Laney (2013), Batman on Big Data, Gartner. 16.06.2014 46#BigDataCanarias: "Big Data & Career Paths"
  • 47. References (3/3) 21. D. Laney (2013), Batman on Big Data, Gartner. 22. S. Lohr (2013), The Origins of ‘Big Data’: An Etymological Detective Story, The New York Times. 23. J. Manyika, M. Chui, B. Brown, J. Bughin, R. Dobbs, C. Roxburgh, A.H. Byers (2012), Big Data: The Next Frontier for Innovation, Competition and Productivity, McKinsey Global Institute (156 pp). 24. R. Nair, A. Narayanan (2012), Benefitting from Big Data: Leveraging Unstructured Data Capabilities for Competitive Advantage, Booz & Company (16 pp). 25. D.J. Patil (2011), Building Data Science Teams, O’Reilly Media (26 pp). 26. G. Piatetsky (2014), Big Data Landscape v3.0 Analyzed, KDnuggets (blog). 27. J. Podesta, P. Pritzker, E.J. Moniz, J. Holdren, J. Zients (2014), Big Data: Seizing Opportunities, Preserving Values, The White House (79 pp). 28. M. Schroeck, R. Shockley, J. Smart, D. Romero-Morales, P. Tufano (2012), Analytics: The Real-World Use of Big Data, IBM Global Services. 29. C. Somohano (2013), Big Data [sorry] & Data Science: What Does a Data Scientist Do?, Data Science London (55 pp). 30. D. Soubra (2012), The 3Vs that define Big Data, Data Science Central (blog). 31. C. Yiu, The Big Data Opportunity (2012), Policy Exchange (36 pp). 32. P. Zikopoulos, C. Eaton, D. deRoos, T. Deutsch, G. Lapis (2012), Understanding Big Data, McGraw-Hill. 16.06.2014 47#BigDataCanarias: "Big Data & Career Paths"
  • 48. Datos de contacto y cuestiones ¡¡Gracias!! ¿Preguntas?  Datos de contacto:  Marcos Colebrook  Email: mcolesan@ull.edu.es  Twitter: @MColebrook  SlideShare: www.slideshare.net/MarcosColebrookSantamaria 16.06.2014 48#BigDataCanarias: "Big Data & Career Paths"