SlideShare una empresa de Scribd logo
1 de 44
Investigating Data Scientists
Bob. E. Hayes, PhD
bob@businessoverbroadway.com
@bobehayes
2Copyright © 2015 Business Over Broadway and AnalyticsWeek
Who I am
Scientist /
Blogger
Author
Consultant
3Copyright © 2015 Business Over Broadway and AnalyticsWeek
Business Over Broadway
Solve problems, primarily business problems,
through the use of the scientific method
Founder
Using data and analytics to help make decisions that
are based on fact, not hyperbole.
AnalyticsWeek
Help businesses optimize their data/analytics
Chief Research Officer
Improving talent/technology recruitment, facilitating
deeper community engagement with the power of
online/offline channels and advancing knowledge
through research
What I do
4Copyright © 2015 Business Over Broadway and AnalyticsWeek
What is the role of a data scientist?
Quotes from the article: What is the role of a data scientist in the insight economy?
5Copyright © 2015 Business Over Broadway and AnalyticsWeek
Data Science Study
• Invited data professionals via:
– AnalyticsWeek Newsletter
– Blog post
– Analytics.Club
• 600+ completed surveys
– Self-assessment rating of proficiency of 25 skills across
five skill areas:
• Business, Technology, Programming, Math & Modeling,
Statistics
– 9 additional questions
– Overall satisfaction with work outcome
6Copyright © 2015 Business Over Broadway and AnalyticsWeek
Data Science Skills Assessed
Area Skills*
Business
1.Product design and development
2.Project management
3.Business development
4.Budgeting
5.Governance & Compliance (e.g., security)
Technology
6.Managing unstructured data (e.g., noSQL)
7.Managing structured data (e.g., SQL, JSON, XML)
8.Natural Language Processing (NLP) and text mining
9.Machine Learning (e.g., decision trees, neural nets, Support Vector Machine, clustering)
10.Big and Distributed Data (e.g., Hadoop, Map/Reduce, Spark)
Math &
Modeling
11.Optimization (e.g., linear, integer, convex, global)
12.Math (e.g., linear algebra, real analysis, calculus)
13.Graphical Models (e.g., social networks)
14.Algorithms (e.g., computational complexity, Computer Science theory) and Simulations (e.g., discrete, agent-based, continuous)
15.Bayesian Statistics (e.g., Markov Chain Monte Carlo)
Programming
16.Systems Administration (e.g., UNIX) and Design
17.Database Administration (MySQL, NOSQL)
18.Cloud Management
19.Back-End Programming (e.g., JAVA/Rails/Objective C)
20.Front-End Programming (e.g., JavaScript, HTML, CSS)
Statistics
21.Data Management (e.g., recoding, de-duplicating, Integrating disparate data sources, Web scraping)
22.Data Mining (e.g. R, Python, SPSS, SAS) and Visualization (e.g., graphics, mapping, web-based data visualization) tools
23.Statistics and statistical modeling (e.g., general linear model, ANOVA, MANOVA, Spatio-temporal, Geographical Information System (GIS))
24.Science/Scientific Method (e.g., experimental design, research design)
25.Communication (e.g., sharing results, writing/publishing, presentations, blogging)
* List of skills adapted from Analyzing the Analyzers by Harlan D. Harris, Sean Patrick Murphy and Marck Vaisman
7Copyright © 2015 Business Over Broadway and AnalyticsWeek
Proficiency Ratings*
Proficiency
Level
Scale
Value
Description
Don't know 0 You possess no knowledge
Fundamental
Awareness
20
You have a common knowledge or an understanding of basic
techniques and concepts.
Novice 40
You have the level of experience gained in a classroom and/or
experimental scenarios or as a trainee on-the-job. You are expected
to need help when performing this skill.
Intermediate 60
You are able to successfully complete tasks in this competency as
requested. Help from an expert may be required from time to time, but
you can usually perform the skill independently.
Advanced 80
You can perform the actions associated with this skill without
assistance. You are certainly recognized within your immediate
organization as "a person to ask" when difficult questions arise
regarding this skill.
Expert 100
You are known as an expert in this area. You can provide guidance,
troubleshoot and answer questions related to this area of expertise
and the field where the skill is used.
* Rating scale is based on a proficiency rating scale used by NIH. Respondent instructions: You will be asked about your
proficiency for a variety of skills. Please use the following scale when indicating your level of proficiency for each skill.
8Copyright © 2015 Business Over Broadway and AnalyticsWeek
Sample
9Copyright © 2015 Business Over Broadway and AnalyticsWeek
Proficiency varies across skills
10Copyright © 2016 Business Over Broadway and AnalyticsWeek
Job Roles in Data Science
*Researcher (e.g., researcher, scientist, statistician); Business Management (e.g., leader, business person,
entrepreneur); Creative (e.g., jack of all trades, artist, hacker); Developer (e.g., developer, engineer)
11Copyright © 2015 Business Over Broadway and AnalyticsWeek
Proficiency varies by job role – 1st Look
12Copyright © 2016 Business Over Broadway and AnalyticsWeek
Proficiency varies by job role – 2nd Look
13Copyright © 2016 Business Over Broadway and AnalyticsWeek
Structure of Data Science Skills
* Factor analysis is based on proficiency ratings of 621 data professionals. Reliability (Cronbach’s alpha for each of the
three Skills areas (based on items that loaded on the respective factors) were: .87 (Business); .92 (Tech / Prog); .92
(Math / Stats)
Factor Analysis of Data Skills
• Data reduction technique
• Examines the statistical relationships (e.g.,
correlations) among a large set of
variables and tries to explain these
correlations using a smaller number of
variables (factors)
• Elements (or factor loadings) of the factor
pattern matrix represent the strength of
relationship between the variables and
each of the underlying factors
• Tells us two things:
1. number of underlying factors that
describe the initial set of variables
2. which variables are best
represented by each factor
14Copyright © 2016 Business Over Broadway and AnalyticsWeek
Structure of Data Science Skills
* Factor analysis is based on proficiency ratings of 621 data professionals. Reliability (Cronbach’s alpha for each of the
three Skills areas (based on items that loaded on the respective factors) were: .87 (Business); .92 (Tech / Prog); .92
(Math / Stats)
Plot the factor loadings
for the 25 data skills into
a 3-dimensional space
Three Distinct Skill Sets
• Business
• Technology / Programming
• Math / Statistics
16Copyright © 2016 Business Over Broadway and AnalyticsWeek
17Copyright © 2016 Business Over Broadway and AnalyticsWeek
The Structure of Data Science Skills
18Copyright © 2016 Business Over Broadway and AnalyticsWeek
Proficiency varies by job role – 3rd
Look
19Copyright © 2016 Business Over Broadway and AnalyticsWeek
Proficiency varies by job role – 4th
Look
*Researcher (e.g., researcher, scientist, statistician) n = 133; Business Management (e.g., leader, business person,
entrepreneur) n = 86; Creative (e.g., jack of all trades, artist, hacker) n = 30; Developer (e.g., developer, engineer) n = 54
20Copyright © 2016 Business Over Broadway and AnalyticsWeek
Proficiency varies by job role – 4th
Look
*Researcher (e.g., researcher, scientist, statistician) n = 133; Business Management (e.g., leader, business person,
entrepreneur) n = 86; Creative (e.g., jack of all trades, artist, hacker) n = 30; Developer (e.g., developer, engineer) n = 54
21Copyright © 2016 Business Over Broadway and AnalyticsWeek
Top Data Science Skills by Job Role
22Copyright © 2016 Business Over Broadway and AnalyticsWeek
Top Data Science Skills by Job Role
23Copyright © 2016 Business Over Broadway and AnalyticsWeek
Satisfaction with Work Outcome
*Researcher (e.g., researcher, scientist, statistician); Business Management (e.g., leader, business person,
entrepreneur); Creative (e.g., jack of all trades, artist, hacker); Developer (e.g., developer, engineer)
24Copyright © 2016 Business Over Broadway and AnalyticsWeek
In Search of the Data Scientist
Unicorn
25Copyright © 2016 Business Over Broadway and AnalyticsWeek
Data Science as a Team Sport
Impact of Business Expert
26Copyright © 2016 Business Over Broadway and AnalyticsWeek
Data Science as a Team Sport
Impact of Technology / Programming Expert
27Copyright © 2016 Business Over Broadway and AnalyticsWeek
Data Science as a Team Sport
Impact of Math & Modeling / Statistics Expert
28Copyright © 2016 Business Over Broadway and AnalyticsWeek
Getting Insight from Data: The Scientific Method
29Copyright © 2016 Business Over Broadway and AnalyticsWeek
Scientific Method and Data Science Skills
30Copyright © 2016 Business Over Broadway and AnalyticsWeek
Skill Proficiency Drives Satisfaction with Work
Outcome
31Copyright © 2016 Business Over Broadway and AnalyticsWeek
What skills are linked to project
success?
32Copyright © 2016 Business Over Broadway and AnalyticsWeek
Importance of Data Science Skills by Job
Role
33Copyright © 2016 Business Over Broadway and AnalyticsWeek
Impact of Data Mining and Viz Tools
34Copyright © 2015 Business Over Broadway and AnalyticsWeek
EXPERT
PROFICIENCY
NO
PROFICIENCY
Skill-Based Approach to Improve the Practice of Data Science
IMPROVE /
INVEST
DIVEST
DON’T
OVERINVEST
STAY THE
COURSE
SKILL ESSENTIAL TO
PROJECT OUTCOME
SKILL NOT ESSENTIAL
TO PROJECT OUTCOMECopyright 2015 Business Over Broadway
35Copyright © 2015 Business Over Broadway and AnalyticsWeek-.10
.00
.10
.20
.30
.40
.50
.60
0 20 40 60 80 100
Business Management
EXPERTNONE
Skill-Based Method to Improve Data Science
Data Mining and
Viz Tools
Big and distributed data
Business Management
Science / Scientific Method
Statistics / Statistical Modeling
Machine Learning
Bayesian Statistics
Unstructured data
Optimization
Algorithms
NOT ESSENTIAL
ESSENTIAL
Copyright 2015 Business Over Broadway
36Copyright © 2015 Business Over Broadway and AnalyticsWeek
EXPERTNONE
Skill-Based Method to Improve Data Science
Systems Administration
Developer
Data Mining and Viz Tools
NOT ESSENTIAL
ESSENTIAL
Copyright 2015 Business Over Broadway
37Copyright © 2015 Business Over Broadway and AnalyticsWeek
EXPERTNONE
Skill-Based Method to Improve Data Science
Business Development
Creative
Optimization
Copyright 2015 Business Over Broadway
Math
Data Mining
and Viz Tools
Graphical Models
NOT ESSENTIAL
ESSENTIAL
38Copyright © 2015 Business Over Broadway and AnalyticsWeek
ImpactonSatisfactionwithOutcome
Proficiency
EXPERTNONE
NOT ESSENTIAL
ESSENTIAL
Skill-Based Method to Improve Data Science
Big and distributed data
Bayesian Statistics
Researcher
Data Management
Machine Learning
AlgorithmsProduct Design
Copyright 2015 Business Over Broadway
39Copyright © 2016 Business Over Broadway and AnalyticsWeek
Lack of Gender Diversity
40Copyright © 2016 Business Over Broadway and AnalyticsWeek
Lack of Gender Diversity – Other Science
Roles
41Copyright © 2016 Business Over Broadway and AnalyticsWeek
Job Roles in Data Science by Gender
42Copyright © 2016 Business Over Broadway and AnalyticsWeek
Highest Level of Education Attained
43Copyright © 2016 Business Over Broadway and AnalyticsWeek
Gender Comparison of Proficiency across Skills
44Copyright © 2016 Business Over Broadway and AnalyticsWeek
Advice for Data Scientists
• Take the Data Science Skills Survey at: http://pxl.me/awrds3
• Be specific when talking about “data scientists”
• There are different types – defined by what they do and the skills
they possess
• Work with other data professionals who have
complementary skills. Teamwork is key to success.
• Learn to use data mining and visualization tools
• R, Python, SPSS, SAS, graphics, mapping, web-based data
visualization
• Be an advocate for women in the field of data science
45Copyright © 2015 Business Over Broadway and AnalyticsWeek
Bob E. Hayes, Ph.D.
Email: bob@analyticsweek.com and bob@businessoverbroadway.com
Web: www.analyticsweek.com and www.businessoverbroadway.com
Blog: www.businessoverbroadway.com/blog
Twitter: www.twitter.com/bobehayes

Más contenido relacionado

Similar a Investigating data scientists

Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...DATAVERSITY
 
Corey Sykes' Resume
Corey Sykes' ResumeCorey Sykes' Resume
Corey Sykes' ResumeCorey Sykes
 
Certified Data Analytics Course in Chennai-April
Certified Data Analytics Course in Chennai-AprilCertified Data Analytics Course in Chennai-April
Certified Data Analytics Course in Chennai-AprilDataMites
 
Certified Data Analytics Course in Chennai-April
Certified Data Analytics Course in Chennai-AprilCertified Data Analytics Course in Chennai-April
Certified Data Analytics Course in Chennai-AprilDataMites
 
Data Analytics Course in Coimbatore-April
Data Analytics Course in Coimbatore-AprilData Analytics Course in Coimbatore-April
Data Analytics Course in Coimbatore-AprilDataMites
 
Certified Data Analytics Course in Chennai-April
Certified Data Analytics Course in Chennai-AprilCertified Data Analytics Course in Chennai-April
Certified Data Analytics Course in Chennai-AprilDataMites
 
Certified Data Analyst Course in Chennai-April
Certified Data Analyst Course in Chennai-AprilCertified Data Analyst Course in Chennai-April
Certified Data Analyst Course in Chennai-AprilDataMites
 
VALUENEX Singapore Seminar: Our Services (and Case Study)
VALUENEX Singapore Seminar: Our Services (and Case Study)VALUENEX Singapore Seminar: Our Services (and Case Study)
VALUENEX Singapore Seminar: Our Services (and Case Study)VALUENEX
 
Certified Data Analyst Course in Pune-April
Certified Data Analyst Course in Pune-AprilCertified Data Analyst Course in Pune-April
Certified Data Analyst Course in Pune-AprilDataMites
 
Certified Data Analytics Training in Pune-April
Certified Data Analytics Training in Pune-AprilCertified Data Analytics Training in Pune-April
Certified Data Analytics Training in Pune-AprilDataMites
 
How to Identify, Train or Become a Data Scientist
How to Identify, Train or Become a Data ScientistHow to Identify, Train or Become a Data Scientist
How to Identify, Train or Become a Data ScientistInside Analysis
 
Resume vivek mohan - Data & Analytics Chief Architect
Resume vivek mohan - Data & Analytics Chief ArchitectResume vivek mohan - Data & Analytics Chief Architect
Resume vivek mohan - Data & Analytics Chief ArchitectVivek Mohan
 
Analytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data PlatformAnalytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data PlatformVMware Tanzu
 
Business Applications of Predictive Modeling at Scale
Business Applications of Predictive Modeling at ScaleBusiness Applications of Predictive Modeling at Scale
Business Applications of Predictive Modeling at ScaleSongtao Guo
 
Training in Analytics and Data Science
Training in Analytics and Data ScienceTraining in Analytics and Data Science
Training in Analytics and Data ScienceAjay Ohri
 
How Data Science Can Grow Your Business?
How Data Science Can Grow Your Business?How Data Science Can Grow Your Business?
How Data Science Can Grow Your Business?Noam Cohen
 
How to make your data scientists happy
How to make your data scientists happy How to make your data scientists happy
How to make your data scientists happy Hussain Sultan
 
Bluegranite AA Webinar FINAL 28JUN16
Bluegranite AA Webinar FINAL 28JUN16Bluegranite AA Webinar FINAL 28JUN16
Bluegranite AA Webinar FINAL 28JUN16Andy Lathrop
 
Data & Analytics at Scale
Data & Analytics at ScaleData & Analytics at Scale
Data & Analytics at ScaleWalid Mehanna
 

Similar a Investigating data scientists (20)

Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
 
Corey Sykes' Resume
Corey Sykes' ResumeCorey Sykes' Resume
Corey Sykes' Resume
 
Certified Data Analytics Course in Chennai-April
Certified Data Analytics Course in Chennai-AprilCertified Data Analytics Course in Chennai-April
Certified Data Analytics Course in Chennai-April
 
Certified Data Analytics Course in Chennai-April
Certified Data Analytics Course in Chennai-AprilCertified Data Analytics Course in Chennai-April
Certified Data Analytics Course in Chennai-April
 
Data Analytics Course in Coimbatore-April
Data Analytics Course in Coimbatore-AprilData Analytics Course in Coimbatore-April
Data Analytics Course in Coimbatore-April
 
Certified Data Analytics Course in Chennai-April
Certified Data Analytics Course in Chennai-AprilCertified Data Analytics Course in Chennai-April
Certified Data Analytics Course in Chennai-April
 
Certified Data Analyst Course in Chennai-April
Certified Data Analyst Course in Chennai-AprilCertified Data Analyst Course in Chennai-April
Certified Data Analyst Course in Chennai-April
 
VALUENEX Singapore Seminar: Our Services (and Case Study)
VALUENEX Singapore Seminar: Our Services (and Case Study)VALUENEX Singapore Seminar: Our Services (and Case Study)
VALUENEX Singapore Seminar: Our Services (and Case Study)
 
Certified Data Analyst Course in Pune-April
Certified Data Analyst Course in Pune-AprilCertified Data Analyst Course in Pune-April
Certified Data Analyst Course in Pune-April
 
Certified Data Analytics Training in Pune-April
Certified Data Analytics Training in Pune-AprilCertified Data Analytics Training in Pune-April
Certified Data Analytics Training in Pune-April
 
How to Identify, Train or Become a Data Scientist
How to Identify, Train or Become a Data ScientistHow to Identify, Train or Become a Data Scientist
How to Identify, Train or Become a Data Scientist
 
Resume vivek mohan - Data & Analytics Chief Architect
Resume vivek mohan - Data & Analytics Chief ArchitectResume vivek mohan - Data & Analytics Chief Architect
Resume vivek mohan - Data & Analytics Chief Architect
 
Analytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data PlatformAnalytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data Platform
 
Business Applications of Predictive Modeling at Scale
Business Applications of Predictive Modeling at ScaleBusiness Applications of Predictive Modeling at Scale
Business Applications of Predictive Modeling at Scale
 
Training in Analytics and Data Science
Training in Analytics and Data ScienceTraining in Analytics and Data Science
Training in Analytics and Data Science
 
Resume_Vignesh_ThulasiDass
Resume_Vignesh_ThulasiDass Resume_Vignesh_ThulasiDass
Resume_Vignesh_ThulasiDass
 
How Data Science Can Grow Your Business?
How Data Science Can Grow Your Business?How Data Science Can Grow Your Business?
How Data Science Can Grow Your Business?
 
How to make your data scientists happy
How to make your data scientists happy How to make your data scientists happy
How to make your data scientists happy
 
Bluegranite AA Webinar FINAL 28JUN16
Bluegranite AA Webinar FINAL 28JUN16Bluegranite AA Webinar FINAL 28JUN16
Bluegranite AA Webinar FINAL 28JUN16
 
Data & Analytics at Scale
Data & Analytics at ScaleData & Analytics at Scale
Data & Analytics at Scale
 

Más de Business Over Broadway

What MBA Students Need to Know about CX, Data Science and Surveys
What MBA Students Need to Know about CX, Data Science and SurveysWhat MBA Students Need to Know about CX, Data Science and Surveys
What MBA Students Need to Know about CX, Data Science and SurveysBusiness Over Broadway
 
In a Word: The Customer Sentiment Index
In a Word: The Customer Sentiment IndexIn a Word: The Customer Sentiment Index
In a Word: The Customer Sentiment IndexBusiness Over Broadway
 
Big Data - What it Really Means for VOC and Customer Experience Professionals
Big Data - What it Really Means for VOC and Customer Experience ProfessionalsBig Data - What it Really Means for VOC and Customer Experience Professionals
Big Data - What it Really Means for VOC and Customer Experience ProfessionalsBusiness Over Broadway
 
Improving the customer experience using big data customer-centric measurement...
Improving the customer experience using big data customer-centric measurement...Improving the customer experience using big data customer-centric measurement...
Improving the customer experience using big data customer-centric measurement...Business Over Broadway
 
Customer Relationship Diagnostic: Sample Report
Customer Relationship Diagnostic: Sample ReportCustomer Relationship Diagnostic: Sample Report
Customer Relationship Diagnostic: Sample ReportBusiness Over Broadway
 
Customer Experience Management for Startups
Customer Experience Management for StartupsCustomer Experience Management for Startups
Customer Experience Management for StartupsBusiness Over Broadway
 
Big Data has Big Implications for Customer Experience Management
Big Data has Big Implications for Customer Experience ManagementBig Data has Big Implications for Customer Experience Management
Big Data has Big Implications for Customer Experience ManagementBusiness Over Broadway
 
Asking the Right CX Questions: Optimizing your Customer Relationship Survey
Asking the Right CX Questions: Optimizing your Customer Relationship SurveyAsking the Right CX Questions: Optimizing your Customer Relationship Survey
Asking the Right CX Questions: Optimizing your Customer Relationship SurveyBusiness Over Broadway
 
Competitive Analytics that Drive Customer Loyalty
Competitive Analytics that Drive Customer LoyaltyCompetitive Analytics that Drive Customer Loyalty
Competitive Analytics that Drive Customer LoyaltyBusiness Over Broadway
 
Developing a Customer Centric Research Program
Developing a Customer Centric Research ProgramDeveloping a Customer Centric Research Program
Developing a Customer Centric Research ProgramBusiness Over Broadway
 
Managing Customer Loyalty - Micro and Macro Approach
Managing Customer Loyalty - Micro and Macro ApproachManaging Customer Loyalty - Micro and Macro Approach
Managing Customer Loyalty - Micro and Macro ApproachBusiness Over Broadway
 

Más de Business Over Broadway (16)

What MBA Students Need to Know about CX, Data Science and Surveys
What MBA Students Need to Know about CX, Data Science and SurveysWhat MBA Students Need to Know about CX, Data Science and Surveys
What MBA Students Need to Know about CX, Data Science and Surveys
 
In a Word: The Customer Sentiment Index
In a Word: The Customer Sentiment IndexIn a Word: The Customer Sentiment Index
In a Word: The Customer Sentiment Index
 
The Hidden Bias in Customer Metrics
The Hidden Bias in Customer MetricsThe Hidden Bias in Customer Metrics
The Hidden Bias in Customer Metrics
 
Big Data and Customer Experience
Big Data and Customer ExperienceBig Data and Customer Experience
Big Data and Customer Experience
 
Big Data - What it Really Means for VOC and Customer Experience Professionals
Big Data - What it Really Means for VOC and Customer Experience ProfessionalsBig Data - What it Really Means for VOC and Customer Experience Professionals
Big Data - What it Really Means for VOC and Customer Experience Professionals
 
Improving the customer experience using big data customer-centric measurement...
Improving the customer experience using big data customer-centric measurement...Improving the customer experience using big data customer-centric measurement...
Improving the customer experience using big data customer-centric measurement...
 
Customer Relationship Diagnostic: Sample Report
Customer Relationship Diagnostic: Sample ReportCustomer Relationship Diagnostic: Sample Report
Customer Relationship Diagnostic: Sample Report
 
Customer Experience Management for Startups
Customer Experience Management for StartupsCustomer Experience Management for Startups
Customer Experience Management for Startups
 
Big Data has Big Implications for Customer Experience Management
Big Data has Big Implications for Customer Experience ManagementBig Data has Big Implications for Customer Experience Management
Big Data has Big Implications for Customer Experience Management
 
Asking the Right CX Questions: Optimizing your Customer Relationship Survey
Asking the Right CX Questions: Optimizing your Customer Relationship SurveyAsking the Right CX Questions: Optimizing your Customer Relationship Survey
Asking the Right CX Questions: Optimizing your Customer Relationship Survey
 
Competitive Analytics that Drive Customer Loyalty
Competitive Analytics that Drive Customer LoyaltyCompetitive Analytics that Drive Customer Loyalty
Competitive Analytics that Drive Customer Loyalty
 
Validation of Customer Survey
Validation of Customer SurveyValidation of Customer Survey
Validation of Customer Survey
 
Developing a Customer Centric Research Program
Developing a Customer Centric Research ProgramDeveloping a Customer Centric Research Program
Developing a Customer Centric Research Program
 
Managing Customer Loyalty - Micro and Macro Approach
Managing Customer Loyalty - Micro and Macro ApproachManaging Customer Loyalty - Micro and Macro Approach
Managing Customer Loyalty - Micro and Macro Approach
 
Building a Customer Feedback Program
Building a Customer Feedback ProgramBuilding a Customer Feedback Program
Building a Customer Feedback Program
 
RAPID Loyalty Measurement
RAPID Loyalty MeasurementRAPID Loyalty Measurement
RAPID Loyalty Measurement
 

Último

Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
IMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxIMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxdolaknnilon
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...ttt fff
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一F La
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhYasamin16
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
MK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxMK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxUnduhUnggah1
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 

Último (20)

Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
IMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxIMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptx
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
MK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxMK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docx
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 

Investigating data scientists

  • 1. Investigating Data Scientists Bob. E. Hayes, PhD bob@businessoverbroadway.com @bobehayes
  • 2. 2Copyright © 2015 Business Over Broadway and AnalyticsWeek Who I am Scientist / Blogger Author Consultant
  • 3. 3Copyright © 2015 Business Over Broadway and AnalyticsWeek Business Over Broadway Solve problems, primarily business problems, through the use of the scientific method Founder Using data and analytics to help make decisions that are based on fact, not hyperbole. AnalyticsWeek Help businesses optimize their data/analytics Chief Research Officer Improving talent/technology recruitment, facilitating deeper community engagement with the power of online/offline channels and advancing knowledge through research What I do
  • 4. 4Copyright © 2015 Business Over Broadway and AnalyticsWeek What is the role of a data scientist? Quotes from the article: What is the role of a data scientist in the insight economy?
  • 5. 5Copyright © 2015 Business Over Broadway and AnalyticsWeek Data Science Study • Invited data professionals via: – AnalyticsWeek Newsletter – Blog post – Analytics.Club • 600+ completed surveys – Self-assessment rating of proficiency of 25 skills across five skill areas: • Business, Technology, Programming, Math & Modeling, Statistics – 9 additional questions – Overall satisfaction with work outcome
  • 6. 6Copyright © 2015 Business Over Broadway and AnalyticsWeek Data Science Skills Assessed Area Skills* Business 1.Product design and development 2.Project management 3.Business development 4.Budgeting 5.Governance & Compliance (e.g., security) Technology 6.Managing unstructured data (e.g., noSQL) 7.Managing structured data (e.g., SQL, JSON, XML) 8.Natural Language Processing (NLP) and text mining 9.Machine Learning (e.g., decision trees, neural nets, Support Vector Machine, clustering) 10.Big and Distributed Data (e.g., Hadoop, Map/Reduce, Spark) Math & Modeling 11.Optimization (e.g., linear, integer, convex, global) 12.Math (e.g., linear algebra, real analysis, calculus) 13.Graphical Models (e.g., social networks) 14.Algorithms (e.g., computational complexity, Computer Science theory) and Simulations (e.g., discrete, agent-based, continuous) 15.Bayesian Statistics (e.g., Markov Chain Monte Carlo) Programming 16.Systems Administration (e.g., UNIX) and Design 17.Database Administration (MySQL, NOSQL) 18.Cloud Management 19.Back-End Programming (e.g., JAVA/Rails/Objective C) 20.Front-End Programming (e.g., JavaScript, HTML, CSS) Statistics 21.Data Management (e.g., recoding, de-duplicating, Integrating disparate data sources, Web scraping) 22.Data Mining (e.g. R, Python, SPSS, SAS) and Visualization (e.g., graphics, mapping, web-based data visualization) tools 23.Statistics and statistical modeling (e.g., general linear model, ANOVA, MANOVA, Spatio-temporal, Geographical Information System (GIS)) 24.Science/Scientific Method (e.g., experimental design, research design) 25.Communication (e.g., sharing results, writing/publishing, presentations, blogging) * List of skills adapted from Analyzing the Analyzers by Harlan D. Harris, Sean Patrick Murphy and Marck Vaisman
  • 7. 7Copyright © 2015 Business Over Broadway and AnalyticsWeek Proficiency Ratings* Proficiency Level Scale Value Description Don't know 0 You possess no knowledge Fundamental Awareness 20 You have a common knowledge or an understanding of basic techniques and concepts. Novice 40 You have the level of experience gained in a classroom and/or experimental scenarios or as a trainee on-the-job. You are expected to need help when performing this skill. Intermediate 60 You are able to successfully complete tasks in this competency as requested. Help from an expert may be required from time to time, but you can usually perform the skill independently. Advanced 80 You can perform the actions associated with this skill without assistance. You are certainly recognized within your immediate organization as "a person to ask" when difficult questions arise regarding this skill. Expert 100 You are known as an expert in this area. You can provide guidance, troubleshoot and answer questions related to this area of expertise and the field where the skill is used. * Rating scale is based on a proficiency rating scale used by NIH. Respondent instructions: You will be asked about your proficiency for a variety of skills. Please use the following scale when indicating your level of proficiency for each skill.
  • 8. 8Copyright © 2015 Business Over Broadway and AnalyticsWeek Sample
  • 9. 9Copyright © 2015 Business Over Broadway and AnalyticsWeek Proficiency varies across skills
  • 10. 10Copyright © 2016 Business Over Broadway and AnalyticsWeek Job Roles in Data Science *Researcher (e.g., researcher, scientist, statistician); Business Management (e.g., leader, business person, entrepreneur); Creative (e.g., jack of all trades, artist, hacker); Developer (e.g., developer, engineer)
  • 11. 11Copyright © 2015 Business Over Broadway and AnalyticsWeek Proficiency varies by job role – 1st Look
  • 12. 12Copyright © 2016 Business Over Broadway and AnalyticsWeek Proficiency varies by job role – 2nd Look
  • 13. 13Copyright © 2016 Business Over Broadway and AnalyticsWeek Structure of Data Science Skills * Factor analysis is based on proficiency ratings of 621 data professionals. Reliability (Cronbach’s alpha for each of the three Skills areas (based on items that loaded on the respective factors) were: .87 (Business); .92 (Tech / Prog); .92 (Math / Stats) Factor Analysis of Data Skills • Data reduction technique • Examines the statistical relationships (e.g., correlations) among a large set of variables and tries to explain these correlations using a smaller number of variables (factors) • Elements (or factor loadings) of the factor pattern matrix represent the strength of relationship between the variables and each of the underlying factors • Tells us two things: 1. number of underlying factors that describe the initial set of variables 2. which variables are best represented by each factor
  • 14. 14Copyright © 2016 Business Over Broadway and AnalyticsWeek Structure of Data Science Skills * Factor analysis is based on proficiency ratings of 621 data professionals. Reliability (Cronbach’s alpha for each of the three Skills areas (based on items that loaded on the respective factors) were: .87 (Business); .92 (Tech / Prog); .92 (Math / Stats) Plot the factor loadings for the 25 data skills into a 3-dimensional space Three Distinct Skill Sets • Business • Technology / Programming • Math / Statistics
  • 15. 16Copyright © 2016 Business Over Broadway and AnalyticsWeek
  • 16. 17Copyright © 2016 Business Over Broadway and AnalyticsWeek The Structure of Data Science Skills
  • 17. 18Copyright © 2016 Business Over Broadway and AnalyticsWeek Proficiency varies by job role – 3rd Look
  • 18. 19Copyright © 2016 Business Over Broadway and AnalyticsWeek Proficiency varies by job role – 4th Look *Researcher (e.g., researcher, scientist, statistician) n = 133; Business Management (e.g., leader, business person, entrepreneur) n = 86; Creative (e.g., jack of all trades, artist, hacker) n = 30; Developer (e.g., developer, engineer) n = 54
  • 19. 20Copyright © 2016 Business Over Broadway and AnalyticsWeek Proficiency varies by job role – 4th Look *Researcher (e.g., researcher, scientist, statistician) n = 133; Business Management (e.g., leader, business person, entrepreneur) n = 86; Creative (e.g., jack of all trades, artist, hacker) n = 30; Developer (e.g., developer, engineer) n = 54
  • 20. 21Copyright © 2016 Business Over Broadway and AnalyticsWeek Top Data Science Skills by Job Role
  • 21. 22Copyright © 2016 Business Over Broadway and AnalyticsWeek Top Data Science Skills by Job Role
  • 22. 23Copyright © 2016 Business Over Broadway and AnalyticsWeek Satisfaction with Work Outcome *Researcher (e.g., researcher, scientist, statistician); Business Management (e.g., leader, business person, entrepreneur); Creative (e.g., jack of all trades, artist, hacker); Developer (e.g., developer, engineer)
  • 23. 24Copyright © 2016 Business Over Broadway and AnalyticsWeek In Search of the Data Scientist Unicorn
  • 24. 25Copyright © 2016 Business Over Broadway and AnalyticsWeek Data Science as a Team Sport Impact of Business Expert
  • 25. 26Copyright © 2016 Business Over Broadway and AnalyticsWeek Data Science as a Team Sport Impact of Technology / Programming Expert
  • 26. 27Copyright © 2016 Business Over Broadway and AnalyticsWeek Data Science as a Team Sport Impact of Math & Modeling / Statistics Expert
  • 27. 28Copyright © 2016 Business Over Broadway and AnalyticsWeek Getting Insight from Data: The Scientific Method
  • 28. 29Copyright © 2016 Business Over Broadway and AnalyticsWeek Scientific Method and Data Science Skills
  • 29. 30Copyright © 2016 Business Over Broadway and AnalyticsWeek Skill Proficiency Drives Satisfaction with Work Outcome
  • 30. 31Copyright © 2016 Business Over Broadway and AnalyticsWeek What skills are linked to project success?
  • 31. 32Copyright © 2016 Business Over Broadway and AnalyticsWeek Importance of Data Science Skills by Job Role
  • 32. 33Copyright © 2016 Business Over Broadway and AnalyticsWeek Impact of Data Mining and Viz Tools
  • 33. 34Copyright © 2015 Business Over Broadway and AnalyticsWeek EXPERT PROFICIENCY NO PROFICIENCY Skill-Based Approach to Improve the Practice of Data Science IMPROVE / INVEST DIVEST DON’T OVERINVEST STAY THE COURSE SKILL ESSENTIAL TO PROJECT OUTCOME SKILL NOT ESSENTIAL TO PROJECT OUTCOMECopyright 2015 Business Over Broadway
  • 34. 35Copyright © 2015 Business Over Broadway and AnalyticsWeek-.10 .00 .10 .20 .30 .40 .50 .60 0 20 40 60 80 100 Business Management EXPERTNONE Skill-Based Method to Improve Data Science Data Mining and Viz Tools Big and distributed data Business Management Science / Scientific Method Statistics / Statistical Modeling Machine Learning Bayesian Statistics Unstructured data Optimization Algorithms NOT ESSENTIAL ESSENTIAL Copyright 2015 Business Over Broadway
  • 35. 36Copyright © 2015 Business Over Broadway and AnalyticsWeek EXPERTNONE Skill-Based Method to Improve Data Science Systems Administration Developer Data Mining and Viz Tools NOT ESSENTIAL ESSENTIAL Copyright 2015 Business Over Broadway
  • 36. 37Copyright © 2015 Business Over Broadway and AnalyticsWeek EXPERTNONE Skill-Based Method to Improve Data Science Business Development Creative Optimization Copyright 2015 Business Over Broadway Math Data Mining and Viz Tools Graphical Models NOT ESSENTIAL ESSENTIAL
  • 37. 38Copyright © 2015 Business Over Broadway and AnalyticsWeek ImpactonSatisfactionwithOutcome Proficiency EXPERTNONE NOT ESSENTIAL ESSENTIAL Skill-Based Method to Improve Data Science Big and distributed data Bayesian Statistics Researcher Data Management Machine Learning AlgorithmsProduct Design Copyright 2015 Business Over Broadway
  • 38. 39Copyright © 2016 Business Over Broadway and AnalyticsWeek Lack of Gender Diversity
  • 39. 40Copyright © 2016 Business Over Broadway and AnalyticsWeek Lack of Gender Diversity – Other Science Roles
  • 40. 41Copyright © 2016 Business Over Broadway and AnalyticsWeek Job Roles in Data Science by Gender
  • 41. 42Copyright © 2016 Business Over Broadway and AnalyticsWeek Highest Level of Education Attained
  • 42. 43Copyright © 2016 Business Over Broadway and AnalyticsWeek Gender Comparison of Proficiency across Skills
  • 43. 44Copyright © 2016 Business Over Broadway and AnalyticsWeek Advice for Data Scientists • Take the Data Science Skills Survey at: http://pxl.me/awrds3 • Be specific when talking about “data scientists” • There are different types – defined by what they do and the skills they possess • Work with other data professionals who have complementary skills. Teamwork is key to success. • Learn to use data mining and visualization tools • R, Python, SPSS, SAS, graphics, mapping, web-based data visualization • Be an advocate for women in the field of data science
  • 44. 45Copyright © 2015 Business Over Broadway and AnalyticsWeek Bob E. Hayes, Ph.D. Email: bob@analyticsweek.com and bob@businessoverbroadway.com Web: www.analyticsweek.com and www.businessoverbroadway.com Blog: www.businessoverbroadway.com/blog Twitter: www.twitter.com/bobehayes

Notas del editor

  1. A factor analysis is a data reduction technique. It is used when you have a large number of variables in your data set and would like to reduce the number of variables to a manageable size.  In general, factor analysis examines the statistical relationships (e.g., correlations) among a large set of variables and tries to explain these correlations using a smaller number of variables (factors). The results of the factor analysis are presented in tabular format called the factor pattern matrix. The factor matrix is an NxM table (N = number of original variables and M = number of underlying factors). The elements of a factor pattern matrix represent the regression coefficients (like a correlation coefficient) between each of the variables and the factors. These elements (or factor loadings) represent the strength of relationship between the variables and each of the underlying factors. The results of the factor analysis tells us two things: number of underlying factors that describe the initial set of variables which variables are best represented by each factor