SlideShare una empresa de Scribd logo
1 de 47
Lesson 3
AI in the Enterprise
2
Legal Disclaimers
No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by
this document.
Intel disclaims all express and implied warranties, including without limitation, the implied warranties of
merchantability, fitness for a particular purpose, and non-infringement, as well as any warranty arising
from course of performance, course of dealing, or usage in trade.
Copies of documents which have an order number and are referenced in this document may be obtained
by calling 1-800-548-4725 or by visiting www.intel.com/design/literature.htm.
Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.
*Other names and brands may be claimed as the property of others
Copyright © 2018 Intel Corporation. All rights reserved.
3
Learning Objectives
• Identify the steps in the data science workflow
• Identify the key roles and skill sets within the field of AI
• Describe the different ways to structure an AI team
• Identify common data science misconceptions
• Identify the components of model maintenance after deployment
You will be able to:
5
Data Science Workflow
Problem Statement
Data Collection
Data Exploration
& Preprocessing
Modeling
Validation
Decision Making
& Deployment
What problem are you trying to solve?
What data do you need to solve it?
How should you clean your data so your model can use it?
Build a model to solve your problem?
Did I solve the problem?
Communicate to stakeholders or put into production?
6
Problem Statement
What problem are you trying to solve?
• Data scientists first need to identify the problem to solve.
• Knowledge of the business is needed to identify impactful opportunities.
• Technical knowledge is needed to ask the right questions, and to know
what is possible.
7
Data Collection
What data do you need to solve your problem?
• The data required to solve the problem needs to be
identified and collected.
• Data and engineering skills are needed to collect and
consolidate data from multiple sources.
8
Data Exploration and Preprocessing
How should you clean your data so your model can use it?
• Data needs to be cleaned and processed so that it’s in a
usable format for modeling.
• Exploration is required to identify important elements within
the data and to identify any data quality issues.
• Data, engineering, and statistics skills are needed to
appropriately process the data and make inferences.
9
Modeling
What model needs to be built to solve your problem?
• Several factors go into modeling such as complexity,
required data, speed, and performance.
• This step requires skills in engineering, modeling, and
statistics.
10
Validation
Did the problem get sufficiently solved?
• Validation is required to ensure the original problem was
solved.
• Model performance needs to be accurately measured.
• Statistics and modeling skills, as well as domain
knowledge, are needed to make sure the results align
with the business problem.
11
Decision Making and Deployment
Communicate to stakeholders or put into production.
• A business decision needs to be made, or a product
needs to be put into production, so the business can
see value from the project.
• This requires domain knowledge, as well as
communication and storytelling skills.
• Engineering skills are needed to integrate code into
back-end software systems.
13
Data Science Skill Sets
Data science teams need a variety of skills to be successful.
Data Munging
Statistics
Software
Engineering
Domain
Expertise
Modeling
Storytelling &
Communicatio
n
14
Data Science Team Skills
Communicate
with business
leaders
Integrate code
into software
systems
Communication/
Storytelling
Domain
Expertise
Modeling
Statistics
Data Munging
Software
Engineering
Data science teams need a variety of skills to be successful.
15
Communication, Storytelling, Domain Expertise
Understand the business needs and communicate how to address them.
• Domain expertise to understand the process
and business problem to help their business.
• Persuade decision makers to support their idea.
• Communicate complicated concepts clearly,
and tell stories.
16
Modeling and Statistics
Use data to make predictions via models, and using
statistics to assess the validity of those predictions.
• Work with a variety of modeling techniques,
from regression to DL.
• Use statistics to assess the performance of
one model vs. another.
• Design experiments and perform A/B testing.
17
Data Munging/Software Engineering
Transform messy data into clean, usable data, as well as
building software systems to deploy their models.
• Raw data can be messy and unstructured.
• Data must be manipulated and stored in
databases before it can be used.
• Models need to be deployed.
Roles have evolved that fit on different places on this spectrum.
19
Roles on Data Science Teams
Product
Manager
Statistician
Machine
Learning
Engineer
Research
Scientist
Data
Engineer
Software
Engineerin
g
Business
Analyst
Communicate
with business
leaders
Integrate code
into software
systems
20
Business Analysts
Business analysts interact with decision-makers.
• Create reports and provide insights.
• Create dashboards displaying key product KPIs.
• Perform analysis to determine business impact of
a new product/feature.
• Excel*, PowerBI* and Tableau* are examples of
tools used.
*Other names and brands may be claimed as the property of others.
21
Product Managers
Product Managers get requirements from business.
• Translate business ideas into product ideas.
• Determine feasibility of solving business problems.
• Consider impact of new product or model on key
business metrics.
• Prioritize projects and tasks.
• Examples of tools used: Microsoft Project*,
Trello*, and JIRA*.
*Other names and brands may be claimed as the property of others.
22
Statisticians
Statisticians determine the validity of models.
• Applies statistical concepts to determine amount
of data required.
• Explores outliers and trends.
• Determines if results are statistically significant.
• Examples of tools used: R*, SAS*, Python*.
*Other names and brands may be claimed as the property of others.
23
Machine Learning Engineers
Machine learning engineers solve problems involving large amounts of
high-dimensional data.
• Apply machine learning techniques.
• Focused on the engineering that makes
models accurate and fast.
• Examples of tools used: Python*, R*,
and MATLAB*.
*Other names and brands may be claimed as the property of others.
24
Research Scientists
Research scientists work on problems in bleeding-
edge fields.
• Work on the toughest problems in big data and
machine learning.
• Expert at a particular sub-discipline.
• Understand how algorithms work under the hood.
• Can be part of a separate research team,
interfacing with data science team when
necessary.
• Examples of tools used: Python*, MATLAB*, R*,
C++, and Java*.
*Other names and brands may be claimed as the property of others.
25
Data Engineers
Data engineers build data ingestion, storage, and
infrastructure.
• Databases experts.
• Know the tradeoffs between speed, reliability, and
size.
• Automate data cleaning.
• Build ETL (extract, transform, load) pipelines to
make data available on a regular cadence.
• Examples of tools used: Java*, SQL, and noSQL.
*Other names and brands may be claimed as the property of others.
26
Software Engineers
Software engineers are responsible for
optimizing code and deploying.
• Get code into production.
• Write tests to detect code breaking and bugs.
• Ensure model code is maintainable.
• Examples of tools used: Python*, Ruby*, C++,
and Java*.
*Other names and brands may be claimed as the property of others.
Skills and Roles
Business
Analyst
Product
Manager
Statistician
Machine
Learning
Engineer
Research
Scientist
Data
Engineer
Software
Engineer
Communication/Story X X
Domain Expertise X X
Modeling X X X X
Statistics X X X X
Data Munging X X X X
Software Engineering X
29
Data Science Organizational Structure
There are multiple ways to organize data science teams.
• Centralized teams where are all data scientists report to
the same head.
• Distributed teams where individual data scientists work with
a business team.
• Teams embedded within functional business units.
30
Centralized Teams
All the data scientists report to the same group head.
• Usually within a technology or IT team.
• Pros:
• Standardization of skills and tools
• Reduction of redundant roles
• Closer collaboration amongst scientists
• Cons:
• Further removed from the business units
• Tendency to be more reactive to problems
31
Distributed Teams
Distributed teams are where individual data scientists sit within a particular
team within a business unit.
• Pros:
• Closer access to the business, domain experts, and end users
• More likely to come up with solutions to immediate business
problems
• Cons:
• Destandardization of tools and skills throughout the organization
• Less communication and collaboration between data scientists
32
Additional Team Structures
There are multiple hybrid ways to organize data science teams.
• Some examples include:
• Full data science teams can be embedded
within a business function.
• Data science centers of excellence can be
created for the company.
34
Misconception #1: Data Science “Unicorn”
Data Scientists who are experts in all areas are called “unicorns”.
• Successful teams contain people with a diverse array of
skillsets and backgrounds.
• Some excel at communication, while others may excel at
statistics.
• Successful teams have experts in the three main areas:
business, science, and engineering.
35
Misconception #2: Research and Algorithms Focus
Data science teams cannot just focus on research and algorithms.
• Effective teams have mechanisms to:
• Identify problems
• Communicate findings
• Work with engineering to understand
how to put their models into production
36
Misconception #3: Complex and Advanced Systems
The most complicated solution isn’t always the best.
• Teams tend to be more successful when they start simple
and then move on to more complex modeling techniques.
• Complex models may be more accurate, but are less
interpretable, more likely to fail in unpredictable ways, and
harder to maintain.
• Starting simple also ensures that what the team is building
aligns with business needs.
37
Misconception #4: Industry Differences
The underlying modeling and data analysis techniques are
largely transferable between industries.
• Domain expertise is required to understand which data
is relevant and which problems are most important to
solve.
• The techniques used to clean data, store it, and
extract useful insights and modeling remain very similar.
38
Misconception #5: Projects Begin Well-Defined
Data Science projects are often exploratory and
experimental in nature.
• It may not be clear how hard the problem is to
solve until investing time exploring the data.
• Product Managers must actively work with both
the team and the business stakeholders to
manage expectations.
39
Misconception #6: Best Prediction Models are Best
There are more challenges involved when selecting a
model than its predictive abilities.
• Some models may be too slow or complicated to
include in production.
• Some models may not be interpretable, and would
have a tough sell with decision makers.
41
After Deploying a Model
Once a model is deployed, relevant teams must monitor and manage
the model for it to be useful.
• Business Intelligence teams should build reports/dashboards
displaying model results.
• Business teams (e.g. operations, merchandising) or customers
should use model outputs.
• Data Science teams must update model at appropriate cadence
(for example, monthly).
42
Business Intelligence Team
The business intelligence team monitors how model
predictions are changing over time.
• Monitor model usage and output.
• Develop key performance indicators and
dashboards.
• Visualizations should align with the needs of the
functional teams (for example, operations) that will
be using the model.
43
Functional Business Team
Use dashboards to improve decision making.
• For example: marketing team could use churn
model to decide when to reach out to groups of
customers.
• For example: operations team could use model
to predict shipping/logistics times.
44
Data Science Team
Monitor model to ensure continued validity and usefulness.
• For example: changes in marketing may lead to different types of
customers coming in the door, making the old model of customer
behavior less accurate.
• Depending on how fast the business conditions are changing, the data
science team should update model with appropriate frequency.
• Retraining models with new data.
45
Learning Objectives Recap
In this lesson, we worked to:
• Identify the steps in the data science workflow
• Identify the key roles and skill sets within the field of AI
• Describe the different ways to structure an AI team
• Identify common data science misconceptions
• Identify the components of model maintenance after deployment
Sources for images used in this presentation
(listed by slide number)
20 https://www.pexels.com/photo/adult-brainstorming-business-chair-515167/
22 https://www.pexels.com/photo/blue-screen-of-death-in-silver-black-laptop-177598/
31 https://www.pexels.com/photo/ground-group-growth-hands-461049/
33 https://www.pexels.com/photo/yellow-blue-and-red-plastic-gameboard-toy-209712/
34 https://www.pexels.com/photo/adult-agreement-beard-beverage-618550/
38 https://www.pexels.com/photo/blue-and-green-pie-chart-97080/
39 https://www.pexels.com/photo/arrows-box-business-chalk-533189/

Más contenido relacionado

La actualidad más candente

Understanding and Protecting Artificial Intelligence Technology (Machine Lear...
Understanding and Protecting Artificial Intelligence Technology (Machine Lear...Understanding and Protecting Artificial Intelligence Technology (Machine Lear...
Understanding and Protecting Artificial Intelligence Technology (Machine Lear...Knobbe Martens - Intellectual Property Law
 
01 foundations
01 foundations01 foundations
01 foundationsankit_ppt
 
Fairly Measuring Fairness In Machine Learning
Fairly Measuring Fairness In Machine LearningFairly Measuring Fairness In Machine Learning
Fairly Measuring Fairness In Machine LearningHJ van Veen
 
Barga DIDC'14 Invited Talk
Barga DIDC'14 Invited TalkBarga DIDC'14 Invited Talk
Barga DIDC'14 Invited TalkRoger Barga
 
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...Edureka!
 
Machine Learning Using Python
Machine Learning Using PythonMachine Learning Using Python
Machine Learning Using PythonSavitaHanchinal
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learningPruet Boonma
 
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...Edureka!
 
Building a performing Machine Learning model from A to Z
Building a performing Machine Learning model from A to ZBuilding a performing Machine Learning model from A to Z
Building a performing Machine Learning model from A to ZCharles Vestur
 
"An Introduction to Machine Learning and How to Teach Machines to See," a Pre...
"An Introduction to Machine Learning and How to Teach Machines to See," a Pre..."An Introduction to Machine Learning and How to Teach Machines to See," a Pre...
"An Introduction to Machine Learning and How to Teach Machines to See," a Pre...Edge AI and Vision Alliance
 
Introduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regressionIntroduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regressionGirish Gore
 
Patent Protection of Machine Learning and Artificial Intelligence Inventions ...
Patent Protection of Machine Learning and Artificial Intelligence Inventions ...Patent Protection of Machine Learning and Artificial Intelligence Inventions ...
Patent Protection of Machine Learning and Artificial Intelligence Inventions ...Knobbe Martens - Intellectual Property Law
 
Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)butest
 
Barga Data Science lecture 4
Barga Data Science lecture 4Barga Data Science lecture 4
Barga Data Science lecture 4Roger Barga
 
“Introducing Machine Learning and How to Teach Machines to See,” a Presentati...
“Introducing Machine Learning and How to Teach Machines to See,” a Presentati...“Introducing Machine Learning and How to Teach Machines to See,” a Presentati...
“Introducing Machine Learning and How to Teach Machines to See,” a Presentati...Edge AI and Vision Alliance
 
Machine learning with Big Data power point presentation
Machine learning with Big Data power point presentationMachine learning with Big Data power point presentation
Machine learning with Big Data power point presentationDavid Raj Kanthi
 
Machine Learning and Applications
Machine Learning and ApplicationsMachine Learning and Applications
Machine Learning and ApplicationsGeeta Arora
 
10 Lessons Learned from Building Machine Learning Systems
10 Lessons Learned from Building Machine Learning Systems10 Lessons Learned from Building Machine Learning Systems
10 Lessons Learned from Building Machine Learning SystemsXavier Amatriain
 

La actualidad más candente (20)

Understanding and Protecting Artificial Intelligence Technology (Machine Lear...
Understanding and Protecting Artificial Intelligence Technology (Machine Lear...Understanding and Protecting Artificial Intelligence Technology (Machine Lear...
Understanding and Protecting Artificial Intelligence Technology (Machine Lear...
 
01 foundations
01 foundations01 foundations
01 foundations
 
Fairly Measuring Fairness In Machine Learning
Fairly Measuring Fairness In Machine LearningFairly Measuring Fairness In Machine Learning
Fairly Measuring Fairness In Machine Learning
 
Barga DIDC'14 Invited Talk
Barga DIDC'14 Invited TalkBarga DIDC'14 Invited Talk
Barga DIDC'14 Invited Talk
 
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
 
Applications of Machine Learning
Applications of Machine LearningApplications of Machine Learning
Applications of Machine Learning
 
Machine Learning Using Python
Machine Learning Using PythonMachine Learning Using Python
Machine Learning Using Python
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
 
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...
 
Building a performing Machine Learning model from A to Z
Building a performing Machine Learning model from A to ZBuilding a performing Machine Learning model from A to Z
Building a performing Machine Learning model from A to Z
 
"An Introduction to Machine Learning and How to Teach Machines to See," a Pre...
"An Introduction to Machine Learning and How to Teach Machines to See," a Pre..."An Introduction to Machine Learning and How to Teach Machines to See," a Pre...
"An Introduction to Machine Learning and How to Teach Machines to See," a Pre...
 
Introduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regressionIntroduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regression
 
Patent Protection of Machine Learning and Artificial Intelligence Inventions ...
Patent Protection of Machine Learning and Artificial Intelligence Inventions ...Patent Protection of Machine Learning and Artificial Intelligence Inventions ...
Patent Protection of Machine Learning and Artificial Intelligence Inventions ...
 
Artificial Neural Networks for Data Mining
Artificial Neural Networks for Data MiningArtificial Neural Networks for Data Mining
Artificial Neural Networks for Data Mining
 
Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)
 
Barga Data Science lecture 4
Barga Data Science lecture 4Barga Data Science lecture 4
Barga Data Science lecture 4
 
“Introducing Machine Learning and How to Teach Machines to See,” a Presentati...
“Introducing Machine Learning and How to Teach Machines to See,” a Presentati...“Introducing Machine Learning and How to Teach Machines to See,” a Presentati...
“Introducing Machine Learning and How to Teach Machines to See,” a Presentati...
 
Machine learning with Big Data power point presentation
Machine learning with Big Data power point presentationMachine learning with Big Data power point presentation
Machine learning with Big Data power point presentation
 
Machine Learning and Applications
Machine Learning and ApplicationsMachine Learning and Applications
Machine Learning and Applications
 
10 Lessons Learned from Building Machine Learning Systems
10 Lessons Learned from Building Machine Learning Systems10 Lessons Learned from Building Machine Learning Systems
10 Lessons Learned from Building Machine Learning Systems
 

Similar a Lesson 3 ai in the enterprise

Data Scientist By: Professor Lili Saghafi
Data Scientist By: Professor Lili SaghafiData Scientist By: Professor Lili Saghafi
Data Scientist By: Professor Lili SaghafiProfessor Lili Saghafi
 
Data Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptxData Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptxCarolineRebeccaD
 
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...DATAVERSITY
 
Data Analytics Today - Data, Tech, and Regulation.pdf
Data Analytics Today - Data, Tech, and Regulation.pdfData Analytics Today - Data, Tech, and Regulation.pdf
Data Analytics Today - Data, Tech, and Regulation.pdfHendri Karisma
 
Which institute is best for data science?
Which institute is best for data science?Which institute is best for data science?
Which institute is best for data science?DIGITALSAI1
 
Best Selenium certification course
Best Selenium certification courseBest Selenium certification course
Best Selenium certification courseKumarNaik21
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)SayyedYusufali
 
Data science training institute in hyderabad
Data science training institute in hyderabadData science training institute in hyderabad
Data science training institute in hyderabadVamsiNihal
 
Data science training in Hyderabad
Data science  training in HyderabadData science  training in Hyderabad
Data science training in Hyderabadsaitejavella
 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training HyderabadNithinsunil1
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabadVamsiNihal
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)SayyedYusufali
 
data science training and placement
data science training and placementdata science training and placement
data science training and placementSaiprasadVella
 
online data science training
online data science trainingonline data science training
online data science trainingDIGITALSAI1
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabadVamsiNihal
 
data science online training in hyderabad
data science online training in hyderabaddata science online training in hyderabad
data science online training in hyderabadVamsiNihal
 
Best data science training in Hyderabad
Best data science training in HyderabadBest data science training in Hyderabad
Best data science training in HyderabadKumarNaik21
 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training HyderabadNithinsunil1
 

Similar a Lesson 3 ai in the enterprise (20)

semana1.pptx
semana1.pptxsemana1.pptx
semana1.pptx
 
Data Scientist By: Professor Lili Saghafi
Data Scientist By: Professor Lili SaghafiData Scientist By: Professor Lili Saghafi
Data Scientist By: Professor Lili Saghafi
 
Data Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptxData Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptx
 
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
 
Data Analytics Today - Data, Tech, and Regulation.pdf
Data Analytics Today - Data, Tech, and Regulation.pdfData Analytics Today - Data, Tech, and Regulation.pdf
Data Analytics Today - Data, Tech, and Regulation.pdf
 
Data Science and Analytics
Data Science and Analytics Data Science and Analytics
Data Science and Analytics
 
Which institute is best for data science?
Which institute is best for data science?Which institute is best for data science?
Which institute is best for data science?
 
Best Selenium certification course
Best Selenium certification courseBest Selenium certification course
Best Selenium certification course
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)
 
Data science training institute in hyderabad
Data science training institute in hyderabadData science training institute in hyderabad
Data science training institute in hyderabad
 
Data science training in Hyderabad
Data science  training in HyderabadData science  training in Hyderabad
Data science training in Hyderabad
 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training Hyderabad
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabad
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)
 
data science training and placement
data science training and placementdata science training and placement
data science training and placement
 
online data science training
online data science trainingonline data science training
online data science training
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabad
 
data science online training in hyderabad
data science online training in hyderabaddata science online training in hyderabad
data science online training in hyderabad
 
Best data science training in Hyderabad
Best data science training in HyderabadBest data science training in Hyderabad
Best data science training in Hyderabad
 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training Hyderabad
 

Más de ankit_ppt

Deep learning summary
Deep learning summaryDeep learning summary
Deep learning summaryankit_ppt
 
08 neural networks
08 neural networks08 neural networks
08 neural networksankit_ppt
 
06 image features
06 image features06 image features
06 image featuresankit_ppt
 
05 contours seg_matching
05 contours seg_matching05 contours seg_matching
05 contours seg_matchingankit_ppt
 
04 image transformations_ii
04 image transformations_ii04 image transformations_ii
04 image transformations_iiankit_ppt
 
03 image transformations_i
03 image transformations_i03 image transformations_i
03 image transformations_iankit_ppt
 
02 image processing
02 image processing02 image processing
02 image processingankit_ppt
 
Text similarity measures
Text similarity measuresText similarity measures
Text similarity measuresankit_ppt
 
Text generation and_advanced_topics
Text generation and_advanced_topicsText generation and_advanced_topics
Text generation and_advanced_topicsankit_ppt
 
Nlp toolkits and_preprocessing_techniques
Nlp toolkits and_preprocessing_techniquesNlp toolkits and_preprocessing_techniques
Nlp toolkits and_preprocessing_techniquesankit_ppt
 
Matrix decomposition and_applications_to_nlp
Matrix decomposition and_applications_to_nlpMatrix decomposition and_applications_to_nlp
Matrix decomposition and_applications_to_nlpankit_ppt
 
Machine learning and_nlp
Machine learning and_nlpMachine learning and_nlp
Machine learning and_nlpankit_ppt
 
Latent dirichlet allocation_and_topic_modeling
Latent dirichlet allocation_and_topic_modelingLatent dirichlet allocation_and_topic_modeling
Latent dirichlet allocation_and_topic_modelingankit_ppt
 
Intro to nlp
Intro to nlpIntro to nlp
Intro to nlpankit_ppt
 
Ot regularization and_gradient_descent
Ot regularization and_gradient_descentOt regularization and_gradient_descent
Ot regularization and_gradient_descentankit_ppt
 
Ml10 dimensionality reduction-and_advanced_topics
Ml10 dimensionality reduction-and_advanced_topicsMl10 dimensionality reduction-and_advanced_topics
Ml10 dimensionality reduction-and_advanced_topicsankit_ppt
 
Ml9 introduction to-unsupervised_learning_and_clustering_methods
Ml9 introduction to-unsupervised_learning_and_clustering_methodsMl9 introduction to-unsupervised_learning_and_clustering_methods
Ml9 introduction to-unsupervised_learning_and_clustering_methodsankit_ppt
 
Ml8 boosting and-stacking
Ml8 boosting and-stackingMl8 boosting and-stacking
Ml8 boosting and-stackingankit_ppt
 

Más de ankit_ppt (20)

Deep learning summary
Deep learning summaryDeep learning summary
Deep learning summary
 
08 neural networks
08 neural networks08 neural networks
08 neural networks
 
07 learning
07 learning07 learning
07 learning
 
06 image features
06 image features06 image features
06 image features
 
05 contours seg_matching
05 contours seg_matching05 contours seg_matching
05 contours seg_matching
 
04 image transformations_ii
04 image transformations_ii04 image transformations_ii
04 image transformations_ii
 
03 image transformations_i
03 image transformations_i03 image transformations_i
03 image transformations_i
 
02 image processing
02 image processing02 image processing
02 image processing
 
Word2 vec
Word2 vecWord2 vec
Word2 vec
 
Text similarity measures
Text similarity measuresText similarity measures
Text similarity measures
 
Text generation and_advanced_topics
Text generation and_advanced_topicsText generation and_advanced_topics
Text generation and_advanced_topics
 
Nlp toolkits and_preprocessing_techniques
Nlp toolkits and_preprocessing_techniquesNlp toolkits and_preprocessing_techniques
Nlp toolkits and_preprocessing_techniques
 
Matrix decomposition and_applications_to_nlp
Matrix decomposition and_applications_to_nlpMatrix decomposition and_applications_to_nlp
Matrix decomposition and_applications_to_nlp
 
Machine learning and_nlp
Machine learning and_nlpMachine learning and_nlp
Machine learning and_nlp
 
Latent dirichlet allocation_and_topic_modeling
Latent dirichlet allocation_and_topic_modelingLatent dirichlet allocation_and_topic_modeling
Latent dirichlet allocation_and_topic_modeling
 
Intro to nlp
Intro to nlpIntro to nlp
Intro to nlp
 
Ot regularization and_gradient_descent
Ot regularization and_gradient_descentOt regularization and_gradient_descent
Ot regularization and_gradient_descent
 
Ml10 dimensionality reduction-and_advanced_topics
Ml10 dimensionality reduction-and_advanced_topicsMl10 dimensionality reduction-and_advanced_topics
Ml10 dimensionality reduction-and_advanced_topics
 
Ml9 introduction to-unsupervised_learning_and_clustering_methods
Ml9 introduction to-unsupervised_learning_and_clustering_methodsMl9 introduction to-unsupervised_learning_and_clustering_methods
Ml9 introduction to-unsupervised_learning_and_clustering_methods
 
Ml8 boosting and-stacking
Ml8 boosting and-stackingMl8 boosting and-stacking
Ml8 boosting and-stacking
 

Último

The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)simmis5
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfRagavanV2
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfKamal Acharya
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdfKamal Acharya
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...SUHANI PANDEY
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations120cr0395
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICSUNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICSrknatarajan
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 

Último (20)

(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdf
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICSUNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
 
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
 

Lesson 3 ai in the enterprise

  • 1. Lesson 3 AI in the Enterprise
  • 2. 2 Legal Disclaimers No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by this document. Intel disclaims all express and implied warranties, including without limitation, the implied warranties of merchantability, fitness for a particular purpose, and non-infringement, as well as any warranty arising from course of performance, course of dealing, or usage in trade. Copies of documents which have an order number and are referenced in this document may be obtained by calling 1-800-548-4725 or by visiting www.intel.com/design/literature.htm. Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries. *Other names and brands may be claimed as the property of others Copyright © 2018 Intel Corporation. All rights reserved.
  • 3. 3 Learning Objectives • Identify the steps in the data science workflow • Identify the key roles and skill sets within the field of AI • Describe the different ways to structure an AI team • Identify common data science misconceptions • Identify the components of model maintenance after deployment You will be able to:
  • 4.
  • 5. 5 Data Science Workflow Problem Statement Data Collection Data Exploration & Preprocessing Modeling Validation Decision Making & Deployment What problem are you trying to solve? What data do you need to solve it? How should you clean your data so your model can use it? Build a model to solve your problem? Did I solve the problem? Communicate to stakeholders or put into production?
  • 6. 6 Problem Statement What problem are you trying to solve? • Data scientists first need to identify the problem to solve. • Knowledge of the business is needed to identify impactful opportunities. • Technical knowledge is needed to ask the right questions, and to know what is possible.
  • 7. 7 Data Collection What data do you need to solve your problem? • The data required to solve the problem needs to be identified and collected. • Data and engineering skills are needed to collect and consolidate data from multiple sources.
  • 8. 8 Data Exploration and Preprocessing How should you clean your data so your model can use it? • Data needs to be cleaned and processed so that it’s in a usable format for modeling. • Exploration is required to identify important elements within the data and to identify any data quality issues. • Data, engineering, and statistics skills are needed to appropriately process the data and make inferences.
  • 9. 9 Modeling What model needs to be built to solve your problem? • Several factors go into modeling such as complexity, required data, speed, and performance. • This step requires skills in engineering, modeling, and statistics.
  • 10. 10 Validation Did the problem get sufficiently solved? • Validation is required to ensure the original problem was solved. • Model performance needs to be accurately measured. • Statistics and modeling skills, as well as domain knowledge, are needed to make sure the results align with the business problem.
  • 11. 11 Decision Making and Deployment Communicate to stakeholders or put into production. • A business decision needs to be made, or a product needs to be put into production, so the business can see value from the project. • This requires domain knowledge, as well as communication and storytelling skills. • Engineering skills are needed to integrate code into back-end software systems.
  • 12.
  • 13. 13 Data Science Skill Sets Data science teams need a variety of skills to be successful. Data Munging Statistics Software Engineering Domain Expertise Modeling Storytelling & Communicatio n
  • 14. 14 Data Science Team Skills Communicate with business leaders Integrate code into software systems Communication/ Storytelling Domain Expertise Modeling Statistics Data Munging Software Engineering Data science teams need a variety of skills to be successful.
  • 15. 15 Communication, Storytelling, Domain Expertise Understand the business needs and communicate how to address them. • Domain expertise to understand the process and business problem to help their business. • Persuade decision makers to support their idea. • Communicate complicated concepts clearly, and tell stories.
  • 16. 16 Modeling and Statistics Use data to make predictions via models, and using statistics to assess the validity of those predictions. • Work with a variety of modeling techniques, from regression to DL. • Use statistics to assess the performance of one model vs. another. • Design experiments and perform A/B testing.
  • 17. 17 Data Munging/Software Engineering Transform messy data into clean, usable data, as well as building software systems to deploy their models. • Raw data can be messy and unstructured. • Data must be manipulated and stored in databases before it can be used. • Models need to be deployed.
  • 18.
  • 19. Roles have evolved that fit on different places on this spectrum. 19 Roles on Data Science Teams Product Manager Statistician Machine Learning Engineer Research Scientist Data Engineer Software Engineerin g Business Analyst Communicate with business leaders Integrate code into software systems
  • 20. 20 Business Analysts Business analysts interact with decision-makers. • Create reports and provide insights. • Create dashboards displaying key product KPIs. • Perform analysis to determine business impact of a new product/feature. • Excel*, PowerBI* and Tableau* are examples of tools used. *Other names and brands may be claimed as the property of others.
  • 21. 21 Product Managers Product Managers get requirements from business. • Translate business ideas into product ideas. • Determine feasibility of solving business problems. • Consider impact of new product or model on key business metrics. • Prioritize projects and tasks. • Examples of tools used: Microsoft Project*, Trello*, and JIRA*. *Other names and brands may be claimed as the property of others.
  • 22. 22 Statisticians Statisticians determine the validity of models. • Applies statistical concepts to determine amount of data required. • Explores outliers and trends. • Determines if results are statistically significant. • Examples of tools used: R*, SAS*, Python*. *Other names and brands may be claimed as the property of others.
  • 23. 23 Machine Learning Engineers Machine learning engineers solve problems involving large amounts of high-dimensional data. • Apply machine learning techniques. • Focused on the engineering that makes models accurate and fast. • Examples of tools used: Python*, R*, and MATLAB*. *Other names and brands may be claimed as the property of others.
  • 24. 24 Research Scientists Research scientists work on problems in bleeding- edge fields. • Work on the toughest problems in big data and machine learning. • Expert at a particular sub-discipline. • Understand how algorithms work under the hood. • Can be part of a separate research team, interfacing with data science team when necessary. • Examples of tools used: Python*, MATLAB*, R*, C++, and Java*. *Other names and brands may be claimed as the property of others.
  • 25. 25 Data Engineers Data engineers build data ingestion, storage, and infrastructure. • Databases experts. • Know the tradeoffs between speed, reliability, and size. • Automate data cleaning. • Build ETL (extract, transform, load) pipelines to make data available on a regular cadence. • Examples of tools used: Java*, SQL, and noSQL. *Other names and brands may be claimed as the property of others.
  • 26. 26 Software Engineers Software engineers are responsible for optimizing code and deploying. • Get code into production. • Write tests to detect code breaking and bugs. • Ensure model code is maintainable. • Examples of tools used: Python*, Ruby*, C++, and Java*. *Other names and brands may be claimed as the property of others.
  • 27. Skills and Roles Business Analyst Product Manager Statistician Machine Learning Engineer Research Scientist Data Engineer Software Engineer Communication/Story X X Domain Expertise X X Modeling X X X X Statistics X X X X Data Munging X X X X Software Engineering X
  • 28.
  • 29. 29 Data Science Organizational Structure There are multiple ways to organize data science teams. • Centralized teams where are all data scientists report to the same head. • Distributed teams where individual data scientists work with a business team. • Teams embedded within functional business units.
  • 30. 30 Centralized Teams All the data scientists report to the same group head. • Usually within a technology or IT team. • Pros: • Standardization of skills and tools • Reduction of redundant roles • Closer collaboration amongst scientists • Cons: • Further removed from the business units • Tendency to be more reactive to problems
  • 31. 31 Distributed Teams Distributed teams are where individual data scientists sit within a particular team within a business unit. • Pros: • Closer access to the business, domain experts, and end users • More likely to come up with solutions to immediate business problems • Cons: • Destandardization of tools and skills throughout the organization • Less communication and collaboration between data scientists
  • 32. 32 Additional Team Structures There are multiple hybrid ways to organize data science teams. • Some examples include: • Full data science teams can be embedded within a business function. • Data science centers of excellence can be created for the company.
  • 33.
  • 34. 34 Misconception #1: Data Science “Unicorn” Data Scientists who are experts in all areas are called “unicorns”. • Successful teams contain people with a diverse array of skillsets and backgrounds. • Some excel at communication, while others may excel at statistics. • Successful teams have experts in the three main areas: business, science, and engineering.
  • 35. 35 Misconception #2: Research and Algorithms Focus Data science teams cannot just focus on research and algorithms. • Effective teams have mechanisms to: • Identify problems • Communicate findings • Work with engineering to understand how to put their models into production
  • 36. 36 Misconception #3: Complex and Advanced Systems The most complicated solution isn’t always the best. • Teams tend to be more successful when they start simple and then move on to more complex modeling techniques. • Complex models may be more accurate, but are less interpretable, more likely to fail in unpredictable ways, and harder to maintain. • Starting simple also ensures that what the team is building aligns with business needs.
  • 37. 37 Misconception #4: Industry Differences The underlying modeling and data analysis techniques are largely transferable between industries. • Domain expertise is required to understand which data is relevant and which problems are most important to solve. • The techniques used to clean data, store it, and extract useful insights and modeling remain very similar.
  • 38. 38 Misconception #5: Projects Begin Well-Defined Data Science projects are often exploratory and experimental in nature. • It may not be clear how hard the problem is to solve until investing time exploring the data. • Product Managers must actively work with both the team and the business stakeholders to manage expectations.
  • 39. 39 Misconception #6: Best Prediction Models are Best There are more challenges involved when selecting a model than its predictive abilities. • Some models may be too slow or complicated to include in production. • Some models may not be interpretable, and would have a tough sell with decision makers.
  • 40.
  • 41. 41 After Deploying a Model Once a model is deployed, relevant teams must monitor and manage the model for it to be useful. • Business Intelligence teams should build reports/dashboards displaying model results. • Business teams (e.g. operations, merchandising) or customers should use model outputs. • Data Science teams must update model at appropriate cadence (for example, monthly).
  • 42. 42 Business Intelligence Team The business intelligence team monitors how model predictions are changing over time. • Monitor model usage and output. • Develop key performance indicators and dashboards. • Visualizations should align with the needs of the functional teams (for example, operations) that will be using the model.
  • 43. 43 Functional Business Team Use dashboards to improve decision making. • For example: marketing team could use churn model to decide when to reach out to groups of customers. • For example: operations team could use model to predict shipping/logistics times.
  • 44. 44 Data Science Team Monitor model to ensure continued validity and usefulness. • For example: changes in marketing may lead to different types of customers coming in the door, making the old model of customer behavior less accurate. • Depending on how fast the business conditions are changing, the data science team should update model with appropriate frequency. • Retraining models with new data.
  • 45. 45 Learning Objectives Recap In this lesson, we worked to: • Identify the steps in the data science workflow • Identify the key roles and skill sets within the field of AI • Describe the different ways to structure an AI team • Identify common data science misconceptions • Identify the components of model maintenance after deployment
  • 46.
  • 47. Sources for images used in this presentation (listed by slide number) 20 https://www.pexels.com/photo/adult-brainstorming-business-chair-515167/ 22 https://www.pexels.com/photo/blue-screen-of-death-in-silver-black-laptop-177598/ 31 https://www.pexels.com/photo/ground-group-growth-hands-461049/ 33 https://www.pexels.com/photo/yellow-blue-and-red-plastic-gameboard-toy-209712/ 34 https://www.pexels.com/photo/adult-agreement-beard-beverage-618550/ 38 https://www.pexels.com/photo/blue-and-green-pie-chart-97080/ 39 https://www.pexels.com/photo/arrows-box-business-chalk-533189/

Notas del editor

  1. Walk through each step in the workflow. Highlight that we’ll be going into more depth in the various steps throughout the course. Segway into the next slide with the fact that there are a wide range of skills needed throughout this workflow