SlideShare una empresa de Scribd logo
1 de 14
Descargar para leer sin conexión
Predictive Vehicle Inspection
Matous Havlena
matous@havlena.net
Tim Ojo
timmyojo@gmail.com
Akin Alao
alaoraufu@yahoo.co.uk
Project Charter
Evaluate the feasibility of using Big Data analytics solutions for
Manufacturing to solve the problem of Predictive Vehicle
Inspection:
● Analyzing vehicle production history to predict car inspection
failures from the production line.
● Production shifts, specific employee, and other factors
The two Big Data Analytics solutions to be evaluated:
● IBM BigInsights
● Datameer 2.1
Approach & Proposed Solution
● Recognized the problem as a classification problem
similar to credit scoring or fraud detection.
● Classification is the problem of identifying to which of a
set of categories a new observation belongs, on the basis
of a training set of data containing observations whose
category membership is known.
● Build a predictive model based on machine learning
classification (supervised learning) to identify whether a
vehicle can be classified as good (passes quality check
on 1st try) or bad (fails quality check on 1st try)
Proposed Solutions - Tools
● BigInsights + SPSS Modeler
○ Hadoop is used to store big data and execute data
processing jobs in an efficient and distributed
fashion. IBM provides BigInsights as a management
and operational interface to simplify working with
Hadoop without doing much coding.
○ SPSS Modeler is a data analytics workbench that
allows the user to build predictive models by
leveraging built in algorithms and functions without
the need for programming
Proposed Solutions - Tools
● Datameer
○ Like BigInsights, Datameer Analytics Solution presents a
web based spreadsheet interface on top of a Hadoop
cluster and provides analytics functions and
visualizations out of the box without the need for writing
code.
○ DAS also has a Smart Analytics suite. One of the tools
available in that suite is a decision tree model which is a
descriptive model that can identify important factors that
affect quality.
○ Datameer can also be extended to run predictive models
created in R, SAS, SPSS, etc.
IBM Solution Architecture
SPSS Modeler
Client (only
Windows)

SPSS Modeler
Server (multiplatform)

SPSS Analytic Server
● allows analysts to do predictive analytics over big
data
● data centric architecture ensures scalability and
performance
SPSS Analytic Catalyst
● automatically discovers statistically interesting
relationships in data
● close the analytic specialist gap
● good in early discovery dataset stage (helps to
focus on important parts)
● automate some parts of CRISP-DM

SPSS Analytic
Server
(multiplatform)

SPSS Analytic
Catalyst

Hadoop
(BigInsights)
Prediction in SPSS Modeler

425 predictors
85.4% accuracy
(on the training dataset)
Model Outcome
Original value | Predicted value | Confidence
Predictor Importance
c5.0 Algorithm
● C5.o is an algorithm used to generate a decision tree
which can be used for classification therefore it is often
referred to as a statistical classifier
● A C5.0 model works by splitting the sample based on the
field that provides the maximum information gain. Each
subsample defined by the first split is then split again,
usually based on a different field, and the process
repeats until the subsamples cannot be split any further.
Finally, the lowest-level splits are reexamined, and those
that do not contribute significantly to the value of the
model are removed or pruned.
c5.0 Algorithm
● C5.0 models are quite robust in the presence of
problems such as missing data and large numbers of
input fields.
● They usually do not require long training times to
create. Because of the algorithm’s recursive nature it can
benefit from parallel processing.
● C5.0 offers the boosting method to increase accuracy of
classification
Datameer Analysis
● As previously mentioned Datameer has some built in
advanced analytics tools but most of them are in the
descriptive analytics area. The sole predictive analytics
tool they have is a specialized recommendation engine.
● Datameer can be extended to include predictive models
generated in tools like R, SAS, SPSS, etc. These take the
form of functions in DAS similar to the concept of
functions in Excel.
○ The disadvantage of this approach is that the hard work
of building the model is done without the support of big
data
○ Another disadvantage is the lack of tight integration that
is present in the IBM solution however you do get the
freedom to use any tool
Project Challenges & Opportunities
● Data understanding and formatting
● Time constraints
● More interaction with people on the ground
● More predictor data (diverse dataset is a key!)
○ Plant environment (temperature, humidity,
pressure)
○ Specific employees
○ Supplier & parts data
○ Warranty data
Questions?
Matous Havlena
matous@havlena.net
Tim Ojo
timmyojo@gmail.com
Akin Alao
alaoraufu@yahoo.co.uk

Más contenido relacionado

La actualidad más candente

Tesla Motors Presentation 2021
Tesla Motors Presentation 2021Tesla Motors Presentation 2021
Tesla Motors Presentation 2021JonMaker
 
ELON MUSK'S TESLA
ELON MUSK'S TESLA ELON MUSK'S TESLA
ELON MUSK'S TESLA Simran Singh
 
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...Edureka!
 
Bringing AI to Business Intelligence
Bringing AI to Business IntelligenceBringing AI to Business Intelligence
Bringing AI to Business IntelligenceSi Krishan
 
Tesla Strategy
Tesla StrategyTesla Strategy
Tesla StrategyJoe Baker
 
Darden School of Business Tesla Strategic Analysis
Darden School of Business   Tesla Strategic AnalysisDarden School of Business   Tesla Strategic Analysis
Darden School of Business Tesla Strategic AnalysisJosé Ángel Álvarez Fuente
 
LLM presentation final
LLM presentation finalLLM presentation final
LLM presentation finalRuth Griffin
 
Tesla Company Presentation
Tesla Company PresentationTesla Company Presentation
Tesla Company PresentationNicholasNoles
 
AI in Manufacturing - John.pdf
AI in Manufacturing - John.pdfAI in Manufacturing - John.pdf
AI in Manufacturing - John.pdfJohn Chang
 
Automated Machine Learning
Automated Machine LearningAutomated Machine Learning
Automated Machine LearningYuriy Guts
 
Business analytics in the automobile sector
Business analytics in the automobile sectorBusiness analytics in the automobile sector
Business analytics in the automobile sectorTejusN1
 
Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...
Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...
Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...SlideTeam
 
Introduction to data science club
Introduction to data science clubIntroduction to data science club
Introduction to data science clubData Science Club
 
Landscape of AI/ML in 2023
Landscape of AI/ML in 2023Landscape of AI/ML in 2023
Landscape of AI/ML in 2023HyunJoon Jung
 
Elon Musk & Tesla (7 p's, Gale of creative destruction, Big idea)
Elon Musk & Tesla (7 p's, Gale of creative destruction, Big idea)Elon Musk & Tesla (7 p's, Gale of creative destruction, Big idea)
Elon Musk & Tesla (7 p's, Gale of creative destruction, Big idea)RGupta16
 

La actualidad más candente (20)

Tesla Motors Presentation 2021
Tesla Motors Presentation 2021Tesla Motors Presentation 2021
Tesla Motors Presentation 2021
 
ELON MUSK'S TESLA
ELON MUSK'S TESLA ELON MUSK'S TESLA
ELON MUSK'S TESLA
 
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
 
Bringing AI to Business Intelligence
Bringing AI to Business IntelligenceBringing AI to Business Intelligence
Bringing AI to Business Intelligence
 
Tesla Strategy
Tesla StrategyTesla Strategy
Tesla Strategy
 
Darden School of Business Tesla Strategic Analysis
Darden School of Business   Tesla Strategic AnalysisDarden School of Business   Tesla Strategic Analysis
Darden School of Business Tesla Strategic Analysis
 
Tesla
TeslaTesla
Tesla
 
LLM presentation final
LLM presentation finalLLM presentation final
LLM presentation final
 
Tesla Company Presentation
Tesla Company PresentationTesla Company Presentation
Tesla Company Presentation
 
AI in Manufacturing - John.pdf
AI in Manufacturing - John.pdfAI in Manufacturing - John.pdf
AI in Manufacturing - John.pdf
 
Tesla Motors
Tesla MotorsTesla Motors
Tesla Motors
 
Automated Machine Learning
Automated Machine LearningAutomated Machine Learning
Automated Machine Learning
 
Tesla
TeslaTesla
Tesla
 
Business analytics in the automobile sector
Business analytics in the automobile sectorBusiness analytics in the automobile sector
Business analytics in the automobile sector
 
Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...
Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...
Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...
 
Introduction to data science club
Introduction to data science clubIntroduction to data science club
Introduction to data science club
 
Tesla
TeslaTesla
Tesla
 
Tesla cross border strategy11 12_2015_final
Tesla cross border strategy11 12_2015_finalTesla cross border strategy11 12_2015_final
Tesla cross border strategy11 12_2015_final
 
Landscape of AI/ML in 2023
Landscape of AI/ML in 2023Landscape of AI/ML in 2023
Landscape of AI/ML in 2023
 
Elon Musk & Tesla (7 p's, Gale of creative destruction, Big idea)
Elon Musk & Tesla (7 p's, Gale of creative destruction, Big idea)Elon Musk & Tesla (7 p's, Gale of creative destruction, Big idea)
Elon Musk & Tesla (7 p's, Gale of creative destruction, Big idea)
 

Destacado

Sample SOP For MS in Business Analytics
Sample SOP For MS in Business AnalyticsSample SOP For MS in Business Analytics
Sample SOP For MS in Business AnalyticsSOP MBA
 
Functional Programming Fundamentals
Functional Programming FundamentalsFunctional Programming Fundamentals
Functional Programming FundamentalsShahriar Hyder
 
Modeling with Hadoop kdd2011
Modeling with Hadoop kdd2011Modeling with Hadoop kdd2011
Modeling with Hadoop kdd2011Milind Bhandarkar
 
Lambda Calculus by Dustin Mulcahey
Lambda Calculus by Dustin Mulcahey Lambda Calculus by Dustin Mulcahey
Lambda Calculus by Dustin Mulcahey Hakka Labs
 
Interactive Scientific Image Analysis using Spark
Interactive Scientific Image Analysis using SparkInteractive Scientific Image Analysis using Spark
Interactive Scientific Image Analysis using SparkKevin Mader
 
Functional programming
Functional programmingFunctional programming
Functional programmingedusmildo
 
Machine Learning with Apache Mahout
Machine Learning with Apache MahoutMachine Learning with Apache Mahout
Machine Learning with Apache MahoutDaniel Glauser
 
Functional Programming in JavaScript by Luis Atencio
Functional Programming in JavaScript by Luis AtencioFunctional Programming in JavaScript by Luis Atencio
Functional Programming in JavaScript by Luis AtencioLuis Atencio
 
The Lambda Calculus and The JavaScript
The Lambda Calculus and The JavaScriptThe Lambda Calculus and The JavaScript
The Lambda Calculus and The JavaScriptNorman Richards
 
Functional programming
Functional programmingFunctional programming
Functional programmingPrateek Jain
 
Functional programming ii
Functional programming iiFunctional programming ii
Functional programming iiPrashant Kalkar
 
Introduction to Functional Programming in JavaScript
Introduction to Functional Programming in JavaScriptIntroduction to Functional Programming in JavaScript
Introduction to Functional Programming in JavaScripttmont
 

Destacado (12)

Sample SOP For MS in Business Analytics
Sample SOP For MS in Business AnalyticsSample SOP For MS in Business Analytics
Sample SOP For MS in Business Analytics
 
Functional Programming Fundamentals
Functional Programming FundamentalsFunctional Programming Fundamentals
Functional Programming Fundamentals
 
Modeling with Hadoop kdd2011
Modeling with Hadoop kdd2011Modeling with Hadoop kdd2011
Modeling with Hadoop kdd2011
 
Lambda Calculus by Dustin Mulcahey
Lambda Calculus by Dustin Mulcahey Lambda Calculus by Dustin Mulcahey
Lambda Calculus by Dustin Mulcahey
 
Interactive Scientific Image Analysis using Spark
Interactive Scientific Image Analysis using SparkInteractive Scientific Image Analysis using Spark
Interactive Scientific Image Analysis using Spark
 
Functional programming
Functional programmingFunctional programming
Functional programming
 
Machine Learning with Apache Mahout
Machine Learning with Apache MahoutMachine Learning with Apache Mahout
Machine Learning with Apache Mahout
 
Functional Programming in JavaScript by Luis Atencio
Functional Programming in JavaScript by Luis AtencioFunctional Programming in JavaScript by Luis Atencio
Functional Programming in JavaScript by Luis Atencio
 
The Lambda Calculus and The JavaScript
The Lambda Calculus and The JavaScriptThe Lambda Calculus and The JavaScript
The Lambda Calculus and The JavaScript
 
Functional programming
Functional programmingFunctional programming
Functional programming
 
Functional programming ii
Functional programming iiFunctional programming ii
Functional programming ii
 
Introduction to Functional Programming in JavaScript
Introduction to Functional Programming in JavaScriptIntroduction to Functional Programming in JavaScript
Introduction to Functional Programming in JavaScript
 

Similar a Predictive Analytics Project in Automotive Industry

MOPs & ML Pipelines on GCP - Session 6, RGDC
MOPs & ML Pipelines on GCP - Session 6, RGDCMOPs & ML Pipelines on GCP - Session 6, RGDC
MOPs & ML Pipelines on GCP - Session 6, RGDCgdgsurrey
 
A Machine learning based framework for Verification and Validation of Massive...
A Machine learning based framework for Verification and Validation of Massive...A Machine learning based framework for Verification and Validation of Massive...
A Machine learning based framework for Verification and Validation of Massive...IRJET Journal
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data SciencePouria Amirian
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data SciencePouria Amirian
 
Canadian Experts Discuss Modern Data Stacks and Cloud Computing for 5 Years o...
Canadian Experts Discuss Modern Data Stacks and Cloud Computing for 5 Years o...Canadian Experts Discuss Modern Data Stacks and Cloud Computing for 5 Years o...
Canadian Experts Discuss Modern Data Stacks and Cloud Computing for 5 Years o...Daniel Zivkovic
 
Data Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptxData Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptxCarolineRebeccaD
 
laptop price prediction presentation
laptop price prediction presentationlaptop price prediction presentation
laptop price prediction presentationNeerajNishad4
 
Microsoft_Databricks Datathon - Submission Deck TEMPLATE.pptx
Microsoft_Databricks Datathon - Submission Deck TEMPLATE.pptxMicrosoft_Databricks Datathon - Submission Deck TEMPLATE.pptx
Microsoft_Databricks Datathon - Submission Deck TEMPLATE.pptxAbdoulaye DOUCOURE
 
Choosing The Right Data Annotation Option: Pros And Cons
Choosing The Right Data Annotation Option: Pros And ConsChoosing The Right Data Annotation Option: Pros And Cons
Choosing The Right Data Annotation Option: Pros And ConsArnav Malhotra
 
Practical data science
Practical data sciencePractical data science
Practical data scienceDing Li
 
Bhadale group of companies data science project methodologies catalogue
Bhadale group of companies data science project methodologies catalogueBhadale group of companies data science project methodologies catalogue
Bhadale group of companies data science project methodologies catalogueVijayananda Mohire
 
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...Ali Alkan
 

Similar a Predictive Analytics Project in Automotive Industry (20)

Demystifying Data Science
Demystifying Data ScienceDemystifying Data Science
Demystifying Data Science
 
MOPs & ML Pipelines on GCP - Session 6, RGDC
MOPs & ML Pipelines on GCP - Session 6, RGDCMOPs & ML Pipelines on GCP - Session 6, RGDC
MOPs & ML Pipelines on GCP - Session 6, RGDC
 
A Machine learning based framework for Verification and Validation of Massive...
A Machine learning based framework for Verification and Validation of Massive...A Machine learning based framework for Verification and Validation of Massive...
A Machine learning based framework for Verification and Validation of Massive...
 
Python and data analytics
Python and data analyticsPython and data analytics
Python and data analytics
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data Science
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data Science
 
Canadian Experts Discuss Modern Data Stacks and Cloud Computing for 5 Years o...
Canadian Experts Discuss Modern Data Stacks and Cloud Computing for 5 Years o...Canadian Experts Discuss Modern Data Stacks and Cloud Computing for 5 Years o...
Canadian Experts Discuss Modern Data Stacks and Cloud Computing for 5 Years o...
 
MLOps.pptx
MLOps.pptxMLOps.pptx
MLOps.pptx
 
Data Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptxData Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptx
 
laptop price prediction presentation
laptop price prediction presentationlaptop price prediction presentation
laptop price prediction presentation
 
Aws autopilot
Aws autopilotAws autopilot
Aws autopilot
 
Microsoft_Databricks Datathon - Submission Deck TEMPLATE.pptx
Microsoft_Databricks Datathon - Submission Deck TEMPLATE.pptxMicrosoft_Databricks Datathon - Submission Deck TEMPLATE.pptx
Microsoft_Databricks Datathon - Submission Deck TEMPLATE.pptx
 
Choosing The Right Data Annotation Option: Pros And Cons
Choosing The Right Data Annotation Option: Pros And ConsChoosing The Right Data Annotation Option: Pros And Cons
Choosing The Right Data Annotation Option: Pros And Cons
 
Practical data science
Practical data sciencePractical data science
Practical data science
 
Bhadale group of companies data science project methodologies catalogue
Bhadale group of companies data science project methodologies catalogueBhadale group of companies data science project methodologies catalogue
Bhadale group of companies data science project methodologies catalogue
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Ibm watson
Ibm watsonIbm watson
Ibm watson
 
DS Life Cycle
DS Life CycleDS Life Cycle
DS Life Cycle
 
DS Life Cycle
DS Life CycleDS Life Cycle
DS Life Cycle
 
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
 

Más de Matouš Havlena

Más de Matouš Havlena (6)

Data warehousing
Data warehousingData warehousing
Data warehousing
 
Predictive Analytics [UTC]
Predictive Analytics [UTC]Predictive Analytics [UTC]
Predictive Analytics [UTC]
 
Big Data Analytics [UTC]
Big Data Analytics [UTC]Big Data Analytics [UTC]
Big Data Analytics [UTC]
 
Koucink [MUNI]
Koucink [MUNI]Koucink [MUNI]
Koucink [MUNI]
 
Agile requirementspraguefinal
Agile requirementspraguefinalAgile requirementspraguefinal
Agile requirementspraguefinal
 
Presentation IBM Rational AppScan
Presentation IBM Rational AppScanPresentation IBM Rational AppScan
Presentation IBM Rational AppScan
 

Último

Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 

Último (20)

Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 

Predictive Analytics Project in Automotive Industry

  • 1. Predictive Vehicle Inspection Matous Havlena matous@havlena.net Tim Ojo timmyojo@gmail.com Akin Alao alaoraufu@yahoo.co.uk
  • 2. Project Charter Evaluate the feasibility of using Big Data analytics solutions for Manufacturing to solve the problem of Predictive Vehicle Inspection: ● Analyzing vehicle production history to predict car inspection failures from the production line. ● Production shifts, specific employee, and other factors The two Big Data Analytics solutions to be evaluated: ● IBM BigInsights ● Datameer 2.1
  • 3. Approach & Proposed Solution ● Recognized the problem as a classification problem similar to credit scoring or fraud detection. ● Classification is the problem of identifying to which of a set of categories a new observation belongs, on the basis of a training set of data containing observations whose category membership is known. ● Build a predictive model based on machine learning classification (supervised learning) to identify whether a vehicle can be classified as good (passes quality check on 1st try) or bad (fails quality check on 1st try)
  • 4. Proposed Solutions - Tools ● BigInsights + SPSS Modeler ○ Hadoop is used to store big data and execute data processing jobs in an efficient and distributed fashion. IBM provides BigInsights as a management and operational interface to simplify working with Hadoop without doing much coding. ○ SPSS Modeler is a data analytics workbench that allows the user to build predictive models by leveraging built in algorithms and functions without the need for programming
  • 5. Proposed Solutions - Tools ● Datameer ○ Like BigInsights, Datameer Analytics Solution presents a web based spreadsheet interface on top of a Hadoop cluster and provides analytics functions and visualizations out of the box without the need for writing code. ○ DAS also has a Smart Analytics suite. One of the tools available in that suite is a decision tree model which is a descriptive model that can identify important factors that affect quality. ○ Datameer can also be extended to run predictive models created in R, SAS, SPSS, etc.
  • 6. IBM Solution Architecture SPSS Modeler Client (only Windows) SPSS Modeler Server (multiplatform) SPSS Analytic Server ● allows analysts to do predictive analytics over big data ● data centric architecture ensures scalability and performance SPSS Analytic Catalyst ● automatically discovers statistically interesting relationships in data ● close the analytic specialist gap ● good in early discovery dataset stage (helps to focus on important parts) ● automate some parts of CRISP-DM SPSS Analytic Server (multiplatform) SPSS Analytic Catalyst Hadoop (BigInsights)
  • 7. Prediction in SPSS Modeler 425 predictors 85.4% accuracy (on the training dataset)
  • 8. Model Outcome Original value | Predicted value | Confidence
  • 10. c5.0 Algorithm ● C5.o is an algorithm used to generate a decision tree which can be used for classification therefore it is often referred to as a statistical classifier ● A C5.0 model works by splitting the sample based on the field that provides the maximum information gain. Each subsample defined by the first split is then split again, usually based on a different field, and the process repeats until the subsamples cannot be split any further. Finally, the lowest-level splits are reexamined, and those that do not contribute significantly to the value of the model are removed or pruned.
  • 11. c5.0 Algorithm ● C5.0 models are quite robust in the presence of problems such as missing data and large numbers of input fields. ● They usually do not require long training times to create. Because of the algorithm’s recursive nature it can benefit from parallel processing. ● C5.0 offers the boosting method to increase accuracy of classification
  • 12. Datameer Analysis ● As previously mentioned Datameer has some built in advanced analytics tools but most of them are in the descriptive analytics area. The sole predictive analytics tool they have is a specialized recommendation engine. ● Datameer can be extended to include predictive models generated in tools like R, SAS, SPSS, etc. These take the form of functions in DAS similar to the concept of functions in Excel. ○ The disadvantage of this approach is that the hard work of building the model is done without the support of big data ○ Another disadvantage is the lack of tight integration that is present in the IBM solution however you do get the freedom to use any tool
  • 13. Project Challenges & Opportunities ● Data understanding and formatting ● Time constraints ● More interaction with people on the ground ● More predictor data (diverse dataset is a key!) ○ Plant environment (temperature, humidity, pressure) ○ Specific employees ○ Supplier & parts data ○ Warranty data