SlideShare una empresa de Scribd logo
1 de 21
QUALITY • ANALYTICS • PERFORMANCE
Machine Learning At Work
QUALITY • ANALYTICS • PERFORMANCE
December 6, 2017
Prepared for Data Science Event
2
Introduction
The Stealth Media – Media Advertisement Startup Agency on
Facebook
Clients – 1800Dentist, FIJI Water, FabFitFun, Wonderful Company,
and etc
Role at the company – Data Analyst & Jack-of-all-trades
Banking & Quantitative Solutions LLC – Founder/Data Scientist of a
Data Analytics Startup
Main Project – Building AI machines and Recommendation
systems
Current Company:
Previous Company:
3
Definitions & The Objective
How to reduce state aids when maximizing clicks that lead to
conversions?
Is there a correlation between clicks and state aids?
If there is a correlation between the two, what can we do to
optimize the situation?
Clicks – a number of times that a user clicks on a specific facebook
advertisement.
State Aids – a number that shows a given conversion received an
aid from the State where the conversion occurred.
Conversion – a number of purchase
The Objective for this client:
Definitions:
4
Collecting & Compiling Data
Each element of data contains year, month, and day information
besides media information so data can easily be organized,
compiled, or downloaded by year, month, or day.
For the purpose of this presentation, a portion of data was
extracted from the database in csv form.
Data is collected from multiple sources: Facebook and 3rd party
pixel recording softwares.
Once Data iscollected from multiple sources, it is uploaded in our
database (MySql).
Collection:
Compilation:
5
Tidying Data
In a simple phrase, data preprocessing means data cleansing and
normalizing so that it can produce an accurate analysis.
Preprocessing:
Example Coding:
6
Tidying Data (Continued)
Example Coding:
7
Exploratory Data Analysis
There is a moderate to high correlation between the two by year.
Visiual Analysis by Year:
8
Exploratory Data Analysis (Continued)
There is a high correlation between the two by gender.
Visiual Analysis by Gender:
9
Exploratory Data Analysis (Continued)
There is a high correlation between the two by location.
Visiual Analysis by Location:
10
Exploratory Data Analysis (Continued)
Linear Regression – As we saw from the visual analyses, variables
such as gender and year did not affect the graphs too much. Now,
we need to find which states are affected by state aids the most.
Linear Regression
Clicks ~ Location
11
Exploratory Data Analysis (Continued)
California, 5-state states, and Standard states seem to be affected by state
aids the most.
State Aids ~ Location
12
Data Partition
Training set is used to train the selected model: LM & XGB.
Normally, 70% of the data are chosen to be a training set and 30%
become a test set. A training set can be used over and over but a
test set can only be used once to avoid over-fitting.
Use the createDataPartition function to partition the data into 70%
training and 30% test sets.
Caret Package:
Training vs. Test Sets
13
Definitions
Regression – Output variable takes continuous values
Classification – Output variable takes class labels
Supervised Learning – All data is labeled and algorithms are used
to predict the output from the input data.
Unsupervised Learning – All data is not labeled and algorithms are
used to learn inherent structure from the input data.
Supervised vs. Unsupervised Learnings
Regression vs. Classification
14
Machine Learning Chart
15
Machine Learning (Part 1 – Speed)
You delete more features as you train the model. The accuracy should
increase when the test set is fed into the trained model.
The last column shows the
predicted values.
16
Machine Learning (Part 1 – Speed Continued)
The linear regression is very quick to calculate however it seems that its
accuracy is not that great.
17
Machine Learning (Part 2 – Accuracy)
One-hot encoding – A method of converting categorical variables
into columns of binary variables so that XGBoost model can
process them.
Extreme Gradient Boosting for Regression (XGB)
18
Machine Learning (Part 2 – Accuracy Continued)
Extreme Gradient Boosting for Regression (XGB)
19
Machine Learning (Part 2 – Accuracy Continued)
Extreme Gradient Boosting for Regression (XGB)
20
Outcome & Conclusion
We shut down some of the high performing ads in each of those 3
regions as soon as we got an alert from our AI machine and
focused on other regions. It greatly limited the state aid reception
by the client and optimized the state aid and click ratio.
What the machine learning did:
This did not necessarily increase our profit but it definitely
prolonged our contract with the company that we worked with as
their pure sales went up.
Thank You!

Más contenido relacionado

Similar a Practical Machine Learning at Work

Turnover Prediction of Shares Using Data Mining Techniques : A Case Study
Turnover Prediction of Shares Using Data Mining Techniques : A Case Study Turnover Prediction of Shares Using Data Mining Techniques : A Case Study
Turnover Prediction of Shares Using Data Mining Techniques : A Case Study csandit
 
Going Big : Why Companies Need to Focus on Operational Analytics
Going Big : Why Companies Need to Focus on Operational Analytics Going Big : Why Companies Need to Focus on Operational Analytics
Going Big : Why Companies Need to Focus on Operational Analytics Capgemini
 
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...Simplilearn
 
Stock Price Trend Forecasting using Supervised Learning
Stock Price Trend Forecasting using Supervised LearningStock Price Trend Forecasting using Supervised Learning
Stock Price Trend Forecasting using Supervised LearningSharvil Katariya
 
SPC (Statistical Process Control) concepts in forecasting
SPC (Statistical Process Control) concepts in forecastingSPC (Statistical Process Control) concepts in forecasting
SPC (Statistical Process Control) concepts in forecastingCharles Novak
 
Iwsm2014 importance of benchmarking (john ogilvie & harold van heeringen)
Iwsm2014   importance of benchmarking (john ogilvie & harold van heeringen)Iwsm2014   importance of benchmarking (john ogilvie & harold van heeringen)
Iwsm2014 importance of benchmarking (john ogilvie & harold van heeringen)Nesma
 
The importance of benchmarking software projects - Van Heeringen and Ogilvie
The importance of benchmarking software projects - Van Heeringen and OgilvieThe importance of benchmarking software projects - Van Heeringen and Ogilvie
The importance of benchmarking software projects - Van Heeringen and OgilvieHarold van Heeringen
 
What is Oracle Demantra
What is Oracle Demantra What is Oracle Demantra
What is Oracle Demantra Amit Sharma
 
Yimei chan's data analysis work
Yimei chan's data analysis workYimei chan's data analysis work
Yimei chan's data analysis workyimeichan
 
Earned Value Management Meets Big Data
Earned Value Management Meets Big DataEarned Value Management Meets Big Data
Earned Value Management Meets Big DataGlen Alleman
 
Data Analytics Using R - Report
Data Analytics Using R - ReportData Analytics Using R - Report
Data Analytics Using R - ReportAkanksha Gohil
 
3G War: Who will be the winner?
3G War: Who will be the winner?3G War: Who will be the winner?
3G War: Who will be the winner?Alok Singh
 
DA ST-1 SET-B-Solution.pdf we also provide the many type of solution
DA ST-1 SET-B-Solution.pdf we also provide the many type of solutionDA ST-1 SET-B-Solution.pdf we also provide the many type of solution
DA ST-1 SET-B-Solution.pdf we also provide the many type of solutiongitikasingh2004
 
Applying Machine Learning for Mobile Games by Neil Patrick Del Gallego
Applying Machine Learning for Mobile Games by Neil Patrick Del GallegoApplying Machine Learning for Mobile Games by Neil Patrick Del Gallego
Applying Machine Learning for Mobile Games by Neil Patrick Del GallegoDEVCON
 
#ATAGTR2021 Presentation : "Unlocking the Power of Machine Learning in the Mo...
#ATAGTR2021 Presentation : "Unlocking the Power of Machine Learning in the Mo...#ATAGTR2021 Presentation : "Unlocking the Power of Machine Learning in the Mo...
#ATAGTR2021 Presentation : "Unlocking the Power of Machine Learning in the Mo...Agile Testing Alliance
 
6 levels of big data analytics applications
6 levels of big data analytics applications6 levels of big data analytics applications
6 levels of big data analytics applicationspanoratio
 
MIS637_Final_Project_Rahul_Bhatia
MIS637_Final_Project_Rahul_BhatiaMIS637_Final_Project_Rahul_Bhatia
MIS637_Final_Project_Rahul_BhatiaRahul Bhatia
 
Better Living Through Analytics - Strategies for Data Decisions
Better Living Through Analytics - Strategies for Data DecisionsBetter Living Through Analytics - Strategies for Data Decisions
Better Living Through Analytics - Strategies for Data DecisionsProduct School
 
Black_Friday_Sales_Trushita
Black_Friday_Sales_TrushitaBlack_Friday_Sales_Trushita
Black_Friday_Sales_TrushitaTrushita Redij
 

Similar a Practical Machine Learning at Work (20)

Turnover Prediction of Shares Using Data Mining Techniques : A Case Study
Turnover Prediction of Shares Using Data Mining Techniques : A Case Study Turnover Prediction of Shares Using Data Mining Techniques : A Case Study
Turnover Prediction of Shares Using Data Mining Techniques : A Case Study
 
Going Big : Why Companies Need to Focus on Operational Analytics
Going Big : Why Companies Need to Focus on Operational Analytics Going Big : Why Companies Need to Focus on Operational Analytics
Going Big : Why Companies Need to Focus on Operational Analytics
 
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
 
Stock Price Trend Forecasting using Supervised Learning
Stock Price Trend Forecasting using Supervised LearningStock Price Trend Forecasting using Supervised Learning
Stock Price Trend Forecasting using Supervised Learning
 
SPC (Statistical Process Control) concepts in forecasting
SPC (Statistical Process Control) concepts in forecastingSPC (Statistical Process Control) concepts in forecasting
SPC (Statistical Process Control) concepts in forecasting
 
Iwsm2014 importance of benchmarking (john ogilvie & harold van heeringen)
Iwsm2014   importance of benchmarking (john ogilvie & harold van heeringen)Iwsm2014   importance of benchmarking (john ogilvie & harold van heeringen)
Iwsm2014 importance of benchmarking (john ogilvie & harold van heeringen)
 
The importance of benchmarking software projects - Van Heeringen and Ogilvie
The importance of benchmarking software projects - Van Heeringen and OgilvieThe importance of benchmarking software projects - Van Heeringen and Ogilvie
The importance of benchmarking software projects - Van Heeringen and Ogilvie
 
What is Oracle Demantra
What is Oracle Demantra What is Oracle Demantra
What is Oracle Demantra
 
Yimei chan's data analysis work
Yimei chan's data analysis workYimei chan's data analysis work
Yimei chan's data analysis work
 
Earned Value Management Meets Big Data
Earned Value Management Meets Big DataEarned Value Management Meets Big Data
Earned Value Management Meets Big Data
 
Data Analytics Using R - Report
Data Analytics Using R - ReportData Analytics Using R - Report
Data Analytics Using R - Report
 
3G War: Who will be the winner?
3G War: Who will be the winner?3G War: Who will be the winner?
3G War: Who will be the winner?
 
DA ST-1 SET-B-Solution.pdf we also provide the many type of solution
DA ST-1 SET-B-Solution.pdf we also provide the many type of solutionDA ST-1 SET-B-Solution.pdf we also provide the many type of solution
DA ST-1 SET-B-Solution.pdf we also provide the many type of solution
 
Applying Machine Learning for Mobile Games by Neil Patrick Del Gallego
Applying Machine Learning for Mobile Games by Neil Patrick Del GallegoApplying Machine Learning for Mobile Games by Neil Patrick Del Gallego
Applying Machine Learning for Mobile Games by Neil Patrick Del Gallego
 
Using a Big Data approach to adapt Market Strategies
Using a Big Data approach to adapt Market StrategiesUsing a Big Data approach to adapt Market Strategies
Using a Big Data approach to adapt Market Strategies
 
#ATAGTR2021 Presentation : "Unlocking the Power of Machine Learning in the Mo...
#ATAGTR2021 Presentation : "Unlocking the Power of Machine Learning in the Mo...#ATAGTR2021 Presentation : "Unlocking the Power of Machine Learning in the Mo...
#ATAGTR2021 Presentation : "Unlocking the Power of Machine Learning in the Mo...
 
6 levels of big data analytics applications
6 levels of big data analytics applications6 levels of big data analytics applications
6 levels of big data analytics applications
 
MIS637_Final_Project_Rahul_Bhatia
MIS637_Final_Project_Rahul_BhatiaMIS637_Final_Project_Rahul_Bhatia
MIS637_Final_Project_Rahul_Bhatia
 
Better Living Through Analytics - Strategies for Data Decisions
Better Living Through Analytics - Strategies for Data DecisionsBetter Living Through Analytics - Strategies for Data Decisions
Better Living Through Analytics - Strategies for Data Decisions
 
Black_Friday_Sales_Trushita
Black_Friday_Sales_TrushitaBlack_Friday_Sales_Trushita
Black_Friday_Sales_Trushita
 

Más de IDEAS - Int'l Data Engineering and Science Association

Más de IDEAS - Int'l Data Engineering and Science Association (20)

How to deliver effective data science projects
How to deliver effective data science projectsHow to deliver effective data science projects
How to deliver effective data science projects
 
Digital cracks in banking--Sid Nandi
Digital cracks in banking--Sid NandiDigital cracks in banking--Sid Nandi
Digital cracks in banking--Sid Nandi
 
“Full Stack” Data Science with R for Startups: Production-ready with Open-Sou...
“Full Stack” Data Science with R for Startups: Production-ready with Open-Sou...“Full Stack” Data Science with R for Startups: Production-ready with Open-Sou...
“Full Stack” Data Science with R for Startups: Production-ready with Open-Sou...
 
Battling Skynet: The Role of Humanity in Artificial Intelligence
Battling Skynet: The Role of Humanity in Artificial IntelligenceBattling Skynet: The Role of Humanity in Artificial Intelligence
Battling Skynet: The Role of Humanity in Artificial Intelligence
 
Implementing Artificial Intelligence with Big Data
Implementing Artificial Intelligence with Big DataImplementing Artificial Intelligence with Big Data
Implementing Artificial Intelligence with Big Data
 
Data Architecture (i.e., normalization / relational algebra) and Database Sec...
Data Architecture (i.e., normalization / relational algebra) and Database Sec...Data Architecture (i.e., normalization / relational algebra) and Database Sec...
Data Architecture (i.e., normalization / relational algebra) and Database Sec...
 
Blockchain Application in Real Estate Transactions
Blockchain Application in Real Estate TransactionsBlockchain Application in Real Estate Transactions
Blockchain Application in Real Estate Transactions
 
Learning to learn Model Behavior: How to use "human-in-the-loop" to explain d...
Learning to learn Model Behavior: How to use "human-in-the-loop" to explain d...Learning to learn Model Behavior: How to use "human-in-the-loop" to explain d...
Learning to learn Model Behavior: How to use "human-in-the-loop" to explain d...
 
Artificial Intelligence: Hype, Reality, Vision.
Artificial Intelligence: Hype, Reality, Vision.Artificial Intelligence: Hype, Reality, Vision.
Artificial Intelligence: Hype, Reality, Vision.
 
Operationalizing your Data Lake: Get Ready for Advanced Analytics
Operationalizing your Data Lake: Get Ready for Advanced AnalyticsOperationalizing your Data Lake: Get Ready for Advanced Analytics
Operationalizing your Data Lake: Get Ready for Advanced Analytics
 
Introduction to Deep Reinforcement Learning
Introduction to Deep Reinforcement LearningIntroduction to Deep Reinforcement Learning
Introduction to Deep Reinforcement Learning
 
Best Practices in Data Partnerships Between Mayor's Office and Academia
Best Practices in Data Partnerships Between Mayor's Office and AcademiaBest Practices in Data Partnerships Between Mayor's Office and Academia
Best Practices in Data Partnerships Between Mayor's Office and Academia
 
Everything You Wish You Knew About Search
Everything You Wish You Knew About SearchEverything You Wish You Knew About Search
Everything You Wish You Knew About Search
 
AliMe Bot Platform Technical Practice - Alibaba`s Personal Intelligent Assist...
AliMe Bot Platform Technical Practice - Alibaba`s Personal Intelligent Assist...AliMe Bot Platform Technical Practice - Alibaba`s Personal Intelligent Assist...
AliMe Bot Platform Technical Practice - Alibaba`s Personal Intelligent Assist...
 
Data-Driven AI for Entertainment and Healthcare
Data-Driven AI for Entertainment and HealthcareData-Driven AI for Entertainment and Healthcare
Data-Driven AI for Entertainment and Healthcare
 
Generating Creative Works with AI
Generating Creative Works with AIGenerating Creative Works with AI
Generating Creative Works with AI
 
Using AI to Tackle the Future of Health Care Data
Using AI to Tackle the Future of Health Care DataUsing AI to Tackle the Future of Health Care Data
Using AI to Tackle the Future of Health Care Data
 
State of AI/ML in Real Estate
State of AI/ML in Real EstateState of AI/ML in Real Estate
State of AI/ML in Real Estate
 
Hot Dog, Not Hot Dog! Generate new training data without taking more photos.
Hot Dog, Not Hot Dog! Generate new training data without taking more photos.Hot Dog, Not Hot Dog! Generate new training data without taking more photos.
Hot Dog, Not Hot Dog! Generate new training data without taking more photos.
 
Machine Learning in Healthcare and Life Science
Machine Learning in Healthcare and Life ScienceMachine Learning in Healthcare and Life Science
Machine Learning in Healthcare and Life Science
 

Último

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 

Último (20)

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

Practical Machine Learning at Work

  • 1. QUALITY • ANALYTICS • PERFORMANCE Machine Learning At Work QUALITY • ANALYTICS • PERFORMANCE December 6, 2017 Prepared for Data Science Event
  • 2. 2 Introduction The Stealth Media – Media Advertisement Startup Agency on Facebook Clients – 1800Dentist, FIJI Water, FabFitFun, Wonderful Company, and etc Role at the company – Data Analyst & Jack-of-all-trades Banking & Quantitative Solutions LLC – Founder/Data Scientist of a Data Analytics Startup Main Project – Building AI machines and Recommendation systems Current Company: Previous Company:
  • 3. 3 Definitions & The Objective How to reduce state aids when maximizing clicks that lead to conversions? Is there a correlation between clicks and state aids? If there is a correlation between the two, what can we do to optimize the situation? Clicks – a number of times that a user clicks on a specific facebook advertisement. State Aids – a number that shows a given conversion received an aid from the State where the conversion occurred. Conversion – a number of purchase The Objective for this client: Definitions:
  • 4. 4 Collecting & Compiling Data Each element of data contains year, month, and day information besides media information so data can easily be organized, compiled, or downloaded by year, month, or day. For the purpose of this presentation, a portion of data was extracted from the database in csv form. Data is collected from multiple sources: Facebook and 3rd party pixel recording softwares. Once Data iscollected from multiple sources, it is uploaded in our database (MySql). Collection: Compilation:
  • 5. 5 Tidying Data In a simple phrase, data preprocessing means data cleansing and normalizing so that it can produce an accurate analysis. Preprocessing: Example Coding:
  • 7. 7 Exploratory Data Analysis There is a moderate to high correlation between the two by year. Visiual Analysis by Year:
  • 8. 8 Exploratory Data Analysis (Continued) There is a high correlation between the two by gender. Visiual Analysis by Gender:
  • 9. 9 Exploratory Data Analysis (Continued) There is a high correlation between the two by location. Visiual Analysis by Location:
  • 10. 10 Exploratory Data Analysis (Continued) Linear Regression – As we saw from the visual analyses, variables such as gender and year did not affect the graphs too much. Now, we need to find which states are affected by state aids the most. Linear Regression Clicks ~ Location
  • 11. 11 Exploratory Data Analysis (Continued) California, 5-state states, and Standard states seem to be affected by state aids the most. State Aids ~ Location
  • 12. 12 Data Partition Training set is used to train the selected model: LM & XGB. Normally, 70% of the data are chosen to be a training set and 30% become a test set. A training set can be used over and over but a test set can only be used once to avoid over-fitting. Use the createDataPartition function to partition the data into 70% training and 30% test sets. Caret Package: Training vs. Test Sets
  • 13. 13 Definitions Regression – Output variable takes continuous values Classification – Output variable takes class labels Supervised Learning – All data is labeled and algorithms are used to predict the output from the input data. Unsupervised Learning – All data is not labeled and algorithms are used to learn inherent structure from the input data. Supervised vs. Unsupervised Learnings Regression vs. Classification
  • 15. 15 Machine Learning (Part 1 – Speed) You delete more features as you train the model. The accuracy should increase when the test set is fed into the trained model. The last column shows the predicted values.
  • 16. 16 Machine Learning (Part 1 – Speed Continued) The linear regression is very quick to calculate however it seems that its accuracy is not that great.
  • 17. 17 Machine Learning (Part 2 – Accuracy) One-hot encoding – A method of converting categorical variables into columns of binary variables so that XGBoost model can process them. Extreme Gradient Boosting for Regression (XGB)
  • 18. 18 Machine Learning (Part 2 – Accuracy Continued) Extreme Gradient Boosting for Regression (XGB)
  • 19. 19 Machine Learning (Part 2 – Accuracy Continued) Extreme Gradient Boosting for Regression (XGB)
  • 20. 20 Outcome & Conclusion We shut down some of the high performing ads in each of those 3 regions as soon as we got an alert from our AI machine and focused on other regions. It greatly limited the state aid reception by the client and optimized the state aid and click ratio. What the machine learning did: This did not necessarily increase our profit but it definitely prolonged our contract with the company that we worked with as their pure sales went up.