SlideShare una empresa de Scribd logo
1 de 31
Descargar para leer sin conexión
Machine	
  Learning	
  Introduc1on	
  
guodong@hulu.com	
  
	
  
Machine	
  learning	
  introduc0on	
  
Logis1c	
  regression	
  
Feature	
  selec1on	
  
Boos1ng,	
  tree	
  boos1ng	
  
	
  
See	
  more	
  ML	
  posts:	
  h>p://dongguo.me/	
  
	
  
Machine	
  Learning	
  Makes	
  Life	
  Be>er	
  
WHAT	
  IS	
  MACHINE	
  LEARNING?	
  
Learning	
  
•  What	
  is	
  learning	
  
–  Find	
  rules	
  from	
  data/experience	
  

•  Why	
  learning	
  is	
  possible	
  
–  Assume	
  rules	
  exist	
  in	
  this	
  world	
  

•  How	
  to	
  learn	
  
–  Induc1ve	
  
What	
  is	
  machine	
  learning	
  
•  “Machine	
  Learning	
  is	
  a	
  field	
  of	
  study	
  that	
  gives	
  
computers	
  the	
  ability	
  to	
  learn	
  without	
  being	
  
explicitly	
  programmed”	
  -­‐	
  Arthur	
  Samuel	
  (1959)	
  
•  Machine	
  learning	
  is	
  the	
  study	
  of	
  computer	
  
algorithms	
  that	
  improve	
  automa1cally	
  through	
  
experience”	
  –	
  Tom	
  Mitchell	
  (1998)	
  
Overview	
  of	
  machine	
  learning	
  
	
  	
  

Machine	
  Learning	
  

Unsupervised	
  
Learning	
  

Supervised	
  
Learning	
  

Classifica1on	
  

Semi-­‐supervised	
  
Learning	
  

Regression	
  
Outline	
  
•  Supervised	
  Learning	
  
•  Case	
  Study	
  
•  Challenge	
  
•  Resource	
  
Supervised	
  learning	
  
•  Concepts	
  
•  Defini1on	
  
•  Models	
  
•  Metrics	
  
•  Open	
  Ques1ons	
  
Concepts	
  
Problem	
  

	
  	
  

Generate	
  dataset	
  

Dataset	
  
Train	
  

Sample/instance	
  
Feature	
  vector	
  

label	
  
model	
  
Predict	
  

Test	
  

Model	
  Tuning	
  
Feature	
  selec0on	
  
What	
  is	
  Supervised	
  learning	
  
•  Find	
  a	
  func1on	
  (from	
  some	
  func1on	
  space)	
  to	
  
predict	
  for	
  unseen	
  instances,	
  from	
  the	
  labeled	
  
training	
  data	
  
–  Func1on	
  space:	
  determined	
  by	
  the	
  chosen	
  model	
  
–  Find	
  the	
  func1on:	
  minimize	
  error	
  on	
  training	
  data	
  with	
  
some	
  cost	
  func1on	
  

•  2	
  types:	
  Classifica1on	
  and	
  regression	
  
Formal	
  defini1on	
  
•  Given	
  a	
  training	
  dataset	
  

r
N
{xi , yi }i =1

•  And	
  define	
  a	
  loss	
  func1on	
  
∧

∧

L( y, y ), where y = f ( x)

•  Target	
  

∧

f ( x) =arg min G ( f ),
f

1
st. G ( f ) =
N

N

∑ L( y , f ( x ))
i =1

i

i
Models	
  for	
  supervised	
  learning	
  
•  Classifica1on	
  and	
  regression	
  
–  For	
  classifica1on:	
  LR(Logis1c	
  regression),	
  Naïve	
  Bayes	
  
–  For	
  regression:	
  linear	
  regression	
  
–  For	
  Both:	
  Trees,	
  KNN,	
  SVM,	
  ANN	
  

•  Genera1ve	
  and	
  Discrimina1ve	
  
–  Genera1ve:	
  Naïve	
  Bayes,	
  GMM,	
  HMM	
  
–  Discrimina1ve:	
  KNN,	
  LR,	
  SVM,	
  ANN,	
  Trees	
  

•  Parametric	
  and	
  nonparametric	
  
–  Parametric:	
  LR,	
  Naïve	
  Bayes,	
  ANN	
  
–  nonparametric:	
  Trees,	
  KNN,	
  kernel	
  methods	
  
Decision	
  Tree	
  
•  Would	
  you	
  like	
  to	
  date	
  somebody?	
  
Gender	
  
male	
  

female	
  
Good	
  
looking?	
  
Yes!	
  

Pass	
  

No!	
  

umm..	
  

Pass	
  

Others…	
  

Accept	
  

Very	
  good	
  
Accept	
  

else	
  
Pass	
  
K-­‐Nearest	
  Neighbor	
  classifier	
  
K=15	
  

K=1	
  
Naïve	
  Bayes	
  
•  Bayes	
  classifier	
  

•  Condi1onal	
  Independence	
  assump1on	
  
•  With	
  this	
  assump1on	
  
	
  
Logis1c	
  regression	
  
•  Logis1c	
  func1on	
  
	
  
	
  
Ar1ficial	
  neural	
  network	
  
Support	
  vector	
  machine	
  
Model	
  Inference	
  
•  Typical	
  inference	
  methods	
  
–  Gradient	
  descent	
  
–  Expecta1on	
  Maximiza1on	
  
–  Sampling	
  based	
  
Model	
  ensemble	
  
•  Averaging	
  or	
  vo1ng	
  output	
  of	
  mul1ply	
  classifiers	
  
•  Bagging	
  (bootstrap	
  aggrega1ng)	
  
–  Train	
  mul1ple	
  base	
  models	
  
–  Vote	
  mul1ply	
  base	
  classifiers	
  with	
  same	
  weight	
  
–  Improve	
  model	
  stability	
  and	
  avoid	
  overfihng	
  
–  Work	
  well	
  on	
  unstable	
  base	
  classifier	
  
•  Adaboost	
  (adap1ve	
  boos1ng)	
  
–  Sequen1al	
  base	
  classifiers	
  
–  Misclassified	
  instances	
  have	
  higher	
  weight	
  in	
  next	
  base	
  
classifier	
  
–  Weighted	
  vo1ng	
  
Evalua1on	
  metrics	
  
•  Common	
  Metrics	
  for	
  classifica1on	
  
–  Accuracy	
  
–  Precision-­‐Recall	
  
–  AUC	
  

•  For	
  regression	
  
–  Mean	
  absolute	
  error	
  (MAE)	
  
–  Mean	
  square	
  error	
  (MSE),	
  RMSE	
  
Ques1on1:	
  How	
  to	
  choose	
  a	
  suitable	
  model?	
  
Characteris0c	
  

Naïve	
  
Bayes	
  

Trees	
   K	
  Nearest	
  
neighbor	
  

Logis0c	
  
regression	
  

Neural	
  
SVM	
  
Networks	
  

Natural	
  handling	
  
data	
  of	
  “mixed”	
  
type	
  
Robustness	
  to	
  
outliers	
  in	
  input	
  
space	
  
Computa1onal	
  
scalability	
  
Interpretability	
  

1	
  

3	
  

1	
  

1	
  

1	
  

1	
  

	
  3	
  

3	
  

3	
  

3	
  	
  

1	
  

1	
  

3	
  

3	
  

1	
  

3	
  

1	
  

1	
  

	
  2	
  

2	
  	
  

1	
  

	
  2	
  

1	
  

1	
  

Predic1ve	
  power	
  

1	
  

1	
  

	
  3	
  

2	
  

3	
  

3	
  

<Elements	
  of	
  Sta-s-cal	
  Learning>	
  II	
  P351	
  	
  	
  
Ques1on2:	
  Can	
  we	
  find	
  a	
  100%	
  accurate	
  model?	
  	
  	
  
•  Expected	
  risk	
  

•  Empirical	
  risk	
  
•  Choose	
  a	
  family	
  	
  	
  	
  	
  for	
  candidate	
  predic1on	
  func1ons	
  	
  
•  Error	
  
Case	
  study:	
  Predic1ve	
  Demographic	
  
	
  	
  

Feature	
  extrac1on	
  (‘show’,	
  ‘ad	
  vote’,	
  ‘ad	
  
selec1on’)	
  
feature	
  analysis	
  (remove	
  ‘ad	
  selec1on’)	
  
Load	
  login	
  profile	
  

ML	
  problem?	
  What	
  kind?	
  	
  Labels?	
  
Evalua1on	
  metric?	
  
Possible	
  features?	
  (show,	
  ad	
  vote,	
  
ad	
  selec1on,	
  search…)	
  	
  Accessible?	
  	
  
	
  

Problem	
  

Dataset	
  genera1on	
  

Choose	
  a	
  Model	
  

1.  Familiar?	
  (NB,	
  ANN,	
  LR,	
  Tree,	
  SVM)	
  
2.  Computa1onal	
  cost?	
  Interpretability?	
  
Precision?	
  	
  
3.  Data:	
  amount?	
  noise	
  ra1o?	
  
	
  

Train	
  
Try	
  more	
  features(add	
  
‘OS’,	
  ‘browser’,	
  ‘flash’)	
  
Feature	
  selec1on	
  (remove	
  
‘flash’,	
  and	
  non	
  
anonymous	
  features)	
  

Predictor	
  

	
  

Try	
  more	
  models	
  

Tuning	
  
Evalua1on	
  (AUC,	
  
Precision-­‐recall)	
  

Test	
  

Challenges	
  

(Noise,	
  different	
  Join	
  distribu1on,	
  evalua1on)	
  	
  	
  
model	
  ensemble	
  

Predictor	
  on	
  product	
  

Scoring	
  

Online	
  Update	
  
Challenges	
  in	
  Machine	
  learning	
  
•  Data	
  
–  Sparse	
  data	
  in	
  high	
  dimensions	
  
–  Limited	
  labels	
  
	
  

•  Computa1on	
  Cost	
  
–  Speed	
  Up	
  advanced	
  models	
  
–  Paralleliza1on	
  

•  Applica1on	
  
–  Structured	
  predic1on	
  
Resource	
  
• 
• 
• 
• 

Conference	
  
Books	
  
Lectures	
  
Dataset	
  
Top	
  conference	
  
• 
• 
• 
• 
• 

ICML	
  
NIPS	
  
IJCAI/AAAI	
  
KDD	
  
Other	
  related	
  
–  WSDM,	
  WWW,	
  SIGIR,	
  CIKM,	
  ICDE,	
  ICDM	
  
Books	
  
• 
• 
• 
• 

Machine	
  Learning	
  [link]	
  	
  	
  by	
  Mitchell	
  
Pa-ern	
  Recogni0on	
  and	
  Machine	
  Learning	
  [link]	
  by	
  Bishop	
  
The	
  Elements	
  of	
  Sta0s0cal	
  Learning	
  [link]	
  
Scaling	
  Up	
  Machine	
  Learning	
  [link]	
  
Lectures	
  
•  Machine	
  Learning	
  open	
  class	
  –	
  by	
  Andrew	
  Ng	
  
–  Video	
  in	
  YouTube	
  

•  Advanced	
  topics	
  in	
  Machine	
  Learning	
  –	
  Cornell	
  
•  h>p://videolectures.net/	
  
Other	
  research	
  resource	
  
•  Research	
  Organs	
  
–  Yahoo	
  Research	
  [link]	
  
–  Google	
  Research	
  publica1ons	
  [link]	
  

•  Dataset	
  
–  UCI	
  machine	
  learning	
  Repository	
  [link]	
  
–  kaggle.com	
  
THANKS	
  

Más contenido relacionado

La actualidad más candente

Artificial Intelligence Course: Linear models
Artificial Intelligence Course: Linear models Artificial Intelligence Course: Linear models
Artificial Intelligence Course: Linear models ananth
 
Presentazione Tesi Laurea Triennale in Informatica
Presentazione Tesi Laurea Triennale in InformaticaPresentazione Tesi Laurea Triennale in Informatica
Presentazione Tesi Laurea Triennale in InformaticaLuca Marignati
 
Feature Engineering & Selection
Feature Engineering & SelectionFeature Engineering & Selection
Feature Engineering & SelectionEng Teong Cheah
 
Presentation by Lionel Briand
Presentation by Lionel BriandPresentation by Lionel Briand
Presentation by Lionel BriandPtidej Team
 
Policy Based reinforcement Learning for time series Anomaly detection
Policy Based reinforcement Learning for time series Anomaly detectionPolicy Based reinforcement Learning for time series Anomaly detection
Policy Based reinforcement Learning for time series Anomaly detectionKishor Datta Gupta
 
Adversarial Input Detection Using Image Processing Techniques (IPT)
Adversarial Input Detection Using Image Processing Techniques (IPT)Adversarial Input Detection Using Image Processing Techniques (IPT)
Adversarial Input Detection Using Image Processing Techniques (IPT)Kishor Datta Gupta
 
Generative Adversarial Networks : Basic architecture and variants
Generative Adversarial Networks : Basic architecture and variantsGenerative Adversarial Networks : Basic architecture and variants
Generative Adversarial Networks : Basic architecture and variantsananth
 
Logistic Regression
Logistic RegressionLogistic Regression
Logistic RegressionDong Guo
 
Introduction To Applied Machine Learning
Introduction To Applied Machine LearningIntroduction To Applied Machine Learning
Introduction To Applied Machine Learningananth
 
Modeling & Simulation Lecture Notes
Modeling & Simulation Lecture NotesModeling & Simulation Lecture Notes
Modeling & Simulation Lecture NotesFellowBuddy.com
 
JRs presentation-few-shot-learning-overview @ AI4Media WP5 workshop
JRs presentation-few-shot-learning-overview @ AI4Media WP5 workshopJRs presentation-few-shot-learning-overview @ AI4Media WP5 workshop
JRs presentation-few-shot-learning-overview @ AI4Media WP5 workshopHannes Fassold
 
Automated Machine Learning
Automated Machine LearningAutomated Machine Learning
Automated Machine Learningsafa cimenli
 
Hyperparameter Tuning
Hyperparameter TuningHyperparameter Tuning
Hyperparameter TuningJon Lederman
 
Recommendation algorithm using reinforcement learning
Recommendation algorithm using reinforcement learningRecommendation algorithm using reinforcement learning
Recommendation algorithm using reinforcement learningArithmer Inc.
 
Deep Learning For Practitioners, lecture 2: Selecting the right applications...
Deep Learning For Practitioners,  lecture 2: Selecting the right applications...Deep Learning For Practitioners,  lecture 2: Selecting the right applications...
Deep Learning For Practitioners, lecture 2: Selecting the right applications...ananth
 
Hacking Predictive Modeling - RoadSec 2018
Hacking Predictive Modeling - RoadSec 2018Hacking Predictive Modeling - RoadSec 2018
Hacking Predictive Modeling - RoadSec 2018HJ van Veen
 
Context-aware preference modeling with factorization
Context-aware preference modeling with factorizationContext-aware preference modeling with factorization
Context-aware preference modeling with factorizationBalázs Hidasi
 
Understanding Basics of Machine Learning
Understanding Basics of Machine LearningUnderstanding Basics of Machine Learning
Understanding Basics of Machine LearningPranav Ainavolu
 

La actualidad más candente (20)

Artificial Intelligence Course: Linear models
Artificial Intelligence Course: Linear models Artificial Intelligence Course: Linear models
Artificial Intelligence Course: Linear models
 
Presentazione Tesi Laurea Triennale in Informatica
Presentazione Tesi Laurea Triennale in InformaticaPresentazione Tesi Laurea Triennale in Informatica
Presentazione Tesi Laurea Triennale in Informatica
 
Feature Engineering & Selection
Feature Engineering & SelectionFeature Engineering & Selection
Feature Engineering & Selection
 
Presentation by Lionel Briand
Presentation by Lionel BriandPresentation by Lionel Briand
Presentation by Lionel Briand
 
Policy Based reinforcement Learning for time series Anomaly detection
Policy Based reinforcement Learning for time series Anomaly detectionPolicy Based reinforcement Learning for time series Anomaly detection
Policy Based reinforcement Learning for time series Anomaly detection
 
Adversarial Input Detection Using Image Processing Techniques (IPT)
Adversarial Input Detection Using Image Processing Techniques (IPT)Adversarial Input Detection Using Image Processing Techniques (IPT)
Adversarial Input Detection Using Image Processing Techniques (IPT)
 
Lesson 33
Lesson 33Lesson 33
Lesson 33
 
Generative Adversarial Networks : Basic architecture and variants
Generative Adversarial Networks : Basic architecture and variantsGenerative Adversarial Networks : Basic architecture and variants
Generative Adversarial Networks : Basic architecture and variants
 
Logistic Regression
Logistic RegressionLogistic Regression
Logistic Regression
 
Introduction To Applied Machine Learning
Introduction To Applied Machine LearningIntroduction To Applied Machine Learning
Introduction To Applied Machine Learning
 
Modeling & Simulation Lecture Notes
Modeling & Simulation Lecture NotesModeling & Simulation Lecture Notes
Modeling & Simulation Lecture Notes
 
JRs presentation-few-shot-learning-overview @ AI4Media WP5 workshop
JRs presentation-few-shot-learning-overview @ AI4Media WP5 workshopJRs presentation-few-shot-learning-overview @ AI4Media WP5 workshop
JRs presentation-few-shot-learning-overview @ AI4Media WP5 workshop
 
Automated Machine Learning
Automated Machine LearningAutomated Machine Learning
Automated Machine Learning
 
Launching into machine learning
Launching into machine learningLaunching into machine learning
Launching into machine learning
 
Hyperparameter Tuning
Hyperparameter TuningHyperparameter Tuning
Hyperparameter Tuning
 
Recommendation algorithm using reinforcement learning
Recommendation algorithm using reinforcement learningRecommendation algorithm using reinforcement learning
Recommendation algorithm using reinforcement learning
 
Deep Learning For Practitioners, lecture 2: Selecting the right applications...
Deep Learning For Practitioners,  lecture 2: Selecting the right applications...Deep Learning For Practitioners,  lecture 2: Selecting the right applications...
Deep Learning For Practitioners, lecture 2: Selecting the right applications...
 
Hacking Predictive Modeling - RoadSec 2018
Hacking Predictive Modeling - RoadSec 2018Hacking Predictive Modeling - RoadSec 2018
Hacking Predictive Modeling - RoadSec 2018
 
Context-aware preference modeling with factorization
Context-aware preference modeling with factorizationContext-aware preference modeling with factorization
Context-aware preference modeling with factorization
 
Understanding Basics of Machine Learning
Understanding Basics of Machine LearningUnderstanding Basics of Machine Learning
Understanding Basics of Machine Learning
 

Similar a Machine learning Introduction

Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningAI Summary
 
How Machine Learning Helps Organizations to Work More Efficiently?
How Machine Learning Helps Organizations to Work More Efficiently?How Machine Learning Helps Organizations to Work More Efficiently?
How Machine Learning Helps Organizations to Work More Efficiently?Tuan Yang
 
Carolina AI Meetup Nov 2018
Carolina AI Meetup Nov 2018Carolina AI Meetup Nov 2018
Carolina AI Meetup Nov 2018Sergey A. Razin
 
Intro to machine learning
Intro to machine learningIntro to machine learning
Intro to machine learningAkshay Kanchan
 
To bag, or to boost? A question of balance
To bag, or to boost? A question of balanceTo bag, or to boost? A question of balance
To bag, or to boost? A question of balanceAlex Henderson
 
Introduction to Neural Network
Introduction to Neural NetworkIntroduction to Neural Network
Introduction to Neural NetworkYan Xu
 
Using Tree algorithms on machine learning
Using Tree algorithms on machine learningUsing Tree algorithms on machine learning
Using Tree algorithms on machine learningRajasekhar364622
 
in5490-classification (1).pptx
in5490-classification (1).pptxin5490-classification (1).pptx
in5490-classification (1).pptxMonicaTimber
 
Machine Learning workshop by GDSC Amity University Chhattisgarh
Machine Learning workshop by GDSC Amity University ChhattisgarhMachine Learning workshop by GDSC Amity University Chhattisgarh
Machine Learning workshop by GDSC Amity University ChhattisgarhPoorabpatel
 
Hadoop Summit 2010 Machine Learning Using Hadoop
Hadoop Summit 2010 Machine Learning Using HadoopHadoop Summit 2010 Machine Learning Using Hadoop
Hadoop Summit 2010 Machine Learning Using HadoopYahoo Developer Network
 
Machine Learning Innovations
Machine Learning InnovationsMachine Learning Innovations
Machine Learning InnovationsHPCC Systems
 
Learning to Search Henry Kautz
Learning to Search Henry KautzLearning to Search Henry Kautz
Learning to Search Henry Kautzbutest
 
Learning to Search Henry Kautz
Learning to Search Henry KautzLearning to Search Henry Kautz
Learning to Search Henry Kautzbutest
 

Similar a Machine learning Introduction (20)

Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
How Machine Learning Helps Organizations to Work More Efficiently?
How Machine Learning Helps Organizations to Work More Efficiently?How Machine Learning Helps Organizations to Work More Efficiently?
How Machine Learning Helps Organizations to Work More Efficiently?
 
LR1. Summary Day 1
LR1. Summary Day 1LR1. Summary Day 1
LR1. Summary Day 1
 
Carolina AI Meetup Nov 2018
Carolina AI Meetup Nov 2018Carolina AI Meetup Nov 2018
Carolina AI Meetup Nov 2018
 
Learning from data
Learning from dataLearning from data
Learning from data
 
Intro to machine learning
Intro to machine learningIntro to machine learning
Intro to machine learning
 
AL slides.ppt
AL slides.pptAL slides.ppt
AL slides.ppt
 
To bag, or to boost? A question of balance
To bag, or to boost? A question of balanceTo bag, or to boost? A question of balance
To bag, or to boost? A question of balance
 
Introduction to Neural Network
Introduction to Neural NetworkIntroduction to Neural Network
Introduction to Neural Network
 
c23_ml1.ppt
c23_ml1.pptc23_ml1.ppt
c23_ml1.ppt
 
Using Tree algorithms on machine learning
Using Tree algorithms on machine learningUsing Tree algorithms on machine learning
Using Tree algorithms on machine learning
 
in5490-classification (1).pptx
in5490-classification (1).pptxin5490-classification (1).pptx
in5490-classification (1).pptx
 
Machine Learning workshop by GDSC Amity University Chhattisgarh
Machine Learning workshop by GDSC Amity University ChhattisgarhMachine Learning workshop by GDSC Amity University Chhattisgarh
Machine Learning workshop by GDSC Amity University Chhattisgarh
 
Deep learning - a primer
Deep learning - a primerDeep learning - a primer
Deep learning - a primer
 
Deep learning - a primer
Deep learning - a primerDeep learning - a primer
Deep learning - a primer
 
Hadoop Summit 2010 Machine Learning Using Hadoop
Hadoop Summit 2010 Machine Learning Using HadoopHadoop Summit 2010 Machine Learning Using Hadoop
Hadoop Summit 2010 Machine Learning Using Hadoop
 
Machine Learning Innovations
Machine Learning InnovationsMachine Learning Innovations
Machine Learning Innovations
 
Learning to Search Henry Kautz
Learning to Search Henry KautzLearning to Search Henry Kautz
Learning to Search Henry Kautz
 
Learning to Search Henry Kautz
Learning to Search Henry KautzLearning to Search Henry Kautz
Learning to Search Henry Kautz
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 

Más de Dong Guo

Convex optimization methods
Convex optimization methodsConvex optimization methods
Convex optimization methodsDong Guo
 
AlphaGo zero
AlphaGo zeroAlphaGo zero
AlphaGo zeroDong Guo
 
DQN (Deep Q-Network)
DQN (Deep Q-Network)DQN (Deep Q-Network)
DQN (Deep Q-Network)Dong Guo
 
机器学习概述
机器学习概述机器学习概述
机器学习概述Dong Guo
 
Expectation propagation
Expectation propagationExpectation propagation
Expectation propagationDong Guo
 
Additive model and boosting tree
Additive model and boosting treeAdditive model and boosting tree
Additive model and boosting treeDong Guo
 

Más de Dong Guo (6)

Convex optimization methods
Convex optimization methodsConvex optimization methods
Convex optimization methods
 
AlphaGo zero
AlphaGo zeroAlphaGo zero
AlphaGo zero
 
DQN (Deep Q-Network)
DQN (Deep Q-Network)DQN (Deep Q-Network)
DQN (Deep Q-Network)
 
机器学习概述
机器学习概述机器学习概述
机器学习概述
 
Expectation propagation
Expectation propagationExpectation propagation
Expectation propagation
 
Additive model and boosting tree
Additive model and boosting treeAdditive model and boosting tree
Additive model and boosting tree
 

Último

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 

Último (20)

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 

Machine learning Introduction

  • 1. Machine  Learning  Introduc1on   guodong@hulu.com     Machine  learning  introduc0on   Logis1c  regression   Feature  selec1on   Boos1ng,  tree  boos1ng     See  more  ML  posts:  h>p://dongguo.me/    
  • 2. Machine  Learning  Makes  Life  Be>er  
  • 3. WHAT  IS  MACHINE  LEARNING?  
  • 4. Learning   •  What  is  learning   –  Find  rules  from  data/experience   •  Why  learning  is  possible   –  Assume  rules  exist  in  this  world   •  How  to  learn   –  Induc1ve  
  • 5. What  is  machine  learning   •  “Machine  Learning  is  a  field  of  study  that  gives   computers  the  ability  to  learn  without  being   explicitly  programmed”  -­‐  Arthur  Samuel  (1959)   •  Machine  learning  is  the  study  of  computer   algorithms  that  improve  automa1cally  through   experience”  –  Tom  Mitchell  (1998)  
  • 6. Overview  of  machine  learning       Machine  Learning   Unsupervised   Learning   Supervised   Learning   Classifica1on   Semi-­‐supervised   Learning   Regression  
  • 7. Outline   •  Supervised  Learning   •  Case  Study   •  Challenge   •  Resource  
  • 8. Supervised  learning   •  Concepts   •  Defini1on   •  Models   •  Metrics   •  Open  Ques1ons  
  • 9. Concepts   Problem       Generate  dataset   Dataset   Train   Sample/instance   Feature  vector   label   model   Predict   Test   Model  Tuning   Feature  selec0on  
  • 10. What  is  Supervised  learning   •  Find  a  func1on  (from  some  func1on  space)  to   predict  for  unseen  instances,  from  the  labeled   training  data   –  Func1on  space:  determined  by  the  chosen  model   –  Find  the  func1on:  minimize  error  on  training  data  with   some  cost  func1on   •  2  types:  Classifica1on  and  regression  
  • 11. Formal  defini1on   •  Given  a  training  dataset   r N {xi , yi }i =1 •  And  define  a  loss  func1on   ∧ ∧ L( y, y ), where y = f ( x) •  Target   ∧ f ( x) =arg min G ( f ), f 1 st. G ( f ) = N N ∑ L( y , f ( x )) i =1 i i
  • 12. Models  for  supervised  learning   •  Classifica1on  and  regression   –  For  classifica1on:  LR(Logis1c  regression),  Naïve  Bayes   –  For  regression:  linear  regression   –  For  Both:  Trees,  KNN,  SVM,  ANN   •  Genera1ve  and  Discrimina1ve   –  Genera1ve:  Naïve  Bayes,  GMM,  HMM   –  Discrimina1ve:  KNN,  LR,  SVM,  ANN,  Trees   •  Parametric  and  nonparametric   –  Parametric:  LR,  Naïve  Bayes,  ANN   –  nonparametric:  Trees,  KNN,  kernel  methods  
  • 13. Decision  Tree   •  Would  you  like  to  date  somebody?   Gender   male   female   Good   looking?   Yes!   Pass   No!   umm..   Pass   Others…   Accept   Very  good   Accept   else   Pass  
  • 15. Naïve  Bayes   •  Bayes  classifier   •  Condi1onal  Independence  assump1on   •  With  this  assump1on    
  • 16. Logis1c  regression   •  Logis1c  func1on      
  • 19. Model  Inference   •  Typical  inference  methods   –  Gradient  descent   –  Expecta1on  Maximiza1on   –  Sampling  based  
  • 20. Model  ensemble   •  Averaging  or  vo1ng  output  of  mul1ply  classifiers   •  Bagging  (bootstrap  aggrega1ng)   –  Train  mul1ple  base  models   –  Vote  mul1ply  base  classifiers  with  same  weight   –  Improve  model  stability  and  avoid  overfihng   –  Work  well  on  unstable  base  classifier   •  Adaboost  (adap1ve  boos1ng)   –  Sequen1al  base  classifiers   –  Misclassified  instances  have  higher  weight  in  next  base   classifier   –  Weighted  vo1ng  
  • 21. Evalua1on  metrics   •  Common  Metrics  for  classifica1on   –  Accuracy   –  Precision-­‐Recall   –  AUC   •  For  regression   –  Mean  absolute  error  (MAE)   –  Mean  square  error  (MSE),  RMSE  
  • 22. Ques1on1:  How  to  choose  a  suitable  model?   Characteris0c   Naïve   Bayes   Trees   K  Nearest   neighbor   Logis0c   regression   Neural   SVM   Networks   Natural  handling   data  of  “mixed”   type   Robustness  to   outliers  in  input   space   Computa1onal   scalability   Interpretability   1   3   1   1   1   1    3   3   3   3     1   1   3   3   1   3   1   1    2   2     1    2   1   1   Predic1ve  power   1   1    3   2   3   3   <Elements  of  Sta-s-cal  Learning>  II  P351      
  • 23. Ques1on2:  Can  we  find  a  100%  accurate  model?       •  Expected  risk   •  Empirical  risk   •  Choose  a  family          for  candidate  predic1on  func1ons     •  Error  
  • 24. Case  study:  Predic1ve  Demographic       Feature  extrac1on  (‘show’,  ‘ad  vote’,  ‘ad   selec1on’)   feature  analysis  (remove  ‘ad  selec1on’)   Load  login  profile   ML  problem?  What  kind?    Labels?   Evalua1on  metric?   Possible  features?  (show,  ad  vote,   ad  selec1on,  search…)    Accessible?       Problem   Dataset  genera1on   Choose  a  Model   1.  Familiar?  (NB,  ANN,  LR,  Tree,  SVM)   2.  Computa1onal  cost?  Interpretability?   Precision?     3.  Data:  amount?  noise  ra1o?     Train   Try  more  features(add   ‘OS’,  ‘browser’,  ‘flash’)   Feature  selec1on  (remove   ‘flash’,  and  non   anonymous  features)   Predictor     Try  more  models   Tuning   Evalua1on  (AUC,   Precision-­‐recall)   Test   Challenges   (Noise,  different  Join  distribu1on,  evalua1on)       model  ensemble   Predictor  on  product   Scoring   Online  Update  
  • 25. Challenges  in  Machine  learning   •  Data   –  Sparse  data  in  high  dimensions   –  Limited  labels     •  Computa1on  Cost   –  Speed  Up  advanced  models   –  Paralleliza1on   •  Applica1on   –  Structured  predic1on  
  • 27. Top  conference   •  •  •  •  •  ICML   NIPS   IJCAI/AAAI   KDD   Other  related   –  WSDM,  WWW,  SIGIR,  CIKM,  ICDE,  ICDM  
  • 28. Books   •  •  •  •  Machine  Learning  [link]      by  Mitchell   Pa-ern  Recogni0on  and  Machine  Learning  [link]  by  Bishop   The  Elements  of  Sta0s0cal  Learning  [link]   Scaling  Up  Machine  Learning  [link]  
  • 29. Lectures   •  Machine  Learning  open  class  –  by  Andrew  Ng   –  Video  in  YouTube   •  Advanced  topics  in  Machine  Learning  –  Cornell   •  h>p://videolectures.net/  
  • 30. Other  research  resource   •  Research  Organs   –  Yahoo  Research  [link]   –  Google  Research  publica1ons  [link]   •  Dataset   –  UCI  machine  learning  Repository  [link]   –  kaggle.com  

Notas del editor

  1. Unsupervised learning(聚类,降维(topic model)): learn structure from unlabeled data. Closely related with density estimation; summarize the dataSemi-supervised learning: use both labeled and unlabeled samples for training; It’s cost to collect lots of labels sometimes, use both
  2. 除此之外,你对模型的熟悉程度。
  3. Expected risk: 定义好loss function,选择一个预估函数,有一个输入变量和response value的联合分布,在该联合分布上对损失函数求积分,即为期望风险;通过最小化该期望风险,我们找到一个最优的预估函数。但是实际上,我们并不知道该联合分布,我们有的是从该联合分布中有偏或无偏采样得到的有限样本,可能还有一些noise点。我们转为最小化在该有限样本上的最小化loss function寻找预估函数。 即我们转为最小化经验风险另一方面,我们往往给目标函数指定function family,该function family极有可能没有包含最优或者较优的那些目标函数。误差的大小: 第一部分:函数family F中的预估函数有多接近真正最优的预估函数;第二部分:我们选择优化经验而不是经验风险
  4. Logistic regression is one of the most popular classifier.Advantage: 1. easy understand and implement; 2. not bad performance; 3. light weight and less time taken for training and prediction;(can handle large dataset) 4. easy parallelizationValue to attendances:Know about what is logistic regression, what’s the advantages and disadvantage. what kind of problems are suitable apply to.L1 and L2 regularizationHow to inference through maximize likelihood with gradient descent. And know how to implement it
  5. 对于generalized linear model,如果response variable是binomial或者multinomial分布,且选择了logit function作为link function 就是logistic regressionLogistic function 是logit function的反函数
  6. Binary(binomial) logistic regression
  7. 负梯度方向是使得函数值下降最快的方向
  8. 在重新计算likelihood前,我们看一下这2种正则化背后的理论基础
  9. 假设全部的weight服从一致的分布
  10. Laplace 分布一阶倒数不连续假设全部的weight服从一致的分布(均值为0,Laplace参数也一样)W_k在一次更新中不能变换正负号
  11. L1拟合得到的weight通常较稀疏,带来2点好处: 帮我们做特征选择,工程上更有利
  12. 增加了decay ratio:AUC稍有提高(0.845 -&gt; 0.847)在不同step时,适合的decay ratio也不一样Iteration times: 与样本量的大小有关
  13. 例子:今天是高考第一天,高考选专业,每个人有多个候选,但是仅能选择一门专业(计算机,金融,化学,数学,物理,生物)和binomial分布应用的区别多类问题,可以转化为多个两类问题,如果我们的问题是“找出每门课成绩前10%的学生”,我们可以用两类logistic regression来做如果问题是“对于每个学生找出其成绩最好的课,或者最好的几门课”,两类问题就不是很适合 (每一类上的预估概率之和不等于1,无法比较不同类上的概率)Multi-nominal适用于response value为category的情况,不太适合ordinal的情况。我实现了。
  14. Link function: (1) generalized linear model的重要组成部分:将linear regression拓展到generalized linear model;(2)link function的反函数的自变量介于(-无穷,+无穷),若y服从binominal分布,应变量介于【0,1】区间The inverse of any continuous cumulative distribution function (CDF) can be used for the link since the CDF’s range is [0,1]
  15. Generalized linear model 广义上的线性模型,都有一个基本的线性单元W*X(linear regression),通过各种link function建立该线性单元和各种分布的response variable的关系。包含linear regression (normal distribution),logistic regression (binominal/multi-nominal distribution), Poisson regression (Poisson distribution)对于binominal/multi-nominal distribution,我们也可以选择除logit link function之外的link function (广义的logistic regression)