Machine learning Introduction

Machine
Learning
Introduc1on

guodong@hulu.com

Machine
learning
introduc0on

Logis1c
regression

Feature
selec1on

Boos1ng,
tree
boos1ng

See
more
ML
posts:
h>p://dongguo.me/

Machine
Learning
Makes
Life
Be>er

WHAT
IS
MACHINE
LEARNING?

Learning

•  What
is
learning

–  Find
rules
from
data/experience

•  Why
learning
is
possible

–  Assume
rules
exist
in
this
world

•  How
to
learn

–  Induc1ve

What
is
machine
learning

•  “Machine
Learning
is
a
ﬁeld
of
study
that
gives

computers
the
ability
to
learn
without
being

explicitly
programmed”
-‐
Arthur
Samuel
(1959)

•  Machine
learning
is
the
study
of
computer

algorithms
that
improve
automa1cally
through

experience”
–
Tom
Mitchell
(1998)

Overview
of
machine
learning

Machine
Learning

Unsupervised

Learning

Supervised

Learning

Classiﬁca1on

Semi-‐supervised

Learning

Regression

Outline

•  Supervised
Learning

•  Case
Study

•  Challenge

•  Resource

Supervised
learning

•  Concepts

•  Deﬁni1on

•  Models

•  Metrics

•  Open
Ques1ons

Concepts

Problem

Generate
dataset

Dataset

Train

Sample/instance

Feature
vector

label

model

Predict

Test

Model
Tuning

Feature
selec0on

What
is
Supervised
learning

•  Find
a
func1on
(from
some
func1on
space)
to

predict
for
unseen
instances,
from
the
labeled

training
data

–  Func1on
space:
determined
by
the
chosen
model

–  Find
the
func1on:
minimize
error
on
training
data
with

some
cost
func1on

•  2
types:
Classiﬁca1on
and
regression

Formal
deﬁni1on

•  Given
a
training
dataset

r
N
{xi , yi }i =1

•  And
deﬁne
a
loss
func1on

∧

∧

L( y, y ), where y = f ( x)

•  Target

∧

f ( x) =arg min G ( f ),
f

1
st. G ( f ) =
N

N

∑ L( y , f ( x ))
i =1

i

i

Models
for
supervised
learning

•  Classiﬁca1on
and
regression

–  For
classiﬁca1on:
LR(Logis1c
regression),
Naïve
Bayes

–  For
regression:
linear
regression

–  For
Both:
Trees,
KNN,
SVM,
ANN

•  Genera1ve
and
Discrimina1ve

–  Genera1ve:
Naïve
Bayes,
GMM,
HMM

–  Discrimina1ve:
KNN,
LR,
SVM,
ANN,
Trees

•  Parametric
and
nonparametric

–  Parametric:
LR,
Naïve
Bayes,
ANN

–  nonparametric:
Trees,
KNN,
kernel
methods

Decision
Tree

•  Would
you
like
to
date
somebody?

Gender

male

female

Good

looking?

Yes!

Pass

No!

umm..

Pass

Others…

Accept

Very
good

Accept

else

Pass

K-‐Nearest
Neighbor
classiﬁer

K=15

K=1

Naïve
Bayes

•  Bayes
classiﬁer

•  Condi1onal
Independence
assump1on

•  With
this
assump1on

Logis1c
regression

•  Logis1c
func1on

Ar1ﬁcial
neural
network

Support
vector
machine

Model
Inference

•  Typical
inference
methods

–  Gradient
descent

–  Expecta1on
Maximiza1on

–  Sampling
based

Model
ensemble

•  Averaging
or
vo1ng
output
of
mul1ply
classifiers

•  Bagging
(bootstrap
aggrega1ng)

–  Train
mul1ple
base
models

–  Vote
mul1ply
base
classifiers
with
same
weight

–  Improve
model
stability
and
avoid
overfihng

–  Work
well
on
unstable
base
classifier

•  Adaboost
(adap1ve
boos1ng)

–  Sequen1al
base
classifiers

–  Misclassified
instances
have
higher
weight
in
next
base

classifier

–  Weighted
vo1ng

Evalua1on
metrics

•  Common
Metrics
for
classiﬁca1on

–  Accuracy

–  Precision-‐Recall

–  AUC

•  For
regression

–  Mean
absolute
error
(MAE)

–  Mean
square
error
(MSE),
RMSE

Ques1on1:
How
to
choose
a
suitable
model?

Characteris0c

Naïve

Bayes

Trees
K
Nearest

neighbor

Logis0c

regression

Neural

SVM

Networks

Natural
handling

data
of
“mixed”

type

Robustness
to

outliers
in
input

space

Computa1onal

scalability

Interpretability

1

3

1

1

1

1

3

3

3

3

1

1

3

3

1

3

1

1

2

2

1

2

1

1

Predic1ve
power

1

1

3

2

3

3

<Elements
of
Sta-s-cal
Learning>
II
P351

Ques1on2:
Can
we
ﬁnd
a
100%
accurate
model?

•  Expected
risk

•  Empirical
risk

•  Choose
a
family

for
candidate
predic1on
func1ons

•  Error

Case
study:
Predic1ve
Demographic

Feature
extrac1on
(‘show’,
‘ad
vote’,
‘ad

selec1on’)

feature
analysis
(remove
‘ad
selec1on’)

Load
login
profile

ML
problem?
What
kind?

Labels?

Evalua1on
metric?

Possible
features?
(show,
ad
vote,

ad
selec1on,
search…)

Accessible?

Problem

Dataset
genera1on

Choose
a
Model

1.  Familiar?
(NB,
ANN,
LR,
Tree,
SVM)

2.  Computa1onal
cost?
Interpretability?

Precision?

3.  Data:
amount?
noise
ra1o?

Train

Try
more
features(add

‘OS’,
‘browser’,
‘flash’)

Feature
selec1on
(remove

‘flash’,
and
non

anonymous
features)

Predictor

Try
more
models

Tuning

Evalua1on
(AUC,

Precision-‐recall)

Test

Challenges

(Noise,
different
Join
distribu1on,
evalua1on)

model
ensemble

Predictor
on
product

Scoring

Online
Update

Challenges
in
Machine
learning

•  Data

–  Sparse
data
in
high
dimensions

–  Limited
labels

•  Computa1on
Cost

–  Speed
Up
advanced
models

–  Paralleliza1on

•  Applica1on

–  Structured
predic1on

Resource

• 
• 
• 
• 

Conference

Books

Lectures

Dataset

Top
conference

• 
• 
• 
• 
• 

ICML

NIPS

IJCAI/AAAI

KDD

Other
related

–  WSDM,
WWW,
SIGIR,
CIKM,
ICDE,
ICDM

Books

• 
• 
• 
• 

Machine
Learning
[link]

by
Mitchell

Pa-ern
Recogni0on
and
Machine
Learning
[link]
by
Bishop

The
Elements
of
Sta0s0cal
Learning
[link]

Scaling
Up
Machine
Learning
[link]

Lectures

•  Machine
Learning
open
class
–
by
Andrew
Ng

–  Video
in
YouTube

•  Advanced
topics
in
Machine
Learning
–
Cornell

•  h>p://videolectures.net/

Other
research
resource

•  Research
Organs

–  Yahoo
Research
[link]

–  Google
Research
publica1ons
[link]

•  Dataset

–  UCI
machine
learning
Repository
[link]

–  kaggle.com

Machine learning Introduction

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Similar a Machine learning Introduction

Similar a Machine learning Introduction (20)

Más de Dong Guo

Más de Dong Guo (6)

Último

Último (20)

Machine learning Introduction

Notas del editor