Claudia Perlich, Chief Scientist, Dstillery at MLconf NYC

All the data and still not enough!
Claudia Perlich Chief Scientist
@claudia_perlich

Predictive Modeling:
Algorithms that LearnFunctions

Income Age Buy
123,000 30 yes
51,100 40 yes
68,000 55 no
74,000 46 no
23,000 47 yes
100,000 49 no
Data forPredictiveModeling
Target
Examples
Features

?
yes
yes
no
no
yes
no
RulesforPredictiveModeling
Target
Examples
Features
 Data should be:
 Large enough
 Independently Identically Distributed

Paradox of BigData:
“Youneverhave thedatayouwant”
Art of making due with second best

WalletisNEVERobserved
We observe
this in the
data
But we do not
observe this
IBM Sales to
this Company
Company Revenue (D&B)
Wallet/Opportunity
How can we make this a
predictive modeling problem?

Wallet
10
5
31
17
39
4
Data forWalletEstimation?
Target
Examples
Features

9
REALISTICWalletsas quantiles
 Motivation
 Imagine 100 identical firms with identical IT needs
 Consider the distribution of the IBM sales to these firms
 Bottom firms should spend as much as the top
 Define wallet as high percentile of spending conditional on the customer
attributes
Frequency
IBM Sales
Wallet Estimate

Revenue
10
5
31
17
39
4
Data forWalletEstimation
Target
Examples
Features

QuantileRegressionoptimizing
weightedabsoluteloss
10 20 30 40 50 60 70 80
1
2
3
4
5
6
7
8
9
Firm Sales
IBMRevenue
Company Sales
IBMRevenue
Opportunity for C 2
Opportunity for C 1
C1
C2
10 20 30 40 50 60 70 80
1
2
3
4
5
6
7
8
9
Firm Sales
IBMRevenue
Company Sales
IBMRevenue
Opportunity for C 2
Opportunity for C
1
C1
C2
10 20 30 40 50 60 70 80
1
2
3
4
5
6
7
8
9
10 20 30 40 50 60 70 80
1
2
3
4
5
6
7
8
9
Firm Sales
IBMRevenue
Company Sales
IBMRevenue
Opportunity for C 2
Opportunity for C 1
C1
C2
10 20 30 40 50 60 70 80
1
2
3
4
5
6
7
8
9
20 30 40 50 60 70 80
1
2
3
4
5
6
7
8
9
Firm Sales
IBMRevenue
Company Sales
IBMRevenue
Opportunity for C 2
Opportunity for C
1
C1
C2

© IBM Corporation 2008
Slide 13
Siemens: Computer-Aided Detection of Breast Cancer in Mammograms
1712 Patients 6816 Images
105,000 Candidates
[ x1 , x2 , … , x117 ]
Image feature vector
Malignant
?
MLO CC MLO CC

SiemensMedical:fMRIbreastcancerdata
245 Patients:
36% Cancer
414 Patients:
1% Cancer
1027 Patients
0% Cancer
18 Patients:
85% Cancer
Model
score
Log of Patient ID
Every point
is a candidate
Inessence,themostpredictivevariableisthepatientID

Data forDiagnosisfromMultiple
Sources
Target
Examples
Features
Cancer
yes
no
no
no
no
no

ModelingtheSources…
Target
Examples
Features
Source Cancer
1 yes
2 no
1 no
1 no
4 no
3 no

OnlineDisplayAdvertising
Do peoplebuystuff afterseeingan ad?

Datacollectionforpost-viewpurchase
conversion
Time
Cohort of
random
prospects
?

Data ForAdvertising
Target
Examples
Features
PV Buy
no
no
no
no
yes
yes

Multi-ArmedBandit:
Explorationvs.exploitation
 Show some random ads to learn a good model
 Tradeoff between learning and using

SizeoftheTrainingSample?
Target
Examples
Features
Buy
no
no
no
no
yes
yes

VeryfewLuxury carsareboughonline
Maserati $128,0000
$128,0000

RealityofOnlinePurchases
Target
Examples
Features
Buy
no
no
no
no
no
yes

Proxyfor purchase?How about click?

Click?
yes
yes
no
no
yes
no
OptimizingClicksinAdvertising?

ClickOptimization:Fumblingin theDark
Top 10 Apps by CTR

How BigData andOptimizationis
killingMetrics
 90% of clicks are ‘accidental/non intentional’
 10% are meaningful, and changes can be
measures
 Optimization can find structure in the other
90%
 You will end up with only non-intentional …

Whocaresabout thead anyway?

PredictOtherindicators:searchor
brandsitevisit/scheduletestdrive
Target
Examples
Features
Site Visit
no
no
no
yes
yes
yes

Istherereallyapersonontheother
endwantingtoseethesite?

Data forFraudDetection
Target
Examples
Features
Human?
yes
no
no
yes
yes
no

Tellingthedifferencebetweenan
algorithmandahuman
Turing test KAPTCHA

Whoshouldyoureallyadvertiseto???

Data forAdvertisingImpact
Target
Examples
Features
Impact
1
0.3
0.5
0
0
0.1

AlternativeHistories(Counterfactual)

FundamentallyImpossible!
Target
Examples
Features
Impact
1
0.3
0.5
0
0
0.1

Buildtwoseparatemodelsand
calculateimpactas thedifference
Site Visit
yes
no
no
yes
no
no
Site Visit
yes
no
no
yes
no
no
Examples1
seenad
Examples2
notseenad
ExpectedImpact:
p(SV|Ad)-p(SV|noad)

Usepredictivemodelstomeasureimpact
Negative Test: wrong ad
Positive Test: A/B comparison

Relationshipoforganicconversionrateand
causalimpact
-0.001000
0.000000
0.001000
0.002000
0.003000
0.004000
0.005000
0.006000
0.40% 0.50% 0.60% 0.70% 0.80% 0.90% 1.00% 1.10% 1.20% 1.30% 1.40%
Organic conversion propensity
Additivecasualimpact

Pleasingtheadvertisingoracle…
 Audience reports from
matched populations in
Facebook
 68% of the ads where shown
to females
 Makeup for 32% of ads
The Oracle

Data forAudienceOptimization
Target
Examples
Features
Gender
male
female
female
male
male
female

WeightedLogisticRegressionon
aggregated
Target
Examples
Features
Weight Gender
0.32 male
0.68 female
0.32 male
0.68 female
0.73 male
0.27 female

HyperlocalTargeting?
 Foursquare locations: very noisy…

Data forLocationReliabilityinAuction
Target
Examples
Features
Reliable?
yes
no
no
yes
yes
no

30%smartphoneuserstravelfaster
thanspeedof sound…

Catalan traditions
pop up everywhere….

Data forLocationReliabilityinAuction
Target
Examples
Features
Reliable?
maybe
no
no
maybe
maybe
no

Allamatterhowcreativeyouareatcheating….

Claudia Perlich, Chief Scientist, Dstillery at MLconf NYC

Recomendados

Recomendados

Más contenido relacionado

Más de MLconf

Más de MLconf (20)

Último

Último (20)

Claudia Perlich, Chief Scientist, Dstillery at MLconf NYC