4. CTR Prediction
CTR: Click-Through Rate
pCTR: predicted CTR
Question
How likely is the user to click on the ad?
Why
Proxy for relevance
5.5%
0.8%
9.2%
?
6. pCTR Model History
(CC) from Flickr: "Wednesday Freedom 11"
by Parker Knight
(CC) from Flickr: "Icelandig sheepdog"
by Thomas Quine(CC) from Flickr: by Craige Moore
French
Brittany
Icelandic
Sheepdog
Jindo Kuvasz
16. Offline Training at Yelp
merge logs sampling
feature
extraction
model
training
evaluation
mrjob
AWS EMR
daily scheduled
pipeline
kicked off manually
mrjob
AWS EMR
Spark
mrjob
AWS EMR
mrjob
AWS EMR
mrjob
AWS EMR
new
features
(CC) from Flickr: "Cloud" by Jason Pratt
22. Focus on a single metric
(but don't trust it blindly)
Create helpful visualizations
Tools: Zeppelin
Evaluation
data model
prediction
verification
evaluation
fast
scalable
32. Lessons Learned
Infrastructure
Log at source of online prediction
Verify predictions
Make offline iterations fast & scalable
Model Comprehension
Evaluate, evaluate, evaluate
Be aware of threshold effects
38. Lessons Learned
Above all, keep it simple.
Infrastructure
Log at source of online prediction
Verify predictions
Make offline iterations fast & scalable
Model Comprehension
Evaluate, evaluate, evaluate
Be aware of threshold effects