Amitpal Tagore, Integral Ad Science - Leveraging Data for Successful Ad Campaigns - H2O World 2019 NYC

Leveraging Data for Successful Ad Campaigns
Amitpal Tagore
Data Scientist
Integral Ad Science

We work with the world’s leading brands
2
Available brand safety categories vary by language

Integral Ad Science
3
Brand Safety
Fraud detection
Viewability
Optimization

Brand safety
4
The Product: IAS Brand Safety
Dynamically score websites and control what
content will appear with your advertisement. Your
Ad
Here
TRAGIC
NEWS
STORY
#&^!!
@#*
$^*%

LocalChron.com -- NATURE
HEADLINE NEWS
Ad Banner
Internal News Link
Internal News Link
Internal News Link
Internal News Link
Local US International Sports Tech Entertainment Nature
External Link
External Link
External Link
External Link
Ad Banner
Ad Banner
Similar layouts across subdomains
Repeated text across subdomains
Large number of subdomains
External links
Text from various fields
Metadata -- Keywords

40+ languages
English
Spanish
Italian
Japanese
French
Chinese
...
7+ categories
Violence
Adult
Alcohol
Gambling
Hate speech
Illegal drugs
Illegal downloads
...
3-5+ risk levels
Low
Medium
High
...
Millions of URLs
Model complexity

der Fünfjahresvertrag
我跟你讲。
Languages
Compound words
Lack of whitespace
Character meanings
Cultural differences
Varying alphabets
Tea vs dinner
ਸੰਸਾਰ Мир ‫العالمية‬
Phonetic transcriptionनमस्ते = namaste

H2O Driverless AI
8
https://www.h2o.ai/products-dai-nlp

Why do ad fraud?
10
Source: Hewlett Packard Enterprises, “The Business of Hacking”, May 2016
PAYOUTPOTENTIAL
EFFORT & RISK ESTIMATION
Cyber
warfare
Identity
theft
Organized
crime
IP theft
Extortion
Ad fraud
Payment system
fraud
Bank fraud
Medical records
fraud
Credential
harvesting
Credit card
fraud
Hacktivism
HIGH LOW
HIGH
LOW

The monetary impact of fraud
11
The IAB and Ernst & Young
estimate that $4.6 billion is lost
due to ad fraud/NHT annually

Detecting & preventing ad fraud: 3 pillars
12
Behavioral & network
analysis
Browser & device analysis Targeted reconnaissance &
malware analysis
• Dissection of malware and
infiltration of hacker communities
• Validate that browser viewing
ad is a real, human web
browser like Chrome or Mobile
Safari
• Validate that device viewing ad
is actually an iPhone or
Windows 10 computer
• Differentiate human from bot
behavior
• Process vast amounts of data

Hidden Ads
13
Why load one ad when you can load many?

Hidden Ads
14

Hidden Ads
15

Hidden Ads
16
Why load one site when you can load many?

Predicted Viewability: Product overview & Implementation
17
The Product: IAS Predicted Viewability
Provides advertisers with a probability that their
ads will be seen by the end user.
Out of view portion of the web site
Ad in-
view
Your Ad Here

Data Description
18
➢ Proprietary Data: Ad impression logs collected from advertising
campaigns of our clients.
➢ Available data points:
○ URL properties
○ User’s device/environment (desktop web, mobile web, mobile app)
○ Impression Type (banner, video)
○ Viewability measurements

Tools Used
19
Data exploration / preparation / cleansing: Apache Hive
AI Hosts: G3.16xlarge with 64 vCPUS and 4 NVIDIA Tesla GPUs
AI-driven feature engineering framework: H2O DriverlessAI
AI-driven workflow for model ML optimization: H2O AutoML
Hierarchical Post modeling Processing: Jupyter/Python

Total number of features:
Preferred Models:
Significant improvement:
Results
20
46 machine engineered features from
Driverless AI.
LightGBM, gradient boosted regression
H2O Driverless AI improved accuracy and
provides insights.

Motivation and Business Value
21
➢ Gain competitive edge with first-in-class ML-based
brand safety and viewability predictions, aided by
H2O Driverless IA
➢ Enable greater flexibility for our clients’ custom
viewability standards
➢ Enable rapid model development, testing, and
deployment
➢ Provide insights in an automated framework

Thank You
Social handles (in: tagoreas)
Email (atagore@integralads.com)

Amitpal Tagore, Integral Ad Science - Leveraging Data for Successful Ad Campaigns - H2O World 2019 NYC

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Similar a Amitpal Tagore, Integral Ad Science - Leveraging Data for Successful Ad Campaigns - H2O World 2019 NYC

Similar a Amitpal Tagore, Integral Ad Science - Leveraging Data for Successful Ad Campaigns - H2O World 2019 NYC (20)

Más de Sri Ambati

Más de Sri Ambati (20)

Último

Último (20)

Amitpal Tagore, Integral Ad Science - Leveraging Data for Successful Ad Campaigns - H2O World 2019 NYC