SlideShare una empresa de Scribd logo
1 de 28
Descargar para leer sin conexión
EMBEDDING FLINK THROUGHOUT
AN OPERATIONALIZED STREAMING
ML LIFECYCLE
Dave Torok, Senior Principal Architect
Sameer Wadkar, Senior Principal Architect
10 April, 2018
2
INTRODUCTION AND BACKGROUND
CUSTOMER EXPERIENCE TEAM
27 MILLION CUSTOMERS (HIGH SPEED DATA, VIDEO,
VOICE, HOME SECURITY, MOBILE)
INGESTING ABOUT 2 BILLION EVENTS / MONTH
SOME HIGH-VOLUME MACHINE-GENERATED EVENTS
TYPICAL STREAMING DATA ARCHITECTURE
DATA ETL, LAND IN A TIME SERIES DATA LAKE
GREW FROM A FEW DOZEN TO 150+ DATA SOURCES
/ FEEDS IN ABOUT A YEAR
Comcast collects, stores, and uses all data in accordance with our privacy disclosures to users and applicable laws.
3
BUSINESS PROBLEM
INCREASE POSITIVE CUSTOMER EXPERIENCES
RESOLVE POTENTIAL ISSUES CORRECTLY AND
QUICKLY
PREDICT AND DIAGNOSE SERVICE TROUBLE
ACROSS MULTIPLE KNOWLEDGE DOMAINS
REDUCE COSTS THROUGH EARLIER RESOLUTION
AND BY REDUCING AVOIDABLE TECHNICIAN VISITS
4
TECHNICAL PROBLEM
MULTIPLE PROGRAMMING AND DATA SCIENCE
ENVIRONMENTS
WIDESPREAD AND DISCORDANT DATA SOURCES
THE “DATA PLANE” PROBLEM: COMBINING DATA AT
REST AND DATA IN MOTION
ML VERSIONING: DATA, CODE, FEATURES, MODELS
5
SOLUTION MOTIVATION
SELF-SERVICE
PLATFORM
ALIGN DATA SCIENTISTS
AND PRODUCTION
MODELS TREATED
AS CODE
HIGH THROUGHPUT
STREAM PLATFORM
6
MACHINE LEARNING LIFECYCLE
USE CASE DEFINITION
FEATURE EXPLORATION / ENGINEERING
MODEL TRAINING
MODEL EVALUATION
MODEL ARTIFACT DELIVERY (POJO/DOCKER)
MODEL SELECTION
MODEL OPERATIONALIZATION
MODEL PERFORMANCE MONITORING ON LIVE DATA
(A/B & MULTIVARIATE TESTING)
PUSH MODEL TO PRODUCTION
RETRAIN MODEL ON NEWER DATA
7
EXAMPLE NEAR REAL TIME
PREDICTION USE CASE
CUSTOMER RUNS A “SPEED TEST”
EVENT TRIGGERS A PREDICTION FLOW
ENRICH WITH NETWORK HEALTH AND OTHER
INDICATORS
EXECUTE ML MODEL
PREDICT WHETHER IT IS A WIFI, MODEM, OR
NETWORK ISSUE
Detect
Enrich
Predict
Gather Data
Event
ML
Model
Engage Customer
Act / Notify
Network Diagnostic Services
Slow
Speed?
Additional Context Services
Run
Prediction
8
ML PIPELINE ARCHITECTURE PRINCIPLES
Metadata Driven
Feature/Model
Definition,
Versioning , Feature
Assembly, Model
Deployment, Model
Monitoring is
metadata driven
Automation
Orchestrated
Deployment for
new features and
models
Rapid
Onboarding
Portal for Model
and Feature
Management as
well Model
Deployment
Data Consistency
Feature store
enforces a
consistent data
pipeline ensuring
that the data
used for training
is functionally
identical to the
data used for
predictions
Monitoring and
Metrics
Ability to execute
& monitor
multiple models
in production to
enable real-time
metrics driven
model selection
Iterative/Consistent
Model
Development
Multiple versions of
the model can be
developed
iteratively while
consuming from a
consistent dataset
(feature store),
enables A/B &
Multivariate Testing
9
ML PIPELINE – ROLES & WORKFLOW
Define
Use
Case
Business User
Data Scientist
ML Operations
Explore
Features
Create and
publish new
features
Create &
Validate
Models
Model
Selection
Go Live with
Selected
Models
• Define Online Feature
Assembly
• Define pipeline to
collect outcomes
• Model Deployment
and Monitoring
Model
Review
Iterate
Evaluate
Live Model
Performance
Inception Exploration
Model
Development
Candidate Model
Selection
Model
Operationalization
Model
Evaluation
Go Live
Phase
Monitor Live
ModelsCollect new data & retrain
Iterate
1 0
WHY APACHE FLINK?
UTILIZED AS ORCHESTRATION & ETL ENGINE
FIRST-CLASS STREAMING MODEL
PERFORMANCE
RICH STATEFUL SEMANTICS
TEAM EXPERIENCE
OPEN SOURCE
GROWING COMMUNITY
Apache®, Apache Flink®, and the squirrel logo are either
registered trademarks or trademarks of the Apache
Software Foundation in the United States and/or other
countries.
1 1
THE “DATA PLANE” PROBLEM
Streaming Compute Pipeline
AWS
S3
HDFS
Data File Abstraction
Databases
MODEL
Streaming
State
Sum
Avg
Time
Buckets
Stream
Data
QUERY
Enterprise Services
Data Sets at Rest
1 2
ML MODEL EXECUTION
MODEL
EXECUTION
TRIGGER
1. Payload only
contains Model
Name & Account
Number
FEATURE
ASSEMBLY
Model
Metadata
Online
Feature
Store
2. Model Metadata
informs which
features are needed
for a model
3. Pull required
features by account
number
MODEL
EXECUTION
4. Pass full set of
assembled features
for model execution
5. Prediction
1 3
SOLUTION
Rest
Service
Inputs to REST
Service:
1.Model Name
2.Account No
SELECT MODEL BASED ON
RULES (ON-
DEMAND/STREAMING)
Request Initiated
asynchronously via
pushing it to a
queue/topic
INITIATE MODEL PREDICTION REQUEST (ASYNCHRONOUSLY)
REQUESTING
APPLICATION
TRIGGER
EVENT
LISTENER
1 4
SOLUTION
ASSEMBLE FEATURES FOR
A GIVEN MODEL
Happy Path for Model Execution – All Features Current
Online
Feature
Store
Model
/Feature
Metadata
Feature
Store
API
Feature
Assembly
Model
Execution
Are All
Features
Current?
Yes
Prediction/Outc
ome Store
Prediction Sink
Store Prediction
Flow
Customer
Context
Listens
PushREQUESTING
APPLICATION
Assemble features
based on Account
Number as model
input
Collect predictions
and outcome to
create datasets for
model refinement
Store current values
of features for
interactive query
access
1 5
SOLUTION (CONT.)
ASSEMBLE FEATURES FOR
A GIVEN MODEL
Exception Path – Some/All Features are not current
Online
Feature
Store
Model
/Feature
Metadata
Feature
Store
API
Feature
Assembly
Feature Creation
Pipeline
Are All
Features
Current?
No
History
Feature Store
Online
Feature Store
Back to
Happy
Path
Feature
Assembly
Append store (Ex. S3, HDFS,
Redshift) for use by Data
Scientist for Model Training
1 6
SOLUTION – DIGGING DEEPER
Global Window,
Pane per
Request Id
Model Execution
Requests
Request
Features
KeyBy
Request Id
Apply
Function
Custom
Evictor
Model
Metadata
Connected
Stream
Periodically check if
Model TTL has expired
(onEventTime)
Arrival of each feature
triggers the model
execution (onElement)
Evict pane if
model
executed
Evict pane if
model request
expired
Execute model
or expire
Side
Outputs
Features
Custom
Trigger
1 7
FEATURE STORE
TWO TYPES OF FEATURE STORES:
• Online Feature Store – Current values by key
(Key/Value Store)
• History Feature Store – Append features as they are
collected (Ex. HDFS, S3)
MULTIPLE ONLINE FEATURE STORES BASED ON
SLA’S
• A feature can be stored in multiple online feature stores
to support model specific SLA’s.
TYPES OF ONLINE FEATURE STORE
• PostgreSQL (AWS RDS, Aurora DB) for low volume
on-demand model execution requests
• HBase, DynamoDB for high volume feature ingest
• Flink Queryable State for high volume ingest, high
velocity model execution requests
Feature Creation
Pipeline
History
Feature Store
Online
Feature Store
Prediction
Phase
Model Training
Phase
AppendOverwrite
1 8
FEATURE CREATION PIPELINES
FLINK AS REAL-TIME DATA STREAM CONSUMER
CUSTOM FLOWS FOR AGGREGATION FEATURES
SAME DATA FLOWS FOR PREDICTION (STREAMING)
& TRAINING (BATCH)
• PRODUCED FEATURES UPDATE ONLINE FEATURE
STORE (PREDICTION PHASE)
• PRODUCED FEATURES APPENDED TO S3 OR
HDFS FOR USE BY DATA SCIENTISTS (TRAINING
PHASE)
Aggregation
Features
On Demand
Feature
Raw Data
On Demand
Feature Request
External
Rest API
Push to Feature
Store
1 9
STREAMING FEATURE EXAMPLE
KAFKA ERROR STREAM (~150 / SECOND)
DETECT ACCOUNTS WITH SIGNAL ERROR WITH
COUNT > 2000 IN TRAILING 24 HOURS
SOLUTION:
AVRO DESERIALIZER WITH KEY = ACCOUNT
“24 HOUR ROLLING” HASH STRUCTURE AS STATE
FILTER FUNCTION WITH SIGNAL THRESHOLD
Flink Features Used:
Kafka Source
Keyed Stream
Value State
Sliding Window
Filter Function
2 0
ON-DEMAND FEATURE EXAMPLE
PREMISE HEATH TEST
• DIAGNOSTIC TELEMETRY INFORMATION FOR
EACH DEVICE FOR A GIVEN CUSTOMER
• EXPENSIVE - ONLY REQUESTED ON DEMAND
• MODELS USING SUCH A FEATURE WILL EXTRACT
SUB-ELEMENTS USING SCRIPTING CAPABILITIES
(MODEL METADATA & FEATURE ENGINEERING)
• MODEL METADATA WILL CONTAIN TTL
ATTRIBUTE FOR SUCH FEATURES INDICATING
THEIR TOLERANCE FOR STALE DATA
SOLUTION:
MAKE AN ON-DEMAND REQUEST FOR PHT
TELEMETRY DATA FOR IF IT IS STALE OR ABSENT
FOR A GIVEN ACCOUNT
Flink Features Used:
Async Operator
2 1
ML PREDICTION COMPONENT
• REST SERVICE
• H2O.ai Model Container (POJO)
• Python based service running specialized ML Models
• Any stateless REST service
• FLINK MAP OPERATOR
• H2O.ai Model Container (POJO) wrapped in a Flink
Map Operator
• Possibly support native calls via Flink Map Operators
running specialized Models (Ex. Tensorflow GPU
based predictions)
• Same Code Base
• Multiple Deployment Models
• REST – Low velocity, on-
demand model invocations
• Map Operators – High
velocity, streaming model
invocations
2 2
VERSIONING AND DEVOPS
EVERYTHING IS VERSIONED
• Feature/Model Metadata
• Feature Data & Model Execution environments
• Training, Validation datasets are versioned
• Feature creation pipelines are versioned
VERSIONING ALLOWS PROVENANCE &
AUDITABILITY & REPEATABILITY OF EVERY
PREDICTION
2 3
FEATURES OF THE ML PIPELINE
CLOUD AGNOSTIC
• Integrates with the AWS Cloud but not
dependent on it
• Framework should be able to work in a
non-AWS distributed environment with
configuration (not code) changes
TRACEABILITY & REPEATABILITY &
AUDITABILITY
• Model to be traced back to business use-
cases
• Full traceability from raw data to feature
engineering to predictions
• “Everything Versioned” enables
repeatability
CI/CD SUPPORT
• Code, Metadata (Hyper-Parameters) and
Data (Training/Validation Data) are
versioned. Deployable artifacts to
integrate with CI/CD Pipeline
2 4
FEATURES OF THE ML PIPELINE (CONT.)
MULTI-DEPLOYMENT OPTIONS
• Supports Throughput vs. Latency
Tradeoffs- Process in stream/batch/on-
demand
• Allows multiple versions of the
same/different models to be compared
with one another on live data
• A/B testing & Multivariate testing
• Live but dark deployments
• Supports integration of outcomes with
predictions to measure production
performance & support continuous model
re-training
PLUGGABLE (DATA AND COMPUTE)
ARCHITECTURE
• De-coupled architecture based on
message driven inter-component
communication.
• Failure of an isolated component does
not fail the entire platform
• Asynchronous behavior
• Micro-Services based design which
supports independent deployment of
components
2 5
NEXT STEPS AND FUTURE WORK
GENERATING “FLINK NATIVE” FEATURE FLOWS
• Evaluating Uber’s “AthenaX” Project / Similar Approaches
UI PORTAL FOR
• MODEL / FEATURE AND METADATA MANAGEMENT
• CONTAINERIZATION SUPPORT FOR MODEL
EXECUTION PHASE
• WORKBENCH FOR DATA SCIENTIST
• CONTINUOUS MODEL MONITORING
QUERYABLE STATE
AUTOMATING THE RETRAINING PROCESS
SUPPORT FOR MULTIPLE/PLUGGABLE FEATURE
STORES (SLA DRIVEN)
2 6
SUMMARY & LESSONS LEARNED
FLINK IS HELPING ACHIEVE OUR BUSINESS GOALS
• Near-real-time streaming context
• Container for ML Prediction Pipeline
• Stateful Feature Generation
• Multiple Solutions to the “Data Plane” Problem
• Natural Asynchronous support
• Rich windowing semantics support various aspects of
our ML Pipeline (Training/Prediction/ETL)
• Connected Streams simplify pushing metadata updates
(reduced querying load with better performance)
• Queryable State is a natural fit for high velocity and high
volume data being pushed to the online feature store
THANK YOU!
Flink Forward San Francisco 2018:  Dave Torok & Sameer Wadkar - "Embedding Flink Throughout an Operationalized Streaming ML Lifecycle"

Más contenido relacionado

La actualidad más candente

Future of Apache Flink Deployments: Containers, Kubernetes and More - Flink F...
Future of Apache Flink Deployments: Containers, Kubernetes and More - Flink F...Future of Apache Flink Deployments: Containers, Kubernetes and More - Flink F...
Future of Apache Flink Deployments: Containers, Kubernetes and More - Flink F...Till Rohrmann
 
Flink Forward Berlin 2018: Viktor Klang - Keynote "The convergence of stream ...
Flink Forward Berlin 2018: Viktor Klang - Keynote "The convergence of stream ...Flink Forward Berlin 2018: Viktor Klang - Keynote "The convergence of stream ...
Flink Forward Berlin 2018: Viktor Klang - Keynote "The convergence of stream ...Flink Forward
 
Virtual Flink Forward 2020: Integrate Flink with Kubernetes natively - Yang Wang
Virtual Flink Forward 2020: Integrate Flink with Kubernetes natively - Yang WangVirtual Flink Forward 2020: Integrate Flink with Kubernetes natively - Yang Wang
Virtual Flink Forward 2020: Integrate Flink with Kubernetes natively - Yang WangFlink Forward
 
A stream: Ad-hoc Shared Stream Processing - Jeyhun Karimov, DFKI GmbH
A stream: Ad-hoc Shared Stream Processing - Jeyhun Karimov, DFKI GmbH A stream: Ad-hoc Shared Stream Processing - Jeyhun Karimov, DFKI GmbH
A stream: Ad-hoc Shared Stream Processing - Jeyhun Karimov, DFKI GmbH Flink Forward
 
From Zero to Hero with Kafka Connect
From Zero to Hero with Kafka ConnectFrom Zero to Hero with Kafka Connect
From Zero to Hero with Kafka Connectconfluent
 
Kubernetes + Operator + PaaSTA = Flink @ Yelp - Antonio Verardi, Yelp
Kubernetes + Operator + PaaSTA = Flink @ Yelp -  Antonio Verardi, YelpKubernetes + Operator + PaaSTA = Flink @ Yelp -  Antonio Verardi, Yelp
Kubernetes + Operator + PaaSTA = Flink @ Yelp - Antonio Verardi, YelpFlink Forward
 
Flink Forward San Francisco 2018: Ken Krugler - "Building a scalable focused ...
Flink Forward San Francisco 2018: Ken Krugler - "Building a scalable focused ...Flink Forward San Francisco 2018: Ken Krugler - "Building a scalable focused ...
Flink Forward San Francisco 2018: Ken Krugler - "Building a scalable focused ...Flink Forward
 
Creating Connector to Bridge the Worlds of Kafka and gRPC at Wework (Anoop Di...
Creating Connector to Bridge the Worlds of Kafka and gRPC at Wework (Anoop Di...Creating Connector to Bridge the Worlds of Kafka and gRPC at Wework (Anoop Di...
Creating Connector to Bridge the Worlds of Kafka and gRPC at Wework (Anoop Di...confluent
 
Apicurio Registry: Event-driven APIs & Schema governance for Apache Kafka | F...
Apicurio Registry: Event-driven APIs & Schema governance for Apache Kafka | F...Apicurio Registry: Event-driven APIs & Schema governance for Apache Kafka | F...
Apicurio Registry: Event-driven APIs & Schema governance for Apache Kafka | F...HostedbyConfluent
 
Towards Flink 2.0: Unified Batch & Stream Processing - Aljoscha Krettek, Ver...
Towards Flink 2.0:  Unified Batch & Stream Processing - Aljoscha Krettek, Ver...Towards Flink 2.0:  Unified Batch & Stream Processing - Aljoscha Krettek, Ver...
Towards Flink 2.0: Unified Batch & Stream Processing - Aljoscha Krettek, Ver...Flink Forward
 
Maximilian Michels - Flink and Beam
Maximilian Michels - Flink and BeamMaximilian Michels - Flink and Beam
Maximilian Michels - Flink and BeamFlink Forward
 
Virtual Flink Forward 2020: Machine learning with Flink in Weibo - Yu Qian
Virtual Flink Forward 2020: Machine learning with Flink in Weibo - Yu QianVirtual Flink Forward 2020: Machine learning with Flink in Weibo - Yu Qian
Virtual Flink Forward 2020: Machine learning with Flink in Weibo - Yu QianFlink Forward
 
Time to-live: How to Perform Automatic State Cleanup in Apache Flink - Andrey...
Time to-live: How to Perform Automatic State Cleanup in Apache Flink - Andrey...Time to-live: How to Perform Automatic State Cleanup in Apache Flink - Andrey...
Time to-live: How to Perform Automatic State Cleanup in Apache Flink - Andrey...Flink Forward
 
Apache Flink @ Alibaba - Seattle Apache Flink Meetup
Apache Flink @ Alibaba - Seattle Apache Flink MeetupApache Flink @ Alibaba - Seattle Apache Flink Meetup
Apache Flink @ Alibaba - Seattle Apache Flink MeetupBowen Li
 
Flink Forward San Francisco 2018 keynote: Anand Iyer - "Apache Flink + Apach...
Flink Forward San Francisco 2018 keynote:  Anand Iyer - "Apache Flink + Apach...Flink Forward San Francisco 2018 keynote:  Anand Iyer - "Apache Flink + Apach...
Flink Forward San Francisco 2018 keynote: Anand Iyer - "Apache Flink + Apach...Flink Forward
 
Failing to Cross the Streams – Lessons Learned the Hard Way | Philip Schmitt,...
Failing to Cross the Streams – Lessons Learned the Hard Way | Philip Schmitt,...Failing to Cross the Streams – Lessons Learned the Hard Way | Philip Schmitt,...
Failing to Cross the Streams – Lessons Learned the Hard Way | Philip Schmitt,...HostedbyConfluent
 
What's New in Confluent Platform 5.5
What's New in Confluent Platform 5.5What's New in Confluent Platform 5.5
What's New in Confluent Platform 5.5confluent
 
Virtual Flink Forward 2020: A deep dive into Flink SQL - Jark Wu
Virtual Flink Forward 2020: A deep dive into Flink SQL - Jark WuVirtual Flink Forward 2020: A deep dive into Flink SQL - Jark Wu
Virtual Flink Forward 2020: A deep dive into Flink SQL - Jark WuFlink Forward
 

La actualidad más candente (19)

Future of Apache Flink Deployments: Containers, Kubernetes and More - Flink F...
Future of Apache Flink Deployments: Containers, Kubernetes and More - Flink F...Future of Apache Flink Deployments: Containers, Kubernetes and More - Flink F...
Future of Apache Flink Deployments: Containers, Kubernetes and More - Flink F...
 
Flink Forward Berlin 2018: Viktor Klang - Keynote "The convergence of stream ...
Flink Forward Berlin 2018: Viktor Klang - Keynote "The convergence of stream ...Flink Forward Berlin 2018: Viktor Klang - Keynote "The convergence of stream ...
Flink Forward Berlin 2018: Viktor Klang - Keynote "The convergence of stream ...
 
Virtual Flink Forward 2020: Integrate Flink with Kubernetes natively - Yang Wang
Virtual Flink Forward 2020: Integrate Flink with Kubernetes natively - Yang WangVirtual Flink Forward 2020: Integrate Flink with Kubernetes natively - Yang Wang
Virtual Flink Forward 2020: Integrate Flink with Kubernetes natively - Yang Wang
 
A stream: Ad-hoc Shared Stream Processing - Jeyhun Karimov, DFKI GmbH
A stream: Ad-hoc Shared Stream Processing - Jeyhun Karimov, DFKI GmbH A stream: Ad-hoc Shared Stream Processing - Jeyhun Karimov, DFKI GmbH
A stream: Ad-hoc Shared Stream Processing - Jeyhun Karimov, DFKI GmbH
 
From Zero to Hero with Kafka Connect
From Zero to Hero with Kafka ConnectFrom Zero to Hero with Kafka Connect
From Zero to Hero with Kafka Connect
 
Kubernetes + Operator + PaaSTA = Flink @ Yelp - Antonio Verardi, Yelp
Kubernetes + Operator + PaaSTA = Flink @ Yelp -  Antonio Verardi, YelpKubernetes + Operator + PaaSTA = Flink @ Yelp -  Antonio Verardi, Yelp
Kubernetes + Operator + PaaSTA = Flink @ Yelp - Antonio Verardi, Yelp
 
Flink Forward San Francisco 2018: Ken Krugler - "Building a scalable focused ...
Flink Forward San Francisco 2018: Ken Krugler - "Building a scalable focused ...Flink Forward San Francisco 2018: Ken Krugler - "Building a scalable focused ...
Flink Forward San Francisco 2018: Ken Krugler - "Building a scalable focused ...
 
Creating Connector to Bridge the Worlds of Kafka and gRPC at Wework (Anoop Di...
Creating Connector to Bridge the Worlds of Kafka and gRPC at Wework (Anoop Di...Creating Connector to Bridge the Worlds of Kafka and gRPC at Wework (Anoop Di...
Creating Connector to Bridge the Worlds of Kafka and gRPC at Wework (Anoop Di...
 
Apicurio Registry: Event-driven APIs & Schema governance for Apache Kafka | F...
Apicurio Registry: Event-driven APIs & Schema governance for Apache Kafka | F...Apicurio Registry: Event-driven APIs & Schema governance for Apache Kafka | F...
Apicurio Registry: Event-driven APIs & Schema governance for Apache Kafka | F...
 
Towards Flink 2.0: Unified Batch & Stream Processing - Aljoscha Krettek, Ver...
Towards Flink 2.0:  Unified Batch & Stream Processing - Aljoscha Krettek, Ver...Towards Flink 2.0:  Unified Batch & Stream Processing - Aljoscha Krettek, Ver...
Towards Flink 2.0: Unified Batch & Stream Processing - Aljoscha Krettek, Ver...
 
Maximilian Michels - Flink and Beam
Maximilian Michels - Flink and BeamMaximilian Michels - Flink and Beam
Maximilian Michels - Flink and Beam
 
Virtual Flink Forward 2020: Machine learning with Flink in Weibo - Yu Qian
Virtual Flink Forward 2020: Machine learning with Flink in Weibo - Yu QianVirtual Flink Forward 2020: Machine learning with Flink in Weibo - Yu Qian
Virtual Flink Forward 2020: Machine learning with Flink in Weibo - Yu Qian
 
Time to-live: How to Perform Automatic State Cleanup in Apache Flink - Andrey...
Time to-live: How to Perform Automatic State Cleanup in Apache Flink - Andrey...Time to-live: How to Perform Automatic State Cleanup in Apache Flink - Andrey...
Time to-live: How to Perform Automatic State Cleanup in Apache Flink - Andrey...
 
Apache Flink @ Alibaba - Seattle Apache Flink Meetup
Apache Flink @ Alibaba - Seattle Apache Flink MeetupApache Flink @ Alibaba - Seattle Apache Flink Meetup
Apache Flink @ Alibaba - Seattle Apache Flink Meetup
 
Flink Forward San Francisco 2018 keynote: Anand Iyer - "Apache Flink + Apach...
Flink Forward San Francisco 2018 keynote:  Anand Iyer - "Apache Flink + Apach...Flink Forward San Francisco 2018 keynote:  Anand Iyer - "Apache Flink + Apach...
Flink Forward San Francisco 2018 keynote: Anand Iyer - "Apache Flink + Apach...
 
Failing to Cross the Streams – Lessons Learned the Hard Way | Philip Schmitt,...
Failing to Cross the Streams – Lessons Learned the Hard Way | Philip Schmitt,...Failing to Cross the Streams – Lessons Learned the Hard Way | Philip Schmitt,...
Failing to Cross the Streams – Lessons Learned the Hard Way | Philip Schmitt,...
 
What's New in Confluent Platform 5.5
What's New in Confluent Platform 5.5What's New in Confluent Platform 5.5
What's New in Confluent Platform 5.5
 
Netflix conductor
Netflix conductorNetflix conductor
Netflix conductor
 
Virtual Flink Forward 2020: A deep dive into Flink SQL - Jark Wu
Virtual Flink Forward 2020: A deep dive into Flink SQL - Jark WuVirtual Flink Forward 2020: A deep dive into Flink SQL - Jark Wu
Virtual Flink Forward 2020: A deep dive into Flink SQL - Jark Wu
 

Similar a Flink Forward San Francisco 2018: Dave Torok & Sameer Wadkar - "Embedding Flink Throughout an Operationalized Streaming ML Lifecycle"

Operationalizing Machine Learning—Managing Provenance from Raw Data to Predic...
Operationalizing Machine Learning—Managing Provenance from Raw Data to Predic...Operationalizing Machine Learning—Managing Provenance from Raw Data to Predic...
Operationalizing Machine Learning—Managing Provenance from Raw Data to Predic...Databricks
 
Build and Manage Your APIs with Amazon API Gateway
Build and Manage Your APIs with Amazon API GatewayBuild and Manage Your APIs with Amazon API Gateway
Build and Manage Your APIs with Amazon API GatewayAmazon Web Services
 
What's New in IBM Streams V4.1
What's New in IBM Streams V4.1What's New in IBM Streams V4.1
What's New in IBM Streams V4.1lisanl
 
.NET microservices with Azure Service Fabric
.NET microservices with Azure Service Fabric.NET microservices with Azure Service Fabric
.NET microservices with Azure Service FabricDavide Benvegnù
 
Music city data Hail Hydrate! from stream to lake
Music city data Hail Hydrate! from stream to lakeMusic city data Hail Hydrate! from stream to lake
Music city data Hail Hydrate! from stream to lakeTimothy Spann
 
Reduce Risk with End to End Monitoring of Middleware-based Applications
Reduce Risk with End to End Monitoring of Middleware-based ApplicationsReduce Risk with End to End Monitoring of Middleware-based Applications
Reduce Risk with End to End Monitoring of Middleware-based ApplicationsSL Corporation
 
Modernizing Testing as Apps Re-Architect
Modernizing Testing as Apps Re-ArchitectModernizing Testing as Apps Re-Architect
Modernizing Testing as Apps Re-ArchitectDevOps.com
 
Data & Analytics Forum: Moving Telcos to Real Time
Data & Analytics Forum: Moving Telcos to Real TimeData & Analytics Forum: Moving Telcos to Real Time
Data & Analytics Forum: Moving Telcos to Real TimeSingleStore
 
Role of Rest vs. Web Services and EI
Role of Rest vs. Web Services and EIRole of Rest vs. Web Services and EI
Role of Rest vs. Web Services and EIWSO2
 
vCloud Automation Center and Pivotal Cloud Foundry – Better PaaS Solution (VM...
vCloud Automation Center and Pivotal Cloud Foundry – Better PaaS Solution (VM...vCloud Automation Center and Pivotal Cloud Foundry – Better PaaS Solution (VM...
vCloud Automation Center and Pivotal Cloud Foundry – Better PaaS Solution (VM...VMware Tanzu
 
Building Scalable Big Data Infrastructure Using Open Source Software Presenta...
Building Scalable Big Data Infrastructure Using Open Source Software Presenta...Building Scalable Big Data Infrastructure Using Open Source Software Presenta...
Building Scalable Big Data Infrastructure Using Open Source Software Presenta...ssuserd3a367
 
Adobe Spark Meetup - 9/19/2018 - San Jose, CA
Adobe Spark Meetup - 9/19/2018 - San Jose, CAAdobe Spark Meetup - 9/19/2018 - San Jose, CA
Adobe Spark Meetup - 9/19/2018 - San Jose, CAJaemi Bremner
 
(DVO201) Scaling Your Web Applications with AWS Elastic Beanstalk
(DVO201) Scaling Your Web Applications with AWS Elastic Beanstalk(DVO201) Scaling Your Web Applications with AWS Elastic Beanstalk
(DVO201) Scaling Your Web Applications with AWS Elastic BeanstalkAmazon Web Services
 
Cowboy Dating with Big Data or DWH Evolution in Action, Борис Трофимов
Cowboy Dating with Big Data or DWH Evolution in Action, Борис ТрофимовCowboy Dating with Big Data or DWH Evolution in Action, Борис Трофимов
Cowboy Dating with Big Data or DWH Evolution in Action, Борис ТрофимовSigma Software
 
Webinar: What's new in CDAP 3.5?
Webinar: What's new in CDAP 3.5?Webinar: What's new in CDAP 3.5?
Webinar: What's new in CDAP 3.5?Cask Data
 
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...Spark Summit
 
Using AWS to Build a Scalable Big Data Management & Processing Service (BDT40...
Using AWS to Build a Scalable Big Data Management & Processing Service (BDT40...Using AWS to Build a Scalable Big Data Management & Processing Service (BDT40...
Using AWS to Build a Scalable Big Data Management & Processing Service (BDT40...Amazon Web Services
 

Similar a Flink Forward San Francisco 2018: Dave Torok & Sameer Wadkar - "Embedding Flink Throughout an Operationalized Streaming ML Lifecycle" (20)

Operationalizing Machine Learning—Managing Provenance from Raw Data to Predic...
Operationalizing Machine Learning—Managing Provenance from Raw Data to Predic...Operationalizing Machine Learning—Managing Provenance from Raw Data to Predic...
Operationalizing Machine Learning—Managing Provenance from Raw Data to Predic...
 
Build and Manage Your APIs with Amazon API Gateway
Build and Manage Your APIs with Amazon API GatewayBuild and Manage Your APIs with Amazon API Gateway
Build and Manage Your APIs with Amazon API Gateway
 
Serverless SQL
Serverless SQLServerless SQL
Serverless SQL
 
What's New in IBM Streams V4.1
What's New in IBM Streams V4.1What's New in IBM Streams V4.1
What's New in IBM Streams V4.1
 
.NET microservices with Azure Service Fabric
.NET microservices with Azure Service Fabric.NET microservices with Azure Service Fabric
.NET microservices with Azure Service Fabric
 
Music city data Hail Hydrate! from stream to lake
Music city data Hail Hydrate! from stream to lakeMusic city data Hail Hydrate! from stream to lake
Music city data Hail Hydrate! from stream to lake
 
App fabric introduction
App fabric introductionApp fabric introduction
App fabric introduction
 
Reduce Risk with End to End Monitoring of Middleware-based Applications
Reduce Risk with End to End Monitoring of Middleware-based ApplicationsReduce Risk with End to End Monitoring of Middleware-based Applications
Reduce Risk with End to End Monitoring of Middleware-based Applications
 
Modernizing Testing as Apps Re-Architect
Modernizing Testing as Apps Re-ArchitectModernizing Testing as Apps Re-Architect
Modernizing Testing as Apps Re-Architect
 
Data & Analytics Forum: Moving Telcos to Real Time
Data & Analytics Forum: Moving Telcos to Real TimeData & Analytics Forum: Moving Telcos to Real Time
Data & Analytics Forum: Moving Telcos to Real Time
 
Managing Your Cloud Assets
Managing Your Cloud AssetsManaging Your Cloud Assets
Managing Your Cloud Assets
 
Role of Rest vs. Web Services and EI
Role of Rest vs. Web Services and EIRole of Rest vs. Web Services and EI
Role of Rest vs. Web Services and EI
 
vCloud Automation Center and Pivotal Cloud Foundry – Better PaaS Solution (VM...
vCloud Automation Center and Pivotal Cloud Foundry – Better PaaS Solution (VM...vCloud Automation Center and Pivotal Cloud Foundry – Better PaaS Solution (VM...
vCloud Automation Center and Pivotal Cloud Foundry – Better PaaS Solution (VM...
 
Building Scalable Big Data Infrastructure Using Open Source Software Presenta...
Building Scalable Big Data Infrastructure Using Open Source Software Presenta...Building Scalable Big Data Infrastructure Using Open Source Software Presenta...
Building Scalable Big Data Infrastructure Using Open Source Software Presenta...
 
Adobe Spark Meetup - 9/19/2018 - San Jose, CA
Adobe Spark Meetup - 9/19/2018 - San Jose, CAAdobe Spark Meetup - 9/19/2018 - San Jose, CA
Adobe Spark Meetup - 9/19/2018 - San Jose, CA
 
(DVO201) Scaling Your Web Applications with AWS Elastic Beanstalk
(DVO201) Scaling Your Web Applications with AWS Elastic Beanstalk(DVO201) Scaling Your Web Applications with AWS Elastic Beanstalk
(DVO201) Scaling Your Web Applications with AWS Elastic Beanstalk
 
Cowboy Dating with Big Data or DWH Evolution in Action, Борис Трофимов
Cowboy Dating with Big Data or DWH Evolution in Action, Борис ТрофимовCowboy Dating with Big Data or DWH Evolution in Action, Борис Трофимов
Cowboy Dating with Big Data or DWH Evolution in Action, Борис Трофимов
 
Webinar: What's new in CDAP 3.5?
Webinar: What's new in CDAP 3.5?Webinar: What's new in CDAP 3.5?
Webinar: What's new in CDAP 3.5?
 
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...
 
Using AWS to Build a Scalable Big Data Management & Processing Service (BDT40...
Using AWS to Build a Scalable Big Data Management & Processing Service (BDT40...Using AWS to Build a Scalable Big Data Management & Processing Service (BDT40...
Using AWS to Build a Scalable Big Data Management & Processing Service (BDT40...
 

Más de Flink Forward

Building a fully managed stream processing platform on Flink at scale for Lin...
Building a fully managed stream processing platform on Flink at scale for Lin...Building a fully managed stream processing platform on Flink at scale for Lin...
Building a fully managed stream processing platform on Flink at scale for Lin...Flink Forward
 
Evening out the uneven: dealing with skew in Flink
Evening out the uneven: dealing with skew in FlinkEvening out the uneven: dealing with skew in Flink
Evening out the uneven: dealing with skew in FlinkFlink Forward
 
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...Flink Forward
 
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...Flink Forward
 
Introducing the Apache Flink Kubernetes Operator
Introducing the Apache Flink Kubernetes OperatorIntroducing the Apache Flink Kubernetes Operator
Introducing the Apache Flink Kubernetes OperatorFlink Forward
 
Autoscaling Flink with Reactive Mode
Autoscaling Flink with Reactive ModeAutoscaling Flink with Reactive Mode
Autoscaling Flink with Reactive ModeFlink Forward
 
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...Flink Forward
 
One sink to rule them all: Introducing the new Async Sink
One sink to rule them all: Introducing the new Async SinkOne sink to rule them all: Introducing the new Async Sink
One sink to rule them all: Introducing the new Async SinkFlink Forward
 
Tuning Apache Kafka Connectors for Flink.pptx
Tuning Apache Kafka Connectors for Flink.pptxTuning Apache Kafka Connectors for Flink.pptx
Tuning Apache Kafka Connectors for Flink.pptxFlink Forward
 
Flink powered stream processing platform at Pinterest
Flink powered stream processing platform at PinterestFlink powered stream processing platform at Pinterest
Flink powered stream processing platform at PinterestFlink Forward
 
Apache Flink in the Cloud-Native Era
Apache Flink in the Cloud-Native EraApache Flink in the Cloud-Native Era
Apache Flink in the Cloud-Native EraFlink Forward
 
Where is my bottleneck? Performance troubleshooting in Flink
Where is my bottleneck? Performance troubleshooting in FlinkWhere is my bottleneck? Performance troubleshooting in Flink
Where is my bottleneck? Performance troubleshooting in FlinkFlink Forward
 
Using the New Apache Flink Kubernetes Operator in a Production Deployment
Using the New Apache Flink Kubernetes Operator in a Production DeploymentUsing the New Apache Flink Kubernetes Operator in a Production Deployment
Using the New Apache Flink Kubernetes Operator in a Production DeploymentFlink Forward
 
The Current State of Table API in 2022
The Current State of Table API in 2022The Current State of Table API in 2022
The Current State of Table API in 2022Flink Forward
 
Flink SQL on Pulsar made easy
Flink SQL on Pulsar made easyFlink SQL on Pulsar made easy
Flink SQL on Pulsar made easyFlink Forward
 
Dynamic Rule-based Real-time Market Data Alerts
Dynamic Rule-based Real-time Market Data AlertsDynamic Rule-based Real-time Market Data Alerts
Dynamic Rule-based Real-time Market Data AlertsFlink Forward
 
Exactly-Once Financial Data Processing at Scale with Flink and Pinot
Exactly-Once Financial Data Processing at Scale with Flink and PinotExactly-Once Financial Data Processing at Scale with Flink and Pinot
Exactly-Once Financial Data Processing at Scale with Flink and PinotFlink Forward
 
Processing Semantically-Ordered Streams in Financial Services
Processing Semantically-Ordered Streams in Financial ServicesProcessing Semantically-Ordered Streams in Financial Services
Processing Semantically-Ordered Streams in Financial ServicesFlink Forward
 
Tame the small files problem and optimize data layout for streaming ingestion...
Tame the small files problem and optimize data layout for streaming ingestion...Tame the small files problem and optimize data layout for streaming ingestion...
Tame the small files problem and optimize data layout for streaming ingestion...Flink Forward
 
Batch Processing at Scale with Flink & Iceberg
Batch Processing at Scale with Flink & IcebergBatch Processing at Scale with Flink & Iceberg
Batch Processing at Scale with Flink & IcebergFlink Forward
 

Más de Flink Forward (20)

Building a fully managed stream processing platform on Flink at scale for Lin...
Building a fully managed stream processing platform on Flink at scale for Lin...Building a fully managed stream processing platform on Flink at scale for Lin...
Building a fully managed stream processing platform on Flink at scale for Lin...
 
Evening out the uneven: dealing with skew in Flink
Evening out the uneven: dealing with skew in FlinkEvening out the uneven: dealing with skew in Flink
Evening out the uneven: dealing with skew in Flink
 
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
 
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...
 
Introducing the Apache Flink Kubernetes Operator
Introducing the Apache Flink Kubernetes OperatorIntroducing the Apache Flink Kubernetes Operator
Introducing the Apache Flink Kubernetes Operator
 
Autoscaling Flink with Reactive Mode
Autoscaling Flink with Reactive ModeAutoscaling Flink with Reactive Mode
Autoscaling Flink with Reactive Mode
 
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...
 
One sink to rule them all: Introducing the new Async Sink
One sink to rule them all: Introducing the new Async SinkOne sink to rule them all: Introducing the new Async Sink
One sink to rule them all: Introducing the new Async Sink
 
Tuning Apache Kafka Connectors for Flink.pptx
Tuning Apache Kafka Connectors for Flink.pptxTuning Apache Kafka Connectors for Flink.pptx
Tuning Apache Kafka Connectors for Flink.pptx
 
Flink powered stream processing platform at Pinterest
Flink powered stream processing platform at PinterestFlink powered stream processing platform at Pinterest
Flink powered stream processing platform at Pinterest
 
Apache Flink in the Cloud-Native Era
Apache Flink in the Cloud-Native EraApache Flink in the Cloud-Native Era
Apache Flink in the Cloud-Native Era
 
Where is my bottleneck? Performance troubleshooting in Flink
Where is my bottleneck? Performance troubleshooting in FlinkWhere is my bottleneck? Performance troubleshooting in Flink
Where is my bottleneck? Performance troubleshooting in Flink
 
Using the New Apache Flink Kubernetes Operator in a Production Deployment
Using the New Apache Flink Kubernetes Operator in a Production DeploymentUsing the New Apache Flink Kubernetes Operator in a Production Deployment
Using the New Apache Flink Kubernetes Operator in a Production Deployment
 
The Current State of Table API in 2022
The Current State of Table API in 2022The Current State of Table API in 2022
The Current State of Table API in 2022
 
Flink SQL on Pulsar made easy
Flink SQL on Pulsar made easyFlink SQL on Pulsar made easy
Flink SQL on Pulsar made easy
 
Dynamic Rule-based Real-time Market Data Alerts
Dynamic Rule-based Real-time Market Data AlertsDynamic Rule-based Real-time Market Data Alerts
Dynamic Rule-based Real-time Market Data Alerts
 
Exactly-Once Financial Data Processing at Scale with Flink and Pinot
Exactly-Once Financial Data Processing at Scale with Flink and PinotExactly-Once Financial Data Processing at Scale with Flink and Pinot
Exactly-Once Financial Data Processing at Scale with Flink and Pinot
 
Processing Semantically-Ordered Streams in Financial Services
Processing Semantically-Ordered Streams in Financial ServicesProcessing Semantically-Ordered Streams in Financial Services
Processing Semantically-Ordered Streams in Financial Services
 
Tame the small files problem and optimize data layout for streaming ingestion...
Tame the small files problem and optimize data layout for streaming ingestion...Tame the small files problem and optimize data layout for streaming ingestion...
Tame the small files problem and optimize data layout for streaming ingestion...
 
Batch Processing at Scale with Flink & Iceberg
Batch Processing at Scale with Flink & IcebergBatch Processing at Scale with Flink & Iceberg
Batch Processing at Scale with Flink & Iceberg
 

Último

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...apidays
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 

Último (20)

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 

Flink Forward San Francisco 2018: Dave Torok & Sameer Wadkar - "Embedding Flink Throughout an Operationalized Streaming ML Lifecycle"

  • 1. EMBEDDING FLINK THROUGHOUT AN OPERATIONALIZED STREAMING ML LIFECYCLE Dave Torok, Senior Principal Architect Sameer Wadkar, Senior Principal Architect 10 April, 2018
  • 2. 2 INTRODUCTION AND BACKGROUND CUSTOMER EXPERIENCE TEAM 27 MILLION CUSTOMERS (HIGH SPEED DATA, VIDEO, VOICE, HOME SECURITY, MOBILE) INGESTING ABOUT 2 BILLION EVENTS / MONTH SOME HIGH-VOLUME MACHINE-GENERATED EVENTS TYPICAL STREAMING DATA ARCHITECTURE DATA ETL, LAND IN A TIME SERIES DATA LAKE GREW FROM A FEW DOZEN TO 150+ DATA SOURCES / FEEDS IN ABOUT A YEAR Comcast collects, stores, and uses all data in accordance with our privacy disclosures to users and applicable laws.
  • 3. 3 BUSINESS PROBLEM INCREASE POSITIVE CUSTOMER EXPERIENCES RESOLVE POTENTIAL ISSUES CORRECTLY AND QUICKLY PREDICT AND DIAGNOSE SERVICE TROUBLE ACROSS MULTIPLE KNOWLEDGE DOMAINS REDUCE COSTS THROUGH EARLIER RESOLUTION AND BY REDUCING AVOIDABLE TECHNICIAN VISITS
  • 4. 4 TECHNICAL PROBLEM MULTIPLE PROGRAMMING AND DATA SCIENCE ENVIRONMENTS WIDESPREAD AND DISCORDANT DATA SOURCES THE “DATA PLANE” PROBLEM: COMBINING DATA AT REST AND DATA IN MOTION ML VERSIONING: DATA, CODE, FEATURES, MODELS
  • 5. 5 SOLUTION MOTIVATION SELF-SERVICE PLATFORM ALIGN DATA SCIENTISTS AND PRODUCTION MODELS TREATED AS CODE HIGH THROUGHPUT STREAM PLATFORM
  • 6. 6 MACHINE LEARNING LIFECYCLE USE CASE DEFINITION FEATURE EXPLORATION / ENGINEERING MODEL TRAINING MODEL EVALUATION MODEL ARTIFACT DELIVERY (POJO/DOCKER) MODEL SELECTION MODEL OPERATIONALIZATION MODEL PERFORMANCE MONITORING ON LIVE DATA (A/B & MULTIVARIATE TESTING) PUSH MODEL TO PRODUCTION RETRAIN MODEL ON NEWER DATA
  • 7. 7 EXAMPLE NEAR REAL TIME PREDICTION USE CASE CUSTOMER RUNS A “SPEED TEST” EVENT TRIGGERS A PREDICTION FLOW ENRICH WITH NETWORK HEALTH AND OTHER INDICATORS EXECUTE ML MODEL PREDICT WHETHER IT IS A WIFI, MODEM, OR NETWORK ISSUE Detect Enrich Predict Gather Data Event ML Model Engage Customer Act / Notify Network Diagnostic Services Slow Speed? Additional Context Services Run Prediction
  • 8. 8 ML PIPELINE ARCHITECTURE PRINCIPLES Metadata Driven Feature/Model Definition, Versioning , Feature Assembly, Model Deployment, Model Monitoring is metadata driven Automation Orchestrated Deployment for new features and models Rapid Onboarding Portal for Model and Feature Management as well Model Deployment Data Consistency Feature store enforces a consistent data pipeline ensuring that the data used for training is functionally identical to the data used for predictions Monitoring and Metrics Ability to execute & monitor multiple models in production to enable real-time metrics driven model selection Iterative/Consistent Model Development Multiple versions of the model can be developed iteratively while consuming from a consistent dataset (feature store), enables A/B & Multivariate Testing
  • 9. 9 ML PIPELINE – ROLES & WORKFLOW Define Use Case Business User Data Scientist ML Operations Explore Features Create and publish new features Create & Validate Models Model Selection Go Live with Selected Models • Define Online Feature Assembly • Define pipeline to collect outcomes • Model Deployment and Monitoring Model Review Iterate Evaluate Live Model Performance Inception Exploration Model Development Candidate Model Selection Model Operationalization Model Evaluation Go Live Phase Monitor Live ModelsCollect new data & retrain Iterate
  • 10. 1 0 WHY APACHE FLINK? UTILIZED AS ORCHESTRATION & ETL ENGINE FIRST-CLASS STREAMING MODEL PERFORMANCE RICH STATEFUL SEMANTICS TEAM EXPERIENCE OPEN SOURCE GROWING COMMUNITY Apache®, Apache Flink®, and the squirrel logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries.
  • 11. 1 1 THE “DATA PLANE” PROBLEM Streaming Compute Pipeline AWS S3 HDFS Data File Abstraction Databases MODEL Streaming State Sum Avg Time Buckets Stream Data QUERY Enterprise Services Data Sets at Rest
  • 12. 1 2 ML MODEL EXECUTION MODEL EXECUTION TRIGGER 1. Payload only contains Model Name & Account Number FEATURE ASSEMBLY Model Metadata Online Feature Store 2. Model Metadata informs which features are needed for a model 3. Pull required features by account number MODEL EXECUTION 4. Pass full set of assembled features for model execution 5. Prediction
  • 13. 1 3 SOLUTION Rest Service Inputs to REST Service: 1.Model Name 2.Account No SELECT MODEL BASED ON RULES (ON- DEMAND/STREAMING) Request Initiated asynchronously via pushing it to a queue/topic INITIATE MODEL PREDICTION REQUEST (ASYNCHRONOUSLY) REQUESTING APPLICATION TRIGGER EVENT LISTENER
  • 14. 1 4 SOLUTION ASSEMBLE FEATURES FOR A GIVEN MODEL Happy Path for Model Execution – All Features Current Online Feature Store Model /Feature Metadata Feature Store API Feature Assembly Model Execution Are All Features Current? Yes Prediction/Outc ome Store Prediction Sink Store Prediction Flow Customer Context Listens PushREQUESTING APPLICATION Assemble features based on Account Number as model input Collect predictions and outcome to create datasets for model refinement Store current values of features for interactive query access
  • 15. 1 5 SOLUTION (CONT.) ASSEMBLE FEATURES FOR A GIVEN MODEL Exception Path – Some/All Features are not current Online Feature Store Model /Feature Metadata Feature Store API Feature Assembly Feature Creation Pipeline Are All Features Current? No History Feature Store Online Feature Store Back to Happy Path Feature Assembly Append store (Ex. S3, HDFS, Redshift) for use by Data Scientist for Model Training
  • 16. 1 6 SOLUTION – DIGGING DEEPER Global Window, Pane per Request Id Model Execution Requests Request Features KeyBy Request Id Apply Function Custom Evictor Model Metadata Connected Stream Periodically check if Model TTL has expired (onEventTime) Arrival of each feature triggers the model execution (onElement) Evict pane if model executed Evict pane if model request expired Execute model or expire Side Outputs Features Custom Trigger
  • 17. 1 7 FEATURE STORE TWO TYPES OF FEATURE STORES: • Online Feature Store – Current values by key (Key/Value Store) • History Feature Store – Append features as they are collected (Ex. HDFS, S3) MULTIPLE ONLINE FEATURE STORES BASED ON SLA’S • A feature can be stored in multiple online feature stores to support model specific SLA’s. TYPES OF ONLINE FEATURE STORE • PostgreSQL (AWS RDS, Aurora DB) for low volume on-demand model execution requests • HBase, DynamoDB for high volume feature ingest • Flink Queryable State for high volume ingest, high velocity model execution requests Feature Creation Pipeline History Feature Store Online Feature Store Prediction Phase Model Training Phase AppendOverwrite
  • 18. 1 8 FEATURE CREATION PIPELINES FLINK AS REAL-TIME DATA STREAM CONSUMER CUSTOM FLOWS FOR AGGREGATION FEATURES SAME DATA FLOWS FOR PREDICTION (STREAMING) & TRAINING (BATCH) • PRODUCED FEATURES UPDATE ONLINE FEATURE STORE (PREDICTION PHASE) • PRODUCED FEATURES APPENDED TO S3 OR HDFS FOR USE BY DATA SCIENTISTS (TRAINING PHASE) Aggregation Features On Demand Feature Raw Data On Demand Feature Request External Rest API Push to Feature Store
  • 19. 1 9 STREAMING FEATURE EXAMPLE KAFKA ERROR STREAM (~150 / SECOND) DETECT ACCOUNTS WITH SIGNAL ERROR WITH COUNT > 2000 IN TRAILING 24 HOURS SOLUTION: AVRO DESERIALIZER WITH KEY = ACCOUNT “24 HOUR ROLLING” HASH STRUCTURE AS STATE FILTER FUNCTION WITH SIGNAL THRESHOLD Flink Features Used: Kafka Source Keyed Stream Value State Sliding Window Filter Function
  • 20. 2 0 ON-DEMAND FEATURE EXAMPLE PREMISE HEATH TEST • DIAGNOSTIC TELEMETRY INFORMATION FOR EACH DEVICE FOR A GIVEN CUSTOMER • EXPENSIVE - ONLY REQUESTED ON DEMAND • MODELS USING SUCH A FEATURE WILL EXTRACT SUB-ELEMENTS USING SCRIPTING CAPABILITIES (MODEL METADATA & FEATURE ENGINEERING) • MODEL METADATA WILL CONTAIN TTL ATTRIBUTE FOR SUCH FEATURES INDICATING THEIR TOLERANCE FOR STALE DATA SOLUTION: MAKE AN ON-DEMAND REQUEST FOR PHT TELEMETRY DATA FOR IF IT IS STALE OR ABSENT FOR A GIVEN ACCOUNT Flink Features Used: Async Operator
  • 21. 2 1 ML PREDICTION COMPONENT • REST SERVICE • H2O.ai Model Container (POJO) • Python based service running specialized ML Models • Any stateless REST service • FLINK MAP OPERATOR • H2O.ai Model Container (POJO) wrapped in a Flink Map Operator • Possibly support native calls via Flink Map Operators running specialized Models (Ex. Tensorflow GPU based predictions) • Same Code Base • Multiple Deployment Models • REST – Low velocity, on- demand model invocations • Map Operators – High velocity, streaming model invocations
  • 22. 2 2 VERSIONING AND DEVOPS EVERYTHING IS VERSIONED • Feature/Model Metadata • Feature Data & Model Execution environments • Training, Validation datasets are versioned • Feature creation pipelines are versioned VERSIONING ALLOWS PROVENANCE & AUDITABILITY & REPEATABILITY OF EVERY PREDICTION
  • 23. 2 3 FEATURES OF THE ML PIPELINE CLOUD AGNOSTIC • Integrates with the AWS Cloud but not dependent on it • Framework should be able to work in a non-AWS distributed environment with configuration (not code) changes TRACEABILITY & REPEATABILITY & AUDITABILITY • Model to be traced back to business use- cases • Full traceability from raw data to feature engineering to predictions • “Everything Versioned” enables repeatability CI/CD SUPPORT • Code, Metadata (Hyper-Parameters) and Data (Training/Validation Data) are versioned. Deployable artifacts to integrate with CI/CD Pipeline
  • 24. 2 4 FEATURES OF THE ML PIPELINE (CONT.) MULTI-DEPLOYMENT OPTIONS • Supports Throughput vs. Latency Tradeoffs- Process in stream/batch/on- demand • Allows multiple versions of the same/different models to be compared with one another on live data • A/B testing & Multivariate testing • Live but dark deployments • Supports integration of outcomes with predictions to measure production performance & support continuous model re-training PLUGGABLE (DATA AND COMPUTE) ARCHITECTURE • De-coupled architecture based on message driven inter-component communication. • Failure of an isolated component does not fail the entire platform • Asynchronous behavior • Micro-Services based design which supports independent deployment of components
  • 25. 2 5 NEXT STEPS AND FUTURE WORK GENERATING “FLINK NATIVE” FEATURE FLOWS • Evaluating Uber’s “AthenaX” Project / Similar Approaches UI PORTAL FOR • MODEL / FEATURE AND METADATA MANAGEMENT • CONTAINERIZATION SUPPORT FOR MODEL EXECUTION PHASE • WORKBENCH FOR DATA SCIENTIST • CONTINUOUS MODEL MONITORING QUERYABLE STATE AUTOMATING THE RETRAINING PROCESS SUPPORT FOR MULTIPLE/PLUGGABLE FEATURE STORES (SLA DRIVEN)
  • 26. 2 6 SUMMARY & LESSONS LEARNED FLINK IS HELPING ACHIEVE OUR BUSINESS GOALS • Near-real-time streaming context • Container for ML Prediction Pipeline • Stateful Feature Generation • Multiple Solutions to the “Data Plane” Problem • Natural Asynchronous support • Rich windowing semantics support various aspects of our ML Pipeline (Training/Prediction/ETL) • Connected Streams simplify pushing metadata updates (reduced querying load with better performance) • Queryable State is a natural fit for high velocity and high volume data being pushed to the online feature store