SlideShare una empresa de Scribd logo
1 de 49
Descargar para leer sin conexión
WIFI SSID:SparkAISummit | Password: UnifiedAnalytics
Shivnath Babu
Cofounder/CTO, Unravel
Adjunct Professor, Duke University
An AI-powered Chatbot to
Simplify Spark Performance
Management
#UnifiedAnalytics #SparkAISummit
Meet the speaker
• Cofounder/CTO at Unravel
• Adjunct Professor of Computer Science at
Duke University
• Focusing on ease-of-use and manageability of
data-intensive systems
• Recipient of US National Science Foundation
CAREER Award, three IBM Faculty Awards,
HP Labs Innovation Research Award
3#UnifiedAnalytics #SparkAISummit
What is a Chatbot?
4#UnifiedAnalytics #SparkAISummit
A program which conducts a
conversation via text or voice
5#UnifiedAnalytics #SparkAISummit
Chatbots are making a
real difference
6#UnifiedAnalytics #SparkAISummit
7#UnifiedAnalytics #SparkAISummit
Source: https://chatbottle.co/awards/2018
8#UnifiedAnalytics #SparkAISummit
TOBi
generates
2x more
ecommerce
conversions
in ½ the time
for Vodafone
9#UnifiedAnalytics #SparkAISummit
Zara
provides fast
services to
20% of Zurich
Insurance
customers
10#UnifiedAnalytics #SparkAISummit
Woebot, the
therapist chatbot,
talks to more
people in a day
than a human
therapist does in
a lifetime
Chatbots ó Spark Performance
What is the connection?
11#UnifiedAnalytics #SparkAISummit
The happy Spark user
12#UnifiedAnalytics #SparkAISummit
• Spark is fast
• Spark has easy-to-use and
comprehensive APIs
• Wow, I can do SQL, Streaming,
AI/ML, and Graphs in one system!
• Spark has a rich ecosystem
13#UnifiedAnalytics #SparkAISummit
“I have no clue
which cloud
instance type to
pick for my
workload”
“My cloud
costs are
getting out of
control. Help!”
“I have no
idea why
my app is
slow”
“My app
failed and I
don’t know
why!”
The frustrated Spark user
• Many levels of correlated stack traces
• Identifying the root cause is hard and time consuming
14
Typical app failure in Spark
#UnifiedAnalytics #SparkAISummit
15#UnifiedAnalytics #SparkAISummit
“My app
failed and I
don’t know
why!”
Spark User Spark Chatbot
“I know that sucks! Let me take
a look here …”
“I see the problem. Executors
are running out of memory”
“Setting
spark.executor.memory to 12g
fixes the problem. I have
verified it. See this run here”
“Wow.
Thanks.
You are
awesome!”
I will show you a Chatbot that
• Makes you more productive
• Saves you time and money
• Becomes your AI-driven Spark Expert in a Bot!
16#UnifiedAnalytics #SparkAISummit
My app is too slow…
17
DATA ENGINEER
#UnifiedAnalytics #SparkAISummit
I need to make it faster…
18
DATA ENGINEER
#UnifiedAnalytics #SparkAISummit
Current approach
19
1. Review Spark/YARN UI to find the app
2. Review metrics in the UI
3. Review jobs and stages associated with the app
4. Identify all containers associated with the app
6. Identify “problematic” jobs, stages, or containers
7. Guess which parameters to tune for performance
5. Review and debug container logs
9. Rinse & repeat
8. Do trial-and-error by changing a parameter setting
#UnifiedAnalytics #SparkAISummit
There has to be a better way
20#UnifiedAnalytics #SparkAISummit
What is going on here?
21#UnifiedAnalytics #SparkAISummit
22#UnifiedAnalytics #SparkAISummit
Messaging
Platform
Bot’s NLP
Layer
Bot’s Backend
Layer
Chatbot Architecture from 30000 ft
Monitoring
Data
Historic Data
&
Probe Data
Recommendation
Algorithm
Cluster Services On-premises and Cloud
App,Goal
Orchestrator
Algorithm running in bot’s backend
Xnext
Probe Algorithm
23#UnifiedAnalytics #SparkAISummit
spark.driver.cores 2
spark.executor.cores
…
10
spark.sql.shuffle.partitions 300
spark.sql.autoBroadcastJoinThres
hold
20MB
…
SKEW('orders', 'o_custId') true
spark.catalog.cacheTable(“orders") true
…
We represent this setting as vector X X
PERFORMANCE
24
Spark tuning parameters
#UnifiedAnalytics #SparkAISummit
• Find the setting of X that
best meets the goal
• Challenge: Response
surface y = ƒ(X) is
unknown
X
PERFORMANCE
Given: App + Goal
25#UnifiedAnalytics #SparkAISummit
Model the response surface as
The Gaussian Process model captures the
uncertainty in our current knowledge of the
response surface
)()()(ˆ XZXfXy t
+= b
!!
b
!!
)(Xf t
)(XZ
X
PERFORMANCE
Challenge: Response surface
y = ƒ(X) is unknown
Here:
is a regression model
is the residual captured as a
Gaussian Process
#AI7SAIS 26
ò
=
-¥=
-=
)(
)(ˆ
*
*
)())(()(
Xyp
p
Xy dpppdfpXyXEIP
We can now estimate the expected improvement EIP(X) from
doing a probe at any setting X
Gaussian Process model helps estimate EIP(X)
Improvement at any
setting X over the best
performance seen so far
Probability density
function (uncertainty
estimate)
X
Opportunity
27
PERFORMANCE
#UnifiedAnalytics #SparkAISummit
Get initial set of
monitoring data from
history or via
probes: <X1,y1>,
<X2,y2>, …, <Xn,yn>
1
Select next probe
Xnext based on all
history and probe data
available so far to
calculate the setting
with maximum expected
improvement EIP(X)
2
Bootstrap
Probe Algorithm
Until the
stopping
condition
is
reached
#AI7SAIS 28
PERFORMANCE
X
4 6 8 10 12
02468
x1
y
4 6 8 10 12
02468
x1
y
4 6 8 10 12
02468
x1
y
4 6 8 10 12
02468
x1
y
X
Performance
U
EIP(X)
U
Xnext: Do next
probe here
This approach
balances
Exploration Vs.
Exploitation
U
Exploration
U
Exploitation
29#UnifiedAnalytics #SparkAISummit
Credit: https://discovery.rsm.nl/articles/detail/130-how-to-balance-exploration-and-exploitation-in-multinational-enterprises
Data Starved
& High Uncertainty
Data Rich
& Low
Uncertainty
30
App,Goal
Xnext
Probe Algorithm
#UnifiedAnalytics #SparkAISummit
31#UnifiedAnalytics #SparkAISummit
Messaging
Platform
Bot’s NLP
Layer
Bot’s Backend
Layer
Chatbot architecture
• Many levels of correlated stack traces
• Identifying the root cause is hard and time consuming
32
Typical app failure in Spark
#UnifiedAnalytics #SparkAISummit
Let us see a better way
33#UnifiedAnalytics #SparkAISummit
What is going on here?
34#UnifiedAnalytics #SparkAISummit
35#UnifiedAnalytics #SparkAISummit
Predictive
Model
Root cause
of the failure
App failure
App’s
Container
Logs
Error
Template
Extraction
Feature
vector
36#UnifiedAnalytics #SparkAISummit
Predictive
Model
Root cause
of the failure
App failure
App’s
Container
Logs
Error
Template
Extraction
Error
Template
Extraction
Feature
vectors
Model
Learning
Container
Logs
Root cause labels
Logs from
millions of
app failures
Label
Generation
Feature
vector
Two ways to get root-cause labels
• Manual diagnosis by a domain expert
• Automatic injection of the root cause
37#UnifiedAnalytics #SparkAISummit
Unravel’s large-scale lab framework for
automatic root cause analysis
Spark and multi-tenant Workloads:
- Variety of workloads: Batch, ML, SQL, Streaming, etc.
Failures:
- Large set of root causes learned from customers &
partners. Constantly updated
- Continuously inject these root causes to train & test
models for root-cause prediction
Environment:
- Lab created on demand on cloud or on-premises
- Workloads are run and failures are injected
38#UnifiedAnalytics #SparkAISummit
Injecting “labeled” failures
Application
Execution
Application
Monitor
FAILED
Injected
Failure
Label
Labeled
Failures
• Invalid input
• Invalid memory configuration
• OOME: Java heap space
• OOME: GC overhead limit
• Container killed by YARN
• Runtime incompatibility
Injected failure examples:
• No space left on device
• Transformations inside other
transformations
• Runtime error
• Arithmetic error
• Invalid configuration settings
Input Feature
Extraction
39
Training
data
#UnifiedAnalytics #SparkAISummit
We created a Failure Taxonomy for Labels
Configuration
Errors
Data
Errors
Resource
Errors
Deployment
Errors
Root Node
Category of failure
Input Path
Not
Available
Number
Format
Exception
SparkSQL
JsonProcessing
Exception
…
Root cause labels
40#UnifiedAnalytics #SparkAISummit
Extracting input features from logs
java.lang.OutOfMemoryError: Java heap space
at
scala.reflect.ManifestFactory$$anon$9.newArray(Manifest.scala:114)
at
scala.reflect.ManifestFactory$$anon$9.newArray(Manifest.scala:112)
at …
• Extracting stack traces and error messages
• Tokenize by class names and words
Tokens example:
java.lang.OutOfmemoryError Java heap space at
scala.reflect.ManifestFactory$$anon$9.newArray(Manife
st.scala:114)
41#UnifiedAnalytics #SparkAISummit
Input feature extraction
• Bag of Words with TF-IDF
– Computes a vocabulary of words
– Uses TF-IDF to reflect importance of words in a document
• Doc2Vec
– Maps words, paragraphs, or documents to multi-dimensional vectors
– Evaluates the placement of words wrt neighboring words
– Uses a 3-layer neural network
42#Exp8SAIS
43#UnifiedAnalytics #SparkAISummit
Predictive
Model
Root cause
of the failure
App failure
App’s
Container
Logs
Error
Template
Extraction
Error
Template
Extraction
Feature
vectors
Model
Learning
Container
Logs
Root cause labels
Logs from
millions of
app failures
Label
Generation
Feature
vector
Learning the predictive model
• Shallow Learning
– Logistic Regression
– Random forests
• Deep Learning
– Neural networks
44
• Training and testing with injected failures
• Test to train data set ratio 75% to 25%
• Models: logistic regression, random forests
80
85
90
95
100
TF-IDF Doc2Vec
AccuracyScore
[%]
Logistic Regression Random Forests
#UnifiedAnalytics #SparkAISummit
45#UnifiedAnalytics #SparkAISummit
Messaging
Platform
Bot’s NLP
Layer
Bot’s Backend
Layer
The NLP element in the Chatbot
Algorithm
Compute
Storage
46#UnifiedAnalytics #SparkAISummit
Extract
the intent
Intent =
AppAutoTune
Entities: {
AppName =
‘CEO report’,
TuningGoal =
Speedup }
Invoke app
autotuning
algorithm
How can I make
CEO report query
faster Tune an app
Fetch a metric
Generate a report
Set an alert
Diagnose a failure
…
Extract entities
for the intent
Take
action
The NLP element in the Chatbot
Many use cases can be addressed
• Who are the top resource-wasting users on the cluster?
• Which app is causing contention on the cluster?
• Why is my app stuck?
• Alert me if my query fails
• Which part of my query failed?
• Kill the sales report BI app if it uses more than $25
• And many more …
47#UnifiedAnalytics #SparkAISummit
In summary
• AI-driven Spark Expert in a Bot!
– Makes you more productive
– Saves you time and money
48#UnifiedAnalytics #SparkAISummit
Sign up for a free trial, we value your feedback!
http://unraveldata.com/free-trial
And yes, we are hiring @ Unravel
shivnath@unraveldata.com
DON’T FORGET TO RATE
AND REVIEW THE SESSIONS
SEARCH SPARK + AI SUMMIT

Más contenido relacionado

La actualidad más candente

Performance Optimizations in Apache Impala
Performance Optimizations in Apache ImpalaPerformance Optimizations in Apache Impala
Performance Optimizations in Apache Impala
Cloudera, Inc.
 
Taming the ever-evolving Compliance Beast : Lessons learnt at LinkedIn [Strat...
Taming the ever-evolving Compliance Beast : Lessons learnt at LinkedIn [Strat...Taming the ever-evolving Compliance Beast : Lessons learnt at LinkedIn [Strat...
Taming the ever-evolving Compliance Beast : Lessons learnt at LinkedIn [Strat...
Shirshanka Das
 

La actualidad más candente (20)

Databricks Delta Lake and Its Benefits
Databricks Delta Lake and Its BenefitsDatabricks Delta Lake and Its Benefits
Databricks Delta Lake and Its Benefits
 
Apache Spark and the Hadoop Ecosystem on AWS
Apache Spark and the Hadoop Ecosystem on AWSApache Spark and the Hadoop Ecosystem on AWS
Apache Spark and the Hadoop Ecosystem on AWS
 
Scaling Apache Spark at Facebook
Scaling Apache Spark at FacebookScaling Apache Spark at Facebook
Scaling Apache Spark at Facebook
 
ORC Column Encryption
ORC Column EncryptionORC Column Encryption
ORC Column Encryption
 
Introduction to Amazon Redshift
Introduction to Amazon RedshiftIntroduction to Amazon Redshift
Introduction to Amazon Redshift
 
Performance Optimizations in Apache Impala
Performance Optimizations in Apache ImpalaPerformance Optimizations in Apache Impala
Performance Optimizations in Apache Impala
 
Building a Virtual Data Lake with Apache Arrow
Building a Virtual Data Lake with Apache ArrowBuilding a Virtual Data Lake with Apache Arrow
Building a Virtual Data Lake with Apache Arrow
 
Taming the ever-evolving Compliance Beast : Lessons learnt at LinkedIn [Strat...
Taming the ever-evolving Compliance Beast : Lessons learnt at LinkedIn [Strat...Taming the ever-evolving Compliance Beast : Lessons learnt at LinkedIn [Strat...
Taming the ever-evolving Compliance Beast : Lessons learnt at LinkedIn [Strat...
 
File Format Benchmark - Avro, JSON, ORC & Parquet
File Format Benchmark - Avro, JSON, ORC & ParquetFile Format Benchmark - Avro, JSON, ORC & Parquet
File Format Benchmark - Avro, JSON, ORC & Parquet
 
Amazon Redshift Masterclass
Amazon Redshift MasterclassAmazon Redshift Masterclass
Amazon Redshift Masterclass
 
Understanding Memory Management In Spark For Fun And Profit
Understanding Memory Management In Spark For Fun And ProfitUnderstanding Memory Management In Spark For Fun And Profit
Understanding Memory Management In Spark For Fun And Profit
 
The roadmap for sql server 2019
The roadmap for sql server 2019The roadmap for sql server 2019
The roadmap for sql server 2019
 
InfluxDB IOx Tech Talks: Query Engine Design and the Rust-Based DataFusion in...
InfluxDB IOx Tech Talks: Query Engine Design and the Rust-Based DataFusion in...InfluxDB IOx Tech Talks: Query Engine Design and the Rust-Based DataFusion in...
InfluxDB IOx Tech Talks: Query Engine Design and the Rust-Based DataFusion in...
 
Intro to Delta Lake
Intro to Delta LakeIntro to Delta Lake
Intro to Delta Lake
 
Protect your private data with ORC column encryption
Protect your private data with ORC column encryptionProtect your private data with ORC column encryption
Protect your private data with ORC column encryption
 
Protect your Private Data in your Hadoop Clusters with ORC Column Encryption
Protect your Private Data in your Hadoop Clusters with ORC Column EncryptionProtect your Private Data in your Hadoop Clusters with ORC Column Encryption
Protect your Private Data in your Hadoop Clusters with ORC Column Encryption
 
Achieving Lakehouse Models with Spark 3.0
Achieving Lakehouse Models with Spark 3.0Achieving Lakehouse Models with Spark 3.0
Achieving Lakehouse Models with Spark 3.0
 
Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lake
 
Delta from a Data Engineer's Perspective
Delta from a Data Engineer's PerspectiveDelta from a Data Engineer's Perspective
Delta from a Data Engineer's Perspective
 
Lessons from the Field: Applying Best Practices to Your Apache Spark Applicat...
Lessons from the Field: Applying Best Practices to Your Apache Spark Applicat...Lessons from the Field: Applying Best Practices to Your Apache Spark Applicat...
Lessons from the Field: Applying Best Practices to Your Apache Spark Applicat...
 

Similar a An AI-Powered Chatbot to Simplify Apache Spark Performance Management

Connecting the Dots: Integrating Apache Spark into Production Pipelines
Connecting the Dots: Integrating Apache Spark into Production PipelinesConnecting the Dots: Integrating Apache Spark into Production Pipelines
Connecting the Dots: Integrating Apache Spark into Production Pipelines
Databricks
 
Media_Entertainment_Veriticals
Media_Entertainment_VeriticalsMedia_Entertainment_Veriticals
Media_Entertainment_Veriticals
Peyman Mohajerian
 

Similar a An AI-Powered Chatbot to Simplify Apache Spark Performance Management (20)

Apache Spark Data Validation
Apache Spark Data ValidationApache Spark Data Validation
Apache Spark Data Validation
 
Building an Enterprise Data Platform with Azure Databricks to Enable Machine ...
Building an Enterprise Data Platform with Azure Databricks to Enable Machine ...Building an Enterprise Data Platform with Azure Databricks to Enable Machine ...
Building an Enterprise Data Platform with Azure Databricks to Enable Machine ...
 
Internals of Speeding up PySpark with Arrow
 Internals of Speeding up PySpark with Arrow Internals of Speeding up PySpark with Arrow
Internals of Speeding up PySpark with Arrow
 
Scaling ML-Based Threat Detection For Production Cyber Attacks
Scaling ML-Based Threat Detection For Production Cyber AttacksScaling ML-Based Threat Detection For Production Cyber Attacks
Scaling ML-Based Threat Detection For Production Cyber Attacks
 
Connecting the Dots: Integrating Apache Spark into Production Pipelines
Connecting the Dots: Integrating Apache Spark into Production PipelinesConnecting the Dots: Integrating Apache Spark into Production Pipelines
Connecting the Dots: Integrating Apache Spark into Production Pipelines
 
Scaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data ScienceScaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data Science
 
Getting Started with Splunk Enterprise
Getting Started with Splunk EnterpriseGetting Started with Splunk Enterprise
Getting Started with Splunk Enterprise
 
Databricks Meetup @ Los Angeles Apache Spark User Group
Databricks Meetup @ Los Angeles Apache Spark User GroupDatabricks Meetup @ Los Angeles Apache Spark User Group
Databricks Meetup @ Los Angeles Apache Spark User Group
 
Getting Started with Splunk Enterprise Hands-On
Getting Started with Splunk Enterprise Hands-OnGetting Started with Splunk Enterprise Hands-On
Getting Started with Splunk Enterprise Hands-On
 
Big Data 2.0 - How Spark technologies are reshaping the world of big data ana...
Big Data 2.0 - How Spark technologies are reshaping the world of big data ana...Big Data 2.0 - How Spark technologies are reshaping the world of big data ana...
Big Data 2.0 - How Spark technologies are reshaping the world of big data ana...
 
DevOps for DataScience
DevOps for DataScienceDevOps for DataScience
DevOps for DataScience
 
The Azure Cognitive Services on Spark: Clusters with Embedded Intelligent Ser...
The Azure Cognitive Services on Spark: Clusters with Embedded Intelligent Ser...The Azure Cognitive Services on Spark: Clusters with Embedded Intelligent Ser...
The Azure Cognitive Services on Spark: Clusters with Embedded Intelligent Ser...
 
Getting Started with Splunk Enterprise
Getting Started with Splunk EnterpriseGetting Started with Splunk Enterprise
Getting Started with Splunk Enterprise
 
Tiny Batches, in the wine: Shiny New Bits in Spark Streaming
Tiny Batches, in the wine: Shiny New Bits in Spark StreamingTiny Batches, in the wine: Shiny New Bits in Spark Streaming
Tiny Batches, in the wine: Shiny New Bits in Spark Streaming
 
Getting Started with Splunk Breakout Session
Getting Started with Splunk Breakout SessionGetting Started with Splunk Breakout Session
Getting Started with Splunk Breakout Session
 
Hybrid Transactional/Analytics Processing with Spark and IMDGs
Hybrid Transactional/Analytics Processing with Spark and IMDGsHybrid Transactional/Analytics Processing with Spark and IMDGs
Hybrid Transactional/Analytics Processing with Spark and IMDGs
 
Data-Driven Transformation: Leveraging Big Data at Showtime with Apache Spark
Data-Driven Transformation: Leveraging Big Data at Showtime with Apache SparkData-Driven Transformation: Leveraging Big Data at Showtime with Apache Spark
Data-Driven Transformation: Leveraging Big Data at Showtime with Apache Spark
 
Media_Entertainment_Veriticals
Media_Entertainment_VeriticalsMedia_Entertainment_Veriticals
Media_Entertainment_Veriticals
 
ITCamp 2019 - Andy Cross - Machine Learning with ML.NET and Azure Data Lake
ITCamp 2019 - Andy Cross - Machine Learning with ML.NET and Azure Data LakeITCamp 2019 - Andy Cross - Machine Learning with ML.NET and Azure Data Lake
ITCamp 2019 - Andy Cross - Machine Learning with ML.NET and Azure Data Lake
 
Strata EU 2014: Spark Streaming Case Studies
Strata EU 2014: Spark Streaming Case StudiesStrata EU 2014: Spark Streaming Case Studies
Strata EU 2014: Spark Streaming Case Studies
 

Más de Databricks

Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized Platform
Databricks
 
Stage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationStage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI Integration
Databricks
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchSimplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Databricks
 
Raven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesRaven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction Queries
Databricks
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache Spark
Databricks
 

Más de Databricks (20)

DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
 
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1
 
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2
 
Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
 
Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized Platform
 
Learn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data Science
 
Why APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringWhy APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML Monitoring
 
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch FixThe Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
 
Stage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationStage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI Integration
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchSimplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorch
 
Scaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on KubernetesScaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on Kubernetes
 
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark PipelinesScaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
 
Sawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature AggregationsSawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature Aggregations
 
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkRedis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
 
Re-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkRe-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and Spark
 
Raven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesRaven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction Queries
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache Spark
 
Massive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta LakeMassive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta Lake
 

Último

Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Klinik kandungan
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
nirzagarg
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdf
SayantanBiswas37
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
gajnagarg
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
gajnagarg
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
gajnagarg
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Bertram Ludäscher
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
Health
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
vexqp
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
nirzagarg
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
HyderabadDolls
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
chadhar227
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
gajnagarg
 

Último (20)

Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdf
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
 
Kings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themKings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about them
 

An AI-Powered Chatbot to Simplify Apache Spark Performance Management

  • 1. WIFI SSID:SparkAISummit | Password: UnifiedAnalytics
  • 2. Shivnath Babu Cofounder/CTO, Unravel Adjunct Professor, Duke University An AI-powered Chatbot to Simplify Spark Performance Management #UnifiedAnalytics #SparkAISummit
  • 3. Meet the speaker • Cofounder/CTO at Unravel • Adjunct Professor of Computer Science at Duke University • Focusing on ease-of-use and manageability of data-intensive systems • Recipient of US National Science Foundation CAREER Award, three IBM Faculty Awards, HP Labs Innovation Research Award 3#UnifiedAnalytics #SparkAISummit
  • 4. What is a Chatbot? 4#UnifiedAnalytics #SparkAISummit
  • 5. A program which conducts a conversation via text or voice 5#UnifiedAnalytics #SparkAISummit
  • 6. Chatbots are making a real difference 6#UnifiedAnalytics #SparkAISummit
  • 10. 10#UnifiedAnalytics #SparkAISummit Woebot, the therapist chatbot, talks to more people in a day than a human therapist does in a lifetime
  • 11. Chatbots ó Spark Performance What is the connection? 11#UnifiedAnalytics #SparkAISummit
  • 12. The happy Spark user 12#UnifiedAnalytics #SparkAISummit • Spark is fast • Spark has easy-to-use and comprehensive APIs • Wow, I can do SQL, Streaming, AI/ML, and Graphs in one system! • Spark has a rich ecosystem
  • 13. 13#UnifiedAnalytics #SparkAISummit “I have no clue which cloud instance type to pick for my workload” “My cloud costs are getting out of control. Help!” “I have no idea why my app is slow” “My app failed and I don’t know why!” The frustrated Spark user
  • 14. • Many levels of correlated stack traces • Identifying the root cause is hard and time consuming 14 Typical app failure in Spark #UnifiedAnalytics #SparkAISummit
  • 15. 15#UnifiedAnalytics #SparkAISummit “My app failed and I don’t know why!” Spark User Spark Chatbot “I know that sucks! Let me take a look here …” “I see the problem. Executors are running out of memory” “Setting spark.executor.memory to 12g fixes the problem. I have verified it. See this run here” “Wow. Thanks. You are awesome!”
  • 16. I will show you a Chatbot that • Makes you more productive • Saves you time and money • Becomes your AI-driven Spark Expert in a Bot! 16#UnifiedAnalytics #SparkAISummit
  • 17. My app is too slow… 17 DATA ENGINEER #UnifiedAnalytics #SparkAISummit
  • 18. I need to make it faster… 18 DATA ENGINEER #UnifiedAnalytics #SparkAISummit
  • 19. Current approach 19 1. Review Spark/YARN UI to find the app 2. Review metrics in the UI 3. Review jobs and stages associated with the app 4. Identify all containers associated with the app 6. Identify “problematic” jobs, stages, or containers 7. Guess which parameters to tune for performance 5. Review and debug container logs 9. Rinse & repeat 8. Do trial-and-error by changing a parameter setting #UnifiedAnalytics #SparkAISummit
  • 20. There has to be a better way 20#UnifiedAnalytics #SparkAISummit
  • 21. What is going on here? 21#UnifiedAnalytics #SparkAISummit
  • 23. Monitoring Data Historic Data & Probe Data Recommendation Algorithm Cluster Services On-premises and Cloud App,Goal Orchestrator Algorithm running in bot’s backend Xnext Probe Algorithm 23#UnifiedAnalytics #SparkAISummit
  • 24. spark.driver.cores 2 spark.executor.cores … 10 spark.sql.shuffle.partitions 300 spark.sql.autoBroadcastJoinThres hold 20MB … SKEW('orders', 'o_custId') true spark.catalog.cacheTable(“orders") true … We represent this setting as vector X X PERFORMANCE 24 Spark tuning parameters #UnifiedAnalytics #SparkAISummit
  • 25. • Find the setting of X that best meets the goal • Challenge: Response surface y = ƒ(X) is unknown X PERFORMANCE Given: App + Goal 25#UnifiedAnalytics #SparkAISummit
  • 26. Model the response surface as The Gaussian Process model captures the uncertainty in our current knowledge of the response surface )()()(ˆ XZXfXy t += b !! b !! )(Xf t )(XZ X PERFORMANCE Challenge: Response surface y = ƒ(X) is unknown Here: is a regression model is the residual captured as a Gaussian Process #AI7SAIS 26
  • 27. ò = -¥= -= )( )(ˆ * * )())(()( Xyp p Xy dpppdfpXyXEIP We can now estimate the expected improvement EIP(X) from doing a probe at any setting X Gaussian Process model helps estimate EIP(X) Improvement at any setting X over the best performance seen so far Probability density function (uncertainty estimate) X Opportunity 27 PERFORMANCE #UnifiedAnalytics #SparkAISummit
  • 28. Get initial set of monitoring data from history or via probes: <X1,y1>, <X2,y2>, …, <Xn,yn> 1 Select next probe Xnext based on all history and probe data available so far to calculate the setting with maximum expected improvement EIP(X) 2 Bootstrap Probe Algorithm Until the stopping condition is reached #AI7SAIS 28 PERFORMANCE X
  • 29. 4 6 8 10 12 02468 x1 y 4 6 8 10 12 02468 x1 y 4 6 8 10 12 02468 x1 y 4 6 8 10 12 02468 x1 y X Performance U EIP(X) U Xnext: Do next probe here This approach balances Exploration Vs. Exploitation U Exploration U Exploitation 29#UnifiedAnalytics #SparkAISummit
  • 30. Credit: https://discovery.rsm.nl/articles/detail/130-how-to-balance-exploration-and-exploitation-in-multinational-enterprises Data Starved & High Uncertainty Data Rich & Low Uncertainty 30 App,Goal Xnext Probe Algorithm #UnifiedAnalytics #SparkAISummit
  • 32. • Many levels of correlated stack traces • Identifying the root cause is hard and time consuming 32 Typical app failure in Spark #UnifiedAnalytics #SparkAISummit
  • 33. Let us see a better way 33#UnifiedAnalytics #SparkAISummit
  • 34. What is going on here? 34#UnifiedAnalytics #SparkAISummit
  • 35. 35#UnifiedAnalytics #SparkAISummit Predictive Model Root cause of the failure App failure App’s Container Logs Error Template Extraction Feature vector
  • 36. 36#UnifiedAnalytics #SparkAISummit Predictive Model Root cause of the failure App failure App’s Container Logs Error Template Extraction Error Template Extraction Feature vectors Model Learning Container Logs Root cause labels Logs from millions of app failures Label Generation Feature vector
  • 37. Two ways to get root-cause labels • Manual diagnosis by a domain expert • Automatic injection of the root cause 37#UnifiedAnalytics #SparkAISummit
  • 38. Unravel’s large-scale lab framework for automatic root cause analysis Spark and multi-tenant Workloads: - Variety of workloads: Batch, ML, SQL, Streaming, etc. Failures: - Large set of root causes learned from customers & partners. Constantly updated - Continuously inject these root causes to train & test models for root-cause prediction Environment: - Lab created on demand on cloud or on-premises - Workloads are run and failures are injected 38#UnifiedAnalytics #SparkAISummit
  • 39. Injecting “labeled” failures Application Execution Application Monitor FAILED Injected Failure Label Labeled Failures • Invalid input • Invalid memory configuration • OOME: Java heap space • OOME: GC overhead limit • Container killed by YARN • Runtime incompatibility Injected failure examples: • No space left on device • Transformations inside other transformations • Runtime error • Arithmetic error • Invalid configuration settings Input Feature Extraction 39 Training data #UnifiedAnalytics #SparkAISummit
  • 40. We created a Failure Taxonomy for Labels Configuration Errors Data Errors Resource Errors Deployment Errors Root Node Category of failure Input Path Not Available Number Format Exception SparkSQL JsonProcessing Exception … Root cause labels 40#UnifiedAnalytics #SparkAISummit
  • 41. Extracting input features from logs java.lang.OutOfMemoryError: Java heap space at scala.reflect.ManifestFactory$$anon$9.newArray(Manifest.scala:114) at scala.reflect.ManifestFactory$$anon$9.newArray(Manifest.scala:112) at … • Extracting stack traces and error messages • Tokenize by class names and words Tokens example: java.lang.OutOfmemoryError Java heap space at scala.reflect.ManifestFactory$$anon$9.newArray(Manife st.scala:114) 41#UnifiedAnalytics #SparkAISummit
  • 42. Input feature extraction • Bag of Words with TF-IDF – Computes a vocabulary of words – Uses TF-IDF to reflect importance of words in a document • Doc2Vec – Maps words, paragraphs, or documents to multi-dimensional vectors – Evaluates the placement of words wrt neighboring words – Uses a 3-layer neural network 42#Exp8SAIS
  • 43. 43#UnifiedAnalytics #SparkAISummit Predictive Model Root cause of the failure App failure App’s Container Logs Error Template Extraction Error Template Extraction Feature vectors Model Learning Container Logs Root cause labels Logs from millions of app failures Label Generation Feature vector
  • 44. Learning the predictive model • Shallow Learning – Logistic Regression – Random forests • Deep Learning – Neural networks 44 • Training and testing with injected failures • Test to train data set ratio 75% to 25% • Models: logistic regression, random forests 80 85 90 95 100 TF-IDF Doc2Vec AccuracyScore [%] Logistic Regression Random Forests #UnifiedAnalytics #SparkAISummit
  • 45. 45#UnifiedAnalytics #SparkAISummit Messaging Platform Bot’s NLP Layer Bot’s Backend Layer The NLP element in the Chatbot Algorithm Compute Storage
  • 46. 46#UnifiedAnalytics #SparkAISummit Extract the intent Intent = AppAutoTune Entities: { AppName = ‘CEO report’, TuningGoal = Speedup } Invoke app autotuning algorithm How can I make CEO report query faster Tune an app Fetch a metric Generate a report Set an alert Diagnose a failure … Extract entities for the intent Take action The NLP element in the Chatbot
  • 47. Many use cases can be addressed • Who are the top resource-wasting users on the cluster? • Which app is causing contention on the cluster? • Why is my app stuck? • Alert me if my query fails • Which part of my query failed? • Kill the sales report BI app if it uses more than $25 • And many more … 47#UnifiedAnalytics #SparkAISummit
  • 48. In summary • AI-driven Spark Expert in a Bot! – Makes you more productive – Saves you time and money 48#UnifiedAnalytics #SparkAISummit Sign up for a free trial, we value your feedback! http://unraveldata.com/free-trial And yes, we are hiring @ Unravel shivnath@unraveldata.com
  • 49. DON’T FORGET TO RATE AND REVIEW THE SESSIONS SEARCH SPARK + AI SUMMIT