SlideShare una empresa de Scribd logo
1 de 15
Apache Zeppelin
The (very) short field trip
by G.Alléon & G.DupontTDS meetup - 2016.06.30
Who are we?
Guillaume Alleon - AIRBUS Group Innovation (corporate research center)
Research leader for more than 30 people from UK to China, tackling problems in massive data processing
and information extraction.
Was already in “big data” when it was still called HPC…
Gerard Dupont - AIRBUS Defence & Space (space systems)
Technical coordinator for R&T studies on distributed processing systems.
Spend way too much time processing web data for intelligence, now looking to the sky (satellite data ;-)
Zeppelin moto
“A web-based notebook that enables interactive data analytics.”
Origins & history
Missing piece in HADOOP landscape: a modern analytic playground.
2012.12 - Data analytics
solution (NFLabs)
2013.10 - Opensourced
2014.12 - ASF incubation
2015 - 3 stable releases
2016.05 - Maturing to Apache
top level project
3000 feet view
What’s cool about Zeppelin
⊕interactive
⊕out-of-the-box spark integration
⊕out-of-the-box visualization options
⊕direct access to DOM for customized visualization
⊕nice UI (bootstrap & angular)
⊕notebook run scheduler
⊕easy to configure
⊕extensibility, extensibility and extensibility...
What’s cool about Zeppelin
⊕interactive
⊕out-of-the-box spark integration
⊕out-of-the-box visualization options
⊕direct access to DOM for customized visualization
⊕nice UI (bootstrap & angular)
⊕notebook run scheduler
⊕easy to configure
⊕extensibility, extensibility and extensibility...
… the dark side
⊝hard to install
⊝need to build from the source
(for customized version)
⊝not (yet) multi-users
Overview/look & feel
Interpreter text
(aka your code)
Interpreter config
Interactive results
DEMO time
credits: https://www.weasyl.com/~uszatyarbuz
Under the hood
○ Interpreter isolation with their
own JVM
○ Dynamic dependencies loading
○ REST & websocket on front
○ Thrift in back
(or whatever you add)
○ Process scheduler (cron-like)
Roadmap
Enterprise Ready
○ Multi-tenancy
○ Job scheduler
○ HA
Usability Improvement
○ UX improvement
○ Table data support
○ Dynamic interpreter integration
○ Reusable analytic application catalog
Thx
Offical website: https://zeppelin.apache.org/
Notebook sample: https://www.zeppelinhub.com/viewer
Source code: https://github.com/apache/incubator-zeppelin
Mailing lists: http://zeppelin.apache.org/community.html
This TDS notebook: http://tinyurl.com/zeppelin-tds
Sources for this presentation:
○ http://www.slideshare.net/FlinkForward/moon-soo-lee-data-science-lifecycle-with-apache-flink-and-apache-zeppelin/23
○ http://www.slideshare.net/HadoopSummit/apache-zeppelin-helium-and-beyond
○ http://www.slideshare.net/felixcss/interactive-data-science-from-scratch-with-apache-zeppelin-and-apache-spark
○ http://www.slideshare.net/BrunoBonnin/explorez-vos-donnes-avec-apache-zeppelin
credits: https://www.weasyl.com/~uszatyarbuz
BACKUP
Origins & history
Active core teams
Descent number of
external contributors
Plenty of interpreters
(official and external)
0.6.0-SNAPSHOT
(pending stabilization)
3000 feet view

Más contenido relacionado

La actualidad más candente

Improving long-term preservation of EOS data by independently mapping HDF4 da...
Improving long-term preservation of EOS data by independently mapping HDF4 da...Improving long-term preservation of EOS data by independently mapping HDF4 da...
Improving long-term preservation of EOS data by independently mapping HDF4 da...
The HDF-EOS Tools and Information Center
 

La actualidad más candente (20)

Introduction NL-HUG (April)
Introduction NL-HUG (April)Introduction NL-HUG (April)
Introduction NL-HUG (April)
 
Session 09 learning relationships.pptx
Session 09 learning relationships.pptxSession 09 learning relationships.pptx
Session 09 learning relationships.pptx
 
Notes on data-intensive processing with Hadoop Mapreduce
Notes on data-intensive processing with Hadoop MapreduceNotes on data-intensive processing with Hadoop Mapreduce
Notes on data-intensive processing with Hadoop Mapreduce
 
Introduction to TensorFlow
Introduction to TensorFlowIntroduction to TensorFlow
Introduction to TensorFlow
 
Text Mining with Node.js - Philipp Burckhardt, Carnegie Mellon University
Text Mining with Node.js - Philipp Burckhardt, Carnegie Mellon UniversityText Mining with Node.js - Philipp Burckhardt, Carnegie Mellon University
Text Mining with Node.js - Philipp Burckhardt, Carnegie Mellon University
 
Sociopath presentation
Sociopath presentationSociopath presentation
Sociopath presentation
 
Your data isn't that big @ Big Things Meetup 2016-05-16
Your data isn't that big @ Big Things Meetup 2016-05-16Your data isn't that big @ Big Things Meetup 2016-05-16
Your data isn't that big @ Big Things Meetup 2016-05-16
 
Large-Scale Data Storage and Processing for Scientists with Hadoop
Large-Scale Data Storage and Processing for Scientists with HadoopLarge-Scale Data Storage and Processing for Scientists with Hadoop
Large-Scale Data Storage and Processing for Scientists with Hadoop
 
Improving long-term preservation of EOS data by independently mapping HDF4 da...
Improving long-term preservation of EOS data by independently mapping HDF4 da...Improving long-term preservation of EOS data by independently mapping HDF4 da...
Improving long-term preservation of EOS data by independently mapping HDF4 da...
 
July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data"
July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data"July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data"
July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data"
 
Hadoop Jute Record Python
Hadoop Jute Record PythonHadoop Jute Record Python
Hadoop Jute Record Python
 
Nov HUG 2009: Hadoop Record Reader In Python
Nov HUG 2009: Hadoop Record Reader In PythonNov HUG 2009: Hadoop Record Reader In Python
Nov HUG 2009: Hadoop Record Reader In Python
 
Intro to Python
Intro to PythonIntro to Python
Intro to Python
 
TENSORFLOW: ARCHITECTURE AND USE CASE - NASA SPACE APPS CHALLENGE by Gema Par...
TENSORFLOW: ARCHITECTURE AND USE CASE - NASA SPACE APPS CHALLENGE by Gema Par...TENSORFLOW: ARCHITECTURE AND USE CASE - NASA SPACE APPS CHALLENGE by Gema Par...
TENSORFLOW: ARCHITECTURE AND USE CASE - NASA SPACE APPS CHALLENGE by Gema Par...
 
Big data
Big dataBig data
Big data
 
Tech Talk - Underutilized Resources in Distributed System
Tech Talk - Underutilized Resources in Distributed SystemTech Talk - Underutilized Resources in Distributed System
Tech Talk - Underutilized Resources in Distributed System
 
PetaPG
PetaPGPetaPG
PetaPG
 
simple introduction to hadoop
simple introduction to hadoopsimple introduction to hadoop
simple introduction to hadoop
 
Machine learning in python course contents
Machine learning in python course contentsMachine learning in python course contents
Machine learning in python course contents
 
DSD-NL 2017 Digishape project: "Heel AHN2 is inmiddels ingeladen in de Micros...
DSD-NL 2017 Digishape project: "Heel AHN2 is inmiddels ingeladen in de Micros...DSD-NL 2017 Digishape project: "Heel AHN2 is inmiddels ingeladen in de Micros...
DSD-NL 2017 Digishape project: "Heel AHN2 is inmiddels ingeladen in de Micros...
 

Destacado

The Networked Supply Chain - Gary Philbin, Chief Operating Officer, Dollar T...
The Networked Supply Chain -  Gary Philbin, Chief Operating Officer, Dollar T...The Networked Supply Chain -  Gary Philbin, Chief Operating Officer, Dollar T...
The Networked Supply Chain - Gary Philbin, Chief Operating Officer, Dollar T...
SAP Ariba
 
Voorstelling 4Betterresults_voor_KMOs_en_ondernemers
Voorstelling 4Betterresults_voor_KMOs_en_ondernemersVoorstelling 4Betterresults_voor_KMOs_en_ondernemers
Voorstelling 4Betterresults_voor_KMOs_en_ondernemers
Kurt Vandewalle
 
[biurowi 5 - en] basic principles of fire protection
[biurowi 5 - en] basic principles of fire protection[biurowi 5 - en] basic principles of fire protection
[biurowi 5 - en] basic principles of fire protection
AktywBHP
 
Chinese link textbook Lesson 6 vocabulary
Chinese link textbook Lesson 6 vocabulary Chinese link textbook Lesson 6 vocabulary
Chinese link textbook Lesson 6 vocabulary
Joanne Chen
 

Destacado (18)

Ashcraft.edu103.module3
Ashcraft.edu103.module3Ashcraft.edu103.module3
Ashcraft.edu103.module3
 
Jennyresumen
JennyresumenJennyresumen
Jennyresumen
 
Unit 8 - Textbook Lesson 1
Unit 8 - Textbook Lesson 1Unit 8 - Textbook Lesson 1
Unit 8 - Textbook Lesson 1
 
The Networked Supply Chain - Gary Philbin, Chief Operating Officer, Dollar T...
The Networked Supply Chain -  Gary Philbin, Chief Operating Officer, Dollar T...The Networked Supply Chain -  Gary Philbin, Chief Operating Officer, Dollar T...
The Networked Supply Chain - Gary Philbin, Chief Operating Officer, Dollar T...
 
Pedal to the Metal: Accelerating Spark with Silicon Innovation
Pedal to the Metal: Accelerating Spark with Silicon InnovationPedal to the Metal: Accelerating Spark with Silicon Innovation
Pedal to the Metal: Accelerating Spark with Silicon Innovation
 
Voorstelling 4Betterresults_voor_KMOs_en_ondernemers
Voorstelling 4Betterresults_voor_KMOs_en_ondernemersVoorstelling 4Betterresults_voor_KMOs_en_ondernemers
Voorstelling 4Betterresults_voor_KMOs_en_ondernemers
 
清明節
清明節清明節
清明節
 
Diccionario logístico
Diccionario logísticoDiccionario logístico
Diccionario logístico
 
Seminarie 'Sturen op effecten door slimme dashboards' 3 december 2015
Seminarie 'Sturen op effecten door slimme dashboards' 3 december 2015Seminarie 'Sturen op effecten door slimme dashboards' 3 december 2015
Seminarie 'Sturen op effecten door slimme dashboards' 3 december 2015
 
Postgres & Redis Sitting in a Tree- Rimas Silkaitis, Heroku
Postgres & Redis Sitting in a Tree- Rimas Silkaitis, HerokuPostgres & Redis Sitting in a Tree- Rimas Silkaitis, Heroku
Postgres & Redis Sitting in a Tree- Rimas Silkaitis, Heroku
 
我想請你吃飯 (繁体)
我想請你吃飯 (繁体)我想請你吃飯 (繁体)
我想請你吃飯 (繁体)
 
HIgh Performance Redis- Tague Griffith, GoPro
HIgh Performance Redis- Tague Griffith, GoProHIgh Performance Redis- Tague Griffith, GoPro
HIgh Performance Redis- Tague Griffith, GoPro
 
Condicionamiento y aprendizaje
Condicionamiento y aprendizajeCondicionamiento y aprendizaje
Condicionamiento y aprendizaje
 
[biurowi 5 - en] basic principles of fire protection
[biurowi 5 - en] basic principles of fire protection[biurowi 5 - en] basic principles of fire protection
[biurowi 5 - en] basic principles of fire protection
 
Lista de verbos Irregulares - Inglés
Lista de verbos Irregulares - InglésLista de verbos Irregulares - Inglés
Lista de verbos Irregulares - Inglés
 
Chinese Link Lesson 20 worksheet 2016
Chinese Link Lesson 20  worksheet 2016Chinese Link Lesson 20  worksheet 2016
Chinese Link Lesson 20 worksheet 2016
 
Chinese link textbook Lesson 6 vocabulary
Chinese link textbook Lesson 6 vocabulary Chinese link textbook Lesson 6 vocabulary
Chinese link textbook Lesson 6 vocabulary
 
A Presentation on "NGO's Role in Disaster Management" Presented by Mr. Deepak...
A Presentation on "NGO's Role in Disaster Management" Presented by Mr. Deepak...A Presentation on "NGO's Role in Disaster Management" Presented by Mr. Deepak...
A Presentation on "NGO's Role in Disaster Management" Presented by Mr. Deepak...
 

Similar a Toulouse Data Science meetup - Apache zeppelin

Unified Data API for Distributed Cloud Analytics and AI
Unified Data API for Distributed Cloud Analytics and AIUnified Data API for Distributed Cloud Analytics and AI
Unified Data API for Distributed Cloud Analytics and AI
Alluxio, Inc.
 
Big Data Essentials meetup @ IBM Ljubljana 23.06.2015
Big Data Essentials meetup @ IBM Ljubljana 23.06.2015Big Data Essentials meetup @ IBM Ljubljana 23.06.2015
Big Data Essentials meetup @ IBM Ljubljana 23.06.2015
Andrey Vykhodtsev
 

Similar a Toulouse Data Science meetup - Apache zeppelin (20)

Unified Data API for Distributed Cloud Analytics and AI
Unified Data API for Distributed Cloud Analytics and AIUnified Data API for Distributed Cloud Analytics and AI
Unified Data API for Distributed Cloud Analytics and AI
 
Ultra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & Alluxio
Ultra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & AlluxioUltra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & Alluxio
Ultra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & Alluxio
 
Architecting a Heterogeneous Data Platform Across Clusters, Regions, and Clouds
Architecting a Heterogeneous Data Platform Across Clusters, Regions, and CloudsArchitecting a Heterogeneous Data Platform Across Clusters, Regions, and Clouds
Architecting a Heterogeneous Data Platform Across Clusters, Regions, and Clouds
 
Working together with SURF Raymond Oonk Annette Langedijk SURF
Working together with SURF Raymond Oonk Annette Langedijk SURFWorking together with SURF Raymond Oonk Annette Langedijk SURF
Working together with SURF Raymond Oonk Annette Langedijk SURF
 
Google Cloud Platform for Data Science teams
Google Cloud Platform for Data Science teamsGoogle Cloud Platform for Data Science teams
Google Cloud Platform for Data Science teams
 
Optimizing Data Access for Analytics And AI with Alluxio
Optimizing Data Access for Analytics And AI with AlluxioOptimizing Data Access for Analytics And AI with Alluxio
Optimizing Data Access for Analytics And AI with Alluxio
 
Hopsworks in the cloud Berlin Buzzwords 2019
Hopsworks in the cloud Berlin Buzzwords 2019 Hopsworks in the cloud Berlin Buzzwords 2019
Hopsworks in the cloud Berlin Buzzwords 2019
 
Run your code serverlessly on Google's open cloud
Run your code serverlessly on Google's open cloudRun your code serverlessly on Google's open cloud
Run your code serverlessly on Google's open cloud
 
Developing and releasing SOFA Statistics
Developing and releasing SOFA StatisticsDeveloping and releasing SOFA Statistics
Developing and releasing SOFA Statistics
 
Large scale, interactive ad-hoc queries over different datastores with Apache...
Large scale, interactive ad-hoc queries over different datastores with Apache...Large scale, interactive ad-hoc queries over different datastores with Apache...
Large scale, interactive ad-hoc queries over different datastores with Apache...
 
PyData Boston 2013
PyData Boston 2013PyData Boston 2013
PyData Boston 2013
 
Containers for sensor web services, applications and research @ Sensor Web Co...
Containers for sensor web services, applications and research @ Sensor Web Co...Containers for sensor web services, applications and research @ Sensor Web Co...
Containers for sensor web services, applications and research @ Sensor Web Co...
 
Hambug R Meetup - Intro to H2O
Hambug R Meetup - Intro to H2OHambug R Meetup - Intro to H2O
Hambug R Meetup - Intro to H2O
 
Netflix Data Engineering @ Uber Engineering Meetup
Netflix Data Engineering @ Uber Engineering MeetupNetflix Data Engineering @ Uber Engineering Meetup
Netflix Data Engineering @ Uber Engineering Meetup
 
Berlin Hadoop Get Together Apache Drill
Berlin Hadoop Get Together Apache Drill Berlin Hadoop Get Together Apache Drill
Berlin Hadoop Get Together Apache Drill
 
Big Data Essentials meetup @ IBM Ljubljana 23.06.2015
Big Data Essentials meetup @ IBM Ljubljana 23.06.2015Big Data Essentials meetup @ IBM Ljubljana 23.06.2015
Big Data Essentials meetup @ IBM Ljubljana 23.06.2015
 
Large Scale Deep Learning with TensorFlow
Large Scale Deep Learning with TensorFlow Large Scale Deep Learning with TensorFlow
Large Scale Deep Learning with TensorFlow
 
Rise of Intermediate APIs - Beam and Alluxio at Alluxio Meetup 2016
Rise of Intermediate APIs - Beam and Alluxio at Alluxio Meetup 2016Rise of Intermediate APIs - Beam and Alluxio at Alluxio Meetup 2016
Rise of Intermediate APIs - Beam and Alluxio at Alluxio Meetup 2016
 
Data pipelines from zero to solid
Data pipelines from zero to solidData pipelines from zero to solid
Data pipelines from zero to solid
 
Accelerating Spark with Kubernetes
Accelerating Spark with KubernetesAccelerating Spark with Kubernetes
Accelerating Spark with Kubernetes
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 

Toulouse Data Science meetup - Apache zeppelin

  • 1. Apache Zeppelin The (very) short field trip by G.Alléon & G.DupontTDS meetup - 2016.06.30
  • 2. Who are we? Guillaume Alleon - AIRBUS Group Innovation (corporate research center) Research leader for more than 30 people from UK to China, tackling problems in massive data processing and information extraction. Was already in “big data” when it was still called HPC… Gerard Dupont - AIRBUS Defence & Space (space systems) Technical coordinator for R&T studies on distributed processing systems. Spend way too much time processing web data for intelligence, now looking to the sky (satellite data ;-)
  • 3. Zeppelin moto “A web-based notebook that enables interactive data analytics.”
  • 4. Origins & history Missing piece in HADOOP landscape: a modern analytic playground. 2012.12 - Data analytics solution (NFLabs) 2013.10 - Opensourced 2014.12 - ASF incubation 2015 - 3 stable releases 2016.05 - Maturing to Apache top level project
  • 6. What’s cool about Zeppelin ⊕interactive ⊕out-of-the-box spark integration ⊕out-of-the-box visualization options ⊕direct access to DOM for customized visualization ⊕nice UI (bootstrap & angular) ⊕notebook run scheduler ⊕easy to configure ⊕extensibility, extensibility and extensibility...
  • 7. What’s cool about Zeppelin ⊕interactive ⊕out-of-the-box spark integration ⊕out-of-the-box visualization options ⊕direct access to DOM for customized visualization ⊕nice UI (bootstrap & angular) ⊕notebook run scheduler ⊕easy to configure ⊕extensibility, extensibility and extensibility... … the dark side ⊝hard to install ⊝need to build from the source (for customized version) ⊝not (yet) multi-users
  • 8. Overview/look & feel Interpreter text (aka your code) Interpreter config Interactive results
  • 10. Under the hood ○ Interpreter isolation with their own JVM ○ Dynamic dependencies loading ○ REST & websocket on front ○ Thrift in back (or whatever you add) ○ Process scheduler (cron-like)
  • 11. Roadmap Enterprise Ready ○ Multi-tenancy ○ Job scheduler ○ HA Usability Improvement ○ UX improvement ○ Table data support ○ Dynamic interpreter integration ○ Reusable analytic application catalog
  • 12. Thx Offical website: https://zeppelin.apache.org/ Notebook sample: https://www.zeppelinhub.com/viewer Source code: https://github.com/apache/incubator-zeppelin Mailing lists: http://zeppelin.apache.org/community.html This TDS notebook: http://tinyurl.com/zeppelin-tds Sources for this presentation: ○ http://www.slideshare.net/FlinkForward/moon-soo-lee-data-science-lifecycle-with-apache-flink-and-apache-zeppelin/23 ○ http://www.slideshare.net/HadoopSummit/apache-zeppelin-helium-and-beyond ○ http://www.slideshare.net/felixcss/interactive-data-science-from-scratch-with-apache-zeppelin-and-apache-spark ○ http://www.slideshare.net/BrunoBonnin/explorez-vos-donnes-avec-apache-zeppelin credits: https://www.weasyl.com/~uszatyarbuz
  • 14. Origins & history Active core teams Descent number of external contributors Plenty of interpreters (official and external) 0.6.0-SNAPSHOT (pending stabilization)

Notas del editor

  1. Interactive & extensible Ingestion, Discovery, Analytics, Visualization, Collaboration, Data product Toward better capitalization of analytical application (helium)
  2. ~4 years top level apache project after less than 18 months of incubation
  3. Scala & spark integration Direct DOM for super cool visualization