3.
Adoption
All major Hadoop distributions include Spark
Beyond Hadoop
4.
Partnerships
Partner with Spark distributors to provide great
experience to every Spark user
Partners
5.
Certification
Build a strong application ecosystem
Spark API
Spark Distros
…
Distros Cert
Spark Apps
… App Cert
6.
Certification
Free certification process
Scripts for certifying Spark distributions
• Developed by community
• Open-source
Anyone will be able to certify any Spark distribution
7.
Training
We’ve been teaching Spark since 2012
• 400+ people this year through Databricks
Just launched a new training program
• Already hold workshops in 5 cities
300+ people signed up for training on Wednesday
10.
Big Promise
Great successes using Big Data
Every organization collects dataYour company here!
11.
Big Challenge
Great successes using Big Data
Google, Facebook spend billions $ to develop,
implement, and run data analysis tools and products
Your company here!
Every organization collects data
12.
Typical Story
Your company starts a Big Data initiative
You are tasked to…
1) Build a Hadoop cluster
2) Build a data pipeline
3) Get insights &
build data products
Clusters hard to set up
and manage
Need to integrate a zoo
of tools
Tools are hard to use
(IT)
(engineers, data scientists)
(engineers, data scientists, analysts)
13.
Typical Data Pipeline
Data
ETL
Exploration
Dashboards
& Reports
Data
Products
Advanced
Analytics
Integrate disparate, clunky tools
Hard to navigate data, develop and deploy apps
15.
From Challenges to Solutions
Challenges
Solutions
Hosted platform
Apache Spark
Clusters hard to set up
and manage
Need to integrate a zoo
of tools
Tools are hard to useInteractive Workspace
23.
Job Launcher
Run arbitrary Spark jobs, programmatically
24.
Dramatically Simplify Data Pipeline
Data
ETL
Exploration
Advanced Analytics
Dashboards & Reports
Data Products
Cloud
25.
Dramatically Simplify Data Pipeline
Data
ETL
Exploration
Advanced Analytics
Dashboards & Reports
Data Products
Cloud
Free users to focus on
finding answers & building products
27.
Availability
Started closed beta program earlier this year
Limited availability soon
• Gradually ramping up
• Sign up on databricks.com!
28.
3rd Party Apps
Databricks
Workspace
Databricks Platform
29.
3rd Party Apps
Databricks Workspace
Apps
…
Databricks Platform
30.
Databricks Cloud and Spark
Databricks Cloud runs 100% Apache Spark
• No lock in: any Databricks Cloud app runs on any
certified Spark distribution
Databricks Cloud accelerates Spark adoption
• Provide easiest way to learn and use Apache Spark
31.
Databricks Cloud
Databricks Workspace
Databricks Platform
Dramatically simplify
• analyzing big data
• building data products
Fuel growth of Spark ecosystem
Make big data easy
Los recortes son una forma práctica de recopilar diapositivas importantes para volver a ellas más tarde. Ahora puedes personalizar el nombre de un tablero de recortes para guardar tus recortes.
Crear un tablero de recortes
Compartir esta SlideShare
¿Odia los anuncios?
Consiga SlideShare sin anuncios
Acceda a millones de presentaciones, documentos, libros electrónicos, audiolibros, revistas y mucho más. Todos ellos sin anuncios.
Oferta especial para lectores de SlideShare
Solo para ti: Prueba exclusiva de 60 días con acceso a la mayor biblioteca digital del mundo.
La familia SlideShare crece. Disfruta de acceso a millones de libros electrónicos, audiolibros, revistas y mucho más de Scribd.
Parece que tiene un bloqueador de anuncios ejecutándose. Poniendo SlideShare en la lista blanca de su bloqueador de anuncios, está apoyando a nuestra comunidad de creadores de contenidos.
¿Odia los anuncios?
Hemos actualizado nuestra política de privacidad.
Hemos actualizado su política de privacidad para cumplir con las cambiantes normativas de privacidad internacionales y para ofrecerle información sobre las limitadas formas en las que utilizamos sus datos.
Puede leer los detalles a continuación. Al aceptar, usted acepta la política de privacidad actualizada.