SlideShare una empresa de Scribd logo
1 de 60
1 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Managing Enterprise
Hadoop Clusters with
Apache Ambari
Jayush Luniya @ Hortonworks
Apache Ambari PMC
© Hortonworks Inc. 2011 – 2016. All Rights Reserved May 2016
2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Agenda
Ambari
Overview
Ambari
Features
Demo Q&A
3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
What’s Apache Ambari?
100% open-source
platform for simplifying
Hadoop cluster
management and use.
Highly extensible.
4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
It’s a wild zoo out there!
Gotta manage this
efficiently.
5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Apache Ambari Themes
• Deliver the core operational capabilities to provision,
manage and monitor Hadoop clusters at scale.
Operate Hadoop at
Scale
• Robust API for integration with existing enterprise systems,
such as Microsoft SCOM and Teradata Viewpoint.
Integrate with the
Enterprise
• Provide extensible platform for Customers, Partners and
the Community (Stacks, Views)
Extend for the
Ecosystem
6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Apache Ambari
7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Open Source Activity
8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Inception: AMBARI-1 (Sept, 2011)
9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Fast forward 5 years to today…
 Latest JIRA: AMBARI-16131
 150+ Contributors
 60+ Committers
 16131 JIRAs filed
 14254 JIRAs fixed
At 1.5 day per JIRA ~ 90 person years!
 Used by hundreds of companies
10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ambari – 3rd Biggest Project* @ Apache
* Based on total JIRAs filed on a project basis as of April 26, 2016
#2: Hadoop at ~32k as it is split across multiple JIRA Projects
#1
#3
#4
#5
11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Timeline
Ambari 1.6.*
May 2014
908 JIRAs
Ambari 1.5.*
Apr 2014
1218 JIRAs
Ambari 1.7.*
Dec 2014
1620 JIRAs
Ambari 2.0.*
April 2015
1804 JIRAs
Current GA Version (2.2.2)
Ambari 2.1.*
July 2015
2674 JIRAs
Ambari Stacks
Resolution of 9k+ JIRAs
Ambari Blueprints Ambari Views
Alerts Framework
Metrics System
Rolling Upgrade
Kerberos Automation
Enhanced
Dashboards
Smart Configs
Ambari 2.2.*
Dec 2015
1542 JIRAs
Express Upgrade
AMS Grafana
12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Agenda
Ambari
Overview
Ambari
Features
Demo Q&A
13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Extensibility Features
• To add new Services (ISV or otherwise) beyond HDP stack
• To customize a Stack for customer specific environments
Stacks
• To use Ambari for automating cluster installations.
• To share best practices on layout and cluster configuration
Blueprints
• To extend and customize the Ambari Web UI
• Add new capabilities, customize existing capabilities
Views
14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Anatomy of Ambari Extension Points
15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ambari Stacks
16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Stack Terminology
Term Definition Examples
STACK Defines a set of Services, where to obtain the
software packages and how to manage the
lifecycle.
HDP-2.3, HDP-2.2
SERVICE Defines the Components that make-up the service. HDFS, NAGIOS, YARN
COMPONENT The building-blocks of a Service, that adhere to a
certain lifecycle.
NAMENODE, DATANODE,
OOZIE_SERVER
CATEGORY The category of Component. MASTER, SLAVE, CLIENT
REPO Repository metadata where the artifacts reside http://public-repo-
1.hortonworks.com/HDP/centos6/2
.x/GA/2.3.0.0
17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ambari Stack
 Stacks define Services + Repo
– What is a stack, and where to get the bits
 Each service has a definition
– What components are part of the Service
 Each service has defined lifecycle commands
– start, stop, status, install, configure
 Lifecycle is controlled via command scripts
 Ability to define “custom” commands
Ambari Server
Stack
Service
Definitions
Command
Scripts
xml python
Ambari Agents
Repos
18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Stacks Support Inheritance
HDP 2.1 Stack
HDP 2.0 Stack
 Overrides any Service definitions, commands and configurations
 Adds new Services specific to this Stack
 Defines a set of Service definitions
 Default service configurations and command scripts
19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Blueprints
20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Automated Cluster Deployment
 Deploy clusters of any scale with ease
 Two REST API calls is all it takes to provision a cluster
Who uses it?
 HDInsight (Microsoft Azure)
 Hortonworks QA
21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Example: Create a 100-node Cluster
{
"configurations" : [
{
”hdfs-site" : {
"dfs.datanode.data.dir" : ”/hadoop/1,/hadoop/2,/hadoop/3"
}
}
],
"host_groups" : [
{
"name" : ”master-host",
"components" : [
{ "name" : "NAMENODE” },
{ "name" : "RESOURCEMANAGER” },
…
],
"cardinality" : "1"
},
{
"name" : ”worker-host",
"components" : [
{ "name" : ”DATANODE” },
{ "name" : ”NODEMANAGER” },
…
],
"cardinality" : "1+"
},
],
"Blueprints" : {
"stack_name" : "HDP",
"stack_version" : "2.0"
}
}
{
"blueprint" : ”my-blueprint",
"host_groups" :[
{
"name" : ”master-host",
"hosts" : [
{
"fqdn" : ”master001.ambari.apache.org”
}
]
},
{
"name" : ”worker-host",
"hosts" : [
{
"fqdn" : ”worker001.ambari.apache.org”
},
{
"fqdn" : ”worker002.ambari.apache.org”
},
…
{
"fqdn" : ”worker099.ambari.apache.org”
}
]
}
]
}
1. POST /api/v1/blueprints/my-blueprint 2. POST /api/v1/clusters/my-cluster
22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Cluster Replication
{
"configurations" : [
{
”cluster-env" : {
”user_group" : ”hadoop"
}
”hdfs-site" : {
"dfs.datanode.data.dir" : ”/hadoop/1,/hadoop/2,/hadoop/3"
}
}
],
"host_groups" : [
{
"name" : ”master-host",
"components" : [
{ "name" : "NAMENODE” },
{ "name" : "RESOURCEMANAGER” },
…
],
"cardinality" : "1"
}
],
"Blueprints" : {
"stack_name" : "HDP",
"stack_version" : "2.0"
}
}
GET/api/v1/clusters/my-
cluster?format=blueprint
 Export blueprint from an existing cluster
 Import blueprint to replicate the cluster
23 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Blueprint Features
Ambari 2.0:
 High availability (HA) cluster deployments
 Adding hosts using blueprints (AMBARI-8458)
Ambari 2.1:
 Advanced cluster creation options (AMBARI-10750)
Ambari 2.2:
 Kerberized cluster deployments (AMBARI-13431)
 Stack advisor recommendations (AMBARI-13487)
24 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Stack Upgrades
25 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Stack Upgrades
 Rolling vs Express Upgrade modes
 Side-by-Side Bits and Configs
Bits:
/usr/hdp/2.2.0.0-2041
/usr/hdp/2.2.4.2-2
/usr/hdp/2.3.0.0-3000
Configs:
/etc/hive/conf/ (initial)
/etc/hive/conf/v0 (HDP 2.2.4.2)
/etc/hive/conf/v1 (HDP 2.3)
2.2.0.0 2.2.4.2 2.3.0.0minor jump major jump
26 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Express vs Rolling Upgrade
Rolling Upgrade
 Services are up the entire time
 Upgrade one component at a time
 Robust and fault-tolerant
 Service checks performed frequently during the upgrade
Express Upgrade
 All services are brought down, upgraded and restarted
 Faster upgrade mode
 Planned service downtime
 Relatively service checks performed less frequently during the upgrade.
27 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Stack Upgrade – Install Version
 Install new version in parallel on all agents
 No downtime
28 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Stack Upgrade – Orchestration
 Not necessarily “one-click” but fully guided
29 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Stack Upgrade – Upgrade Catalog
 Upgrades are driven by upgrade catalogs defined in stack definitions.
 Defines upgrade groups and upgrade order
 Provides ability to modify configurations
– Set, move, delete, transform
 Upgrade steps can be marked as skippable and retryable
 Supports executing custom scripts during upgrade
30 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Stack Upgrade – Upgrade Catalog
31 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Stack Downgrade
 Can trigger downgrade at any stage of the stack upgrade
 Cannot downgrade once stack upgrade has been finalized
32 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Smart Configurations
33 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Hadoop Configuration Challenges
 Too many configurations
– Which ones are important?
 Too easy to mess up
– What are valid/reasonable values?
– What are the units?
– Ok, what about dependencies?
 Gets harder with combinations of services, host assignments, enabled features,
CPU/RAM/disks, etc
– Any recommendations? What am I doing wrong?
 Smart Configurations
34 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ambari Smart Configs UI
Customizable layout
- Tabs
- Sections
- Sub-sections
- Simple grid layout
(Advanced Tab contains
remaining
configurations)
New Widgets
- Sliders
- Recommended
- Minimum
- Maximum
- Increment Step
- Combos
- Enumerated values
- Toggles
- Binary options
- Spinners
- Splits value into multiple
controls. Time in
milliseconds split into days,
hours, minutes.
- Lists
- Enumerated values
- Single select
- Multi select
Implemented
- HDFS
- YARN
- MapReduce
- Hive
- HBase
35 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Stack Driven Layouts
Stack has theme.json file
Layout
 Tabs
 Sections
 Sub-sections
Placement
 Configs placement in sub-sections
Widgets
 Widget type
 Optional Units
 Bytes (B, KB, MB, GB, TB, PB)
 Time (Millis, Seconds, Minutes, Hours, Days, Months,
Years)
{
"name": "default",
"description": "Default theme for HBASE service",
"configuration": {
"layouts": [
{
"name": "default",
"tabs": [
{
"name": "settings",
"display-name": "Settings",
"layout": {
"tab-columns": "3",
"tab-rows": "3",
"sections": [
...
]
}
}
]
}
],
"placement": {
"configuration-layout": "default",
"configs": [...]
},
"widgets": [
{
"config": "hbase-env/hbase_master_heapsize",
"widget": {
"type": "slider",
"units": [
{
"unit-name": "GB"
}
]
}
},
...
]
}
}
36 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Config Metadata and Dependencies
Extended Metadata
 Defined in property_value_attributes
 Hold non-UI metadata about value range,
increment, unit, etc
Dependencies
 Models bi-directional relationship between configs
 Depends On (property_depends_on)
 Answers “which configs do I depend on?”
 Depended By (dependencies)
 Answers “which configs are dependent on me?”
 Ambari automatically updates dependencies
{
"StackConfigurations": {
"final": "false",
"property_depends_on": [
{
"type": "yarn-site",
"name": "yarn.nodemanager.resource.memory-mb"
}
],
"property_description": “The minimum allocation for every",
"property_display_name": "Minimum Container Size (Memory)",
"property_name": "yarn.scheduler.minimum-allocation-mb",
"property_type": [],
"property_value": "512",
"property_value_attributes": {
"type": "int",
"maximum": "5120",
"minimum": "0",
"unit": "MB",
"increment_step": "256"
},
"type": "yarn-site.xml"
},
"dependencies": [
{
"StackConfigurationDependency": {
"dependency_name": "hive.tez.container.size",
"property_name": "yarn.scheduler.minimum-allocation-mb”
}
},
{
"StackConfigurationDependency": {
"dependency_name": "mapreduce.map.memory.mb",
"property_name": "yarn.scheduler.minimum-allocation-mb”
}
},
{
"StackConfigurationDependency": {
"dependency_name": "mapreduce.reduce.memory.mb",
"property_name": "yarn.scheduler.minimum-allocation-mb”
}
}…
]
}
37 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Metrics
38 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ambari Metrics Service (AMS) - Goals
 Ability to collect metrics from Hadoop and other Stack services
 Ability to collect system level metrics
 Ability to retain metrics at a high precision for a configurable time period
 Ability to automatically purge metrics after retention period
 Provide integration point for metrics collection and retention by external system
 Trigger alerts based on metrics in Ambari
39 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ambari Metrics System - Architecture
40 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
AMS Grafana
Ambari 2.2.2
 Powerful dashboard builder integrated with AMS
 Pre-built Grafana dashboards for host-level and service-level metrics
 User can build and save custom dashboards
41 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
AMS Grafana
42 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Alerts
43 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Alert – Types
Type Description Status
Thresholds
Configurable?
PORT Watches a port based on a configuration property such as the URI. OK, WARN, CRIT Yes (seconds)
WEB
Watches an HTTP or HTTPS endpoint and determines connectivity
and HTTP status code.
OK, WARN, CRIT No
AGGREGATE Aggregate of status for another alert definition. OK, WARN, CRIT Yes (percentage)
METRIC
Watches a metric or series of metrics in JMX and compares a
mathematical result against a threshold.
OK, WARN, CRIT Yes (variable)
SCRIPT Uses a custom script to handle checking. OK or CRIT No
44 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
UI – Current Alerts
Configured by default; managed via the the web client
45 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
UI – Host Alerts
 Automatically refreshes
 Query alert history
46 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
UI– Customization & Instances
 Status text, thresholds, and interval
47 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Views
48 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ambari Views
View Framework
 Provide various applications accessible from Ambari Web UI – interact with the cluster via a
browser from a single place for all users (cluster operators, data analysis, developers, etc)
Easy to develop
 No need to understand Ambari core code – view development is just like creating any other web
application
Easy to deploy
 Packaged as a single jar file
 Auto create / auto configure
49 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
CS Queue Manager for Cluster Operators
Capacity Scheduler Queue Manager
50 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS File Browser for General Users
HDFS File Browser
51 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Job Analysis for Developers
Troubleshoot Tez JobsTroubleshoot / Improve Hive queries
52 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Query Editors for Data Analysts
Create, edit, execute, and analyze Hive queries Create, edit, and execute Pig scripts
53 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ambari Server in Views-Only mode
Ambari
Server
Cluster managed by Ambari
Ambari
Server “Views-only” mode
(aka “Stand-alone” mode)
Cluster not managed by Ambari
Management
Use Views
Use Views
Use Views
 Use Views on existing clusters not managed by Ambari
 Can use Views against multiple clusters
54 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Kerberos Automation
55 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Kerberos Automation
Ambari 2.0
 Ambari manage Kerberos principals and keytabs
 Works with existing MIT KDC or Active Directory
 Once Kerberized, seamlessly handle:
 Adding new hosts
 Adding new components to existing hosts
 Adding new services
 Moving components to different hosts
56 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Agenda
Ambari
Overview
Ambari
Features
Demo Q&A
57 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Agenda
Ambari
Overview
Ambari
Features
Demo Q&A
58 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Thank You!
Try Ambari
 Follow the Ambari Quick Start Guide
https://cwiki.apache.org/confluence/display/AMBARI/Quick+Start+Guide
Learn more
 Visit the project website http://ambari.apache.org/
Get Involved
 User Mailing List: user-subscribe@ambari.apache.org
 Developer Mailing List: dev-subscribe@ambari.apache.org
 Use JIRA to file bugs and improvement requests https://issues.apache.org/jira/browse/AMBARI/
Jayush Luniya @ Hortonworks (Apache Ambari PMC)
59 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Future Roadmap
 AMS Grafana Integration
 Ambari Management Packs
 Ambari Logsearch
 Patch Upgrades
 Multi Service Versions
 Multi Service Instances
60 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Q&A
Stats
Largest production clusters managed by Ambari ~1600 nodes,
~800 nodes
Largest test cluster for Ambari scale testing ~400 nodes
Largest test cluster where rolling upgrade was performed ~400 nodes
~40 hours

Más contenido relacionado

La actualidad más candente

Hortonworks Technical Workshop: Apache Ambari
Hortonworks Technical Workshop:   Apache AmbariHortonworks Technical Workshop:   Apache Ambari
Hortonworks Technical Workshop: Apache AmbariHortonworks
 
Apache Ambari - What's New in 2.2
 Apache Ambari - What's New in 2.2 Apache Ambari - What's New in 2.2
Apache Ambari - What's New in 2.2Hortonworks
 
Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016Hortonworks
 
ODPi 101: Who we are, What we do
ODPi 101: Who we are, What we doODPi 101: Who we are, What we do
ODPi 101: Who we are, What we doHortonworks
 
Apache Ambari - What's New in 2.1
Apache Ambari - What's New in 2.1Apache Ambari - What's New in 2.1
Apache Ambari - What's New in 2.1Hortonworks
 
Webinar Series Part 5 New Features of HDF 5
Webinar Series Part 5 New Features of HDF 5Webinar Series Part 5 New Features of HDF 5
Webinar Series Part 5 New Features of HDF 5Hortonworks
 
Apache Ambari - What's New in 2.0.0
Apache Ambari - What's New in 2.0.0Apache Ambari - What's New in 2.0.0
Apache Ambari - What's New in 2.0.0Hortonworks
 
Deploying and Managing Hadoop Clusters with AMBARI
Deploying and Managing Hadoop Clusters with AMBARIDeploying and Managing Hadoop Clusters with AMBARI
Deploying and Managing Hadoop Clusters with AMBARIDataWorks Summit
 
Hive present-and-feature-shanghai
Hive present-and-feature-shanghaiHive present-and-feature-shanghai
Hive present-and-feature-shanghaiYifeng Jiang
 
Double Your Hadoop Hardware Performance with SmartSense
Double Your Hadoop Hardware Performance with SmartSenseDouble Your Hadoop Hardware Performance with SmartSense
Double Your Hadoop Hardware Performance with SmartSenseHortonworks
 
HDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical WorkshopHDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical WorkshopHortonworks
 
Past, Present and Future of Apache Ambari
Past, Present and Future of Apache AmbariPast, Present and Future of Apache Ambari
Past, Present and Future of Apache AmbariArtem Ervits
 
Apache Hadoop 0.23
Apache Hadoop 0.23Apache Hadoop 0.23
Apache Hadoop 0.23Hortonworks
 
Running Enterprise Workloads in the Cloud
Running Enterprise Workloads in the CloudRunning Enterprise Workloads in the Cloud
Running Enterprise Workloads in the CloudDataWorks Summit
 
Hortonworks Technical Workshop: HBase For Mission Critical Applications
Hortonworks Technical Workshop: HBase For Mission Critical ApplicationsHortonworks Technical Workshop: HBase For Mission Critical Applications
Hortonworks Technical Workshop: HBase For Mission Critical ApplicationsHortonworks
 
Scaling real time streaming architectures with HDF and Dell EMC Isilon
Scaling real time streaming architectures with HDF and Dell EMC IsilonScaling real time streaming architectures with HDF and Dell EMC Isilon
Scaling real time streaming architectures with HDF and Dell EMC IsilonHortonworks
 
An Overview on Optimization in Apache Hive: Past, Present, Future
An Overview on Optimization in Apache Hive: Past, Present, FutureAn Overview on Optimization in Apache Hive: Past, Present, Future
An Overview on Optimization in Apache Hive: Past, Present, FutureDataWorks Summit
 
Introduction to Hortonworks Data Cloud for AWS
Introduction to Hortonworks Data Cloud for AWSIntroduction to Hortonworks Data Cloud for AWS
Introduction to Hortonworks Data Cloud for AWSYifeng Jiang
 
Ozone- Object store for Apache Hadoop
Ozone- Object store for Apache HadoopOzone- Object store for Apache Hadoop
Ozone- Object store for Apache HadoopHortonworks
 
A First-Hand Look at What's New in HDP 2.3
A First-Hand Look at What's New in HDP 2.3 A First-Hand Look at What's New in HDP 2.3
A First-Hand Look at What's New in HDP 2.3 DataWorks Summit
 

La actualidad más candente (20)

Hortonworks Technical Workshop: Apache Ambari
Hortonworks Technical Workshop:   Apache AmbariHortonworks Technical Workshop:   Apache Ambari
Hortonworks Technical Workshop: Apache Ambari
 
Apache Ambari - What's New in 2.2
 Apache Ambari - What's New in 2.2 Apache Ambari - What's New in 2.2
Apache Ambari - What's New in 2.2
 
Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016
 
ODPi 101: Who we are, What we do
ODPi 101: Who we are, What we doODPi 101: Who we are, What we do
ODPi 101: Who we are, What we do
 
Apache Ambari - What's New in 2.1
Apache Ambari - What's New in 2.1Apache Ambari - What's New in 2.1
Apache Ambari - What's New in 2.1
 
Webinar Series Part 5 New Features of HDF 5
Webinar Series Part 5 New Features of HDF 5Webinar Series Part 5 New Features of HDF 5
Webinar Series Part 5 New Features of HDF 5
 
Apache Ambari - What's New in 2.0.0
Apache Ambari - What's New in 2.0.0Apache Ambari - What's New in 2.0.0
Apache Ambari - What's New in 2.0.0
 
Deploying and Managing Hadoop Clusters with AMBARI
Deploying and Managing Hadoop Clusters with AMBARIDeploying and Managing Hadoop Clusters with AMBARI
Deploying and Managing Hadoop Clusters with AMBARI
 
Hive present-and-feature-shanghai
Hive present-and-feature-shanghaiHive present-and-feature-shanghai
Hive present-and-feature-shanghai
 
Double Your Hadoop Hardware Performance with SmartSense
Double Your Hadoop Hardware Performance with SmartSenseDouble Your Hadoop Hardware Performance with SmartSense
Double Your Hadoop Hardware Performance with SmartSense
 
HDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical WorkshopHDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical Workshop
 
Past, Present and Future of Apache Ambari
Past, Present and Future of Apache AmbariPast, Present and Future of Apache Ambari
Past, Present and Future of Apache Ambari
 
Apache Hadoop 0.23
Apache Hadoop 0.23Apache Hadoop 0.23
Apache Hadoop 0.23
 
Running Enterprise Workloads in the Cloud
Running Enterprise Workloads in the CloudRunning Enterprise Workloads in the Cloud
Running Enterprise Workloads in the Cloud
 
Hortonworks Technical Workshop: HBase For Mission Critical Applications
Hortonworks Technical Workshop: HBase For Mission Critical ApplicationsHortonworks Technical Workshop: HBase For Mission Critical Applications
Hortonworks Technical Workshop: HBase For Mission Critical Applications
 
Scaling real time streaming architectures with HDF and Dell EMC Isilon
Scaling real time streaming architectures with HDF and Dell EMC IsilonScaling real time streaming architectures with HDF and Dell EMC Isilon
Scaling real time streaming architectures with HDF and Dell EMC Isilon
 
An Overview on Optimization in Apache Hive: Past, Present, Future
An Overview on Optimization in Apache Hive: Past, Present, FutureAn Overview on Optimization in Apache Hive: Past, Present, Future
An Overview on Optimization in Apache Hive: Past, Present, Future
 
Introduction to Hortonworks Data Cloud for AWS
Introduction to Hortonworks Data Cloud for AWSIntroduction to Hortonworks Data Cloud for AWS
Introduction to Hortonworks Data Cloud for AWS
 
Ozone- Object store for Apache Hadoop
Ozone- Object store for Apache HadoopOzone- Object store for Apache Hadoop
Ozone- Object store for Apache Hadoop
 
A First-Hand Look at What's New in HDP 2.3
A First-Hand Look at What's New in HDP 2.3 A First-Hand Look at What's New in HDP 2.3
A First-Hand Look at What's New in HDP 2.3
 

Destacado

Apache Ambari: Managing Hadoop and YARN
Apache Ambari: Managing Hadoop and YARNApache Ambari: Managing Hadoop and YARN
Apache Ambari: Managing Hadoop and YARNHortonworks
 
Apache ambari
Apache ambariApache ambari
Apache ambariYuta Imai
 
Mission to NARs with Apache NiFi
Mission to NARs with Apache NiFiMission to NARs with Apache NiFi
Mission to NARs with Apache NiFiHortonworks
 
Real time trade surveillance in financial markets
Real time trade surveillance in financial marketsReal time trade surveillance in financial markets
Real time trade surveillance in financial marketsHortonworks
 
Managing your Hadoop Clusters with Apache Ambari
Managing your Hadoop Clusters with Apache AmbariManaging your Hadoop Clusters with Apache Ambari
Managing your Hadoop Clusters with Apache AmbariDataWorks Summit
 
"Integration of Hadoop in Business landscape", Michal Alexa, IT and Innovatio...
"Integration of Hadoop in Business landscape", Michal Alexa, IT and Innovatio..."Integration of Hadoop in Business landscape", Michal Alexa, IT and Innovatio...
"Integration of Hadoop in Business landscape", Michal Alexa, IT and Innovatio...Dataconomy Media
 
Ambari Views - Overview
Ambari Views - OverviewAmbari Views - Overview
Ambari Views - OverviewHortonworks
 
Using Big Data to create a data drive organization
Using Big Data to create a data drive organizationUsing Big Data to create a data drive organization
Using Big Data to create a data drive organizationEdward Chenard
 
Hortonworks.Cluster Config Guide
Hortonworks.Cluster Config GuideHortonworks.Cluster Config Guide
Hortonworks.Cluster Config GuideDouglas Bernardini
 
Aurkezpen Aeskozia
Aurkezpen AeskoziaAurkezpen Aeskozia
Aurkezpen Aeskozialechristie
 
Presentation for chansky
Presentation for chanskyPresentation for chansky
Presentation for chanskyJuliette Andrea
 
MobileYouth Report 2012: Generation O
MobileYouth Report 2012: Generation OMobileYouth Report 2012: Generation O
MobileYouth Report 2012: Generation OGraham Brown
 
瑞士白朗峰的美景
瑞士白朗峰的美景瑞士白朗峰的美景
瑞士白朗峰的美景paulkuo
 
Innovate Now
Innovate NowInnovate Now
Innovate NowOgilvy
 
Daily Newsletter: 10th December, 2010
Daily Newsletter: 10th December, 2010Daily Newsletter: 10th December, 2010
Daily Newsletter: 10th December, 2010Fullerton Securities
 

Destacado (18)

Apache Ambari: Managing Hadoop and YARN
Apache Ambari: Managing Hadoop and YARNApache Ambari: Managing Hadoop and YARN
Apache Ambari: Managing Hadoop and YARN
 
Apache ambari
Apache ambariApache ambari
Apache ambari
 
Mission to NARs with Apache NiFi
Mission to NARs with Apache NiFiMission to NARs with Apache NiFi
Mission to NARs with Apache NiFi
 
Real time trade surveillance in financial markets
Real time trade surveillance in financial marketsReal time trade surveillance in financial markets
Real time trade surveillance in financial markets
 
Managing your Hadoop Clusters with Apache Ambari
Managing your Hadoop Clusters with Apache AmbariManaging your Hadoop Clusters with Apache Ambari
Managing your Hadoop Clusters with Apache Ambari
 
"Integration of Hadoop in Business landscape", Michal Alexa, IT and Innovatio...
"Integration of Hadoop in Business landscape", Michal Alexa, IT and Innovatio..."Integration of Hadoop in Business landscape", Michal Alexa, IT and Innovatio...
"Integration of Hadoop in Business landscape", Michal Alexa, IT and Innovatio...
 
Ambari Views - Overview
Ambari Views - OverviewAmbari Views - Overview
Ambari Views - Overview
 
Using Big Data to create a data drive organization
Using Big Data to create a data drive organizationUsing Big Data to create a data drive organization
Using Big Data to create a data drive organization
 
Hortonworks.Cluster Config Guide
Hortonworks.Cluster Config GuideHortonworks.Cluster Config Guide
Hortonworks.Cluster Config Guide
 
An Overview of Ambari
An Overview of AmbariAn Overview of Ambari
An Overview of Ambari
 
Apache Hadoop Crash Course - HS16SJ
Apache Hadoop Crash Course - HS16SJApache Hadoop Crash Course - HS16SJ
Apache Hadoop Crash Course - HS16SJ
 
Aurkezpen Aeskozia
Aurkezpen AeskoziaAurkezpen Aeskozia
Aurkezpen Aeskozia
 
Presentation for chansky
Presentation for chanskyPresentation for chansky
Presentation for chansky
 
MobileYouth Report 2012: Generation O
MobileYouth Report 2012: Generation OMobileYouth Report 2012: Generation O
MobileYouth Report 2012: Generation O
 
瑞士白朗峰的美景
瑞士白朗峰的美景瑞士白朗峰的美景
瑞士白朗峰的美景
 
6º básico b semana 28 al 01 de abril
6º básico b semana 28 al 01 de  abril6º básico b semana 28 al 01 de  abril
6º básico b semana 28 al 01 de abril
 
Innovate Now
Innovate NowInnovate Now
Innovate Now
 
Daily Newsletter: 10th December, 2010
Daily Newsletter: 10th December, 2010Daily Newsletter: 10th December, 2010
Daily Newsletter: 10th December, 2010
 

Similar a Managing Enterprise Hadoop Clusters with Apache Ambari

Manage Add-on Services in Apache Ambari
Manage Add-on Services in Apache AmbariManage Add-on Services in Apache Ambari
Manage Add-on Services in Apache AmbariJayush Luniya
 
Manage Add-On Services with Apache Ambari
Manage Add-On Services with Apache AmbariManage Add-On Services with Apache Ambari
Manage Add-On Services with Apache AmbariDataWorks Summit
 
Hadoop Operations - Past, Present, and Future
Hadoop Operations - Past, Present, and FutureHadoop Operations - Past, Present, and Future
Hadoop Operations - Past, Present, and FutureDataWorks Summit
 
Future of Apache Ambari
Future of Apache AmbariFuture of Apache Ambari
Future of Apache AmbariJayush Luniya
 
The Future of Apache Ambari
The Future of Apache AmbariThe Future of Apache Ambari
The Future of Apache AmbariDataWorks Summit
 
Hello OpenStack, Meet Hadoop
Hello OpenStack, Meet HadoopHello OpenStack, Meet Hadoop
Hello OpenStack, Meet HadoopDataWorks Summit
 
Apache Hadoop 3 updates with migration story
Apache Hadoop 3 updates with migration storyApache Hadoop 3 updates with migration story
Apache Hadoop 3 updates with migration storySunil Govindan
 
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and TroubleshootingApache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and TroubleshootingDataWorks Summit/Hadoop Summit
 
Hadoop Operations - Past, Present, and Future
Hadoop Operations - Past, Present, and FutureHadoop Operations - Past, Present, and Future
Hadoop Operations - Past, Present, and FutureDataWorks Summit
 
Data in the Cloud Crash Course
Data in the Cloud Crash CourseData in the Cloud Crash Course
Data in the Cloud Crash CourseDataWorks Summit
 
Hadoop Operations – Past, Present, and Future
Hadoop Operations – Past, Present, and FutureHadoop Operations – Past, Present, and Future
Hadoop Operations – Past, Present, and FutureDataWorks Summit
 
Streamline Apache Hadoop Operations with Apache Ambari and SmartSense
Streamline Apache Hadoop Operations with Apache Ambari and SmartSenseStreamline Apache Hadoop Operations with Apache Ambari and SmartSense
Streamline Apache Hadoop Operations with Apache Ambari and SmartSenseHortonworks
 
Data in the Cloud Crash Course
Data in the Cloud Crash CourseData in the Cloud Crash Course
Data in the Cloud Crash CourseDataWorks Summit
 
Hortonworks technical workshop operations with ambari
Hortonworks technical workshop   operations with ambariHortonworks technical workshop   operations with ambari
Hortonworks technical workshop operations with ambariHortonworks
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureDataWorks Summit
 
HDF 3.1 : An Introduction to New Features
HDF 3.1 : An Introduction to New FeaturesHDF 3.1 : An Introduction to New Features
HDF 3.1 : An Introduction to New FeaturesTimothy Spann
 
YARN - Past, Present, & Future
YARN - Past, Present, & FutureYARN - Past, Present, & Future
YARN - Past, Present, & FutureDataWorks Summit
 

Similar a Managing Enterprise Hadoop Clusters with Apache Ambari (20)

Manage Add-on Services in Apache Ambari
Manage Add-on Services in Apache AmbariManage Add-on Services in Apache Ambari
Manage Add-on Services in Apache Ambari
 
Manage Add-On Services with Apache Ambari
Manage Add-On Services with Apache AmbariManage Add-On Services with Apache Ambari
Manage Add-On Services with Apache Ambari
 
Streamline Hadoop DevOps with Apache Ambari
Streamline Hadoop DevOps with Apache AmbariStreamline Hadoop DevOps with Apache Ambari
Streamline Hadoop DevOps with Apache Ambari
 
Hadoop Operations - Past, Present, and Future
Hadoop Operations - Past, Present, and FutureHadoop Operations - Past, Present, and Future
Hadoop Operations - Past, Present, and Future
 
What's new in Ambari
What's new in AmbariWhat's new in Ambari
What's new in Ambari
 
Future of Apache Ambari
Future of Apache AmbariFuture of Apache Ambari
Future of Apache Ambari
 
The Future of Apache Ambari
The Future of Apache AmbariThe Future of Apache Ambari
The Future of Apache Ambari
 
Hello OpenStack, Meet Hadoop
Hello OpenStack, Meet HadoopHello OpenStack, Meet Hadoop
Hello OpenStack, Meet Hadoop
 
Apache Hadoop 3 updates with migration story
Apache Hadoop 3 updates with migration storyApache Hadoop 3 updates with migration story
Apache Hadoop 3 updates with migration story
 
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and TroubleshootingApache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
 
Hadoop Operations - Past, Present, and Future
Hadoop Operations - Past, Present, and FutureHadoop Operations - Past, Present, and Future
Hadoop Operations - Past, Present, and Future
 
Data in the Cloud Crash Course
Data in the Cloud Crash CourseData in the Cloud Crash Course
Data in the Cloud Crash Course
 
Hadoop Operations – Past, Present, and Future
Hadoop Operations – Past, Present, and FutureHadoop Operations – Past, Present, and Future
Hadoop Operations – Past, Present, and Future
 
Streamline Apache Hadoop Operations with Apache Ambari and SmartSense
Streamline Apache Hadoop Operations with Apache Ambari and SmartSenseStreamline Apache Hadoop Operations with Apache Ambari and SmartSense
Streamline Apache Hadoop Operations with Apache Ambari and SmartSense
 
Data in the Cloud Crash Course
Data in the Cloud Crash CourseData in the Cloud Crash Course
Data in the Cloud Crash Course
 
Apache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and FutureApache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and Future
 
Hortonworks technical workshop operations with ambari
Hortonworks technical workshop   operations with ambariHortonworks technical workshop   operations with ambari
Hortonworks technical workshop operations with ambari
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and Future
 
HDF 3.1 : An Introduction to New Features
HDF 3.1 : An Introduction to New FeaturesHDF 3.1 : An Introduction to New Features
HDF 3.1 : An Introduction to New Features
 
YARN - Past, Present, & Future
YARN - Past, Present, & FutureYARN - Past, Present, & Future
YARN - Past, Present, & Future
 

Más de Hortonworks

Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next LevelHortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next LevelHortonworks
 
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT StrategyIoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT StrategyHortonworks
 
Getting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with CloudbreakGetting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with CloudbreakHortonworks
 
Johns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log EventsJohns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log EventsHortonworks
 
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad GuysCatch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad GuysHortonworks
 
HDF 3.2 - What's New
HDF 3.2 - What's NewHDF 3.2 - What's New
HDF 3.2 - What's NewHortonworks
 
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging ManagerCuring Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging ManagerHortonworks
 
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical EnvironmentsInterpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical EnvironmentsHortonworks
 
IBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data LandscapeIBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data LandscapeHortonworks
 
Premier Inside-Out: Apache Druid
Premier Inside-Out: Apache DruidPremier Inside-Out: Apache Druid
Premier Inside-Out: Apache DruidHortonworks
 
Accelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleAccelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleHortonworks
 
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATATIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATAHortonworks
 
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Hortonworks
 
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseDelivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseHortonworks
 
Making Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with EaseMaking Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with EaseHortonworks
 
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World PresentationWebinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World PresentationHortonworks
 
Driving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementDriving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementHortonworks
 
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming FeaturesHDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming FeaturesHortonworks
 
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...Hortonworks
 
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCUnlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCHortonworks
 

Más de Hortonworks (20)

Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next LevelHortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
 
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT StrategyIoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
 
Getting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with CloudbreakGetting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with Cloudbreak
 
Johns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log EventsJohns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log Events
 
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad GuysCatch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
 
HDF 3.2 - What's New
HDF 3.2 - What's NewHDF 3.2 - What's New
HDF 3.2 - What's New
 
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging ManagerCuring Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
 
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical EnvironmentsInterpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
 
IBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data LandscapeIBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data Landscape
 
Premier Inside-Out: Apache Druid
Premier Inside-Out: Apache DruidPremier Inside-Out: Apache Druid
Premier Inside-Out: Apache Druid
 
Accelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleAccelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at Scale
 
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATATIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
 
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
 
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseDelivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
 
Making Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with EaseMaking Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with Ease
 
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World PresentationWebinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
 
Driving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementDriving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data Management
 
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming FeaturesHDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
 
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
 
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCUnlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDC
 

Último

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
The Evolution of Money: Digital Transformation and CBDCs in Central Banking
The Evolution of Money: Digital Transformation and CBDCs in Central BankingThe Evolution of Money: Digital Transformation and CBDCs in Central Banking
The Evolution of Money: Digital Transformation and CBDCs in Central BankingSelcen Ozturkcan
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 

Último (20)

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
The Evolution of Money: Digital Transformation and CBDCs in Central Banking
The Evolution of Money: Digital Transformation and CBDCs in Central BankingThe Evolution of Money: Digital Transformation and CBDCs in Central Banking
The Evolution of Money: Digital Transformation and CBDCs in Central Banking
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 

Managing Enterprise Hadoop Clusters with Apache Ambari

  • 1. 1 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Managing Enterprise Hadoop Clusters with Apache Ambari Jayush Luniya @ Hortonworks Apache Ambari PMC © Hortonworks Inc. 2011 – 2016. All Rights Reserved May 2016
  • 2. 2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Agenda Ambari Overview Ambari Features Demo Q&A
  • 3. 3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved What’s Apache Ambari? 100% open-source platform for simplifying Hadoop cluster management and use. Highly extensible.
  • 4. 4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved It’s a wild zoo out there! Gotta manage this efficiently.
  • 5. 5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Apache Ambari Themes • Deliver the core operational capabilities to provision, manage and monitor Hadoop clusters at scale. Operate Hadoop at Scale • Robust API for integration with existing enterprise systems, such as Microsoft SCOM and Teradata Viewpoint. Integrate with the Enterprise • Provide extensible platform for Customers, Partners and the Community (Stacks, Views) Extend for the Ecosystem
  • 6. 6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Apache Ambari
  • 7. 7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Open Source Activity
  • 8. 8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Inception: AMBARI-1 (Sept, 2011)
  • 9. 9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Fast forward 5 years to today…  Latest JIRA: AMBARI-16131  150+ Contributors  60+ Committers  16131 JIRAs filed  14254 JIRAs fixed At 1.5 day per JIRA ~ 90 person years!  Used by hundreds of companies
  • 10. 10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ambari – 3rd Biggest Project* @ Apache * Based on total JIRAs filed on a project basis as of April 26, 2016 #2: Hadoop at ~32k as it is split across multiple JIRA Projects #1 #3 #4 #5
  • 11. 11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Timeline Ambari 1.6.* May 2014 908 JIRAs Ambari 1.5.* Apr 2014 1218 JIRAs Ambari 1.7.* Dec 2014 1620 JIRAs Ambari 2.0.* April 2015 1804 JIRAs Current GA Version (2.2.2) Ambari 2.1.* July 2015 2674 JIRAs Ambari Stacks Resolution of 9k+ JIRAs Ambari Blueprints Ambari Views Alerts Framework Metrics System Rolling Upgrade Kerberos Automation Enhanced Dashboards Smart Configs Ambari 2.2.* Dec 2015 1542 JIRAs Express Upgrade AMS Grafana
  • 12. 12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Agenda Ambari Overview Ambari Features Demo Q&A
  • 13. 13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Extensibility Features • To add new Services (ISV or otherwise) beyond HDP stack • To customize a Stack for customer specific environments Stacks • To use Ambari for automating cluster installations. • To share best practices on layout and cluster configuration Blueprints • To extend and customize the Ambari Web UI • Add new capabilities, customize existing capabilities Views
  • 14. 14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Anatomy of Ambari Extension Points
  • 15. 15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ambari Stacks
  • 16. 16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Stack Terminology Term Definition Examples STACK Defines a set of Services, where to obtain the software packages and how to manage the lifecycle. HDP-2.3, HDP-2.2 SERVICE Defines the Components that make-up the service. HDFS, NAGIOS, YARN COMPONENT The building-blocks of a Service, that adhere to a certain lifecycle. NAMENODE, DATANODE, OOZIE_SERVER CATEGORY The category of Component. MASTER, SLAVE, CLIENT REPO Repository metadata where the artifacts reside http://public-repo- 1.hortonworks.com/HDP/centos6/2 .x/GA/2.3.0.0
  • 17. 17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ambari Stack  Stacks define Services + Repo – What is a stack, and where to get the bits  Each service has a definition – What components are part of the Service  Each service has defined lifecycle commands – start, stop, status, install, configure  Lifecycle is controlled via command scripts  Ability to define “custom” commands Ambari Server Stack Service Definitions Command Scripts xml python Ambari Agents Repos
  • 18. 18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Stacks Support Inheritance HDP 2.1 Stack HDP 2.0 Stack  Overrides any Service definitions, commands and configurations  Adds new Services specific to this Stack  Defines a set of Service definitions  Default service configurations and command scripts
  • 19. 19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Blueprints
  • 20. 20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Automated Cluster Deployment  Deploy clusters of any scale with ease  Two REST API calls is all it takes to provision a cluster Who uses it?  HDInsight (Microsoft Azure)  Hortonworks QA
  • 21. 21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Example: Create a 100-node Cluster { "configurations" : [ { ”hdfs-site" : { "dfs.datanode.data.dir" : ”/hadoop/1,/hadoop/2,/hadoop/3" } } ], "host_groups" : [ { "name" : ”master-host", "components" : [ { "name" : "NAMENODE” }, { "name" : "RESOURCEMANAGER” }, … ], "cardinality" : "1" }, { "name" : ”worker-host", "components" : [ { "name" : ”DATANODE” }, { "name" : ”NODEMANAGER” }, … ], "cardinality" : "1+" }, ], "Blueprints" : { "stack_name" : "HDP", "stack_version" : "2.0" } } { "blueprint" : ”my-blueprint", "host_groups" :[ { "name" : ”master-host", "hosts" : [ { "fqdn" : ”master001.ambari.apache.org” } ] }, { "name" : ”worker-host", "hosts" : [ { "fqdn" : ”worker001.ambari.apache.org” }, { "fqdn" : ”worker002.ambari.apache.org” }, … { "fqdn" : ”worker099.ambari.apache.org” } ] } ] } 1. POST /api/v1/blueprints/my-blueprint 2. POST /api/v1/clusters/my-cluster
  • 22. 22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Cluster Replication { "configurations" : [ { ”cluster-env" : { ”user_group" : ”hadoop" } ”hdfs-site" : { "dfs.datanode.data.dir" : ”/hadoop/1,/hadoop/2,/hadoop/3" } } ], "host_groups" : [ { "name" : ”master-host", "components" : [ { "name" : "NAMENODE” }, { "name" : "RESOURCEMANAGER” }, … ], "cardinality" : "1" } ], "Blueprints" : { "stack_name" : "HDP", "stack_version" : "2.0" } } GET/api/v1/clusters/my- cluster?format=blueprint  Export blueprint from an existing cluster  Import blueprint to replicate the cluster
  • 23. 23 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Blueprint Features Ambari 2.0:  High availability (HA) cluster deployments  Adding hosts using blueprints (AMBARI-8458) Ambari 2.1:  Advanced cluster creation options (AMBARI-10750) Ambari 2.2:  Kerberized cluster deployments (AMBARI-13431)  Stack advisor recommendations (AMBARI-13487)
  • 24. 24 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Stack Upgrades
  • 25. 25 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Stack Upgrades  Rolling vs Express Upgrade modes  Side-by-Side Bits and Configs Bits: /usr/hdp/2.2.0.0-2041 /usr/hdp/2.2.4.2-2 /usr/hdp/2.3.0.0-3000 Configs: /etc/hive/conf/ (initial) /etc/hive/conf/v0 (HDP 2.2.4.2) /etc/hive/conf/v1 (HDP 2.3) 2.2.0.0 2.2.4.2 2.3.0.0minor jump major jump
  • 26. 26 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Express vs Rolling Upgrade Rolling Upgrade  Services are up the entire time  Upgrade one component at a time  Robust and fault-tolerant  Service checks performed frequently during the upgrade Express Upgrade  All services are brought down, upgraded and restarted  Faster upgrade mode  Planned service downtime  Relatively service checks performed less frequently during the upgrade.
  • 27. 27 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Stack Upgrade – Install Version  Install new version in parallel on all agents  No downtime
  • 28. 28 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Stack Upgrade – Orchestration  Not necessarily “one-click” but fully guided
  • 29. 29 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Stack Upgrade – Upgrade Catalog  Upgrades are driven by upgrade catalogs defined in stack definitions.  Defines upgrade groups and upgrade order  Provides ability to modify configurations – Set, move, delete, transform  Upgrade steps can be marked as skippable and retryable  Supports executing custom scripts during upgrade
  • 30. 30 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Stack Upgrade – Upgrade Catalog
  • 31. 31 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Stack Downgrade  Can trigger downgrade at any stage of the stack upgrade  Cannot downgrade once stack upgrade has been finalized
  • 32. 32 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Smart Configurations
  • 33. 33 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Hadoop Configuration Challenges  Too many configurations – Which ones are important?  Too easy to mess up – What are valid/reasonable values? – What are the units? – Ok, what about dependencies?  Gets harder with combinations of services, host assignments, enabled features, CPU/RAM/disks, etc – Any recommendations? What am I doing wrong?  Smart Configurations
  • 34. 34 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ambari Smart Configs UI Customizable layout - Tabs - Sections - Sub-sections - Simple grid layout (Advanced Tab contains remaining configurations) New Widgets - Sliders - Recommended - Minimum - Maximum - Increment Step - Combos - Enumerated values - Toggles - Binary options - Spinners - Splits value into multiple controls. Time in milliseconds split into days, hours, minutes. - Lists - Enumerated values - Single select - Multi select Implemented - HDFS - YARN - MapReduce - Hive - HBase
  • 35. 35 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Stack Driven Layouts Stack has theme.json file Layout  Tabs  Sections  Sub-sections Placement  Configs placement in sub-sections Widgets  Widget type  Optional Units  Bytes (B, KB, MB, GB, TB, PB)  Time (Millis, Seconds, Minutes, Hours, Days, Months, Years) { "name": "default", "description": "Default theme for HBASE service", "configuration": { "layouts": [ { "name": "default", "tabs": [ { "name": "settings", "display-name": "Settings", "layout": { "tab-columns": "3", "tab-rows": "3", "sections": [ ... ] } } ] } ], "placement": { "configuration-layout": "default", "configs": [...] }, "widgets": [ { "config": "hbase-env/hbase_master_heapsize", "widget": { "type": "slider", "units": [ { "unit-name": "GB" } ] } }, ... ] } }
  • 36. 36 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Config Metadata and Dependencies Extended Metadata  Defined in property_value_attributes  Hold non-UI metadata about value range, increment, unit, etc Dependencies  Models bi-directional relationship between configs  Depends On (property_depends_on)  Answers “which configs do I depend on?”  Depended By (dependencies)  Answers “which configs are dependent on me?”  Ambari automatically updates dependencies { "StackConfigurations": { "final": "false", "property_depends_on": [ { "type": "yarn-site", "name": "yarn.nodemanager.resource.memory-mb" } ], "property_description": “The minimum allocation for every", "property_display_name": "Minimum Container Size (Memory)", "property_name": "yarn.scheduler.minimum-allocation-mb", "property_type": [], "property_value": "512", "property_value_attributes": { "type": "int", "maximum": "5120", "minimum": "0", "unit": "MB", "increment_step": "256" }, "type": "yarn-site.xml" }, "dependencies": [ { "StackConfigurationDependency": { "dependency_name": "hive.tez.container.size", "property_name": "yarn.scheduler.minimum-allocation-mb” } }, { "StackConfigurationDependency": { "dependency_name": "mapreduce.map.memory.mb", "property_name": "yarn.scheduler.minimum-allocation-mb” } }, { "StackConfigurationDependency": { "dependency_name": "mapreduce.reduce.memory.mb", "property_name": "yarn.scheduler.minimum-allocation-mb” } }… ] }
  • 37. 37 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Metrics
  • 38. 38 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ambari Metrics Service (AMS) - Goals  Ability to collect metrics from Hadoop and other Stack services  Ability to collect system level metrics  Ability to retain metrics at a high precision for a configurable time period  Ability to automatically purge metrics after retention period  Provide integration point for metrics collection and retention by external system  Trigger alerts based on metrics in Ambari
  • 39. 39 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ambari Metrics System - Architecture
  • 40. 40 © Hortonworks Inc. 2011 – 2016. All Rights Reserved AMS Grafana Ambari 2.2.2  Powerful dashboard builder integrated with AMS  Pre-built Grafana dashboards for host-level and service-level metrics  User can build and save custom dashboards
  • 41. 41 © Hortonworks Inc. 2011 – 2016. All Rights Reserved AMS Grafana
  • 42. 42 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Alerts
  • 43. 43 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Alert – Types Type Description Status Thresholds Configurable? PORT Watches a port based on a configuration property such as the URI. OK, WARN, CRIT Yes (seconds) WEB Watches an HTTP or HTTPS endpoint and determines connectivity and HTTP status code. OK, WARN, CRIT No AGGREGATE Aggregate of status for another alert definition. OK, WARN, CRIT Yes (percentage) METRIC Watches a metric or series of metrics in JMX and compares a mathematical result against a threshold. OK, WARN, CRIT Yes (variable) SCRIPT Uses a custom script to handle checking. OK or CRIT No
  • 44. 44 © Hortonworks Inc. 2011 – 2016. All Rights Reserved UI – Current Alerts Configured by default; managed via the the web client
  • 45. 45 © Hortonworks Inc. 2011 – 2016. All Rights Reserved UI – Host Alerts  Automatically refreshes  Query alert history
  • 46. 46 © Hortonworks Inc. 2011 – 2016. All Rights Reserved UI– Customization & Instances  Status text, thresholds, and interval
  • 47. 47 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Views
  • 48. 48 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ambari Views View Framework  Provide various applications accessible from Ambari Web UI – interact with the cluster via a browser from a single place for all users (cluster operators, data analysis, developers, etc) Easy to develop  No need to understand Ambari core code – view development is just like creating any other web application Easy to deploy  Packaged as a single jar file  Auto create / auto configure
  • 49. 49 © Hortonworks Inc. 2011 – 2016. All Rights Reserved CS Queue Manager for Cluster Operators Capacity Scheduler Queue Manager
  • 50. 50 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS File Browser for General Users HDFS File Browser
  • 51. 51 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Job Analysis for Developers Troubleshoot Tez JobsTroubleshoot / Improve Hive queries
  • 52. 52 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Query Editors for Data Analysts Create, edit, execute, and analyze Hive queries Create, edit, and execute Pig scripts
  • 53. 53 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ambari Server in Views-Only mode Ambari Server Cluster managed by Ambari Ambari Server “Views-only” mode (aka “Stand-alone” mode) Cluster not managed by Ambari Management Use Views Use Views Use Views  Use Views on existing clusters not managed by Ambari  Can use Views against multiple clusters
  • 54. 54 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Kerberos Automation
  • 55. 55 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Kerberos Automation Ambari 2.0  Ambari manage Kerberos principals and keytabs  Works with existing MIT KDC or Active Directory  Once Kerberized, seamlessly handle:  Adding new hosts  Adding new components to existing hosts  Adding new services  Moving components to different hosts
  • 56. 56 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Agenda Ambari Overview Ambari Features Demo Q&A
  • 57. 57 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Agenda Ambari Overview Ambari Features Demo Q&A
  • 58. 58 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Thank You! Try Ambari  Follow the Ambari Quick Start Guide https://cwiki.apache.org/confluence/display/AMBARI/Quick+Start+Guide Learn more  Visit the project website http://ambari.apache.org/ Get Involved  User Mailing List: user-subscribe@ambari.apache.org  Developer Mailing List: dev-subscribe@ambari.apache.org  Use JIRA to file bugs and improvement requests https://issues.apache.org/jira/browse/AMBARI/ Jayush Luniya @ Hortonworks (Apache Ambari PMC)
  • 59. 59 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Future Roadmap  AMS Grafana Integration  Ambari Management Packs  Ambari Logsearch  Patch Upgrades  Multi Service Versions  Multi Service Instances
  • 60. 60 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Q&A Stats Largest production clusters managed by Ambari ~1600 nodes, ~800 nodes Largest test cluster for Ambari scale testing ~400 nodes Largest test cluster where rolling upgrade was performed ~400 nodes ~40 hours

Notas del editor

  1. Hadoop projects combined is: ~28k HDFS: 8247 MapReduce: 6117 YARN: 3664 Hadoop Common: 10072
  2. Allow cluster creation or scaling to be started via the REST API prior to all/any hosts being available. As hosts register with Ambari server they will be matched to request host groups and provisioned according to the requested topology Allow host predicates to be specified along with host count to provide more flexibility in matching hosts to host groups. This will allow for host flavors where different host groups are matched to different host flavors Break up the current monolithic provisioning request into a request for each host operation. For example, install on host A, start on host A, install on hostB, etc. This will allow hosts to make progress even when another host encounters a failure. Allow a host count to be specified in the cluster creation template instead of host names. This is documented in https://issues.apache.org/jira/browse/AMBARI-6275
  3. Allow cluster creation or scaling to be started via the REST API prior to all/any hosts being available. As hosts register with Ambari server they will be matched to request host groups and provisioned according to the requested topology Allow host predicates to be specified along with host count to provide more flexibility in matching hosts to host groups. This will allow for host flavors where different host groups are matched to different host flavors Break up the current monolithic provisioning request into a request for each host operation. For example, install on host A, start on host A, install on hostB, etc. This will allow hosts to make progress even when another host encounters a failure. Allow a host count to be specified in the cluster creation template instead of host names. This is documented in https://issues.apache.org/jira/browse/AMBARI-6275
  4. Notice that can upgrade in either same stack e.g., 2.2.*, or 2.2 -> 2.3
  5. Too many configurations – which ones are important? 2 Configurations from 1 section and 2 from another section might be most important No easy way to group across sections Majority Text fields Configs almost always shown as text fields Can be shown in more intuitive controls No units help Configs might shown to user in one unit (days, GB), and be saved in a different unit (milliseconds, B) What are acceptable values? Open ended text fields don’t help when values have to been within a minimum/maximum values No support for a enum of values No configuration dependencies After install if you change one config, you have to remember to change others
  6. OpenTSDB is popular solution on top of HBASE. Time Series DB
  7. Introduced in Ambari 1.7