SlideShare una empresa de Scribd logo
1 de 24
Hellmar Becker, ING
Continuous Lifecycle London
Automate Hadoop Cluster Deployment
in a Banking Ecosystem
Lessons from Practice
May 4, 2016
Who am I?
2
Automate Hadoop Cluster Deployment in a Banking Ecosystem
3
The Goal
Prelude: Hadoop Patterns in ING
Chapter 1: First Steps
Chapter 2: Standardizing
Chapter 3: The Cloud
Conclusion
Questions
The Goal
IN WHICH we look at the challenges that a bank has to face in the 21st
century, and how this translates into decisions made in the IT landscape.
Market leaders Benelux
Growth markets
Commercial Banking
Challengers
The world of ING – Data Driven Since 1881
5
Customers
33 Million
Private, Corporate and
Institutional Customers
Countries
41
In Europe, Asia,
Australia, North and
South America
Employees
52,000
6
We accelerate through the Concept of One
7
Provide standardized and easy to use
global capabilities and services
Accelerate strategy and
concentrate on business value
Concept of One
Prelude: Hadoop Patterns in ING
8
IN WHICH we describe the journey of some interesting characters that set out to get Hadoop adopted
within a large, venerable institution, and across the world.
Data Lake and Advanced Analytics within ING
9
External and internal reporting for
own or regulatory purposes
Integrate all data sources within the
bank into one processing platform
• Batch data streams
• Live transactions
• Model building for customer
interaction
Better understand customer
needs in an increasingly digital world
Data can help us offering
tailored products and services
Empower data scientists and analysts
to get the best results with advanced
analytics tools and predictive models
Open source software where possible
– Hadoop as a core component
1. File Storage
2. Deep Data
3. Analytical
Hadoop
4. Real Time
Hadoop Usage Patterns
10
Analytical Hadoop
• Our first use case
• Development and Production environments
• P environment has Production level security but Test level SLA
FileStore and Deep Data
• Completely automated
• Full DTAP street (Development, Test, Acceptance, Production)
Patterns and their maintenance
11
• Vendors give us tools to do a GUI based install
• Maintain several clusters in parallel, DTAP!
• Auditability!
• Not for us, we need to do automated installs
• APIs and scripting facilities do exist, but are often poorly tested and documented
Standard installation doesn’t cut it
• Layers – IaaS, PaaS, Application (we want IaaS not PaaS)
• Organizational divide: Platform team vs. Infra team
• Different privileges
• Different tool choices
• Trust and collaboration need to be actively built
• Convince security audit teams!
Organizational challenge
Chapter 1: First Steps
IN WHICH a first expedition ventures into uncharted territory,
encounters strange monsters and reconsiders their equipment.
• First take by Exploration teams (Analytical Pattern)
• Unusual Ops mode: No Production system (although we use production data)
• Install everything with Ansible
• YAML based, ssh based access
• All text files. Easy to put in git and to document
• The Power of Root
• Great power and flexibility
• Risk people and GUI users do not like it
• You are on your own
• We tried to learn from this!
Tooling part 1
Chapter 2: Standardizing
16
IN WHICH a larger party sets out with better equipment, reaches the
shores of a new world but finds that still, much is to be improved.
• Now we needed a Datalake integrated solution with full support
• Also need a full DTAP street
Infra team has legacy tooling (proprietary tools) but limited flexibility.
• Basically, we roll our specific configuration into homemade rpm packages.
Tool choice for application deployment: CA Lisa aka Nolio
• GUI based
• No version control (tagging added as an afterthought)
• Slow and awkward to use
• Dumbed down by organizational restrictions
Conclusion: Don’t go there!
Implementing the FileStore and DeepData patterns
• By then, we had a lot of structure to help us
• Standardized build server with GitBlit, Artifactory, Jenkings
• Agile Way of Working
• Now deployment is a split approach
• Infra parts use TEM (and Ambari blueprint) to deploy full Hadoop stack
• On top of the stack we deploy our own applications with Nolio
• Handovers CIO-Infra still hurt us
• We do have: Deployment on a given system at the press of a button
• We do have: Automatic propagation of Git changes into Artifactory via Jenkins
• We do not (yet) have: Automated propagation D->T->A->P via Jenkins
Implementing the FileStore and DeepData patterns
Chapter 3: The Cloud
IN WHICH our heroes learn from the cloud experience and from explorers around the world,
and make deployment a safe experience for everyone.
Chapter 3: The Cloud
• ING Private Cloud: is essentially Datacenter v2.0
• However, we get the chance to rethink our tooling
• Puppet integrates nicely with RH Satellite and is used to provision PaaS solutions
• Ansible is gaining ground in the internal discussion
• External Ansible community: Meetup grown a lot over the last year. Now more than
Puppet and Chef combined
• ING has an initiative to come up with a standardized way to deploy packaged software,
based on Ansible
The Cloud
Conclusion
• Be aware: Deployment of mostly prepackaged software is different from developing your
own software
• Full automation might not be needed because we do not change as quickly as e.g.
mobile app
• Use tools that are scriptable
• GUIs suck
• Own your stack
Conclusion
22
Questions
Questions
Questions
• Crane Gears by Kevin Utting is licensed under CC BY 2.0
• Hellmar in Nîmes / With Python in Mindanao, by the author
• Domtoren in het oranje licht by helena_is_here is licensed under CC BY 2.0
Attributions
24

Más contenido relacionado

La actualidad más candente

Lean Enterprise, Microservices and Big Data
Lean Enterprise, Microservices and Big DataLean Enterprise, Microservices and Big Data
Lean Enterprise, Microservices and Big Data
Stylight
 
Ensuring Cloud Native Success: Organization Transformation
Ensuring Cloud Native Success:  Organization TransformationEnsuring Cloud Native Success:  Organization Transformation
Ensuring Cloud Native Success: Organization Transformation
Chloe Jackson
 
Rightscale webinar-key-design-considerations-private-hybrid-clouds
Rightscale webinar-key-design-considerations-private-hybrid-cloudsRightscale webinar-key-design-considerations-private-hybrid-clouds
Rightscale webinar-key-design-considerations-private-hybrid-clouds
RightScale
 

La actualidad más candente (20)

Lean Enterprise, Microservices and Big Data
Lean Enterprise, Microservices and Big DataLean Enterprise, Microservices and Big Data
Lean Enterprise, Microservices and Big Data
 
Hadoop on Docker
Hadoop on DockerHadoop on Docker
Hadoop on Docker
 
SAP Teched 2012 Session Tec3438 Automate IaaS SAP deployments
SAP Teched 2012 Session Tec3438 Automate IaaS SAP deploymentsSAP Teched 2012 Session Tec3438 Automate IaaS SAP deployments
SAP Teched 2012 Session Tec3438 Automate IaaS SAP deployments
 
Drone fly - Decoupling Event Listeners from the Hive Metastore
Drone fly - Decoupling Event Listeners from the Hive MetastoreDrone fly - Decoupling Event Listeners from the Hive Metastore
Drone fly - Decoupling Event Listeners from the Hive Metastore
 
CWIN17 london becoming cloud native part 2 - guy martin docker
CWIN17 london   becoming cloud native part 2 - guy martin dockerCWIN17 london   becoming cloud native part 2 - guy martin docker
CWIN17 london becoming cloud native part 2 - guy martin docker
 
OPEN'17_4_Postgres: The Centerpiece for Modernising IT Infrastructures
OPEN'17_4_Postgres: The Centerpiece for Modernising IT InfrastructuresOPEN'17_4_Postgres: The Centerpiece for Modernising IT Infrastructures
OPEN'17_4_Postgres: The Centerpiece for Modernising IT Infrastructures
 
GDPR- The Buck Stops Here
GDPR-  The Buck Stops HereGDPR-  The Buck Stops Here
GDPR- The Buck Stops Here
 
Micro service architecture
Micro service architecture  Micro service architecture
Micro service architecture
 
DevOps and Decoys How to Build a Successful Microsoft DevOps Including the Data
DevOps and Decoys  How to Build a Successful Microsoft DevOps Including the DataDevOps and Decoys  How to Build a Successful Microsoft DevOps Including the Data
DevOps and Decoys How to Build a Successful Microsoft DevOps Including the Data
 
An Architecture for Autonomy
An Architecture for AutonomyAn Architecture for Autonomy
An Architecture for Autonomy
 
Deploying your apps in the cloud - the options: an overview
Deploying your apps in the cloud - the options: an overviewDeploying your apps in the cloud - the options: an overview
Deploying your apps in the cloud - the options: an overview
 
B3 getting started_with_cloud_native_development
B3 getting started_with_cloud_native_developmentB3 getting started_with_cloud_native_development
B3 getting started_with_cloud_native_development
 
6_OPEN17_SUSE Enterprise Storage 4
6_OPEN17_SUSE Enterprise Storage 4 6_OPEN17_SUSE Enterprise Storage 4
6_OPEN17_SUSE Enterprise Storage 4
 
Open Source Applied - Real World Use Cases
Open Source Applied - Real World Use CasesOpen Source Applied - Real World Use Cases
Open Source Applied - Real World Use Cases
 
Ensuring Cloud Native Success: Organization Transformation
Ensuring Cloud Native Success:  Organization TransformationEnsuring Cloud Native Success:  Organization Transformation
Ensuring Cloud Native Success: Organization Transformation
 
Rightscale webinar-key-design-considerations-private-hybrid-clouds
Rightscale webinar-key-design-considerations-private-hybrid-cloudsRightscale webinar-key-design-considerations-private-hybrid-clouds
Rightscale webinar-key-design-considerations-private-hybrid-clouds
 
PouchDB - The Database That Syncs
PouchDB - The Database That SyncsPouchDB - The Database That Syncs
PouchDB - The Database That Syncs
 
Newt global meetup microservices
Newt global meetup microservicesNewt global meetup microservices
Newt global meetup microservices
 
OpenStack 3rd Birthday Presentation
OpenStack 3rd Birthday PresentationOpenStack 3rd Birthday Presentation
OpenStack 3rd Birthday Presentation
 
Containing your microservice sprawl
Containing your microservice sprawlContaining your microservice sprawl
Containing your microservice sprawl
 

Destacado

Destacado (20)

AddReality company overview
AddReality company overviewAddReality company overview
AddReality company overview
 
Best Practices for Virtualizing Hadoop
Best Practices for Virtualizing HadoopBest Practices for Virtualizing Hadoop
Best Practices for Virtualizing Hadoop
 
The Warranty Data Lake – After, Inc.
The Warranty Data Lake – After, Inc.The Warranty Data Lake – After, Inc.
The Warranty Data Lake – After, Inc.
 
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
 
Luxury 3.0- a new Retail Scenario for Product Mass Customization and On Deman...
Luxury 3.0- a new Retail Scenario for Product Mass Customization and On Deman...Luxury 3.0- a new Retail Scenario for Product Mass Customization and On Deman...
Luxury 3.0- a new Retail Scenario for Product Mass Customization and On Deman...
 
Reducing Microservice Complexity with Kafka and Reactive Streams
Reducing Microservice Complexity with Kafka and Reactive StreamsReducing Microservice Complexity with Kafka and Reactive Streams
Reducing Microservice Complexity with Kafka and Reactive Streams
 
Building a Data Lake - An App Dev's Perspective
Building a Data Lake - An App Dev's PerspectiveBuilding a Data Lake - An App Dev's Perspective
Building a Data Lake - An App Dev's Perspective
 
Reducing Latency and Increasing Performance while Cutting Infrastructure Costs
Reducing Latency and Increasing Performance while Cutting Infrastructure CostsReducing Latency and Increasing Performance while Cutting Infrastructure Costs
Reducing Latency and Increasing Performance while Cutting Infrastructure Costs
 
Oracle Retail
Oracle RetailOracle Retail
Oracle Retail
 
Business Data Lake Best Practices
Business Data Lake Best PracticesBusiness Data Lake Best Practices
Business Data Lake Best Practices
 
Retail 2.0 Strategy - Perfect Store PDF
Retail 2.0 Strategy - Perfect Store PDFRetail 2.0 Strategy - Perfect Store PDF
Retail 2.0 Strategy - Perfect Store PDF
 
The Emerging Data Lake IT Strategy
The Emerging Data Lake IT StrategyThe Emerging Data Lake IT Strategy
The Emerging Data Lake IT Strategy
 
Best Practices for Building a Data Lake with Amazon S3 - August 2016 Monthly ...
Best Practices for Building a Data Lake with Amazon S3 - August 2016 Monthly ...Best Practices for Building a Data Lake with Amazon S3 - August 2016 Monthly ...
Best Practices for Building a Data Lake with Amazon S3 - August 2016 Monthly ...
 
Creating a Modern Data Architecture for Digital Transformation
Creating a Modern Data Architecture for Digital TransformationCreating a Modern Data Architecture for Digital Transformation
Creating a Modern Data Architecture for Digital Transformation
 
The Rise of Microservices
The Rise of MicroservicesThe Rise of Microservices
The Rise of Microservices
 
Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lake
 
Constant Contact: An Online Marketing Leader’s Data Lake Journey
Constant Contact: An Online Marketing Leader’s Data Lake JourneyConstant Contact: An Online Marketing Leader’s Data Lake Journey
Constant Contact: An Online Marketing Leader’s Data Lake Journey
 
Southwest Power Pool big data case study
Southwest Power Pool big data case study Southwest Power Pool big data case study
Southwest Power Pool big data case study
 
Best Practices for Building a Data Lake on AWS
Best Practices for Building a Data Lake on AWSBest Practices for Building a Data Lake on AWS
Best Practices for Building a Data Lake on AWS
 
Retail Industry Enterprise Architecture Review
Retail Industry Enterprise Architecture ReviewRetail Industry Enterprise Architecture Review
Retail Industry Enterprise Architecture Review
 

Similar a Automate Hadoop Cluster Deployment in a Banking Ecosystem

A Reference Architecture to Enable Visibility and Traceability across the Ent...
A Reference Architecture to Enable Visibility and Traceability across the Ent...A Reference Architecture to Enable Visibility and Traceability across the Ent...
A Reference Architecture to Enable Visibility and Traceability across the Ent...
CollabNet
 
Intalio create and cloudfoudry - short
Intalio create and cloudfoudry - shortIntalio create and cloudfoudry - short
Intalio create and cloudfoudry - short
hmalphettes
 

Similar a Automate Hadoop Cluster Deployment in a Banking Ecosystem (20)

#RADC4L16: An API-First Archives Approach at NPR
#RADC4L16: An API-First Archives Approach at NPR#RADC4L16: An API-First Archives Approach at NPR
#RADC4L16: An API-First Archives Approach at NPR
 
Dev Ops for systems of record - Talk at Agile Australia 2015
Dev Ops for systems of record - Talk at Agile Australia 2015Dev Ops for systems of record - Talk at Agile Australia 2015
Dev Ops for systems of record - Talk at Agile Australia 2015
 
How bigtop leveraged docker for build automation and one click hadoop provis...
How bigtop leveraged docker for build automation and  one click hadoop provis...How bigtop leveraged docker for build automation and  one click hadoop provis...
How bigtop leveraged docker for build automation and one click hadoop provis...
 
OS Accelerate London - 09/16/15
OS Accelerate London - 09/16/15OS Accelerate London - 09/16/15
OS Accelerate London - 09/16/15
 
AD1545 - Extending the XPages Extension Library
AD1545 - Extending the XPages Extension LibraryAD1545 - Extending the XPages Extension Library
AD1545 - Extending the XPages Extension Library
 
2012 RightScale Conference NYC - Jeff Gelb, Director of Technology Strategy, ...
2012 RightScale Conference NYC - Jeff Gelb, Director of Technology Strategy, ...2012 RightScale Conference NYC - Jeff Gelb, Director of Technology Strategy, ...
2012 RightScale Conference NYC - Jeff Gelb, Director of Technology Strategy, ...
 
Drupal 8 introduction
Drupal 8 introductionDrupal 8 introduction
Drupal 8 introduction
 
Gartner Infrastructure and Operations Summit Berlin 2015 - DevOps Journey
Gartner Infrastructure and Operations Summit Berlin 2015 - DevOps JourneyGartner Infrastructure and Operations Summit Berlin 2015 - DevOps Journey
Gartner Infrastructure and Operations Summit Berlin 2015 - DevOps Journey
 
Cloud Native Transformation (Alexis Richardson) - Continuous Lifecycle 2018 ...
 Cloud Native Transformation (Alexis Richardson) - Continuous Lifecycle 2018 ... Cloud Native Transformation (Alexis Richardson) - Continuous Lifecycle 2018 ...
Cloud Native Transformation (Alexis Richardson) - Continuous Lifecycle 2018 ...
 
Using Crowdsourced Images to Create Image Recognition Models with Analytics Z...
Using Crowdsourced Images to Create Image Recognition Models with Analytics Z...Using Crowdsourced Images to Create Image Recognition Models with Analytics Z...
Using Crowdsourced Images to Create Image Recognition Models with Analytics Z...
 
DevOps Days Ohio
DevOps Days OhioDevOps Days Ohio
DevOps Days Ohio
 
Top 10 dev ops tools (1)
Top 10 dev ops tools (1)Top 10 dev ops tools (1)
Top 10 dev ops tools (1)
 
Justin Sheppard & Ankur Gupta from Sears Holdings Corporation - Single point ...
Justin Sheppard & Ankur Gupta from Sears Holdings Corporation - Single point ...Justin Sheppard & Ankur Gupta from Sears Holdings Corporation - Single point ...
Justin Sheppard & Ankur Gupta from Sears Holdings Corporation - Single point ...
 
Cincom Smalltalk: Present, Future & Smalltalk Advocacy
Cincom Smalltalk: Present, Future & Smalltalk AdvocacyCincom Smalltalk: Present, Future & Smalltalk Advocacy
Cincom Smalltalk: Present, Future & Smalltalk Advocacy
 
Why kubernetes matters
Why kubernetes mattersWhy kubernetes matters
Why kubernetes matters
 
A Reference Architecture to Enable Visibility and Traceability across the Ent...
A Reference Architecture to Enable Visibility and Traceability across the Ent...A Reference Architecture to Enable Visibility and Traceability across the Ent...
A Reference Architecture to Enable Visibility and Traceability across the Ent...
 
Hot Technologies of 2013: Hadoop 2.0
Hot Technologies of 2013: Hadoop 2.0Hot Technologies of 2013: Hadoop 2.0
Hot Technologies of 2013: Hadoop 2.0
 
Kubernetes, Toolbox to fail or succeed for beginners - Demi Ben-Ari, VP R&D @...
Kubernetes, Toolbox to fail or succeed for beginners - Demi Ben-Ari, VP R&D @...Kubernetes, Toolbox to fail or succeed for beginners - Demi Ben-Ari, VP R&D @...
Kubernetes, Toolbox to fail or succeed for beginners - Demi Ben-Ari, VP R&D @...
 
Cloud Academy Webinar: Recipe for DevOps Success: Capital One Style
Cloud Academy Webinar: Recipe for DevOps Success: Capital One StyleCloud Academy Webinar: Recipe for DevOps Success: Capital One Style
Cloud Academy Webinar: Recipe for DevOps Success: Capital One Style
 
Intalio create and cloudfoudry - short
Intalio create and cloudfoudry - shortIntalio create and cloudfoudry - short
Intalio create and cloudfoudry - short
 

Último

The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
shinachiaurasa2
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
masabamasaba
 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
masabamasaba
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
VictorSzoltysek
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
mohitmore19
 

Último (20)

The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
 
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
%in Durban+277-882-255-28 abortion pills for sale in Durban
%in Durban+277-882-255-28 abortion pills for sale in Durban%in Durban+277-882-255-28 abortion pills for sale in Durban
%in Durban+277-882-255-28 abortion pills for sale in Durban
 
%+27788225528 love spells in Vancouver Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Vancouver Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Vancouver Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Vancouver Psychic Readings, Attraction spells,Br...
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdf
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
 
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park %in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
 

Automate Hadoop Cluster Deployment in a Banking Ecosystem

  • 1. Hellmar Becker, ING Continuous Lifecycle London Automate Hadoop Cluster Deployment in a Banking Ecosystem Lessons from Practice May 4, 2016
  • 3. Automate Hadoop Cluster Deployment in a Banking Ecosystem 3 The Goal Prelude: Hadoop Patterns in ING Chapter 1: First Steps Chapter 2: Standardizing Chapter 3: The Cloud Conclusion Questions
  • 4. The Goal IN WHICH we look at the challenges that a bank has to face in the 21st century, and how this translates into decisions made in the IT landscape.
  • 5. Market leaders Benelux Growth markets Commercial Banking Challengers The world of ING – Data Driven Since 1881 5 Customers 33 Million Private, Corporate and Institutional Customers Countries 41 In Europe, Asia, Australia, North and South America Employees 52,000
  • 6. 6
  • 7. We accelerate through the Concept of One 7 Provide standardized and easy to use global capabilities and services Accelerate strategy and concentrate on business value Concept of One
  • 8. Prelude: Hadoop Patterns in ING 8 IN WHICH we describe the journey of some interesting characters that set out to get Hadoop adopted within a large, venerable institution, and across the world.
  • 9. Data Lake and Advanced Analytics within ING 9 External and internal reporting for own or regulatory purposes Integrate all data sources within the bank into one processing platform • Batch data streams • Live transactions • Model building for customer interaction Better understand customer needs in an increasingly digital world Data can help us offering tailored products and services Empower data scientists and analysts to get the best results with advanced analytics tools and predictive models Open source software where possible – Hadoop as a core component
  • 10. 1. File Storage 2. Deep Data 3. Analytical Hadoop 4. Real Time Hadoop Usage Patterns 10
  • 11. Analytical Hadoop • Our first use case • Development and Production environments • P environment has Production level security but Test level SLA FileStore and Deep Data • Completely automated • Full DTAP street (Development, Test, Acceptance, Production) Patterns and their maintenance 11
  • 12. • Vendors give us tools to do a GUI based install • Maintain several clusters in parallel, DTAP! • Auditability! • Not for us, we need to do automated installs • APIs and scripting facilities do exist, but are often poorly tested and documented Standard installation doesn’t cut it
  • 13. • Layers – IaaS, PaaS, Application (we want IaaS not PaaS) • Organizational divide: Platform team vs. Infra team • Different privileges • Different tool choices • Trust and collaboration need to be actively built • Convince security audit teams! Organizational challenge
  • 14. Chapter 1: First Steps IN WHICH a first expedition ventures into uncharted territory, encounters strange monsters and reconsiders their equipment.
  • 15. • First take by Exploration teams (Analytical Pattern) • Unusual Ops mode: No Production system (although we use production data) • Install everything with Ansible • YAML based, ssh based access • All text files. Easy to put in git and to document • The Power of Root • Great power and flexibility • Risk people and GUI users do not like it • You are on your own • We tried to learn from this! Tooling part 1
  • 16. Chapter 2: Standardizing 16 IN WHICH a larger party sets out with better equipment, reaches the shores of a new world but finds that still, much is to be improved.
  • 17. • Now we needed a Datalake integrated solution with full support • Also need a full DTAP street Infra team has legacy tooling (proprietary tools) but limited flexibility. • Basically, we roll our specific configuration into homemade rpm packages. Tool choice for application deployment: CA Lisa aka Nolio • GUI based • No version control (tagging added as an afterthought) • Slow and awkward to use • Dumbed down by organizational restrictions Conclusion: Don’t go there! Implementing the FileStore and DeepData patterns
  • 18. • By then, we had a lot of structure to help us • Standardized build server with GitBlit, Artifactory, Jenkings • Agile Way of Working • Now deployment is a split approach • Infra parts use TEM (and Ambari blueprint) to deploy full Hadoop stack • On top of the stack we deploy our own applications with Nolio • Handovers CIO-Infra still hurt us • We do have: Deployment on a given system at the press of a button • We do have: Automatic propagation of Git changes into Artifactory via Jenkins • We do not (yet) have: Automated propagation D->T->A->P via Jenkins Implementing the FileStore and DeepData patterns
  • 19. Chapter 3: The Cloud IN WHICH our heroes learn from the cloud experience and from explorers around the world, and make deployment a safe experience for everyone. Chapter 3: The Cloud
  • 20. • ING Private Cloud: is essentially Datacenter v2.0 • However, we get the chance to rethink our tooling • Puppet integrates nicely with RH Satellite and is used to provision PaaS solutions • Ansible is gaining ground in the internal discussion • External Ansible community: Meetup grown a lot over the last year. Now more than Puppet and Chef combined • ING has an initiative to come up with a standardized way to deploy packaged software, based on Ansible The Cloud
  • 22. • Be aware: Deployment of mostly prepackaged software is different from developing your own software • Full automation might not be needed because we do not change as quickly as e.g. mobile app • Use tools that are scriptable • GUIs suck • Own your stack Conclusion 22
  • 24. • Crane Gears by Kevin Utting is licensed under CC BY 2.0 • Hellmar in Nîmes / With Python in Mindanao, by the author • Domtoren in het oranje licht by helena_is_here is licensed under CC BY 2.0 Attributions 24