SlideShare una empresa de Scribd logo
1 de 26
Descargar para leer sin conexión
1
© Worldpay 2016. All rights reserved.
Delivering Multi-Tenancy Applications on Hadoop
David M Walker
Enterprise Data Platform Programme Director & Technical Architect
5th April 2017
2 © Worldpay 2017. All rights reserved.2
Transactions Daily.
On average that’s per second.
merchants using >
payment methods & currencies
in countries and in the UK we
process % of all non-cash transactions
Worldpay In (Big) Numbers
In Store
Online
Mobile
3 © Worldpay 2017. All rights reserved.3
Who are our customers?
• You probably interact with Worldpay several times a day without realising it:
• And we are also process the payments for over:
̶ 16,000 hairdressers - 24,000 restaurants - 9,000 pubs - etc.
• After today you will probably notice everywhere
4 © Worldpay 2017. All rights reserved.4
Worldpay & Big Data
• In April 2015 we made the strategic decision to commit to a new enterprise
wide data platform to:
̶ Provide deep analytics and data driven decisions as well as traditional
reporting
̶ Source information from across all our platforms and bring it to one place
̶ Make this information available to our colleagues, our customers and our
partners
̶ Exploit disruptive open-source technologies
̶ Full commitment from CEO, CIO and the Head of Data who initiated the
project
• But with 13.1 billion transactions to a total value of £402bn from 2015 alone
and with a significant proportion of both your card and my card transaction
history in the system it had to be SECURE
5 © Worldpay 2017. All rights reserved.5
Some Stats About Our Environment
• Two Production (PRD) Clusters (96 nodes), Two PPE Clusters (16 nodes) One DTE Cluster (8 Nodes)
̶ All environments are built using the same templates and build instructions
̶ The average Data Node has 12x4Tb disk, 256Gb Memory and 20 cores
̶ Our clusters are on premise and we have the capability to burst to cloud infrastructure with secured
(tokenised) data
̶ We have plans to expand rapidly over the next 24 months
• Security is Key
̶ Because we have so much PCI & PII data we must be both secure and comply with regulators
• We’ve upgraded from HDP 2.3 to HDP 2.4 to HDP 2.5 in 18 months including many point releases
̶ And security and ease of management have improved with each release
• We’ve loaded 80+ Billion Card Transactions from two of Worldpay’s systems
̶ And we are busy at work to get all the other systems on board as both batch and real-time streams
• We’re in the process of delivering to Users and Systems
̶ Users have secure data access with a range of desktop and web tools to the Transaction History
̶ We are in the process of deploying Machine Learning Derived Algorithms back into payment platforms
6
© Worldpay 2016. All rights reserved.
What Is A Multi-Tenancy Cluster?
7 © Worldpay 2017. All rights reserved.7
Large Multi-Tenancy Developments
• A building like The Shard in London has many tenant types
and many tenants. These will include:
• Offices
• Retail Arcade
• Restaurants & Bars
• Hotel
• etc.
• They will also have many components and services
provided in the building including:
• Water
• Gas
• Electricity
• Air Conditioning
• Internet
• Security
• Building Management
8 © Worldpay 2017. All rights reserved.8
The analogy with the Enterprise Data Platform
• We have many tenant types
• Data Warehousing
• Decision Services
• Data APIs
• Technical Insights
• Search
• etc.
• These tenancy type each have components and services
they need to operate
• Data Sources
• Batch, Stream/CDC data, Log Files
• Data Lake and Derived Data Sets
• Data Ingest and Manipulation Tools
• Reporting and Analytic Tooling
• Engines for running models
• Governance (Building Management)
• Security
9 © Worldpay 2017. All rights reserved.9
Our Tenancy Types: Data Warehousing
• It will surprise many but despite the
innovations of big data there is still a
requirement inside the business for
reports and dashboards
• We don’t have a single ‘Enterprise Data Model’
but we do have a number of ‘Narrative
Models’ – third normal form data models that
describe aspects of the business and are used
to populate data marts in Hive and reported
on with tools such as Tableau
Data Warehousing
10 © Worldpay 2017. All rights reserved.10
Our Tenancy Types: Decision Services
• Our data scientists can use the historical data
we have available to examine the factors
including
• What affects whether a transaction
successfully completes?
• How smooth the transaction from a
customer perspective (did a 3D Secure
appear, etc.)?
• Is it fraud?
• Using this information we can generate
Predictive Models that can be seeded back
into the transaction path and used to optimise
the way in which to process a transaction
Data Warehousing
Decision Services
11 © Worldpay 2017. All rights reserved.11
Our Tenancy Types: Data APIs
• Our data API tenants share data from EDP with
other systems
• This may include either de-tokenising our data
into clear (e.g. for sharing with fraud agencies)
or double encrypting the data (e.g. when
sharing it with another company so we can
trace lineage)
• Data is shared with both internal and external
organisations
Data Warehousing
Decision Services
Data APIs
12 © Worldpay 2017. All rights reserved.12
Our Tenancy Types: Technical Insights
• Our Technical Insights tenancy type stores systems
and security monitoring and logs
• These are gathered from various platforms and
made available for analysis and reporting
• The data can be used for both simple and complex
analysis
• We start with simple examples about which
systems need patching, how many support calls
were opened against a specific system, uptime of
servers, etc.
• But we are looking for the complex relationships –
given a pattern of events in routers and servers we
need to add more capacity or take preventative
maintenance allowing us to offer better outcomes
to our merchants
Data Warehousing
Decision Services
Data APIs
Technical Insights
13 © Worldpay 2017. All rights reserved.13
Our Tenancy Types: Search
• Search is a growing requirement
• Our business is moving to use a concept of
‘Operational Events’ where all systems
generate an event when something changes
• These will be stored in our Operational Event
Store within the cluster
• We need to make this data quickly and easily
available via search tools
• The data will help satisfy queries from our
colleagues, merchants, consumers and
business partners
Data Warehousing
Decision Services
Data APIs
Technical Insights
Search
14 © Worldpay 2017. All rights reserved.14
Why Tenancy Types and not just Tenants ?
• Many ‘Big Data’ environments talk about and have multiple tenants
• Each use case developed using the best tool for the job be an agile team
• But behind this is a wave of hidden costs relating to management and upgrades
• The long term operability of the cluster will depend on being able to easily identify:
• Which product components are being used and can the be upgraded?
• Which data sets are required and when are they available?
• How to manage the SLAs and isolate different components with different SLAs ?
• A tenancy type is defined by a common collection of tools and data sets used for a functionally
similar purposes – it provides a design pattern for engineering teams to work towards
15 © Worldpay 2017. All rights reserved.15
How our Multi-Tenancy model changes with Hortonworks 3
• Our model of tenancy types is geared towards a move to Hortonworks 3
• We want to define a tenancy type that uses a collection of containers
• As opposed to the current concept of multiple components
• This allows us to better manage versions of the components
• We want to blueprint the set of containers for a tenancy type
• Then we can rapidly roll out tenants of that tenancy type
• And then (as we are on-premise) we can burst our workloads to the cloud
16 © Worldpay 2017. All rights reserved.16
Optimizing The Platform Support
• We operate a ‘metalic’ service level: Gold, Silver, Bronze
• Good architecture means being able to isolate components for maintenace
• The optimal solution is to keep each component at the lowest possible support level
• Allows us to take components down to perform upgrades, etc.
• Reduces in-house and external support costs
• We have a Hortonworks First Policy for the Hadoop Platform
• If the functionality exists in the Hortonworks Data Platform we will strive to use it over
purchasing/using another product
17 © Worldpay 2017. All rights reserved.17
An Example: The Decision Service Tenancy Type
Decision Service is a capability that is created using components from the Enterprise Data Platform (EDP) and
encompasses:
• The ability to ingest data in real time from operational systems (Attunity/Kafka/Flume/Data Capture Job)
• The ability to analyse data either on the stream as it arrives (Kafka) or historically in a database (Hive)
• The tools to allow data scientists to do machine learning (Spark, using Python and Scala ML libraries)
• The ability to publish and run machine learning models to offer a service (PMML and OpenScoring.io)
• The ability to allow other systems to access the decision service via a RESTful API
• The ability to support decision services in production - the required DR, Integration Testing, Performance
Testing, Service Transition and Governance
18 © Worldpay 2017. All rights reserved.18
PMML
Many decision mechanisms will be individually deployed to form a complete service
Workflow Management
Version Control
Intelligent Account Verification
Predict Fraud
Dynamic 3DS
Payment Recycling
Other Similar Decisions
RESTAPI
OperationalPlatform
Customer
Core
Data
Modelling
Data
Scoring
Data
Data Lake
Batch
Stream
Data Ingest
Batch
Stream
Data Ingest
Other
Platforms
Data Profiling
Feature Engineering
Provisioning
Lifecycle Dashboard
Tools
Algorithms
Scoring Libraries
A/B Testing
Model Health Scoring/Validation
Data Refresh
Deployment
Data Science
Model Management
Event Calendar
Decision Service
19 © Worldpay 2017. All rights reserved.19
Our vision is to optimise every single transaction balanced across Cost,
Acceptance & Risk weighted to meet customer preferences
Card
History
Card Type
Regional
Customs
Credit
History
Currency
Timing
Account
Updater
ATV
Issuer
History
Security
AcceptanceCost Risk
Outcome Priority
P
O
P
O
O
ABC
Fraud
CV2
AVS
3DS
Retry
Route
20 © Worldpay 2017. All rights reserved.20
We have begun to analyze the potential customer outcomes
Existing client solution
Hybrid Model
Pure Machine Learning
ML model
performance only
current data
Disclaimer: These numbers are the results for only one merchant
21 © Worldpay 2017. All rights reserved.21
Operation & Security Infrastructure
The Technical Insights Tenancy Type
Windows
Servers
Web
& File
Servers
Virtual-
isation
Servers
Linux
Servers
including
syslog
Database
Servers
(Oracle
MSSQL)
Firewalls
& Anti-
DDOS
SNMP &
Other
Event
Traps
Physical
Access
Logs
CMDB &
Service
Now
Anti-
Virus
Logs
Vulner-
ability
Scans
Enterprise Data Platform
EventCapture
EventStore
Security & IT
Ops
AnalyticsWorkbench
Reports
Dashboards
Investigations
Advanced Analytics
Machine Learning
Search
Data
Science
Security
Single Pane
Of Glass
IT
Single Pane
Of Glass
Third Party Security Products
Beginners
Advanced
22 © Worldpay 2017. All rights reserved.22
One of our live dashboards – Sensitive data obscured!
23 © Worldpay 2017. All rights reserved.23
Technical Insights:
Eat Your Own Dogfood
Using our own data load
metrics to look for
technical debt and
necessary remedial work
24 © Worldpay 2017. All rights reserved.24
So where are we now and where do we expect to be in two years?
• Data Warehousing
• Two Live Tenants – one for Shopper
Insight and one for Financial Reporting
• We would expect around around ten
narrative models and three reporting
tools to be deployed
• Decision Services
• Multiple decision services being
developed now
• Expect there to be at least tens of decision
services to be deployed
• Search
• PoC Starting
• Data API
• 1 API live
• 3 more planned for the coming months
• As many as required on-going
• Technical Insights
• 15 dashboards delivered from two source
systems
• Deploying now to access hundreds of
sources and devices
• Other Tenancy Types
• More to come – we just don’t know what
they are yet
25 © Worldpay 2017. All rights reserved.25
ENTERPRISE DATA PLATFORM
Who are our technology partners?
26
© Worldpay 2016. All rights reserved.
Leaders in Modern Money
Innovating In Secure Modern Data Analytics
Thank You
David M Walker (david.walker@worldpay.com)
Enterprise Data Platform Programme Director

Más contenido relacionado

La actualidad más candente

ETIS11 - Agile Business Intelligence - Presentation
ETIS11 -  Agile Business Intelligence - PresentationETIS11 -  Agile Business Intelligence - Presentation
ETIS11 - Agile Business Intelligence - PresentationDavid Walker
 
White paper making an-operational_data_store_(ods)_the_center_of_your_data_...
White paper   making an-operational_data_store_(ods)_the_center_of_your_data_...White paper   making an-operational_data_store_(ods)_the_center_of_your_data_...
White paper making an-operational_data_store_(ods)_the_center_of_your_data_...Eric Javier Espino Man
 
Data Warehousing 2016
Data Warehousing 2016Data Warehousing 2016
Data Warehousing 2016Kent Graziano
 
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...Edureka!
 
Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Empowered Holdings, LLC
 
Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...
Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...
Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...Denodo
 
Can data virtualization uphold performance with complex queries?
Can data virtualization uphold performance with complex queries?Can data virtualization uphold performance with complex queries?
Can data virtualization uphold performance with complex queries?Denodo
 
Govern and Protect Your End User Information
Govern and Protect Your End User InformationGovern and Protect Your End User Information
Govern and Protect Your End User InformationDenodo
 
Data warehouse concepts
Data warehouse conceptsData warehouse concepts
Data warehouse conceptsobieefans
 
Denodo Data Virtualization Platform: Overview (session 1 from Architect to Ar...
Denodo Data Virtualization Platform: Overview (session 1 from Architect to Ar...Denodo Data Virtualization Platform: Overview (session 1 from Architect to Ar...
Denodo Data Virtualization Platform: Overview (session 1 from Architect to Ar...Denodo
 
Tag productos y servicios
Tag  productos y serviciosTag  productos y servicios
Tag productos y servicioswebanexo
 
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...Seeling Cheung
 
New Database and Application Development Technology
New Database and Application Development TechnologyNew Database and Application Development Technology
New Database and Application Development TechnologyMaurice Staal
 
Enterprise resource planning system & data warehousing implementation
Enterprise resource planning system & data warehousing implementationEnterprise resource planning system & data warehousing implementation
Enterprise resource planning system & data warehousing implementationSumya Abdelrazek
 
DATA Warehousing & Data Mining
DATA Warehousing & Data MiningDATA Warehousing & Data Mining
DATA Warehousing & Data Miningcpjcollege
 
Logical Data Warehouse: The Foundation of Modern Data and Analytics (APAC)
Logical Data Warehouse: The Foundation of Modern Data and Analytics (APAC)Logical Data Warehouse: The Foundation of Modern Data and Analytics (APAC)
Logical Data Warehouse: The Foundation of Modern Data and Analytics (APAC)Denodo
 

La actualidad más candente (20)

ETIS11 - Agile Business Intelligence - Presentation
ETIS11 -  Agile Business Intelligence - PresentationETIS11 -  Agile Business Intelligence - Presentation
ETIS11 - Agile Business Intelligence - Presentation
 
White paper making an-operational_data_store_(ods)_the_center_of_your_data_...
White paper   making an-operational_data_store_(ods)_the_center_of_your_data_...White paper   making an-operational_data_store_(ods)_the_center_of_your_data_...
White paper making an-operational_data_store_(ods)_the_center_of_your_data_...
 
Data Warehousing 2016
Data Warehousing 2016Data Warehousing 2016
Data Warehousing 2016
 
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
 
Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012
 
Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...
Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...
Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...
 
DW 101
DW 101DW 101
DW 101
 
Can data virtualization uphold performance with complex queries?
Can data virtualization uphold performance with complex queries?Can data virtualization uphold performance with complex queries?
Can data virtualization uphold performance with complex queries?
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Govern and Protect Your End User Information
Govern and Protect Your End User InformationGovern and Protect Your End User Information
Govern and Protect Your End User Information
 
Neelesh it assignment
Neelesh it assignmentNeelesh it assignment
Neelesh it assignment
 
Data warehouse concepts
Data warehouse conceptsData warehouse concepts
Data warehouse concepts
 
Denodo Data Virtualization Platform: Overview (session 1 from Architect to Ar...
Denodo Data Virtualization Platform: Overview (session 1 from Architect to Ar...Denodo Data Virtualization Platform: Overview (session 1 from Architect to Ar...
Denodo Data Virtualization Platform: Overview (session 1 from Architect to Ar...
 
Tag productos y servicios
Tag  productos y serviciosTag  productos y servicios
Tag productos y servicios
 
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
 
New Database and Application Development Technology
New Database and Application Development TechnologyNew Database and Application Development Technology
New Database and Application Development Technology
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Enterprise resource planning system & data warehousing implementation
Enterprise resource planning system & data warehousing implementationEnterprise resource planning system & data warehousing implementation
Enterprise resource planning system & data warehousing implementation
 
DATA Warehousing & Data Mining
DATA Warehousing & Data MiningDATA Warehousing & Data Mining
DATA Warehousing & Data Mining
 
Logical Data Warehouse: The Foundation of Modern Data and Analytics (APAC)
Logical Data Warehouse: The Foundation of Modern Data and Analytics (APAC)Logical Data Warehouse: The Foundation of Modern Data and Analytics (APAC)
Logical Data Warehouse: The Foundation of Modern Data and Analytics (APAC)
 

Similar a Data Works Summit Munich 2017 - Worldpay - Multi Tenancy Clusters

MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -MapR Technologies
 
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...Cloudera, Inc.
 
Data Works Berlin 2018 - Worldpay - PCI Compliance
Data Works Berlin 2018 - Worldpay - PCI ComplianceData Works Berlin 2018 - Worldpay - PCI Compliance
Data Works Berlin 2018 - Worldpay - PCI ComplianceDavid Walker
 
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...DataWorks Summit
 
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsSelf-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsDenodo
 
Gain Deep Visibility into APIs and Integrations with Anypoint Monitoring
Gain Deep Visibility into APIs and Integrations with Anypoint MonitoringGain Deep Visibility into APIs and Integrations with Anypoint Monitoring
Gain Deep Visibility into APIs and Integrations with Anypoint MonitoringInfluxData
 
Simply Business' Data Platform
Simply Business' Data PlatformSimply Business' Data Platform
Simply Business' Data PlatformDani Solà Lagares
 
Redefining the Role of IT in a Self-Help Data Integration Environment
Redefining the Role of IT in a Self-Help Data Integration EnvironmentRedefining the Role of IT in a Self-Help Data Integration Environment
Redefining the Role of IT in a Self-Help Data Integration EnvironmentUNIFI Software
 
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...MapR Technologies
 
8 Things to Consider as SharePoint Moves to the Cloud
8 Things to Consider as SharePoint Moves to the Cloud8 Things to Consider as SharePoint Moves to the Cloud
8 Things to Consider as SharePoint Moves to the CloudChristian Buckley
 
Flink Forward Berlin 2017 Keynote: Ferd Scheepers - Taking away customer fric...
Flink Forward Berlin 2017 Keynote: Ferd Scheepers - Taking away customer fric...Flink Forward Berlin 2017 Keynote: Ferd Scheepers - Taking away customer fric...
Flink Forward Berlin 2017 Keynote: Ferd Scheepers - Taking away customer fric...Flink Forward
 
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...Precisely
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US InformationJulian Tong
 
How to scale your PaaS with OVH infrastructure?
How to scale your PaaS with OVH infrastructure?How to scale your PaaS with OVH infrastructure?
How to scale your PaaS with OVH infrastructure?OVHcloud
 
InfoSphere BigInsights
InfoSphere BigInsightsInfoSphere BigInsights
InfoSphere BigInsightsWilfried Hoge
 
OC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBMOC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBMBig Data Joe™ Rossi
 
SD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBMSD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBMBig Data Joe™ Rossi
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataMatt Stubbs
 

Similar a Data Works Summit Munich 2017 - Worldpay - Multi Tenancy Clusters (20)

MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -
 
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
 
Data Works Berlin 2018 - Worldpay - PCI Compliance
Data Works Berlin 2018 - Worldpay - PCI ComplianceData Works Berlin 2018 - Worldpay - PCI Compliance
Data Works Berlin 2018 - Worldpay - PCI Compliance
 
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...
 
Ask bigger questions
Ask bigger questionsAsk bigger questions
Ask bigger questions
 
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsSelf-Service Analytics with Guard Rails
Self-Service Analytics with Guard Rails
 
Gain Deep Visibility into APIs and Integrations with Anypoint Monitoring
Gain Deep Visibility into APIs and Integrations with Anypoint MonitoringGain Deep Visibility into APIs and Integrations with Anypoint Monitoring
Gain Deep Visibility into APIs and Integrations with Anypoint Monitoring
 
Simply Business' Data Platform
Simply Business' Data PlatformSimply Business' Data Platform
Simply Business' Data Platform
 
Redefining the Role of IT in a Self-Help Data Integration Environment
Redefining the Role of IT in a Self-Help Data Integration EnvironmentRedefining the Role of IT in a Self-Help Data Integration Environment
Redefining the Role of IT in a Self-Help Data Integration Environment
 
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
 
8 Things to Consider as SharePoint Moves to the Cloud
8 Things to Consider as SharePoint Moves to the Cloud8 Things to Consider as SharePoint Moves to the Cloud
8 Things to Consider as SharePoint Moves to the Cloud
 
Flink Forward Berlin 2017 Keynote: Ferd Scheepers - Taking away customer fric...
Flink Forward Berlin 2017 Keynote: Ferd Scheepers - Taking away customer fric...Flink Forward Berlin 2017 Keynote: Ferd Scheepers - Taking away customer fric...
Flink Forward Berlin 2017 Keynote: Ferd Scheepers - Taking away customer fric...
 
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US Information
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US Information
 
How to scale your PaaS with OVH infrastructure?
How to scale your PaaS with OVH infrastructure?How to scale your PaaS with OVH infrastructure?
How to scale your PaaS with OVH infrastructure?
 
InfoSphere BigInsights
InfoSphere BigInsightsInfoSphere BigInsights
InfoSphere BigInsights
 
OC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBMOC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBM
 
SD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBMSD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBM
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on Data
 

Más de David Walker

Moving To MicroServices
Moving To MicroServicesMoving To MicroServices
Moving To MicroServicesDavid Walker
 
Big Data Analytics 2017 - Worldpay - Empowering Payments
Big Data Analytics 2017  - Worldpay - Empowering PaymentsBig Data Analytics 2017  - Worldpay - Empowering Payments
Big Data Analytics 2017 - Worldpay - Empowering PaymentsDavid Walker
 
Data Driven Insurance Underwriting
Data Driven Insurance UnderwritingData Driven Insurance Underwriting
Data Driven Insurance UnderwritingDavid Walker
 
Data Driven Insurance Underwriting (Dutch Language Version)
Data Driven Insurance Underwriting (Dutch Language Version)Data Driven Insurance Underwriting (Dutch Language Version)
Data Driven Insurance Underwriting (Dutch Language Version)David Walker
 
An introduction to data virtualization in business intelligence
An introduction to data virtualization in business intelligenceAn introduction to data virtualization in business intelligence
An introduction to data virtualization in business intelligenceDavid Walker
 
BI SaaS & Cloud Strategies for Telcos
BI SaaS & Cloud Strategies for TelcosBI SaaS & Cloud Strategies for Telcos
BI SaaS & Cloud Strategies for TelcosDavid Walker
 
Gathering Business Requirements for Data Warehouses
Gathering Business Requirements for Data WarehousesGathering Business Requirements for Data Warehouses
Gathering Business Requirements for Data WarehousesDavid Walker
 
Building a data warehouse of call data records
Building a data warehouse of call data recordsBuilding a data warehouse of call data records
Building a data warehouse of call data recordsDavid Walker
 
Struggling with data management
Struggling with data managementStruggling with data management
Struggling with data managementDavid Walker
 
A linux mac os x command line interface
A linux mac os x command line interfaceA linux mac os x command line interface
A linux mac os x command line interfaceDavid Walker
 
Connections a life in the day of - david walker
Connections   a life in the day of - david walkerConnections   a life in the day of - david walker
Connections a life in the day of - david walkerDavid Walker
 
Conspectus data warehousing appliances – fad or future
Conspectus   data warehousing appliances – fad or futureConspectus   data warehousing appliances – fad or future
Conspectus data warehousing appliances – fad or futureDavid Walker
 
An introduction to social network data
An introduction to social network dataAn introduction to social network data
An introduction to social network dataDavid Walker
 
Using the right data model in a data mart
Using the right data model in a data martUsing the right data model in a data mart
Using the right data model in a data martDavid Walker
 
Implementing Netezza Spatial
Implementing Netezza SpatialImplementing Netezza Spatial
Implementing Netezza SpatialDavid Walker
 
UKOUG06 - An Introduction To Process Neutral Data Modelling - Presentation
UKOUG06 - An Introduction To Process Neutral Data Modelling - PresentationUKOUG06 - An Introduction To Process Neutral Data Modelling - Presentation
UKOUG06 - An Introduction To Process Neutral Data Modelling - PresentationDavid Walker
 
Oracle BI06 From Volume To Value - Presentation
Oracle BI06   From Volume To Value - PresentationOracle BI06   From Volume To Value - Presentation
Oracle BI06 From Volume To Value - PresentationDavid Walker
 
IRM09 - What Can IT Really Deliver For BI and DW - Presentation
IRM09 - What Can IT Really Deliver For BI and DW - PresentationIRM09 - What Can IT Really Deliver For BI and DW - Presentation
IRM09 - What Can IT Really Deliver For BI and DW - PresentationDavid Walker
 
ETIS11 - Enterprise Metadata Management
ETIS11 -  Enterprise Metadata ManagementETIS11 -  Enterprise Metadata Management
ETIS11 - Enterprise Metadata ManagementDavid Walker
 
ETIS10 - BI Governance Models & Strategies - Presentation
ETIS10 - BI Governance Models & Strategies - PresentationETIS10 - BI Governance Models & Strategies - Presentation
ETIS10 - BI Governance Models & Strategies - PresentationDavid Walker
 

Más de David Walker (20)

Moving To MicroServices
Moving To MicroServicesMoving To MicroServices
Moving To MicroServices
 
Big Data Analytics 2017 - Worldpay - Empowering Payments
Big Data Analytics 2017  - Worldpay - Empowering PaymentsBig Data Analytics 2017  - Worldpay - Empowering Payments
Big Data Analytics 2017 - Worldpay - Empowering Payments
 
Data Driven Insurance Underwriting
Data Driven Insurance UnderwritingData Driven Insurance Underwriting
Data Driven Insurance Underwriting
 
Data Driven Insurance Underwriting (Dutch Language Version)
Data Driven Insurance Underwriting (Dutch Language Version)Data Driven Insurance Underwriting (Dutch Language Version)
Data Driven Insurance Underwriting (Dutch Language Version)
 
An introduction to data virtualization in business intelligence
An introduction to data virtualization in business intelligenceAn introduction to data virtualization in business intelligence
An introduction to data virtualization in business intelligence
 
BI SaaS & Cloud Strategies for Telcos
BI SaaS & Cloud Strategies for TelcosBI SaaS & Cloud Strategies for Telcos
BI SaaS & Cloud Strategies for Telcos
 
Gathering Business Requirements for Data Warehouses
Gathering Business Requirements for Data WarehousesGathering Business Requirements for Data Warehouses
Gathering Business Requirements for Data Warehouses
 
Building a data warehouse of call data records
Building a data warehouse of call data recordsBuilding a data warehouse of call data records
Building a data warehouse of call data records
 
Struggling with data management
Struggling with data managementStruggling with data management
Struggling with data management
 
A linux mac os x command line interface
A linux mac os x command line interfaceA linux mac os x command line interface
A linux mac os x command line interface
 
Connections a life in the day of - david walker
Connections   a life in the day of - david walkerConnections   a life in the day of - david walker
Connections a life in the day of - david walker
 
Conspectus data warehousing appliances – fad or future
Conspectus   data warehousing appliances – fad or futureConspectus   data warehousing appliances – fad or future
Conspectus data warehousing appliances – fad or future
 
An introduction to social network data
An introduction to social network dataAn introduction to social network data
An introduction to social network data
 
Using the right data model in a data mart
Using the right data model in a data martUsing the right data model in a data mart
Using the right data model in a data mart
 
Implementing Netezza Spatial
Implementing Netezza SpatialImplementing Netezza Spatial
Implementing Netezza Spatial
 
UKOUG06 - An Introduction To Process Neutral Data Modelling - Presentation
UKOUG06 - An Introduction To Process Neutral Data Modelling - PresentationUKOUG06 - An Introduction To Process Neutral Data Modelling - Presentation
UKOUG06 - An Introduction To Process Neutral Data Modelling - Presentation
 
Oracle BI06 From Volume To Value - Presentation
Oracle BI06   From Volume To Value - PresentationOracle BI06   From Volume To Value - Presentation
Oracle BI06 From Volume To Value - Presentation
 
IRM09 - What Can IT Really Deliver For BI and DW - Presentation
IRM09 - What Can IT Really Deliver For BI and DW - PresentationIRM09 - What Can IT Really Deliver For BI and DW - Presentation
IRM09 - What Can IT Really Deliver For BI and DW - Presentation
 
ETIS11 - Enterprise Metadata Management
ETIS11 -  Enterprise Metadata ManagementETIS11 -  Enterprise Metadata Management
ETIS11 - Enterprise Metadata Management
 
ETIS10 - BI Governance Models & Strategies - Presentation
ETIS10 - BI Governance Models & Strategies - PresentationETIS10 - BI Governance Models & Strategies - Presentation
ETIS10 - BI Governance Models & Strategies - Presentation
 

Último

Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 

Último (20)

Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 

Data Works Summit Munich 2017 - Worldpay - Multi Tenancy Clusters

  • 1. 1 © Worldpay 2016. All rights reserved. Delivering Multi-Tenancy Applications on Hadoop David M Walker Enterprise Data Platform Programme Director & Technical Architect 5th April 2017
  • 2. 2 © Worldpay 2017. All rights reserved.2 Transactions Daily. On average that’s per second. merchants using > payment methods & currencies in countries and in the UK we process % of all non-cash transactions Worldpay In (Big) Numbers In Store Online Mobile
  • 3. 3 © Worldpay 2017. All rights reserved.3 Who are our customers? • You probably interact with Worldpay several times a day without realising it: • And we are also process the payments for over: ̶ 16,000 hairdressers - 24,000 restaurants - 9,000 pubs - etc. • After today you will probably notice everywhere
  • 4. 4 © Worldpay 2017. All rights reserved.4 Worldpay & Big Data • In April 2015 we made the strategic decision to commit to a new enterprise wide data platform to: ̶ Provide deep analytics and data driven decisions as well as traditional reporting ̶ Source information from across all our platforms and bring it to one place ̶ Make this information available to our colleagues, our customers and our partners ̶ Exploit disruptive open-source technologies ̶ Full commitment from CEO, CIO and the Head of Data who initiated the project • But with 13.1 billion transactions to a total value of £402bn from 2015 alone and with a significant proportion of both your card and my card transaction history in the system it had to be SECURE
  • 5. 5 © Worldpay 2017. All rights reserved.5 Some Stats About Our Environment • Two Production (PRD) Clusters (96 nodes), Two PPE Clusters (16 nodes) One DTE Cluster (8 Nodes) ̶ All environments are built using the same templates and build instructions ̶ The average Data Node has 12x4Tb disk, 256Gb Memory and 20 cores ̶ Our clusters are on premise and we have the capability to burst to cloud infrastructure with secured (tokenised) data ̶ We have plans to expand rapidly over the next 24 months • Security is Key ̶ Because we have so much PCI & PII data we must be both secure and comply with regulators • We’ve upgraded from HDP 2.3 to HDP 2.4 to HDP 2.5 in 18 months including many point releases ̶ And security and ease of management have improved with each release • We’ve loaded 80+ Billion Card Transactions from two of Worldpay’s systems ̶ And we are busy at work to get all the other systems on board as both batch and real-time streams • We’re in the process of delivering to Users and Systems ̶ Users have secure data access with a range of desktop and web tools to the Transaction History ̶ We are in the process of deploying Machine Learning Derived Algorithms back into payment platforms
  • 6. 6 © Worldpay 2016. All rights reserved. What Is A Multi-Tenancy Cluster?
  • 7. 7 © Worldpay 2017. All rights reserved.7 Large Multi-Tenancy Developments • A building like The Shard in London has many tenant types and many tenants. These will include: • Offices • Retail Arcade • Restaurants & Bars • Hotel • etc. • They will also have many components and services provided in the building including: • Water • Gas • Electricity • Air Conditioning • Internet • Security • Building Management
  • 8. 8 © Worldpay 2017. All rights reserved.8 The analogy with the Enterprise Data Platform • We have many tenant types • Data Warehousing • Decision Services • Data APIs • Technical Insights • Search • etc. • These tenancy type each have components and services they need to operate • Data Sources • Batch, Stream/CDC data, Log Files • Data Lake and Derived Data Sets • Data Ingest and Manipulation Tools • Reporting and Analytic Tooling • Engines for running models • Governance (Building Management) • Security
  • 9. 9 © Worldpay 2017. All rights reserved.9 Our Tenancy Types: Data Warehousing • It will surprise many but despite the innovations of big data there is still a requirement inside the business for reports and dashboards • We don’t have a single ‘Enterprise Data Model’ but we do have a number of ‘Narrative Models’ – third normal form data models that describe aspects of the business and are used to populate data marts in Hive and reported on with tools such as Tableau Data Warehousing
  • 10. 10 © Worldpay 2017. All rights reserved.10 Our Tenancy Types: Decision Services • Our data scientists can use the historical data we have available to examine the factors including • What affects whether a transaction successfully completes? • How smooth the transaction from a customer perspective (did a 3D Secure appear, etc.)? • Is it fraud? • Using this information we can generate Predictive Models that can be seeded back into the transaction path and used to optimise the way in which to process a transaction Data Warehousing Decision Services
  • 11. 11 © Worldpay 2017. All rights reserved.11 Our Tenancy Types: Data APIs • Our data API tenants share data from EDP with other systems • This may include either de-tokenising our data into clear (e.g. for sharing with fraud agencies) or double encrypting the data (e.g. when sharing it with another company so we can trace lineage) • Data is shared with both internal and external organisations Data Warehousing Decision Services Data APIs
  • 12. 12 © Worldpay 2017. All rights reserved.12 Our Tenancy Types: Technical Insights • Our Technical Insights tenancy type stores systems and security monitoring and logs • These are gathered from various platforms and made available for analysis and reporting • The data can be used for both simple and complex analysis • We start with simple examples about which systems need patching, how many support calls were opened against a specific system, uptime of servers, etc. • But we are looking for the complex relationships – given a pattern of events in routers and servers we need to add more capacity or take preventative maintenance allowing us to offer better outcomes to our merchants Data Warehousing Decision Services Data APIs Technical Insights
  • 13. 13 © Worldpay 2017. All rights reserved.13 Our Tenancy Types: Search • Search is a growing requirement • Our business is moving to use a concept of ‘Operational Events’ where all systems generate an event when something changes • These will be stored in our Operational Event Store within the cluster • We need to make this data quickly and easily available via search tools • The data will help satisfy queries from our colleagues, merchants, consumers and business partners Data Warehousing Decision Services Data APIs Technical Insights Search
  • 14. 14 © Worldpay 2017. All rights reserved.14 Why Tenancy Types and not just Tenants ? • Many ‘Big Data’ environments talk about and have multiple tenants • Each use case developed using the best tool for the job be an agile team • But behind this is a wave of hidden costs relating to management and upgrades • The long term operability of the cluster will depend on being able to easily identify: • Which product components are being used and can the be upgraded? • Which data sets are required and when are they available? • How to manage the SLAs and isolate different components with different SLAs ? • A tenancy type is defined by a common collection of tools and data sets used for a functionally similar purposes – it provides a design pattern for engineering teams to work towards
  • 15. 15 © Worldpay 2017. All rights reserved.15 How our Multi-Tenancy model changes with Hortonworks 3 • Our model of tenancy types is geared towards a move to Hortonworks 3 • We want to define a tenancy type that uses a collection of containers • As opposed to the current concept of multiple components • This allows us to better manage versions of the components • We want to blueprint the set of containers for a tenancy type • Then we can rapidly roll out tenants of that tenancy type • And then (as we are on-premise) we can burst our workloads to the cloud
  • 16. 16 © Worldpay 2017. All rights reserved.16 Optimizing The Platform Support • We operate a ‘metalic’ service level: Gold, Silver, Bronze • Good architecture means being able to isolate components for maintenace • The optimal solution is to keep each component at the lowest possible support level • Allows us to take components down to perform upgrades, etc. • Reduces in-house and external support costs • We have a Hortonworks First Policy for the Hadoop Platform • If the functionality exists in the Hortonworks Data Platform we will strive to use it over purchasing/using another product
  • 17. 17 © Worldpay 2017. All rights reserved.17 An Example: The Decision Service Tenancy Type Decision Service is a capability that is created using components from the Enterprise Data Platform (EDP) and encompasses: • The ability to ingest data in real time from operational systems (Attunity/Kafka/Flume/Data Capture Job) • The ability to analyse data either on the stream as it arrives (Kafka) or historically in a database (Hive) • The tools to allow data scientists to do machine learning (Spark, using Python and Scala ML libraries) • The ability to publish and run machine learning models to offer a service (PMML and OpenScoring.io) • The ability to allow other systems to access the decision service via a RESTful API • The ability to support decision services in production - the required DR, Integration Testing, Performance Testing, Service Transition and Governance
  • 18. 18 © Worldpay 2017. All rights reserved.18 PMML Many decision mechanisms will be individually deployed to form a complete service Workflow Management Version Control Intelligent Account Verification Predict Fraud Dynamic 3DS Payment Recycling Other Similar Decisions RESTAPI OperationalPlatform Customer Core Data Modelling Data Scoring Data Data Lake Batch Stream Data Ingest Batch Stream Data Ingest Other Platforms Data Profiling Feature Engineering Provisioning Lifecycle Dashboard Tools Algorithms Scoring Libraries A/B Testing Model Health Scoring/Validation Data Refresh Deployment Data Science Model Management Event Calendar Decision Service
  • 19. 19 © Worldpay 2017. All rights reserved.19 Our vision is to optimise every single transaction balanced across Cost, Acceptance & Risk weighted to meet customer preferences Card History Card Type Regional Customs Credit History Currency Timing Account Updater ATV Issuer History Security AcceptanceCost Risk Outcome Priority P O P O O ABC Fraud CV2 AVS 3DS Retry Route
  • 20. 20 © Worldpay 2017. All rights reserved.20 We have begun to analyze the potential customer outcomes Existing client solution Hybrid Model Pure Machine Learning ML model performance only current data Disclaimer: These numbers are the results for only one merchant
  • 21. 21 © Worldpay 2017. All rights reserved.21 Operation & Security Infrastructure The Technical Insights Tenancy Type Windows Servers Web & File Servers Virtual- isation Servers Linux Servers including syslog Database Servers (Oracle MSSQL) Firewalls & Anti- DDOS SNMP & Other Event Traps Physical Access Logs CMDB & Service Now Anti- Virus Logs Vulner- ability Scans Enterprise Data Platform EventCapture EventStore Security & IT Ops AnalyticsWorkbench Reports Dashboards Investigations Advanced Analytics Machine Learning Search Data Science Security Single Pane Of Glass IT Single Pane Of Glass Third Party Security Products Beginners Advanced
  • 22. 22 © Worldpay 2017. All rights reserved.22 One of our live dashboards – Sensitive data obscured!
  • 23. 23 © Worldpay 2017. All rights reserved.23 Technical Insights: Eat Your Own Dogfood Using our own data load metrics to look for technical debt and necessary remedial work
  • 24. 24 © Worldpay 2017. All rights reserved.24 So where are we now and where do we expect to be in two years? • Data Warehousing • Two Live Tenants – one for Shopper Insight and one for Financial Reporting • We would expect around around ten narrative models and three reporting tools to be deployed • Decision Services • Multiple decision services being developed now • Expect there to be at least tens of decision services to be deployed • Search • PoC Starting • Data API • 1 API live • 3 more planned for the coming months • As many as required on-going • Technical Insights • 15 dashboards delivered from two source systems • Deploying now to access hundreds of sources and devices • Other Tenancy Types • More to come – we just don’t know what they are yet
  • 25. 25 © Worldpay 2017. All rights reserved.25 ENTERPRISE DATA PLATFORM Who are our technology partners?
  • 26. 26 © Worldpay 2016. All rights reserved. Leaders in Modern Money Innovating In Secure Modern Data Analytics Thank You David M Walker (david.walker@worldpay.com) Enterprise Data Platform Programme Director