Global data management is not a newly coined term. However, what it stands for is actually widening in scope particularly around data-in-motion and data-at-rest. Significant technology trends such as IoT, cloud, AI/ML, blockchain, and streaming data have given rise to excessive data volumes and also innovative use cases. The scope for global data management now extends all the way from ingestion, processing, storage, governance, security to analysis. With a good number of endpoints served through the cloud and major application footprints remaining on-premisess, it is pertinent to have a global data management strategy that supports hybrid models and more specifically, a multi-cloud model.
Many modern businesses struggle to balance the demands of rapidly innovating through new technologies like machine learning with the need to keep data safe and secure, all while responding to a constantly changing regulatory landscape. This puts data stewards, data engineers, architects, data scientists, and analysts under intense pressure as they must contend with existing and new applications, multiple logical and physical data stores and sources, diverse data types, and data spread across several deployment environments.
Attend this session led by Matt Aslett, Research Director at 451 Research and Dinesh Chandrasekhar, Director, Hortonworks to learn more about creating a framework for your enterprise that offers guidance on how to think about global data management—priorities, responsibilities, key stakeholders, compliance, and growth.
Speakers
Dinesh Chandrasekhar, Hortonworks, Director Product Marketing
Matt Aslett, 451 Research, Research Director, Data platforms and Analytics
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Global Data Management – a practical framework to rethinking enterprise, operational, and analytical data
1. Copyright (C) 2018 451 Research LLC
Global Data Management
A practical framework for rethinking enterprise,
operational, and analytical data
Matt Aslett, Research Director, Data Platforms & Analytics
Dinesh Chandrasekhar, Director, Product Marketing
12. Copyright (C) 2018 451 Research LLC
Global Data Management
A practical framework for rethinking enterprise,
operational, and analytical data
Matt Aslett, Research Director, Data Platforms & Analytics
13. Copyright (C) 2018 451 Research LLC
451 Research is a leading IT research & advisory company
13
Founded in 2000
300+ employees, including over 120 analysts
2,000+ clients: Technology & Service providers, corporate
advisory, finance, professional services, and IT decision makers
70,000+ IT professionals, business users and consumers in our 451 Alliance
research community
Over 52 million data points published each quarter and 4,500+ reports
published each year
3,000+ technology & service providers under coverage
451 Research and its sister company, Uptime Institute, are the two divisions
of The 451 Group
Headquartered in New York City, with offices in London, Boston, San
Francisco, Washington DC, Mexico, Costa Rica, Brazil, Spain, UAE, Russia,
Taiwan, Singapore and Malaysia
Research & Data
Advisory
Events
Go 2 Market
14. Copyright (C) 2018 451 Research LLC
14
Copyright (C) 2018 451 Research LLC
15. Source: 451 Research, Voice of the Enterprise: Cloud, Hosting & Managed Services, Workloads and Key Projects 2018
Q1. Which of the following types of cloud or hosted services, if any, does your organization currently use? Please select all that apply.
15
68%
46%
37%
32%
31%
29%
7%
Software as a service (SaaS) and hosted applications
Infrastructure as a service (IaaS)/public cloud
On-premises private cloud
Hosted private cloud
Platform as a service (PaaS)
Hosted, non-cloud infrastructure
None
% of respondents (n = 707)
Types of cloud or hosted services currently in use
All respondents
16. Source: 451 Research, Voice of the Enterprise: Cloud, Hosting & Managed Services, Workloads and Key Projects 2018
Q4. Which of the following IaaS features is your organization using in connection with your IaaS/public cloud deployment? Please select all
that apply.
16
45%
42%
41%
37%
33%
30%
25%
23%
22%
21%
16%
14%
5%
8%
Relational database
Data/business analytics
Containers
Auto-scaling
Data warehouse
Serverless compute/function as a service
NoSQL database
Real-time/streaming data processing
Machine learning
Mobile services
IoT platform
Large-scale/batch data transfer
Other
None
% of respondents (n = 322)
Public cloud features currently in use
IaaS/public cloud users
17. Source: 451 Research, Voice of the Enterprise: Digital Pulse, Workloads and Key Projects 2018
Q15. You indicated that the majority of your organization’s workloads/applications will be deployed in off-premises cloud/hosted IT
environments two years from now. What are the drivers behind this? Please select up to 3.
17
38%
37%
35%
32%
30%
27%
27%
19%
16%
1%
Reduce IT costs
Enhance IT systems agility
Improved access to new technology resources/capabilities/features
Modernize IT infrastructure
Deploy new applications/features faster
Enhance application performance and resiliency
Move from capital-intensive IT to an operating expense model
Improve security
Overcome lack of in-house IT staff/expertise
Other
% of respondents (n=448)
Drivers of Deploying Workloads/Applications in
Off-Premises Environments
Base: Respondents Plan To Deploy Majority Workloads/Applications in Off-Premises IT Environments
19. Migration patterns
44%
18%
14%
12%
11%
% of respondents (n=1049)
Source: 451 Research, Voice of the Enterprise: Digital Pulse, Workloads and Key Projects 2018
Q7. Which of the following best describes your organization’s overall IT infrastructure approach to mission-critical legacy applications and
workloads going forward?
19
Modernize
Retain existing applications on-
premises but move to more modern
application and infrastructure
architectures.
Refactor and shift
Re-architect or redesign existing
applications using cloud-native
frameworks and deploy in off-premises
cloud environments.
Retain
Keep current applications unchanged
on existing on-premises infrastructure.
Repurchase and shift
Replace current on-premises
applications with SaaS or off-premises
hosted versions of the applications.
Lift and shift
Migrate applications to off-
premises/cloud with minimal changes to
the application code or business logic.
21. Copyright (C) 2018 451 Research LLC
A bridge to the cloud
21
Copyright (C) 2018 451 Research LLC
22. Source: 451 Research, Voice of the Enterprise: Cloud, Hosting & Managed Services, Workloads and Key Projects 2018
Q3. Which of the following vendors is your organization currently using for IaaS/public cloud? Please select all that apply.
22
71%
69%
23%
10%
8%
7%
3%
3%
3%
3%
3%
10%
Amazon Web Services (AWS)
Microsoft Azure
Google Cloud Platform
Oracle Cloud
IBM Cloud
Rackspace
CenturyLink Cloud
Digital Ocean
Alibaba Cloud
NTT
OVH (VMware vCloud Air)
Other
% of respondents (n = 319)
IaaS/public cloud vendors currently in use
IaaS/public cloud users
• Azure access also comes bundled with a Microsoft EA license
• 59% of organizations using public cloud use more than one provider
• 64% of Azure users also use AWS, 18% use Google Cloud
• 62% of AWS users also use Azure, 22% use Google Cloud
• 82% of Google Cloud users also use AWS, 65% use Azure
23. Copyright (C) 2018 451 Research LLC
It’s a multi-cloud world (increasingly deliberately)
23
Which Flavor of Multi-Cloud?
+
Q. Has your organization configured any of the following cloud deployments for interoperability for the seamless delivery of a business function?
Source: 451 Research, Voice of the Enterprise: Cloud Transformation, Vendor Evaluations, 2016
24. Copyright (C) 2018 451 Research LLC
It’s a multi-cloud world (increasingly deliberately)
24
Which Flavor of Multi-Cloud?
Q. Has your organization configured any of the following cloud deployments for interoperability for the seamless delivery of a business function?
Source: 451 Research, Voice of the Enterprise: Cloud Transformation, Vendor Evaluations, 2016
+
25. Source: 451 Research, Voice of the Enterprise: Cloud, Hosting & Managed Services, Workloads and Key Projects 2018
Q20. Is your organization currently using multiple infrastructure environments in the operation of (Workload)?
25
47%
6%
18%
29%
Yes
No, but we plan to
No, but we are considering it
No, and we don’t plan to
% of respondents (n = 621)
Multiple infrastructure environments in the operation of a
specific workload
Respondents identifying workloads as currently in place
26. Source: 451 Research, Voice of the Enterprise: Cloud, Hosting & Managed Services, Workloads and Key Projects 2018
Q21. Which, if any, of the following best describe your organization’s reasons for using multiple infrastructure environments to operate
(Workload)? Please select all that apply.
26
47%
40%
33%
33%
33%
32%
26%
6%
1%
Improving performance/availability
Optimizing for cost
Isolating sensitive business data
Migrating between infrastructure environments
Meeting regulatory or data sovereignty requirements
Adding new functions to existing deployments
Adding geographic diversity/reducing latency
Other
None of the above
% of respondents (n = 419)
Reasons for multiple infrastructure environments in the
operation of workload
Multiple infrastructure environment user/potential user
27. Copyright (C) 2018 451 Research LLC
The trouble with data processing in a globally distributed,
multi-location environment
27
EDW
In theory: one data warehouse,
at the heart of the enterprise
ENTERPRISE
28. Copyright (C) 2018 451 Research LLC
The trouble with data processing in a globally distributed,
multi-location environment
28
EDW
In practice: data marts/
departmental data warehouses/
data lakes
ENTERPRISE
29. Copyright (C) 2018 451 Research LLC
The trouble with data processing in a globally distributed,
multi-location environment
29
In theory: cloud
EDW
ENTERPRISE
30. Copyright (C) 2018 451 Research LLC
The trouble with data processing in a globally distributed,
multi-location environment
30
In practice: multiple clouds,
multiple database services
EDW
ENTERPRISE
31. Copyright (C) 2018 451 Research LLC
The trouble with data processing in a globally distributed,
multi-location environment
31
Data processing at the edge
EDW
ENTERPRISE
32. Copyright (C) 2018 451 Research LLC
Standardization – uniformity across multiple clouds
32
EDW
ENTERPRISE
33. Copyright (C) 2018 451 Research LLC
Catalog – index and discovery
33
Data Catalog
EDW
ENTERPRISE
34. Copyright (C) 2018 451 Research LLC
Schrodinger’s data
(credit Chris Hallenbeck, SAP SVP
database and data management)
How do you know the data is
identifiable/to be forgotten without
looking at it?
Identification of PII data
34Source: Jie Qi on Flickr https://www.flickr.com/photos/jieq/2371783536
35. Copyright (C) 2018 451 Research LLC
Amid the potential complexity – the data fabric
35
EDW
The structure of
anything; the manner in
which the parts of a
thing are united
• A material made of
fibers, a textile or
cloth.
More intangible -
The framework
underlying a structure
• The fabric of space
and time
• The fabric of society
36. Copyright (C) 2018 451 Research LLC
Data fabric – interconnective tissue
36
A consistent user
experience across
multiple environments
(both on-premises and
cloud)
The ability to
(increasingly
automatically) identify
data in multiple
environments (both
on-premises and cloud)
The ability to manage
data in multiple
environments (both
on-premises and cloud)
– data lifecycle, data
governance, data
security
The ability to manage
data across multiple
environments as if it
was a single globally
distributed architecture
(work in progress)
37. Copyright (C) 2018 451 Research LLC
Building blocks for global data management
37
EDW
Data Catalog