SlideShare una empresa de Scribd logo
1 de 31
1© Cloudera, Inc. All rights reserved.
How to Lower TCO and
Avoid Cloud Lock-in
Jim Fisher, Director of Systems Engineering at Cloudera
Ifi Derekli, Systems Engineer at Cloudera
Susan Greslik, Systems Engineer at Cloudera
2© Cloudera, Inc. All rights reserved.
● Moving from on-prem to Cloud: Best practices for lowering your TCO
● Portability and selecting the right Cloud provider
● Demo of using multiple Cloud providers
● Bringing it all together with Cloudera Altus
Agenda
3© Cloudera, Inc. All rights reserved.
Big Data deployments in cloud are
accelerating
● Increased agility through end-user self-
service
● Organization focused on higher value
items
● Perceived lower overall TCO by
optimizing infrastructure usage
Benefits of the Cloud
4© Cloudera, Inc. All rights reserved.
Three Types of Deployment Models in Cloud
1hr
SPIN UP SPIN
DOWN
24/7
24/7
1hr
SPIN UP SPIN
DOWN
Persistent
Transient
Elastic
5© Cloudera, Inc. All rights reserved.
Characteristics are Different for Each
Usage Requirements Example Workloads
Persistent ● Runs 24/7
● Only expands when
new capacity is
needed
● High availability & disaster
recovery
● Cluster operational management
● Resource management
● Security
● NoSQL
● Streaming
● BI analytics
● Multi-user
Transient ● Runs on an
intermittent basis
(e.g.: daily, weekly,
hourly)
● Object store integration
● Fast cluster provisioning
● Cluster metadata persistence
● Usage-based pricing
● ETL workflows
● Model training
● Ad hoc analytics
● Dev/Test workflows
Elastic ● Some nodes run 24/7
● Others added &
removed as needed
(e.g.: daily, weekly,
monthly, quarterly)
● Combination of requirements
from persistent and transient
clusters
● BI analytics during
peak hours
● End of week, month,
quarter processing
6© Cloudera, Inc. All rights reserved.
Cost Models are Different
Cost Structure Cost Optimization Option
Persistent Yearly “rental” of infrastructure Multi-year agreements (e.g.: Reserved
Instances)
Transient Hourly “rental” of infrastructure Preemptible VMs (e.g.: Spot Instances)
Elastic Yearly “rental” for persistent nodes and
hourly “rental” for transient nodes
Multi-year agreements for persistent nodes
and preemptible VMs for transient nodes
On-Premise Purchase of infrastructure that is
typically depreciated over 3 years
Limited
7© Cloudera, Inc. All rights reserved.
● Utilizing publicly available list pricing with no discounts
● AWS pricing used since they are the market leader but same concepts apply
to other Cloud providers
● Your mileage may vary - These are going to be different for every
organization but the concepts and numbers are directionally correct
● Only using infrastructure costs since they are often the majority of TCO
Some Assumptions
8© Cloudera, Inc. All rights reserved.
How Much Does a Server Cost?
Component Details Cost Estimates
Server 20 cores, 256GB
RAM, 12 4TB disks
$18,000
Data Center Power, cooling, and
data center space*
$4,000
Networking Switches &
networking equip.
$5,000
Administrator One person who
manages 100 servers
$6,000
TOTAL (3 Years) $33,000
TOTAL
(Annually)
$11,000
* https://ongoingoperations.com/data-center-pricing-credit-unions/
9© Cloudera, Inc. All rights reserved.
How Much is a Similar Server in the Cloud?
Component Details
d2.8xlarge 36 vCPUs, 244 GB RAM,
12 4TB disks
Option Unit
Cost
Total
(3 Years)
Total
(Annual)
On-Premise Server $33,000 / 3 years $33,000 $11,000
On-Demand Pricing $5.52 / hour $145,065 $48,355
Reserved Instance (1-Year) $23,616 / year $70,848 $23,616
Reserved Instance (3-Year) $41,560 / 3 years $41,560 $13,853
10© Cloudera, Inc. All rights reserved.
Benefits of using object storage
● Create a data lake in object store for multiple
clusters and eliminate data silos
● Provides durability for the data, so you don’t
have to worry about replication.
● Allows you to separate compute and storage so
you can grow independently.
● ..which leads to less costs than local storage
Except when…
● Performance is critical. Each attached disk is
roughly 100Mb/s and that’s rough estimate per
server to object storage
What if You Want to Use Object Storage?
Object Store
11© Cloudera, Inc. All rights reserved.
How Does Object Storage Affect the Costs?
Component Details
r4.8xlarge 32 vCPUs, 244 GB RAM
EBS Disks 640GB
S3 Storage 12 TB* (no replication required)
Option Compute Storage Total (Annual)
On-Premise Server $11,000 $0 $11,000
d2.8xl Reserved Instance (3-
Year)
$13,853 $0 $13,853
r4.8xl Reserved Instance (3-
Year)
$7,009 $5,458 $12,467
Object Store
* Object Storage costs may be less since you pay for what you use
12© Cloudera, Inc. All rights reserved.
Benefits of Transient Clusters
● Pay only for what you use
● Right-size cluster based on
workload needs
● Better isolation between different
users and groups
Cloud Workloads are often Transient
13© Cloudera, Inc. All rights reserved.
What if you only needed 6 Hours per Day?
Component Details
r4.8xlarge 32 vCPUs, 244 GB RAM, 6 hours/day
EBS Disks 640GB, 6 hours/day
S3 Storage 12 TB*, 24x7
Option Compute Storage Total (Annual)
On-Premise Server $11,000 $0 $11,000
Reserved Instance (3-Year) $7,009 $5,458 $12,467
On Demand Pricing (6
hours/day)
$4,648 $4,728 $9,376
* Object Storage costs may be less since you pay for what you use
14© Cloudera, Inc. All rights reserved.
What are Preemptible Instances?
Spare computing capacity that you can bid on a significant discount levels as
compared to on-demand pricing. AWS suggests that costs can be 50-90% less
than On-Demand and Google says they can be up to 80% cheaper.
Known as Spot pricing in AWS and Preemptible VMs in Google.
Preemptible Instances can be used to lower costs
15© Cloudera, Inc. All rights reserved.
How much can you save with Spot at 70% Discount?
Component Details
r4.8xlarge 32 vCPUs cores, 244 GB RAM, 6
hours/day
EBS Disks 640GB, 6 hours/day
S3 Storage 12 TB*, 24x7
Option Compute Storage Total (Annual)
On-Premise Server $11,000 $0 $11,000
On Demand Pricing (6
hours/day)
$4,648 $4,728 $9,376
Spot Pricing - 70% (6 hours/day) $1,3941 $4,728 $6,122
* Object Storage costs may be less since you pay for what you use
16© Cloudera, Inc. All rights reserved.
25 Servers cost about $275,000 per year
35 Servers cost about $385,000 per year
Let’s Walk through a Scenario
Workload Requirements
ETL ● SLAs to complete within 6 hours
● Need 20 servers to meet SLAs
BI ● 15 servers to meet minimum workload
● 25 servers needed during business hours
17© Cloudera, Inc. All rights reserved.
Two Clusters in the Cloud
Component Details
r4.2xlarge 100, with 8 vCPUs cores, 61 GB RAM
EBS Disks 15.6 TB, 24 hours/day
S3 Storage 300 TB
Object Store
BI - Persistent running 24/7
Component Details
r4.2xlarge 80 with 8 vCPUs, 61 GB RAM, 6 hours/day
EBS Disks 12.8 TB, 6 hours/day
S3 Storage (included with BI workload)
ETL - Transient Running 6 hrs/daily
18© Cloudera, Inc. All rights reserved.
● Cloud can be less when on-
premise is not as highly utilized
and more when on-premise is
efficiently utilized
● Cloud TCO best practices used
● Cloud provides benefits of
isolation and on-demand flexibility
How Do the Costs Compare?
Option Compute Storage Total (Annual)
On-Premise (25 Servers) $275,000 $0 $275,000
On-Premise (35 Servers) $385,000 $0 $385,000
Cloud (ETL) $27,888 $4,853 $32,741
Cloud (BI) $175,233 $136,395 $311,628
19© Cloudera, Inc. All rights reserved.
General
● Don’t look as Cloud as infrastructure hosted in another data center
● Understand the workloads so you use the right tool to optimize the TCO
Best Practices on How to Lower Cloud TCO
Storage
● Utilize object storage when possible to eliminate data silos
● Use local storage when performance SLAs are more critical
Persistent
● Use Reserved Instances when workloads are
known and can be committed to for multiple
years
Transient
● Use Preemptible instances when possible
but you may have to re-design application
20© Cloudera, Inc. All rights reserved.
Portability and selecting the right Cloud
provider
21© Cloudera, Inc. All rights reserved.
Why is portability important?
Financial considerations
• Price negotiation position
• Instance pricing fluctuation
• Project type can dictate cost
(storage, processing power)
Conflict of Interest
• Competition with cloud vendors
• Freedom to have choices for
given projects is critical
Maximize Capabilities
• Freedom to leverage all
features available across
vendors
Diversify Risk
• Eliminate 100% dependency on
vendor’s technology
• Ensure uptime of your
environment despite Cloud
vendor potential issues
22© Cloudera, Inc. All rights reserved.
What to consider?
Questions to ask
Pricing model ● Does the vendor round to nearest minute? Hour? Do
they offer discounts for upfront commitment?
Variety of services provided ● Does the vendor provide enough options for
instances (e.g. dense disk, memory-optimized, cpu-
optimized), storage options (e.g. local disk, object-
store), or network capabilities to meet your needs?
Ease of Use ● Do you have an existing skill-set for a particular
vendor?
● Is the platform simple to deploy and easy to learn?
Support & Management ● What type of assistance will the vendor provide?
● How easy is it to troubleshoot your cloud
environment?
23© Cloudera, Inc. All rights reserved.
Orchestration tool for deploying, monitoring and scaling Cloudera EDH on cloud
infrastructure
Characteristics:
• Embodies Cloudera best-practices and reference
Architectures
• Complements on-prem offerings for IaaS users
• Extends capabilities of Cloudera Manager
• Grows and shrinks clusters via single pane of glass
Main Goals:
• Reduce time-to-value
• Enable new usage patterns (on-demand clusters,
self-service)
• Facilitate portability amongst cloud vendors
• Allow predictability of workloads
What is Cloudera Director?
Cloudera Director
24© Cloudera, Inc. All rights reserved.
• Re-usability
• Cluster configuration files
• Node templates (Master,
Workers, Edge)
• “Standard” cluster configs ->
predictable project costs
• Only Pay for What You Use
• Automatic billing
• Flexible SKUs based on use
case
Cloudera Director Economics
25© Cloudera, Inc. All rights reserved.
How Does Director Help with Portability?
IFI TODO: Screenshot of director
26© Cloudera, Inc. All rights reserved.
How Does Director Help with Portability?
27© Cloudera, Inc. All rights reserved.
Demo
28© Cloudera, Inc. All rights reserved.
● Brand for Cloudera PaaS
offerings
● Foundation acts as framework
for building services.
● Altus for data engineers is first
user-facing service.
Cloudera Altus is a PaaS for big data analytics
29© Cloudera, Inc. All rights reserved.
● Understand your workloads so you use the right tools and right vendor to
optimize the TCO
● Plan for portability to reduce risk and costs and enable options
● Cloudera can help you plan and architect efficiently
● Cloudera Altus - PaaS offering so you can focus on your applications
Key Takeaways
30© Cloudera, Inc. All rights reserved.
Thank you
31© Cloudera, Inc. All rights reserved.
An Enterprise Data Hub reimagined in the cloud
Object Store Object Store
Data Science
Workbench
Common Governance
Common Security
Common: Operations, Governance, Security, Schema, Catalog
SQL WorkbenchPartner EcosystemWorkload Management

Más contenido relacionado

La actualidad más candente

Making Self-Service BI a Reality in the Enterprise
Making Self-Service BI a Reality in the EnterpriseMaking Self-Service BI a Reality in the Enterprise
Making Self-Service BI a Reality in the Enterprise
Cloudera, Inc.
 

La actualidad más candente (20)

The Five Markers on Your Big Data Journey
The Five Markers on Your Big Data JourneyThe Five Markers on Your Big Data Journey
The Five Markers on Your Big Data Journey
 
The Big Picture: Learned Behaviors in Churn
The Big Picture: Learned Behaviors in ChurnThe Big Picture: Learned Behaviors in Churn
The Big Picture: Learned Behaviors in Churn
 
How Data Drives Business at Choice Hotels
How Data Drives Business at Choice HotelsHow Data Drives Business at Choice Hotels
How Data Drives Business at Choice Hotels
 
Intuitive Real-Time Analytics with Search
Intuitive Real-Time Analytics with SearchIntuitive Real-Time Analytics with Search
Intuitive Real-Time Analytics with Search
 
How Big Data Can Enable Analytics from the Cloud (Technical Workshop)
How Big Data Can Enable Analytics from the Cloud (Technical Workshop)How Big Data Can Enable Analytics from the Cloud (Technical Workshop)
How Big Data Can Enable Analytics from the Cloud (Technical Workshop)
 
Kudu Forrester Webinar
Kudu Forrester WebinarKudu Forrester Webinar
Kudu Forrester Webinar
 
Making Self-Service BI a Reality in the Enterprise
Making Self-Service BI a Reality in the EnterpriseMaking Self-Service BI a Reality in the Enterprise
Making Self-Service BI a Reality in the Enterprise
 
Consolidate your data marts for fast, flexible analytics 5.24.18
Consolidate your data marts for fast, flexible analytics 5.24.18Consolidate your data marts for fast, flexible analytics 5.24.18
Consolidate your data marts for fast, flexible analytics 5.24.18
 
High-Performance Analytics in the Cloud with Apache Impala
High-Performance Analytics in the Cloud with Apache ImpalaHigh-Performance Analytics in the Cloud with Apache Impala
High-Performance Analytics in the Cloud with Apache Impala
 
Apache Impala (incubating) 2.5 Performance Update
Apache Impala (incubating) 2.5 Performance UpdateApache Impala (incubating) 2.5 Performance Update
Apache Impala (incubating) 2.5 Performance Update
 
Simplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache KuduSimplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache Kudu
 
Advanced Analytics for Investment Firms and Machine Learning
Advanced Analytics for Investment Firms and Machine LearningAdvanced Analytics for Investment Firms and Machine Learning
Advanced Analytics for Investment Firms and Machine Learning
 
Moving Beyond Lambda Architectures with Apache Kudu
Moving Beyond Lambda Architectures with Apache KuduMoving Beyond Lambda Architectures with Apache Kudu
Moving Beyond Lambda Architectures with Apache Kudu
 
Securing the Data Hub--Protecting your Customer IP (Technical Workshop)
Securing the Data Hub--Protecting your Customer IP (Technical Workshop)Securing the Data Hub--Protecting your Customer IP (Technical Workshop)
Securing the Data Hub--Protecting your Customer IP (Technical Workshop)
 
Big data journey to the cloud rohit pujari 5.30.18
Big data journey to the cloud   rohit pujari 5.30.18Big data journey to the cloud   rohit pujari 5.30.18
Big data journey to the cloud rohit pujari 5.30.18
 
How to Build Continuous Ingestion for the Internet of Things
How to Build Continuous Ingestion for the Internet of ThingsHow to Build Continuous Ingestion for the Internet of Things
How to Build Continuous Ingestion for the Internet of Things
 
Cloudera Altus: Big Data in der Cloud einfach gemacht
Cloudera Altus: Big Data in der Cloud einfach gemachtCloudera Altus: Big Data in der Cloud einfach gemacht
Cloudera Altus: Big Data in der Cloud einfach gemacht
 
Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...
Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...
Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...
 
Cloudera - The Modern Platform for Analytics
Cloudera - The Modern Platform for AnalyticsCloudera - The Modern Platform for Analytics
Cloudera - The Modern Platform for Analytics
 
Cloudera Altus: Big Data in the Cloud Made Easy
Cloudera Altus: Big Data in the Cloud Made EasyCloudera Altus: Big Data in the Cloud Made Easy
Cloudera Altus: Big Data in the Cloud Made Easy
 

Similar a How to Lower TCO and Avoid Cloud Lock-in


Discover Neo4j Aura Workshop (9.27.23).pptx
Discover Neo4j Aura Workshop (9.27.23).pptxDiscover Neo4j Aura Workshop (9.27.23).pptx
Discover Neo4j Aura Workshop (9.27.23).pptx
Neo4j
 

Similar a How to Lower TCO and Avoid Cloud Lock-in
 (20)

CloudOpen Japan - Controlling the cost of your first cloud
CloudOpen Japan - Controlling the cost of your first cloudCloudOpen Japan - Controlling the cost of your first cloud
CloudOpen Japan - Controlling the cost of your first cloud
 
Taming the cost of your first cloud - CCCEU 2014
Taming the cost of your first cloud - CCCEU 2014Taming the cost of your first cloud - CCCEU 2014
Taming the cost of your first cloud - CCCEU 2014
 
Cloud Data Warehousing with Cloudera Altus 7.24.18
Cloud Data Warehousing with Cloudera Altus 7.24.18Cloud Data Warehousing with Cloudera Altus 7.24.18
Cloud Data Warehousing with Cloudera Altus 7.24.18
 
AWS Summit Stockholm 2014 – B5 – The TCO of cloud applications
AWS Summit Stockholm 2014 – B5 – The TCO of cloud applicationsAWS Summit Stockholm 2014 – B5 – The TCO of cloud applications
AWS Summit Stockholm 2014 – B5 – The TCO of cloud applications
 
B4 - The TCO of cloud applications
B4 - The TCO of cloud applicationsB4 - The TCO of cloud applications
B4 - The TCO of cloud applications
 
Oracle Cloud Infrastructure Introduction
Oracle Cloud Infrastructure IntroductionOracle Cloud Infrastructure Introduction
Oracle Cloud Infrastructure Introduction
 
Discover Neo4j Aura Workshop (9.27.23).pptx
Discover Neo4j Aura Workshop (9.27.23).pptxDiscover Neo4j Aura Workshop (9.27.23).pptx
Discover Neo4j Aura Workshop (9.27.23).pptx
 
Apache Druid Design and Future prospect
Apache Druid Design and Future prospectApache Druid Design and Future prospect
Apache Druid Design and Future prospect
 
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the CloudPart 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
 
Stor simple presentation customers
 Stor simple presentation customers Stor simple presentation customers
Stor simple presentation customers
 
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...
 
NVMe and Flash – Make Your Storage Great Again!
NVMe and Flash – Make Your Storage Great Again!NVMe and Flash – Make Your Storage Great Again!
NVMe and Flash – Make Your Storage Great Again!
 
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
 
Automating a PostgreSQL High Availability Architecture with Ansible
Automating a PostgreSQL High Availability Architecture with AnsibleAutomating a PostgreSQL High Availability Architecture with Ansible
Automating a PostgreSQL High Availability Architecture with Ansible
 
[Cloudera World Tokyo 2018] Cloudera on Oracle Cloud Infrastructure
[Cloudera World Tokyo 2018] Cloudera on Oracle Cloud Infrastructure[Cloudera World Tokyo 2018] Cloudera on Oracle Cloud Infrastructure
[Cloudera World Tokyo 2018] Cloudera on Oracle Cloud Infrastructure
 
Oracle & SQL Server on the Cloud: Database Week SF
Oracle & SQL Server on the Cloud: Database Week SFOracle & SQL Server on the Cloud: Database Week SF
Oracle & SQL Server on the Cloud: Database Week SF
 
Oracle & SQL Server on the Cloud: Database Week San Francisco
Oracle & SQL Server on the Cloud: Database Week San FranciscoOracle & SQL Server on the Cloud: Database Week San Francisco
Oracle & SQL Server on the Cloud: Database Week San Francisco
 
How to Integrate Hyperconverged Systems with Existing SANs
How to Integrate Hyperconverged Systems with Existing SANsHow to Integrate Hyperconverged Systems with Existing SANs
How to Integrate Hyperconverged Systems with Existing SANs
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Oracle and SQL Server on the Cloud
Oracle and SQL Server on the CloudOracle and SQL Server on the Cloud
Oracle and SQL Server on the Cloud
 

Más de Cloudera, Inc.

Más de Cloudera, Inc. (20)

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 
Cloudera SDX
Cloudera SDXCloudera SDX
Cloudera SDX
 

Último

%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
masabamasaba
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
VictoriaMetrics
 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
masabamasaba
 
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
masabamasaba
 
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
chiefasafspells
 
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Medical / Health Care (+971588192166) Mifepristone and Misoprostol tablets 200mg
 

Último (20)

%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
 
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
 
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
 
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
 
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
 
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
 
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
 
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
 
What Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the SituationWhat Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the Situation
 
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
 
%in Benoni+277-882-255-28 abortion pills for sale in Benoni
%in Benoni+277-882-255-28 abortion pills for sale in Benoni%in Benoni+277-882-255-28 abortion pills for sale in Benoni
%in Benoni+277-882-255-28 abortion pills for sale in Benoni
 
tonesoftg
tonesoftgtonesoftg
tonesoftg
 
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
 
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
 
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
 

How to Lower TCO and Avoid Cloud Lock-in


  • 1. 1© Cloudera, Inc. All rights reserved. How to Lower TCO and Avoid Cloud Lock-in Jim Fisher, Director of Systems Engineering at Cloudera Ifi Derekli, Systems Engineer at Cloudera Susan Greslik, Systems Engineer at Cloudera
  • 2. 2© Cloudera, Inc. All rights reserved. ● Moving from on-prem to Cloud: Best practices for lowering your TCO ● Portability and selecting the right Cloud provider ● Demo of using multiple Cloud providers ● Bringing it all together with Cloudera Altus Agenda
  • 3. 3© Cloudera, Inc. All rights reserved. Big Data deployments in cloud are accelerating ● Increased agility through end-user self- service ● Organization focused on higher value items ● Perceived lower overall TCO by optimizing infrastructure usage Benefits of the Cloud
  • 4. 4© Cloudera, Inc. All rights reserved. Three Types of Deployment Models in Cloud 1hr SPIN UP SPIN DOWN 24/7 24/7 1hr SPIN UP SPIN DOWN Persistent Transient Elastic
  • 5. 5© Cloudera, Inc. All rights reserved. Characteristics are Different for Each Usage Requirements Example Workloads Persistent ● Runs 24/7 ● Only expands when new capacity is needed ● High availability & disaster recovery ● Cluster operational management ● Resource management ● Security ● NoSQL ● Streaming ● BI analytics ● Multi-user Transient ● Runs on an intermittent basis (e.g.: daily, weekly, hourly) ● Object store integration ● Fast cluster provisioning ● Cluster metadata persistence ● Usage-based pricing ● ETL workflows ● Model training ● Ad hoc analytics ● Dev/Test workflows Elastic ● Some nodes run 24/7 ● Others added & removed as needed (e.g.: daily, weekly, monthly, quarterly) ● Combination of requirements from persistent and transient clusters ● BI analytics during peak hours ● End of week, month, quarter processing
  • 6. 6© Cloudera, Inc. All rights reserved. Cost Models are Different Cost Structure Cost Optimization Option Persistent Yearly “rental” of infrastructure Multi-year agreements (e.g.: Reserved Instances) Transient Hourly “rental” of infrastructure Preemptible VMs (e.g.: Spot Instances) Elastic Yearly “rental” for persistent nodes and hourly “rental” for transient nodes Multi-year agreements for persistent nodes and preemptible VMs for transient nodes On-Premise Purchase of infrastructure that is typically depreciated over 3 years Limited
  • 7. 7© Cloudera, Inc. All rights reserved. ● Utilizing publicly available list pricing with no discounts ● AWS pricing used since they are the market leader but same concepts apply to other Cloud providers ● Your mileage may vary - These are going to be different for every organization but the concepts and numbers are directionally correct ● Only using infrastructure costs since they are often the majority of TCO Some Assumptions
  • 8. 8© Cloudera, Inc. All rights reserved. How Much Does a Server Cost? Component Details Cost Estimates Server 20 cores, 256GB RAM, 12 4TB disks $18,000 Data Center Power, cooling, and data center space* $4,000 Networking Switches & networking equip. $5,000 Administrator One person who manages 100 servers $6,000 TOTAL (3 Years) $33,000 TOTAL (Annually) $11,000 * https://ongoingoperations.com/data-center-pricing-credit-unions/
  • 9. 9© Cloudera, Inc. All rights reserved. How Much is a Similar Server in the Cloud? Component Details d2.8xlarge 36 vCPUs, 244 GB RAM, 12 4TB disks Option Unit Cost Total (3 Years) Total (Annual) On-Premise Server $33,000 / 3 years $33,000 $11,000 On-Demand Pricing $5.52 / hour $145,065 $48,355 Reserved Instance (1-Year) $23,616 / year $70,848 $23,616 Reserved Instance (3-Year) $41,560 / 3 years $41,560 $13,853
  • 10. 10© Cloudera, Inc. All rights reserved. Benefits of using object storage ● Create a data lake in object store for multiple clusters and eliminate data silos ● Provides durability for the data, so you don’t have to worry about replication. ● Allows you to separate compute and storage so you can grow independently. ● ..which leads to less costs than local storage Except when… ● Performance is critical. Each attached disk is roughly 100Mb/s and that’s rough estimate per server to object storage What if You Want to Use Object Storage? Object Store
  • 11. 11© Cloudera, Inc. All rights reserved. How Does Object Storage Affect the Costs? Component Details r4.8xlarge 32 vCPUs, 244 GB RAM EBS Disks 640GB S3 Storage 12 TB* (no replication required) Option Compute Storage Total (Annual) On-Premise Server $11,000 $0 $11,000 d2.8xl Reserved Instance (3- Year) $13,853 $0 $13,853 r4.8xl Reserved Instance (3- Year) $7,009 $5,458 $12,467 Object Store * Object Storage costs may be less since you pay for what you use
  • 12. 12© Cloudera, Inc. All rights reserved. Benefits of Transient Clusters ● Pay only for what you use ● Right-size cluster based on workload needs ● Better isolation between different users and groups Cloud Workloads are often Transient
  • 13. 13© Cloudera, Inc. All rights reserved. What if you only needed 6 Hours per Day? Component Details r4.8xlarge 32 vCPUs, 244 GB RAM, 6 hours/day EBS Disks 640GB, 6 hours/day S3 Storage 12 TB*, 24x7 Option Compute Storage Total (Annual) On-Premise Server $11,000 $0 $11,000 Reserved Instance (3-Year) $7,009 $5,458 $12,467 On Demand Pricing (6 hours/day) $4,648 $4,728 $9,376 * Object Storage costs may be less since you pay for what you use
  • 14. 14© Cloudera, Inc. All rights reserved. What are Preemptible Instances? Spare computing capacity that you can bid on a significant discount levels as compared to on-demand pricing. AWS suggests that costs can be 50-90% less than On-Demand and Google says they can be up to 80% cheaper. Known as Spot pricing in AWS and Preemptible VMs in Google. Preemptible Instances can be used to lower costs
  • 15. 15© Cloudera, Inc. All rights reserved. How much can you save with Spot at 70% Discount? Component Details r4.8xlarge 32 vCPUs cores, 244 GB RAM, 6 hours/day EBS Disks 640GB, 6 hours/day S3 Storage 12 TB*, 24x7 Option Compute Storage Total (Annual) On-Premise Server $11,000 $0 $11,000 On Demand Pricing (6 hours/day) $4,648 $4,728 $9,376 Spot Pricing - 70% (6 hours/day) $1,3941 $4,728 $6,122 * Object Storage costs may be less since you pay for what you use
  • 16. 16© Cloudera, Inc. All rights reserved. 25 Servers cost about $275,000 per year 35 Servers cost about $385,000 per year Let’s Walk through a Scenario Workload Requirements ETL ● SLAs to complete within 6 hours ● Need 20 servers to meet SLAs BI ● 15 servers to meet minimum workload ● 25 servers needed during business hours
  • 17. 17© Cloudera, Inc. All rights reserved. Two Clusters in the Cloud Component Details r4.2xlarge 100, with 8 vCPUs cores, 61 GB RAM EBS Disks 15.6 TB, 24 hours/day S3 Storage 300 TB Object Store BI - Persistent running 24/7 Component Details r4.2xlarge 80 with 8 vCPUs, 61 GB RAM, 6 hours/day EBS Disks 12.8 TB, 6 hours/day S3 Storage (included with BI workload) ETL - Transient Running 6 hrs/daily
  • 18. 18© Cloudera, Inc. All rights reserved. ● Cloud can be less when on- premise is not as highly utilized and more when on-premise is efficiently utilized ● Cloud TCO best practices used ● Cloud provides benefits of isolation and on-demand flexibility How Do the Costs Compare? Option Compute Storage Total (Annual) On-Premise (25 Servers) $275,000 $0 $275,000 On-Premise (35 Servers) $385,000 $0 $385,000 Cloud (ETL) $27,888 $4,853 $32,741 Cloud (BI) $175,233 $136,395 $311,628
  • 19. 19© Cloudera, Inc. All rights reserved. General ● Don’t look as Cloud as infrastructure hosted in another data center ● Understand the workloads so you use the right tool to optimize the TCO Best Practices on How to Lower Cloud TCO Storage ● Utilize object storage when possible to eliminate data silos ● Use local storage when performance SLAs are more critical Persistent ● Use Reserved Instances when workloads are known and can be committed to for multiple years Transient ● Use Preemptible instances when possible but you may have to re-design application
  • 20. 20© Cloudera, Inc. All rights reserved. Portability and selecting the right Cloud provider
  • 21. 21© Cloudera, Inc. All rights reserved. Why is portability important? Financial considerations • Price negotiation position • Instance pricing fluctuation • Project type can dictate cost (storage, processing power) Conflict of Interest • Competition with cloud vendors • Freedom to have choices for given projects is critical Maximize Capabilities • Freedom to leverage all features available across vendors Diversify Risk • Eliminate 100% dependency on vendor’s technology • Ensure uptime of your environment despite Cloud vendor potential issues
  • 22. 22© Cloudera, Inc. All rights reserved. What to consider? Questions to ask Pricing model ● Does the vendor round to nearest minute? Hour? Do they offer discounts for upfront commitment? Variety of services provided ● Does the vendor provide enough options for instances (e.g. dense disk, memory-optimized, cpu- optimized), storage options (e.g. local disk, object- store), or network capabilities to meet your needs? Ease of Use ● Do you have an existing skill-set for a particular vendor? ● Is the platform simple to deploy and easy to learn? Support & Management ● What type of assistance will the vendor provide? ● How easy is it to troubleshoot your cloud environment?
  • 23. 23© Cloudera, Inc. All rights reserved. Orchestration tool for deploying, monitoring and scaling Cloudera EDH on cloud infrastructure Characteristics: • Embodies Cloudera best-practices and reference Architectures • Complements on-prem offerings for IaaS users • Extends capabilities of Cloudera Manager • Grows and shrinks clusters via single pane of glass Main Goals: • Reduce time-to-value • Enable new usage patterns (on-demand clusters, self-service) • Facilitate portability amongst cloud vendors • Allow predictability of workloads What is Cloudera Director? Cloudera Director
  • 24. 24© Cloudera, Inc. All rights reserved. • Re-usability • Cluster configuration files • Node templates (Master, Workers, Edge) • “Standard” cluster configs -> predictable project costs • Only Pay for What You Use • Automatic billing • Flexible SKUs based on use case Cloudera Director Economics
  • 25. 25© Cloudera, Inc. All rights reserved. How Does Director Help with Portability? IFI TODO: Screenshot of director
  • 26. 26© Cloudera, Inc. All rights reserved. How Does Director Help with Portability?
  • 27. 27© Cloudera, Inc. All rights reserved. Demo
  • 28. 28© Cloudera, Inc. All rights reserved. ● Brand for Cloudera PaaS offerings ● Foundation acts as framework for building services. ● Altus for data engineers is first user-facing service. Cloudera Altus is a PaaS for big data analytics
  • 29. 29© Cloudera, Inc. All rights reserved. ● Understand your workloads so you use the right tools and right vendor to optimize the TCO ● Plan for portability to reduce risk and costs and enable options ● Cloudera can help you plan and architect efficiently ● Cloudera Altus - PaaS offering so you can focus on your applications Key Takeaways
  • 30. 30© Cloudera, Inc. All rights reserved. Thank you
  • 31. 31© Cloudera, Inc. All rights reserved. An Enterprise Data Hub reimagined in the cloud Object Store Object Store Data Science Workbench Common Governance Common Security Common: Operations, Governance, Security, Schema, Catalog SQL WorkbenchPartner EcosystemWorkload Management