SlideShare una empresa de Scribd logo
1 de 23
www.datacenterdynamics.com
THE RISE OF BIG DATA..
WHEN DO “YOU” HAVE TO FACE IT
Dez Blanchfield
June 24th, 2014
Big data “hammer” Created by Rachel Jones of Wink Design Studio using: © Tagxedo.com
www.datacenterdynamics.com
BIG DATA & THE DATA CENTRE
BIG DATA HAS EVOLVED TO ESTABLISH ITS PLACE IN THE ENTERPRISE DATA CENTER
Data volumes are growing exponentially year on year, and the ‘stress’ being placed on data center infrastructure
across networks, storage and compute is overloading many data centers ability to service it.
Data Center infrastructure need to adapt and evolve in order to support new workloads. Let’s take a quick look at
what a data center is today, what big data is, and the types of workloads and technologies we’re having to
consider as upcoming growth markets.
Frist let’s just state once and for all, we are sick and tired of hearing about the 4 x V’s
» Volume
» Velocity
» Variety
» Veracity
www.datacenterdynamics.com
WHAT IS BIG DATA
LET’S BE CLEAR ABOUT WHAT IS NOT BIG DATA
Everyone has an opinion about what Big Data is and is not. Let’s be clear about what Big Data is NOT.
Just putting a “Big Data” stamp on it does not make it Big Data.
Big Data is not:
» Lots of data
» Fast data
» Messy data
» Badly managed data
» Bigger databases / bigger SAN’s
» Individual silos of data
» The result of regulatory data retention
Analysis of bad data will result in bad information.
FACT: A growth of data in a a traditional enterprise
Databases from 20 GB to 200 GB is not Big Data, that’s
Just lots of data. The are not the same by any measure.
www.datacenterdynamics.com
WHAT IS BIG DATA
A PLAIN ENGLISH DEFINITION AND A FEW EXAMPLES OF WHAT BIG DATA IS
BIG DATA: “The collection, processing and usage of large volumes of digitized data to improve how companies
make important decisions and operate the business.”
What is Big Data:
» Unstructured data / Machine data
» Aggregated click streams
» Previously unconnected data feeds
» Horizontal analysis across vertical silos
» Insights from analysis of multiple data pools
» Sentiment analysis within communities ( staff / consumers )
» Predictive Analytics replaces Routine Maintenance
Big Data analytics can give you a key points of differentiation.
IBM: “Over the next decade we will see significant gaps open
up between enterprises that proactively transform their operations
for the digital age and those that continue with business as usual.”
Era of Smart - Reinventing Australian Enterprises for the Digital Economy
www.datacenterdynamics.com
WHAT IS A DATA CENTRE
LET’S BE HONEST - DO WE ACTUALLY KNOW WHAT A DATA CENTRE IS ANY MORE
www.datacenterdynamics.com
WHAT IS A DATA CENTRE
IS IT TRADITIONAL PURPOSE BUILT DEDICATED IT ACCOMODATION
www.datacenterdynamics.com
WHAT IS A DATA CENTRE
IS IT A HONKING BIG UBER SECRET CAMPUS OF WAREHOUSE SCALE DATA HALLS
www.datacenterdynamics.com
WHAT IS A DATA CENTRE
IS IT PURPOSE BUILT CONTAINERISED MOBILE DATA ROOMS
www.datacenterdynamics.com
WHAT IS A DATA CENTRE
IS IT A HYBRID OF CONTAINERISED COMPUTE FABRIC AND TRADITIONAL DATA HALLS
www.datacenterdynamics.com
WHAT IS A DATA CENTRE
THE GIANTS OF THE INDUSTRY ARE DOING MORE THAN JUST DIPPING THEIR TOES
www.datacenterdynamics.com
WHAT IS A DATA CENTRE
IS IT DEDICATED PURPOSE BUILT IT ACCOMODATION
www.datacenterdynamics.com
WHAT IS A DATA CENTRE
IF IT WALKS LIKE A DUCK, IF IT QUACKS LIKE A DUCK
www.datacenterdynamics.com
WHAT IS A DATA CENTRE
DATA CENTRE INNOVAITON – THE SKY’S THE LIMIT
www.datacenterdynamics.com
THE DATA CENTRE LANDSCAPE
NOT ALL DATA CENTERS ARE CREATED EQUAL
When the media talk about Big Data data centers they all too often default to what I call The Usual Suspects
The Usual Suspects have specialist niche application workloads to service, i.e. not Enterprise workloads
» Facebook
» Google
» YaHoo
» eBay
» PayPal
» NASA
» CERN
» CIA / TSA / FBI / NSA
Some have created disruptive technologies
» Hadoop ( Google / YaHoo )
» OpenStack ( NASA / Rackspace )
» Open Compute ( Facebook )
www.datacenterdynamics.com
THE BIG DATA LANDSCAPE
NOT ALL DATA IS CREATED EQUAL
In this era of big data, we are fast learning, all too often the hard way, that not all data is created equal.
Raw data originating from machine log files, social media, or years of original transaction data is often considered to
be of lower value until it has been prepared & refined for analysis
key points to keep in mind about your data
» Treat data as if it is perishable goods
» Make timely relevant use of data in decision-making
» Know where your data is at all times
» Know who has access to your data, when and how
» Be able to provide access to your data in multiple forms
» Structure data consistently, ETL is your friend
» Silos make data less valuable over time
» Data documentation is critical
» Communicate within the business on exactly
what data you have and why
www.datacenterdynamics.com
BIG DATA PLATFORMS ARE PLENTIFUL
THE GROWTH IN BIG DATA PLATFORMS IS NOTHING SHORT OF EXPLOSIVE
www.datacenterdynamics.com
NOT ALWAYS DATA CENTRE FRIENDLY
BOTH ENTERPRISE & OPEN SOURCE PLATFORMS PRESENT THEIR OWN CHALLENGES
Deploying any large scale Big Data platform into a modern data center will present challenges not faced with
traditional enterprise network, storage and compute workloads – especially in the area of “rack awareness”.
Platforms which span multiple racks should ensure that replicas of data exist on multiple racks. This way, the loss of a
switch does not render portions of the data unavailable due to all replicas being underneath it.
Rack awareness is critical in datacenters - Not all Big Data platforms are capable of being “rack aware”
» Hadoop 1.2.1 and 2.0 can be made rack aware
» OpenStack not so much ( SWIFT has zones )
» SAP Hana is not
» CSC Infochimps is in part
» Spark can schedule for locality
» Ceph + RADOS = rack aware object store
» Moose FS has “proximity” settings
» Aerospike has “paxos protocol”
» Couchbase 2.5 now has rack awareness
» Traditional enterprise workloads don’t apply
» HPC platforms eat networks for breakfast
» A full Hadoop rebalance is quite the joyride
FACT: If you make the wrong design decisions in either bare metal or virtualized deployments of any of these types
of ecosystems, your network, storage, compute & data center infrastructure are in for a whole new world of pain.
www.datacenterdynamics.com
TRY HOSTING THESE EXAMPLES
CONSIDER THE CHALLENGES HOSTING THESE BIG DATA CUSTOMERS
A provincial Chinese phone company payroll system
» One million full time staff
Queensland power utility
» Asset & Vegetation management system
» Drone planes acquiring 8TB of data per flight
» 2 flights per day per plane, plans for a fleet of 6 planes
» HPC & Big Data storage & compute resources on-prem and in-cloud
Virgin Atlantic ( airline )
» The new Boeing 787 aircraft create half a terabyte of data per flight
» There are an average of 87,000 domestic flights a day inside US airspace
» 87,000 flights per day x 0.5 TB p/flight = 43,500 TB of data created per day
» In other words, approx. 42 Petabytes a day ( the meaning of life the universe and everything !? )
NOTE: That’s just the “storage” problem, consider the network and compute scale required to
perform even the most rudimentary analysis on a dataset of that scale !!
!?
www.datacenterdynamics.com
OTHER KEY TOPICS IMPACTING DATA CENTRES
BIG DATA AND THE MIRIAD OF SUPPORTING TECH TOPICS YOU NEED TO BE WATCHING
On-prem / Off-prem / Hybrid / Cloud
» Azure / AWS / RAX / Softlayer / Smartcloud
» VMware / OpenStack / Citrix / Hyper-V / KVM / Xen / LXC
VM Instantiation / App Containers
» Vagrant / Packer / Docker
» OSv / Capstan
» Joyent / SmartOS
DevOps / Automation / Service Catalogues
» FAI / Pre-Seed / Kickstart / Cobbler
» Puppet / Chef / cfengine / Salt / Ansible / Razor / Juju
Bursting into 3rd party clouds
» Private
» Public
» Hybrid
» Storage, Network & Compute
www.datacenterdynamics.com
TAKEAWAY POINTS – PART 1
30 MINUTES BARELY LETS US SCRATCH THE SURFACE
Food for thought when you are gazing into your crystal ball trying to map out your 1, 2, 3 and 5 year roadmap
» Traditional data center thinking no longer valid
» Your old cost models need to be completely rebuilt
» Starting with a clean sheet of paper is a valid decision
» Distributed beats centralized
» Girds beat Networks beat Clusters beat Scale
» 2 x Tier 1 beats 1 x Tier 3 hands down
» Edge of network is as important as your network core
» Economists outnumber engineers
» Actuaries and Data Scientists are now cool
» Software & hardware developers must be Infrastructure savvy
» Web scale has taught us some valuable things
» Modularity is the key
» Build it by the rack at the factory and ship it to me
» Reference architectures now include applications stacks
» Infrastructure as a Service is the new normal
» Platform and Software “as a service” by default
» Containers are a good start
www.datacenterdynamics.com
TAKEAWAY POINTS – PART 2
30 MINUTES BARELY LETS US SCRATCH THE SURFACE
Food for thought when you are gazing into your crystal ball trying to map out your 1, 2, 3 and 5 year roadmap
» Whole of rack “forklift installs”
» Petabyte is the new Terabyte
» Everything including the kitchen sink is purpose built
» Property developers get kitchens and bathrooms made in China
» Containers of kitchens and bathrooms are “dropped in from the roof”
» Buildings are not necessary
» Tin sheds and razor wire are more and more acceptable
» 42 foot containers plugged into the side of buildings is quite normal
» Software defined everything
» Networks glue it all together
» Capex is dead / Opex is king / Customers want to rent everything
» Contracts are no longer relevant
» Focus on delivering good products & services
» Everything is a service and paid for “by the month”
» Vendor lock in is to be avoided like the plague
» Everything gets smaller and we rack and pack more of them
» Power consumption of 1x rack in 2014 equals 10 racks in 2004
www.datacenterdynamics.com
THANK YOU
THANKS FOR YOUR TIME – FEEL FREE TO PING ME ON TWITTER OR LINKEDIN
QUESTIONS ?
www.datacenterdynamics.com
ABOUT ME
DEZ BLANCHFIELD
Strategy & Architecture
Australian Federal Government
Cloud Computing, Big Data, Hadoop & OpenStack solutions
all start with a conversation. You name the time & place,
and I'll pay for coffee.
{ email } dez@gara.guru
{ mobile } +61 414 464 356
{ phone } +61 2 8006 4700
{ twitter } @dez_blanchfield
{ linkedin } http://linkedin.com/in/dezblanchfield

Más contenido relacionado

La actualidad más candente

Introduction to Big Data
Introduction to Big Data Introduction to Big Data
Introduction to Big Data Srinath Perera
 
Addressing Big Data Challenges - The Hadoop Way
Addressing Big Data Challenges - The Hadoop WayAddressing Big Data Challenges - The Hadoop Way
Addressing Big Data Challenges - The Hadoop WayXoriant Corporation
 
The evolution of data analytics
The evolution of data analyticsThe evolution of data analytics
The evolution of data analyticsNatalino Busa
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolutionitnewsafrica
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big datakk1718
 
Unexpected Challenges in Large Scale Machine Learning by Charles Parker
 Unexpected Challenges in Large Scale Machine Learning by Charles Parker Unexpected Challenges in Large Scale Machine Learning by Charles Parker
Unexpected Challenges in Large Scale Machine Learning by Charles ParkerBigMine
 
Tools and Methods for Big Data Analytics by Dahl Winters
Tools and Methods for Big Data Analytics by Dahl WintersTools and Methods for Big Data Analytics by Dahl Winters
Tools and Methods for Big Data Analytics by Dahl WintersMelinda Thielbar
 
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...yashbheda
 
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...BigMine
 
Big Data Science: Intro and Benefits
Big Data Science: Intro and BenefitsBig Data Science: Intro and Benefits
Big Data Science: Intro and BenefitsChandan Rajah
 
Big Tools for Big Data
Big Tools for Big DataBig Tools for Big Data
Big Tools for Big DataLewis Crawford
 
Big data ppt
Big data pptBig data ppt
Big data pptYash Raj
 
Big data introduction
Big data introductionBig data introduction
Big data introductionChirag Ahuja
 

La actualidad más candente (20)

Introduction to Big Data
Introduction to Big Data Introduction to Big Data
Introduction to Big Data
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Addressing Big Data Challenges - The Hadoop Way
Addressing Big Data Challenges - The Hadoop WayAddressing Big Data Challenges - The Hadoop Way
Addressing Big Data Challenges - The Hadoop Way
 
The evolution of data analytics
The evolution of data analyticsThe evolution of data analytics
The evolution of data analytics
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Exploring Big Data Analytics Tools
Exploring Big Data Analytics ToolsExploring Big Data Analytics Tools
Exploring Big Data Analytics Tools
 
Unexpected Challenges in Large Scale Machine Learning by Charles Parker
 Unexpected Challenges in Large Scale Machine Learning by Charles Parker Unexpected Challenges in Large Scale Machine Learning by Charles Parker
Unexpected Challenges in Large Scale Machine Learning by Charles Parker
 
Big Data Hadoop
Big Data HadoopBig Data Hadoop
Big Data Hadoop
 
Tools and Methods for Big Data Analytics by Dahl Winters
Tools and Methods for Big Data Analytics by Dahl WintersTools and Methods for Big Data Analytics by Dahl Winters
Tools and Methods for Big Data Analytics by Dahl Winters
 
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...
 
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
 
Big Data Science: Intro and Benefits
Big Data Science: Intro and BenefitsBig Data Science: Intro and Benefits
Big Data Science: Intro and Benefits
 
Big Tools for Big Data
Big Tools for Big DataBig Tools for Big Data
Big Tools for Big Data
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Big data introduction
Big data introductionBig data introduction
Big data introduction
 
Big Data
Big DataBig Data
Big Data
 
Big data
Big dataBig data
Big data
 
Big data 101
Big data 101Big data 101
Big data 101
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 

Destacado

Presentation Big Data
Presentation Big DataPresentation Big Data
Presentation Big DataRené Kuipers
 
Big Data
Big DataBig Data
Big DataNGDATA
 
Digital Transformation briefing to CAUDIT - CIO’s of Australian universities
Digital Transformation briefing to CAUDIT - CIO’s of Australian universitiesDigital Transformation briefing to CAUDIT - CIO’s of Australian universities
Digital Transformation briefing to CAUDIT - CIO’s of Australian universitiesDez Blanchfield
 
Money and the global debt crisis
Money and the global debt crisisMoney and the global debt crisis
Money and the global debt crisisJohn Bradford
 
Big Data: Big Numbers Bigger Questions, A presentation at Big Data Week
Big Data: Big Numbers Bigger Questions, A presentation at Big Data WeekBig Data: Big Numbers Bigger Questions, A presentation at Big Data Week
Big Data: Big Numbers Bigger Questions, A presentation at Big Data WeekChloe Thomas
 
Big data presentation, explanations and use cases in industrial sector
Big data presentation, explanations and use cases in industrial sectorBig data presentation, explanations and use cases in industrial sector
Big data presentation, explanations and use cases in industrial sectorNicolas Sarramagna
 
Big data presentation on Crystal Ball Event Prediction
Big data presentation on Crystal Ball Event PredictionBig data presentation on Crystal Ball Event Prediction
Big data presentation on Crystal Ball Event PredictionSujan Thapa
 
Apache Drill (ver. 0.1, check ver. 0.2)
Apache Drill (ver. 0.1, check ver. 0.2)Apache Drill (ver. 0.1, check ver. 0.2)
Apache Drill (ver. 0.1, check ver. 0.2)Camuel Gilyadov
 
Big Data: The 4 Layers Everyone Must Know
Big Data: The 4 Layers Everyone Must KnowBig Data: The 4 Layers Everyone Must Know
Big Data: The 4 Layers Everyone Must KnowBernard Marr
 

Destacado (14)

Presentation Big Data
Presentation Big DataPresentation Big Data
Presentation Big Data
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Big Data
Big DataBig Data
Big Data
 
Digital Transformation briefing to CAUDIT - CIO’s of Australian universities
Digital Transformation briefing to CAUDIT - CIO’s of Australian universitiesDigital Transformation briefing to CAUDIT - CIO’s of Australian universities
Digital Transformation briefing to CAUDIT - CIO’s of Australian universities
 
Money and the global debt crisis
Money and the global debt crisisMoney and the global debt crisis
Money and the global debt crisis
 
Big Data: Big Numbers Bigger Questions, A presentation at Big Data Week
Big Data: Big Numbers Bigger Questions, A presentation at Big Data WeekBig Data: Big Numbers Bigger Questions, A presentation at Big Data Week
Big Data: Big Numbers Bigger Questions, A presentation at Big Data Week
 
Big data presentation, explanations and use cases in industrial sector
Big data presentation, explanations and use cases in industrial sectorBig data presentation, explanations and use cases in industrial sector
Big data presentation, explanations and use cases in industrial sector
 
Big data presentation on Crystal Ball Event Prediction
Big data presentation on Crystal Ball Event PredictionBig data presentation on Crystal Ball Event Prediction
Big data presentation on Crystal Ball Event Prediction
 
Apache Drill (ver. 0.1, check ver. 0.2)
Apache Drill (ver. 0.1, check ver. 0.2)Apache Drill (ver. 0.1, check ver. 0.2)
Apache Drill (ver. 0.1, check ver. 0.2)
 
Big Data simplified
Big Data simplifiedBig Data simplified
Big Data simplified
 
Big Data: The 4 Layers Everyone Must Know
Big Data: The 4 Layers Everyone Must KnowBig Data: The 4 Layers Everyone Must Know
Big Data: The 4 Layers Everyone Must Know
 
Big Idea For Big Data
Big Idea For Big DataBig Idea For Big Data
Big Idea For Big Data
 

Similar a Big Data Presentation - Data Center Dynamics Sydney 2014 - Dez Blanchfield

The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data Sciencesarith divakar
 
Big Data Basic Concepts | Presented in 2014
Big Data Basic Concepts  | Presented in 2014Big Data Basic Concepts  | Presented in 2014
Big Data Basic Concepts | Presented in 2014Kenneth Igiri
 
Big Data Session 1.pptx
Big Data Session 1.pptxBig Data Session 1.pptx
Big Data Session 1.pptxElsonPaul2
 
Big Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RKBig Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RKRajesh Jayarman
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyRohit Dubey
 
Introduction to Cloud computing and Big Data-Hadoop
Introduction to Cloud computing and  Big Data-HadoopIntroduction to Cloud computing and  Big Data-Hadoop
Introduction to Cloud computing and Big Data-HadoopNagarjuna D.N
 
Hadoop Master Class : A concise overview
Hadoop Master Class : A concise overviewHadoop Master Class : A concise overview
Hadoop Master Class : A concise overviewAbhishek Roy
 
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...Mihai Criveti
 
Lean Enterprise, Microservices and Big Data
Lean Enterprise, Microservices and Big DataLean Enterprise, Microservices and Big Data
Lean Enterprise, Microservices and Big DataStylight
 
Debunking "Purpose-Built Data Systems:": Enter the Universal Database
Debunking "Purpose-Built Data Systems:": Enter the Universal DatabaseDebunking "Purpose-Built Data Systems:": Enter the Universal Database
Debunking "Purpose-Built Data Systems:": Enter the Universal DatabaseStavros Papadopoulos
 
Lecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptLecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptalmaraniabwmalk
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementTony Bain
 
Innovation med big data – chr. hansens erfaringer
Innovation med big data – chr. hansens erfaringerInnovation med big data – chr. hansens erfaringer
Innovation med big data – chr. hansens erfaringerMicrosoft
 
Big data4businessusers
Big data4businessusersBig data4businessusers
Big data4businessusersBob Hardaway
 
Planning and Optimizing Data Lake Architecture - Milos Milovanovic
 Planning and Optimizing Data Lake Architecture - Milos Milovanovic Planning and Optimizing Data Lake Architecture - Milos Milovanovic
Planning and Optimizing Data Lake Architecture - Milos MilovanovicInstitute of Contemporary Sciences
 
Planing and optimizing data lake architecture
Planing and optimizing data lake architecturePlaning and optimizing data lake architecture
Planing and optimizing data lake architectureMilos Milovanovic
 
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...DataStax
 
Big data and hadoop
Big data and hadoopBig data and hadoop
Big data and hadoopMohit Tare
 

Similar a Big Data Presentation - Data Center Dynamics Sydney 2014 - Dez Blanchfield (20)

The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data Science
 
Big Data Basic Concepts | Presented in 2014
Big Data Basic Concepts  | Presented in 2014Big Data Basic Concepts  | Presented in 2014
Big Data Basic Concepts | Presented in 2014
 
Big Data Session 1.pptx
Big Data Session 1.pptxBig Data Session 1.pptx
Big Data Session 1.pptx
 
Big Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RKBig Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RK
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit Dubey
 
Introduction to Cloud computing and Big Data-Hadoop
Introduction to Cloud computing and  Big Data-HadoopIntroduction to Cloud computing and  Big Data-Hadoop
Introduction to Cloud computing and Big Data-Hadoop
 
Hadoop Master Class : A concise overview
Hadoop Master Class : A concise overviewHadoop Master Class : A concise overview
Hadoop Master Class : A concise overview
 
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
 
Lean Enterprise, Microservices and Big Data
Lean Enterprise, Microservices and Big DataLean Enterprise, Microservices and Big Data
Lean Enterprise, Microservices and Big Data
 
Debunking "Purpose-Built Data Systems:": Enter the Universal Database
Debunking "Purpose-Built Data Systems:": Enter the Universal DatabaseDebunking "Purpose-Built Data Systems:": Enter the Universal Database
Debunking "Purpose-Built Data Systems:": Enter the Universal Database
 
Lecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptLecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.ppt
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data Management
 
Innovation med big data – chr. hansens erfaringer
Innovation med big data – chr. hansens erfaringerInnovation med big data – chr. hansens erfaringer
Innovation med big data – chr. hansens erfaringer
 
Big data4businessusers
Big data4businessusersBig data4businessusers
Big data4businessusers
 
Planning and Optimizing Data Lake Architecture - Milos Milovanovic
 Planning and Optimizing Data Lake Architecture - Milos Milovanovic Planning and Optimizing Data Lake Architecture - Milos Milovanovic
Planning and Optimizing Data Lake Architecture - Milos Milovanovic
 
Planing and optimizing data lake architecture
Planing and optimizing data lake architecturePlaning and optimizing data lake architecture
Planing and optimizing data lake architecture
 
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
 
Big data and hadoop
Big data and hadoopBig data and hadoop
Big data and hadoop
 
Big data business case
Big data   business caseBig data   business case
Big data business case
 
Addressing dm-cloud
Addressing dm-cloudAddressing dm-cloud
Addressing dm-cloud
 

Más de Dez Blanchfield

Hot tech 20170329-idera - health check - maintaining enterprise bi-dez-slides
Hot tech 20170329-idera - health check - maintaining enterprise bi-dez-slidesHot tech 20170329-idera - health check - maintaining enterprise bi-dez-slides
Hot tech 20170329-idera - health check - maintaining enterprise bi-dez-slidesDez Blanchfield
 
CDO Summit 2017 - Sydney - 20170315 - Dez Blanchfield
CDO Summit 2017 - Sydney - 20170315 - Dez BlanchfieldCDO Summit 2017 - Sydney - 20170315 - Dez Blanchfield
CDO Summit 2017 - Sydney - 20170315 - Dez BlanchfieldDez Blanchfield
 
Briefing Room 20161213 - ep019 - Red Hat - Modern Business Storage
Briefing Room 20161213 - ep019 - Red Hat - Modern Business StorageBriefing Room 20161213 - ep019 - Red Hat - Modern Business Storage
Briefing Room 20161213 - ep019 - Red Hat - Modern Business StorageDez Blanchfield
 
Young it - digital transformation on a personal level
Young it - digital transformation on a personal levelYoung it - digital transformation on a personal level
Young it - digital transformation on a personal levelDez Blanchfield
 
Hot tech 20161221 - ep0022 - IDERA - an ounce of prevention - Forging Healthy BI
Hot tech 20161221 - ep0022 - IDERA - an ounce of prevention - Forging Healthy BIHot tech 20161221 - ep0022 - IDERA - an ounce of prevention - Forging Healthy BI
Hot tech 20161221 - ep0022 - IDERA - an ounce of prevention - Forging Healthy BIDez Blanchfield
 
Smart Cities Expo - World Forum - Sydney - 2016 - Dez Blanchfield
Smart Cities Expo - World Forum - Sydney - 2016 - Dez BlanchfieldSmart Cities Expo - World Forum - Sydney - 2016 - Dez Blanchfield
Smart Cities Expo - World Forum - Sydney - 2016 - Dez BlanchfieldDez Blanchfield
 
Hot tech 20161207 - ep0021 - IDERA - Protect your database - High availabilit...
Hot tech 20161207 - ep0021 - IDERA - Protect your database - High availabilit...Hot tech 20161207 - ep0021 - IDERA - Protect your database - High availabilit...
Hot tech 20161207 - ep0021 - IDERA - Protect your database - High availabilit...Dez Blanchfield
 
Hot tech 20161116-ep0019-idera - data modeling in an agile environment-dez-sl...
Hot tech 20161116-ep0019-idera - data modeling in an agile environment-dez-sl...Hot tech 20161116-ep0019-idera - data modeling in an agile environment-dez-sl...
Hot tech 20161116-ep0019-idera - data modeling in an agile environment-dez-sl...Dez Blanchfield
 
Hot tech 20160602-ep008-idera - forward momentum - moving relational beyond t...
Hot tech 20160602-ep008-idera - forward momentum - moving relational beyond t...Hot tech 20160602-ep008-idera - forward momentum - moving relational beyond t...
Hot tech 20160602-ep008-idera - forward momentum - moving relational beyond t...Dez Blanchfield
 
Hot tech 20161005-ep0016-idera - index insanity - how to avoid database chaos...
Hot tech 20161005-ep0016-idera - index insanity - how to avoid database chaos...Hot tech 20161005-ep0016-idera - index insanity - how to avoid database chaos...
Hot tech 20161005-ep0016-idera - index insanity - how to avoid database chaos...Dez Blanchfield
 
Hot tech 20160510-ep005-magnitude software-the_biggest_picture-knowing_your_c...
Hot tech 20160510-ep005-magnitude software-the_biggest_picture-knowing_your_c...Hot tech 20160510-ep005-magnitude software-the_biggest_picture-knowing_your_c...
Hot tech 20160510-ep005-magnitude software-the_biggest_picture-knowing_your_c...Dez Blanchfield
 
Hot tech 20160922-ep0015-dell statistica - edge analytics - the io_t economy ...
Hot tech 20160922-ep0015-dell statistica - edge analytics - the io_t economy ...Hot tech 20160922-ep0015-dell statistica - edge analytics - the io_t economy ...
Hot tech 20160922-ep0015-dell statistica - edge analytics - the io_t economy ...Dez Blanchfield
 
Briefing room 20160920-ep017-striim - a real-time version of the truth-dez-sl...
Briefing room 20160920-ep017-striim - a real-time version of the truth-dez-sl...Briefing room 20160920-ep017-striim - a real-time version of the truth-dez-sl...
Briefing room 20160920-ep017-striim - a real-time version of the truth-dez-sl...Dez Blanchfield
 
OpenStack Australia Government Day 2016 - Dez Blanchfield
OpenStack Australia Government Day 2016 - Dez BlanchfieldOpenStack Australia Government Day 2016 - Dez Blanchfield
OpenStack Australia Government Day 2016 - Dez BlanchfieldDez Blanchfield
 
Hot tech 20161102 - ep0018 - idera - application acceleration - faster perfor...
Hot tech 20161102 - ep0018 - idera - application acceleration - faster perfor...Hot tech 20161102 - ep0018 - idera - application acceleration - faster perfor...
Hot tech 20161102 - ep0018 - idera - application acceleration - faster perfor...Dez Blanchfield
 
Health Data Management - Clear Data - 5 reasons hospital CIOs are extending t...
Health Data Management - Clear Data - 5 reasons hospital CIOs are extending t...Health Data Management - Clear Data - 5 reasons hospital CIOs are extending t...
Health Data Management - Clear Data - 5 reasons hospital CIOs are extending t...Dez Blanchfield
 
Hot tech 20160602 - ep007 - sync sort - big iron meet big data - liberating m...
Hot tech 20160602 - ep007 - sync sort - big iron meet big data - liberating m...Hot tech 20160602 - ep007 - sync sort - big iron meet big data - liberating m...
Hot tech 20160602 - ep007 - sync sort - big iron meet big data - liberating m...Dez Blanchfield
 
Hot tech 20160914-ep0014-idera - who what where and how - why you want to kno...
Hot tech 20160914-ep0014-idera - who what where and how - why you want to kno...Hot tech 20160914-ep0014-idera - who what where and how - why you want to kno...
Hot tech 20160914-ep0014-idera - who what where and how - why you want to kno...Dez Blanchfield
 
Briefing room 20160913-ep0016-sap-anomalies-or-alerts-streaming-analytics-to-...
Briefing room 20160913-ep0016-sap-anomalies-or-alerts-streaming-analytics-to-...Briefing room 20160913-ep0016-sap-anomalies-or-alerts-streaming-analytics-to-...
Briefing room 20160913-ep0016-sap-anomalies-or-alerts-streaming-analytics-to-...Dez Blanchfield
 
Hot tech 20160825-ep0012-dell statistica-embed-analytics-everywhere-enabling-...
Hot tech 20160825-ep0012-dell statistica-embed-analytics-everywhere-enabling-...Hot tech 20160825-ep0012-dell statistica-embed-analytics-everywhere-enabling-...
Hot tech 20160825-ep0012-dell statistica-embed-analytics-everywhere-enabling-...Dez Blanchfield
 

Más de Dez Blanchfield (20)

Hot tech 20170329-idera - health check - maintaining enterprise bi-dez-slides
Hot tech 20170329-idera - health check - maintaining enterprise bi-dez-slidesHot tech 20170329-idera - health check - maintaining enterprise bi-dez-slides
Hot tech 20170329-idera - health check - maintaining enterprise bi-dez-slides
 
CDO Summit 2017 - Sydney - 20170315 - Dez Blanchfield
CDO Summit 2017 - Sydney - 20170315 - Dez BlanchfieldCDO Summit 2017 - Sydney - 20170315 - Dez Blanchfield
CDO Summit 2017 - Sydney - 20170315 - Dez Blanchfield
 
Briefing Room 20161213 - ep019 - Red Hat - Modern Business Storage
Briefing Room 20161213 - ep019 - Red Hat - Modern Business StorageBriefing Room 20161213 - ep019 - Red Hat - Modern Business Storage
Briefing Room 20161213 - ep019 - Red Hat - Modern Business Storage
 
Young it - digital transformation on a personal level
Young it - digital transformation on a personal levelYoung it - digital transformation on a personal level
Young it - digital transformation on a personal level
 
Hot tech 20161221 - ep0022 - IDERA - an ounce of prevention - Forging Healthy BI
Hot tech 20161221 - ep0022 - IDERA - an ounce of prevention - Forging Healthy BIHot tech 20161221 - ep0022 - IDERA - an ounce of prevention - Forging Healthy BI
Hot tech 20161221 - ep0022 - IDERA - an ounce of prevention - Forging Healthy BI
 
Smart Cities Expo - World Forum - Sydney - 2016 - Dez Blanchfield
Smart Cities Expo - World Forum - Sydney - 2016 - Dez BlanchfieldSmart Cities Expo - World Forum - Sydney - 2016 - Dez Blanchfield
Smart Cities Expo - World Forum - Sydney - 2016 - Dez Blanchfield
 
Hot tech 20161207 - ep0021 - IDERA - Protect your database - High availabilit...
Hot tech 20161207 - ep0021 - IDERA - Protect your database - High availabilit...Hot tech 20161207 - ep0021 - IDERA - Protect your database - High availabilit...
Hot tech 20161207 - ep0021 - IDERA - Protect your database - High availabilit...
 
Hot tech 20161116-ep0019-idera - data modeling in an agile environment-dez-sl...
Hot tech 20161116-ep0019-idera - data modeling in an agile environment-dez-sl...Hot tech 20161116-ep0019-idera - data modeling in an agile environment-dez-sl...
Hot tech 20161116-ep0019-idera - data modeling in an agile environment-dez-sl...
 
Hot tech 20160602-ep008-idera - forward momentum - moving relational beyond t...
Hot tech 20160602-ep008-idera - forward momentum - moving relational beyond t...Hot tech 20160602-ep008-idera - forward momentum - moving relational beyond t...
Hot tech 20160602-ep008-idera - forward momentum - moving relational beyond t...
 
Hot tech 20161005-ep0016-idera - index insanity - how to avoid database chaos...
Hot tech 20161005-ep0016-idera - index insanity - how to avoid database chaos...Hot tech 20161005-ep0016-idera - index insanity - how to avoid database chaos...
Hot tech 20161005-ep0016-idera - index insanity - how to avoid database chaos...
 
Hot tech 20160510-ep005-magnitude software-the_biggest_picture-knowing_your_c...
Hot tech 20160510-ep005-magnitude software-the_biggest_picture-knowing_your_c...Hot tech 20160510-ep005-magnitude software-the_biggest_picture-knowing_your_c...
Hot tech 20160510-ep005-magnitude software-the_biggest_picture-knowing_your_c...
 
Hot tech 20160922-ep0015-dell statistica - edge analytics - the io_t economy ...
Hot tech 20160922-ep0015-dell statistica - edge analytics - the io_t economy ...Hot tech 20160922-ep0015-dell statistica - edge analytics - the io_t economy ...
Hot tech 20160922-ep0015-dell statistica - edge analytics - the io_t economy ...
 
Briefing room 20160920-ep017-striim - a real-time version of the truth-dez-sl...
Briefing room 20160920-ep017-striim - a real-time version of the truth-dez-sl...Briefing room 20160920-ep017-striim - a real-time version of the truth-dez-sl...
Briefing room 20160920-ep017-striim - a real-time version of the truth-dez-sl...
 
OpenStack Australia Government Day 2016 - Dez Blanchfield
OpenStack Australia Government Day 2016 - Dez BlanchfieldOpenStack Australia Government Day 2016 - Dez Blanchfield
OpenStack Australia Government Day 2016 - Dez Blanchfield
 
Hot tech 20161102 - ep0018 - idera - application acceleration - faster perfor...
Hot tech 20161102 - ep0018 - idera - application acceleration - faster perfor...Hot tech 20161102 - ep0018 - idera - application acceleration - faster perfor...
Hot tech 20161102 - ep0018 - idera - application acceleration - faster perfor...
 
Health Data Management - Clear Data - 5 reasons hospital CIOs are extending t...
Health Data Management - Clear Data - 5 reasons hospital CIOs are extending t...Health Data Management - Clear Data - 5 reasons hospital CIOs are extending t...
Health Data Management - Clear Data - 5 reasons hospital CIOs are extending t...
 
Hot tech 20160602 - ep007 - sync sort - big iron meet big data - liberating m...
Hot tech 20160602 - ep007 - sync sort - big iron meet big data - liberating m...Hot tech 20160602 - ep007 - sync sort - big iron meet big data - liberating m...
Hot tech 20160602 - ep007 - sync sort - big iron meet big data - liberating m...
 
Hot tech 20160914-ep0014-idera - who what where and how - why you want to kno...
Hot tech 20160914-ep0014-idera - who what where and how - why you want to kno...Hot tech 20160914-ep0014-idera - who what where and how - why you want to kno...
Hot tech 20160914-ep0014-idera - who what where and how - why you want to kno...
 
Briefing room 20160913-ep0016-sap-anomalies-or-alerts-streaming-analytics-to-...
Briefing room 20160913-ep0016-sap-anomalies-or-alerts-streaming-analytics-to-...Briefing room 20160913-ep0016-sap-anomalies-or-alerts-streaming-analytics-to-...
Briefing room 20160913-ep0016-sap-anomalies-or-alerts-streaming-analytics-to-...
 
Hot tech 20160825-ep0012-dell statistica-embed-analytics-everywhere-enabling-...
Hot tech 20160825-ep0012-dell statistica-embed-analytics-everywhere-enabling-...Hot tech 20160825-ep0012-dell statistica-embed-analytics-everywhere-enabling-...
Hot tech 20160825-ep0012-dell statistica-embed-analytics-everywhere-enabling-...
 

Último

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 

Último (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 

Big Data Presentation - Data Center Dynamics Sydney 2014 - Dez Blanchfield

  • 1. www.datacenterdynamics.com THE RISE OF BIG DATA.. WHEN DO “YOU” HAVE TO FACE IT Dez Blanchfield June 24th, 2014 Big data “hammer” Created by Rachel Jones of Wink Design Studio using: © Tagxedo.com
  • 2. www.datacenterdynamics.com BIG DATA & THE DATA CENTRE BIG DATA HAS EVOLVED TO ESTABLISH ITS PLACE IN THE ENTERPRISE DATA CENTER Data volumes are growing exponentially year on year, and the ‘stress’ being placed on data center infrastructure across networks, storage and compute is overloading many data centers ability to service it. Data Center infrastructure need to adapt and evolve in order to support new workloads. Let’s take a quick look at what a data center is today, what big data is, and the types of workloads and technologies we’re having to consider as upcoming growth markets. Frist let’s just state once and for all, we are sick and tired of hearing about the 4 x V’s » Volume » Velocity » Variety » Veracity
  • 3. www.datacenterdynamics.com WHAT IS BIG DATA LET’S BE CLEAR ABOUT WHAT IS NOT BIG DATA Everyone has an opinion about what Big Data is and is not. Let’s be clear about what Big Data is NOT. Just putting a “Big Data” stamp on it does not make it Big Data. Big Data is not: » Lots of data » Fast data » Messy data » Badly managed data » Bigger databases / bigger SAN’s » Individual silos of data » The result of regulatory data retention Analysis of bad data will result in bad information. FACT: A growth of data in a a traditional enterprise Databases from 20 GB to 200 GB is not Big Data, that’s Just lots of data. The are not the same by any measure.
  • 4. www.datacenterdynamics.com WHAT IS BIG DATA A PLAIN ENGLISH DEFINITION AND A FEW EXAMPLES OF WHAT BIG DATA IS BIG DATA: “The collection, processing and usage of large volumes of digitized data to improve how companies make important decisions and operate the business.” What is Big Data: » Unstructured data / Machine data » Aggregated click streams » Previously unconnected data feeds » Horizontal analysis across vertical silos » Insights from analysis of multiple data pools » Sentiment analysis within communities ( staff / consumers ) » Predictive Analytics replaces Routine Maintenance Big Data analytics can give you a key points of differentiation. IBM: “Over the next decade we will see significant gaps open up between enterprises that proactively transform their operations for the digital age and those that continue with business as usual.” Era of Smart - Reinventing Australian Enterprises for the Digital Economy
  • 5. www.datacenterdynamics.com WHAT IS A DATA CENTRE LET’S BE HONEST - DO WE ACTUALLY KNOW WHAT A DATA CENTRE IS ANY MORE
  • 6. www.datacenterdynamics.com WHAT IS A DATA CENTRE IS IT TRADITIONAL PURPOSE BUILT DEDICATED IT ACCOMODATION
  • 7. www.datacenterdynamics.com WHAT IS A DATA CENTRE IS IT A HONKING BIG UBER SECRET CAMPUS OF WAREHOUSE SCALE DATA HALLS
  • 8. www.datacenterdynamics.com WHAT IS A DATA CENTRE IS IT PURPOSE BUILT CONTAINERISED MOBILE DATA ROOMS
  • 9. www.datacenterdynamics.com WHAT IS A DATA CENTRE IS IT A HYBRID OF CONTAINERISED COMPUTE FABRIC AND TRADITIONAL DATA HALLS
  • 10. www.datacenterdynamics.com WHAT IS A DATA CENTRE THE GIANTS OF THE INDUSTRY ARE DOING MORE THAN JUST DIPPING THEIR TOES
  • 11. www.datacenterdynamics.com WHAT IS A DATA CENTRE IS IT DEDICATED PURPOSE BUILT IT ACCOMODATION
  • 12. www.datacenterdynamics.com WHAT IS A DATA CENTRE IF IT WALKS LIKE A DUCK, IF IT QUACKS LIKE A DUCK
  • 13. www.datacenterdynamics.com WHAT IS A DATA CENTRE DATA CENTRE INNOVAITON – THE SKY’S THE LIMIT
  • 14. www.datacenterdynamics.com THE DATA CENTRE LANDSCAPE NOT ALL DATA CENTERS ARE CREATED EQUAL When the media talk about Big Data data centers they all too often default to what I call The Usual Suspects The Usual Suspects have specialist niche application workloads to service, i.e. not Enterprise workloads » Facebook » Google » YaHoo » eBay » PayPal » NASA » CERN » CIA / TSA / FBI / NSA Some have created disruptive technologies » Hadoop ( Google / YaHoo ) » OpenStack ( NASA / Rackspace ) » Open Compute ( Facebook )
  • 15. www.datacenterdynamics.com THE BIG DATA LANDSCAPE NOT ALL DATA IS CREATED EQUAL In this era of big data, we are fast learning, all too often the hard way, that not all data is created equal. Raw data originating from machine log files, social media, or years of original transaction data is often considered to be of lower value until it has been prepared & refined for analysis key points to keep in mind about your data » Treat data as if it is perishable goods » Make timely relevant use of data in decision-making » Know where your data is at all times » Know who has access to your data, when and how » Be able to provide access to your data in multiple forms » Structure data consistently, ETL is your friend » Silos make data less valuable over time » Data documentation is critical » Communicate within the business on exactly what data you have and why
  • 16. www.datacenterdynamics.com BIG DATA PLATFORMS ARE PLENTIFUL THE GROWTH IN BIG DATA PLATFORMS IS NOTHING SHORT OF EXPLOSIVE
  • 17. www.datacenterdynamics.com NOT ALWAYS DATA CENTRE FRIENDLY BOTH ENTERPRISE & OPEN SOURCE PLATFORMS PRESENT THEIR OWN CHALLENGES Deploying any large scale Big Data platform into a modern data center will present challenges not faced with traditional enterprise network, storage and compute workloads – especially in the area of “rack awareness”. Platforms which span multiple racks should ensure that replicas of data exist on multiple racks. This way, the loss of a switch does not render portions of the data unavailable due to all replicas being underneath it. Rack awareness is critical in datacenters - Not all Big Data platforms are capable of being “rack aware” » Hadoop 1.2.1 and 2.0 can be made rack aware » OpenStack not so much ( SWIFT has zones ) » SAP Hana is not » CSC Infochimps is in part » Spark can schedule for locality » Ceph + RADOS = rack aware object store » Moose FS has “proximity” settings » Aerospike has “paxos protocol” » Couchbase 2.5 now has rack awareness » Traditional enterprise workloads don’t apply » HPC platforms eat networks for breakfast » A full Hadoop rebalance is quite the joyride FACT: If you make the wrong design decisions in either bare metal or virtualized deployments of any of these types of ecosystems, your network, storage, compute & data center infrastructure are in for a whole new world of pain.
  • 18. www.datacenterdynamics.com TRY HOSTING THESE EXAMPLES CONSIDER THE CHALLENGES HOSTING THESE BIG DATA CUSTOMERS A provincial Chinese phone company payroll system » One million full time staff Queensland power utility » Asset & Vegetation management system » Drone planes acquiring 8TB of data per flight » 2 flights per day per plane, plans for a fleet of 6 planes » HPC & Big Data storage & compute resources on-prem and in-cloud Virgin Atlantic ( airline ) » The new Boeing 787 aircraft create half a terabyte of data per flight » There are an average of 87,000 domestic flights a day inside US airspace » 87,000 flights per day x 0.5 TB p/flight = 43,500 TB of data created per day » In other words, approx. 42 Petabytes a day ( the meaning of life the universe and everything !? ) NOTE: That’s just the “storage” problem, consider the network and compute scale required to perform even the most rudimentary analysis on a dataset of that scale !! !?
  • 19. www.datacenterdynamics.com OTHER KEY TOPICS IMPACTING DATA CENTRES BIG DATA AND THE MIRIAD OF SUPPORTING TECH TOPICS YOU NEED TO BE WATCHING On-prem / Off-prem / Hybrid / Cloud » Azure / AWS / RAX / Softlayer / Smartcloud » VMware / OpenStack / Citrix / Hyper-V / KVM / Xen / LXC VM Instantiation / App Containers » Vagrant / Packer / Docker » OSv / Capstan » Joyent / SmartOS DevOps / Automation / Service Catalogues » FAI / Pre-Seed / Kickstart / Cobbler » Puppet / Chef / cfengine / Salt / Ansible / Razor / Juju Bursting into 3rd party clouds » Private » Public » Hybrid » Storage, Network & Compute
  • 20. www.datacenterdynamics.com TAKEAWAY POINTS – PART 1 30 MINUTES BARELY LETS US SCRATCH THE SURFACE Food for thought when you are gazing into your crystal ball trying to map out your 1, 2, 3 and 5 year roadmap » Traditional data center thinking no longer valid » Your old cost models need to be completely rebuilt » Starting with a clean sheet of paper is a valid decision » Distributed beats centralized » Girds beat Networks beat Clusters beat Scale » 2 x Tier 1 beats 1 x Tier 3 hands down » Edge of network is as important as your network core » Economists outnumber engineers » Actuaries and Data Scientists are now cool » Software & hardware developers must be Infrastructure savvy » Web scale has taught us some valuable things » Modularity is the key » Build it by the rack at the factory and ship it to me » Reference architectures now include applications stacks » Infrastructure as a Service is the new normal » Platform and Software “as a service” by default » Containers are a good start
  • 21. www.datacenterdynamics.com TAKEAWAY POINTS – PART 2 30 MINUTES BARELY LETS US SCRATCH THE SURFACE Food for thought when you are gazing into your crystal ball trying to map out your 1, 2, 3 and 5 year roadmap » Whole of rack “forklift installs” » Petabyte is the new Terabyte » Everything including the kitchen sink is purpose built » Property developers get kitchens and bathrooms made in China » Containers of kitchens and bathrooms are “dropped in from the roof” » Buildings are not necessary » Tin sheds and razor wire are more and more acceptable » 42 foot containers plugged into the side of buildings is quite normal » Software defined everything » Networks glue it all together » Capex is dead / Opex is king / Customers want to rent everything » Contracts are no longer relevant » Focus on delivering good products & services » Everything is a service and paid for “by the month” » Vendor lock in is to be avoided like the plague » Everything gets smaller and we rack and pack more of them » Power consumption of 1x rack in 2014 equals 10 racks in 2004
  • 22. www.datacenterdynamics.com THANK YOU THANKS FOR YOUR TIME – FEEL FREE TO PING ME ON TWITTER OR LINKEDIN QUESTIONS ?
  • 23. www.datacenterdynamics.com ABOUT ME DEZ BLANCHFIELD Strategy & Architecture Australian Federal Government Cloud Computing, Big Data, Hadoop & OpenStack solutions all start with a conversation. You name the time & place, and I'll pay for coffee. { email } dez@gara.guru { mobile } +61 414 464 356 { phone } +61 2 8006 4700 { twitter } @dez_blanchfield { linkedin } http://linkedin.com/in/dezblanchfield