SlideShare a Scribd company logo
1 of 73
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
THE POWER OF BIG DATA
Ben Snively, Solutions Architect, Amazon Web Services
Jacob Tomlinson, Lead Engineer, Met Office Informatics Lab
Why Big Data?
Rapid Data Growth
1.2 ZB in 2015
44 ZB by 2020
180 ZB by 2025
Data Generation
• Speed of the data.
G
B
T
B
PB
ZB
E
B
Source:
IDC,
2015
IoT
Social Media
Enterprise
Systems
Data Variety
• 80 billion devices by 2025
• 500 million tweets daily
Volume Velocity Variety
There will be more words written on Twitter
than contained in all books ever printed.
Source: The Huffington Post, Is Twitter Bad For Language? Statistical Analysis Says No
(New Book from Christian Rudder, Dataclysm: Who We Are)In 2014…
Artificial
Intelligence
What is Big Data Powering on AWS?
The Modern Data Architecture
… in Public Sector
Storage	&	Streams
Catalogue	&	Search
Entitlements
API	&	UI
Attributes of a Modern
Data Architecture
Key Pillars of a
Data Lake
Key Components of a Successful Data Strategy
Putting it all together
One tool to
rule them all
Different Types of Services…
Virtualized Managed Serverless
You can easily
provision servers and
focus on OS and
above.
You focus higher in the
stack but still need to
consider servers, how
much CPU is needed, and
how much RAM.
AWS manages based the
customer configuration.
Build applications and services
without thinking of servers.
Don’t be concerned about
provisioning, scaling, and
maintaining servers for fault
tolerance and availability.
AWS does all of this for you.
Real-time Analytics
Collect, process, analyze, and visualize insights in real-
time with AWS.
Easy add On-Demand Analytics
Real-time
On-
Demand
Building a Data Strategy on AWS
Kinesis Firehose
1
2
3
4
5
6
Athena
Query Service
7
8
Glue
Batch
9
10
AWS
IoT
Glue ETL
Petabytes to Exabytes…
§ AWS Snowball
§ AWS Snowball Edge
§ AWS Snowmobile
Processing Data for Analytics
on your data lake
Processing	&	Analytics
Transactional	&	
RDBMS
DynamoDB
NoSQL DB Relational Database
Aurora
BI	&	Data	Visualization
Kinesis Streams
& Firehose
Batch
EMR
Hadoop, Spark,
Presto
Redshift
Data Warehouse
Athena
Query Service
AWS Batch
Predictive
Real-time
AWS Lambda
Apache Storm
on EMR
Apache Flink
on EMR
Spark Streaming
on EMR
Elasticsearch
Service
Kinesis Analytics,
Kinesis Streams
EastiCache DAX
Amazon AI
“Inside	AWS,	we’re	excited	to	lower	the	costs	
and	barriers	to	machine	learning	and	AI	so	
organizations	of	all	sizes	can	take	advantage	of	
these	advanced	techniques”
Amazon AI
What does the customer say?
https://aws.amazon.com/solutions/case-studies/analytics/
https://aws.amazon.com/solutions/case-studies/big-data/
Empowering Smart Cities and IoT
Royal National Institute of Blind People creates and
distributes accessible information in the form of
synthesized content
Amazon Polly delivers
incredibly lifelike voices
which captivate and engage
our readers.
John Worsfold
Solutions Implementation Manager, RNIB
”
“ • RNIB delivers largest library of
audiobooks in the UK for nearly 2
million people with sight loss
• Naturalness of generated speech is
critical to captivate and engage readers
• No restrictions on speech
redistributions enables RNIB to create
and distribute accessible information in
a form of synthesized content
RNIB provides the largest library in the UK for people with sight loss
JustGiving Supports 24 Million Users on Charity Site Using AWS
• Needed a new platform to support general
operations and new analytics service
• Moved to AWS, using a wide range of services
• Can scale system faster in response to
unanticipated spikes in traffic
• Receives query results in seconds compared
to 30 minutes under old system
• Obtains deeper insights into billions of data
points, using information to deliver better
services
Using the new AWS tools, we can
extract much finer-grained data
points based on millions of
donations and billions of visits, and
then use that information to provide
a better platform for our visitors.
Richard Atkinson
Chief Information Officer, JustGiving
”
“
JustGiving is a major online platform that supports
charitable giving. The organization is based in London.
We have an analysis kit we run
every day, looking at data,
comparing patterns over
previous years’ information,
and in a matter of seconds, we
can tell if a student is likely to
succeed or fail. The results
have been phenomenal
Lige Hensley
CTO
Ivy Tech
”
“
Ivy Tech – Improving the classroom and beyond…
Predicting Student Success:
Identify	behaviors	of	successful	and	unsuccessful	students.
We	can	now	predict,	with	over	80%	accuracy,	which	students	are	
likely	to	fail	a	course	within	the	first	two	weeks	of	a	16-week	term.
Improvements across the University:
Struggling	faculty	can	be	identifies	and	addressed	well	before	they	
have	an	impact	on	students
Using	the	same	behavior	analysis	techniques,	we	can	flag	
fraudulent	activity	far	earlier	than	ever	before.
Using	Natural	Language	Understanding	tools	to	analyze	course	
evaluations.
FINRA: Monitor and enforce trading regulations
FINRA handles approximately
75 billion market events every
day to build a holistic picture of
trading in the U.S. Hundreds of
surveillance algorithms against
massive amounts of data.
FINRA mission
§ Deter misconduct by enforcing the rules.
§ Detect and prevent wrongdoing in US markets
§ Discipline those who break the rules
Scale brings unique challenges
§ Market volumes are volatile and increasing
§ Exchanges are dynamically evolving
§ Regulatory rules are created and enhanced
§ New securities products are introduced
§ Market manipulators innovate
Fraud Detection
FINRA	uses	Amazon	EMR	and	Amazon	S3	to	process	up	to	75	billion	
trading	events	per	day	and	securely	store	over	5	petabytes	of	data,	
attaining	savings	of	$10-20MM	per	year.
Met Office Uses AWS to Deliver Tailored Meteorological Data
The Met Office has been a widely respected
national weather service in the United Kingdom
for 160 years.
“We are using the AWS
Cloud to drive the mass-
market availability of
customizable weather
information.
• Needed the means to send weather data to device
users and third-party customers.
• Deployed Amazon ElastiCache to respond to peak
demands.
• Attracted more than half a million users with its app.
• Scaled data storage tenfold and reduced solution
costs by 50 percent.
• Enabled innovation of big data services in a
competitive landscape.
James Tomkins
Head of Enterprise IT Architecture
Met Office
”
“
Please welcome
Jacob Tomlinson
Met Office
Big Data in the
Met Office
Informatics Lab
Jacob Tomlinson
Met Office Informatics Lab
@_jacobtomlinson
Laziness
Iris
What does the Met Office do with this?
What can we do together?
Value for money
Spot pricing
Storytime
Packing workloads
Vertical scaling
Horizontal scaling
Spot fleet scaling
Statelessness
CC-BY-2.0 johnjack https://flic.kr/p/6fNsTY
External storage
Distributed queues
Conclusions
• Prescriptive	guidance	and	rapidly	deployable	solutions	to	help	
you	store,	analyze,	and	process	big	data	on	the	AWS	Cloud
• Derive	Insights	from	IoT in	Minutes	using	AWS	IoT,	Amazon	
Kinesis	Firehose,	Amazon	Athena,	and	Amazon	QuickSight
• Deploying	a	Data	Lake	on	AWS	- March	2017	AWS	Online	Tech	
Talks
• Harmonize,	Search,	and	Analyze	Loosely	Coupled	Datasets	on	
AWS	with	Glue,	Athena	and	QuickSight
• From	Data	Lake	to	Data	Warehouse:	Enhancing	Customer	360	
with	Amazon	Redshift	Spectrum
• Implement	Continuous	Integration	and	Delivery	of	Apache	
Spark	Applications	using	AWS
http://amzn.to/2vHIwBq
http://amzn.to/2i9gqZn
http://bit.ly/2qipA8h
http://amzn.to/2qpiFaK
http://amzn.to/2lpbc8p
Takeaways
https://aws.amazon.com/blogs/big-data/
https://aws.amazon.com/answers/big-data/
http://amzn.to/2gIJcj8
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
THANK YOU!

More Related Content

What's hot

Team 2 Big Data Presentation
Team 2 Big Data PresentationTeam 2 Big Data Presentation
Team 2 Big Data Presentation
Matthew Urdan
 
Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and Roadmap
Srinath Perera
 
Big Data Presentation at SCQAA-SF on June 12 2013
Big Data Presentation at SCQAA-SF on June 12 2013Big Data Presentation at SCQAA-SF on June 12 2013
Big Data Presentation at SCQAA-SF on June 12 2013
Sujit Ghosh
 

What's hot (20)

Team 2 Big Data Presentation
Team 2 Big Data PresentationTeam 2 Big Data Presentation
Team 2 Big Data Presentation
 
AI State of Play Dec 2016 NYC
AI State of Play Dec 2016 NYCAI State of Play Dec 2016 NYC
AI State of Play Dec 2016 NYC
 
Is big data just a buzzword -Big data simply explained
Is big data just a buzzword -Big data simply explainedIs big data just a buzzword -Big data simply explained
Is big data just a buzzword -Big data simply explained
 
Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and Roadmap
 
Artificial Intelligence and Current State of It
Artificial Intelligence and Current State of ItArtificial Intelligence and Current State of It
Artificial Intelligence and Current State of It
 
AI Predictions 2017
AI Predictions 2017AI Predictions 2017
AI Predictions 2017
 
Solve for X with AI: a VC view of the Machine Learning & AI landscape
Solve for X with AI: a VC view of the Machine Learning & AI landscapeSolve for X with AI: a VC view of the Machine Learning & AI landscape
Solve for X with AI: a VC view of the Machine Learning & AI landscape
 
Business Transformation and Strategy for Large Companies in the Age of AI - P...
Business Transformation and Strategy for Large Companies in the Age of AI - P...Business Transformation and Strategy for Large Companies in the Age of AI - P...
Business Transformation and Strategy for Large Companies in the Age of AI - P...
 
Digital-Warriors-Marketing Roadmap with Big Data Analytics
Digital-Warriors-Marketing Roadmap with Big Data AnalyticsDigital-Warriors-Marketing Roadmap with Big Data Analytics
Digital-Warriors-Marketing Roadmap with Big Data Analytics
 
Big data Introduction by Mohan
Big data Introduction by MohanBig data Introduction by Mohan
Big data Introduction by Mohan
 
IC-SDV 2019: The Economics of Artificial Intelligence and Machine Learning fo...
IC-SDV 2019: The Economics of Artificial Intelligence and Machine Learning fo...IC-SDV 2019: The Economics of Artificial Intelligence and Machine Learning fo...
IC-SDV 2019: The Economics of Artificial Intelligence and Machine Learning fo...
 
Big Data Predictions ebook
Big Data Predictions ebookBig Data Predictions ebook
Big Data Predictions ebook
 
Building an AI Startup: Realities & Tactics
Building an AI Startup: Realities & TacticsBuilding an AI Startup: Realities & Tactics
Building an AI Startup: Realities & Tactics
 
Big Data and The Future of Insight - Future Foundation
Big Data and The Future of Insight - Future FoundationBig Data and The Future of Insight - Future Foundation
Big Data and The Future of Insight - Future Foundation
 
The Astonishing Resurrection of AI (A Primer on Artificial Intelligence)
The Astonishing Resurrection of AI (A Primer on Artificial Intelligence)The Astonishing Resurrection of AI (A Primer on Artificial Intelligence)
The Astonishing Resurrection of AI (A Primer on Artificial Intelligence)
 
Big Data Presentation at SCQAA-SF on June 12 2013
Big Data Presentation at SCQAA-SF on June 12 2013Big Data Presentation at SCQAA-SF on June 12 2013
Big Data Presentation at SCQAA-SF on June 12 2013
 
AWS Initiate Day Dublin 2019 – Big Data Meets AI
AWS Initiate Day Dublin 2019 – Big Data Meets AIAWS Initiate Day Dublin 2019 – Big Data Meets AI
AWS Initiate Day Dublin 2019 – Big Data Meets AI
 
12 Interesting Facts about Big Data
12 Interesting Facts about Big Data12 Interesting Facts about Big Data
12 Interesting Facts about Big Data
 
Transforming a Business Through Analytics
Transforming a Business Through AnalyticsTransforming a Business Through Analytics
Transforming a Business Through Analytics
 
Kai-Fu Lee at AI Frontiers : The Era of Artificial Intelligence
Kai-Fu Lee at AI Frontiers : The Era of Artificial IntelligenceKai-Fu Lee at AI Frontiers : The Era of Artificial Intelligence
Kai-Fu Lee at AI Frontiers : The Era of Artificial Intelligence
 

Viewers also liked

(Slides) Task scheduling algorithm for multicore processor system for minimiz...
(Slides) Task scheduling algorithm for multicore processor system for minimiz...(Slides) Task scheduling algorithm for multicore processor system for minimiz...
(Slides) Task scheduling algorithm for multicore processor system for minimiz...
Naoki Shibata
 

Viewers also liked (20)

電子商務資料分析 上課投影片
電子商務資料分析 上課投影片電子商務資料分析 上課投影片
電子商務資料分析 上課投影片
 
Introduction to AWS Glue
Introduction to AWS GlueIntroduction to AWS Glue
Introduction to AWS Glue
 
初探AWS 平台上的 NoSQL 雲端資料庫服務
初探AWS 平台上的 NoSQL 雲端資料庫服務初探AWS 平台上的 NoSQL 雲端資料庫服務
初探AWS 平台上的 NoSQL 雲端資料庫服務
 
大數據運算媒體業案例分享 (Big Data Compute Case Sharing for Media Industry)
大數據運算媒體業案例分享 (Big Data Compute Case Sharing for Media Industry)大數據運算媒體業案例分享 (Big Data Compute Case Sharing for Media Industry)
大數據運算媒體業案例分享 (Big Data Compute Case Sharing for Media Industry)
 
(Slides) Task scheduling algorithm for multicore processor system for minimiz...
(Slides) Task scheduling algorithm for multicore processor system for minimiz...(Slides) Task scheduling algorithm for multicore processor system for minimiz...
(Slides) Task scheduling algorithm for multicore processor system for minimiz...
 
Cephfs架构解读和测试分析
Cephfs架构解读和测试分析Cephfs架构解读和测试分析
Cephfs架构解读和测试分析
 
BDA303 Serverless big data architectures: Design patterns and best practices
BDA303 Serverless big data architectures: Design patterns and best practicesBDA303 Serverless big data architectures: Design patterns and best practices
BDA303 Serverless big data architectures: Design patterns and best practices
 
管理程式對AWS LAMBDA持續交付
管理程式對AWS LAMBDA持續交付管理程式對AWS LAMBDA持續交付
管理程式對AWS LAMBDA持續交付
 
Connecting Your Data Analytics Pipeline
Connecting Your Data Analytics PipelineConnecting Your Data Analytics Pipeline
Connecting Your Data Analytics Pipeline
 
GLOA:A New Job Scheduling Algorithm for Grid Computing
GLOA:A New Job Scheduling Algorithm for Grid ComputingGLOA:A New Job Scheduling Algorithm for Grid Computing
GLOA:A New Job Scheduling Algorithm for Grid Computing
 
可靠分布式系统基础 Paxos的直观解释
可靠分布式系统基础 Paxos的直观解释可靠分布式系统基础 Paxos的直观解释
可靠分布式系统基础 Paxos的直观解释
 
Amazon Athena Capabilities and Use Cases Overview
Amazon Athena Capabilities and Use Cases Overview Amazon Athena Capabilities and Use Cases Overview
Amazon Athena Capabilities and Use Cases Overview
 
Tackle Your Dark Data Challenge with AWS Glue - AWS Online Tech Talks
Tackle Your Dark Data  Challenge with AWS Glue - AWS Online Tech TalksTackle Your Dark Data  Challenge with AWS Glue - AWS Online Tech Talks
Tackle Your Dark Data Challenge with AWS Glue - AWS Online Tech Talks
 
The Power of Big Data - AWS Summit Bahrain 2017
The Power of Big Data - AWS Summit Bahrain 2017The Power of Big Data - AWS Summit Bahrain 2017
The Power of Big Data - AWS Summit Bahrain 2017
 
淺談系統監控與 AWS CloudWatch 的應用
淺談系統監控與 AWS CloudWatch 的應用淺談系統監控與 AWS CloudWatch 的應用
淺談系統監控與 AWS CloudWatch 的應用
 
如何利用 Amazon EMR 及Athena 打造高成本效益的大數據環境
如何利用 Amazon EMR 及Athena 打造高成本效益的大數據環境如何利用 Amazon EMR 及Athena 打造高成本效益的大數據環境
如何利用 Amazon EMR 及Athena 打造高成本效益的大數據環境
 
Introduction to YARN and MapReduce 2
Introduction to YARN and MapReduce 2Introduction to YARN and MapReduce 2
Introduction to YARN and MapReduce 2
 
The Impala Cookbook
The Impala CookbookThe Impala Cookbook
The Impala Cookbook
 
唯品会大数据实践 Sacc pub
唯品会大数据实践 Sacc pub唯品会大数据实践 Sacc pub
唯品会大数据实践 Sacc pub
 
Hadoop, the Apple of Our Eyes (這些年,我們一起追的 Hadoop)
Hadoop, the Apple of Our Eyes (這些年,我們一起追的 Hadoop)Hadoop, the Apple of Our Eyes (這些年,我們一起追的 Hadoop)
Hadoop, the Apple of Our Eyes (這些年,我們一起追的 Hadoop)
 

Similar to The Power of Big Data - Transformation Day Public Sector London 2017

Similar to The Power of Big Data - Transformation Day Public Sector London 2017 (20)

The Rise of People Analytics
The Rise of People AnalyticsThe Rise of People Analytics
The Rise of People Analytics
 
Liberating data power of APIs
Liberating data power of APIsLiberating data power of APIs
Liberating data power of APIs
 
Data mining with big data implementation
Data mining with big data implementationData mining with big data implementation
Data mining with big data implementation
 
Business Intelligence & Predictive Analytic by Prof. Lili Saghafi
Business Intelligence & Predictive Analytic by Prof. Lili SaghafiBusiness Intelligence & Predictive Analytic by Prof. Lili Saghafi
Business Intelligence & Predictive Analytic by Prof. Lili Saghafi
 
IBM Solutions Connect 2013 IT Day Keynote
IBM Solutions Connect 2013 IT Day KeynoteIBM Solutions Connect 2013 IT Day Keynote
IBM Solutions Connect 2013 IT Day Keynote
 
Eric van Tol - Businesscases & Verdienmodellen
Eric van Tol - Businesscases & VerdienmodellenEric van Tol - Businesscases & Verdienmodellen
Eric van Tol - Businesscases & Verdienmodellen
 
Search++: Cognitive transformation of human-system interaction: Presented by ...
Search++: Cognitive transformation of human-system interaction: Presented by ...Search++: Cognitive transformation of human-system interaction: Presented by ...
Search++: Cognitive transformation of human-system interaction: Presented by ...
 
AWSome Day 2019 Keynote
AWSome Day 2019 KeynoteAWSome Day 2019 Keynote
AWSome Day 2019 Keynote
 
Mark Phillips: General Session
Mark Phillips: General SessionMark Phillips: General Session
Mark Phillips: General Session
 
Video AI for Media and Entertainment Industry
Video AI for Media and Entertainment IndustryVideo AI for Media and Entertainment Industry
Video AI for Media and Entertainment Industry
 
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
 
AWS Financial Services Cloud Symposium | Hong Kong - Keynote
AWS Financial Services Cloud Symposium | Hong Kong - KeynoteAWS Financial Services Cloud Symposium | Hong Kong - Keynote
AWS Financial Services Cloud Symposium | Hong Kong - Keynote
 
How to Consume Your Data for AI
How to Consume Your Data for AIHow to Consume Your Data for AI
How to Consume Your Data for AI
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data Management
 
IBM Cognitive platform: IBM Watson
IBM Cognitive platform: IBM WatsonIBM Cognitive platform: IBM Watson
IBM Cognitive platform: IBM Watson
 
SkillsFuture Festival at NUS 2019- Artificial Intelligence for Everyone - A P...
SkillsFuture Festival at NUS 2019- Artificial Intelligence for Everyone - A P...SkillsFuture Festival at NUS 2019- Artificial Intelligence for Everyone - A P...
SkillsFuture Festival at NUS 2019- Artificial Intelligence for Everyone - A P...
 
What is Big Data
What is Big Data What is Big Data
What is Big Data
 
Big Data : From HindSight to Insight to Foresight
Big Data : From HindSight to Insight to ForesightBig Data : From HindSight to Insight to Foresight
Big Data : From HindSight to Insight to Foresight
 
Intro to Artificial Intelligence w/ Target's Director of PM
 Intro to Artificial Intelligence w/ Target's Director of PM Intro to Artificial Intelligence w/ Target's Director of PM
Intro to Artificial Intelligence w/ Target's Director of PM
 
Qu'est ce que le Big Data ? Avec Victoria Galano Data Scientist chez Air France
Qu'est ce que le Big Data ? Avec Victoria Galano Data Scientist chez Air FranceQu'est ce que le Big Data ? Avec Victoria Galano Data Scientist chez Air France
Qu'est ce que le Big Data ? Avec Victoria Galano Data Scientist chez Air France
 

More from Amazon Web Services

Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
Amazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
Amazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
Amazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
Amazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

The Power of Big Data - Transformation Day Public Sector London 2017

  • 1. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. THE POWER OF BIG DATA Ben Snively, Solutions Architect, Amazon Web Services Jacob Tomlinson, Lead Engineer, Met Office Informatics Lab
  • 2. Why Big Data? Rapid Data Growth 1.2 ZB in 2015 44 ZB by 2020 180 ZB by 2025 Data Generation • Speed of the data. G B T B PB ZB E B Source: IDC, 2015 IoT Social Media Enterprise Systems Data Variety • 80 billion devices by 2025 • 500 million tweets daily Volume Velocity Variety
  • 3. There will be more words written on Twitter than contained in all books ever printed. Source: The Huffington Post, Is Twitter Bad For Language? Statistical Analysis Says No (New Book from Christian Rudder, Dataclysm: Who We Are)In 2014…
  • 4. Artificial Intelligence What is Big Data Powering on AWS?
  • 5. The Modern Data Architecture … in Public Sector
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20. Storage & Streams Catalogue & Search Entitlements API & UI Attributes of a Modern Data Architecture Key Pillars of a Data Lake Key Components of a Successful Data Strategy
  • 21.
  • 22. Putting it all together
  • 23. One tool to rule them all
  • 24. Different Types of Services… Virtualized Managed Serverless You can easily provision servers and focus on OS and above. You focus higher in the stack but still need to consider servers, how much CPU is needed, and how much RAM. AWS manages based the customer configuration. Build applications and services without thinking of servers. Don’t be concerned about provisioning, scaling, and maintaining servers for fault tolerance and availability. AWS does all of this for you.
  • 25. Real-time Analytics Collect, process, analyze, and visualize insights in real- time with AWS.
  • 26. Easy add On-Demand Analytics Real-time On- Demand
  • 27. Building a Data Strategy on AWS Kinesis Firehose 1 2 3 4 5 6 Athena Query Service 7 8 Glue Batch 9 10 AWS IoT Glue ETL
  • 28. Petabytes to Exabytes… § AWS Snowball § AWS Snowball Edge § AWS Snowmobile
  • 29. Processing Data for Analytics on your data lake
  • 30. Processing & Analytics Transactional & RDBMS DynamoDB NoSQL DB Relational Database Aurora BI & Data Visualization Kinesis Streams & Firehose Batch EMR Hadoop, Spark, Presto Redshift Data Warehouse Athena Query Service AWS Batch Predictive Real-time AWS Lambda Apache Storm on EMR Apache Flink on EMR Spark Streaming on EMR Elasticsearch Service Kinesis Analytics, Kinesis Streams EastiCache DAX
  • 33. What does the customer say? https://aws.amazon.com/solutions/case-studies/analytics/ https://aws.amazon.com/solutions/case-studies/big-data/
  • 35. Royal National Institute of Blind People creates and distributes accessible information in the form of synthesized content Amazon Polly delivers incredibly lifelike voices which captivate and engage our readers. John Worsfold Solutions Implementation Manager, RNIB ” “ • RNIB delivers largest library of audiobooks in the UK for nearly 2 million people with sight loss • Naturalness of generated speech is critical to captivate and engage readers • No restrictions on speech redistributions enables RNIB to create and distribute accessible information in a form of synthesized content RNIB provides the largest library in the UK for people with sight loss
  • 36. JustGiving Supports 24 Million Users on Charity Site Using AWS • Needed a new platform to support general operations and new analytics service • Moved to AWS, using a wide range of services • Can scale system faster in response to unanticipated spikes in traffic • Receives query results in seconds compared to 30 minutes under old system • Obtains deeper insights into billions of data points, using information to deliver better services Using the new AWS tools, we can extract much finer-grained data points based on millions of donations and billions of visits, and then use that information to provide a better platform for our visitors. Richard Atkinson Chief Information Officer, JustGiving ” “ JustGiving is a major online platform that supports charitable giving. The organization is based in London.
  • 37. We have an analysis kit we run every day, looking at data, comparing patterns over previous years’ information, and in a matter of seconds, we can tell if a student is likely to succeed or fail. The results have been phenomenal Lige Hensley CTO Ivy Tech ” “ Ivy Tech – Improving the classroom and beyond… Predicting Student Success: Identify behaviors of successful and unsuccessful students. We can now predict, with over 80% accuracy, which students are likely to fail a course within the first two weeks of a 16-week term. Improvements across the University: Struggling faculty can be identifies and addressed well before they have an impact on students Using the same behavior analysis techniques, we can flag fraudulent activity far earlier than ever before. Using Natural Language Understanding tools to analyze course evaluations.
  • 38. FINRA: Monitor and enforce trading regulations FINRA handles approximately 75 billion market events every day to build a holistic picture of trading in the U.S. Hundreds of surveillance algorithms against massive amounts of data. FINRA mission § Deter misconduct by enforcing the rules. § Detect and prevent wrongdoing in US markets § Discipline those who break the rules Scale brings unique challenges § Market volumes are volatile and increasing § Exchanges are dynamically evolving § Regulatory rules are created and enhanced § New securities products are introduced § Market manipulators innovate
  • 40. Met Office Uses AWS to Deliver Tailored Meteorological Data The Met Office has been a widely respected national weather service in the United Kingdom for 160 years. “We are using the AWS Cloud to drive the mass- market availability of customizable weather information. • Needed the means to send weather data to device users and third-party customers. • Deployed Amazon ElastiCache to respond to peak demands. • Attracted more than half a million users with its app. • Scaled data storage tenfold and reduced solution costs by 50 percent. • Enabled innovation of big data services in a competitive landscape. James Tomkins Head of Enterprise IT Architecture Met Office ” “
  • 42. Big Data in the Met Office Informatics Lab Jacob Tomlinson Met Office Informatics Lab @_jacobtomlinson
  • 43.
  • 44.
  • 46.
  • 47.
  • 48.
  • 49. Iris
  • 50.
  • 51.
  • 52.
  • 53.
  • 54.
  • 55. What does the Met Office do with this?
  • 56. What can we do together?
  • 60.
  • 70.
  • 71.
  • 72. • Prescriptive guidance and rapidly deployable solutions to help you store, analyze, and process big data on the AWS Cloud • Derive Insights from IoT in Minutes using AWS IoT, Amazon Kinesis Firehose, Amazon Athena, and Amazon QuickSight • Deploying a Data Lake on AWS - March 2017 AWS Online Tech Talks • Harmonize, Search, and Analyze Loosely Coupled Datasets on AWS with Glue, Athena and QuickSight • From Data Lake to Data Warehouse: Enhancing Customer 360 with Amazon Redshift Spectrum • Implement Continuous Integration and Delivery of Apache Spark Applications using AWS http://amzn.to/2vHIwBq http://amzn.to/2i9gqZn http://bit.ly/2qipA8h http://amzn.to/2qpiFaK http://amzn.to/2lpbc8p Takeaways https://aws.amazon.com/blogs/big-data/ https://aws.amazon.com/answers/big-data/ http://amzn.to/2gIJcj8
  • 73. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. THANK YOU!