Tapping the cloud for real time data analytics

What is Big Data?
Volume
Velocity
Variety
Veracity
Value
Quantum of data
TB to PB of data
Speed of data
Millisecond latency
Types of data
Hundreds of data sources
Quality of data
Varies greatly. Affects
accuracy of analysis
Business relevance
How does it help the business?
25+TB of data being
generated per second
globally
90+% of world’s data
created in last 2 years
90+% of data generated is
unstructured

Evolution of Big Data ProcessingDescriptivePredictivePrescriptive
Batch
Real-time
Dashboards;
Traditional query &
reporting
Prediction engines;
Inventory forecasting,
cross-sell analysis
Recommendation engines;
routes, content recos
What & Why it happened?
Probability of ‘x’ happening
What to do if ‘x’ happens
TypeofAnalytics
Speed of Analysis
It is happening!
Alerts, analysis &
detection; what is going
wrong, fraudulent use

Big Data
Potentially massive datasets
Iterative, experimental style of data
manipulation & analysis
Frequently not a steady-state
workload; peaks & valleys
Variety & velocity of data
Management of tools complex
AWS Cloud
Massive, virtually unlimited capacity
On-demand infrastructure allows iterative,
experimental deployment/usage
Most efficient with highly variable
workloads
Tools & services for managing structured
& unstructured, batch & stream data
Fully managed
Big Data was built for the Cloud

Ingest/
Collect
Consume/
visualize
Store Process &
Analyze
Data
1 4
0 9
5
Answers &
Insights
Broad, Tightly Integrated Capabilities
AWS provides the broadest platform for big data analytics today
Start Here
with a
business
case
Real-time
Amazon Kinesis
Firehose
Data Import
AWS
Import/Export
Snowball
Object Storage
Amazon S3
Real-time
Amazon Kinesis
Streams
Distributed
Amazon EMR
(Hadoop, Spark, etc)
BI & Data
Vizualization
Amazon Quicksoght
Real-time
AWS Lambda
Amazon Kinesis Analytics
Data Warehousing
Amazon Redshift
Machine Learning
Amazon Machine
Learning
Relational
Databases
Amazon RDS
No SQL
Databases
Amazon
DynamoDB Elasticsearch
Amazon
Elasticsearch
Data Connect
AWS Direct
Connect
Storage gateway
AWS Storage
Gateway
Database Migration
AWS Data Migration
Service
Time to
answer
(latency)
Throughput
Cost

Amazon Redshift
Fast, fully managed, petabyte-scale data warehouse
• 10X better performance than traditional DBs
• Less than one tenth the cost of traditional solutions
• Simple and fully managed
• Flexible & Scalable: Easily change number or type nodes
• ANSI SQL Compatible: Use familiar SQL clients/BI tools
• Secure: Encryption, network isolation, audit & compliance
• Ideal usage patterns: sales, historical, gaming, finance,
marketing, ad, social data
10 GigE
(HPC)
Ingestion
Backup
Restore
SQL Clients/BI Tools
128GB RAM
16TB disk
16 cores
Amazon S3
JDBC/ODBC
128GB RAM
16TB disk
16 coresCompute
Node
128GB RAM
16TB disk
16 coresCompute
Node
128GB RAM
16TB disk
16 coresCompute
Node
Leader
Node

Amazon EMR
Quickly and cost-effectively process vast amounts of data
• Largest cloud operator of Hadoop infrastructure
• Open source & MapR distributions
• Most current Hadoop distribution
• Flexibility : Decoupled compute & storage, select apps, resize
• Simple : Launch a cluster in minutes, fully managed
• Scalable : Provision as much capacity as needed
• Multiple pricing option – On-demand, Reserved Instances, Spot
• Typical use cases – Clickstream analysis, log processing, genomics

Amazon Kinesis
Easily work with real-time streaming data
Amazon Kinesis Streams
• Build custom apps to process or analyze streaming data
• Typical use cases – Log & event data collection, real-time analytics
Amazon Kinesis Firehose
• Easily load massive volumes of streaming data into S3, Redshift, AWS ES
• Typical use cases – Digital marketing, IoT, mobile data capture
Amazon Kinesis Analytics
• Easily analyze data streams using standard SQL queries

Amazon Elasticsearch
Fully managed making it easy to set-up, operate & scale
Elasticsearch clusters in the cloud
• Easy set-up & configuration. Fully managed
• Flexible storage options
• Set-up for high availability
• Seamlessly scale
• Direct access to Elasticsearch APIs
• Support for ELK. Built-in KIbana
• Integration with AWS IAM for controlling access to your domain
• Integration with Amazon CloudTrail for auditing
Amazon
Route 53
Elastic
Load
Balancing
IAM
CloudWatch
Elasticsearch API
CloudTrailAWS

Select Big Data & Analytics Customers
The vast majority of Big Data use cases deployed in the cloudtoday run onAWS

Now available in the Mumbai region!

Tapping the cloud for real
time data analytics

Businesses are literally drowning in data
1-3. Source: https://www-01.ibm.com/software/data/bigdata/what-is-big-data.html
4. Source: https://www.technologyreview.com/business-report/big-data-gets-personal/download/?state=join#/join/
 2.5 quintillion bytes of data is being created every day1
 90% of data in the world today has been created in the last two years2
 1.7 megabytes of new information will be created every second for every human on the planet by 20203
 <0.5% of all data is currently being analysed and used

Internal systems can’t cope
 On-premise environments can’t scale quick enough for big data analytics
projects to work well
Cost is prohibitive
 The high capital cost of upgrading server infrastructure deters organisations
from embarking on projects.
Tools are outdated
 Data management architectures are complex and traditional data analytics
tools are no longer suitable
So what’s the problem?

Drives scale
 Offers agile and secure cloud infrastructure, provided by AWS, at a low cost
Provides clarity
 Easy to forecast how much computing power is needed
 Ensures infrastructure is not under-utilised
Empowers business
 Servers can be ‘spun up’ to support proof of concepts as required
 Enables organisations to go to market faster
 Supports a ‘fail fast’ culture
Why cloud for real time analytics?

Who is Blazeclan?
Blazeclan is a cloud
solutions expert and
award-winning AWS
Prem ier Partner.

Cloud advisory
 Consulting approach to identify the suitability of a move to the cloud – examining current
apps, infrastructure tools, methods and readiness
Migration and deployment
 Move web-based and ERP apps – including Oracle and SAP solutions – to the cloud
Cloud consulting services

DevOps
 Continuous integration, deployment and release
management processes with Puppet Labs, Jenkins,
Capistrano, and ELK Stack
Managed services
 Proactive monitoring of AWS infrastructure, SLA-based
resolution, 24x7 support, and account management
Cloud consulting services

Big data on cloud
 Process data in real time using Amazon Kinesis, Apache Kafka, AWS Lambda and Hadoop
Data warehouse on cloud
 Data warehouse design, management and reporting with Amazon Redshift, AWS Quicksight
and Tableau.
Cloud native app and product development
 Provide micro services and driven architecture with tools like SQS and SNS
Analytics and product development services

Cloud stream
 Digital content, asset management, publishing workflow
and video on demand
Cloudlytics
 Provides log and billing analytics, cloud automation and
monitoring
CloudScale
 Load testing and resilience, and testing automation in the
cloud
BlazeNAS
 A highly available and fault-tolerant storage solution
Our products and frameworks

 Our in-house developed big data framework Cloudlytics 2.0 is an analytics engine that addresses
applications from different domains like infrastructure, application monitoring and IoT.
 It gives organizations an edge over their competition by providing real-time insights which help reduce the
time to market for products and services.
Big data analytics engine

Analytics engine components flow
AWS
Kinesis/
Kafka
Custom
API Layer
LogStash
+ AWS EMR
+ TalenD
AWS
Elastic
Search
Custom
API Layer
KibanaCustom Wrappers
Aggregation
Engine
Transformation
Engine
Querying Engine Visualization
Message
Persistence
AWS S3

 5Abox is a software company building embedded solutions for the IoT world. It is focused on energy and
domotics gateways, and ‘VPN on request’ solutions.
Case study: 5Abox
Analyzing IOT data in real-time

 Streaming real-time data
 Complex transformation
 Visualization
Case study: 5Abox
The problem/challenge:

The solution:
Case study: 5Abox
Weather and Voltage
fluctuations Data
IOT Device Real-Time Transformation and
visualization of Data on Cloudlytics 2.0
using MQTT
protocol
 BlazeClan solution enabled the customer Real-Time Insights for weather and voltage fluctuations data.

Case study: 5Abox
Technical architecture

Talk to Us!
Company Address
BlazeClan Technologies Pvt. Ltd.
4 Tara Heights, Pune-Mumbai Road, Wakdewadi,
Shivaji Nagar, Pune – 411003 Maharashtra.
Phone:
+91 20 6529 0035
Website:
www.blazeclan.com
Email:
sales@blazeClan.com
Want to learn more about us?
Download our Brochures here!

Tapping the cloud for real time data analytics

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Destacado

Destacado (20)

Similar a Tapping the cloud for real time data analytics

Similar a Tapping the cloud for real time data analytics (20)

Más de Amazon Web Services

Más de Amazon Web Services (20)

Último

Último (20)

Tapping the cloud for real time data analytics

Notas del editor