In this session, we will describe the key elements of a Dell EMC Isilon Data Lake and its key advantages including reduced IT costs, simplified management, increased operational flexibility and in-place data analytics. Dell EMC products to be featured include Isilon, ECS and Virtustream.
In this session, you will learn about the Dell EMC Isilon Data Lake and its advantages including:
• Data consolidation and increased efficiency to lower capital costs
• Streamlined management to reduce operating costs
• Improved operational flexibility and scalability to meet growing storage requirements
• Simple integration with a choice of public or private cloud storage providers
• Powerful in-place data analytics that accelerate time to insight while eliminating the need for a separate analytics storage infrastructure.
You will also hear how this solution can be easily extended to include data from remote and branch office locations with an efficient software defined storage solution.
5. 5
Collect, store, analyze and use
Traditional and emerging sources
Social Networks,
User Generated Content
Public records
Location DataInternet Of Things
Emerging
Enterprise File Data
Machine Data
Traditional
Video Archive
6. 6
CURRENT AND FUTURE CHALLENGES
Surveillance
Next-Gen Application
Hadoop & Analytics
Transaction
Logs
BLOBSSync & Share
Content
Shares
Marketing M&E
Social & Next-Gen
Archive &
Backup Target
Data Monetization
Design, Test
& Manufacture
Application Test
Home Directories & File Shares
6
DATA SILOS MULTIPLYING PLATFORM 3 INTEGRATION
12. 12
Isilon scales from
16TB to 68PB
in a single file system,
single volume cluster
Under 60 seconds to
scale with no downtime
Migration-less replacement of
old nodes
Isilon is massively scalable
More scalable than traditional storage
13. 13
Industry leading NAS efficiency
Traditional NAS: 50-60%
Isilon: 80+%
STORAGE CAPACITY
UTILIZATION
FINANCIAL
EFFICIENCY
capacity
future
Isilon matches need
and financial timing
Utilization increases with scale
Space-efficient data protection
Single OneFS volume
No dedicated spares
Automated workload balance
55%
Traditional NAS
$0.55
$0.80+
14. 14
Enterprise grade software
DATA MANAGEMENTDATA MANAGEMENTDATA PROTECTION & EFFICIENCY
SnapshotIQ
Fast, Efficient Data Backup And Recovery
SyncIQ
Fast And Flexible Asynchronous Replication
For Disaster Recovery Protection
SmartConnect
Policy-based Client Failover With Load
Balancing
SmartLock
Policy-based Compliance and WORM Data
Protection
SmartDedupe
Data Deduplication to reduce storage
requirements and costs
SmartPools
Policy-based Automated Tiering
SmartQuotas
Quota Management And Thin Provisioning
InsightIQ
Performance Monitoring And Reporting To
Manage Storage Resources
CloudPools
Cloud-scale Capacity
15. 15
Isilon product family
Linear Scaling of Performance and Capacity
High Performance
Platform
Nearline
Platform
S-Series
Capacity
Performance
NL-Series
High Density
Platform
HD-SeriesHighly Versatile
Platform
X-Series
Internal Cloud
External Cloud
Software Defined
Your hardware
16. 16
Optimize with automated storage tiering
Single point of management
• Single file system/single volume
• Multiple performance tiers
Automatic data movement
• Policy-based tiering management
• Transparent reallocation
• NO application changes
Optimize storage resources
• Automatically match storage resources
with data requirements
• Eliminate data migration
• Isilon SmartPools
S-Series
Performance
HD-Series
High Density
X-Series
Throughput
NL-Series
Nearline
Performance
Capacity
17. 17
Isilon advantage for Hadoop
In-place analytics
• Native integration speeds time to insight
Enterprise data protection
• Simple, efficient data replication for DR
Lower costs
• Eliminates the need for dedicated Hadoop
infrastructure
Increase flexibility
• Simultaneous support for any Apache-
compliant Hadoop distribution
• Scale-out storage with native Hadoop integration
“EMC Isilon is indeed an easy to
operate, highly scalable and efficient
Enterprise Data Lake Platform (EDLP).
IDC validated that a shared storage
model based on the Data Lake can in
fact provide enterprise-grade service-
levels while performing better than
dedicated commodity off-the shelf
(COTS) storage for Hadoop workloads”
20. 20
CORE
CloudPools - seamless tiering of frozen data
CLOUD PROVIDER
HOT DATA
>30 days
WARM DATA
1-2 Months
FROZEN DATA
1-2 years
21. 21
CloudPools - seamless tiering of frozen data
APPS & USERS
Access time
SEAMLESS
CLOUD
INTEGRATION
CORE CLOUD PROVIDER
22. 22
PRIVATE
PUBLICHOSTED
CORE
DATACENTER
Data Lake Benefits
• Lower IT costs
• Seamless Cloud
advantage
• Flexible SDS
deployments
• Future proofed for
Emerging workloads
• Agility with automated
control
• Linear scalable
performance
EDGE
CLOUDSCALE-OUT FLASHSOFTWARE DEFINED
24. 24
Customer Adoption of All-flash is Accelerating
Data in the Data CenterProjection of Capacity Disk & Scale-
out Capacity NAND Flash
Unstructured
Data
20%
structured
80%
unstructured data
$1500
$1000
$500
$0
2015 2016 2017 2018 2019 2020
Cost/TB for Disk
Cost/TB for Flash
By 2020, Flash $/TB
reaches parity with
HDDs
4-yr Cost/TB
Analysis of Unstructured Data: Applications of Text Analytics and Sentiment Mining
SAS. Retrieved June 24, 2016.
25. 25
Unstructured Data Drives Final Frontier
Media
EDA
Big Data
Life
Sciences
25
Unstructured Data
Final Frontier
30. 30
SOFTWAREDell - Internal Use - Confidential30
ANY UNSTRUCTURED PROTOCOL
• Full, multi-protocol support
SMB, NFS, HDFS, Object, NDMP and more
• In-place Hadoop analytics
• Enable new workloads
• Eliminate silos of storage
31. 31
SECURITYDell - Internal Use - Confidential31
ENTERPRISE GRADE PROTECTION
& SECURITY
• Iron clad data protection,
Up to N+4 redundancy
• Snapshots, Replication
• NFS/SMB automatic failover & load
balancing
• Backup and recovery
• Secure Access, WORM, SEC 17a-4
32. 32
TIERING32
TIERING ECONOMICS = <$.50/GB
Extreme
PerformancetierStoragetier
• Reduce Capital
Expense
• Just Enough
Flash
• Transparent to
Users and
Applications
• Flexible Admin
defined policies
• Deduplication
1PB