At last week's Strata + Hadoop World in San Jose, CA SnapLogic Chief Scientist Greg Benson talked to big data experts, data scientists and other enterprise IT leaders about the data lake and how SnapLogic comes into play with Hadoop-scale data integration.
Check out this presentation to learn how SnapLogic helps customers adopt Hadoop and automate data integration workflows.
To learn more, visit: www.snaplogic.com/big-data
4. Elastic Integration, Hadoop-Scale!
• Cloud to Cloud
• Cloud to Ground!
• Groud to Groud!
• Elastic: Scales in the
cloud or on premise.
Metadata
Data
5. SnapLogic Key Technologies !
• SaaS model for Integration: iPaaS
• Modern HTML5-based user
interface
• No programming required
• Intelligent connectivity: Snaps
• High-performance pipeline
execution engine: Snaplex
• Hybrid execution:
cloud or ground
• Streaming and accumulating
(batch) support
• JSON native data processing
• Pipelines as APIs
• Integration automation
• Hadooplex, SnapReduce, and SnapSpark
7. Hadooplex: Snaplex YARN Application
= Snaplex Container
• SnapLogic is a first-class
citizen in Hadoop
• Multiplex Hadoop Cluster
for integration, data
staging, and data prep.
• Scale out Snaplex
processes via Resource
Manager
• Kerberos Authentication
• Certified by Cloudera and
Hortonworks
8. SnapReduce: Pipelines Generate MapReduce
MAP MAP MAPMAP
REDUCEMAP MAPREDUCE
SnapReduce
Compiler
Map Reduce
• A checkbox option to
SnapReduce-enable a pipeline
• Support for SequenceFile,
RCFile, document (JSON)
processing for MapReduce jobs
YARN
9. SnapLogic, Hadoop, and the Data Lake !
• Augment Hadoop ecosystem
• Open up Hadoop to more IT/Business professionals
• Automate data ingest into Hadoop
• Prepare data for Data Scientists and Analytics
• Generate MapReduce and Spark code for pipeline execution
• Deliver data to DBs, BI Tools, and Cloud Apps
10. Big Data Integration in a Snap!
@SnapLogic
Facebook.com/SnapLogic
Plus.google.com/+SnapLogic
• Helping customers
adopt Hadoop
• Automate your data
integration workflows
Learn more at www.SnapLogic.com!
!
Notas del editor
I’m Greg Benson
Chief Scientist at SnapLogic
Also a Professor of Computer Science at the University of San Francisco
Unified Data Integration Platform
Scale in the Cloud or on Premise
Intelligent connectivity via Snaps
Beyond….
Relational Data
Batch
Point to Point
A Few Users
The Firewall
The Surface
Modern, scalable HTML5 user interface
Snaps
Pipelines
Execution
No programming required, Excel-level skills.
Beyond….
Relational Data
Batch
Point to Point
A Few Users
The Firewall
Magnify the containers? May it magnify out.
MapReduce and Spark code generation
Augment Hadoop ecosystem
Open up Hadoop to more IT/Business professionals
Automate data ingest into Hadoop
De-normalize relational data
Prepare data for Data Scientists and Analytics
Generate MapReduce and Spark code for pipeline execution
Deliver data to DBs, BI Tools, and Cloud Apps
Metadata management and lineage