This document summarizes a webinar about data modeling and indexing for Apache Accumulo using Sqrrl. It discusses Accumulo and Sqrrl technology, including table designs for dynamic documents, graphs and inverted indexes. It also describes how Sqrrl Enterprise allows building advanced indexes and the real-time operational applications it enables.
1. Securely explore your data
DATA MODELING AND
INDEXING FOR
APACHE ACCUMULO
Sqrrl Webinar Series
October, 2013
Adam Fuchs, CTO
Sqrrl Data, Inc.
2. RECAP
1. Introduction to Sqrrl and Accumulo
2. Security In The Wild
3. Sqrrl and Accumulo Technology
4. The Data-Centric Security Ecosystem
In our September Webinar:
Sqrrl, Apache Accumulo, and Cell-Level Security
Sqrrl%Data,%Inc.%Confiden3al%and%Proprietary% 2%
3. TODAY’S DISCUSSION
1. Sqrrl and Accumulo Technology Review
2. Table Designs
1. Dynamic Documents
2. Graphs
3. Inverted Indexes
3. Putting It All Together with Sqrrl
Data Modeling and Indexing for Apache Accumulo
Sqrrl%Data,%Inc.%Confiden3al%and%Proprietary% 3%
4. LAYERED ARCHITECTURE
Turtles all the way down...
Accumulo'RPC'
(Sorted(Key/Value(I/O)(
Hadoop'RPC'
(File(I/O)(
Application
Sqrrl Enterprise
Sqrrl'API'over'Apache'Thri8'RPC'
(JSON,(Graph,(Aggrega=on,(
Search,(etc.)(
Sqrrl%Data,%Inc.%Confiden3al%and%Proprietary% 4%
5. An Accumulo key is a 5-tuple, consisting of:
" Row: Controls Atomicity
" Column Family: Controls Locality
" Column Qualifier: Controls Uniqueness
" Visibility Label: Controls Access
" Timestamp: Controls Versioning
Row Col. Fam. Col. Qual. Visibility Timestamp Value
John Doe Notes PCP PCP_JD 20120912
Patient suffers
from an acute …
John Doe Test Results Cholesterol JD|PCP_JD 20120912 183
John Doe Test Results Mental Health JD|PSYCH_JD 20120801 Pass
John Doe Test Results X-Ray JD|PHYS_JD 20120513 1010110110100…
Accumulo(Key/Value(Example(
ACCUMULO DATA FORMAT
Sqrrl%Data,%Inc.%Confiden3al%and%Proprietary% 5%
8. TODAY’S DISCUSSION
1. Sqrrl and Accumulo Technology Review
2. Table Designs
1. Dynamic Documents
2. Graphs
3. Inverted Indexes
3. Putting It All Together with Sqrrl
Data Modeling and Indexing for Apache Accumulo
Sqrrl%Data,%Inc.%Confiden3al%and%Proprietary% 8%
17. TODAY’S DISCUSSION
1. Sqrrl and Accumulo Technology Review
2. Table Designs
1. Dynamic Documents
2. Graphs
3. Inverted Indexes
3. Putting It All Together with Sqrrl
Data Modeling and Indexing for Apache Accumulo
Sqrrl%Data,%Inc.%Confiden3al%and%Proprietary% 17%
18. SQRRL ENTERPRISE
• Dynamic Documents
• JSON I/O support
• Cell-level Security and Efficient Aggregation Extensions
• Dynamic Graphs
• Co-partitioned with Documents for Integrated Search and
Discovery
• Search
• Lucene Query Syntax
• Accumulo Indexes Preserve Security Model
• Processing
• SQL-Like Language for Transforming and Aggregating Results
• Parallel Slicing and Extraction
Sqrrl%Data,%Inc.%Confiden3al%and%Proprietary% 18%
Simple API for Advanced Accumulo Usage
20. HOW TO LEARN MORE
Download our White Paper
" www.sqrrl.com/whitepaper
Watch a video
" www.sqrrl.com/downloads#videos
Request a demo or one-on-one workshop
" www.sqrrl.com/contact
Come meet us
" Accumulo Meetup (October 28, New York)
" Strata + Hadoop World (October 28-30, New York)
" IBM IOD (November 4-7, Las Vegas)
" SC13 (November 18-21, Denver)
Sqrrl%Data,%Inc.%Confiden3al%and%Proprietary% 20%
21. THANK YOU
Thanks for attending!
To keep up to date
with Sqrrl, check out
or social media sites:
www.twitter.com/sqrrl_inc
www.linkedin.com/company/sqrrl
Sqrrl%Data,%Inc.%Confiden3al%and%Proprietary% 21%