SlideShare una empresa de Scribd logo
1 de 23
Descargar para leer sin conexión
Transforming Data Management and Time to Insight
with Anzo Smart Data Lake®
©2016 Cambridge Semantics Inc. All Rights Reserved
Steve Hamby
Managing Director, Government
Cambridge Semantics Inc.
678.346.6386
steve@cambridgesemantics.com
This webinar is targeted to Federal Government CIOs and
staff that are researching enterprise data management and
mining tools to help them understand how Smart Data Lakes
enable a viable mechanism for addressing their top priorities
©2016 Cambridge Semantics Inc. All rights reserved. 3
Cambridge Semantics (CSI)
The Anzo Smart Data Lake® enables enterprise-wide, governed,
self-serve, diverse data discovery & analytics


Company:
▪ Founded by senior team from IBM’s Advanced Internet Technology Group
▪ Complemented by MPP technology team previously at Netezza & Paraccel (Amazon Redshift)
▪ Select customers:
Software:
▪ Anzo: Links & harmonizes diverse data using Semantic knowledge graphs for governed self-
serve data preparation, discovery & analytics
▪ Currently 3rd generation of the product in production use
MIT Innovation
Showcase
Business
Intelligence /
Analytics Solutions
©2016 Cambridge Semantics Inc. All rights reserved. 4
What is the Smart Data Lake?
• Ovum: a governed, managed, and transparent default ingest point for raw data that
provides inventory, security, and integration to enable holistic enterprise analytics
• CSI: a flexible and scalable system of parallelized data ingestion for structured and
unstructured data that uses semantic graph models to link and contextualize the diverse
enterprise data, and a clustered, in-memory graph query engine to provide users self-
service data discovery, analytics and visualization capabilities across enterprise concepts
representing real-world entities & relationships
Machine-Assisted Smart Data Prep &
Integration
Semantic Knowledge Graphs
In-Memory Clustered Graph Query
Engine
Interactive Discovery & Analytics with
Query Builder Dashboard
©2016 Cambridge Semantics Inc. All rights reserved. 5
The Data Prep, Discovery & Analytics Problem
Current Approach: Labor-intensive, Time-consuming with Limited Insight Capabilities
• Data Preparation Process Iterative and Labor Intensive
• Data Models & Data Prepared to Respond to Specific Questions – Full granularity not
exposed
• Harmonizing Unstructured Data is Too Complex
• Understanding & Overview of Available Data Limited
• Poor/Limited Governance & Provenance across Diverse Data
©2016 Cambridge Semantics Inc. All rights reserved. 6
Enterprise Data Management and Mining
• Enterprise Data Warehouse (1980s and 1990s): central repository(ies) of integrated,
processed data from one or more disparate sources, containing current and historical data
that is used for creating analytical reports for operational users throughout the enterprise
Single, “boil-the-ocean”, static view of the enterprise
• Data Mart (1990s and 2000s): simple form of a data warehouse that is focused on a single
subject / functional area (e.g. sales, finance or marketing), and are often created and
controlled by a single department within an organization
Several, often competing, “authoritative” views of the enterprise
• Data Lake (2010 - present): a collection of storage instances of various data assets for the
purpose of presenting an unrefined view of data to only the most highly skilled analysts, to
help them explore their data refinement and analysis techniques independent of any of the
system-of-record compromises that may exist in a traditional analytic data store (such as a
data mart or data warehouse)
“Data shredder / blender” —> Hire smart data scientists
©2016 Cambridge Semantics Inc. All rights reserved. 7
Feature Comparison of Legacy Approaches
1. Governed: Enforce processes on integrating data sources to enhance enterprise data;
2. Non-volatile: Do not overlay existing data values, but create a new record with the new value;
3. Integrated: Provide holistic enterprise view of data across business functions and over time;
4. Agile: Rapidly evolve use cases supported as organization needs change;
5. Data Agnostic: Support all data, regardless of structure or implied authoritativeness;
6. Data Centric: Analytics are attached to data in a horizontally scalable environment versus pushing data to
analytics, which creates significant network load;
7. Standards-based: Uses open standards to promote interoperability and reduce vendor lock-in;
8. Low TCO: Total cost of ownership is directly proportional to number of supported use cases.
©2016 Cambridge Semantics Inc. All rights reserved. 8
Comparison of Total Cost of Ownership
©2016 Cambridge Semantics Inc. All rights reserved. 9
Introducing Anzo Smart Data Lake
Anzo Smart Data Lake
•Data Prep - Combine
•Data Discovery - Explore
•Data Analytics - Act
Provide a Knowledge Graph of All Diverse Data,
Structured, Unstructured, Internal or External,
at Big Data Scale while Providing End-User Governed Self-Service Discovery & Analytics
A fully integrated Data Prep, Data Discovery and Data Analytics product
which is an overlay to existing systems
©2016 Cambridge Semantics Inc. All rights reserved. 10
Data Prep: Combine to Explore the Whole Picture
©2016 Cambridge Semantics Inc. All rights reserved. 11
Data Discovery: Explore All Your Data
©2016 Cambridge Semantics Inc. All rights reserved. 12
Data Analytics: Act - Governed Self-Service on Trusted
Data
Traditional
Approach
Smart Data
Lake
● Explore entities & relationships across all
data sets to discover & analyze new
insights
● Create individualized analytics
dashboards based on custom extract
without having to learn a query language
● No more waiting on IT for DBA’s or data
wranglers to integrate data
● Trusted and validated data. Data
harmonization for synonyms
● Governance of users.. IT can set data
policies
● Self-serve analytics post data loading
into self-serve BI tool
● Little governance or control
● Poor handling of unstructured and
diverse data
©2016 Cambridge Semantics Inc. All rights reserved. Company Confidential 13
Open Standards Enable Smart Data
Logical models represented as OWL Ontologies are the key to a Smart Data Lake
• Next step in a decades-long evolution in IT
– Relational theory started a movement focused on modeling data in rich queryable
structures
– Logical models are data too and not conflated with database storage constructs
– Rich canonical models allowing unlimited entity types & relationships
– Open Standards based representations (W3C’s OWL & RDF)
• CSI is the market leader in operationalizing ontologies
– Rich detailed data representations in the language of the business or domain
– Simplifies data ingestion through ETL/ELT automation, interlinking & harmonization
– Ideal for merging data from both structured and unstructured sources
– Dramatically simplified end-user data self-service through automated query
(SPARQL) generation
©2016 Cambridge Semantics Inc. All rights reserved. Company Confidential 14
Work the Way you Think

Graph-Based Data Discovery and Analytics
©2016 Cambridge Semantics Inc. All rights reserved. Company Confidential 15
Work the Way you Think

Graph-Based Data Discovery and Analytics
©2016 Cambridge Semantics Inc. All rights reserved. Company Confidential 16
Anzo Smart Data Lake®

Graph-based Data Discovery & Analytics for the Enterprise
An enterprise-scale offering to easily harmonize, discover and analyze all data
whether from inside or outside the enterprise
Smart Data Prep &
Integration
Combine
Smart Data
Discovery
Enterprise Smart
Data Lake
Explore
Smart Data
Analytics
Act
Standard
Ontology
(e.g. NIEM,
BFO)
©2016 Cambridge Semantics Inc. All rights reserved. 17
Anzo Smart Data Lake®
Graph-based Data Discovery for the Enterprise
A Semantic knowledge graph links & harmonizes
diverse data to provide governed self-service
discovery, analytics & data management at big data
scale
Enterprise
Corporate Systems
Enterprise
Unstructured Data
External Data
Feeds
Social Media &
Diverse Data
Sources
Third-Party &
Partner Systems
Cambridge Semantics
Query Builder Dashboards
Last Mile Analytics Tools
Examples:
Machine-Assisted Smart Data
Prep & Integration
Semantic Knowledge Graphs
In-Memory Clustered Graph
Query Engine
Interactive Discovery &
Analytics with Query Builder
Dashboard
Data
Prep & Ingestion
Data
Discovery &
Analytics
Easily harmonize, discover and analyze all data whether from inside or outside the
enterprise
©2016 Cambridge Semantics Inc. All rights reserved. 18
Anzo Smart Data Lake®

Processing Unstructured Data
©2016 Cambridge Semantics Inc. All rights reserved. Company Confidential 19
“ “
It would be nice if somehow all
data and information was coming
from a single place, like spokes
on a wheel.
Our highly qualified, highly
paid PhD’s should not be
spending their time searching
for and formatting data.
“
In a recent regulatory investigation, our
scientists had to sift through tons of
documents and systems to manually
trace data. All together, the investigation
spanned multiple months
Anzo Smart Data Lake®

Example Customer Challenges and Customer Satisfaction Quotes
“
The integration/connectivity
that the tool affords is very
impressive
“
Very powerful …
Far exceeded
expectations
“
The ability to handle structured
and unstructured data is very
impressive
©2016 Cambridge Semantics Inc. All rights reserved. Company Confidential 20
Example Government Use Cases Already Transforming
Data Management and Time to Insight
• U.S. Intelligence Organization
– Problem: Need better insight into contract performance, trends, and patterns
– Background:
• Recent push from Congress to leverage small business
• Diverse mission set and classification of specific contracts complicates insight issues
– Solution: Use Anzo Smart Data Platform® to ingest contracts data to provide contract analysts
better insight
– Result: Analysts can search, analyze, and conduct interactive data discovery across contracts that
are within their “Need to Know” purview to better understand and optimize contracting
• Defense Organization
– Problem: New multi-billion dollar program with 500+ legacy data sources over-spending and
under-delivering; had immediate need for determining the optimal mix of legacy data sources to
integrate to perform mission, and which ones to terminate sustainment
– Background:
• Data in relational databases, proprietary flat files, XML, and some .csv and Excel
• Mission requirements and available data sources were documented in Excel
– Solution: Use Anzo Smart Data Platform® to integrate Excel containing mission requirements and
available data sources; generate a data lineage for all data sources; create reports detailing
optimal data sources to integrate to accomplish mission with least cost
– Result: Mission-oriented data integration roadmap with faster time to implementation and
significant cost reduction
©2016 Cambridge Semantics Inc. All rights reserved. 21
Anzo Smart Data Lake®

Driving Business Efficiency Use Case
• Every agency has key
performance metrics related to
their mission
– They manage their performance based
on these KPPs / KPIs
• Every agency has a requirement
to share “business” related data
– Energy efficiency, personnel data,
contracting performance data, etc.
• Anzo Smart Data Lake® can help
agencies drive business efficiency
through analyzing this data
– Better understand the data they share
– Analyze trends in the data
– Compare with other agencies
– Develop metrics (similar to KPP / KPI
on the mission) that provide a
“scoring” mechanism
– Rationalize investments / needed
contractual obligations
– Predict future performance
©2016 Cambridge Semantics Inc. All rights reserved. 22
Anzo Smart Data Lake®

Enabling Self-Service to Citizens
• Data.gov was created to provide
citizens data to enable them to:
– Conduct research
– Develop web and mobile
applications
– Design data visualizations
– And more
• Filtering through the data is a
timely task for citizens in using
the data
– Anzo Smart Data Lake® can
provide a robust, scalable solution
to process large amounts of data
• Including text and structured
– Using Ontology to easily
categorize data for the citizen
– Enabling citizens to produce
better analysis and do better
services with the results of their
analysis
©2016 Cambridge Semantics Inc. All rights reserved. 23
Summary, Q&A and Next Steps
• Anzo Smart Data Lake can help your agency
transform its data management to drive faster
time to insight and cost reduction
– Platform has proven success in Financial Services and
Life Sciences (very large, complex data)
• Business Efficiency is a missing metric for agencies
– Data is already being shared
• Citizens are already doing wonderful things with
data.gov
– Analysis is expensive; Anzo can help
• Next Steps
– Proof of Concept
– Unsolicited Proposal

Más contenido relacionado

La actualidad más candente

Harnessing the Power of Big Data at Freddie Mac
Harnessing the Power of Big Data at Freddie MacHarnessing the Power of Big Data at Freddie Mac
Harnessing the Power of Big Data at Freddie Mac
DataWorks Summit
 
Big data competitive landscape overview
Big data competitive landscape overviewBig data competitive landscape overview
Big data competitive landscape overview
Bisakha Praharaj
 
Automating Data Science over a Human Genomics Knowledge Base
Automating Data Science over a Human Genomics Knowledge BaseAutomating Data Science over a Human Genomics Knowledge Base
Automating Data Science over a Human Genomics Knowledge Base
Vaticle
 

La actualidad más candente (20)

From Data Lakes to the Data Fabric: Our Vision for Digital Strategy
From Data Lakes to the Data Fabric: Our Vision for Digital StrategyFrom Data Lakes to the Data Fabric: Our Vision for Digital Strategy
From Data Lakes to the Data Fabric: Our Vision for Digital Strategy
 
Using Cloud Automation Technologies to Deliver an Enterprise Data Fabric
Using Cloud Automation Technologies to Deliver an Enterprise Data FabricUsing Cloud Automation Technologies to Deliver an Enterprise Data Fabric
Using Cloud Automation Technologies to Deliver an Enterprise Data Fabric
 
Big Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data DemocratizationBig Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data Democratization
 
Modern Data Discovery and Integration in Insurance
Modern Data Discovery and Integration in InsuranceModern Data Discovery and Integration in Insurance
Modern Data Discovery and Integration in Insurance
 
Accelerate Digital Transformation with an Enterprise Big Data Fabric
Accelerate Digital Transformation with an Enterprise Big Data FabricAccelerate Digital Transformation with an Enterprise Big Data Fabric
Accelerate Digital Transformation with an Enterprise Big Data Fabric
 
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
 
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...
 
Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...
Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...
Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...
 
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
 
Semantic Graph Databases: The Evolution of Relational Databases
Semantic Graph Databases: The Evolution of Relational DatabasesSemantic Graph Databases: The Evolution of Relational Databases
Semantic Graph Databases: The Evolution of Relational Databases
 
Harnessing the Power of Big Data at Freddie Mac
Harnessing the Power of Big Data at Freddie MacHarnessing the Power of Big Data at Freddie Mac
Harnessing the Power of Big Data at Freddie Mac
 
Big Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture CapabilitiesBig Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture Capabilities
 
Fighting Financial Crime with Artificial Intelligence
Fighting Financial Crime with Artificial IntelligenceFighting Financial Crime with Artificial Intelligence
Fighting Financial Crime with Artificial Intelligence
 
Big Data Landscape 2016
Big Data Landscape 2016 Big Data Landscape 2016
Big Data Landscape 2016
 
Analyst Keynote: Forrester: Data Fabric Strategy is Vital for Business Innova...
Analyst Keynote: Forrester: Data Fabric Strategy is Vital for Business Innova...Analyst Keynote: Forrester: Data Fabric Strategy is Vital for Business Innova...
Analyst Keynote: Forrester: Data Fabric Strategy is Vital for Business Innova...
 
Big data competitive landscape overview
Big data competitive landscape overviewBig data competitive landscape overview
Big data competitive landscape overview
 
Himss DC meet mark logic
Himss DC meet   mark logicHimss DC meet   mark logic
Himss DC meet mark logic
 
Automating Data Science over a Human Genomics Knowledge Base
Automating Data Science over a Human Genomics Knowledge BaseAutomating Data Science over a Human Genomics Knowledge Base
Automating Data Science over a Human Genomics Knowledge Base
 
Study: #Big Data in #Austria
Study: #Big Data in #AustriaStudy: #Big Data in #Austria
Study: #Big Data in #Austria
 
Denodo Platform 7.0: What's New?
Denodo Platform 7.0: What's New?Denodo Platform 7.0: What's New?
Denodo Platform 7.0: What's New?
 

Destacado

Destacado (19)

PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
 
Appistry WGDAS Presentation
Appistry WGDAS PresentationAppistry WGDAS Presentation
Appistry WGDAS Presentation
 
Data: The Good, The Bad & The Ugly
Data: The Good, The Bad & The UglyData: The Good, The Bad & The Ugly
Data: The Good, The Bad & The Ugly
 
Visualization 101 BA4All
Visualization 101 BA4AllVisualization 101 BA4All
Visualization 101 BA4All
 
SciBite - Role Of Ontologies (Pistoia Alliance Webinar)
SciBite - Role Of Ontologies (Pistoia Alliance Webinar)SciBite - Role Of Ontologies (Pistoia Alliance Webinar)
SciBite - Role Of Ontologies (Pistoia Alliance Webinar)
 
A Journey to Modern Apps with Containers, Microservices and Big Data
A Journey to Modern Apps with Containers, Microservices and Big DataA Journey to Modern Apps with Containers, Microservices and Big Data
A Journey to Modern Apps with Containers, Microservices and Big Data
 
Lambda at Weather Scale - Cassandra Summit 2015
Lambda at Weather Scale - Cassandra Summit 2015Lambda at Weather Scale - Cassandra Summit 2015
Lambda at Weather Scale - Cassandra Summit 2015
 
Data Scientist 101 BI Dutch
Data Scientist 101 BI DutchData Scientist 101 BI Dutch
Data Scientist 101 BI Dutch
 
Hi Speed Datawarehousing
Hi Speed DatawarehousingHi Speed Datawarehousing
Hi Speed Datawarehousing
 
SnappyData overview NikeTechTalk 11/19/15
SnappyData overview NikeTechTalk 11/19/15SnappyData overview NikeTechTalk 11/19/15
SnappyData overview NikeTechTalk 11/19/15
 
Always On: Building Highly Available Applications on Cassandra
Always On: Building Highly Available Applications on CassandraAlways On: Building Highly Available Applications on Cassandra
Always On: Building Highly Available Applications on Cassandra
 
Database Shootout: What's best for BI?
Database Shootout: What's best for BI?Database Shootout: What's best for BI?
Database Shootout: What's best for BI?
 
NoLambda: Combining Streaming, Ad-Hoc, Machine Learning and Batch Analysis
NoLambda: Combining Streaming, Ad-Hoc, Machine Learning and Batch AnalysisNoLambda: Combining Streaming, Ad-Hoc, Machine Learning and Batch Analysis
NoLambda: Combining Streaming, Ad-Hoc, Machine Learning and Batch Analysis
 
Scalable On-Demand Hadoop Clusters with Docker and Mesos
Scalable On-Demand Hadoop Clusters with Docker and MesosScalable On-Demand Hadoop Clusters with Docker and Mesos
Scalable On-Demand Hadoop Clusters with Docker and Mesos
 
Online Analytics with Hadoop and Cassandra
Online Analytics with Hadoop and CassandraOnline Analytics with Hadoop and Cassandra
Online Analytics with Hadoop and Cassandra
 
Streaming Big Data & Analytics For Scale
Streaming Big Data & Analytics For ScaleStreaming Big Data & Analytics For Scale
Streaming Big Data & Analytics For Scale
 
Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...
Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...
Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...
 
[db tech showcase Tokyo 2015] A14:Amazon Redshiftの元となったスケールアウト型カラムナーDB徹底解説 その...
[db tech showcase Tokyo 2015] A14:Amazon Redshiftの元となったスケールアウト型カラムナーDB徹底解説 その...[db tech showcase Tokyo 2015] A14:Amazon Redshiftの元となったスケールアウト型カラムナーDB徹底解説 その...
[db tech showcase Tokyo 2015] A14:Amazon Redshiftの元となったスケールアウト型カラムナーDB徹底解説 その...
 
Big Data: Architecture and Performance Considerations in Logical Data Lakes
Big Data: Architecture and Performance Considerations in Logical Data LakesBig Data: Architecture and Performance Considerations in Logical Data Lakes
Big Data: Architecture and Performance Considerations in Logical Data Lakes
 

Similar a Transforming Data Management and Time to Insight with Anzo Smart Data Lake®

Anzo smart data integration dgiq 2014
Anzo smart data integration dgiq 2014Anzo smart data integration dgiq 2014
Anzo smart data integration dgiq 2014
Marty Loughlin
 
Anzo smart data integration february 2015
Anzo smart data integration february 2015Anzo smart data integration february 2015
Anzo smart data integration february 2015
John Rueter
 

Similar a Transforming Data Management and Time to Insight with Anzo Smart Data Lake® (20)

Anzo smart data integration dgiq 2014
Anzo smart data integration dgiq 2014Anzo smart data integration dgiq 2014
Anzo smart data integration dgiq 2014
 
Anzo smart data integration february 2015
Anzo smart data integration february 2015Anzo smart data integration february 2015
Anzo smart data integration february 2015
 
LinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbenchLinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbench
 
Big and fast data strategy 2017 jr
Big and fast data strategy 2017 jrBig and fast data strategy 2017 jr
Big and fast data strategy 2017 jr
 
Oracle big data publix sector 1
Oracle big data publix sector 1Oracle big data publix sector 1
Oracle big data publix sector 1
 
Structuring Serendipitous Collaboration
Structuring Serendipitous CollaborationStructuring Serendipitous Collaboration
Structuring Serendipitous Collaboration
 
Anzo Smart Data Integration
Anzo Smart Data IntegrationAnzo Smart Data Integration
Anzo Smart Data Integration
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
 
Ask bigger questions
Ask bigger questionsAsk bigger questions
Ask bigger questions
 
Insights into Real World Data Management Challenges
Insights into Real World Data Management ChallengesInsights into Real World Data Management Challenges
Insights into Real World Data Management Challenges
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
 
Unlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationUnlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data Virtualization
 
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
 
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
 
BigData Analysis
BigData AnalysisBigData Analysis
BigData Analysis
 
Business Visualization: Dashboard & Storyboarding
Business Visualization: Dashboard & StoryboardingBusiness Visualization: Dashboard & Storyboarding
Business Visualization: Dashboard & Storyboarding
 
How to Consume Your Data for AI
How to Consume Your Data for AIHow to Consume Your Data for AI
How to Consume Your Data for AI
 
Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19
 
Data lake-itweekend-sharif university-vahid amiry
Data lake-itweekend-sharif university-vahid amiryData lake-itweekend-sharif university-vahid amiry
Data lake-itweekend-sharif university-vahid amiry
 
IARE_BDBA_ PPT_0.pptx
IARE_BDBA_ PPT_0.pptxIARE_BDBA_ PPT_0.pptx
IARE_BDBA_ PPT_0.pptx
 

Más de Cambridge Semantics

Más de Cambridge Semantics (15)

Risk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Risk Analytics Using Knowledge Graphs / FIBO with Deep LearningRisk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Risk Analytics Using Knowledge Graphs / FIBO with Deep Learning
 
Using Machine Teaching in Text Analysis: Case Study on Using Machine Teaching...
Using Machine Teaching in Text Analysis: Case Study on Using Machine Teaching...Using Machine Teaching in Text Analysis: Case Study on Using Machine Teaching...
Using Machine Teaching in Text Analysis: Case Study on Using Machine Teaching...
 
Fireside Chat with Bloor Research: State of the Graph Database Market 2020
Fireside Chat with Bloor Research: State of the Graph Database Market 2020Fireside Chat with Bloor Research: State of the Graph Database Market 2020
Fireside Chat with Bloor Research: State of the Graph Database Market 2020
 
The Business Case for Semantic Web Ontology & Knowledge Graph
The Business Case for Semantic Web Ontology & Knowledge GraphThe Business Case for Semantic Web Ontology & Knowledge Graph
The Business Case for Semantic Web Ontology & Knowledge Graph
 
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
 
Introduction to RDF*
Introduction to RDF*Introduction to RDF*
Introduction to RDF*
 
AnzoGraph DB - SPARQL 101
AnzoGraph DB - SPARQL 101AnzoGraph DB - SPARQL 101
AnzoGraph DB - SPARQL 101
 
Healthcare and Life Sciences: Two Industries Separated by Common Data
Healthcare and Life Sciences: Two Industries Separated by Common DataHealthcare and Life Sciences: Two Industries Separated by Common Data
Healthcare and Life Sciences: Two Industries Separated by Common Data
 
Knowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data ScienceKnowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data Science
 
Scalable, Fast Analytics with Graph - Why and How
Scalable, Fast Analytics with Graph - Why and HowScalable, Fast Analytics with Graph - Why and How
Scalable, Fast Analytics with Graph - Why and How
 
Sustainability Investment Research Using Cognitive Analytics
Sustainability Investment Research Using Cognitive AnalyticsSustainability Investment Research Using Cognitive Analytics
Sustainability Investment Research Using Cognitive Analytics
 
Should a Graph Database Be in Your Next Data Warehouse Stack?
Should a Graph Database Be in Your Next Data Warehouse Stack?Should a Graph Database Be in Your Next Data Warehouse Stack?
Should a Graph Database Be in Your Next Data Warehouse Stack?
 
Going Beyond Rows and Columns with Graph Analytics
Going Beyond Rows and Columns with Graph AnalyticsGoing Beyond Rows and Columns with Graph Analytics
Going Beyond Rows and Columns with Graph Analytics
 
Accelerate Pharma R&D with Cross-Study Analytics
Accelerate Pharma R&D with Cross-Study AnalyticsAccelerate Pharma R&D with Cross-Study Analytics
Accelerate Pharma R&D with Cross-Study Analytics
 
Large Scale Graph Analytics with RDF and LPG Parallel Processing
Large Scale Graph Analytics with RDF and LPG Parallel ProcessingLarge Scale Graph Analytics with RDF and LPG Parallel Processing
Large Scale Graph Analytics with RDF and LPG Parallel Processing
 

Último

Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
only4webmaster01
 
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
amitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Último (20)

Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning Approach
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
hybrid Seed Production In Chilli & Capsicum.pptx
hybrid Seed Production In Chilli & Capsicum.pptxhybrid Seed Production In Chilli & Capsicum.pptx
hybrid Seed Production In Chilli & Capsicum.pptx
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 

Transforming Data Management and Time to Insight with Anzo Smart Data Lake®

  • 1. Transforming Data Management and Time to Insight with Anzo Smart Data Lake® ©2016 Cambridge Semantics Inc. All Rights Reserved Steve Hamby Managing Director, Government Cambridge Semantics Inc. 678.346.6386 steve@cambridgesemantics.com
  • 2. This webinar is targeted to Federal Government CIOs and staff that are researching enterprise data management and mining tools to help them understand how Smart Data Lakes enable a viable mechanism for addressing their top priorities
  • 3. ©2016 Cambridge Semantics Inc. All rights reserved. 3 Cambridge Semantics (CSI) The Anzo Smart Data Lake® enables enterprise-wide, governed, self-serve, diverse data discovery & analytics 
 Company: ▪ Founded by senior team from IBM’s Advanced Internet Technology Group ▪ Complemented by MPP technology team previously at Netezza & Paraccel (Amazon Redshift) ▪ Select customers: Software: ▪ Anzo: Links & harmonizes diverse data using Semantic knowledge graphs for governed self- serve data preparation, discovery & analytics ▪ Currently 3rd generation of the product in production use MIT Innovation Showcase Business Intelligence / Analytics Solutions
  • 4. ©2016 Cambridge Semantics Inc. All rights reserved. 4 What is the Smart Data Lake? • Ovum: a governed, managed, and transparent default ingest point for raw data that provides inventory, security, and integration to enable holistic enterprise analytics • CSI: a flexible and scalable system of parallelized data ingestion for structured and unstructured data that uses semantic graph models to link and contextualize the diverse enterprise data, and a clustered, in-memory graph query engine to provide users self- service data discovery, analytics and visualization capabilities across enterprise concepts representing real-world entities & relationships Machine-Assisted Smart Data Prep & Integration Semantic Knowledge Graphs In-Memory Clustered Graph Query Engine Interactive Discovery & Analytics with Query Builder Dashboard
  • 5. ©2016 Cambridge Semantics Inc. All rights reserved. 5 The Data Prep, Discovery & Analytics Problem Current Approach: Labor-intensive, Time-consuming with Limited Insight Capabilities • Data Preparation Process Iterative and Labor Intensive • Data Models & Data Prepared to Respond to Specific Questions – Full granularity not exposed • Harmonizing Unstructured Data is Too Complex • Understanding & Overview of Available Data Limited • Poor/Limited Governance & Provenance across Diverse Data
  • 6. ©2016 Cambridge Semantics Inc. All rights reserved. 6 Enterprise Data Management and Mining • Enterprise Data Warehouse (1980s and 1990s): central repository(ies) of integrated, processed data from one or more disparate sources, containing current and historical data that is used for creating analytical reports for operational users throughout the enterprise Single, “boil-the-ocean”, static view of the enterprise • Data Mart (1990s and 2000s): simple form of a data warehouse that is focused on a single subject / functional area (e.g. sales, finance or marketing), and are often created and controlled by a single department within an organization Several, often competing, “authoritative” views of the enterprise • Data Lake (2010 - present): a collection of storage instances of various data assets for the purpose of presenting an unrefined view of data to only the most highly skilled analysts, to help them explore their data refinement and analysis techniques independent of any of the system-of-record compromises that may exist in a traditional analytic data store (such as a data mart or data warehouse) “Data shredder / blender” —> Hire smart data scientists
  • 7. ©2016 Cambridge Semantics Inc. All rights reserved. 7 Feature Comparison of Legacy Approaches 1. Governed: Enforce processes on integrating data sources to enhance enterprise data; 2. Non-volatile: Do not overlay existing data values, but create a new record with the new value; 3. Integrated: Provide holistic enterprise view of data across business functions and over time; 4. Agile: Rapidly evolve use cases supported as organization needs change; 5. Data Agnostic: Support all data, regardless of structure or implied authoritativeness; 6. Data Centric: Analytics are attached to data in a horizontally scalable environment versus pushing data to analytics, which creates significant network load; 7. Standards-based: Uses open standards to promote interoperability and reduce vendor lock-in; 8. Low TCO: Total cost of ownership is directly proportional to number of supported use cases.
  • 8. ©2016 Cambridge Semantics Inc. All rights reserved. 8 Comparison of Total Cost of Ownership
  • 9. ©2016 Cambridge Semantics Inc. All rights reserved. 9 Introducing Anzo Smart Data Lake Anzo Smart Data Lake •Data Prep - Combine •Data Discovery - Explore •Data Analytics - Act Provide a Knowledge Graph of All Diverse Data, Structured, Unstructured, Internal or External, at Big Data Scale while Providing End-User Governed Self-Service Discovery & Analytics A fully integrated Data Prep, Data Discovery and Data Analytics product which is an overlay to existing systems
  • 10. ©2016 Cambridge Semantics Inc. All rights reserved. 10 Data Prep: Combine to Explore the Whole Picture
  • 11. ©2016 Cambridge Semantics Inc. All rights reserved. 11 Data Discovery: Explore All Your Data
  • 12. ©2016 Cambridge Semantics Inc. All rights reserved. 12 Data Analytics: Act - Governed Self-Service on Trusted Data Traditional Approach Smart Data Lake ● Explore entities & relationships across all data sets to discover & analyze new insights ● Create individualized analytics dashboards based on custom extract without having to learn a query language ● No more waiting on IT for DBA’s or data wranglers to integrate data ● Trusted and validated data. Data harmonization for synonyms ● Governance of users.. IT can set data policies ● Self-serve analytics post data loading into self-serve BI tool ● Little governance or control ● Poor handling of unstructured and diverse data
  • 13. ©2016 Cambridge Semantics Inc. All rights reserved. Company Confidential 13 Open Standards Enable Smart Data Logical models represented as OWL Ontologies are the key to a Smart Data Lake • Next step in a decades-long evolution in IT – Relational theory started a movement focused on modeling data in rich queryable structures – Logical models are data too and not conflated with database storage constructs – Rich canonical models allowing unlimited entity types & relationships – Open Standards based representations (W3C’s OWL & RDF) • CSI is the market leader in operationalizing ontologies – Rich detailed data representations in the language of the business or domain – Simplifies data ingestion through ETL/ELT automation, interlinking & harmonization – Ideal for merging data from both structured and unstructured sources – Dramatically simplified end-user data self-service through automated query (SPARQL) generation
  • 14. ©2016 Cambridge Semantics Inc. All rights reserved. Company Confidential 14 Work the Way you Think
 Graph-Based Data Discovery and Analytics
  • 15. ©2016 Cambridge Semantics Inc. All rights reserved. Company Confidential 15 Work the Way you Think
 Graph-Based Data Discovery and Analytics
  • 16. ©2016 Cambridge Semantics Inc. All rights reserved. Company Confidential 16 Anzo Smart Data Lake®
 Graph-based Data Discovery & Analytics for the Enterprise An enterprise-scale offering to easily harmonize, discover and analyze all data whether from inside or outside the enterprise Smart Data Prep & Integration Combine Smart Data Discovery Enterprise Smart Data Lake Explore Smart Data Analytics Act Standard Ontology (e.g. NIEM, BFO)
  • 17. ©2016 Cambridge Semantics Inc. All rights reserved. 17 Anzo Smart Data Lake® Graph-based Data Discovery for the Enterprise A Semantic knowledge graph links & harmonizes diverse data to provide governed self-service discovery, analytics & data management at big data scale Enterprise Corporate Systems Enterprise Unstructured Data External Data Feeds Social Media & Diverse Data Sources Third-Party & Partner Systems Cambridge Semantics Query Builder Dashboards Last Mile Analytics Tools Examples: Machine-Assisted Smart Data Prep & Integration Semantic Knowledge Graphs In-Memory Clustered Graph Query Engine Interactive Discovery & Analytics with Query Builder Dashboard Data Prep & Ingestion Data Discovery & Analytics Easily harmonize, discover and analyze all data whether from inside or outside the enterprise
  • 18. ©2016 Cambridge Semantics Inc. All rights reserved. 18 Anzo Smart Data Lake®
 Processing Unstructured Data
  • 19. ©2016 Cambridge Semantics Inc. All rights reserved. Company Confidential 19 “ “ It would be nice if somehow all data and information was coming from a single place, like spokes on a wheel. Our highly qualified, highly paid PhD’s should not be spending their time searching for and formatting data. “ In a recent regulatory investigation, our scientists had to sift through tons of documents and systems to manually trace data. All together, the investigation spanned multiple months Anzo Smart Data Lake®
 Example Customer Challenges and Customer Satisfaction Quotes “ The integration/connectivity that the tool affords is very impressive “ Very powerful … Far exceeded expectations “ The ability to handle structured and unstructured data is very impressive
  • 20. ©2016 Cambridge Semantics Inc. All rights reserved. Company Confidential 20 Example Government Use Cases Already Transforming Data Management and Time to Insight • U.S. Intelligence Organization – Problem: Need better insight into contract performance, trends, and patterns – Background: • Recent push from Congress to leverage small business • Diverse mission set and classification of specific contracts complicates insight issues – Solution: Use Anzo Smart Data Platform® to ingest contracts data to provide contract analysts better insight – Result: Analysts can search, analyze, and conduct interactive data discovery across contracts that are within their “Need to Know” purview to better understand and optimize contracting • Defense Organization – Problem: New multi-billion dollar program with 500+ legacy data sources over-spending and under-delivering; had immediate need for determining the optimal mix of legacy data sources to integrate to perform mission, and which ones to terminate sustainment – Background: • Data in relational databases, proprietary flat files, XML, and some .csv and Excel • Mission requirements and available data sources were documented in Excel – Solution: Use Anzo Smart Data Platform® to integrate Excel containing mission requirements and available data sources; generate a data lineage for all data sources; create reports detailing optimal data sources to integrate to accomplish mission with least cost – Result: Mission-oriented data integration roadmap with faster time to implementation and significant cost reduction
  • 21. ©2016 Cambridge Semantics Inc. All rights reserved. 21 Anzo Smart Data Lake®
 Driving Business Efficiency Use Case • Every agency has key performance metrics related to their mission – They manage their performance based on these KPPs / KPIs • Every agency has a requirement to share “business” related data – Energy efficiency, personnel data, contracting performance data, etc. • Anzo Smart Data Lake® can help agencies drive business efficiency through analyzing this data – Better understand the data they share – Analyze trends in the data – Compare with other agencies – Develop metrics (similar to KPP / KPI on the mission) that provide a “scoring” mechanism – Rationalize investments / needed contractual obligations – Predict future performance
  • 22. ©2016 Cambridge Semantics Inc. All rights reserved. 22 Anzo Smart Data Lake®
 Enabling Self-Service to Citizens • Data.gov was created to provide citizens data to enable them to: – Conduct research – Develop web and mobile applications – Design data visualizations – And more • Filtering through the data is a timely task for citizens in using the data – Anzo Smart Data Lake® can provide a robust, scalable solution to process large amounts of data • Including text and structured – Using Ontology to easily categorize data for the citizen – Enabling citizens to produce better analysis and do better services with the results of their analysis
  • 23. ©2016 Cambridge Semantics Inc. All rights reserved. 23 Summary, Q&A and Next Steps • Anzo Smart Data Lake can help your agency transform its data management to drive faster time to insight and cost reduction – Platform has proven success in Financial Services and Life Sciences (very large, complex data) • Business Efficiency is a missing metric for agencies – Data is already being shared • Citizens are already doing wonderful things with data.gov – Analysis is expensive; Anzo can help • Next Steps – Proof of Concept – Unsolicited Proposal