SlideShare una empresa de Scribd logo
1 de 49
Data-centric market status,
case studies and outlook
Alan Morrison
February 2019
PwC
Time Activity
2:00 – 2:10 Introduction
2:10 – 2:20 Larger market context: buzz, trends and dynamics
2:20 – 2:30 Current approaches to the market
2:30 – 2:40 Case studies
2:40 – 3:15 Conclusion and discussion
Agenda
February 5, 2019Data-Centric Conference
2
Introduction
PwC
• Market oriented, passive, laissez faire
• Architecture and vision free
• “The market will fix it.”
• Big vendor dominated
• Legacy of Nick Carr’s IT Doesn’t Matter
• Siloed, isolated efforts
• Startups venture funded in silos, with waves of new
silos being generated every year
• Users themselves just passive and disempowered
buyers or subscribers
• Consumer services mostly self-service, with no one
to call if a problem arises
Product-centric IT: Got an IT problem? Buy a packaged solution.
February 5, 2019Data-Centric Conference
Products and services
Data management
Applications
Computer graphics and web design
Development
IT administration
IT certifications
IT security
Service providers
Technologies and fields
Categories from wand® precision
classification and search, 2019
PwC
Data-centric IT: Own the problem first. Then build a solution.
February 5, 2019Data-Centric Conference
5
• In the data-centric view, every IT category is subordinated to centrally managed, model-driven data via data
strategy, GRC and data-centric architecture (DCA)
• Relationship-rich modeling leads development for reasons of efficiency and effectiveness
• Standards based, open source enabled, build versus buy
• Empowers user communities, activism, large scale collaboration, shared infrastructure
Goals for data to
be obtained,
enriched and used
Data strategy
Data governance,
risk and
compliance
Data-centric
architecture (DCA)
Strategy and planning
Execution
Data-centric infrastructure
Data and logic lifecycle management
Model-driven development
Cross-enterprise intelligence
Relationship-rich modeling
Data-centric security
Process, pipeline and delivery automation
Human and machine learning loops
Data-centric design thinking
PwC
Think beyond end-to-end packaged software and
SaaS solutions
• “Cognitive computing” platforms
• “Big data” platforms
• Repackaged legacy MDM
• Application-centric integration suites
• Data preparation suites
• Robotic or intelligent process automation
• Any other “AI-enabled” X, Y or Z that simply generates
lot of market noise and gets in the way
The elephant in the room is always organizational change.
However, wrong technology and data strategy approaches
prevent change also.
Those who succeed have effective strategies, means and
execution all going for them at the same time.
Consider how to get the business and IT together
building and solving foundational data problems
• Bespoke projects with worthy objectives and practical
means of success
• Human-in-the-loop computing—constructing data-
description feedback loops for knowledge foundations
• Software and SaaSes that do encourage better data
modeling and relationship-rich integration
– semantic graph databases
– smart data hubs
– NoSQL + SQL modeling—building bridges between
tabular, document and graph
– automated taxonomy and ontology generation, but
as a starting point
– Automated process mapping, but as a starting point
– Clever modeling or visualization tools to encourage
deeper, system-level understanding
Ways to think about truly data-centric opportunities
6
February 5, 2019Data-Centric Conference
PwC
The real inhibitors to adoption aren’t technological – they’re rooted
in tribal biases and resistance to change
7
Tribalism CollectivismIndividualism
Anarchy TotalitarianismLocus of inertia
Daniel Quinn, Beyond Civilization and Alice Linsley, “Daniel Quinn: A Return to Tribalism?”, college-ethics.blogspot.com, 2018
February 5, 2019
Data-Centric Conference
PwC
Tribalism – Machine learning edition
8
Source: Pedro Domingos, The Master Algorithm, 2015
More at “Machine learning evolution”: http://usblogs.pwc.com/emerging-technology/machine-learning-evolution-infographic/, PwC, 2017
Symbolists Bayesians Connectionists Evolutionaries Analogizers
Use symbols, rules,
and logic to
represent
knowledge and
draw logical
inference
Assess the
likelihood of
occurrence for
probabilistic
inference
Recognize and
generalize patterns
dynamically with
matrices of
probabilistic,
weighted neurons
Generate variations
and then assess the
fitness of each for a
given purpose
Optimize a function
in light of
constraints (“going
as high as you
can while staying
on the road”)
Favored
algorithm
Rules and decision
trees
Favored
algorithm
Naïve Bayes or
Markov
Favored
algorithm
Neural network
Favored
algorithm
Genetic programs
Favored
algorithm
Support vectors
February 5, 2019
Data-Centric Conference
PwC
Tribalism – Data integration edition
9
Trend toward more data centricity this way
Application-
centric
RESTful
developers
Relational
database linkers
Data-centric
knowledge graphers
Application-
centric
ESB advocates
Semantic Web Company, 2018
Computerscience
wiki.org, 2018
TIBCO, 2014
Oracle DBA’s Guide, 2018
User Scott
Select FROM emp
Local
database
PUBLIC SYNONYM
Emp - >
emp@HQ.ACME>COM
Database link
(undirected)
Remote
database
EMP table
Portals
Net
Application
B2B
Interactions
Enterprise
data
Business process
management
Web Services
Mobile
Applications
DEE
Application
ERP
CRM SFA
Legacy
System
ESBCustom environment Common environment
API
Unstructured
Data
Semi-
structured Data
Structured
Data
Schema mapping based
on ontologies
Entity Extractor informs all
incoming data streams about
its semantics and links them
Unified Views
RDF Graph
Database
PoolParty
Graph Search
February 5, 2019Data-Centric Conference
PwC
Crossing the chasm between the tribes
10
Reducing the amount of unfamiliarity developers confront--familiar document means to achieve comparable ends
to graph:
• Semantic suites that use the JSON format and familiar hierarchies: SWC’s PoolParty is an example
• GraphQL: A popular document shape language that talks to APIs using SELECT-like statements and tree
shapes; retrieves only the data you need, provides needed feedback to API owners, helps with API sprawl
• Accessible web as database methods: JSON-LD and Schema.org, etc. vocabularies
• Document “schemas” via data objects: JavaScript objects to developers = documents to NoSQL DB types;
Object Data Modeling instead of database semantics
• Mongoose or MongoDB JSON schema features + GraphQL: MongoDB object modeling and querying that can
be used for subdocument filtering within a GraphQL context
• HyperGraphQL: A GraphQL UI for Linked Data, restricted to certain tree-shaped queries
• Universal Schema Language: Mike Bowers’ document/graph query and modeling language still in development
• COMN: Concept and Object Modeling Notation, Ted Hills’ well-defined NoSQL + SQL data modeling notation
February 5, 2019Data-Centric Conference
PwC
Knowledge graphs
• Large-scale, heterogeneous
integration for data discovery and
asset tracking
• Platforms for advanced analytics
and machine learning
• Knowledge bases for intelligent
assistants
Smart data hubs
• Alternatives or adjuncts to data
warehousing, or
• Integration across both operational
and analytics data
Off-chain to on-chain data
quality for blockchain networks
• Ways to avoid garbage-in,
garbage-out
• Supply chain integration
• Smart contracts for automated
transactions and compliance
• Personal data protection
• Self-sovereign identity
– Individuals have a border and
sovereignty for personal data
control in the same way
countries exercise sovereignty
within their borders
– Peer-to-peer relationship
status with other entities on the
network
Data cataloging and auditing
• Portal-style data asset visibility
• Data inventory and curation as a
first step to privacy compliance
• Information supply chain mapping
• Back-end classification
Examples of data-centric approaches in 2019
11
February 5, 2019Data-Centric Conference
Sources; Kurt Cagle, “The Semantic Zoo,” Forbes, 18 January 2019
Bridget Botelho, “What are the main features of data catalog software?”
TechTarget, 10 May 2018
Christopher Allen, “The Path to Self-Sovereign Identity,” Life with Alacrity blog,
25 April 2016
Phil Windley, “What is ‘self-sovereignty’?” Sovrin Twitter video, 24 January
2019
Intelligent assistants
• Expanded user experience
• Cross-domain capability
Cybersecurity
• Threat intelligence, but active
measures too
• Network analysis
• Identity verification
Market buzz, trends
and dynamics to be
aware of
PwC
Emerging technologies and the data value chain
PwC and http://doi.ieeecomputersociety.org/cms/Computer.org/dl/mags/it/2013/01/figures/mit20130100571.gif
IoT and
drone data
collection
AI AR/VR
Plan
Create
Refine
Execute
Optimize
Blockchain
(immutable ledger sharing +
autonomous process coding)
3D printing output, IoT
distribution, robotics
and drone delivery
Operational data
generation and
use
Manage
&
Monitor
February 5, 2019Data-Centric Conference
13
PwC
SaaSes and clouds generally are incredibly popular.
What are the implications?
14
February 5, 2019Data-Centric Conference
Trend toward owning and managing less and less of the stack
PwC
Enterprises used an average of 1,181 cloud services each by the
end of 2017
15
February 5, 2019Data-Centric Conference
Netskope’s 2017 Cloud Report
• Enterprises used nearly 1,200 cloud
services each in Q4 2017, according to
Netskope
• Most of these are SaaSes such as
Salesforce, Workday, SAP Success
Factors….
• Buy rather than build continues
• Even with enthusiasm for AI, data and
analytics skills continue to be scarce
PwC
Means of integration, and a database popularity ranking per
DB-Engines
16
February 5, 2019Data-Centric Conference
PwC
Database popularity by type (including model type)
17
February 5, 2019Data-Centric Conference
PwC 18
February 5, 2019Data-Centric Conference
0
20
40
60
80
100
120
1/28/2018 2/28/2018 3/31/2018 4/30/2018 5/31/2018 6/30/2018 7/31/2018 8/31/2018 9/30/2018 10/31/2018 11/30/2018 12/31/2018
Interestovertime
100=peakpopularity
Last 12 months
Popularity of the "O" word and "data lake" versus other data terms
(per Google Trends)
"data lake": (United States) ontology: (United States) "data catalog": (United States)
"data-centric": (United States) "master data management": (United States)
PwC 19
February 5, 2019Data-Centric Conference
0
20
40
60
80
100
120
1/28/2018 2/28/2018 3/31/2018 4/30/2018 5/31/2018 6/30/2018 7/31/2018 8/31/2018 9/30/2018 10/31/2018 11/30/2018 12/31/2018
Interestlevel
100=peakpopularity
Past 12 months
Popularity of data lake + storage terms
data lake aws: (United States) data lake azure: (United States) data lake hadoop: (United States)
PwC 20
February 5, 2019Data-Centric Conference
0
20
40
60
80
100
120
1/28/2018 2/28/2018 3/31/2018 4/30/2018 5/31/2018 6/30/2018 7/31/2018 8/31/2018 9/30/2018 10/31/2018 11/30/2018 12/31/2018
Interestlevel
100=peakpopularity
Last 12 months
Graph and related search term popularity
graph database: (United States) GraphQL: (United States) NoSQL: (United States)
Current approaches
to the market
PwC
Data catalogs
Open data and data catalog development sites
• World Bank
• Data.gov
• Data.gov.uk
• OpenAfrica
• Data.world
Data-centric audit and protection (DCAP)
• Can be heavily vendor-driven (Protegrity and
Informatica lead Gartner’s ranking, e.g.)
• Bespoke methods would be more in line with
data-centric build-vs.-buy principles
Open data initiatives provide examples to follow, but what about data audit?
February 5, 2019Data-Centric Conference
22
PwC
Gemini Data vs. Palantir
“Placed in the hands of an analyst, Gemini
[Data] allows them to start with an event,
such as an anomalous router message or a
suspicious email address, and then work out
from there. The product’s GUI guides the
analyst through possible connections to
that particular piece of data, allowing the
analyst to quickly and iteratively explore how
different pieces of data might be connected.”
--Alex Woodie, Datanami
Cybersecurity: Will black box services give way to open graphs that
ordinary analysts can use?
February 5, 2019Data-Centric Conference
23
Cybersecurity at the DNC in 2016
“Take more vulnerable organizations that feel like they don’t
have the resources. A good example from the book, the
Democratic National Committee. OK? So before the election
cycle gets going, they bring in Dick Clarke…he now runs a
cybersecurity firm. They do a quick survey of the DNC’s
computing system and they come back and they basically
say you guys are hopeless. OK? Like, you’re down in
kindergarten levels….he showed them how much it
was going to cost. And they said, great, this is too
much money, we’ll pay for it after the election. OK?
And then the FBI calls and says, by the way, the
Russians are inside your system. Well, I’m sorry. They
called and they asked to be connected to somebody to who
they could tell that to. And they got connected to the
help desk.”
--David Sanger, author of The Perfect Weapon, as quoted on
the Council of Foreign Relations website
PwC
Highlighted features of Jules Data-Centric Design
• Applications pull data from Jules and push the processed
data back to Jules
• Defined metadata model, lineage, ontologies, semantics
• Data controls, governance, stewarded centrally
• Data as a platform
Source: “Journey to the Centre of Data,” Giridhar Vugrala, Managing Consultant – Capital Markets,
Wipro, Sept. 13, 2018
Data-centric architecture: Designing an investment bank from
the inside out at Wipro
24
February 5, 2019Data-Centric Conference
PwC
At the annual Internet Identity Workshop,
members of the Decentralized Identity Foundation
and the W3C Verifiable Claims Working Group
cobbled together a standards-based self-
sovereign identity (SSI) stack, which included
JSON-LD, as well as two other options.
SSI is currently operational in the Sovrin Network,
a public, permissioned blockchain run by 60
different stewards on six different continents. That
network is largely standards based, particularly via
the W3C and Decentralized Identity Foundation.
Sources: Oliver Terbu, “The Self-sovereign Identity Stack,” Medium post, and the Sovrin website, 2019
Personal data protection:
A preliminary self-sovereign identity stack proposes JSON-LD as a messaging payload format
25
February 5, 2019Data-Centric Conference
PwC
Dynamic knowledge graphs: Tracking virus mutations with the
help of graph databases
26
February 5, 2019Data-Centric Conference
Kadir Bölükbasi, “One Graph to Find Them All,”
G Data Security Blog, 8 January 2019
PwC
Alexa +
Cortana:
Siri:
Intelligent assistants: Amazon (Alexa) and Microsoft (Cortana)
share resources, while Apple (Siri) contemplates its next move
27
February 5, 2019Data-Centric Conference
“Apple executive Bill Stasior, who has led the Siri team since joining the company in 2012, has
been removed as head of the project in a sweeping strategy shift favoring long-term research
over incremental updates, according to a report on Friday.”
Source: Mikey Campbell, “Apple removes Siri team lead as part of AI strategy shift,”AppleInsider,
Friday, February 01, 2019, 02:47 pm PT (05:47 pm ET)
“With the new Alexa + Cortana world, one could reach across the limitations of each platform
domains and access the power of each platform. This has a synergistic effect where one could at
a future date construct meta Skills/Apps that use features from both platforms
simultaneously…. I see Voice First platforms as a uniform way to access AI-assisted
ontologies, taxonomies, and domains. Apps and Skills can be seen as a domain but also an
extended taxonomy.”
Source: Brian Roemmele, “Why Are Microsoft And Amazon Joining Forces With Cortana And
Alexa?” Quora contributor on Forbes, Sep. 25, 2017
PwC
When will voice recognition accuracy reach 98 percent?
What happens when it does?
Should Apple re-hire Tom Gruber?
28
February 5, 2019Data-Centric Conference
PwC
Data-centric simplified: Q6FSA’s Universal
Information Classification
business
Vocabulary
Who
OrganizationPerson
Agent
Concept
Physical
Asset
Digital
Asset
What
Asset
Where
Coordinates PositionPlace
Location
How
Activity Event Process Task
Action
Why
Discipline UseRole
When
Cycle Point Span Status
TimeFunction
The goal – simplify the classification of information
Copyright © 2019 Q6FSA
Used with permission. February 5, 2019Data-Centric Conference
29
Case studies
PwC
With a knowledge graph base, companies can skate to new
business models = Deep transformation
• Once relationship data-enabled, organizations play
different roles than they've been accustomed to in the
digitized ecosystem.
• Some because of their data collection heritage can
become data providers.
• Others take up roles in the data supply chain, or
position themselves as industry platforms or
marketplaces.
• Why are top companies able to cross industry
boundaries?
• Why can unicorns extend the reach of their
business models?
31
February 5, 2019Data-Centric Conference
PwC
Largest changes in market cap by global company
Cross industry, 2018
32
Known
knowledge
graph builders
Known KG
builders
Operator of Taobao
and AliBot KG builder
(1)Change in market cap from IPO date
(2)Market cap at IPO date
Source: Bloomberg and PwC analysis
Company name Location Industry
Change in market cap
2009-2018 ($bn) Market cap 2018 ($bn)
1 Apple United States Technology 757 851
2 Amazon.Com United States Consumer Services 670 701
3 Alphabet United States Technology 609 719
4 Microsoft Corp United States Technology 540 703
5 Tencent Holdings China Technology 483 496
6 Facebook United States Technology 383(1) 464
7 Berkshire Hathaway United States Financial 358 492
8 Alibaba China Consumer Services 302(1) 470
9 JPMorgan Chase United States Financials 275 375
10 Bank of America United States Financials 263 307
v
• IBM and Citi are also working on cross-enterprise knowledge graphs
• Many have cross-enterprise knowledge graph ambitions, but most are focused on a single use case
• S&P does cross-enterprise data management using relational tech
February 5, 2019
Data-Centric Conference
PwC
Graphs (including hybrids) complete the picture of your
transformed data lifecycle and how it’s managed
33
February 5, 2019Data-Centric Conference
PwC
Transformation scalability – The AirBnB knowledge
graph example
“In order to surface relevant context to people, we
need to have some way of representing
relationships between distinct but related entities
(think cities, activities, cuisines, etc.) on Airbnb to
easily access important and relevant information
about them….
These types of information will become
increasingly important as we move towards
becoming an end-to-end travel platform as
opposed to just a place for staying in homes. The
knowledge graph is our solution to this need,
giving us the technical scalability we need to power
all of Airbnb’s verticals and the flexibility to define
abstract relationships.”
--Spencer Chang, AirBnB Engineering
34
Events
Neighborhoods
Tags
Restaurants
Users
Homes
Experiences
Places
Airbnb Engineering, 2018
Markets
February 5, 2019Data-Centric Conference
PwC
Most automated knowledge graph – Diffbot?
“Diffbot’s crawler regularly refreshes the DKG with new information and its machine learning algorithms are
smart enough to pass over sites with histories of producing ‘logically inconsistent’ facts.
“‘That’s one of the reasons why we fuse information together from different sources,’ Tung said. ‘Our scale is
such that there’s minimal potential for errors. We’d bet the business on it.’
“Diffbot launched in 2008 and counts 28 employees among its core staff of engineers and data scientists.”
--Mike Tung of Diffbot, quoted in VentureBeat
Diffbot claims an automated knowledge graph of 1 trillion + facts, designed to grow without humans in the
loop.
That compares with 1.6 billion crowdsourced facts in Google’s knowledge graph, according to VentureBeat.
35
Kyle Wiggers, “Diffbot launches AI-powered knowledge graph of 1 trillion facts about people, places, and things,”
VentureBeat, 30 August 2018
February 5, 2019Data-Centric Conference
PwC
Versus more explicit, precise, contextualized meaning with a
triadic, Peircean knowledge graph and less than 1M concepts?
“There are many different approaches for distinguishing a logical basis for ontologies, but Peirce basically
says to base everything around 3s, explains [Mike Bergman of Cognonto]. That is,
1.the object itself;
2.what a particular agent perceives about the object;
3.and the way that agent needs to try to communicate what that is.
‘Without that triad it’s hard to ever get at differences of interpretation, context or meaning,’ he says, whether
that be between something like events and activities or individuals and classes.
Once you adopt that mindset, a lot of things that seemingly were irreconcilable differences begin to fall away,
and the categorization of information becomes really very easy and smooth....”
--Mike Bergman of Cognonto, quoted in Dataversity
36
Jennifer Zaino, “Cognonto Takes On Knowledge-Based Artificial Intelligence,” Dataversity, 23 November 2016
February 5, 2019Data-Centric Conference
PwC
Contextual AI via a large knowledge graph at Fairhair.ai
37
Media Intelligence Apps
Global Monitoring Analyze & Report
Distribute Influence & Engage
New Apps
Employee
App
Freemium
At-Powered
Reporting
Outside
Insight
Enterprise
Solutions
Custom Solutions
3rd party Apps
PaaS
100M
documents ingested
daily
150 NLP/IR
pipelines
100’s Billions of
Searches
Service Layer
Context Building
Enriching & Analysis
Outside Data
Streaming, Search, Analytics, APIs
Building block to leverage the platform
Knowledge Graph
Enable cognitive applications on top of our data by connecting the dots
Data Enrichment Platform
Enrich, analyze & build by interoperating with all major players
AI-driven data Acquisition
Bring high quality outside to our repository with minimal human effort
February 5, 2019Data-Centric Conference
PwC
Montefiore’s semantic data lake
38
HL7
feed
Web
services
EMR LIMS Legacy
OMICs CTMS
Claims
Annotation
engine
HDFS
Hadoop
HDFS
Hadoop
HDFS
Hadoop
HDFS
Hadoop
HDFS
Hadoop
HDFS
Hadoop
HDFS
Hadoop
HDFS
Hadoop
HDFS
Hadoop
HDFS
Hadoop
AllegrographAllegrographAllegrographAllegrograph Allegrograph
SDL loader
ML-LIB/R SPARQL
Prolog
Spark
Java API
Various data sources, some
structured, some not, now all
part of a knowledge graph
with a simple patient care-
centric ontology
Hadoop cluster with high-
performance processors
and memory
Scalable graph database
supporting open W3C
semantic standards
Standard open source
querying,ML and analytics
frameworks,
API accessibility
Doctors can query the graph
or harness ML + analytics and
receive answers from the
system at the point of care via
their handhelds.
The system also acts as a giant
feedback-response or learning
loop which learns
from the data collected via
user/system interactions.
Montefiore Health, Franz, Intel and PwC research, 2017
February 5, 2019Data-Centric Conference
PwC
Siemens’ industrial knowledge graph
39
AI Algorithms
1 09:00 – Analyze
Turbine data hub
2 11:00 – Configure
Configure turbine
3 12:00 – Maintain
Master data Mgmt.
4 13:00 – Mitigate
Financial Risk Analysis
5 15:00 – Contact
Expert &
Communities
6 18:00 – Guide
Rules & Regulations
3
4
5
4
2
1
6
Industrial
Knowledge Graph
“Deep learning fails when it comes to context. Knowledge graphs can handle context
and enable us to address things that deep learning cannot address on its own.”
--Michael May, Head of Company Core Technology, Data Analysis and AI, Siemens
February 5, 2019Data-Centric Conference
PwC
Pharma knowledge graphs for patient safety
Challenges
40
Solutions
Drug safety
Heightened
focus on safety
Evolving
regulatory
demands
Increasing
public scrutiny
Focus on
analytics
Increased
sharing &
transparency
Doing more
with the same
or less
Graph integration Natural language
processing
Data cleaning
during analysis
In-memory
query engine
PwC and Cambridge Semantics, 2018
February 5, 2019Data-Centric Conference
PwC
NuMedii’s precision therapeutics knowledge graph
41
goTerm
Calcium ion
binding
2201
Protein
binding
Extracellular
region
ENSG
00000138829
Extracellular
matrix
disassembly
Extracellular
matrix
Organization
proteinaceous
extracellular
matrix
positive
regulation
of bone
mineralization
Fibrillin - 2
Extracellular
matrix
Structural
constituent
Extracellular
matrix
micro fibril
Camera-type
eye
development
CHEMBL_TC_
10038
Go Function
Reference_
Gosubset
prok
100001650
100001532
100001739
100000687
100002060
Ontotext and NuMedii, 2018
February 5, 2019
Data-Centric Conference
PwC
Thomson Reuters’ financial knowledge graph as a service
42
Thomson Reuters, 2018
February 5, 2019
Data-Centric Conference
PwC
“MicroStrategy 2019 introduces the industry’s first
Enterprise Semantic Graph.
• It elevates the potential of enterprise data assets, makes
true federated analytics possible, and delivers
personalized insights based on who you are, where you
are, and what you’re doing.
• It delivers powerful search capabilities on top of all
business information systems or data assets, making it
incredibly easy to find insights.
• It categorizes and federates each of your data
investments in real time, constantly enriching the
index with location intelligence and usage telemetry.
• It delivers the underlying strength to fuel AI experiences
for every role—with smart recommendations on
authoring actions for analysts who build dashboards, to
smart suggestions on content for business users who
are looking for new insights.”
--Vijay Anand,
Microstrategy blog, January 15, 2019
Data-centric, or product-centric?
43
February 5, 2019Data-Centric Conference
Conclusion
PwC
The problem: System-level complexity and disconnectedness
(product- and app-centric sprawl)
45
Hardware
DBMS
OS
Custom code
Hardware
Lots of OSes
1,000+
SQL/NoSQL DBs
Custom code
ERP+ suites
Hardware
A few more OSes
More
DBMSes
Custom code
ERP+ suites
Hardware
Lots more OSes
5,000+ databases
Componentized
suites
Custom code
Cloud layer
Hardware
More types of OSes
10,000+ DBs +
blockchains
Multicloud layer
Suites as services
Various SaaSes
Custom code
Hardware
A few
DBMSes
A few OSes
ERP+ suites
Custom code
Threat of more
application centric
sprawl
Early1990s Late 1990s 2000s 2010s1973-1990sPre 1970 2020s
February 5, 2019
Data-Centric Conference
PwC
The key opportunity: Large-scale integration and model-driven AI
Rule-based systems (includes KR)
“Handcrafted knowledge” is the term DARPA
uses; rule-based programming + procedure
replication in process automation, + some
knowledge representation (KR)
• Strong on logical reasoning in specific
concrete contexts
- Procedural + declarative programming +
set theory, etc.
- Deterministic
• Can’t learn or abstract
• Still exceptionally common and useful
Statistical machine learning
• Probabilistic
• From Bayesian algorithms to neural nets
(yes, deep learning also)
• Strong on perceiving and learning
(classifying, predicting)
• Weak on abstracting and reasoning
• Quite powerful in the aggregate but
individually (instance by instance)
unreliable
• Can require lots of data
Contextualized, model-driven approach
• Contextualized modeling approach—
allows efficiency, precision and certainty
• Combines power of deterministic,
probabilistic and description logic
• Allows explanations to be added to
decisions
• Accelerates the training process with the
help of specific, contextual human input
• Takes less data
Example: Consumer tax software
Perceiving
Learning
Abstracting
Reasoning
Perceiving
Learning
Abstracting
Reasoning
Perceiving
Learning
Abstracting
Reasoning
Example: Facial recognition
using deep learning/neural nets
Example: Explains first how handwritten
letters are formed so machines can decide—
less data needed, more transparency.
John Launchbury of DARPA (https://www.youtube.com/watch?v=N2L8AqkEDLs), Estes Park Group and PwC research, 2017
Previously dominant On the rise and rapidly improving Nascent, just beginning
1
Data-Centric Conference
February 5, 2019
46
PwC
The key means: The right level of relationship richness
47
Use tables, document trees and
graphs.
• Graphs articulate relationship-rich data
• Tables: Relationships are what’s missing from
most large-scale data, but table are too useful
andhuman-friendly to ignore at smaller scale
• Document trees (e.g., taxonomies) are a
stepping stone to graph models
• Graphs are the parents that bring the data
model family all together, Tinker Toy-style
• Bottom line: A machine readable,
extensible model of your organization
• Build and maintain your advanced
analytics/AI foundation with that graph model
Data-Centric Conference
February 5, 2019
PwC
How to get started: Data-centric strategy, planning, architecture and
execution
February 5, 2019Data-Centric Conference
48
• In the data-centric view, every IT category is subordinated to centrally managed, model-driven data via data
strategy, GRC and data-centric architecture (DCA)
• Relationship-rich modeling leads development for reasons of efficiency and effectiveness
• Standards based, open source enabled, build versus buy
• Empowers user communities, activism, large scale collaboration, shared infrastructure
Goals for data to
be obtained,
enriched and used
Data strategy
Data governance,
risk and
compliance
Data-centric
architecture (DCA)
Strategy and planning
Execution
Data-centric infrastructure
Data and logic lifecycle management
Model-driven development
Cross-enterprise intelligence
Relationship-rich modeling
Data-centric security
Process, pipeline and delivery automation
Human and machine learning loops
Data-centric design thinking
PwC
Questions or comments?
© 2019 PwC. All rights reserved. PwC refers to the US member firm or one of its subsidiaries or affiliates, and may sometimes refer
to the PwC network. Each member firm is a separate legal entity. Please see www.pwc.com/structure for further details.
pwc.com Alan Morrison
Sr. Research Fellow
PwC | Integrated Content | Emerging Tech
Mobile: +1(408) 205 5109
Email: alan.s.morrison@pwc.com
PricewaterhouseCoopers LLP
488 Almaden Blvd., Suite 1800
San Jose, CA 95110 USA
https://www.linkedin.com/in/alanmorrison
@AlanMorrison quora.com/profile/Alan-Morrison
Data-Centric Conference
February 5, 2019

Más contenido relacionado

La actualidad más candente

Modeling the Insurance Enterprise
Modeling the Insurance EnterpriseModeling the Insurance Enterprise
Modeling the Insurance Enterprise
Iver Band
 
Object Oriented Business Capability Map - IIBA 2022 - Draft.pptx
Object Oriented Business Capability Map - IIBA 2022 - Draft.pptxObject Oriented Business Capability Map - IIBA 2022 - Draft.pptx
Object Oriented Business Capability Map - IIBA 2022 - Draft.pptx
AustraliaChapterIIBA
 

La actualidad más candente (20)

Modeling the Insurance Enterprise
Modeling the Insurance EnterpriseModeling the Insurance Enterprise
Modeling the Insurance Enterprise
 
ArchiSurance Case Study
ArchiSurance Case StudyArchiSurance Case Study
ArchiSurance Case Study
 
iSeries Modernization: RPG/400 to Java Migration
iSeries Modernization: RPG/400 to Java MigrationiSeries Modernization: RPG/400 to Java Migration
iSeries Modernization: RPG/400 to Java Migration
 
SAP Strategy on Sustainability and AI
SAP Strategy on Sustainability and AISAP Strategy on Sustainability and AI
SAP Strategy on Sustainability and AI
 
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
 
Object Oriented Business Capability Map - IIBA 2022 - Draft.pptx
Object Oriented Business Capability Map - IIBA 2022 - Draft.pptxObject Oriented Business Capability Map - IIBA 2022 - Draft.pptx
Object Oriented Business Capability Map - IIBA 2022 - Draft.pptx
 
Digital Operating Model & IT4IT
Digital Operating Model & IT4ITDigital Operating Model & IT4IT
Digital Operating Model & IT4IT
 
Demystifying Open Banking
Demystifying Open BankingDemystifying Open Banking
Demystifying Open Banking
 
Axway amplify api management platform
Axway amplify api management platformAxway amplify api management platform
Axway amplify api management platform
 
Global Open Banking Landscape
Global Open Banking LandscapeGlobal Open Banking Landscape
Global Open Banking Landscape
 
BIAN Applied to Open Banking - Thoughts on Architecture and Implementation
BIAN Applied to Open Banking - Thoughts on Architecture and ImplementationBIAN Applied to Open Banking - Thoughts on Architecture and Implementation
BIAN Applied to Open Banking - Thoughts on Architecture and Implementation
 
Power BI Overview, Deployment and Governance
Power BI Overview, Deployment and GovernancePower BI Overview, Deployment and Governance
Power BI Overview, Deployment and Governance
 
Data-centric design and the knowledge graph
Data-centric design and the knowledge graphData-centric design and the knowledge graph
Data-centric design and the knowledge graph
 
The Evolution of Integration
The Evolution of IntegrationThe Evolution of Integration
The Evolution of Integration
 
Open Banking APIs on AWS
Open Banking APIs on AWSOpen Banking APIs on AWS
Open Banking APIs on AWS
 
McKinsey | When Things Get Complex: Complex Systems, Challenges and Where to ...
McKinsey | When Things Get Complex: Complex Systems, Challenges and Where to ...McKinsey | When Things Get Complex: Complex Systems, Challenges and Where to ...
McKinsey | When Things Get Complex: Complex Systems, Challenges and Where to ...
 
Oracle Business Intelligence Overview PPT
Oracle Business Intelligence Overview PPTOracle Business Intelligence Overview PPT
Oracle Business Intelligence Overview PPT
 
Benefits of integration with the Mulesoft Anypoint Platform
Benefits of integration with the Mulesoft Anypoint PlatformBenefits of integration with the Mulesoft Anypoint Platform
Benefits of integration with the Mulesoft Anypoint Platform
 
Modernizing Infrastructure Monitoring and Management with AIOps
Modernizing Infrastructure Monitoring and Management with AIOpsModernizing Infrastructure Monitoring and Management with AIOps
Modernizing Infrastructure Monitoring and Management with AIOps
 
The best digital transformation frameworks in 2020
The best digital transformation frameworks in 2020The best digital transformation frameworks in 2020
The best digital transformation frameworks in 2020
 

Similar a Data-centric market status, case studies and outlook

Similar a Data-centric market status, case studies and outlook (20)

Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
 
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
 
uae views on big data
  uae views on  big data  uae views on  big data
uae views on big data
 
DAS Slides: Enterprise Architecture vs. Data Architecture
DAS Slides: Enterprise Architecture vs. Data ArchitectureDAS Slides: Enterprise Architecture vs. Data Architecture
DAS Slides: Enterprise Architecture vs. Data Architecture
 
How a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 ViewHow a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 View
 
Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)
 
In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017
 
10 Keynotes in STRATA and HADOOP World Conference
10 Keynotes in STRATA and HADOOP World Conference10 Keynotes in STRATA and HADOOP World Conference
10 Keynotes in STRATA and HADOOP World Conference
 
Data Intelligence: How the Amalgamation of Data, Science, and Technology is C...
Data Intelligence: How the Amalgamation of Data, Science, and Technology is C...Data Intelligence: How the Amalgamation of Data, Science, and Technology is C...
Data Intelligence: How the Amalgamation of Data, Science, and Technology is C...
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data Architecture
 
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
 
DataEd Slides: Approaching Data Management Technologies
DataEd Slides:  Approaching Data Management TechnologiesDataEd Slides:  Approaching Data Management Technologies
DataEd Slides: Approaching Data Management Technologies
 
Data Governance in a big data era
Data Governance in a big data eraData Governance in a big data era
Data Governance in a big data era
 
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
 
Keynote: Graphs in Government_Lance Walter, CMO
Keynote:  Graphs in Government_Lance Walter, CMOKeynote:  Graphs in Government_Lance Walter, CMO
Keynote: Graphs in Government_Lance Walter, CMO
 
Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017
 
Monetize Big Data
Monetize Big DataMonetize Big Data
Monetize Big Data
 
Trends in Data Modeling
Trends in Data ModelingTrends in Data Modeling
Trends in Data Modeling
 

Más de Alan Morrison

The FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdfThe FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdf
Alan Morrison
 

Más de Alan Morrison (8)

DCAF 2023 1 and 2.pdf
DCAF 2023 1 and 2.pdfDCAF 2023 1 and 2.pdf
DCAF 2023 1 and 2.pdf
 
FAIR data_ Superior data visibility and reuse without warehousing.pdf
FAIR data_ Superior data visibility and reuse without warehousing.pdfFAIR data_ Superior data visibility and reuse without warehousing.pdf
FAIR data_ Superior data visibility and reuse without warehousing.pdf
 
The FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdfThe FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdf
 
DCA Symposium 6 Feb 2023.pdf
DCA Symposium 6 Feb 2023.pdfDCA Symposium 6 Feb 2023.pdf
DCA Symposium 6 Feb 2023.pdf
 
Graph Foundations for Advanced Analytics and Collaboration
Graph Foundations for Advanced Analytics and CollaborationGraph Foundations for Advanced Analytics and Collaboration
Graph Foundations for Advanced Analytics and Collaboration
 
Dcaf transformation & kg adoption 2022 -alan morrison
Dcaf transformation & kg adoption 2022 -alan morrisonDcaf transformation & kg adoption 2022 -alan morrison
Dcaf transformation & kg adoption 2022 -alan morrison
 
Paths to more personal and collaborative knowledge graphs
Paths to more personal and collaborative knowledge graphsPaths to more personal and collaborative knowledge graphs
Paths to more personal and collaborative knowledge graphs
 
Blockchain demystified
Blockchain demystifiedBlockchain demystified
Blockchain demystified
 

Último

Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
amitlee9823
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
Renandantas16
 
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
lizamodels9
 
Call Girls In Noida 959961⊹3876 Independent Escort Service Noida
Call Girls In Noida 959961⊹3876 Independent Escort Service NoidaCall Girls In Noida 959961⊹3876 Independent Escort Service Noida
Call Girls In Noida 959961⊹3876 Independent Escort Service Noida
dlhescort
 
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...
amitlee9823
 
Call Girls From Pari Chowk Greater Noida ❤️8448577510 ⊹Best Escorts Service I...
Call Girls From Pari Chowk Greater Noida ❤️8448577510 ⊹Best Escorts Service I...Call Girls From Pari Chowk Greater Noida ❤️8448577510 ⊹Best Escorts Service I...
Call Girls From Pari Chowk Greater Noida ❤️8448577510 ⊹Best Escorts Service I...
lizamodels9
 

Último (20)

Value Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and painsValue Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and pains
 
John Halpern sued for sexual assault.pdf
John Halpern sued for sexual assault.pdfJohn Halpern sued for sexual assault.pdf
John Halpern sued for sexual assault.pdf
 
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
 
Falcon's Invoice Discounting: Your Path to Prosperity
Falcon's Invoice Discounting: Your Path to ProsperityFalcon's Invoice Discounting: Your Path to Prosperity
Falcon's Invoice Discounting: Your Path to Prosperity
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
 
Uneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration PresentationUneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration Presentation
 
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
 
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls In Noida 959961⊹3876 Independent Escort Service Noida
Call Girls In Noida 959961⊹3876 Independent Escort Service NoidaCall Girls In Noida 959961⊹3876 Independent Escort Service Noida
Call Girls In Noida 959961⊹3876 Independent Escort Service Noida
 
👉Chandigarh Call Girls 👉9878799926👉Just Call👉Chandigarh Call Girl In Chandiga...
👉Chandigarh Call Girls 👉9878799926👉Just Call👉Chandigarh Call Girl In Chandiga...👉Chandigarh Call Girls 👉9878799926👉Just Call👉Chandigarh Call Girl In Chandiga...
👉Chandigarh Call Girls 👉9878799926👉Just Call👉Chandigarh Call Girl In Chandiga...
 
Call Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine ServiceCall Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine Service
 
It will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayIt will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 May
 
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...
 
Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023
 
VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
 
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdfDr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
 
Call Girls From Pari Chowk Greater Noida ❤️8448577510 ⊹Best Escorts Service I...
Call Girls From Pari Chowk Greater Noida ❤️8448577510 ⊹Best Escorts Service I...Call Girls From Pari Chowk Greater Noida ❤️8448577510 ⊹Best Escorts Service I...
Call Girls From Pari Chowk Greater Noida ❤️8448577510 ⊹Best Escorts Service I...
 
Katrina Personal Brand Project and portfolio 1
Katrina Personal Brand Project and portfolio 1Katrina Personal Brand Project and portfolio 1
Katrina Personal Brand Project and portfolio 1
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communications
 
BAGALUR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRL
BAGALUR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRLBAGALUR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRL
BAGALUR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRL
 

Data-centric market status, case studies and outlook

  • 1. Data-centric market status, case studies and outlook Alan Morrison February 2019
  • 2. PwC Time Activity 2:00 – 2:10 Introduction 2:10 – 2:20 Larger market context: buzz, trends and dynamics 2:20 – 2:30 Current approaches to the market 2:30 – 2:40 Case studies 2:40 – 3:15 Conclusion and discussion Agenda February 5, 2019Data-Centric Conference 2
  • 4. PwC • Market oriented, passive, laissez faire • Architecture and vision free • “The market will fix it.” • Big vendor dominated • Legacy of Nick Carr’s IT Doesn’t Matter • Siloed, isolated efforts • Startups venture funded in silos, with waves of new silos being generated every year • Users themselves just passive and disempowered buyers or subscribers • Consumer services mostly self-service, with no one to call if a problem arises Product-centric IT: Got an IT problem? Buy a packaged solution. February 5, 2019Data-Centric Conference Products and services Data management Applications Computer graphics and web design Development IT administration IT certifications IT security Service providers Technologies and fields Categories from wand® precision classification and search, 2019
  • 5. PwC Data-centric IT: Own the problem first. Then build a solution. February 5, 2019Data-Centric Conference 5 • In the data-centric view, every IT category is subordinated to centrally managed, model-driven data via data strategy, GRC and data-centric architecture (DCA) • Relationship-rich modeling leads development for reasons of efficiency and effectiveness • Standards based, open source enabled, build versus buy • Empowers user communities, activism, large scale collaboration, shared infrastructure Goals for data to be obtained, enriched and used Data strategy Data governance, risk and compliance Data-centric architecture (DCA) Strategy and planning Execution Data-centric infrastructure Data and logic lifecycle management Model-driven development Cross-enterprise intelligence Relationship-rich modeling Data-centric security Process, pipeline and delivery automation Human and machine learning loops Data-centric design thinking
  • 6. PwC Think beyond end-to-end packaged software and SaaS solutions • “Cognitive computing” platforms • “Big data” platforms • Repackaged legacy MDM • Application-centric integration suites • Data preparation suites • Robotic or intelligent process automation • Any other “AI-enabled” X, Y or Z that simply generates lot of market noise and gets in the way The elephant in the room is always organizational change. However, wrong technology and data strategy approaches prevent change also. Those who succeed have effective strategies, means and execution all going for them at the same time. Consider how to get the business and IT together building and solving foundational data problems • Bespoke projects with worthy objectives and practical means of success • Human-in-the-loop computing—constructing data- description feedback loops for knowledge foundations • Software and SaaSes that do encourage better data modeling and relationship-rich integration – semantic graph databases – smart data hubs – NoSQL + SQL modeling—building bridges between tabular, document and graph – automated taxonomy and ontology generation, but as a starting point – Automated process mapping, but as a starting point – Clever modeling or visualization tools to encourage deeper, system-level understanding Ways to think about truly data-centric opportunities 6 February 5, 2019Data-Centric Conference
  • 7. PwC The real inhibitors to adoption aren’t technological – they’re rooted in tribal biases and resistance to change 7 Tribalism CollectivismIndividualism Anarchy TotalitarianismLocus of inertia Daniel Quinn, Beyond Civilization and Alice Linsley, “Daniel Quinn: A Return to Tribalism?”, college-ethics.blogspot.com, 2018 February 5, 2019 Data-Centric Conference
  • 8. PwC Tribalism – Machine learning edition 8 Source: Pedro Domingos, The Master Algorithm, 2015 More at “Machine learning evolution”: http://usblogs.pwc.com/emerging-technology/machine-learning-evolution-infographic/, PwC, 2017 Symbolists Bayesians Connectionists Evolutionaries Analogizers Use symbols, rules, and logic to represent knowledge and draw logical inference Assess the likelihood of occurrence for probabilistic inference Recognize and generalize patterns dynamically with matrices of probabilistic, weighted neurons Generate variations and then assess the fitness of each for a given purpose Optimize a function in light of constraints (“going as high as you can while staying on the road”) Favored algorithm Rules and decision trees Favored algorithm Naïve Bayes or Markov Favored algorithm Neural network Favored algorithm Genetic programs Favored algorithm Support vectors February 5, 2019 Data-Centric Conference
  • 9. PwC Tribalism – Data integration edition 9 Trend toward more data centricity this way Application- centric RESTful developers Relational database linkers Data-centric knowledge graphers Application- centric ESB advocates Semantic Web Company, 2018 Computerscience wiki.org, 2018 TIBCO, 2014 Oracle DBA’s Guide, 2018 User Scott Select FROM emp Local database PUBLIC SYNONYM Emp - > emp@HQ.ACME>COM Database link (undirected) Remote database EMP table Portals Net Application B2B Interactions Enterprise data Business process management Web Services Mobile Applications DEE Application ERP CRM SFA Legacy System ESBCustom environment Common environment API Unstructured Data Semi- structured Data Structured Data Schema mapping based on ontologies Entity Extractor informs all incoming data streams about its semantics and links them Unified Views RDF Graph Database PoolParty Graph Search February 5, 2019Data-Centric Conference
  • 10. PwC Crossing the chasm between the tribes 10 Reducing the amount of unfamiliarity developers confront--familiar document means to achieve comparable ends to graph: • Semantic suites that use the JSON format and familiar hierarchies: SWC’s PoolParty is an example • GraphQL: A popular document shape language that talks to APIs using SELECT-like statements and tree shapes; retrieves only the data you need, provides needed feedback to API owners, helps with API sprawl • Accessible web as database methods: JSON-LD and Schema.org, etc. vocabularies • Document “schemas” via data objects: JavaScript objects to developers = documents to NoSQL DB types; Object Data Modeling instead of database semantics • Mongoose or MongoDB JSON schema features + GraphQL: MongoDB object modeling and querying that can be used for subdocument filtering within a GraphQL context • HyperGraphQL: A GraphQL UI for Linked Data, restricted to certain tree-shaped queries • Universal Schema Language: Mike Bowers’ document/graph query and modeling language still in development • COMN: Concept and Object Modeling Notation, Ted Hills’ well-defined NoSQL + SQL data modeling notation February 5, 2019Data-Centric Conference
  • 11. PwC Knowledge graphs • Large-scale, heterogeneous integration for data discovery and asset tracking • Platforms for advanced analytics and machine learning • Knowledge bases for intelligent assistants Smart data hubs • Alternatives or adjuncts to data warehousing, or • Integration across both operational and analytics data Off-chain to on-chain data quality for blockchain networks • Ways to avoid garbage-in, garbage-out • Supply chain integration • Smart contracts for automated transactions and compliance • Personal data protection • Self-sovereign identity – Individuals have a border and sovereignty for personal data control in the same way countries exercise sovereignty within their borders – Peer-to-peer relationship status with other entities on the network Data cataloging and auditing • Portal-style data asset visibility • Data inventory and curation as a first step to privacy compliance • Information supply chain mapping • Back-end classification Examples of data-centric approaches in 2019 11 February 5, 2019Data-Centric Conference Sources; Kurt Cagle, “The Semantic Zoo,” Forbes, 18 January 2019 Bridget Botelho, “What are the main features of data catalog software?” TechTarget, 10 May 2018 Christopher Allen, “The Path to Self-Sovereign Identity,” Life with Alacrity blog, 25 April 2016 Phil Windley, “What is ‘self-sovereignty’?” Sovrin Twitter video, 24 January 2019 Intelligent assistants • Expanded user experience • Cross-domain capability Cybersecurity • Threat intelligence, but active measures too • Network analysis • Identity verification
  • 12. Market buzz, trends and dynamics to be aware of
  • 13. PwC Emerging technologies and the data value chain PwC and http://doi.ieeecomputersociety.org/cms/Computer.org/dl/mags/it/2013/01/figures/mit20130100571.gif IoT and drone data collection AI AR/VR Plan Create Refine Execute Optimize Blockchain (immutable ledger sharing + autonomous process coding) 3D printing output, IoT distribution, robotics and drone delivery Operational data generation and use Manage & Monitor February 5, 2019Data-Centric Conference 13
  • 14. PwC SaaSes and clouds generally are incredibly popular. What are the implications? 14 February 5, 2019Data-Centric Conference Trend toward owning and managing less and less of the stack
  • 15. PwC Enterprises used an average of 1,181 cloud services each by the end of 2017 15 February 5, 2019Data-Centric Conference Netskope’s 2017 Cloud Report • Enterprises used nearly 1,200 cloud services each in Q4 2017, according to Netskope • Most of these are SaaSes such as Salesforce, Workday, SAP Success Factors…. • Buy rather than build continues • Even with enthusiasm for AI, data and analytics skills continue to be scarce
  • 16. PwC Means of integration, and a database popularity ranking per DB-Engines 16 February 5, 2019Data-Centric Conference
  • 17. PwC Database popularity by type (including model type) 17 February 5, 2019Data-Centric Conference
  • 18. PwC 18 February 5, 2019Data-Centric Conference 0 20 40 60 80 100 120 1/28/2018 2/28/2018 3/31/2018 4/30/2018 5/31/2018 6/30/2018 7/31/2018 8/31/2018 9/30/2018 10/31/2018 11/30/2018 12/31/2018 Interestovertime 100=peakpopularity Last 12 months Popularity of the "O" word and "data lake" versus other data terms (per Google Trends) "data lake": (United States) ontology: (United States) "data catalog": (United States) "data-centric": (United States) "master data management": (United States)
  • 19. PwC 19 February 5, 2019Data-Centric Conference 0 20 40 60 80 100 120 1/28/2018 2/28/2018 3/31/2018 4/30/2018 5/31/2018 6/30/2018 7/31/2018 8/31/2018 9/30/2018 10/31/2018 11/30/2018 12/31/2018 Interestlevel 100=peakpopularity Past 12 months Popularity of data lake + storage terms data lake aws: (United States) data lake azure: (United States) data lake hadoop: (United States)
  • 20. PwC 20 February 5, 2019Data-Centric Conference 0 20 40 60 80 100 120 1/28/2018 2/28/2018 3/31/2018 4/30/2018 5/31/2018 6/30/2018 7/31/2018 8/31/2018 9/30/2018 10/31/2018 11/30/2018 12/31/2018 Interestlevel 100=peakpopularity Last 12 months Graph and related search term popularity graph database: (United States) GraphQL: (United States) NoSQL: (United States)
  • 22. PwC Data catalogs Open data and data catalog development sites • World Bank • Data.gov • Data.gov.uk • OpenAfrica • Data.world Data-centric audit and protection (DCAP) • Can be heavily vendor-driven (Protegrity and Informatica lead Gartner’s ranking, e.g.) • Bespoke methods would be more in line with data-centric build-vs.-buy principles Open data initiatives provide examples to follow, but what about data audit? February 5, 2019Data-Centric Conference 22
  • 23. PwC Gemini Data vs. Palantir “Placed in the hands of an analyst, Gemini [Data] allows them to start with an event, such as an anomalous router message or a suspicious email address, and then work out from there. The product’s GUI guides the analyst through possible connections to that particular piece of data, allowing the analyst to quickly and iteratively explore how different pieces of data might be connected.” --Alex Woodie, Datanami Cybersecurity: Will black box services give way to open graphs that ordinary analysts can use? February 5, 2019Data-Centric Conference 23 Cybersecurity at the DNC in 2016 “Take more vulnerable organizations that feel like they don’t have the resources. A good example from the book, the Democratic National Committee. OK? So before the election cycle gets going, they bring in Dick Clarke…he now runs a cybersecurity firm. They do a quick survey of the DNC’s computing system and they come back and they basically say you guys are hopeless. OK? Like, you’re down in kindergarten levels….he showed them how much it was going to cost. And they said, great, this is too much money, we’ll pay for it after the election. OK? And then the FBI calls and says, by the way, the Russians are inside your system. Well, I’m sorry. They called and they asked to be connected to somebody to who they could tell that to. And they got connected to the help desk.” --David Sanger, author of The Perfect Weapon, as quoted on the Council of Foreign Relations website
  • 24. PwC Highlighted features of Jules Data-Centric Design • Applications pull data from Jules and push the processed data back to Jules • Defined metadata model, lineage, ontologies, semantics • Data controls, governance, stewarded centrally • Data as a platform Source: “Journey to the Centre of Data,” Giridhar Vugrala, Managing Consultant – Capital Markets, Wipro, Sept. 13, 2018 Data-centric architecture: Designing an investment bank from the inside out at Wipro 24 February 5, 2019Data-Centric Conference
  • 25. PwC At the annual Internet Identity Workshop, members of the Decentralized Identity Foundation and the W3C Verifiable Claims Working Group cobbled together a standards-based self- sovereign identity (SSI) stack, which included JSON-LD, as well as two other options. SSI is currently operational in the Sovrin Network, a public, permissioned blockchain run by 60 different stewards on six different continents. That network is largely standards based, particularly via the W3C and Decentralized Identity Foundation. Sources: Oliver Terbu, “The Self-sovereign Identity Stack,” Medium post, and the Sovrin website, 2019 Personal data protection: A preliminary self-sovereign identity stack proposes JSON-LD as a messaging payload format 25 February 5, 2019Data-Centric Conference
  • 26. PwC Dynamic knowledge graphs: Tracking virus mutations with the help of graph databases 26 February 5, 2019Data-Centric Conference Kadir Bölükbasi, “One Graph to Find Them All,” G Data Security Blog, 8 January 2019
  • 27. PwC Alexa + Cortana: Siri: Intelligent assistants: Amazon (Alexa) and Microsoft (Cortana) share resources, while Apple (Siri) contemplates its next move 27 February 5, 2019Data-Centric Conference “Apple executive Bill Stasior, who has led the Siri team since joining the company in 2012, has been removed as head of the project in a sweeping strategy shift favoring long-term research over incremental updates, according to a report on Friday.” Source: Mikey Campbell, “Apple removes Siri team lead as part of AI strategy shift,”AppleInsider, Friday, February 01, 2019, 02:47 pm PT (05:47 pm ET) “With the new Alexa + Cortana world, one could reach across the limitations of each platform domains and access the power of each platform. This has a synergistic effect where one could at a future date construct meta Skills/Apps that use features from both platforms simultaneously…. I see Voice First platforms as a uniform way to access AI-assisted ontologies, taxonomies, and domains. Apps and Skills can be seen as a domain but also an extended taxonomy.” Source: Brian Roemmele, “Why Are Microsoft And Amazon Joining Forces With Cortana And Alexa?” Quora contributor on Forbes, Sep. 25, 2017
  • 28. PwC When will voice recognition accuracy reach 98 percent? What happens when it does? Should Apple re-hire Tom Gruber? 28 February 5, 2019Data-Centric Conference
  • 29. PwC Data-centric simplified: Q6FSA’s Universal Information Classification business Vocabulary Who OrganizationPerson Agent Concept Physical Asset Digital Asset What Asset Where Coordinates PositionPlace Location How Activity Event Process Task Action Why Discipline UseRole When Cycle Point Span Status TimeFunction The goal – simplify the classification of information Copyright © 2019 Q6FSA Used with permission. February 5, 2019Data-Centric Conference 29
  • 31. PwC With a knowledge graph base, companies can skate to new business models = Deep transformation • Once relationship data-enabled, organizations play different roles than they've been accustomed to in the digitized ecosystem. • Some because of their data collection heritage can become data providers. • Others take up roles in the data supply chain, or position themselves as industry platforms or marketplaces. • Why are top companies able to cross industry boundaries? • Why can unicorns extend the reach of their business models? 31 February 5, 2019Data-Centric Conference
  • 32. PwC Largest changes in market cap by global company Cross industry, 2018 32 Known knowledge graph builders Known KG builders Operator of Taobao and AliBot KG builder (1)Change in market cap from IPO date (2)Market cap at IPO date Source: Bloomberg and PwC analysis Company name Location Industry Change in market cap 2009-2018 ($bn) Market cap 2018 ($bn) 1 Apple United States Technology 757 851 2 Amazon.Com United States Consumer Services 670 701 3 Alphabet United States Technology 609 719 4 Microsoft Corp United States Technology 540 703 5 Tencent Holdings China Technology 483 496 6 Facebook United States Technology 383(1) 464 7 Berkshire Hathaway United States Financial 358 492 8 Alibaba China Consumer Services 302(1) 470 9 JPMorgan Chase United States Financials 275 375 10 Bank of America United States Financials 263 307 v • IBM and Citi are also working on cross-enterprise knowledge graphs • Many have cross-enterprise knowledge graph ambitions, but most are focused on a single use case • S&P does cross-enterprise data management using relational tech February 5, 2019 Data-Centric Conference
  • 33. PwC Graphs (including hybrids) complete the picture of your transformed data lifecycle and how it’s managed 33 February 5, 2019Data-Centric Conference
  • 34. PwC Transformation scalability – The AirBnB knowledge graph example “In order to surface relevant context to people, we need to have some way of representing relationships between distinct but related entities (think cities, activities, cuisines, etc.) on Airbnb to easily access important and relevant information about them…. These types of information will become increasingly important as we move towards becoming an end-to-end travel platform as opposed to just a place for staying in homes. The knowledge graph is our solution to this need, giving us the technical scalability we need to power all of Airbnb’s verticals and the flexibility to define abstract relationships.” --Spencer Chang, AirBnB Engineering 34 Events Neighborhoods Tags Restaurants Users Homes Experiences Places Airbnb Engineering, 2018 Markets February 5, 2019Data-Centric Conference
  • 35. PwC Most automated knowledge graph – Diffbot? “Diffbot’s crawler regularly refreshes the DKG with new information and its machine learning algorithms are smart enough to pass over sites with histories of producing ‘logically inconsistent’ facts. “‘That’s one of the reasons why we fuse information together from different sources,’ Tung said. ‘Our scale is such that there’s minimal potential for errors. We’d bet the business on it.’ “Diffbot launched in 2008 and counts 28 employees among its core staff of engineers and data scientists.” --Mike Tung of Diffbot, quoted in VentureBeat Diffbot claims an automated knowledge graph of 1 trillion + facts, designed to grow without humans in the loop. That compares with 1.6 billion crowdsourced facts in Google’s knowledge graph, according to VentureBeat. 35 Kyle Wiggers, “Diffbot launches AI-powered knowledge graph of 1 trillion facts about people, places, and things,” VentureBeat, 30 August 2018 February 5, 2019Data-Centric Conference
  • 36. PwC Versus more explicit, precise, contextualized meaning with a triadic, Peircean knowledge graph and less than 1M concepts? “There are many different approaches for distinguishing a logical basis for ontologies, but Peirce basically says to base everything around 3s, explains [Mike Bergman of Cognonto]. That is, 1.the object itself; 2.what a particular agent perceives about the object; 3.and the way that agent needs to try to communicate what that is. ‘Without that triad it’s hard to ever get at differences of interpretation, context or meaning,’ he says, whether that be between something like events and activities or individuals and classes. Once you adopt that mindset, a lot of things that seemingly were irreconcilable differences begin to fall away, and the categorization of information becomes really very easy and smooth....” --Mike Bergman of Cognonto, quoted in Dataversity 36 Jennifer Zaino, “Cognonto Takes On Knowledge-Based Artificial Intelligence,” Dataversity, 23 November 2016 February 5, 2019Data-Centric Conference
  • 37. PwC Contextual AI via a large knowledge graph at Fairhair.ai 37 Media Intelligence Apps Global Monitoring Analyze & Report Distribute Influence & Engage New Apps Employee App Freemium At-Powered Reporting Outside Insight Enterprise Solutions Custom Solutions 3rd party Apps PaaS 100M documents ingested daily 150 NLP/IR pipelines 100’s Billions of Searches Service Layer Context Building Enriching & Analysis Outside Data Streaming, Search, Analytics, APIs Building block to leverage the platform Knowledge Graph Enable cognitive applications on top of our data by connecting the dots Data Enrichment Platform Enrich, analyze & build by interoperating with all major players AI-driven data Acquisition Bring high quality outside to our repository with minimal human effort February 5, 2019Data-Centric Conference
  • 38. PwC Montefiore’s semantic data lake 38 HL7 feed Web services EMR LIMS Legacy OMICs CTMS Claims Annotation engine HDFS Hadoop HDFS Hadoop HDFS Hadoop HDFS Hadoop HDFS Hadoop HDFS Hadoop HDFS Hadoop HDFS Hadoop HDFS Hadoop HDFS Hadoop AllegrographAllegrographAllegrographAllegrograph Allegrograph SDL loader ML-LIB/R SPARQL Prolog Spark Java API Various data sources, some structured, some not, now all part of a knowledge graph with a simple patient care- centric ontology Hadoop cluster with high- performance processors and memory Scalable graph database supporting open W3C semantic standards Standard open source querying,ML and analytics frameworks, API accessibility Doctors can query the graph or harness ML + analytics and receive answers from the system at the point of care via their handhelds. The system also acts as a giant feedback-response or learning loop which learns from the data collected via user/system interactions. Montefiore Health, Franz, Intel and PwC research, 2017 February 5, 2019Data-Centric Conference
  • 39. PwC Siemens’ industrial knowledge graph 39 AI Algorithms 1 09:00 – Analyze Turbine data hub 2 11:00 – Configure Configure turbine 3 12:00 – Maintain Master data Mgmt. 4 13:00 – Mitigate Financial Risk Analysis 5 15:00 – Contact Expert & Communities 6 18:00 – Guide Rules & Regulations 3 4 5 4 2 1 6 Industrial Knowledge Graph “Deep learning fails when it comes to context. Knowledge graphs can handle context and enable us to address things that deep learning cannot address on its own.” --Michael May, Head of Company Core Technology, Data Analysis and AI, Siemens February 5, 2019Data-Centric Conference
  • 40. PwC Pharma knowledge graphs for patient safety Challenges 40 Solutions Drug safety Heightened focus on safety Evolving regulatory demands Increasing public scrutiny Focus on analytics Increased sharing & transparency Doing more with the same or less Graph integration Natural language processing Data cleaning during analysis In-memory query engine PwC and Cambridge Semantics, 2018 February 5, 2019Data-Centric Conference
  • 41. PwC NuMedii’s precision therapeutics knowledge graph 41 goTerm Calcium ion binding 2201 Protein binding Extracellular region ENSG 00000138829 Extracellular matrix disassembly Extracellular matrix Organization proteinaceous extracellular matrix positive regulation of bone mineralization Fibrillin - 2 Extracellular matrix Structural constituent Extracellular matrix micro fibril Camera-type eye development CHEMBL_TC_ 10038 Go Function Reference_ Gosubset prok 100001650 100001532 100001739 100000687 100002060 Ontotext and NuMedii, 2018 February 5, 2019 Data-Centric Conference
  • 42. PwC Thomson Reuters’ financial knowledge graph as a service 42 Thomson Reuters, 2018 February 5, 2019 Data-Centric Conference
  • 43. PwC “MicroStrategy 2019 introduces the industry’s first Enterprise Semantic Graph. • It elevates the potential of enterprise data assets, makes true federated analytics possible, and delivers personalized insights based on who you are, where you are, and what you’re doing. • It delivers powerful search capabilities on top of all business information systems or data assets, making it incredibly easy to find insights. • It categorizes and federates each of your data investments in real time, constantly enriching the index with location intelligence and usage telemetry. • It delivers the underlying strength to fuel AI experiences for every role—with smart recommendations on authoring actions for analysts who build dashboards, to smart suggestions on content for business users who are looking for new insights.” --Vijay Anand, Microstrategy blog, January 15, 2019 Data-centric, or product-centric? 43 February 5, 2019Data-Centric Conference
  • 45. PwC The problem: System-level complexity and disconnectedness (product- and app-centric sprawl) 45 Hardware DBMS OS Custom code Hardware Lots of OSes 1,000+ SQL/NoSQL DBs Custom code ERP+ suites Hardware A few more OSes More DBMSes Custom code ERP+ suites Hardware Lots more OSes 5,000+ databases Componentized suites Custom code Cloud layer Hardware More types of OSes 10,000+ DBs + blockchains Multicloud layer Suites as services Various SaaSes Custom code Hardware A few DBMSes A few OSes ERP+ suites Custom code Threat of more application centric sprawl Early1990s Late 1990s 2000s 2010s1973-1990sPre 1970 2020s February 5, 2019 Data-Centric Conference
  • 46. PwC The key opportunity: Large-scale integration and model-driven AI Rule-based systems (includes KR) “Handcrafted knowledge” is the term DARPA uses; rule-based programming + procedure replication in process automation, + some knowledge representation (KR) • Strong on logical reasoning in specific concrete contexts - Procedural + declarative programming + set theory, etc. - Deterministic • Can’t learn or abstract • Still exceptionally common and useful Statistical machine learning • Probabilistic • From Bayesian algorithms to neural nets (yes, deep learning also) • Strong on perceiving and learning (classifying, predicting) • Weak on abstracting and reasoning • Quite powerful in the aggregate but individually (instance by instance) unreliable • Can require lots of data Contextualized, model-driven approach • Contextualized modeling approach— allows efficiency, precision and certainty • Combines power of deterministic, probabilistic and description logic • Allows explanations to be added to decisions • Accelerates the training process with the help of specific, contextual human input • Takes less data Example: Consumer tax software Perceiving Learning Abstracting Reasoning Perceiving Learning Abstracting Reasoning Perceiving Learning Abstracting Reasoning Example: Facial recognition using deep learning/neural nets Example: Explains first how handwritten letters are formed so machines can decide— less data needed, more transparency. John Launchbury of DARPA (https://www.youtube.com/watch?v=N2L8AqkEDLs), Estes Park Group and PwC research, 2017 Previously dominant On the rise and rapidly improving Nascent, just beginning 1 Data-Centric Conference February 5, 2019 46
  • 47. PwC The key means: The right level of relationship richness 47 Use tables, document trees and graphs. • Graphs articulate relationship-rich data • Tables: Relationships are what’s missing from most large-scale data, but table are too useful andhuman-friendly to ignore at smaller scale • Document trees (e.g., taxonomies) are a stepping stone to graph models • Graphs are the parents that bring the data model family all together, Tinker Toy-style • Bottom line: A machine readable, extensible model of your organization • Build and maintain your advanced analytics/AI foundation with that graph model Data-Centric Conference February 5, 2019
  • 48. PwC How to get started: Data-centric strategy, planning, architecture and execution February 5, 2019Data-Centric Conference 48 • In the data-centric view, every IT category is subordinated to centrally managed, model-driven data via data strategy, GRC and data-centric architecture (DCA) • Relationship-rich modeling leads development for reasons of efficiency and effectiveness • Standards based, open source enabled, build versus buy • Empowers user communities, activism, large scale collaboration, shared infrastructure Goals for data to be obtained, enriched and used Data strategy Data governance, risk and compliance Data-centric architecture (DCA) Strategy and planning Execution Data-centric infrastructure Data and logic lifecycle management Model-driven development Cross-enterprise intelligence Relationship-rich modeling Data-centric security Process, pipeline and delivery automation Human and machine learning loops Data-centric design thinking
  • 49. PwC Questions or comments? © 2019 PwC. All rights reserved. PwC refers to the US member firm or one of its subsidiaries or affiliates, and may sometimes refer to the PwC network. Each member firm is a separate legal entity. Please see www.pwc.com/structure for further details. pwc.com Alan Morrison Sr. Research Fellow PwC | Integrated Content | Emerging Tech Mobile: +1(408) 205 5109 Email: alan.s.morrison@pwc.com PricewaterhouseCoopers LLP 488 Almaden Blvd., Suite 1800 San Jose, CA 95110 USA https://www.linkedin.com/in/alanmorrison @AlanMorrison quora.com/profile/Alan-Morrison Data-Centric Conference February 5, 2019

Notas del editor

  1. Our definition of AI has shifted over the years. In earlier decades, rule-based systems were the norm, and knowledge representation was among thm.