SlideShare una empresa de Scribd logo
1 de 28
Descargar para leer sin conexión
Grab some coffee and enjoy
the pre-show banter before
the top of the hour!
The Dirty Work – Why Data Must Be Reconciled

The Briefing Room
Welcome

Host & Analyst:
Eric Kavanagh

Guest:
Geoffrey Malafsky
Twitter Tag: #briefr

The Briefing Room
Mission

!   Reveal the essential characteristics of enterprise software,
good and bad
!   Provide a forum for detailed analysis of today s innovative
technologies
!   Give vendors a chance to explain their product to savvy
analysts
!   Allow audience members to pose serious questions... and get
answers!

Twitter Tag: #briefr

The Briefing Room
Data Reconciliation
Garbage In

GIGO

Garbage Out

GARBAGE
DATA

PERFECT
MODEL

GARBAGE
RESULTS

PERFECT
DATA

GARBAGE
MODEL

GARBAGE
RESULTS

Twitter Tag: #briefr

The Briefing Room
§ Current	
  data	
  is	
  disjointed	
  and	
  of	
  low	
  quality	
  
§ Variable	
  use	
  and	
  meaning	
  among	
  systems	
  even	
  for	
  “same”	
  
data	
  elements	
  
§ Undocumented	
  defini=ons	
  and	
  data	
  mgmt	
  processes	
  
§ Errors	
  in	
  data	
  systems	
  
§ Disagreement	
  among	
  data	
  systems	
  
§ Lack	
  of	
  exis=ng	
  descrip=ons	
  for	
  key	
  readiness	
  use	
  cases	
  

§ Legacy	
  data	
  systems	
  have	
  failed	
  to	
  overcome	
  these	
  
problems	
  despite	
  several	
  years	
  of	
  new	
  	
  marts/houses/
brokers/IPTs/applica=ons	
  
“Many CIOs believe data is inexpensive because storage has become
inexpensive. But data is inherently messy – it can be wrong, it can be
duplicative, and it can be irrelevant – which means it requires handling,
which is where the real expenses come in. ‘The cost of more data is the
application and the computing power and the processes to reconcile all
these things’,”
"While there are a myriad of analytical tools that can be leveraged, a
recent study indicated that more than 70% of CMOs feel they are
underprepared to manage the explosion of data and ‘lack true insight.’ “

1.  Wall Street Journal, CIO‘s Big Problem with Big Data, 2012-08-02
2.  Forbes, The CEO/CMO Dilemma: So Much Data, So Little Impact, 2012-07-18
8	
  
§  Suffix	
  in	
  source	
  A,	
  prefix	
  in	
  B,	
  neither	
  in	
  C	
  for	
  same	
  (part	
  number,	
  =tle,	
  …)?	
  
§  Conflict	
  syntac=cally	
  (simplest	
  case)	
  and	
  seman=cally	
  (most	
  difficult)	
  
§  Other	
  tools	
  &	
  methods	
  never	
  solve	
  this	
  because	
  they	
  deal	
  with	
  the	
  
obstacles	
  independently	
  or	
  not	
  at	
  all:	
  	
  Data	
  values	
  out-­‐of-­‐sync	
  with	
  
metadata,	
  data	
  models	
  
Different	
  Meanings	
  (Legal	
  and	
  Business	
  Ac=vi=es)	
  	
  

NKY

HomeSeekers

1. 
2. 
3. 
4. 
5. 

Texas

Create	
  table	
  –	
  =tle	
  aligned	
  to	
  business	
  =	
  Garage	
  
Create	
  vocabulary:	
  spaces.descrip=on,	
  spaces.na=onal,	
  spaces.state,	
  .	
  
Define	
  ETL	
  logic	
  
Merge	
  in	
  warehouse	
  and	
  process	
  in	
  virtualiza=on	
  layer	
  
Change	
  as	
  needed	
  

9	
  
Copyright	
  Phasic	
  Systems	
  Inc	
  2013	
  
§  Data	
  Ra=onaliza=on	
  is	
  the	
  process	
  of	
  building	
  and	
  managing	
  a	
  con=nuously	
  
adap=ve	
  data	
  environment	
  that	
  fuels	
  current	
  and	
  future	
  business	
  needs	
  for	
  
decision	
  making	
  and	
  system	
  opera=ons	
  
§  It	
  ensures	
  data	
  (i.e.	
  not	
  just	
  metadata)	
  is	
  as	
  accurate,	
  meaningful,	
  and	
  
useful	
  as	
  possible	
  while	
  con=nuously	
  adjus=ng	
  to	
  improve	
  and	
  add	
  
capability	
  
§  It	
  provides	
  collabora=ve	
  management	
  of	
  data	
  assets,	
  the	
  designs	
  governing	
  
who,	
  why,	
  and	
  how	
  of	
  data	
  ,	
  and	
  the	
  where,	
  when,	
  how	
  of	
  data	
  use	
  in	
  
opera=onal	
  systems	
  
§  It	
  solves	
  the	
  great	
  challenge	
  of	
  mapping	
  all	
  source	
  values	
  to	
  each	
  target	
  
along	
  the	
  en=re	
  complex	
  paths	
  of	
  enterprise	
  data	
  use	
  
§  Consolidated	
  values	
  when	
  possible	
  with	
  con=nuous	
  improvement	
  
§  Simplified	
  and	
  adap=ve	
  mapping	
  with	
  Corporate	
  NoSQL	
  
10	
  
Design	
  Ra-onaliza-on	
  Issues	
  
•  Mul=ple	
  data	
  models	
  
•  Conflic=ng	
  defini=ons	
  
•  Similar,	
  supposedly	
  similar,	
  opera=onally	
  
dis=nct	
  values	
  
•  Unknown	
  business	
  logic	
  
•  Mul=ple	
  ETL	
  mappings	
  

Design	
  Ra-onaliza-on	
  
• 
• 
• 
• 
• 

Consolidated,	
  adap=ve	
  data	
  models	
  
Standardized	
  defini=ons	
  
Synchronized	
  dis=nct	
  opera=onal	
  values	
  
Managed	
  business	
  logic	
  
Coordinated	
  ETL	
  mappings	
  

System	
  Ra-onaliza-on	
  Issues	
  
• 
• 
• 
• 
• 

Mul=ple	
  database	
  systems	
  
Conflic=ng	
  formats	
  
Redundant	
  storage	
  
Unsynchronized	
  values	
  
Mul=ple	
  integra=on	
  points	
  

System	
  Ra-onaliza-on	
  
• 
• 
• 
• 
• 

Consolidated,	
  adap=ve	
  systems	
  
Common,	
  interoperable	
  formats	
  
Common	
  storage	
  
Synchronized	
  interfaces	
  
Coordinated	
  integra=on	
  
11	
  
Copyright	
  Phasic	
  Systems	
  Inc	
  2013	
  
Ra=onalized	
  Data=Meaningful	
  Analysis,	
  Decision	
  Support,
	
  
Enterprise	
  Applica=ons
	
  

12	
  
Copyright	
  Phasic	
  Systems	
  Inc	
  2013	
  
§ Example	
  from	
  DARPA	
  Evidence	
  
Extrac=on	
  &	
  Link	
  Discovery	
  
§ Today’s	
  Situa=on:	
  	
  ~10k	
  
messages/day	
  from	
  mul=ple	
  
sources	
  read	
  by	
  mul=ple	
  
analysts	
  and	
  analyzed	
  in	
  
mul=ple	
  manual	
  non-­‐integrated	
  
tools	
  
§ Similar	
  to	
  Social	
  Network	
  
Analysis	
  

13	
  
Complicated	
  Mixture	
  of	
  Commercial,	
  Custom,	
  Legacy,	
  Services	
  Applica=ons,	
  Data	
  Stores	
  

14	
  
Copyright	
  Phasic	
  Systems	
  Inc	
  2013	
  
15	
  
Copyright	
  Phasic	
  Systems	
  Inc	
  2013	
  
Costs	
  
Business	
  Alignment:	
  Goal,	
  Capability,	
  Architecture	
  
Data	
  Assets:	
  Systems,	
  Owners,	
  Use	
  

16
17	
  
Copyright	
  Phasic	
  Systems	
  Inc	
  2013	
  
The Ψ–KORS™ System Model

Point-select data models, codes, entities	
  

18
Copyright	
  Phasic	
  Systems	
  Inc	
  2013	
  
Corporate NoSQL™

19
§ DOD	
  CIO	
  
§ Adap=vely	
  blend	
  financial	
  and	
  program	
  data	
  from	
  
mul=ple	
  sources	
  with	
  unclear,	
  undocumented	
  
alignment	
  and	
  integra=on	
  logic	
  (i.e.	
  this	
  is	
  an	
  
intelligence	
  challenge)	
  into	
  BI	
  tools	
  (QlikView,	
  Tableau,	
  
PentaHo,	
  Excel	
  Web	
  Apps-­‐Sharepoint)	
  

§ Export	
  Development	
  Canada	
  
§ Ra=onalize	
  core	
  data	
  distributed	
  and	
  undocumented	
  to	
  
feed	
  cross-­‐enterprise	
  governance	
  and	
  develop	
  
Enterprise	
  Data	
  Model	
  with	
  seman=cally	
  adjudicated	
  
canonical	
  en==es	
  
Copyright	
  Phasic	
  Systems	
  Inc	
  2013	
  

20	
  
§ Challenge:	
  Complicated	
  environment	
  with	
  conflic=ng	
  data	
  
values,	
  standards,	
  business	
  uses	
  cases,	
  and	
  lack	
  of	
  
documenta=on.	
  Data	
  owned	
  by	
  4	
  major	
  organiza=on,	
  in	
  mul=ple	
  
Warehouses	
  and	
  data	
  stores,	
  redundant	
  non-­‐reconciled	
  sets	
  of	
  
data	
  
§ Requirement:	
  Integrated,	
  common,	
  accurate	
  data	
  to	
  enable	
  new	
  
Integrated	
  workforce	
  planning,	
  training,	
  management	
  
applica=on	
  (“Sailor	
  of	
  the	
  Future”)	
  for	
  1	
  million	
  people	
  
§ Prior	
  Ac-vi-es:	
  10+	
  years	
  of	
  system	
  integra=on,	
  data	
  
warehouse,	
  data	
  governance	
  efforts	
  à	
  no	
  improvement,	
  poor	
  
coordina=on	
  across	
  organiza=ons	
  and	
  systems	
  
21	
  
§ Yet,	
  there	
  were	
  problems	
  with	
  the	
  most	
  
basic	
  data	
  fields,	
  which	
  for	
  the	
  Navy,	
  include	
  
things	
  like	
  	
  
§ billet	
  (effec=vely	
  a	
  job	
  but	
  also	
  includes	
  other	
  
characteris=cs),	
  	
  
§ rank	
  (similar	
  to	
  seniority	
  but	
  with	
  formal	
  rules	
  that	
  change	
  
over	
  =me),	
  	
  
§ ra=ng	
  (similar	
  to	
  voca=onal	
  ability	
  but	
  also	
  with	
  changing	
  
rules),	
  	
  
§ and	
  even	
  the	
  primary	
  iden=fier	
  of	
  a	
  person	
  the	
  Social	
  
Security	
  Number	
  (SSN).	
  	
  

22	
  
§ Bridge	
  Organiza=ons,	
  Processes,	
  Technologies	
  to	
  Data	
  
Concepts	
  

23	
  
Logical	
  Models	
  derive	
  directly	
  from	
  conceptual	
  and	
  use	
  business	
  terms	
  

24	
  
•  Promulgate	
  key	
  technologies	
  to	
  help	
  field	
  overcome	
  major	
  
obstacles	
  
•  Iden=fy	
  cause	
  and	
  existence	
  of	
  seman=c	
  conflicts	
  
•  Determine	
  op=ons	
  
•  Promote	
  enterprise	
  decision	
  making	
  on	
  solu=on	
  
•  Implement	
  solu=on	
  into	
  opera=onal	
  data	
  
•  Visible	
  direct	
  line	
  from	
  governance	
  to	
  data	
  modeling	
  to	
  
integra=on	
  to	
  database	
  engineering	
  to	
  analysis	
  and	
  back	
  
again	
  
•  Rapid	
  cycle	
  =me:	
  iden=fy,	
  assess,	
  decide,	
  execute	
  
con=nuously	
  in	
  natural	
  organiza=onal	
  =meline	
  (days/weeks)	
  
•  Community	
  version	
  DataStar	
  for	
  non-­‐commercial	
  use	
  
•  Collabora=ve	
  community	
  communica=on	
  and	
  design	
  of	
  common,	
  
seman=cally	
  clear	
  Corporate	
  NoSQL	
  models	
  
Twitter Tag: #briefr

The Briefing Room
Upcoming Topics

November: DATA DISCOVERY & VISUALIZATION
December: INNOVATORS
2014 Editorial Calendar at

www.insideanalysis.com/webcasts/the-briefing-room

www.insideanalysis.com

Twitter Tag: #briefr

The Briefing Room
Thank You
for Your
Attention

Twitter Tag: #briefr

The Briefing Room

Más contenido relacionado

La actualidad más candente

Bringing Agility and Flexibility to Data Design and Integration
Bringing Agility and Flexibility to Data Design and IntegrationBringing Agility and Flexibility to Data Design and Integration
Bringing Agility and Flexibility to Data Design and IntegrationDATAVERSITY
 
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...Erik Fransen
 
Total Data Industry Report
Total Data Industry ReportTotal Data Industry Report
Total Data Industry ReportRan Zhang
 
Best Practices: Data Admin & Data Management
Best Practices: Data Admin & Data ManagementBest Practices: Data Admin & Data Management
Best Practices: Data Admin & Data ManagementEmpowered Holdings, LLC
 
Data warehouse 101-fundamentals-
Data warehouse 101-fundamentals-Data warehouse 101-fundamentals-
Data warehouse 101-fundamentals-AshishGuleria
 
The technology of the business data lake
The technology of the business data lakeThe technology of the business data lake
The technology of the business data lakeCapgemini
 
Introduction to data warehousing
Introduction to data warehousingIntroduction to data warehousing
Introduction to data warehousinguncleRhyme
 
Smarter Management for Your Data Growth
Smarter Management for Your Data GrowthSmarter Management for Your Data Growth
Smarter Management for Your Data GrowthRainStor
 
Architecting a Data Warehouse: A Case Study
Architecting a Data Warehouse: A Case StudyArchitecting a Data Warehouse: A Case Study
Architecting a Data Warehouse: A Case StudyMark Ginnebaugh
 
Diane England Resume
Diane England ResumeDiane England Resume
Diane England Resumedmengland
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data WarehousingEdureka!
 
Data Warehouse Methodology
Data Warehouse MethodologyData Warehouse Methodology
Data Warehouse MethodologySQL Power
 
Data Warehouse Interview Questions And Answers | Data Warehouse Tutorial | Ed...
Data Warehouse Interview Questions And Answers | Data Warehouse Tutorial | Ed...Data Warehouse Interview Questions And Answers | Data Warehouse Tutorial | Ed...
Data Warehouse Interview Questions And Answers | Data Warehouse Tutorial | Ed...Edureka!
 
Data quality and bi
Data quality and biData quality and bi
Data quality and bijeffd00
 
Building a Big Data Analytics Platform- Impetus White Paper
Building a Big Data Analytics Platform- Impetus White PaperBuilding a Big Data Analytics Platform- Impetus White Paper
Building a Big Data Analytics Platform- Impetus White PaperImpetus Technologies
 
How Real TIme Data Changes the Data Warehouse
How Real TIme Data Changes the Data WarehouseHow Real TIme Data Changes the Data Warehouse
How Real TIme Data Changes the Data Warehousemark madsen
 
Master Data Management (MDM) 101 & Oracle Trading Community Architecture (TCA...
Master Data Management (MDM) 101 & Oracle Trading Community Architecture (TCA...Master Data Management (MDM) 101 & Oracle Trading Community Architecture (TCA...
Master Data Management (MDM) 101 & Oracle Trading Community Architecture (TCA...Rhapsody Technologies, Inc.
 

La actualidad más candente (20)

Bringing Agility and Flexibility to Data Design and Integration
Bringing Agility and Flexibility to Data Design and IntegrationBringing Agility and Flexibility to Data Design and Integration
Bringing Agility and Flexibility to Data Design and Integration
 
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
 
Total Data Industry Report
Total Data Industry ReportTotal Data Industry Report
Total Data Industry Report
 
Best Practices: Data Admin & Data Management
Best Practices: Data Admin & Data ManagementBest Practices: Data Admin & Data Management
Best Practices: Data Admin & Data Management
 
Data warehouse 101-fundamentals-
Data warehouse 101-fundamentals-Data warehouse 101-fundamentals-
Data warehouse 101-fundamentals-
 
The technology of the business data lake
The technology of the business data lakeThe technology of the business data lake
The technology of the business data lake
 
Introduction to data warehousing
Introduction to data warehousingIntroduction to data warehousing
Introduction to data warehousing
 
Smarter Management for Your Data Growth
Smarter Management for Your Data GrowthSmarter Management for Your Data Growth
Smarter Management for Your Data Growth
 
Bi&dw methodology
Bi&dw methodologyBi&dw methodology
Bi&dw methodology
 
Architecting a Data Warehouse: A Case Study
Architecting a Data Warehouse: A Case StudyArchitecting a Data Warehouse: A Case Study
Architecting a Data Warehouse: A Case Study
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
Diane England Resume
Diane England ResumeDiane England Resume
Diane England Resume
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
Data Warehouse Methodology
Data Warehouse MethodologyData Warehouse Methodology
Data Warehouse Methodology
 
Data Warehouse Interview Questions And Answers | Data Warehouse Tutorial | Ed...
Data Warehouse Interview Questions And Answers | Data Warehouse Tutorial | Ed...Data Warehouse Interview Questions And Answers | Data Warehouse Tutorial | Ed...
Data Warehouse Interview Questions And Answers | Data Warehouse Tutorial | Ed...
 
Data quality and bi
Data quality and biData quality and bi
Data quality and bi
 
Building a Big Data Analytics Platform- Impetus White Paper
Building a Big Data Analytics Platform- Impetus White PaperBuilding a Big Data Analytics Platform- Impetus White Paper
Building a Big Data Analytics Platform- Impetus White Paper
 
How Real TIme Data Changes the Data Warehouse
How Real TIme Data Changes the Data WarehouseHow Real TIme Data Changes the Data Warehouse
How Real TIme Data Changes the Data Warehouse
 
Master Data Management (MDM) 101 & Oracle Trading Community Architecture (TCA...
Master Data Management (MDM) 101 & Oracle Trading Community Architecture (TCA...Master Data Management (MDM) 101 & Oracle Trading Community Architecture (TCA...
Master Data Management (MDM) 101 & Oracle Trading Community Architecture (TCA...
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
 

Similar a The Dirty Work -- Why Data Must Be Reconciled

Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Denodo
 
Managing Large Amounts of Data with Salesforce
Managing Large Amounts of Data with SalesforceManaging Large Amounts of Data with Salesforce
Managing Large Amounts of Data with SalesforceSense Corp
 
Data processing in Industrial Systems course notes after week 5
Data processing in Industrial Systems course notes after week 5Data processing in Industrial Systems course notes after week 5
Data processing in Industrial Systems course notes after week 5Ufuk Cebeci
 
Data Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricData Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricNathan Bijnens
 
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...Denodo
 
All Together Now: A Recipe for Successful Data Governance
All Together Now: A Recipe for Successful Data GovernanceAll Together Now: A Recipe for Successful Data Governance
All Together Now: A Recipe for Successful Data GovernanceInside Analysis
 
Agile Data Rationalization for Operational Intelligence
Agile Data Rationalization for Operational IntelligenceAgile Data Rationalization for Operational Intelligence
Agile Data Rationalization for Operational IntelligenceInside Analysis
 
Engineering Machine Learning Data Pipelines Series: Big Data Quality - Cleans...
Engineering Machine Learning Data Pipelines Series: Big Data Quality - Cleans...Engineering Machine Learning Data Pipelines Series: Big Data Quality - Cleans...
Engineering Machine Learning Data Pipelines Series: Big Data Quality - Cleans...Precisely
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Nathan Bijnens
 
Digital intelligence satish bhatia
Digital intelligence satish bhatiaDigital intelligence satish bhatia
Digital intelligence satish bhatiaSatish Bhatia
 
Unlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationUnlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationDenodo
 
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...Denodo
 
Big Data's Impact on the Enterprise
Big Data's Impact on the EnterpriseBig Data's Impact on the Enterprise
Big Data's Impact on the EnterpriseCaserta
 
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRobertsWP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRobertsJane Roberts
 
How 3 trends are shaping analytics and data management
How 3 trends are shaping analytics and data management How 3 trends are shaping analytics and data management
How 3 trends are shaping analytics and data management Abhishek Sood
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An IntroductionDenodo
 
Salim Khan.Resume_3.8
Salim Khan.Resume_3.8Salim Khan.Resume_3.8
Salim Khan.Resume_3.8Salim Khan
 

Similar a The Dirty Work -- Why Data Must Be Reconciled (20)

Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
 
Managing Large Amounts of Data with Salesforce
Managing Large Amounts of Data with SalesforceManaging Large Amounts of Data with Salesforce
Managing Large Amounts of Data with Salesforce
 
Data processing in Industrial Systems course notes after week 5
Data processing in Industrial Systems course notes after week 5Data processing in Industrial Systems course notes after week 5
Data processing in Industrial Systems course notes after week 5
 
Data Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricData Mesh using Microsoft Fabric
Data Mesh using Microsoft Fabric
 
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
 
AtomicDBCoreTech_White Papaer
AtomicDBCoreTech_White PapaerAtomicDBCoreTech_White Papaer
AtomicDBCoreTech_White Papaer
 
All Together Now: A Recipe for Successful Data Governance
All Together Now: A Recipe for Successful Data GovernanceAll Together Now: A Recipe for Successful Data Governance
All Together Now: A Recipe for Successful Data Governance
 
Agile Data Rationalization for Operational Intelligence
Agile Data Rationalization for Operational IntelligenceAgile Data Rationalization for Operational Intelligence
Agile Data Rationalization for Operational Intelligence
 
Datawarehouse
DatawarehouseDatawarehouse
Datawarehouse
 
Engineering Machine Learning Data Pipelines Series: Big Data Quality - Cleans...
Engineering Machine Learning Data Pipelines Series: Big Data Quality - Cleans...Engineering Machine Learning Data Pipelines Series: Big Data Quality - Cleans...
Engineering Machine Learning Data Pipelines Series: Big Data Quality - Cleans...
 
Big Data Boom
Big Data BoomBig Data Boom
Big Data Boom
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)
 
Digital intelligence satish bhatia
Digital intelligence satish bhatiaDigital intelligence satish bhatia
Digital intelligence satish bhatia
 
Unlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationUnlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data Virtualization
 
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
 
Big Data's Impact on the Enterprise
Big Data's Impact on the EnterpriseBig Data's Impact on the Enterprise
Big Data's Impact on the Enterprise
 
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRobertsWP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
 
How 3 trends are shaping analytics and data management
How 3 trends are shaping analytics and data management How 3 trends are shaping analytics and data management
How 3 trends are shaping analytics and data management
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
Salim Khan.Resume_3.8
Salim Khan.Resume_3.8Salim Khan.Resume_3.8
Salim Khan.Resume_3.8
 

Más de Inside Analysis

An Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BIAn Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BIInside Analysis
 
Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessInside Analysis
 
First in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationFirst in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationInside Analysis
 
Fit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data LetdownFit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data LetdownInside Analysis
 
To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security Inside Analysis
 
The Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On TimeThe Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On TimeInside Analysis
 
Introducing: A Complete Algebra of Data
Introducing: A Complete Algebra of DataIntroducing: A Complete Algebra of Data
Introducing: A Complete Algebra of DataInside Analysis
 
The Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop AdoptionThe Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop AdoptionInside Analysis
 
Ahead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time AnalyticsAhead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time AnalyticsInside Analysis
 
All Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of EverythingAll Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of EverythingInside Analysis
 
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETLGoodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETLInside Analysis
 
The Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global LevelThe Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global LevelInside Analysis
 
Structurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your ArchitectureStructurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your ArchitectureInside Analysis
 
SQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the RiskSQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the RiskInside Analysis
 
The Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big DataThe Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big DataInside Analysis
 
A Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data WarehouseA Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data WarehouseInside Analysis
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopInside Analysis
 
Rethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldRethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldInside Analysis
 
DisrupTech - Dave Duggal
DisrupTech - Dave DuggalDisrupTech - Dave Duggal
DisrupTech - Dave DuggalInside Analysis
 

Más de Inside Analysis (20)

An Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BIAn Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BI
 
Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for Success
 
First in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationFirst in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter Integration
 
Fit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data LetdownFit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data Letdown
 
To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security
 
The Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On TimeThe Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On Time
 
Introducing: A Complete Algebra of Data
Introducing: A Complete Algebra of DataIntroducing: A Complete Algebra of Data
Introducing: A Complete Algebra of Data
 
The Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop AdoptionThe Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop Adoption
 
Ahead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time AnalyticsAhead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time Analytics
 
All Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of EverythingAll Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of Everything
 
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETLGoodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
 
The Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global LevelThe Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global Level
 
Structurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your ArchitectureStructurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your Architecture
 
SQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the RiskSQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the Risk
 
The Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big DataThe Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big Data
 
A Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data WarehouseA Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data Warehouse
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of Hadoop
 
Rethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldRethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile World
 
DisrupTech - Dave Duggal
DisrupTech - Dave DuggalDisrupTech - Dave Duggal
DisrupTech - Dave Duggal
 
Modus Operandi
Modus OperandiModus Operandi
Modus Operandi
 

Último

Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 

Último (20)

Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 

The Dirty Work -- Why Data Must Be Reconciled

  • 1. Grab some coffee and enjoy the pre-show banter before the top of the hour!
  • 2. The Dirty Work – Why Data Must Be Reconciled The Briefing Room
  • 3. Welcome Host & Analyst: Eric Kavanagh Guest: Geoffrey Malafsky Twitter Tag: #briefr The Briefing Room
  • 4. Mission !   Reveal the essential characteristics of enterprise software, good and bad !   Provide a forum for detailed analysis of today s innovative technologies !   Give vendors a chance to explain their product to savvy analysts !   Allow audience members to pose serious questions... and get answers! Twitter Tag: #briefr The Briefing Room
  • 5. Data Reconciliation Garbage In GIGO Garbage Out GARBAGE DATA PERFECT MODEL GARBAGE RESULTS PERFECT DATA GARBAGE MODEL GARBAGE RESULTS Twitter Tag: #briefr The Briefing Room
  • 6.
  • 7. § Current  data  is  disjointed  and  of  low  quality   § Variable  use  and  meaning  among  systems  even  for  “same”   data  elements   § Undocumented  defini=ons  and  data  mgmt  processes   § Errors  in  data  systems   § Disagreement  among  data  systems   § Lack  of  exis=ng  descrip=ons  for  key  readiness  use  cases   § Legacy  data  systems  have  failed  to  overcome  these   problems  despite  several  years  of  new    marts/houses/ brokers/IPTs/applica=ons  
  • 8. “Many CIOs believe data is inexpensive because storage has become inexpensive. But data is inherently messy – it can be wrong, it can be duplicative, and it can be irrelevant – which means it requires handling, which is where the real expenses come in. ‘The cost of more data is the application and the computing power and the processes to reconcile all these things’,” "While there are a myriad of analytical tools that can be leveraged, a recent study indicated that more than 70% of CMOs feel they are underprepared to manage the explosion of data and ‘lack true insight.’ “ 1.  Wall Street Journal, CIO‘s Big Problem with Big Data, 2012-08-02 2.  Forbes, The CEO/CMO Dilemma: So Much Data, So Little Impact, 2012-07-18 8  
  • 9. §  Suffix  in  source  A,  prefix  in  B,  neither  in  C  for  same  (part  number,  =tle,  …)?   §  Conflict  syntac=cally  (simplest  case)  and  seman=cally  (most  difficult)   §  Other  tools  &  methods  never  solve  this  because  they  deal  with  the   obstacles  independently  or  not  at  all:    Data  values  out-­‐of-­‐sync  with   metadata,  data  models   Different  Meanings  (Legal  and  Business  Ac=vi=es)     NKY HomeSeekers 1.  2.  3.  4.  5.  Texas Create  table  –  =tle  aligned  to  business  =  Garage   Create  vocabulary:  spaces.descrip=on,  spaces.na=onal,  spaces.state,  .   Define  ETL  logic   Merge  in  warehouse  and  process  in  virtualiza=on  layer   Change  as  needed   9   Copyright  Phasic  Systems  Inc  2013  
  • 10. §  Data  Ra=onaliza=on  is  the  process  of  building  and  managing  a  con=nuously   adap=ve  data  environment  that  fuels  current  and  future  business  needs  for   decision  making  and  system  opera=ons   §  It  ensures  data  (i.e.  not  just  metadata)  is  as  accurate,  meaningful,  and   useful  as  possible  while  con=nuously  adjus=ng  to  improve  and  add   capability   §  It  provides  collabora=ve  management  of  data  assets,  the  designs  governing   who,  why,  and  how  of  data  ,  and  the  where,  when,  how  of  data  use  in   opera=onal  systems   §  It  solves  the  great  challenge  of  mapping  all  source  values  to  each  target   along  the  en=re  complex  paths  of  enterprise  data  use   §  Consolidated  values  when  possible  with  con=nuous  improvement   §  Simplified  and  adap=ve  mapping  with  Corporate  NoSQL   10  
  • 11. Design  Ra-onaliza-on  Issues   •  Mul=ple  data  models   •  Conflic=ng  defini=ons   •  Similar,  supposedly  similar,  opera=onally   dis=nct  values   •  Unknown  business  logic   •  Mul=ple  ETL  mappings   Design  Ra-onaliza-on   •  •  •  •  •  Consolidated,  adap=ve  data  models   Standardized  defini=ons   Synchronized  dis=nct  opera=onal  values   Managed  business  logic   Coordinated  ETL  mappings   System  Ra-onaliza-on  Issues   •  •  •  •  •  Mul=ple  database  systems   Conflic=ng  formats   Redundant  storage   Unsynchronized  values   Mul=ple  integra=on  points   System  Ra-onaliza-on   •  •  •  •  •  Consolidated,  adap=ve  systems   Common,  interoperable  formats   Common  storage   Synchronized  interfaces   Coordinated  integra=on   11   Copyright  Phasic  Systems  Inc  2013  
  • 12. Ra=onalized  Data=Meaningful  Analysis,  Decision  Support,   Enterprise  Applica=ons   12   Copyright  Phasic  Systems  Inc  2013  
  • 13. § Example  from  DARPA  Evidence   Extrac=on  &  Link  Discovery   § Today’s  Situa=on:    ~10k   messages/day  from  mul=ple   sources  read  by  mul=ple   analysts  and  analyzed  in   mul=ple  manual  non-­‐integrated   tools   § Similar  to  Social  Network   Analysis   13  
  • 14. Complicated  Mixture  of  Commercial,  Custom,  Legacy,  Services  Applica=ons,  Data  Stores   14   Copyright  Phasic  Systems  Inc  2013  
  • 15. 15   Copyright  Phasic  Systems  Inc  2013  
  • 16. Costs   Business  Alignment:  Goal,  Capability,  Architecture   Data  Assets:  Systems,  Owners,  Use   16
  • 17. 17   Copyright  Phasic  Systems  Inc  2013  
  • 18. The Ψ–KORS™ System Model Point-select data models, codes, entities   18 Copyright  Phasic  Systems  Inc  2013  
  • 20. § DOD  CIO   § Adap=vely  blend  financial  and  program  data  from   mul=ple  sources  with  unclear,  undocumented   alignment  and  integra=on  logic  (i.e.  this  is  an   intelligence  challenge)  into  BI  tools  (QlikView,  Tableau,   PentaHo,  Excel  Web  Apps-­‐Sharepoint)   § Export  Development  Canada   § Ra=onalize  core  data  distributed  and  undocumented  to   feed  cross-­‐enterprise  governance  and  develop   Enterprise  Data  Model  with  seman=cally  adjudicated   canonical  en==es   Copyright  Phasic  Systems  Inc  2013   20  
  • 21. § Challenge:  Complicated  environment  with  conflic=ng  data   values,  standards,  business  uses  cases,  and  lack  of   documenta=on.  Data  owned  by  4  major  organiza=on,  in  mul=ple   Warehouses  and  data  stores,  redundant  non-­‐reconciled  sets  of   data   § Requirement:  Integrated,  common,  accurate  data  to  enable  new   Integrated  workforce  planning,  training,  management   applica=on  (“Sailor  of  the  Future”)  for  1  million  people   § Prior  Ac-vi-es:  10+  years  of  system  integra=on,  data   warehouse,  data  governance  efforts  à  no  improvement,  poor   coordina=on  across  organiza=ons  and  systems   21  
  • 22. § Yet,  there  were  problems  with  the  most   basic  data  fields,  which  for  the  Navy,  include   things  like     § billet  (effec=vely  a  job  but  also  includes  other   characteris=cs),     § rank  (similar  to  seniority  but  with  formal  rules  that  change   over  =me),     § ra=ng  (similar  to  voca=onal  ability  but  also  with  changing   rules),     § and  even  the  primary  iden=fier  of  a  person  the  Social   Security  Number  (SSN).     22  
  • 23. § Bridge  Organiza=ons,  Processes,  Technologies  to  Data   Concepts   23  
  • 24. Logical  Models  derive  directly  from  conceptual  and  use  business  terms   24  
  • 25. •  Promulgate  key  technologies  to  help  field  overcome  major   obstacles   •  Iden=fy  cause  and  existence  of  seman=c  conflicts   •  Determine  op=ons   •  Promote  enterprise  decision  making  on  solu=on   •  Implement  solu=on  into  opera=onal  data   •  Visible  direct  line  from  governance  to  data  modeling  to   integra=on  to  database  engineering  to  analysis  and  back   again   •  Rapid  cycle  =me:  iden=fy,  assess,  decide,  execute   con=nuously  in  natural  organiza=onal  =meline  (days/weeks)   •  Community  version  DataStar  for  non-­‐commercial  use   •  Collabora=ve  community  communica=on  and  design  of  common,   seman=cally  clear  Corporate  NoSQL  models  
  • 26. Twitter Tag: #briefr The Briefing Room
  • 27. Upcoming Topics November: DATA DISCOVERY & VISUALIZATION December: INNOVATORS 2014 Editorial Calendar at www.insideanalysis.com/webcasts/the-briefing-room www.insideanalysis.com Twitter Tag: #briefr The Briefing Room
  • 28. Thank You for Your Attention Twitter Tag: #briefr The Briefing Room