SlideShare una empresa de Scribd logo
1 de 35
Descargar para leer sin conexión
Data	
  Modelers	
  Save	
  Their	
  Careers:	
  
Surviving	
  and	
  Thriving	
  with	
  NoSQL	
  
	
  
Joe	
  Maguire	
  
Data	
  Quality	
  Strategies,	
  LLC	
  
h=p://www.DataQualityStrategies.com/	
  	
  
©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
  
Thesis	
  
•  RelaIonal	
  DBMS’s	
  have	
  dominated,	
  
•  ...so	
  relaIonal	
  modeling	
  subsumed	
  other	
  
forms,	
  including	
  conceptual	
  modeling.	
  
•  As	
  R-­‐DBMS	
  wanes,	
  so	
  does	
  relaIonal	
  
modeling	
  –	
  and	
  sadly,	
  whatever	
  it	
  subsumed.	
  
•  Conceptual	
  modeling	
  must	
  be	
  saved.	
  
•  RelaIonal	
  modelers	
  can	
  step	
  in	
  to	
  save	
  it...	
  
•  ...with	
  some	
  significant	
  effort.	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   2	
  
My	
  PerspecIve	
  
•  Over	
  three	
  decades	
  in	
  industry	
  
•  Career	
  is	
  a	
  three-­‐legged	
  stool	
  
–  Product	
  development	
  for	
  soVware	
  vendors	
  
–  SoluIon	
  design	
  for	
  enterprises	
  
–  Author,	
  Industry	
  Analyst,	
  Thought	
  Leader	
  	
  
•  Specialize	
  in	
  	
  
–  Modeling	
  
–  Requirements	
  analysis	
  
–  Data	
  architecture	
  
–  Data	
  quality	
  
•  Joe.Maguire@DataQualityStrategies.com	
  	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   3	
  
Agenda	
  
•  History	
  
•  Current	
  Events	
  
•  Your	
  Future	
  as	
  a	
  Data	
  Modeler	
  
•  Q&A	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   4	
  
A	
  Big-­‐Picture	
  Framework	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   5	
  
	
  
Meta-­‐model	
  
	
  
Data	
  Perspec1ve	
  
Conceptual	
   •  EnIIes	
  
•  A=ributes	
  
•  RelaIonships	
  
•  IdenIfiers	
  
Logical	
   •  Tables	
  
•  Columns	
  
•  Primary	
  and	
  foreign	
  keys	
  
Physical	
   •  Indexes	
  
•  Table	
  spaces	
  
•  VerIcal	
  and	
  horizontal	
  parIIoning	
  
•  DenormalizaIons	
  
Good	
  Ideas	
  in	
  the	
  Framework	
  
•  InformaIon	
  Hiding	
  
–  e.g.,	
  conceptual	
  excludes	
  implementaIon	
  details	
  
•  The	
  Type/Instance	
  disIncIon	
  
–  Models	
  describe	
  categories,	
  data	
  describes	
  members	
  
•  ApplicaIon/Data	
  Independence	
  
–  Data	
  modeling	
  is	
  separate	
  from	
  process	
  modeling	
  
•  User	
  Requirements	
  ≠	
  System	
  Requirements	
  
–  Users	
  should	
  not	
  parIcipate	
  in	
  logical	
  and	
  physical	
  	
  
•  Model-­‐Driven	
  Development	
  
–  Forward	
  and	
  reverse	
  engineering	
  across	
  model	
  levels	
  
	
  #Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   6	
  
A	
  Big-­‐Picture	
  Framework,	
  distorted	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   7	
  
	
  
Meta-­‐model	
  
	
  
Data	
  Perspec1ve	
  
RelaIonal	
   •  EnIIes	
  /	
  Tables	
  
•  A=ributes	
  /	
  Columns	
  
•  RelaIonships	
  /	
  FKs	
  
•  IdenIfiers	
  /	
  PKs	
  
	
  
	
  
	
  
Physical	
   •  Indexes	
  
•  Table	
  spaces	
  
•  VerIcal	
  and	
  horizontal	
  parIIoning	
  
•  DenormalizaIons	
  
How	
  the	
  DistorIon	
  Happens	
  
•  Tool	
  Vendors	
  Dismiss	
  Conceptual	
  Modeling	
  
– Because	
  their	
  tools	
  cannot	
  support	
  it	
  anyway	
  
•  Info	
  Mgmt	
  Specialists	
  Confuse	
  Models	
  w	
  Reality	
  
– E.g.,	
  believing	
  the	
  relaIonal	
  model	
  suffices	
  to	
  
describe	
  the	
  universe	
  
•  InsItuIonalized	
  Expediency	
  	
  
– We	
  know	
  about	
  conceptual	
  modeling,	
  but	
  to	
  save	
  
Ime,	
  we	
  combine	
  it	
  with	
  relaIonal	
  modeling...	
  
– ...then	
  we	
  formalize	
  that	
  into	
  our	
  dev	
  processes...	
  
– ...and	
  eventually,	
  that	
  becomes	
  the	
  “best	
  
pracIces.”	
  #Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   8	
  
DistorIons,	
  Revisited	
  
•  Summary	
  of	
  DistorIons:	
  
– DistorIon:	
  Conceptual	
  means	
  vague	
  
– DistorIon:	
  Logical	
  implies	
  relaIonal	
  
•  Rather	
  than	
  XML,	
  OO,	
  KV	
  Store,	
  Array	
  Database,	
  Graph	
  
Database	
  
•  Results	
  of	
  DistorIons:	
  
– Two	
  levels	
  only:	
  relaIonal	
  and	
  physical	
  
– RelaIonal	
  modeling	
  used	
  for	
  user	
  requirements	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   9	
  
Agenda	
  
•  History	
  
•  Current	
  Events	
  
•  Your	
  Future	
  as	
  a	
  Data	
  Modeler	
  
•  Q&A	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   10	
  
Current	
  Events:	
  NoSQL	
  
•  The	
  “Just	
  Say	
  No”	
  InterpretaIon	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   11	
  
	
  
Meta-­‐model	
  
	
  
Data	
  Perspec1ve	
  
Logical	
  
RelaIonal	
  
•  EnIIes	
  /	
  Tables	
  
•  A=ributes	
  /	
  Columns	
  
•  RelaIonships	
  /	
  FKs	
  
•  IdenIfiers	
  /	
  PKs	
  
	
  
	
  
	
  
Physical	
   NO	
  LONGER	
  RELATIONAL:	
  
•  Schemas	
  Based	
  on	
  Big	
  Table	
  ImplementaIons	
  
•  Alien	
  DDL	
  language	
  
•  Limited	
  Support	
  from	
  Modeling	
  Tools	
  
Current	
  Events:	
  NoSQL	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   12	
  
•  The	
  “Not	
  Only	
  SQL”	
  InterpretaIon	
  
– Okay,	
  so	
  there	
  might	
  be	
  some	
  work	
  for	
  you	
  
– But	
  you’re	
  at	
  risk	
  of	
  being	
  marginalized	
  
	
  
	
  
	
  
Agenda	
  
•  History	
  
•  Current	
  Events	
  
•  Your	
  Future	
  as	
  a	
  Data	
  Modeler	
  
•  Summary	
  
•  Q&A	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   13	
  
Your	
  Future	
  as	
  a	
  Modeler	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   14	
  
•  Remaining	
  Relevant	
  
– Selfishly:	
  Saving	
  your	
  career	
  
– Nobly:	
  Serving	
  your	
  client	
  /	
  company	
  /	
  customer	
  
•  What	
  you	
  can	
  do:	
  
– Wait	
  for	
  relaIonal	
  projects	
  
– Become	
  a	
  NoSQL	
  database	
  designer	
  
– Help	
  your	
  client	
  choose	
  data	
  plasorms	
  
•  That	
  starts	
  with	
  understanding	
  the	
  problems	
  
–  which	
  starts	
  with	
  CONCEPTUAL	
  MODELING.	
  
	
  
A	
  New	
  (?)	
  Modeling	
  Framework	
  
•  Conceptual	
  Modeling	
  
•  Choosing	
  a	
  Logical	
  Meta-­‐model	
  
•  Logical	
  Modeling	
  
•  Physical	
  Modeling	
  
	
  
	
  
•  Tool	
  Support?	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   15	
  
Conceptual	
  Modeling	
  
•  Behaviors	
  and	
  constructs	
  will	
  compare	
  to	
  
RelaIonal	
  Modeling:	
  
– Keep	
  some	
  
– Discard	
  some	
  
– Stress	
  some	
  
– Change	
  some	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   16	
  
Conceptual	
  Data	
  Model	
  Example	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   17	
  
Keep	
  Some	
  
•  Keep	
  EnIIes	
  
•  Keep	
  A=ributes	
  
•  Keep	
  RelaIonships	
  
•  Keep	
  IdenIfiers	
  
•  Keep	
  Maximum	
  Cardinality	
  of	
  RelaIonships	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   18	
  
Keep	
  EnIIes	
  
•  Minimum	
  Expressiveness	
  
•  EnIIes,	
  Not	
  Tables	
  
– Don’t	
  express	
  Horizontal	
  or	
  VerIcal	
  ParIIoning	
  for	
  
performance	
  
•  But	
  yes	
  is	
  moIvated	
  by	
  privacy/security/risk	
  
•  EnIty	
  names,	
  not	
  table	
  names	
  
– Honor	
  user	
  vocabulary,	
  not	
  IT	
  naming	
  standards	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   19	
  
Keep	
  A=ributes	
  
•  Honor	
  User	
  Phenomenon	
  
– A=ributes	
  are	
  part	
  of	
  user	
  discourse	
  
•  A=ributes,	
  not	
  columns	
  
– Worry	
  about	
  scale	
  (nominal,	
  numeric,	
  ordinal,	
  
Boolean,	
  cyclic),	
  not	
  data	
  type	
  
– A=ribute	
  names,	
  not	
  column	
  names	
  
•  Support	
  in-­‐progress	
  models	
  
– During	
  which	
  a=ributes	
  can	
  become	
  enIIes	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   20	
  
Keep	
  RelaIonships	
  
•  Minimum	
  Expressiveness	
  
– A=ributes	
  are	
  part	
  of	
  user	
  discourse	
  
•  Allow	
  many-­‐many	
  and	
  collecIon	
  enIIes	
  
– If	
  the	
  la=er	
  seem	
  strange,	
  you’ve	
  been	
  in	
  IT	
  too	
  
long	
  
•  RelaIonships,	
  not	
  FKs	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   21	
  
Keep	
  IdenIfiers	
  
•  IdenIfiers,	
  not	
  PKs	
  
– IDs	
  are	
  not	
  moIvated	
  by	
  computerizaIon,	
  but	
  by	
  
typography	
  
– IDs	
  predate	
  the	
  informaIon	
  revoluIon	
  
•  and	
  the	
  automoIve	
  revoluIon,	
  for	
  that	
  ma=er	
  
•  Support	
  in-­‐process	
  modeling	
  
– IDs	
  help	
  the	
  modeler	
  ferret	
  out	
  the	
  homonym	
  
problem	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   22	
  
Discard	
  Some	
  
•  Discard	
  Foreign	
  Keys	
  
– They’re	
  relaIonal	
  
•  Discard	
  Minimum	
  Cardinality	
  
– A	
  funcIon	
  of	
  process	
  or	
  policy,	
  not	
  data	
  
– Over-­‐reported	
  by	
  users	
  
•  Discard	
  Most	
  Constraints	
  
– A	
  funcIon	
  of	
  process	
  or	
  policy,	
  not	
  data	
  
– Are	
  over-­‐reported	
  by	
  users	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   23	
  
Keep/Discard	
  Rule	
  of	
  Thumb	
  
•  Keep	
  
– Anything	
  that	
  helps	
  you	
  and	
  the	
  users	
  together	
  
discover	
  and	
  name	
  the	
  user	
  categories	
  
•  Discard	
  
– Anything	
  else	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   24	
  
Conceptual	
  Data	
  Model	
  Examples	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   25	
  
Stress	
  Some	
  
•  Stress	
  Consistency	
  Requirements	
  
– RelaIonal	
  modelers	
  (of	
  non-­‐distributed	
  databases)	
  
have	
  not	
  been	
  asking	
  about	
  these.	
  
•  Stress	
  Data	
  Volume	
  /	
  Velocity	
  Requirements	
  
– Can	
  lead	
  or	
  force	
  your	
  to	
  relax	
  applicaIon-­‐data	
  
independence	
  
	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   26	
  
Change	
  Some	
  
•  Change	
  your	
  process	
  
– From	
  math-­‐y	
  normalizaIon	
  to	
  English-­‐y	
  
conversaIon	
  with	
  users	
  
– Very	
  difficult	
  to	
  achieve	
  rigor	
  conversaIonally	
  
	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   27	
  
•  More	
  help:	
  
– Mastering	
  Data	
  Modeling:	
  A	
  
User-­‐Driven	
  Approach	
  	
  
by	
  Carlis	
  &	
  Maguire	
  
– DataStax	
  Webinar:	
  25	
  June	
  
A	
  New	
  Modeling	
  Framework	
  
•  Conceptual	
  Modeling	
  
•  Choosing	
  a	
  Logical	
  Meta-­‐Model	
  
•  Logical	
  Modeling	
  
•  Physical	
  Modeling	
  
	
  
	
  
•  Tool	
  Support?	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   28	
  
Choosing	
  a	
  Logical	
  Meta-­‐Model	
  
•  Don’t	
  Assume	
  RelaIonal	
  (Duh...)	
  
•  Don’t	
  Assume	
  Big	
  Table	
  
•  Lots	
  of	
  Choices	
  
– RelaIonal	
  
– Big	
  Table	
  
– XML/Document	
  Database	
  
– Graph	
  database	
  
– Array	
  database	
  
– ...	
  
	
  #Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   29	
  
A	
  New	
  Modeling	
  Framework	
  
•  Conceptual	
  Modeling	
  
•  Choosing	
  a	
  Logical	
  Meta-­‐Model	
  
•  Logical	
  Modeling	
  
•  Physical	
  Modeling	
  
	
  
	
  
•  Tool	
  Support?	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   30	
  
Logical,	
  Physical,	
  and	
  Tool	
  Support	
  
•  Community	
  needs	
  to	
  develop	
  a	
  roster	
  of	
  shapes	
  
– And	
  the	
  a=endant	
  transformaIons	
  from	
  conceptual	
  
shapes	
  to	
  Big-­‐Table	
  shapes	
  
•  During	
  Logical	
  Big-­‐Table	
  modeling,	
  process	
  
requirements	
  will	
  infiltrate	
  
– including	
  things	
  like	
  minimum	
  cardinality	
  
•  Minimal	
  support	
  from	
  modeling	
  tools	
  
– Because	
  few	
  tools	
  support	
  conceptual	
  modeling	
  
– Because	
  vendors	
  have	
  not	
  caught	
  up	
  to	
  NoSQL	
  yet	
  
	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   31	
  
Agenda	
  
•  History	
  
•  Current	
  Events	
  
•  Your	
  Future	
  as	
  a	
  Data	
  Modeler	
  
•  Summary	
  
•  Q&A	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   32	
  
Summary	
  
•  Re-­‐commit	
  to	
  conceptual	
  modeling	
  for	
  
requirements	
  analysis	
  
– Some	
  but	
  not	
  all	
  relaIonal-­‐modeling	
  skills	
  will	
  
apply	
  
– Must	
  learn	
  to	
  focus	
  on	
  user	
  communicaIon,	
  not	
  
nerdy	
  stuff	
  like	
  intermediate	
  normal	
  forms	
  
	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   33	
  
Summary	
  
•  Remember	
  the	
  fundamentals,	
  so	
  that	
  you	
  can	
  
make	
  informed	
  decisions	
  about	
  relaxing	
  them	
  
–  ApplicaIon-­‐data	
  independence	
  
–  Consistency	
  level	
  as	
  a	
  user	
  requirement	
  
–  DeclaraIve	
  data	
  retrieval	
  (from	
  informaIon	
  hiding)	
  
•  AddiIonal	
  benefits	
  
–  Users	
  will	
  like	
  you	
  be=er	
  
–  Agile	
  developers	
  will	
  like	
  you	
  be=er	
  
–  This	
  framework	
  works	
  in	
  tradiIonal,	
  all-­‐SQL	
  
environments	
  
	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   34	
  
Q&A	
  
•  Joe.Maguire@DataQualityStrategies.com	
  
•  www.DataQualityStrategies.com	
  
#Cassandra13	
   ©	
  2013	
  Data	
  Quality	
  Strategies,	
  LLC	
   35	
  

Más contenido relacionado

La actualidad más candente

Data quality and data profiling
Data quality and data profilingData quality and data profiling
Data quality and data profilingShailja Khurana
 
Improving the customer experience using big data customer-centric measurement...
Improving the customer experience using big data customer-centric measurement...Improving the customer experience using big data customer-centric measurement...
Improving the customer experience using big data customer-centric measurement...Business Over Broadway
 
Data Quality
Data QualityData Quality
Data Qualityjerdeb
 
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...Erik Fransen
 
Building enterprise advance analytics platform
Building enterprise advance analytics platformBuilding enterprise advance analytics platform
Building enterprise advance analytics platformHaoran Du
 
Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015
Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015 Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015
Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015 Vladi Vexler
 
Data modeling for the business 09282010
Data modeling for the business  09282010Data modeling for the business  09282010
Data modeling for the business 09282010ERwin Modeling
 
Data Architecture Process in a BI environment
Data Architecture Process in a BI environmentData Architecture Process in a BI environment
Data Architecture Process in a BI environmentSasha Citino
 
Building the enterprise data architecture
Building the enterprise data architectureBuilding the enterprise data architecture
Building the enterprise data architectureCosta Pissaris
 
Big Data LDN 2018: AGILE DATA MASTERING: THE RIGHT APPROACH FOR DATAOPS
Big Data LDN 2018: AGILE DATA MASTERING: THE RIGHT APPROACH FOR DATAOPSBig Data LDN 2018: AGILE DATA MASTERING: THE RIGHT APPROACH FOR DATAOPS
Big Data LDN 2018: AGILE DATA MASTERING: THE RIGHT APPROACH FOR DATAOPSMatt Stubbs
 
The final frontier v3
The final frontier v3The final frontier v3
The final frontier v3Terry Bunio
 
Data Quality: A Raising Data Warehousing Concern
Data Quality: A Raising Data Warehousing ConcernData Quality: A Raising Data Warehousing Concern
Data Quality: A Raising Data Warehousing ConcernAmin Chowdhury
 
Narender Reddy Andra Profile
Narender Reddy Andra ProfileNarender Reddy Andra Profile
Narender Reddy Andra ProfileNarender Reddy
 
Data Quality Dashboards
Data Quality DashboardsData Quality Dashboards
Data Quality DashboardsWilliam Sharp
 
Bringing Agility and Flexibility to Data Design and Integration
Bringing Agility and Flexibility to Data Design and IntegrationBringing Agility and Flexibility to Data Design and Integration
Bringing Agility and Flexibility to Data Design and IntegrationDATAVERSITY
 
Establishing a Strategy for Data Quality
Establishing a Strategy for Data QualityEstablishing a Strategy for Data Quality
Establishing a Strategy for Data QualityDatabase Answers Ltd.
 
Agile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data PresentationAgile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data PresentationVishal Kumar
 
TeamTechnicalFinalPresentation-M5
TeamTechnicalFinalPresentation-M5TeamTechnicalFinalPresentation-M5
TeamTechnicalFinalPresentation-M5Naveed Shahid
 

La actualidad más candente (19)

Data quality and data profiling
Data quality and data profilingData quality and data profiling
Data quality and data profiling
 
Improving the customer experience using big data customer-centric measurement...
Improving the customer experience using big data customer-centric measurement...Improving the customer experience using big data customer-centric measurement...
Improving the customer experience using big data customer-centric measurement...
 
Data Quality
Data QualityData Quality
Data Quality
 
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
 
Building enterprise advance analytics platform
Building enterprise advance analytics platformBuilding enterprise advance analytics platform
Building enterprise advance analytics platform
 
Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015
Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015 Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015
Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015
 
Data modeling for the business 09282010
Data modeling for the business  09282010Data modeling for the business  09282010
Data modeling for the business 09282010
 
Big Data Modeling
Big Data ModelingBig Data Modeling
Big Data Modeling
 
Data Architecture Process in a BI environment
Data Architecture Process in a BI environmentData Architecture Process in a BI environment
Data Architecture Process in a BI environment
 
Building the enterprise data architecture
Building the enterprise data architectureBuilding the enterprise data architecture
Building the enterprise data architecture
 
Big Data LDN 2018: AGILE DATA MASTERING: THE RIGHT APPROACH FOR DATAOPS
Big Data LDN 2018: AGILE DATA MASTERING: THE RIGHT APPROACH FOR DATAOPSBig Data LDN 2018: AGILE DATA MASTERING: THE RIGHT APPROACH FOR DATAOPS
Big Data LDN 2018: AGILE DATA MASTERING: THE RIGHT APPROACH FOR DATAOPS
 
The final frontier v3
The final frontier v3The final frontier v3
The final frontier v3
 
Data Quality: A Raising Data Warehousing Concern
Data Quality: A Raising Data Warehousing ConcernData Quality: A Raising Data Warehousing Concern
Data Quality: A Raising Data Warehousing Concern
 
Narender Reddy Andra Profile
Narender Reddy Andra ProfileNarender Reddy Andra Profile
Narender Reddy Andra Profile
 
Data Quality Dashboards
Data Quality DashboardsData Quality Dashboards
Data Quality Dashboards
 
Bringing Agility and Flexibility to Data Design and Integration
Bringing Agility and Flexibility to Data Design and IntegrationBringing Agility and Flexibility to Data Design and Integration
Bringing Agility and Flexibility to Data Design and Integration
 
Establishing a Strategy for Data Quality
Establishing a Strategy for Data QualityEstablishing a Strategy for Data Quality
Establishing a Strategy for Data Quality
 
Agile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data PresentationAgile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data Presentation
 
TeamTechnicalFinalPresentation-M5
TeamTechnicalFinalPresentation-M5TeamTechnicalFinalPresentation-M5
TeamTechnicalFinalPresentation-M5
 

Destacado

Cassandra Day Denver 2014: A Cassandra Data Model for Serving up Cat Videos
Cassandra Day Denver 2014: A Cassandra Data Model for Serving up Cat VideosCassandra Day Denver 2014: A Cassandra Data Model for Serving up Cat Videos
Cassandra Day Denver 2014: A Cassandra Data Model for Serving up Cat VideosDataStax Academy
 
Cassandra Summit 2014: Fuzzy Entity Matching at Scale
Cassandra Summit 2014: Fuzzy Entity Matching at ScaleCassandra Summit 2014: Fuzzy Entity Matching at Scale
Cassandra Summit 2014: Fuzzy Entity Matching at ScaleDataStax Academy
 
A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...
A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...
A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...DataStax
 
Data Modelers Still Have Jobs: Adjusting for the NoSQL Environment
Data Modelers Still Have Jobs: Adjusting for the NoSQL EnvironmentData Modelers Still Have Jobs: Adjusting for the NoSQL Environment
Data Modelers Still Have Jobs: Adjusting for the NoSQL EnvironmentDataStax
 
Data Modeling for NoSQL
Data Modeling for NoSQLData Modeling for NoSQL
Data Modeling for NoSQLTony Tam
 
Quelles stratégies de Recherche avec Cassandra ?
Quelles stratégies de Recherche avec Cassandra ?Quelles stratégies de Recherche avec Cassandra ?
Quelles stratégies de Recherche avec Cassandra ?Victor Coustenoble
 

Destacado (8)

Data modelling 101
Data modelling 101Data modelling 101
Data modelling 101
 
Cassandra Day Denver 2014: A Cassandra Data Model for Serving up Cat Videos
Cassandra Day Denver 2014: A Cassandra Data Model for Serving up Cat VideosCassandra Day Denver 2014: A Cassandra Data Model for Serving up Cat Videos
Cassandra Day Denver 2014: A Cassandra Data Model for Serving up Cat Videos
 
Cassandra Summit 2014: Fuzzy Entity Matching at Scale
Cassandra Summit 2014: Fuzzy Entity Matching at ScaleCassandra Summit 2014: Fuzzy Entity Matching at Scale
Cassandra Summit 2014: Fuzzy Entity Matching at Scale
 
A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...
A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...
A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...
 
Data Modelers Still Have Jobs: Adjusting for the NoSQL Environment
Data Modelers Still Have Jobs: Adjusting for the NoSQL EnvironmentData Modelers Still Have Jobs: Adjusting for the NoSQL Environment
Data Modelers Still Have Jobs: Adjusting for the NoSQL Environment
 
Become a super modeler
Become a super modelerBecome a super modeler
Become a super modeler
 
Data Modeling for NoSQL
Data Modeling for NoSQLData Modeling for NoSQL
Data Modeling for NoSQL
 
Quelles stratégies de Recherche avec Cassandra ?
Quelles stratégies de Recherche avec Cassandra ?Quelles stratégies de Recherche avec Cassandra ?
Quelles stratégies de Recherche avec Cassandra ?
 

Similar a C* Summit 2013: Data Modelers Still Have Jobs - Adjusting For the NoSQL Environment by Joe Maguire

Lessons after working as a data scientist for 1 year
Lessons after working as a data scientist for 1 yearLessons after working as a data scientist for 1 year
Lessons after working as a data scientist for 1 yearYao Yao
 
How to Survive as a Data Architect in a Polyglot Database World
How to Survive as a Data Architect in a Polyglot Database WorldHow to Survive as a Data Architect in a Polyglot Database World
How to Survive as a Data Architect in a Polyglot Database WorldKaren Lopez
 
Exploring Business Intelligence: How BI Transforms Business Operations and Fu...
Exploring Business Intelligence: How BI Transforms Business Operations and Fu...Exploring Business Intelligence: How BI Transforms Business Operations and Fu...
Exploring Business Intelligence: How BI Transforms Business Operations and Fu...Velvetech LLC
 
Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...
Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...
Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...IDERA Software
 
Data Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptxData Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptxCarolineRebeccaD
 
Current-Active Resume
Current-Active ResumeCurrent-Active Resume
Current-Active Resumergtyh
 
Democratizing Data Science in the Enterprise
Democratizing Data Science in the EnterpriseDemocratizing Data Science in the Enterprise
Democratizing Data Science in the EnterpriseJesus Rodriguez
 
Agile & Data Modeling – How Can They Work Together?
Agile & Data Modeling – How Can They Work Together?Agile & Data Modeling – How Can They Work Together?
Agile & Data Modeling – How Can They Work Together?DATAVERSITY
 
LDM Webinar: Data Modeling & Business Intelligence
LDM Webinar: Data Modeling & Business IntelligenceLDM Webinar: Data Modeling & Business Intelligence
LDM Webinar: Data Modeling & Business IntelligenceDATAVERSITY
 
Data modelingzone geoffrey-clark-v2
Data modelingzone geoffrey-clark-v2Data modelingzone geoffrey-clark-v2
Data modelingzone geoffrey-clark-v2Geoffrey Clark
 
2013 ALPFA Leadership Submit, Data Analytics in Practice
2013 ALPFA Leadership Submit, Data Analytics in Practice2013 ALPFA Leadership Submit, Data Analytics in Practice
2013 ALPFA Leadership Submit, Data Analytics in PracticeAlejandro Jaramillo
 
Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessInside Analysis
 
Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)Caserta
 
Data-Ed Webinar: Design & Manage Data Structures
Data-Ed Webinar: Design & Manage Data Structures Data-Ed Webinar: Design & Manage Data Structures
Data-Ed Webinar: Design & Manage Data Structures DATAVERSITY
 
Data-Ed: Design and Manage Data Structures
Data-Ed: Design and Manage Data Structures Data-Ed: Design and Manage Data Structures
Data-Ed: Design and Manage Data Structures Data Blueprint
 
Business Centric Data Modeling
Business Centric Data ModelingBusiness Centric Data Modeling
Business Centric Data ModelingDATAVERSITY
 
Data-Ed Webinar: Data Modeling Fundamentals
Data-Ed Webinar: Data Modeling FundamentalsData-Ed Webinar: Data Modeling Fundamentals
Data-Ed Webinar: Data Modeling FundamentalsDATAVERSITY
 
Patterns for Successful Data Science Projects (Spark AI Summit)
Patterns for Successful Data Science Projects (Spark AI Summit)Patterns for Successful Data Science Projects (Spark AI Summit)
Patterns for Successful Data Science Projects (Spark AI Summit)Bill Chambers
 

Similar a C* Summit 2013: Data Modelers Still Have Jobs - Adjusting For the NoSQL Environment by Joe Maguire (20)

Lessons after working as a data scientist for 1 year
Lessons after working as a data scientist for 1 yearLessons after working as a data scientist for 1 year
Lessons after working as a data scientist for 1 year
 
How to Survive as a Data Architect in a Polyglot Database World
How to Survive as a Data Architect in a Polyglot Database WorldHow to Survive as a Data Architect in a Polyglot Database World
How to Survive as a Data Architect in a Polyglot Database World
 
Exploring Business Intelligence: How BI Transforms Business Operations and Fu...
Exploring Business Intelligence: How BI Transforms Business Operations and Fu...Exploring Business Intelligence: How BI Transforms Business Operations and Fu...
Exploring Business Intelligence: How BI Transforms Business Operations and Fu...
 
Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...
Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...
Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...
 
Lean Analytics: How to get more out of your data science team
Lean Analytics: How to get more out of your data science teamLean Analytics: How to get more out of your data science team
Lean Analytics: How to get more out of your data science team
 
Data Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptxData Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptx
 
Current-Active Resume
Current-Active ResumeCurrent-Active Resume
Current-Active Resume
 
Democratizing Data Science in the Enterprise
Democratizing Data Science in the EnterpriseDemocratizing Data Science in the Enterprise
Democratizing Data Science in the Enterprise
 
Agile & Data Modeling – How Can They Work Together?
Agile & Data Modeling – How Can They Work Together?Agile & Data Modeling – How Can They Work Together?
Agile & Data Modeling – How Can They Work Together?
 
LDM Webinar: Data Modeling & Business Intelligence
LDM Webinar: Data Modeling & Business IntelligenceLDM Webinar: Data Modeling & Business Intelligence
LDM Webinar: Data Modeling & Business Intelligence
 
Data modelingzone geoffrey-clark-v2
Data modelingzone geoffrey-clark-v2Data modelingzone geoffrey-clark-v2
Data modelingzone geoffrey-clark-v2
 
Data modeling
Data modelingData modeling
Data modeling
 
2013 ALPFA Leadership Submit, Data Analytics in Practice
2013 ALPFA Leadership Submit, Data Analytics in Practice2013 ALPFA Leadership Submit, Data Analytics in Practice
2013 ALPFA Leadership Submit, Data Analytics in Practice
 
Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for Success
 
Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)
 
Data-Ed Webinar: Design & Manage Data Structures
Data-Ed Webinar: Design & Manage Data Structures Data-Ed Webinar: Design & Manage Data Structures
Data-Ed Webinar: Design & Manage Data Structures
 
Data-Ed: Design and Manage Data Structures
Data-Ed: Design and Manage Data Structures Data-Ed: Design and Manage Data Structures
Data-Ed: Design and Manage Data Structures
 
Business Centric Data Modeling
Business Centric Data ModelingBusiness Centric Data Modeling
Business Centric Data Modeling
 
Data-Ed Webinar: Data Modeling Fundamentals
Data-Ed Webinar: Data Modeling FundamentalsData-Ed Webinar: Data Modeling Fundamentals
Data-Ed Webinar: Data Modeling Fundamentals
 
Patterns for Successful Data Science Projects (Spark AI Summit)
Patterns for Successful Data Science Projects (Spark AI Summit)Patterns for Successful Data Science Projects (Spark AI Summit)
Patterns for Successful Data Science Projects (Spark AI Summit)
 

Más de DataStax Academy

Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
Forrester CXNYC 2017 - Delivering great real-time cx is a true craftForrester CXNYC 2017 - Delivering great real-time cx is a true craft
Forrester CXNYC 2017 - Delivering great real-time cx is a true craftDataStax Academy
 
Introduction to DataStax Enterprise Graph Database
Introduction to DataStax Enterprise Graph DatabaseIntroduction to DataStax Enterprise Graph Database
Introduction to DataStax Enterprise Graph DatabaseDataStax Academy
 
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
Introduction to DataStax Enterprise Advanced Replication with Apache CassandraIntroduction to DataStax Enterprise Advanced Replication with Apache Cassandra
Introduction to DataStax Enterprise Advanced Replication with Apache CassandraDataStax Academy
 
Cassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart LabsCassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart LabsDataStax Academy
 
Cassandra 3.0 Data Modeling
Cassandra 3.0 Data ModelingCassandra 3.0 Data Modeling
Cassandra 3.0 Data ModelingDataStax Academy
 
Cassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stackCassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stackDataStax Academy
 
Data Modeling for Apache Cassandra
Data Modeling for Apache CassandraData Modeling for Apache Cassandra
Data Modeling for Apache CassandraDataStax Academy
 
Production Ready Cassandra
Production Ready CassandraProduction Ready Cassandra
Production Ready CassandraDataStax Academy
 
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & PythonCassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & PythonDataStax Academy
 
Cassandra @ Sony: The good, the bad, and the ugly part 1
Cassandra @ Sony: The good, the bad, and the ugly part 1Cassandra @ Sony: The good, the bad, and the ugly part 1
Cassandra @ Sony: The good, the bad, and the ugly part 1DataStax Academy
 
Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2DataStax Academy
 
Standing Up Your First Cluster
Standing Up Your First ClusterStanding Up Your First Cluster
Standing Up Your First ClusterDataStax Academy
 
Real Time Analytics with Dse
Real Time Analytics with DseReal Time Analytics with Dse
Real Time Analytics with DseDataStax Academy
 
Introduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache CassandraIntroduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache CassandraDataStax Academy
 
Enabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax EnterpriseEnabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax EnterpriseDataStax Academy
 
Advanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache CassandraAdvanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache CassandraDataStax Academy
 

Más de DataStax Academy (20)

Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
Forrester CXNYC 2017 - Delivering great real-time cx is a true craftForrester CXNYC 2017 - Delivering great real-time cx is a true craft
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
 
Introduction to DataStax Enterprise Graph Database
Introduction to DataStax Enterprise Graph DatabaseIntroduction to DataStax Enterprise Graph Database
Introduction to DataStax Enterprise Graph Database
 
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
Introduction to DataStax Enterprise Advanced Replication with Apache CassandraIntroduction to DataStax Enterprise Advanced Replication with Apache Cassandra
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
 
Cassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart LabsCassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart Labs
 
Cassandra 3.0 Data Modeling
Cassandra 3.0 Data ModelingCassandra 3.0 Data Modeling
Cassandra 3.0 Data Modeling
 
Cassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stackCassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stack
 
Data Modeling for Apache Cassandra
Data Modeling for Apache CassandraData Modeling for Apache Cassandra
Data Modeling for Apache Cassandra
 
Coursera Cassandra Driver
Coursera Cassandra DriverCoursera Cassandra Driver
Coursera Cassandra Driver
 
Production Ready Cassandra
Production Ready CassandraProduction Ready Cassandra
Production Ready Cassandra
 
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & PythonCassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
 
Cassandra @ Sony: The good, the bad, and the ugly part 1
Cassandra @ Sony: The good, the bad, and the ugly part 1Cassandra @ Sony: The good, the bad, and the ugly part 1
Cassandra @ Sony: The good, the bad, and the ugly part 1
 
Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2
 
Standing Up Your First Cluster
Standing Up Your First ClusterStanding Up Your First Cluster
Standing Up Your First Cluster
 
Real Time Analytics with Dse
Real Time Analytics with DseReal Time Analytics with Dse
Real Time Analytics with Dse
 
Introduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache CassandraIntroduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache Cassandra
 
Cassandra Core Concepts
Cassandra Core ConceptsCassandra Core Concepts
Cassandra Core Concepts
 
Enabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax EnterpriseEnabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax Enterprise
 
Bad Habits Die Hard
Bad Habits Die Hard Bad Habits Die Hard
Bad Habits Die Hard
 
Advanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache CassandraAdvanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache Cassandra
 
Advanced Cassandra
Advanced CassandraAdvanced Cassandra
Advanced Cassandra
 

Último

Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 

Último (20)

Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 

C* Summit 2013: Data Modelers Still Have Jobs - Adjusting For the NoSQL Environment by Joe Maguire

  • 1. Data  Modelers  Save  Their  Careers:   Surviving  and  Thriving  with  NoSQL     Joe  Maguire   Data  Quality  Strategies,  LLC   h=p://www.DataQualityStrategies.com/     ©  2013  Data  Quality  Strategies,  LLC  
  • 2. Thesis   •  RelaIonal  DBMS’s  have  dominated,   •  ...so  relaIonal  modeling  subsumed  other   forms,  including  conceptual  modeling.   •  As  R-­‐DBMS  wanes,  so  does  relaIonal   modeling  –  and  sadly,  whatever  it  subsumed.   •  Conceptual  modeling  must  be  saved.   •  RelaIonal  modelers  can  step  in  to  save  it...   •  ...with  some  significant  effort.   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   2  
  • 3. My  PerspecIve   •  Over  three  decades  in  industry   •  Career  is  a  three-­‐legged  stool   –  Product  development  for  soVware  vendors   –  SoluIon  design  for  enterprises   –  Author,  Industry  Analyst,  Thought  Leader     •  Specialize  in     –  Modeling   –  Requirements  analysis   –  Data  architecture   –  Data  quality   •  Joe.Maguire@DataQualityStrategies.com     #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   3  
  • 4. Agenda   •  History   •  Current  Events   •  Your  Future  as  a  Data  Modeler   •  Q&A   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   4  
  • 5. A  Big-­‐Picture  Framework   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   5     Meta-­‐model     Data  Perspec1ve   Conceptual   •  EnIIes   •  A=ributes   •  RelaIonships   •  IdenIfiers   Logical   •  Tables   •  Columns   •  Primary  and  foreign  keys   Physical   •  Indexes   •  Table  spaces   •  VerIcal  and  horizontal  parIIoning   •  DenormalizaIons  
  • 6. Good  Ideas  in  the  Framework   •  InformaIon  Hiding   –  e.g.,  conceptual  excludes  implementaIon  details   •  The  Type/Instance  disIncIon   –  Models  describe  categories,  data  describes  members   •  ApplicaIon/Data  Independence   –  Data  modeling  is  separate  from  process  modeling   •  User  Requirements  ≠  System  Requirements   –  Users  should  not  parIcipate  in  logical  and  physical     •  Model-­‐Driven  Development   –  Forward  and  reverse  engineering  across  model  levels    #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   6  
  • 7. A  Big-­‐Picture  Framework,  distorted   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   7     Meta-­‐model     Data  Perspec1ve   RelaIonal   •  EnIIes  /  Tables   •  A=ributes  /  Columns   •  RelaIonships  /  FKs   •  IdenIfiers  /  PKs         Physical   •  Indexes   •  Table  spaces   •  VerIcal  and  horizontal  parIIoning   •  DenormalizaIons  
  • 8. How  the  DistorIon  Happens   •  Tool  Vendors  Dismiss  Conceptual  Modeling   – Because  their  tools  cannot  support  it  anyway   •  Info  Mgmt  Specialists  Confuse  Models  w  Reality   – E.g.,  believing  the  relaIonal  model  suffices  to   describe  the  universe   •  InsItuIonalized  Expediency     – We  know  about  conceptual  modeling,  but  to  save   Ime,  we  combine  it  with  relaIonal  modeling...   – ...then  we  formalize  that  into  our  dev  processes...   – ...and  eventually,  that  becomes  the  “best   pracIces.”  #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   8  
  • 9. DistorIons,  Revisited   •  Summary  of  DistorIons:   – DistorIon:  Conceptual  means  vague   – DistorIon:  Logical  implies  relaIonal   •  Rather  than  XML,  OO,  KV  Store,  Array  Database,  Graph   Database   •  Results  of  DistorIons:   – Two  levels  only:  relaIonal  and  physical   – RelaIonal  modeling  used  for  user  requirements   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   9  
  • 10. Agenda   •  History   •  Current  Events   •  Your  Future  as  a  Data  Modeler   •  Q&A   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   10  
  • 11. Current  Events:  NoSQL   •  The  “Just  Say  No”  InterpretaIon   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   11     Meta-­‐model     Data  Perspec1ve   Logical   RelaIonal   •  EnIIes  /  Tables   •  A=ributes  /  Columns   •  RelaIonships  /  FKs   •  IdenIfiers  /  PKs         Physical   NO  LONGER  RELATIONAL:   •  Schemas  Based  on  Big  Table  ImplementaIons   •  Alien  DDL  language   •  Limited  Support  from  Modeling  Tools  
  • 12. Current  Events:  NoSQL   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   12   •  The  “Not  Only  SQL”  InterpretaIon   – Okay,  so  there  might  be  some  work  for  you   – But  you’re  at  risk  of  being  marginalized        
  • 13. Agenda   •  History   •  Current  Events   •  Your  Future  as  a  Data  Modeler   •  Summary   •  Q&A   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   13  
  • 14. Your  Future  as  a  Modeler   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   14   •  Remaining  Relevant   – Selfishly:  Saving  your  career   – Nobly:  Serving  your  client  /  company  /  customer   •  What  you  can  do:   – Wait  for  relaIonal  projects   – Become  a  NoSQL  database  designer   – Help  your  client  choose  data  plasorms   •  That  starts  with  understanding  the  problems   –  which  starts  with  CONCEPTUAL  MODELING.    
  • 15. A  New  (?)  Modeling  Framework   •  Conceptual  Modeling   •  Choosing  a  Logical  Meta-­‐model   •  Logical  Modeling   •  Physical  Modeling       •  Tool  Support?   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   15  
  • 16. Conceptual  Modeling   •  Behaviors  and  constructs  will  compare  to   RelaIonal  Modeling:   – Keep  some   – Discard  some   – Stress  some   – Change  some   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   16  
  • 17. Conceptual  Data  Model  Example   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   17  
  • 18. Keep  Some   •  Keep  EnIIes   •  Keep  A=ributes   •  Keep  RelaIonships   •  Keep  IdenIfiers   •  Keep  Maximum  Cardinality  of  RelaIonships   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   18  
  • 19. Keep  EnIIes   •  Minimum  Expressiveness   •  EnIIes,  Not  Tables   – Don’t  express  Horizontal  or  VerIcal  ParIIoning  for   performance   •  But  yes  is  moIvated  by  privacy/security/risk   •  EnIty  names,  not  table  names   – Honor  user  vocabulary,  not  IT  naming  standards   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   19  
  • 20. Keep  A=ributes   •  Honor  User  Phenomenon   – A=ributes  are  part  of  user  discourse   •  A=ributes,  not  columns   – Worry  about  scale  (nominal,  numeric,  ordinal,   Boolean,  cyclic),  not  data  type   – A=ribute  names,  not  column  names   •  Support  in-­‐progress  models   – During  which  a=ributes  can  become  enIIes   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   20  
  • 21. Keep  RelaIonships   •  Minimum  Expressiveness   – A=ributes  are  part  of  user  discourse   •  Allow  many-­‐many  and  collecIon  enIIes   – If  the  la=er  seem  strange,  you’ve  been  in  IT  too   long   •  RelaIonships,  not  FKs   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   21  
  • 22. Keep  IdenIfiers   •  IdenIfiers,  not  PKs   – IDs  are  not  moIvated  by  computerizaIon,  but  by   typography   – IDs  predate  the  informaIon  revoluIon   •  and  the  automoIve  revoluIon,  for  that  ma=er   •  Support  in-­‐process  modeling   – IDs  help  the  modeler  ferret  out  the  homonym   problem   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   22  
  • 23. Discard  Some   •  Discard  Foreign  Keys   – They’re  relaIonal   •  Discard  Minimum  Cardinality   – A  funcIon  of  process  or  policy,  not  data   – Over-­‐reported  by  users   •  Discard  Most  Constraints   – A  funcIon  of  process  or  policy,  not  data   – Are  over-­‐reported  by  users   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   23  
  • 24. Keep/Discard  Rule  of  Thumb   •  Keep   – Anything  that  helps  you  and  the  users  together   discover  and  name  the  user  categories   •  Discard   – Anything  else   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   24  
  • 25. Conceptual  Data  Model  Examples   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   25  
  • 26. Stress  Some   •  Stress  Consistency  Requirements   – RelaIonal  modelers  (of  non-­‐distributed  databases)   have  not  been  asking  about  these.   •  Stress  Data  Volume  /  Velocity  Requirements   – Can  lead  or  force  your  to  relax  applicaIon-­‐data   independence     #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   26  
  • 27. Change  Some   •  Change  your  process   – From  math-­‐y  normalizaIon  to  English-­‐y   conversaIon  with  users   – Very  difficult  to  achieve  rigor  conversaIonally     #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   27   •  More  help:   – Mastering  Data  Modeling:  A   User-­‐Driven  Approach     by  Carlis  &  Maguire   – DataStax  Webinar:  25  June  
  • 28. A  New  Modeling  Framework   •  Conceptual  Modeling   •  Choosing  a  Logical  Meta-­‐Model   •  Logical  Modeling   •  Physical  Modeling       •  Tool  Support?   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   28  
  • 29. Choosing  a  Logical  Meta-­‐Model   •  Don’t  Assume  RelaIonal  (Duh...)   •  Don’t  Assume  Big  Table   •  Lots  of  Choices   – RelaIonal   – Big  Table   – XML/Document  Database   – Graph  database   – Array  database   – ...    #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   29  
  • 30. A  New  Modeling  Framework   •  Conceptual  Modeling   •  Choosing  a  Logical  Meta-­‐Model   •  Logical  Modeling   •  Physical  Modeling       •  Tool  Support?   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   30  
  • 31. Logical,  Physical,  and  Tool  Support   •  Community  needs  to  develop  a  roster  of  shapes   – And  the  a=endant  transformaIons  from  conceptual   shapes  to  Big-­‐Table  shapes   •  During  Logical  Big-­‐Table  modeling,  process   requirements  will  infiltrate   – including  things  like  minimum  cardinality   •  Minimal  support  from  modeling  tools   – Because  few  tools  support  conceptual  modeling   – Because  vendors  have  not  caught  up  to  NoSQL  yet     #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   31  
  • 32. Agenda   •  History   •  Current  Events   •  Your  Future  as  a  Data  Modeler   •  Summary   •  Q&A   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   32  
  • 33. Summary   •  Re-­‐commit  to  conceptual  modeling  for   requirements  analysis   – Some  but  not  all  relaIonal-­‐modeling  skills  will   apply   – Must  learn  to  focus  on  user  communicaIon,  not   nerdy  stuff  like  intermediate  normal  forms     #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   33  
  • 34. Summary   •  Remember  the  fundamentals,  so  that  you  can   make  informed  decisions  about  relaxing  them   –  ApplicaIon-­‐data  independence   –  Consistency  level  as  a  user  requirement   –  DeclaraIve  data  retrieval  (from  informaIon  hiding)   •  AddiIonal  benefits   –  Users  will  like  you  be=er   –  Agile  developers  will  like  you  be=er   –  This  framework  works  in  tradiIonal,  all-­‐SQL   environments     #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   34  
  • 35. Q&A   •  Joe.Maguire@DataQualityStrategies.com   •  www.DataQualityStrategies.com   #Cassandra13   ©  2013  Data  Quality  Strategies,  LLC   35