Publicidad
Publicidad

Más contenido relacionado

Presentaciones para ti(20)

Similar a Data-Ed: Metadata Strategies(20)

Publicidad
Publicidad

Data-Ed: Metadata Strategies

  1. Metadata Management Strategies Copyright 2014 by Data Blueprint Good systems development often depends on multiple data management disciplines that provide a solid foundation. One of these is metadata. While much of the discussion around metadata focuses on understanding metadata itself along with its associated technologies, this perspective often represents a typical tool-and-technology focus, which has not achieved significant results to date. A more relevant question when considering pockets of metadata is whether to include them in the scope of organizational metadata practices. By understanding what it means to include items in the scope of your metadata practices, you can begin to build systems that allow you to practice sophisticated ways to advance data management and supported business initiatives with a demonstrable ROI. After a bit of practice in this manner you can position your organization to better exploit any and all metadata technologies in support of business strategy. 
 Date: November 11, 2014 Time: 2:00 PM ET/11:00 AM PT Presenter: Peter Aiken, Ph.D.
  2. Commonly Asked Questions 2 Copyright 2014 by Data Blueprint 1)Will I get copies of the slides after the event 2)Yes this is being recorded
  3. Get Social With Us! 3 Copyright 2014 by Data Blueprint Like Us on Facebook www.facebook.com/datablueprint Post questions and comments Find industry news, insightful content and event updates. Join the Group Data Management & Business Intelligence Ask questions, gain insights and collaborate with fellow data management professionals Live Twitter Feed Join the conversation! Follow us: @datablueprint @paiken Ask questions and submit your comments: #dataed
  4. PETER AIKEN WITH JUANITA BILLINGS FOREWORD BY JOHN BOTTEGA MONETIZING DATA MANAGEMENT Unlocking the Value in Your Organization’s Most Important Asset. Peter Aiken, Ph.D. 4 Copyright 2014 by Data Blueprint The Case for the Chief Data Officer Recasting the C-Suite to Leverage Your MostValuable Asset Peter Aiken and Michael Gorman • 30+ years data management experience • Multiple international awards/ recognition • Founder, Data Blueprint (datablueprint.com) • Associate Professor of IS, VCU (vcu.edu) • (Past) President, DAMA Int. (dama.org) • 9 books and dozens of articles • Experienced w/ 500+ data management practices in 20 countries • Multi-year immersions with organizations as diverse as the US DoD, Nokia, Deutsche Bank, Wells Fargo, Walmart, and the Commonwealth of Virginia
  5. Peter Aiken, Ph.D. Metadata Management Strategies 10124 W. Broad Street, Suite C Glen Allen, Virginia 23060 804.521.4056
  6. Whither the "data dictionary? 6 Copyright 2014 by Data Blueprint • The classic "data dictionary" pretty much died by the early '90s - ... they ever-so-kindly renamed "data dictionary" to "metadata repository" & then promptly went belly up. Ask pretty much IBMer today if they've ever heard of AD/Cycle or RepositoryManager... guaranteed response will be a blank stare. • My calculation says 5% survival rate from 1973 to 2003 • Metadata has morphed into a meaningless buzzword … - Yet organizations are suffering from unprecedented amounts of new forms of seriously unmanaged metadata. • I've essentially given up on trying to grok what metadata is other than "required buzzword" (Dave Eddy/deddy@davideddy.com)
  7. Metadata Management Strategies 7 Copyright 2014 by Data Blueprint 1. Data Management Overview 2. What is metadata and why is it important? 3. Major metadata types & subject areas 4. Metadata benefits, application & sources 5. Metadata strategies & implementation 6. Metadata building blocks 7. Guiding Principles 8. Specific teachable example 9. Take Aways, References and Q&A Tweeting now: #dataed 1. Data Management Overview 2. What is metadata and why is it important? 3. Major metadata types & subject areas 4. Metadata benefits, application & sources 5. Metadata strategies & implementation 6. Metadata building blocks 7. Guiding Principles 8. Specific teachable example 9. Take Aways, References and Q&A
  8. Metadata Management Strategies 8 Copyright 2014 by Data Blueprint 1. Data Management Overview 2. What is metadata and why is it important? 3. Major metadata types & subject areas 4. Metadata benefits, application & sources 5. Metadata strategies & implementation 6. Metadata building blocks 7. Guiding Principles 8. Specific teachable example 9. Take Aways, References and Q&A Tweeting now: #dataed
  9. Uses What is data management? 9 Copyright 2014 by Data Blueprint Sources Data Governance 
 Data Engineering 
 Data 
 Delivery 
 Data
 Storage Specialized Team Skills • Data management practices connect data sources and uses in an organized and efficient manner – Storage – Engineering – Delivery – Governance • When executed, engineering, storage, and delivery implement governance
  10. DMM℠ Structure 10 Copyright 2014 by Data Blueprint
  11. Five Integrated DM Practice Areas Manage data coherently. Share data across boundaries. Assign responsibilities for data. Engineer data delivery systems. Maintain data availability. Data Program Coordination Organizational Data Integration Data Stewardship Data Development Data Support Operations 11 Copyright 2014 by Data Blueprint
  12. DMM℠ Structure 12 Copyright 2014 by Data Blueprint
  13. Maslow's Hierarchiy of Needs 13 Copyright 2014 by Data Blueprint
  14. You can accomplish Advanced Data Practices without becoming proficient in the Foundational Data Management Practices however this will: • Take longer • Cost more • Deliver less • Present 
 greater
 risk
 (with thanks to Tom DeMarco) Data Management Practices Hierarchy Advanced 
 Data 
 Practices • MDM • Mining • Big Data • Analytics • Warehousing • SOA Foundational Data Management Practices 14 Copyright 2014 by Data Blueprint Data Platform/Architecture Data Governance Data Quality Data Operations Data Management Strategy Technologies Capabilities
  15. DataManagementBodyofKnowledge 15 Copyright 2014 by Data Blueprint Data Management Functions
  16. DAMA DM BoK & CDMP 16 Copyright 2014 by Data Blueprint • Data Management Body of Knowledge (DMBOK) – Published by DAMA International, the professional association for 
 Data Managers (40 chapters worldwide) – Organized around primary data management functions focused around data delivery to the organization and several environmental elements • Certified Data Management Professional (CDMP) – Series of 3 exams by DAMA International and ICCP – Membership in a distinct group of 
 fellow professionals – Recognition for specialized knowledge in a 
 choice of 17 specialty areas – For more information, please visit: • www.dama.org, www.iccp.org
  17. Metadata Management from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International 17 Copyright 2014 by Data Blueprint
  18. Metadata Management Strategies 18 Copyright 2014 by Data Blueprint 1. Data Management Overview 2. What is metadata and why is it important? 3. Major metadata types & subject areas 4. Metadata benefits, application & sources 5. Metadata strategies & implementation 6. Metadata building blocks 7. Guiding Principles 8. Specific teachable example 9. Take Aways, References and Q&A Tweeting now: #dataed
  19. Metadata Management Strategies 19 Copyright 2014 by Data Blueprint 1. Data Management Overview 2. What is metadata and why is it important? 3. Major metadata types & subject areas 4. Metadata benefits, application & sources 5. Metadata strategies & implementation 6. Metadata building blocks 7. Guiding Principles 8. Specific teachable example 9. Take Aways, References and Q&A Tweeting now: #dataed
  20. What is a Strategy? 20 Copyright 2014 by Data Blueprint • Current use derived from military • "a pattern in a stream of decisions" [Henry Mintzberg] • "a system of finding, formulating, and developing a doctrine that will ensure long-term success if followed faithfully [Vladimir Kvint]
  21. Meta data, Meta-data, or metadata 21 Copyright 2014 by Data Blueprint • In the history of language, whenever two words are pasted together to form a combined concept initially, a hyphen links them • With the passage of time, 
 the hyphen is lost. The 
 argument can be made 
 that that time has passed • There is a copyright on 
 the term "metadata," but 
 it has not been enforced • So, term is "metadata"
  22. Definitions 22 Copyright 2014 by Data Blueprint • Metadata is – Everywhere in every data management activity and integral 
 to all IT systems and applications. – To data what data is to real life. Data reflects real life transactions, events, objects, relationships, etc. Metadata reflects data transactions, events, objects, relations, etc. – The data that describe the structure and workings of an 
 organization’s use of information, and which describe the 
 systems it uses to manage that information. 
 [quote from David Hay's book, page 4] • Data describing various facets of a data asset, for the purpose of improving its usability throughout its life cycle [Gartner 2010] • Metadata unlocks the value of data, and therefore requires management attention [Gartner 2011] • Metadata Management is – The set of processes that ensure proper creation, storage, integration, and control to support associated use of metadata
  23. 23 Copyright 2014 by Data Blueprint
  24. Analogy: a library card catalog 24 Copyright 2014 by Data Blueprint • Identifies – What books are in the library, and – Where they are located • Search by – Subject area – Author, or – Title • Catalog shows – Author – Subject tags – Publication date and – Revision history • Determine which books will 
 meet the reader’s requirements • Without the catalog, finding things is difficult, time consuming and frustrating
 from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  25. Definition (continued) 25 Copyright 2014 by Data Blueprint • Metadata is the card catalog in a managed data environment • Abstractly, Metadata is the descriptive tags or context on the data (the content) in a managed data environment • Metadata shows business and technical users where to find information in data repositories • Metadata provides details on where the data came from, how it got there, any transformations, and its level of quality • Metadata provides assistance with what the data really means and how to interpret it from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  26. Defining Metadata 26 Copyright 2014 by Data Blueprint Metadata is any combination of any circle and the data in the center that unlocks the value of the data! Adapted from Brad Melton Data WhereWhy What How Who When Data
  27. Library Metadata Example Libraries can operate efficiently through careful 
 use of metadata (Card Catalog)
 
 Who: Author What: Title Where: Shelf Location When: Publication Date
 A small amount of metadata (Card Catalog) unlocks the value of a large amount of data (the Library) 27 Copyright 2014 by Data Blueprint Data WhereWhy What How Who When Library Book
  28. Outlook Example 28 Copyright 2014 by Data Blueprint "Outlook" metadata is used to navigate/manage email
 What: "Subject" How: "Priority" Where: "USERID/Inbox", 
 "USERID/Personal" Why: "Body" When: "Sent" & "Received”
 • Find the important stuff/weed out junk • Organize for future access/outlook rules • Imagine how managing e-mail (already non-trivial) would change if Outlook did not make use of metadata Who:"To" & "From?"
  29. Uses What is the structure of metadata practices? 29 Copyright 2014 by Data Blueprint Sources 
 Metadata Governance 
 Metadata Engineering 
 Metadata Delivery Metadata Practices Metadata
 Storage Specialized Team Skills • Metadata practices connect data sources and uses in an organized and efficient manner – Storage: repository, glossary, models, lineage - often multiple
 technologies – Engineering: identifying/harvesting/normalizing/administer 
 evolving metadata structures – Delivery: supply/access/portal/definition/lookup search 
 identify/ensure required metadata supplies to 
 meet business needs – Governance: ensure proper/creation/storage/integration/control 
 to support effective use • When executed, engineering and delivery implement governance
  30. Polling Question #1 30 Copyright 2014 by Data Blueprint • My organization began using or is planning to use a formal approach to metadata management a) Last year (2013) b) This year (2014) c) Next year (2015) d) Not at all
  31. Polling Question #1 (from last year) 31 Copyright 2014 by Data Blueprint • My organization began using or is planning to use a formal approach to metadata management a) Last year (2012) 38% b) This year (2014) 13% c) Next year (2015) 14% d) Not at all 15%
  32. Metadata Management Strategies 32 Copyright 2014 by Data Blueprint 1. Data Management Overview 2. What is metadata and why is it important? 3. Major metadata types & subject areas 4. Metadata benefits, application & sources 5. Metadata strategies & implementation 6. Metadata building blocks 7. Guiding Principles 8. Specific teachable example 9. Take Aways, References and Q&A Tweeting now: #dataed
  33. Metadata Management Strategies 33 Copyright 2014 by Data Blueprint 1. Data Management Overview 2. What is metadata and why is it important? 3. Major metadata types & subject areas 4. Metadata benefits, application & sources 5. Metadata strategies & implementation 6. Metadata building blocks 7. Guiding Principles 8. Specific teachable example 9. Take Aways, References and Q&A Tweeting now: #dataed
  34. Types of Metadata: Process Metadata 34 Copyright 2014 by Data Blueprint • Process Metadata is... – Data that defines and describes the characteristics of other system elements, e.g. processes, business rules, programs, jobs, tools, etc. • Examples of Process metadata: – Data stores and data involved – Government/regulatory bodies – Organization owners and stakeholders – Process dependencies and decomposition – Process feedback loop and documentation – Process name from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  35. Business Process Metadata 35 Copyright 2014 by Data Blueprint Who: Created the documentation? What: Are the important dependencies 
 among the processes? How: Do the business processes interact with each other? Data WhereWhy What How Who When Email Messag e
  36. Types of Metadata: Business Metadata 36 Copyright 2014 by Data Blueprint • Business Metadata describe 
 to the end user what data are 
 available, what they mean and 
 how to retrieve them. • Included are: – Business names and definitions of subject and concept areas, entities, attributes – Attribute data types and other attribute properties – Range descriptions, calculations, algorithms and business rules – Valid domain values and their definitions from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  37. Types of Metadata: Technical & Operational Metadata 37 Copyright 2014 by Data Blueprint • Technical and operational metadata provides developers and technical users with information about their systems • Technical metadata includes… – Physical database table and column names, column properties, other properties, other database object properties and database storage • Operational metadata is targeted at IT operations users’ needs, including… – Information about data movement, source and target systems, batch programs, job frequency, schedule anomalies, recovery and backup information, archive rules and usage • Examples of Technical & Operational metadata: – Audit controls and balancing information – Data archiving and retention rules – Encoding/reference table conversions – History of extracts and results from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  38. Types of Metadata: Data Stewardship 38 Copyright 2014 by Data Blueprint • Data stewardship metadata is about... – Data stewards, stewardship processes, and responsibility assignments • Data stewards… – Assure that data and Metadata are accurate, with high quality across the enterprise. – Establish and monitor data sharing. • Examples of Data stewardship metadata: – Business drivers/goals – Data CRUD rules – Data definitions – business and technical – Data owners – Data sharing rules and agreements/contracts – Data stewards, roles and responsibilities from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  39. Types of Metadata: Provenance 39 Copyright 2014 by Data Blueprint • Provenance: – the history of ownership of a valued object or work of art or literature" [Merriam Webster] – For each datum, this is the description of: • Its source (system or person or department), • Any derivation used, and • The date it was created. – Examples of Data Provenance: • The programs or 
 processes by which 
 it was created • Its owner • The steward responsible 
 for its quality • Other roles and 
 responsibilities • Rules for sharing it from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  40. Metadata Subject Areas Subject Areas Components 1) Business Analytics Data definitions, reports, users, usage, performance 2) Business Architecture Roles and organizations, goals and objectives 3) Business Definitions Business terms and explanations for a particular concept, fact, or other item found in an organization 4) Business Rules Standard calculations and derivation methods 5) Data Governance Policies, standards, procedures, programs, roles, organizations, stewardship assignments 6) Data Integration Sources, targets, transformations, lineage, ETL workflows, EAI, EII, migration/conversion 7) Data Quality Defects, metrics, ratings 8) Document Content Management Unstructured data, documents, taxonomies, ontologies, name sets, legal discovery, search engine indexes 40 Copyright 2014 by Data Blueprint from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  41. Metadata Subject Areas, continued Subject Areas Components 9) Information Technology Infrastructure Platforms, networks, configurations, licenses 10)Conceptual data models Entities, attributes, relationships and rules, business names and definitions. 11)Logical Data Models Files, tables, columns, views, business definitions, indexes, usage, performance, change management 12)Process Models Functions, activities, roles, inputs/outputs, workflow, timing, stores 13)Systems Portfolio and IT Governance Databases, applications, projects, and programs, integration roadmap, change management 14)Service-oriented Architecture (SOA) information: Components, services, messages, master data 15)System Design and Development Requirements, designs and test plans, impact 16)Systems Management Data security, licenses, configuration, reliability, service levels 41 Copyright 2014 by Data Blueprint from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  42. Metadata Management Strategies 42 Copyright 2014 by Data Blueprint 1. Data Management Overview 2. What is metadata and why is it important? 3. Major metadata types & subject areas 4. Metadata benefits, application & sources 5. Metadata strategies & implementation 6. Metadata building blocks 7. Guiding Principles 8. Specific teachable example 9. Take Aways, References and Q&A Tweeting now: #dataed
  43. Metadata Management Strategies 43 Copyright 2014 by Data Blueprint 1. Data Management Overview 2. What is metadata and why is it important? 3. Major metadata types & subject areas 4. Metadata benefits, application & sources 5. Metadata strategies & implementation 6. Metadata building blocks 7. Guiding Principles 8. Specific teachable example 9. Take Aways, References and Q&A Tweeting now: #dataed
  44. 7 Metadata Benefits 44 Copyright 2014 by Data Blueprint 1. Increase the value of strategic information (e.g. data warehousing, CRM, SCM, etc.) by providing context for the data, thus aiding analysts in making more effective decisions. 2. Reduce training costs and lower the impact of staff turnover through thorough documentation of data context, history, and origin. 3. Reduce data-oriented research time by assisting business analysts in finding the information they need in a timely manner. 4. Improve communication by bridging the gap between business users and IT professionals, leveraging work done by other teams and increasing confidence in IT system data. 5. Increased speed of system development’s time-to-market by reducing system development life-cycle time. 6. Reduce risk of project failure through better impact analysis at various levels during change management. 7. Identify and reduce redundant data and processes, thereby reducing rework and use of redundant, out-of-data, or incorrect data. from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  45. Metadata for Semistructured Data 45 Copyright 2014 by Data Blueprint • Unstructured data – Any data that is not in a database or data file, including documents or other media data • Metadata describes both structured and unstructured data • Metadata for unstructured data exists in many formats, responding to a variety of different requirements • Examples of Metadata repositories describing unstructured data: – Content management applications – University websites – Company intranet sites – Data archives – Electronic journals collections – Community resource lists • Common method for classifying Metadata in unstructured sources is to describe them as descriptive metadata, structural metadata, or administrative metadata from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  46. Metadata for Unstructured Data: Examples 46 Copyright 2014 by Data Blueprint • Examples of descriptive metadata: – Catalog information – Thesauri keyword terms • Examples of structural metadata – Dublin Core – Field structures – Format (audio/visual, booklet) – Thesauri keyword labels – XML schemas • Examples of administrative metadata – Source(s) – Integration/update schedule – Access rights – Page relationships (e.g. site navigational design)
  47. Specific Example 47 Copyright 2014 by Data Blueprint • Four metadata sources: 1. Existing reference models (i.e., ADRM) 2. Conceptual model created two years ago 3. Existing systems (to be reverse engineered) 4. Enterprise data model } from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  48. Metadata Management Strategies 48 Copyright 2014 by Data Blueprint 1. Data Management Overview 2. What is metadata and why is it important? 3. Major metadata types & subject areas 4. Metadata benefits, application & sources 5. Metadata strategies & implementation 6. Metadata building blocks 7. Guiding Principles 8. Specific teachable example 9. Take Aways, References and Q&A Tweeting now: #dataed
  49. Metadata Management Strategies 49 Copyright 2014 by Data Blueprint 1. Data Management Overview 2. What is metadata and why is it important? 3. Major metadata types & subject areas 4. Metadata benefits, application & sources 5. Metadata strategies & implementation 6. Metadata building blocks 7. Guiding Principles 8. Specific teachable example 9. Take Aways, References and Q&A Tweeting now: #dataed
  50. Metadata History 1990-2008 50 Copyright 2014 by Data Blueprint • The history of Metadata management tools and products seems to be a metaphor for the lack of a methodological approach to enterprise information management: • Lack of standards and proprietary nature of most managed Metadata solutions cause many organizations to avoid focusing on metadata • This limits organizations’ ability to develop a true enterprise information management environment • Increased attention given to information and its importance to an organization’s operations and decision-making will drive Metadata management products and solutions to become more standardized • More recognition to the need for a methodological approach to managing information and metadata
  51. Metadata History: The 1990s 51 Copyright 2014 by Data Blueprint • Business managers began to recognize the value of Metadata repositories • Newer tools expanded the scope • Potential benefits identified during this period include: – Providing semantic layer between company’s system and business users – Reducing training costs – Making strategic information more valuable as aid in decision making – Creating actionable information – Limiting incorrect decisions
  52. Metadata History: Mid-to late 1990s 52 Copyright 2014 by Data Blueprint • Metadata becomes more relevant to corporations who were struggling to understand their information resources caused by: – Y2K deadline – Emerging data warehousing initiatives – Growing focus around the World Wide Web • Beginning of efforts to try to standardize Metadata definition and exchange between applications in the enterprise • Examples of standardization: – 1995: CASE Definition Interchange Facility (CDIF) – 1995: Dublin Core Metadata Elements – 1994 – 1999: First parts of ISO 11179 standard for Specification and Standardization of Data Elements were published – 1998: Common Warehouse Metadata Model (CWM) – 1995: Metadata Coalitions’ (MDC) Open Information Model – 2000: Both standards merged into CSM. Many Metadata repositories began promising adoption of CWM standard
  53. Metadata History: 21st Century 53 Copyright 2014 by Data Blueprint • Update of existing Metadata repositories for deployment on the web • Introduction of products to support CWM • Vendors begin focusing on Metadata as an additional product offering • Few organizations purchase or develop Metadata repositories • Effective enterprise-wide Managed Metadata Environments are rare due to: – Scarcity of people with real world skills – Difficulty of the effort – Less than stellar success of some of the initial efforts at some companies – Stagnation of the tool market after the initial burst of interest in late 90s – Still less than universal understanding of the business benefits – Too heavy emphasis on legacy applications and technical metadata
  54. Metadata History: Current Decade 54 Copyright 2014 by Data Blueprint • Focus on need for and importance of metadata • Focus on how to incorporate Metadata beyond traditional structured sources and include semistructured sources • Driving factors: – Recent entry of larger vendors into the market – Challenges related to addressing regulatory requirements, e.g. Sarbanes-Oxley, and privacy requirements with unsophisticated tools – Emergence of enterprise-wide initiatives, e.g. information governance, compliance, enterprise architecture, automated software reuse – Improvements to the existing Metadata standards, e.g. RFP release of new OMG standard Information Management Metamodel (IMM), which will replace CWM – Recognition at the highest levels that information is an asset that must be actively and effectively managed
  55. Why Metadata Matters 55 Copyright 2014 by Data Blueprint • They know you rang a phone sex service at 2:24 am and spoke for 18 minutes. But they don't know what you talked about. • They know you called the suicide prevention hotline from the Golden Gate Bridge. But the topic of the call remains a secret. • They know you spoke with an HIV testing service, then your doctor, then your health insurance company in the same hour. But they don't know what was discussed. • They know you received a call from the local NRA office while it was having a campaign against gun legislation, and then called your senators and congressional representatives immediately after. But the content of those calls remains safe from government intrusion. • They know you called a gynecologist, spoke for a half hour, and then called the local Planned Parenthood's number later that day. But nobody knows what you spoke about. – https://www.eff.org/deeplinks/2013/06/why-metadata-matters
  56. Metadata Strategy 56 Copyright 2014 by Data Blueprint • Metadata Strategy is – A statement of direction in Metadata management by the enterprise – A statement of intend that acts as a reference framework for the development teams – Driven by business objectives and prioritized by the business value they bring to the organization • Build a Metadata strategy from a set of defined components • Primary focus of Metadata strategy – gain an understanding of and consensus on the organization’s key business drivers, issues, and information requirements for the enterprise Metadata program • Need to understand how well the current environment meets these requirements now and in the future • Metadata strategy objectives define the organization’s future enterprise metadata architecture and recommend logical progression of phased implementation steps • Only 1 in 10 organizations has a documented, board approved data strategy
  57. Metadata Strategy Implementation Phases 57 Copyright 2014 by Data Blueprint
  58. Polling Question #2 58 Copyright 2014 by Data Blueprint • Compliance laws have influenced my organization to pay more attention to and/or put more resources into: a) Data quality improvement efforts 29% b) Metadata management efforts 6% c) Database management, in general 27% d) No impact 13%
  59. Metadata Management Strategies 59 Copyright 2014 by Data Blueprint 1. Data Management Overview 2. What is metadata and why is it important? 3. Major metadata types & subject areas 4. Metadata benefits, application & sources 5. Metadata strategies & implementation 6. Metadata building blocks 7. Guiding Principles 8. Specific teachable example 9. Take Aways, References and Q&A Tweeting now: #dataed
  60. Metadata Management Strategies 60 Copyright 2014 by Data Blueprint 1. Data Management Overview 2. What is metadata and why is it important? 3. Major metadata types & subject areas 4. Metadata benefits, application & sources 5. Metadata strategies & implementation 6. Metadata building blocks 7. Guiding Principles 8. Specific teachable example 9. Take Aways, References and Q&A Tweeting now: #dataed
  61. Goals and Principles 61 Copyright 2014 by Data Blueprint from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International • Provide organizational understanding of terms and usage • Integrate Metadata from diverse sources • Provide easy, integrated access to metadata • Ensure Metadata quality and security
  62. Activities 62 Copyright 2014 by Data Blueprint • Understand Metadata requirements • Define the Metadata architecture • Develop and maintain Metadata standards • Implement a managed Metadata environment • Create and maintain metadata • Integrate metadata • Management Metadata repositories • Distribute and deliver metadata • Query, report and analyze metadata from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  63. Activities: Metadata Standards Types 63 Copyright 2014 by Data Blueprint • Two major types: – Industry or consensus standards – International standards
 • High level framework can show – How standards are related – How they rely on each other for context and usage from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  64. Activities: Noteworthy Metadata Standards Types Warehouse Process Warehouse Operation Transformation OLAP Data Mining Information Visualization Business Nomenclature Object Model Relational Record Multidimensional XML Business Information Data Types Expression Keys and Indexes Type Mapping Software Deployment Object Model Management Analysis Resource Foundation 64 Copyright 2014 by Data Blueprint • Common Warehouse Metadata (CWM): • Specifies the interchange of Metadata among data warehousing, BI, KM, and portal technologies. • Based on UML and depends on it to represent object- oriented data constructs. • The CWM Metamodel
  65. Information Management Metamodel (IMM) 65 Copyright 2014 by Data Blueprint • Object Management Group Project to replace CWM • Concerned with: – Business Modeling • Entity/relationship metamodel – Technology modeling • Relational Databases • XML • LDAP – Model Management • Traceability – Compatibility with related models • Semantics of business vocabulary and business rules • Ontology Definition Metamodel • Based on Core model • Used to translate from one model to another
  66. Primary Deliverables 66 Copyright 2014 by Data Blueprint • Metadata repositories • Quality metadata • Metadata analysis • Data lineage • Change impact analysis • Metadata control procedures • Metadata models and architecture • Metadata management operational analysis from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  67. Roles and Responsibilities 67 Copyright 2014 by Data Blueprint • Suppliers: – Data Stewards – Data Architects – Data Modelers – Database Administrators – Other Data Professionals – Data Brokers – Government and Industry Regulators • Participants: – Metadata Specialists – Data Integration Architects – Data Stewards – Data Architects and Modelers – Database Administrators – Other DM Professionals – Other IT Professionals – DM Executives – Business Users • Consumers: – Data Stewards – Data Professionals – Other IT Professionals – Knowledge Workers – Managers and Executives – Customers and Collaborators – Business Users from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  68. Technology 68 Copyright 2014 by Data Blueprint • Metadata repositories • Data modeling tools • Database management systems • Data integration tools • Business intelligence tools • System management tools • Object modeling tools • Process modeling tools • Report generating tools • Data quality tools • Data development and administration tools • Reference and mater data management tools from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  69. Polling Question #3 69 Copyright 2014 by Data Blueprint • Do you use metadata models and/or modeling tools to support your information quality efforts? a) Yes 49% b) No 39%
  70. Metadata Management Strategies 70 Copyright 2014 by Data Blueprint 1. Data Management Overview 2. What is metadata and why is it important? 3. Major metadata types & subject areas 4. Metadata benefits, application & sources 5. Metadata strategies & implementation 6. Metadata building blocks 7. Guiding Principles 8. Specific teachable example 9. Take Aways, References and Q&A Tweeting now: #dataed
  71. Metadata Management Strategies 71 Copyright 2014 by Data Blueprint 1. Data Management Overview 2. What is metadata and why is it important? 3. Major metadata types & subject areas 4. Metadata benefits, application & sources 5. Metadata strategies & implementation 6. Metadata building blocks 7. Guiding Principles 8. Specific teachable example 9. Take Aways, References and Q&A Tweeting now: #dataed
  72. 15 Guiding Principles 72 Copyright 2014 by Data Blueprint 1. Establish and maintain a Metadata strategy and 
 appropriate policies, especially clear goals and 
 objectives for Metadata management and usage 2. Secure sustained commitment, funding, and vocal support from senior management concerning Metadata management for the enterprise 3. Take an enterprise perspective to ensure future extensibility, but implement through iterative and incremental delivery 4. Develop a Metadata strategy before evaluating, purchasing, and installing Metadata management products 5. Create or adopt Metadata standards to ensure interoperability of Metadata across the enterprise 6. Ensure effective Metadata acquisition for internal and external metadata 7. Maximize user access since a solution that is not accessed or is under-accessed will not show business value from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  73. 15 Guiding Principles, continued 73 Copyright 2014 by Data Blueprint 8. Understand and communicate the necessity of Metadata and the purpose of each type of metadata; socialization of the value of Metadata will encourage business usage 9. Measure content and usage 10.Leverage XML, messaging and web services 11.Establish and maintain enterprise-wide business involvement in data stewardship, assigning accountability for metadata 12.Define and monitor procedures and processes to ensure correct policy implementation 13.Include a focus on roles, staffing, 
 standards, procedures, training, & metrics 14.Provide dedicated Metadata experts 
 to the project and beyond 15.Certify Metadata quality from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  74. Metadata Management Strategies 74 Copyright 2014 by Data Blueprint 1. Data Management Overview 2. What is metadata and why is it important? 3. Major metadata types & subject areas 4. Metadata benefits, application & sources 5. Metadata strategies & implementation 6. Metadata building blocks 7. Guiding Principles 8. Specific teachable example 9. Take Aways, References and Q&A Tweeting now: #dataed
  75. Metadata Management Strategies 75 Copyright 2014 by Data Blueprint 1. Data Management Overview 2. What is metadata and why is it important? 3. Major metadata types & subject areas 4. Metadata benefits, application & sources 5. Metadata strategies & implementation 6. Metadata building blocks 7. Guiding Principles 8. Specific teachable example 9. Take Aways, References and Q&A Tweeting now: #dataed
  76. • Example: – iTunes Metadata • Insert a recently purchased CD • iTunes can: – Count the number of tracks (25) – Determine the length of each track 6609/10/12 Example: iTunes Metadata 76 Copyright 2014 by Data Blueprint
  77. 6709/10/12 Example: iTunes Metadata 77 Copyright 2014 by Data Blueprint • When connected to the Internet iTunes connects to the Gracenote(.com) Media Database and retrieves: – CD Name – Artist – Track Names – Genre – Artwork • Sure would be a pain to type in all this information
  78. 6809/10/12 Example: iTunes Metadata 78 Copyright 2014 by Data Blueprint • To organize iTunes – I create a "New Smart Playlist" for Artist's containing "Miles Davis"
  79. Example: iTunes Metadata 6909/10/12 79 Copyright 2014 by Data Blueprint • Notice I didn't get the desired results • I already had another Miles Davis recording in iTunes • Must fine-tune the request to get the desired results – Album contains "The complete birth of the cool" • Now I can move the playlist "Miles Davis" to a folder
  80. Example: iTunes Metadata 7009/10/12 80 Copyright 2014 by Data Blueprint • The same: –Interface –Processing –Data Structures • are applied to –Podcasts –Movies –Books –.pdf files • Economies of scale are enormous
  81. Metadata Management Strategies 81 Copyright 2014 by Data Blueprint 1. Data Management Overview 2. What is metadata and why is it important? 3. Major metadata types & subject areas 4. Metadata benefits, application & sources 5. Metadata strategies & implementation 6. Metadata building blocks 7. Guiding Principles 8. Specific teachable example 9. Take Aways, References and Q&A Tweeting now: #dataed
  82. Metadata Management Strategies 82 Copyright 2014 by Data Blueprint 1. Data Management Overview 2. What is metadata and why is it important? 3. Major metadata types & subject areas 4. Metadata benefits, application & sources 5. Metadata strategies & implementation 6. Metadata building blocks 7. Guiding Principles 8. Specific teachable example 9. Take Aways, References and Q&A Tweeting now: #dataed
  83. Uses Metadata Take Aways 83 Copyright 2014 by Data Blueprint • Metadata unlocks the value of data, and therefore requires management attention [Gartner 2011]
 
 
 
 
 
 
 
 
 • Metadata is the language of data governance • Metadata defines the essence of integration challenges Sources 
 Metadata Governance 
 Metadata Engineering 
 Metadata Delivery Metadata Practices Metadata
 Storage Specialized Team Skills
  84. Data Management Body of Knowledge 84 Copyright 2014 by Data Blueprint Data Management Functions
  85. Metadata Management Summary from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International 85 Copyright 2014 by Data Blueprint
  86. References & Recommended Reading 86 Copyright 2014 by Data Blueprint from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  87. References, cont’d 87 Copyright 2014 by Data Blueprint from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  88. References, cont’d 88 Copyright 2014 by Data Blueprint from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  89. References, cont’d 89 Copyright 2014 by Data Blueprint from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  90. Polling Question #4 90 Copyright 2014 by Data Blueprint • My organization began using or is planning to use a metadata repository (purchased or homegrown) a) Last year (2013) b) This year (2014) c) Next year (2015) d) Not applicable
  91. Questions? It’s your turn! Use the chat feature or Twitter (#dataed) to submit your questions to Peter now. + = 91 Copyright 2014 by Data Blueprint
  92. Upcoming Events 92 Copyright 2014 by Data Blueprint Data Systems Integration & Business Value Pt. 2: Cloud August 13, 2013 @ 2:00 PM ET/11:00 AM PT Data Systems Integration & Business Value Pt. 3: Warehousing September 10, 2013 @ 2:00 PM ET/11:00 AM PT Sign up here: www.datablueprint.com/webinar-schedule or www.dataversity.net
Publicidad