SlideShare a Scribd company logo
1 of 21
Download to read offline
Twenty Years of Metadata:
                              Lessons from the
                       First Two Decades of the Web
                                    Stuart Weibel
                            University of Tsukuba Visiting Scholar
                                         May 13, 2011




Friday, May 13, 2011                                                 1
Outline

                        The Context

                        Dublin Core in the Metadata Matrix

                        What we did right

                        The major impediments

                        A few words about models

                        What about the future?
                                      Image: Carved figures (Morikawa Toen), Tokyo National Museum




Friday, May 13, 2011                                                                                 2
THe Context

               When I started working at OCLC in 1985:

                       I was 4 years away from my first email address

                       A PC hard drive wasn’t large enough to store a
                       single high resolution digital image.
                       (which was ok, because…)

                       Cameras still used film

                       Cell phones were suitcase-sized                               me… circa 1994


                       MARC Cataloging stood alone as the discovery tool for intellectual assets of
                       libraries

                       No end-user access to the global library catalogs

Friday, May 13, 2011                                                                                  3
And now?
               A cell phone has more computing power than the Space Shuttle

               An iPod will hold WorldCat

               Bandwidth is more important than computing power

               The library is still mostly mired in MARC

               There are many metadata standards (mostly struggling for traction)

               People (mostly) find things with Google

               but….


Friday, May 13, 2011                                                                4
Metadata is more than just
                      search
                          Metadata-dependent actions
                                    Describe
                                      Access
                                 Encode/Render
                                     Preserve
                               Rights Management
                                   Administer
                       “Bind” digital pages in digital books


Friday, May 13, 2011                                           5
50 years of Metadata
          MARC standards (library metadata)
                       OCLC founded (shared library cataloging)
                             ARPANET Operational - forerunner of the Internet
                                      Networking diffuses throughout academia
                                                  The Web begins... FRBR work begins
                                                             First Dublin Core Workshop
                                                               DCMI established
                                                                 Google is founded
                                                                     First Dublin Core Conference (Tokyo)
                                            my first email

                                               address
                                                                         WorldCat introduced
                                                                            RDA introduced

            1960s            1970s      1980s                1990s    2000s

Friday, May 13, 2011                                                                                        6
The confusion:
                          How bad is it?




            “This visual map of the metadata landscape is intended to assist
         planners with the selection and implementation of metadata standards.”
      http://www.dlib.indiana.edu/~jenlrile/metadatamap/

Friday, May 13, 2011                                                              7
JenN Riley’s Metadata Map
               105 standards

               30 most common across the top (3 predate the Web)

               some share common models… most do not

               much overlap

               many work together

               Who among us can choose rationally from the array of
               standards, platforms, technologies?

               Will the results have any reasonable expectation of
               interoperability?
Friday, May 13, 2011                                                  8
The real world is not
                                 standards-centric
              Metadata-
           dependent actions        Standard          Information Entities (ex.)
                  Describe         MARC, DC, MODS,                  Agents
                                  RDA, LCSH, MeSH….   (persons, corporate entities, devices)

                       Access      HTTP, FTP….                      Events
                                   RDF, media-type
           Encode/render          dependent (many)       Time intervals or eras

                  Preserve            PREMIS                      Concepts
                  Rights             CC licenses,
                Management        eCommerce systems              Collections

               Administer         METS, MARC….                  Media-types
           “Bind” digital pages     METS, eBook
             in digital books        standards            Structured data type


Friday, May 13, 2011                                                                           9
The map is much more
                            complicated
               “This visual map of the metadata landscape is intended to assist
            planners with the selection and implementation of metadata standards.”




               “selection and implementation of metadata standards requires a clear
                 understanding of the information entities, the standards, and the
                        functional requirements of the system under design”
                                                                Image: Kyoto horizon from above the Tenru-ji Temple




Friday, May 13, 2011                                                                                                  10
Dublin Core in the
                          metadata matrix
               The first metadata standard for
               the Web

               General and cross-disciplinary

               Simple starting place, but
               extensible

               International and multilingual

               Consensus-driven (bottom-up,
               rather than top-down)
                                                 Image: Jomon Pottery, Tokyo National Museum,




Friday, May 13, 2011                                                                            11
Things we did right
               We didn’t call it ‘cataloging’ (Web, not libraries)

               A hybrid of technical engineering
               and social engineering

               International - Major events on
               5 continents, element definitions
               in 20+ languages (maintained in
               Tsukuba)

               Separated syntax and semantics

               Built a community of practice

               About the right level of complexity for a core element set
                                                                            Image: Harajuku train station platform, Tokyo




Friday, May 13, 2011                                                                                                        12
Impediments that tripped
          us up
                 Too many syntaxes to support
                 (HTML, XML, RDF-XML)

                 No common data model
                 but we tried hard:
                 data model group,
                 architecture group,
                 abstract model,
                 Singapore Framework...

                 Without a data model, the story we told was not consistent: confusion resulted

                 Without a data model, details of implementation become arbitrary (and less
                 interoperable)
                                                                               Image: Netsuke, Tokyo National Museum




Friday, May 13, 2011                                                                                                   13
Data Modeling: what is it?
               Entity-relationship model defines the important concepts or things
               (entities), and the relationships among them

               A model is a model, not reality

               Designed to solve a problem,
               not to emulate the real world

               The complexity of the model
               should be mapped to the
               problem, not to reality

               Identifying the right level of abstraction is an art       Image: Edo Museum




Friday, May 13, 2011                                                                          14
Data Modeling: why is it
          necessary?
               Without a shared
               understanding of the
               important entities, and the
               relationships among them,
               systems will not
               interoperate easily

               Cross-walks become
               necessary: clumsy,            Changing rail car ‘bogeys’ on the
               inaccurate, inefficient           China/Mongolia border



Friday, May 13, 2011                                                             15
An example of modeling
          mismatch
           Citation information
                               Date
                                Title
                             Author
                          Affiliation
                       Email address


          - Which of the attributes are Dublin Core?
          - Is “email address” an attribute of the resource, or the person?
          - Should there be a distinction between Title and Subtitle?

Friday, May 13, 2011                                                          16
Is Dublin Core well-matched to the
          problem of bibliographic description?

               It is too simple to capture the precision of detailed
               bibliographic description

               BUT… It is good enough for many purposes, including the
               description of most simple internet resources

               The trade-off between perfect matching of model and
               problem, and simplicity of use is always a compromise

               DC was intended for general resource description, not to
               replace MARC


Friday, May 13, 2011                                                      17
The problem with models
               Matching the complexity of models to a diverse and evolving
               problem is challenging, and full of compromises

                       too much complexity
                       leads to failure
                       (creeping elegance)

                       too little complexity
                       leads to failure
                       (insufficient richness
                       to solve the problem)

               HOW DO YOU KNOW WHEN IT IS RIGHT?
                                                            Image: figures from a model in the Kyushu National Museum




Friday, May 13, 2011                                                                                                    18
Conceptual Models in the
                            Library World
                                                The dominant models for
                         FRBR and FRAD
                                             bibliographic and authority data
                                               Reference model for Open
                              OAIS
                                              Archive Information Systems
                                              Conceptual Reference Model for
                          CIDOC CRM           cultural heritage documentation



                                             Largely unintelligible data model
                Dublin Core Abstract Model
                                              for Dublin Core instance data
                                               A vague framework describing
                       Singapore Framework   levels of metadata interoperability


Friday, May 13, 2011                                                               19
The Next Chapters in the Web
                            Metadata story...

               ...are being written in the W3C Incubator Group on Library Linked Data (http://
               www.w3.org/2005/Incubator/lld/)

               Many questions:

                       Will the data be open?

                       Who will maintain it?

                       Is semantic web infrastructure stable?

                       Can existing metadata be integrate
                       seamlessly into the web?

                       Can a model be agreed upon?

                       Will we ever have interoperability across domain silos?
                                                                                 Image: Stone Monk in the Nezu Museum Garden




Friday, May 13, 2011                                                                                                           20
stuart.weibel@gmail.com


          http://weibel-lines.typepad.com


          @stuartweibel on twitter


          stuartweibel on Facebook

                       all photographs by the author
                                                       Image: Lantern overlooking the Irises in the Nezu Museum Garden




Friday, May 13, 2011                                                                                                     21

More Related Content

What's hot

Digital natives and virtual libraries: What does the future hold for libraries?
Digital natives and virtual libraries: What does the future hold for libraries?Digital natives and virtual libraries: What does the future hold for libraries?
Digital natives and virtual libraries: What does the future hold for libraries?Yasar Tonta
 
Un-defining digital literacies: students' day-to-day engagements with technol...
Un-defining digital literacies: students' day-to-day engagements with technol...Un-defining digital literacies: students' day-to-day engagements with technol...
Un-defining digital literacies: students' day-to-day engagements with technol...Martin Oliver
 
The Links that became a Web
The Links that became a WebThe Links that became a Web
The Links that became a WebJohan Koren
 
Orchestrating Soft and Hard Technologies
Orchestrating Soft and Hard TechnologiesOrchestrating Soft and Hard Technologies
Orchestrating Soft and Hard Technologiesjondron
 
TEL Developments & Trends
TEL Developments & TrendsTEL Developments & Trends
TEL Developments & Trendstimku
 
Network of Excellence in Internet Science (Multidisciplinarity and its Implic...
Network of Excellence in Internet Science (Multidisciplinarity and its Implic...Network of Excellence in Internet Science (Multidisciplinarity and its Implic...
Network of Excellence in Internet Science (Multidisciplinarity and its Implic...i_scienceEU
 
Introduction - Lecture 1 - Seminar Web Information Systems Technology (WE-DIN...
Introduction - Lecture 1 - Seminar Web Information Systems Technology (WE-DIN...Introduction - Lecture 1 - Seminar Web Information Systems Technology (WE-DIN...
Introduction - Lecture 1 - Seminar Web Information Systems Technology (WE-DIN...Beat Signer
 
Learning with technology as coordinated sociomaterial practice: digital liter...
Learning with technology as coordinated sociomaterial practice: digital liter...Learning with technology as coordinated sociomaterial practice: digital liter...
Learning with technology as coordinated sociomaterial practice: digital liter...Martin Oliver
 
The learner voice: students' use and experience of technologies
The learner voice: students' use and experience of technologiesThe learner voice: students' use and experience of technologies
The learner voice: students' use and experience of technologiesgrainne
 
What is information?
What is information?What is information?
What is information?Johan Koren
 
Mist2012 panel discussion-ruo ando
Mist2012 panel discussion-ruo andoMist2012 panel discussion-ruo ando
Mist2012 panel discussion-ruo andoRuo Ando
 
Internet to web: The 40-year old Internet and the 20-year-old Web
Internet to web:  The 40-year old Internet and the 20-year-old WebInternet to web:  The 40-year old Internet and the 20-year-old Web
Internet to web: The 40-year old Internet and the 20-year-old WebJohan Koren
 
History of neural networks
History of neural networks History of neural networks
History of neural networks EliyasJain
 
Ψηφιακές βιβλιοθήκες, ψηφιακά αποθετήρια, υποδομές δεδομένων: θεμέλια της νέα...
Ψηφιακές βιβλιοθήκες, ψηφιακά αποθετήρια, υποδομές δεδομένων: θεμέλια της νέα...Ψηφιακές βιβλιοθήκες, ψηφιακά αποθετήρια, υποδομές δεδομένων: θεμέλια της νέα...
Ψηφιακές βιβλιοθήκες, ψηφιακά αποθετήρια, υποδομές δεδομένων: θεμέλια της νέα...kebepcy
 
Conole keynote edmedia
Conole keynote edmediaConole keynote edmedia
Conole keynote edmediagrainne
 
Pres oslo2013-digital-didactics-isajahnke-v4
Pres oslo2013-digital-didactics-isajahnke-v4Pres oslo2013-digital-didactics-isajahnke-v4
Pres oslo2013-digital-didactics-isajahnke-v4Isa Jahnke
 

What's hot (20)

Digital natives and virtual libraries: What does the future hold for libraries?
Digital natives and virtual libraries: What does the future hold for libraries?Digital natives and virtual libraries: What does the future hold for libraries?
Digital natives and virtual libraries: What does the future hold for libraries?
 
Un-defining digital literacies: students' day-to-day engagements with technol...
Un-defining digital literacies: students' day-to-day engagements with technol...Un-defining digital literacies: students' day-to-day engagements with technol...
Un-defining digital literacies: students' day-to-day engagements with technol...
 
The Links that became a Web
The Links that became a WebThe Links that became a Web
The Links that became a Web
 
Orchestrating Soft and Hard Technologies
Orchestrating Soft and Hard TechnologiesOrchestrating Soft and Hard Technologies
Orchestrating Soft and Hard Technologies
 
TEL Developments & Trends
TEL Developments & TrendsTEL Developments & Trends
TEL Developments & Trends
 
Final multimedia
Final multimediaFinal multimedia
Final multimedia
 
Stefan Decker
Stefan DeckerStefan Decker
Stefan Decker
 
Network of Excellence in Internet Science (Multidisciplinarity and its Implic...
Network of Excellence in Internet Science (Multidisciplinarity and its Implic...Network of Excellence in Internet Science (Multidisciplinarity and its Implic...
Network of Excellence in Internet Science (Multidisciplinarity and its Implic...
 
Introduction - Lecture 1 - Seminar Web Information Systems Technology (WE-DIN...
Introduction - Lecture 1 - Seminar Web Information Systems Technology (WE-DIN...Introduction - Lecture 1 - Seminar Web Information Systems Technology (WE-DIN...
Introduction - Lecture 1 - Seminar Web Information Systems Technology (WE-DIN...
 
Learning with technology as coordinated sociomaterial practice: digital liter...
Learning with technology as coordinated sociomaterial practice: digital liter...Learning with technology as coordinated sociomaterial practice: digital liter...
Learning with technology as coordinated sociomaterial practice: digital liter...
 
The learner voice: students' use and experience of technologies
The learner voice: students' use and experience of technologiesThe learner voice: students' use and experience of technologies
The learner voice: students' use and experience of technologies
 
What is information?
What is information?What is information?
What is information?
 
Mist2012 panel discussion-ruo ando
Mist2012 panel discussion-ruo andoMist2012 panel discussion-ruo ando
Mist2012 panel discussion-ruo ando
 
Internet to web: The 40-year old Internet and the 20-year-old Web
Internet to web:  The 40-year old Internet and the 20-year-old WebInternet to web:  The 40-year old Internet and the 20-year-old Web
Internet to web: The 40-year old Internet and the 20-year-old Web
 
History of neural networks
History of neural networks History of neural networks
History of neural networks
 
Internet to Web
Internet to WebInternet to Web
Internet to Web
 
Ψηφιακές βιβλιοθήκες, ψηφιακά αποθετήρια, υποδομές δεδομένων: θεμέλια της νέα...
Ψηφιακές βιβλιοθήκες, ψηφιακά αποθετήρια, υποδομές δεδομένων: θεμέλια της νέα...Ψηφιακές βιβλιοθήκες, ψηφιακά αποθετήρια, υποδομές δεδομένων: θεμέλια της νέα...
Ψηφιακές βιβλιοθήκες, ψηφιακά αποθετήρια, υποδομές δεδομένων: θεμέλια της νέα...
 
Conole keynote edmedia
Conole keynote edmediaConole keynote edmedia
Conole keynote edmedia
 
Pres oslo2013-digital-didactics-isajahnke-v4
Pres oslo2013-digital-didactics-isajahnke-v4Pres oslo2013-digital-didactics-isajahnke-v4
Pres oslo2013-digital-didactics-isajahnke-v4
 
Ljc
LjcLjc
Ljc
 

Viewers also liked (16)

Loyola University Chicago
Loyola University ChicagoLoyola University Chicago
Loyola University Chicago
 
Bulgaria
BulgariaBulgaria
Bulgaria
 
Foro de matematicas 2011
Foro de matematicas 2011Foro de matematicas 2011
Foro de matematicas 2011
 
Planoghandling
PlanoghandlingPlanoghandling
Planoghandling
 
Swine Flu
Swine FluSwine Flu
Swine Flu
 
Grant Euroskin Presentation
Grant Euroskin PresentationGrant Euroskin Presentation
Grant Euroskin Presentation
 
PSP -Ecclesall Woods
PSP -Ecclesall WoodsPSP -Ecclesall Woods
PSP -Ecclesall Woods
 
Badajoz: present and past
Badajoz: present and pastBadajoz: present and past
Badajoz: present and past
 
Spain: This is our country
Spain: This is our countrySpain: This is our country
Spain: This is our country
 
Est Overview Feb 2010 Ac V2
Est Overview Feb 2010 Ac V2Est Overview Feb 2010 Ac V2
Est Overview Feb 2010 Ac V2
 
Paisajes de escuernavacas
Paisajes de escuernavacasPaisajes de escuernavacas
Paisajes de escuernavacas
 
Stigma Presentation
Stigma PresentationStigma Presentation
Stigma Presentation
 
Hungary's history
Hungary's historyHungary's history
Hungary's history
 
Bulgariacarmenkaczmar
BulgariacarmenkaczmarBulgariacarmenkaczmar
Bulgariacarmenkaczmar
 
Presentationturkiaengodollo
PresentationturkiaengodolloPresentationturkiaengodollo
Presentationturkiaengodollo
 
Brief history of Spain
Brief history of SpainBrief history of Spain
Brief history of Spain
 

Similar to Twenty Years of Metadata: Lessons from the First Two Decades of the Web

One Big Happy Family
One Big Happy FamilyOne Big Happy Family
One Big Happy FamilyDan Brickley
 
Machine learning and multimedia information retrieval
Machine learning and multimedia information retrievalMachine learning and multimedia information retrieval
Machine learning and multimedia information retrievalSi Krishan
 
Living the life electric
Living the life electricLiving the life electric
Living the life electricDoctorG
 
Soeren okfn greece meetup
Soeren okfn greece meetupSoeren okfn greece meetup
Soeren okfn greece meetupOKFN-GR
 
Introduction to the FP7 CODE project @ BDBC
Introduction to the FP7 CODE project @ BDBCIntroduction to the FP7 CODE project @ BDBC
Introduction to the FP7 CODE project @ BDBCFlorian Stegmaier
 
Django and Neo4j - Domain modeling that kicks ass
Django and Neo4j - Domain modeling that kicks assDjango and Neo4j - Domain modeling that kicks ass
Django and Neo4j - Domain modeling that kicks assTobias Lindaaker
 
IASSIT Kansa Presentation
IASSIT Kansa PresentationIASSIT Kansa Presentation
IASSIT Kansa Presentationekansa
 
Poster Semantic Web - Abhijit Chandrasen Manepatil
Poster Semantic Web - Abhijit Chandrasen ManepatilPoster Semantic Web - Abhijit Chandrasen Manepatil
Poster Semantic Web - Abhijit Chandrasen Manepatilap
 
Ten Technology Trends That Will Change the World in Ten Years
Ten Technology Trends That Will Change the World in Ten YearsTen Technology Trends That Will Change the World in Ten Years
Ten Technology Trends That Will Change the World in Ten YearsCisco Services
 
XXIX Charleston Semantic Web Leicht
XXIX Charleston   Semantic Web LeichtXXIX Charleston   Semantic Web Leicht
XXIX Charleston Semantic Web LeichtDarrell W. Gunter
 
The Navigation Layer - Making Sense Of It All
The Navigation Layer - Making Sense Of It AllThe Navigation Layer - Making Sense Of It All
The Navigation Layer - Making Sense Of It AllJim Kalbach
 
Electronic Lab Notebooks in Biomedical Research
Electronic Lab Notebooks in Biomedical ResearchElectronic Lab Notebooks in Biomedical Research
Electronic Lab Notebooks in Biomedical ResearchAxiope Limited
 
ESI Supplemental Webinar 2 - DataONE presentation slides
ESI Supplemental Webinar 2 - DataONE presentation slides ESI Supplemental Webinar 2 - DataONE presentation slides
ESI Supplemental Webinar 2 - DataONE presentation slides DuraSpace
 
Data-Intensive Text Processing with MapReduce
Data-Intensive Text Processing  with MapReduce Data-Intensive Text Processing  with MapReduce
Data-Intensive Text Processing with MapReduce George Ang
 
Data-Intensive Text Processing with MapReduce
Data-Intensive Text Processing with MapReduceData-Intensive Text Processing with MapReduce
Data-Intensive Text Processing with MapReduceGeorge Ang
 

Similar to Twenty Years of Metadata: Lessons from the First Two Decades of the Web (20)

One Big Happy Family
One Big Happy FamilyOne Big Happy Family
One Big Happy Family
 
Machine learning and multimedia information retrieval
Machine learning and multimedia information retrievalMachine learning and multimedia information retrieval
Machine learning and multimedia information retrieval
 
Data Research Vision
Data Research VisionData Research Vision
Data Research Vision
 
Semantic Web Nature
Semantic Web NatureSemantic Web Nature
Semantic Web Nature
 
Living the life electric
Living the life electricLiving the life electric
Living the life electric
 
Metadata extraction
Metadata extractionMetadata extraction
Metadata extraction
 
Soeren okfn greece meetup
Soeren okfn greece meetupSoeren okfn greece meetup
Soeren okfn greece meetup
 
Introduction to the FP7 CODE project @ BDBC
Introduction to the FP7 CODE project @ BDBCIntroduction to the FP7 CODE project @ BDBC
Introduction to the FP7 CODE project @ BDBC
 
Django and Neo4j - Domain modeling that kicks ass
Django and Neo4j - Domain modeling that kicks assDjango and Neo4j - Domain modeling that kicks ass
Django and Neo4j - Domain modeling that kicks ass
 
IASSIT Kansa Presentation
IASSIT Kansa PresentationIASSIT Kansa Presentation
IASSIT Kansa Presentation
 
Poster Semantic Web - Abhijit Chandrasen Manepatil
Poster Semantic Web - Abhijit Chandrasen ManepatilPoster Semantic Web - Abhijit Chandrasen Manepatil
Poster Semantic Web - Abhijit Chandrasen Manepatil
 
Ten Technology Trends That Will Change the World in Ten Years
Ten Technology Trends That Will Change the World in Ten YearsTen Technology Trends That Will Change the World in Ten Years
Ten Technology Trends That Will Change the World in Ten Years
 
XXIX Charleston Semantic Web Leicht
XXIX Charleston   Semantic Web LeichtXXIX Charleston   Semantic Web Leicht
XXIX Charleston Semantic Web Leicht
 
Metadata 101public
Metadata 101publicMetadata 101public
Metadata 101public
 
Adfi forum 4_12
Adfi forum 4_12Adfi forum 4_12
Adfi forum 4_12
 
The Navigation Layer - Making Sense Of It All
The Navigation Layer - Making Sense Of It AllThe Navigation Layer - Making Sense Of It All
The Navigation Layer - Making Sense Of It All
 
Electronic Lab Notebooks in Biomedical Research
Electronic Lab Notebooks in Biomedical ResearchElectronic Lab Notebooks in Biomedical Research
Electronic Lab Notebooks in Biomedical Research
 
ESI Supplemental Webinar 2 - DataONE presentation slides
ESI Supplemental Webinar 2 - DataONE presentation slides ESI Supplemental Webinar 2 - DataONE presentation slides
ESI Supplemental Webinar 2 - DataONE presentation slides
 
Data-Intensive Text Processing with MapReduce
Data-Intensive Text Processing  with MapReduce Data-Intensive Text Processing  with MapReduce
Data-Intensive Text Processing with MapReduce
 
Data-Intensive Text Processing with MapReduce
Data-Intensive Text Processing with MapReduceData-Intensive Text Processing with MapReduce
Data-Intensive Text Processing with MapReduce
 

Recently uploaded

Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...PsychoTech Services
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024Janet Corral
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 

Recently uploaded (20)

Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 

Twenty Years of Metadata: Lessons from the First Two Decades of the Web

  • 1. Twenty Years of Metadata: Lessons from the First Two Decades of the Web Stuart Weibel University of Tsukuba Visiting Scholar May 13, 2011 Friday, May 13, 2011 1
  • 2. Outline The Context Dublin Core in the Metadata Matrix What we did right The major impediments A few words about models What about the future? Image: Carved figures (Morikawa Toen), Tokyo National Museum Friday, May 13, 2011 2
  • 3. THe Context When I started working at OCLC in 1985: I was 4 years away from my first email address A PC hard drive wasn’t large enough to store a single high resolution digital image. (which was ok, because…) Cameras still used film Cell phones were suitcase-sized me… circa 1994 MARC Cataloging stood alone as the discovery tool for intellectual assets of libraries No end-user access to the global library catalogs Friday, May 13, 2011 3
  • 4. And now? A cell phone has more computing power than the Space Shuttle An iPod will hold WorldCat Bandwidth is more important than computing power The library is still mostly mired in MARC There are many metadata standards (mostly struggling for traction) People (mostly) find things with Google but…. Friday, May 13, 2011 4
  • 5. Metadata is more than just search Metadata-dependent actions Describe Access Encode/Render Preserve Rights Management Administer “Bind” digital pages in digital books Friday, May 13, 2011 5
  • 6. 50 years of Metadata MARC standards (library metadata) OCLC founded (shared library cataloging) ARPANET Operational - forerunner of the Internet Networking diffuses throughout academia The Web begins... FRBR work begins First Dublin Core Workshop DCMI established Google is founded First Dublin Core Conference (Tokyo) my first email address WorldCat introduced RDA introduced 1960s 1970s 1980s 1990s 2000s Friday, May 13, 2011 6
  • 7. The confusion: How bad is it? “This visual map of the metadata landscape is intended to assist planners with the selection and implementation of metadata standards.” http://www.dlib.indiana.edu/~jenlrile/metadatamap/ Friday, May 13, 2011 7
  • 8. JenN Riley’s Metadata Map 105 standards 30 most common across the top (3 predate the Web) some share common models… most do not much overlap many work together Who among us can choose rationally from the array of standards, platforms, technologies? Will the results have any reasonable expectation of interoperability? Friday, May 13, 2011 8
  • 9. The real world is not standards-centric Metadata- dependent actions Standard Information Entities (ex.) Describe MARC, DC, MODS, Agents RDA, LCSH, MeSH…. (persons, corporate entities, devices) Access HTTP, FTP…. Events RDF, media-type Encode/render dependent (many) Time intervals or eras Preserve PREMIS Concepts Rights CC licenses, Management eCommerce systems Collections Administer METS, MARC…. Media-types “Bind” digital pages METS, eBook in digital books standards Structured data type Friday, May 13, 2011 9
  • 10. The map is much more complicated “This visual map of the metadata landscape is intended to assist planners with the selection and implementation of metadata standards.” “selection and implementation of metadata standards requires a clear understanding of the information entities, the standards, and the functional requirements of the system under design” Image: Kyoto horizon from above the Tenru-ji Temple Friday, May 13, 2011 10
  • 11. Dublin Core in the metadata matrix The first metadata standard for the Web General and cross-disciplinary Simple starting place, but extensible International and multilingual Consensus-driven (bottom-up, rather than top-down) Image: Jomon Pottery, Tokyo National Museum, Friday, May 13, 2011 11
  • 12. Things we did right We didn’t call it ‘cataloging’ (Web, not libraries) A hybrid of technical engineering and social engineering International - Major events on 5 continents, element definitions in 20+ languages (maintained in Tsukuba) Separated syntax and semantics Built a community of practice About the right level of complexity for a core element set Image: Harajuku train station platform, Tokyo Friday, May 13, 2011 12
  • 13. Impediments that tripped us up Too many syntaxes to support (HTML, XML, RDF-XML) No common data model but we tried hard: data model group, architecture group, abstract model, Singapore Framework... Without a data model, the story we told was not consistent: confusion resulted Without a data model, details of implementation become arbitrary (and less interoperable) Image: Netsuke, Tokyo National Museum Friday, May 13, 2011 13
  • 14. Data Modeling: what is it? Entity-relationship model defines the important concepts or things (entities), and the relationships among them A model is a model, not reality Designed to solve a problem, not to emulate the real world The complexity of the model should be mapped to the problem, not to reality Identifying the right level of abstraction is an art Image: Edo Museum Friday, May 13, 2011 14
  • 15. Data Modeling: why is it necessary? Without a shared understanding of the important entities, and the relationships among them, systems will not interoperate easily Cross-walks become necessary: clumsy, Changing rail car ‘bogeys’ on the inaccurate, inefficient China/Mongolia border Friday, May 13, 2011 15
  • 16. An example of modeling mismatch Citation information Date Title Author Affiliation Email address - Which of the attributes are Dublin Core? - Is “email address” an attribute of the resource, or the person? - Should there be a distinction between Title and Subtitle? Friday, May 13, 2011 16
  • 17. Is Dublin Core well-matched to the problem of bibliographic description? It is too simple to capture the precision of detailed bibliographic description BUT… It is good enough for many purposes, including the description of most simple internet resources The trade-off between perfect matching of model and problem, and simplicity of use is always a compromise DC was intended for general resource description, not to replace MARC Friday, May 13, 2011 17
  • 18. The problem with models Matching the complexity of models to a diverse and evolving problem is challenging, and full of compromises too much complexity leads to failure (creeping elegance) too little complexity leads to failure (insufficient richness to solve the problem) HOW DO YOU KNOW WHEN IT IS RIGHT? Image: figures from a model in the Kyushu National Museum Friday, May 13, 2011 18
  • 19. Conceptual Models in the Library World The dominant models for FRBR and FRAD bibliographic and authority data Reference model for Open OAIS Archive Information Systems Conceptual Reference Model for CIDOC CRM cultural heritage documentation Largely unintelligible data model Dublin Core Abstract Model for Dublin Core instance data A vague framework describing Singapore Framework levels of metadata interoperability Friday, May 13, 2011 19
  • 20. The Next Chapters in the Web Metadata story... ...are being written in the W3C Incubator Group on Library Linked Data (http:// www.w3.org/2005/Incubator/lld/) Many questions: Will the data be open? Who will maintain it? Is semantic web infrastructure stable? Can existing metadata be integrate seamlessly into the web? Can a model be agreed upon? Will we ever have interoperability across domain silos? Image: Stone Monk in the Nezu Museum Garden Friday, May 13, 2011 20
  • 21. stuart.weibel@gmail.com http://weibel-lines.typepad.com @stuartweibel on twitter stuartweibel on Facebook all photographs by the author Image: Lantern overlooking the Irises in the Nezu Museum Garden Friday, May 13, 2011 21