SlideShare una empresa de Scribd logo
1 de 20
The Differences Problem
Or why consistency in metadata is critical in the discovery process
Shana L. McDanold
First A few caveats…
2
Inthenotso
distantpast…
There were two main options when searching for ebooks:
1. Search each individual vendor’s website/database
2. Load MARC records (one record for each title) into the
catalog for each vendor
3
Inthenotso
distantpast…
Problems with this approach:
 Loading records is a LOT of work and requires regular
maintenance
 Massaging/editing/enhancing metadata; loading;
updates; replacements; deletes
 Number of records/titles to load
 Lack of records available for loading
 Records come from numerous places and each vendor
requires a different procedure to download files
 Tracking titles in multiple places (duplicate work)
4
Now:more
options…
1. Search each individual vendor’s website/database
2. Load MARC records (one record for each title) into the
catalog for each vendor
3. Integration of various vendors metadata into
discovery layers via APIs and linked data rather than
importing records into the catalog
4. Federated search tools that index multiple databases
(e.g. unified index search tools)
…but are more options better?
5
Thegoodand
thebad
GOOD:
 fewer places to search (possibly even only one)
 most public libraries, while they have other ebook
databases, will have a single integrated discovery layer
BAD:
 MORE places to search
BUT discovery is still a challenge no matter which search
option you choose, and those challenges are centered
around:
METADATA
6
Printbook
7
Ebook
8
Differences?
 ISBN
 Subjects
 Title
 Author
 Date
9
Printbook
10
Ebook
11
Differences?
 ISBN
 Subjects
 Title
 Author
 Date
12
Printbook
13
Ebook
14
Differences?
 ISBN
 Subjects
 Title
 Author
 Date
15
Differences
defined
 Differences in description
 Current vs past rules and guidelines;
 RDA provider neutral vs individual vendor records
 Differences between vendors for same title
 Differences in how data is entered/presented
 Record proliferation
 Related to metadata differences: records cannot be
“collapsed” because the discovery layer doesn’t recognize
them as the same
 Different vocabularies and identity databases
16
More
differences
 Missing metadata/missing records
 Data changes/updates
 Branding or custom text/collections
17
Whydothese
differences
matter?
 How people search
 Keyword - forces dependency on keyword indexes
 Follow links - if you click on the subject search for
Obama, Michelle, search results include only print books
(no ebooks)
 Limits/facets - dependent on metadata, both visible
and invisible (coded)
 Missing metadata
 Discovery layer exposes ALL the metadata (good, bad,
missing)
All means items get “hidden” because they’re not
findable.
18
How dowefix
it?
 CONSISTENCY
 use of controlled vocabularies and existing authority
databases (name matching, subjects, etc.)
 Use existing metadata sources
 Follow standards and recommended/best practices
 Communication
 Data points
 complete
 consistency across vendors
19
Questions?
20

Más contenido relacionado

La actualidad más candente

Crossref LIVE Indonesia: The Value and Use of Crossref Metadata, CRLIVE-ID 15...
Crossref LIVE Indonesia: The Value and Use of Crossref Metadata, CRLIVE-ID 15...Crossref LIVE Indonesia: The Value and Use of Crossref Metadata, CRLIVE-ID 15...
Crossref LIVE Indonesia: The Value and Use of Crossref Metadata, CRLIVE-ID 15...Crossref
 
SharePoint Folders vs. Metadata Best Practices
SharePoint Folders vs. Metadata Best PracticesSharePoint Folders vs. Metadata Best Practices
SharePoint Folders vs. Metadata Best PracticesChris Woodill
 
crossmark update
crossmark updatecrossmark update
crossmark updateCrossref
 
#mashcat: Evolving MarcEdit: Leveraging Semantic Data in MarcEdit
#mashcat: Evolving MarcEdit: Leveraging Semantic Data in MarcEdit#mashcat: Evolving MarcEdit: Leveraging Semantic Data in MarcEdit
#mashcat: Evolving MarcEdit: Leveraging Semantic Data in MarcEditTerry Reese
 
Fitting MarcEdit into the library software ecosystem
Fitting MarcEdit into the library software ecosystemFitting MarcEdit into the library software ecosystem
Fitting MarcEdit into the library software ecosystemTerry Reese
 
Participation reports webinar December 2020
Participation reports webinar December 2020Participation reports webinar December 2020
Participation reports webinar December 2020Crossref
 
Encore Presentation - ACRL/NEC ITIG Annual Meeting
Encore Presentation - ACRL/NEC ITIG Annual MeetingEncore Presentation - ACRL/NEC ITIG Annual Meeting
Encore Presentation - ACRL/NEC ITIG Annual MeetingLaura Kohl
 
Folders vs. Metadata: SharePoint Engage Oct. 20, 2015
Folders vs. Metadata: SharePoint Engage Oct. 20, 2015Folders vs. Metadata: SharePoint Engage Oct. 20, 2015
Folders vs. Metadata: SharePoint Engage Oct. 20, 2015Donna Rodriguez
 
Best Practices for Organizing Documents in SharePoint 2010
Best Practices for Organizing Documents in SharePoint 2010Best Practices for Organizing Documents in SharePoint 2010
Best Practices for Organizing Documents in SharePoint 2010Agnes Molnar
 
Preparing Catalogers for Linked data
Preparing Catalogers for Linked dataPreparing Catalogers for Linked data
Preparing Catalogers for Linked dataTerry Reese
 
Participation reports webinar November 2020
Participation reports webinar November 2020Participation reports webinar November 2020
Participation reports webinar November 2020Crossref
 
Tactical Fingerprinting using metadata, hidden info and lost data
Tactical Fingerprinting using metadata, hidden info and lost dataTactical Fingerprinting using metadata, hidden info and lost data
Tactical Fingerprinting using metadata, hidden info and lost dataChema Alonso
 
Database poll results
Database poll resultsDatabase poll results
Database poll resultsStephen Abram
 

La actualidad más candente (16)

MS Access 2010 tutorial 1
MS Access 2010 tutorial 1MS Access 2010 tutorial 1
MS Access 2010 tutorial 1
 
Crossref LIVE Indonesia: The Value and Use of Crossref Metadata, CRLIVE-ID 15...
Crossref LIVE Indonesia: The Value and Use of Crossref Metadata, CRLIVE-ID 15...Crossref LIVE Indonesia: The Value and Use of Crossref Metadata, CRLIVE-ID 15...
Crossref LIVE Indonesia: The Value and Use of Crossref Metadata, CRLIVE-ID 15...
 
SharePoint Folders vs. Metadata Best Practices
SharePoint Folders vs. Metadata Best PracticesSharePoint Folders vs. Metadata Best Practices
SharePoint Folders vs. Metadata Best Practices
 
crossmark update
crossmark updatecrossmark update
crossmark update
 
#mashcat: Evolving MarcEdit: Leveraging Semantic Data in MarcEdit
#mashcat: Evolving MarcEdit: Leveraging Semantic Data in MarcEdit#mashcat: Evolving MarcEdit: Leveraging Semantic Data in MarcEdit
#mashcat: Evolving MarcEdit: Leveraging Semantic Data in MarcEdit
 
Ms access 2010
Ms access 2010Ms access 2010
Ms access 2010
 
Fitting MarcEdit into the library software ecosystem
Fitting MarcEdit into the library software ecosystemFitting MarcEdit into the library software ecosystem
Fitting MarcEdit into the library software ecosystem
 
Participation reports webinar December 2020
Participation reports webinar December 2020Participation reports webinar December 2020
Participation reports webinar December 2020
 
Encore Presentation - ACRL/NEC ITIG Annual Meeting
Encore Presentation - ACRL/NEC ITIG Annual MeetingEncore Presentation - ACRL/NEC ITIG Annual Meeting
Encore Presentation - ACRL/NEC ITIG Annual Meeting
 
Folders vs. Metadata: SharePoint Engage Oct. 20, 2015
Folders vs. Metadata: SharePoint Engage Oct. 20, 2015Folders vs. Metadata: SharePoint Engage Oct. 20, 2015
Folders vs. Metadata: SharePoint Engage Oct. 20, 2015
 
Best Practices for Organizing Documents in SharePoint 2010
Best Practices for Organizing Documents in SharePoint 2010Best Practices for Organizing Documents in SharePoint 2010
Best Practices for Organizing Documents in SharePoint 2010
 
Preparing Catalogers for Linked data
Preparing Catalogers for Linked dataPreparing Catalogers for Linked data
Preparing Catalogers for Linked data
 
Using RefWorks to Manage Your Literature
Using RefWorks to Manage Your LiteratureUsing RefWorks to Manage Your Literature
Using RefWorks to Manage Your Literature
 
Participation reports webinar November 2020
Participation reports webinar November 2020Participation reports webinar November 2020
Participation reports webinar November 2020
 
Tactical Fingerprinting using metadata, hidden info and lost data
Tactical Fingerprinting using metadata, hidden info and lost dataTactical Fingerprinting using metadata, hidden info and lost data
Tactical Fingerprinting using metadata, hidden info and lost data
 
Database poll results
Database poll resultsDatabase poll results
Database poll results
 

Similar a Differences Problem: or why consistency in metadata is critical in the discovery process

IA Summit 09 - User Interfaces with Metasearch Capabilities
IA Summit 09 - User Interfaces with Metasearch CapabilitiesIA Summit 09 - User Interfaces with Metasearch Capabilities
IA Summit 09 - User Interfaces with Metasearch Capabilitiesguestbc914e
 
Kbart Update ALA Midwinter 2010
Kbart Update ALA Midwinter 2010Kbart Update ALA Midwinter 2010
Kbart Update ALA Midwinter 2010Jason Price, PhD
 
KBART ALA Midwinter 2010 Update
KBART ALA Midwinter 2010 UpdateKBART ALA Midwinter 2010 Update
KBART ALA Midwinter 2010 UpdateJason Price, PhD
 
Building A Digital Ref Collection
Building A Digital Ref CollectionBuilding A Digital Ref Collection
Building A Digital Ref Collectiondeborah katz
 
IWMW 2002: The Value of Metadata and How to Realise It
IWMW 2002: The Value of Metadata and How to Realise ItIWMW 2002: The Value of Metadata and How to Realise It
IWMW 2002: The Value of Metadata and How to Realise ItIWMW
 
Hearst Faceted Metadata for Site Navigation and Search
Hearst Faceted Metadata for Site Navigation and SearchHearst Faceted Metadata for Site Navigation and Search
Hearst Faceted Metadata for Site Navigation and Search灿辉 葛
 
Many flavors of linked data
Many flavors of linked dataMany flavors of linked data
Many flavors of linked dataDebra Shapiro
 
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...Ringgold Inc
 
Information Systems For Business and BeyondChapter 4Data a.docx
Information Systems For Business and BeyondChapter 4Data a.docxInformation Systems For Business and BeyondChapter 4Data a.docx
Information Systems For Business and BeyondChapter 4Data a.docxjaggernaoma
 
Establishing the Connection: Creating a Linked Data Version of the BNB
Establishing the Connection: Creating a Linked Data Version of the BNBEstablishing the Connection: Creating a Linked Data Version of the BNB
Establishing the Connection: Creating a Linked Data Version of the BNBnw13
 
Relational database concept and technology
Relational database concept and technologyRelational database concept and technology
Relational database concept and technologyDucat
 
Sorting & Extracting Data
Sorting & Extracting DataSorting & Extracting Data
Sorting & Extracting Datamary_ramsay
 
Using metadata repositories with search
Using metadata repositories with searchUsing metadata repositories with search
Using metadata repositories with searchJean Graef
 
Being an independent & assertive learner 2
Being an independent & assertive learner 2Being an independent & assertive learner 2
Being an independent & assertive learner 2SaKuchi Saku
 

Similar a Differences Problem: or why consistency in metadata is critical in the discovery process (20)

IA Summit 09 - User Interfaces with Metasearch Capabilities
IA Summit 09 - User Interfaces with Metasearch CapabilitiesIA Summit 09 - User Interfaces with Metasearch Capabilities
IA Summit 09 - User Interfaces with Metasearch Capabilities
 
Kbart Update ALA Midwinter 2010
Kbart Update ALA Midwinter 2010Kbart Update ALA Midwinter 2010
Kbart Update ALA Midwinter 2010
 
KBART ALA Midwinter 2010 Update
KBART ALA Midwinter 2010 UpdateKBART ALA Midwinter 2010 Update
KBART ALA Midwinter 2010 Update
 
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early AdoptersApril 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
 
Building A Digital Ref Collection
Building A Digital Ref CollectionBuilding A Digital Ref Collection
Building A Digital Ref Collection
 
IWMW 2002: The Value of Metadata and How to Realise It
IWMW 2002: The Value of Metadata and How to Realise ItIWMW 2002: The Value of Metadata and How to Realise It
IWMW 2002: The Value of Metadata and How to Realise It
 
Metadata : Concentrating on the data, not on the scheme
Metadata : Concentrating on the data, not on the schemeMetadata : Concentrating on the data, not on the scheme
Metadata : Concentrating on the data, not on the scheme
 
Hearst Faceted Metadata for Site Navigation and Search
Hearst Faceted Metadata for Site Navigation and SearchHearst Faceted Metadata for Site Navigation and Search
Hearst Faceted Metadata for Site Navigation and Search
 
Metadata
MetadataMetadata
Metadata
 
Many flavors of linked data
Many flavors of linked dataMany flavors of linked data
Many flavors of linked data
 
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
 
Webinar: Ditching File Shares For SharePoint Metadata
Webinar: Ditching File Shares For SharePoint MetadataWebinar: Ditching File Shares For SharePoint Metadata
Webinar: Ditching File Shares For SharePoint Metadata
 
Information Systems For Business and BeyondChapter 4Data a.docx
Information Systems For Business and BeyondChapter 4Data a.docxInformation Systems For Business and BeyondChapter 4Data a.docx
Information Systems For Business and BeyondChapter 4Data a.docx
 
Establishing the Connection: Creating a Linked Data Version of the BNB
Establishing the Connection: Creating a Linked Data Version of the BNBEstablishing the Connection: Creating a Linked Data Version of the BNB
Establishing the Connection: Creating a Linked Data Version of the BNB
 
Relational database concept and technology
Relational database concept and technologyRelational database concept and technology
Relational database concept and technology
 
Sorting & Extracting Data
Sorting & Extracting DataSorting & Extracting Data
Sorting & Extracting Data
 
Payton Eliminating Conflicts in Ebook Metadata
Payton Eliminating Conflicts in Ebook MetadataPayton Eliminating Conflicts in Ebook Metadata
Payton Eliminating Conflicts in Ebook Metadata
 
Using metadata repositories with search
Using metadata repositories with searchUsing metadata repositories with search
Using metadata repositories with search
 
A theory of Metadata enriching & filtering
A theory of  Metadata enriching & filteringA theory of  Metadata enriching & filtering
A theory of Metadata enriching & filtering
 
Being an independent & assertive learner 2
Being an independent & assertive learner 2Being an independent & assertive learner 2
Being an independent & assertive learner 2
 

Más de Shana McDanold

LODLAM Landscape NOTES
LODLAM Landscape NOTESLODLAM Landscape NOTES
LODLAM Landscape NOTESShana McDanold
 
Heretical Metadata: Abandoning Perfection in the Digital Age
Heretical Metadata: Abandoning Perfection in the Digital AgeHeretical Metadata: Abandoning Perfection in the Digital Age
Heretical Metadata: Abandoning Perfection in the Digital AgeShana McDanold
 
All About Access Points in RDA
All About Access Points in RDAAll About Access Points in RDA
All About Access Points in RDAShana McDanold
 
It's All About the Metadata
It's All About the MetadataIt's All About the Metadata
It's All About the MetadataShana McDanold
 
Importance of teaching cataloging theory and conceptual models of discovery s...
Importance of teaching cataloging theory and conceptual models of discovery s...Importance of teaching cataloging theory and conceptual models of discovery s...
Importance of teaching cataloging theory and conceptual models of discovery s...Shana McDanold
 
RDA for Original Catalogers
RDA for Original CatalogersRDA for Original Catalogers
RDA for Original CatalogersShana McDanold
 
RDA and Editing Bibliographic Records
RDA and Editing Bibliographic RecordsRDA and Editing Bibliographic Records
RDA and Editing Bibliographic RecordsShana McDanold
 
Impact of RDA on Serials Cataloging
Impact of RDA on Serials CatalogingImpact of RDA on Serials Cataloging
Impact of RDA on Serials CatalogingShana McDanold
 
RDA from Scratch for Catalogers
RDA from Scratch for CatalogersRDA from Scratch for Catalogers
RDA from Scratch for CatalogersShana McDanold
 

Más de Shana McDanold (10)

LODLAM Landscape
LODLAM LandscapeLODLAM Landscape
LODLAM Landscape
 
LODLAM Landscape NOTES
LODLAM Landscape NOTESLODLAM Landscape NOTES
LODLAM Landscape NOTES
 
Heretical Metadata: Abandoning Perfection in the Digital Age
Heretical Metadata: Abandoning Perfection in the Digital AgeHeretical Metadata: Abandoning Perfection in the Digital Age
Heretical Metadata: Abandoning Perfection in the Digital Age
 
All About Access Points in RDA
All About Access Points in RDAAll About Access Points in RDA
All About Access Points in RDA
 
It's All About the Metadata
It's All About the MetadataIt's All About the Metadata
It's All About the Metadata
 
Importance of teaching cataloging theory and conceptual models of discovery s...
Importance of teaching cataloging theory and conceptual models of discovery s...Importance of teaching cataloging theory and conceptual models of discovery s...
Importance of teaching cataloging theory and conceptual models of discovery s...
 
RDA for Original Catalogers
RDA for Original CatalogersRDA for Original Catalogers
RDA for Original Catalogers
 
RDA and Editing Bibliographic Records
RDA and Editing Bibliographic RecordsRDA and Editing Bibliographic Records
RDA and Editing Bibliographic Records
 
Impact of RDA on Serials Cataloging
Impact of RDA on Serials CatalogingImpact of RDA on Serials Cataloging
Impact of RDA on Serials Cataloging
 
RDA from Scratch for Catalogers
RDA from Scratch for CatalogersRDA from Scratch for Catalogers
RDA from Scratch for Catalogers
 

Último

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 

Último (20)

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 

Differences Problem: or why consistency in metadata is critical in the discovery process

  • 1. The Differences Problem Or why consistency in metadata is critical in the discovery process Shana L. McDanold
  • 2. First A few caveats… 2
  • 3. Inthenotso distantpast… There were two main options when searching for ebooks: 1. Search each individual vendor’s website/database 2. Load MARC records (one record for each title) into the catalog for each vendor 3
  • 4. Inthenotso distantpast… Problems with this approach:  Loading records is a LOT of work and requires regular maintenance  Massaging/editing/enhancing metadata; loading; updates; replacements; deletes  Number of records/titles to load  Lack of records available for loading  Records come from numerous places and each vendor requires a different procedure to download files  Tracking titles in multiple places (duplicate work) 4
  • 5. Now:more options… 1. Search each individual vendor’s website/database 2. Load MARC records (one record for each title) into the catalog for each vendor 3. Integration of various vendors metadata into discovery layers via APIs and linked data rather than importing records into the catalog 4. Federated search tools that index multiple databases (e.g. unified index search tools) …but are more options better? 5
  • 6. Thegoodand thebad GOOD:  fewer places to search (possibly even only one)  most public libraries, while they have other ebook databases, will have a single integrated discovery layer BAD:  MORE places to search BUT discovery is still a challenge no matter which search option you choose, and those challenges are centered around: METADATA 6
  • 9. Differences?  ISBN  Subjects  Title  Author  Date 9
  • 12. Differences?  ISBN  Subjects  Title  Author  Date 12
  • 15. Differences?  ISBN  Subjects  Title  Author  Date 15
  • 16. Differences defined  Differences in description  Current vs past rules and guidelines;  RDA provider neutral vs individual vendor records  Differences between vendors for same title  Differences in how data is entered/presented  Record proliferation  Related to metadata differences: records cannot be “collapsed” because the discovery layer doesn’t recognize them as the same  Different vocabularies and identity databases 16
  • 17. More differences  Missing metadata/missing records  Data changes/updates  Branding or custom text/collections 17
  • 18. Whydothese differences matter?  How people search  Keyword - forces dependency on keyword indexes  Follow links - if you click on the subject search for Obama, Michelle, search results include only print books (no ebooks)  Limits/facets - dependent on metadata, both visible and invisible (coded)  Missing metadata  Discovery layer exposes ALL the metadata (good, bad, missing) All means items get “hidden” because they’re not findable. 18
  • 19. How dowefix it?  CONSISTENCY  use of controlled vocabularies and existing authority databases (name matching, subjects, etc.)  Use existing metadata sources  Follow standards and recommended/best practices  Communication  Data points  complete  consistency across vendors 19

Notas del editor

  1. Usually differences are a GOOD thing, providing diversity; but not in this case Caveat: speaking from a public library perspective mainly; although most of the issues public libraries have are present in academic environments; differences are resource types and focus on currency/popularity of materials (collection is more ephemeral than permanent) BUT my background is serials and nonprint format cataloging – been dealing with managing metadata/cataloging for ejournals and ebooks for almost 2 decades now My philosophy: job of cataloging/metadata is to make stuff findable, which includes unique identification of resources I don’t believe in the “perfect” record If it’s not wrong, leave it alone (don’t delete data, just exclude it from indexes…you may want it in the future) When editing: Fix errors or delete if wrong Add access points Enhance content/description (add value) Make it pretty
  2. Number of vendors increased – more complex  more time Each vendor: different procedure for downloading; different edits (some need proxy added, some don’t); files may be in various formats and require conversion to MARC Tools to help streamline (MarcEdit – TASK LISTS saving the edits for each vendor are a savior) BUT still very time consuming Multiple places: ERM and the Catalog and possibly the vendor website – have to keep in sync
  3. Looking at a single search option for ebooks and print books, where an API is used to search both ebook vendor and the catalog in one search So lets look at examples – examples are current popular titles or authors
  4. Who’s watching the show on Netflix?
  5. ISBN: this is often a key match point for OpenURL resolvers or other API/linked data tools Title: ebook version is incomplete Author: translator is missing, an issue when looking for a specific translation or if searching by translator name Date format – indexing issue – how does your system handle dates?
  6. ISBN: this is often a key match point for OpenURL resolvers or other API/linked data tools Title: ebook version is incomplete Author: indexing issues; identity management/authority control issues Date format – indexing issue – how does your system handle dates?
  7. ISBN: this is often a key match point for OpenURL resolvers or other API/linked data tools Subject: where’s DC?? Title: ebook version is different Author: indexing issues; identity management/authority control issues Date format – indexing issue – how does your system handle dates? Do you see a trend yet?
  8. Description: AACR2 vs RDA – fundamental change in how you approach describing a resource Provider neutral – one records for ALL online versions of a title (formats, platform, etc.) – just have multiple links/URLs to various options; Hard to do that with APIs/linked data tools Date format, author format (last, first or first last?) Proliferation: more vendors = more records We get patron complaints about ebook display all the time Different vocabularies and identity databases – name formats, subjects, locations, etc.  Creates indexing and filing issues; split indexes
  9. Missing: sometimes records just don’t appear – API/linked data tool errors, delays, Data changes: records get “out of sync” – print book may be complete but ebook is still minimal/prepublication Branding: can’t add custom text to create collections, or other data to ebook records; limits to control over display and what data is included – stuck with what the vendor sends/makes available
  10. Forcing dependency on keyword indexing or indexing of the WHOLE records – specific author indexes, etc. become not useful How people search: Subjects/identities – FORM matters “see also” Collections Links – find something the want/like, follow links to “similar” or “like” items using subjects, authors, etc. (internet rabbit hole…) Limits/facets – such as format, publication date, location, etc. Missing metadata – subjects, ISBN, names, locations, etc.; lose match points; may result in records not appearing – search ISBN and the ebooks don’t show up Discovery layers – good at exposing EVERYTHING (great way to identify database cleanup projects…)
  11. Communication – between libraries and vendors Data points – more is better, even if they don’t display