SlideShare a Scribd company logo
1 of 16
eBooks: Why they break ISBNs
Stuart Yeates
http://www.nzetc.org/
Digital Publishing
It's different from print publishing
Who we are
● Unit of the Victoria University library
● Digital (re)publisher of documents used in
teaching, learning and research
● TEI/XML, tomcat/cocoon/XSLT
● Out-sourced digitisation
● In-house authority control
Demo 1
Search “ture pooti”
Demo 2
Search “William Williams”
Demo 3
Search “Robin Hyde”
ePubs
● Open standard for eBooks
● A zip file of all the same stuff you can put on a
static website
● DAISY metadata for naviation
● XHTML, CSS, etc
● We create ePubs by crawling our website
● Device not page does navigation
● grep dimensioned measurements from CSS
ISBNs
● Widely used in the print world to track editions
● Issued to publishers by a bureaucracy
● Used end-to-end in supply chain
● Printing, wharehousing, distribution,
wholesaling, retailing, purchase, cataloging,
circulation, …
Print Runs
● 99% of the time in traditional prublishing ISBNs
are print run identifiers
● Print runs are extraordinarily expensive
● Print runs are a speculative gamble on the part
of publishers
● Print runs have no direct analogue in the pure-
digital model
What's an edition?
● Currecting a single-character OCR error?
● Authority control change in body?
● Authority control change in metadata?
● Decreasing image quality?
● Increasing image quality?
● Factual corrections?
What's an edition?
● It doesn't matter because all non-commercial
ePubs are “digital photocopies” and don't
quality for ISBNs anyway.
What kind of identifier do we need?
Free of bureaucracy
● Arguments about what an “book” / “eBook” is
● Arguments about what an “edition” is
● Arguments about jurisdiction (cloud, ISO, etc)
● Baked-in assumptions about who produces
what, why and for whom
● $$$ to support
Enormously plentiful
● Many more things appear to qualify as eBooks
than books
● ISBNs are being reused
● Versions / updates
● NZETC: 1300 works x regenerated monthly
Naïve hashes insufficient
● “Use an hash of the ePub as the identifier”
● Needs to be an identifier not the identifier
● The identifer can't be used within the ePub
● Many tools in the tool chain alter the ePub
Questions
● Does a bookseller's sticker on a book make it a
“different” book?
● Does an author's signature?
● Does the intended market?

More Related Content

Similar to eBooks: Why they break ISBNs

dahava
dahavadahava
dahavadahava
 
Nylrc nora apr. 2013
Nylrc nora apr. 2013Nylrc nora apr. 2013
Nylrc nora apr. 2013Stephen Abram
 
E catalogs USA Presents ePublication Marketing, SEO, and Design
E catalogs USA Presents ePublication Marketing, SEO, and DesignE catalogs USA Presents ePublication Marketing, SEO, and Design
E catalogs USA Presents ePublication Marketing, SEO, and DesigneCatalogs USA
 
Society of indexers keynote
Society of indexers keynote Society of indexers keynote
Society of indexers keynote Corbas Consulting
 
The Dangers of Going Wide (NINC 2015)
The Dangers of Going Wide (NINC 2015)The Dangers of Going Wide (NINC 2015)
The Dangers of Going Wide (NINC 2015)Draft2Digital
 
Creating Interactive eBooks
Creating Interactive eBooksCreating Interactive eBooks
Creating Interactive eBookslumina123
 
Web technology: Web search
Web technology: Web searchWeb technology: Web search
Web technology: Web searchVictor de Boer
 
Writing The Research Paper A Handbook (7th ed) - Ch 5 computers and the resea...
Writing The Research Paper A Handbook (7th ed) - Ch 5 computers and the resea...Writing The Research Paper A Handbook (7th ed) - Ch 5 computers and the resea...
Writing The Research Paper A Handbook (7th ed) - Ch 5 computers and the resea...tedster777
 
Of Dodos, 'Karma' & Free Software in the Library
Of Dodos, 'Karma' & Free Software in the LibraryOf Dodos, 'Karma' & Free Software in the Library
Of Dodos, 'Karma' & Free Software in the LibraryIndranil Das Gupta
 
John Einar Sandvand
John Einar SandvandJohn Einar Sandvand
John Einar SandvandINMA
 
Digital Distribution With eDistributor
Digital Distribution With eDistributorDigital Distribution With eDistributor
Digital Distribution With eDistributorDoctorZed Publishing
 
Prototyping Accessibility - WordCamp Europe 2018
Prototyping Accessibility - WordCamp Europe 2018Prototyping Accessibility - WordCamp Europe 2018
Prototyping Accessibility - WordCamp Europe 2018Adrian Roselli
 
A comprhensive guide to electronic books
A comprhensive guide to electronic booksA comprhensive guide to electronic books
A comprhensive guide to electronic booksAuwalu Diso
 
MCN Pro - ePublishing: What you need to know
MCN Pro - ePublishing: What you need to knowMCN Pro - ePublishing: What you need to know
MCN Pro - ePublishing: What you need to knowElizabeth Neely
 

Similar to eBooks: Why they break ISBNs (20)

287 andrea powell going large with e books-1-b
287 andrea powell going large with e books-1-b287 andrea powell going large with e books-1-b
287 andrea powell going large with e books-1-b
 
dahava
dahavadahava
dahava
 
Nylrc nora apr. 2013
Nylrc nora apr. 2013Nylrc nora apr. 2013
Nylrc nora apr. 2013
 
E catalogs USA Presents ePublication Marketing, SEO, and Design
E catalogs USA Presents ePublication Marketing, SEO, and DesignE catalogs USA Presents ePublication Marketing, SEO, and Design
E catalogs USA Presents ePublication Marketing, SEO, and Design
 
Society of indexers keynote
Society of indexers keynote Society of indexers keynote
Society of indexers keynote
 
Ebook Update
Ebook UpdateEbook Update
Ebook Update
 
E-Books
E-BooksE-Books
E-Books
 
The Dangers of Going Wide (NINC 2015)
The Dangers of Going Wide (NINC 2015)The Dangers of Going Wide (NINC 2015)
The Dangers of Going Wide (NINC 2015)
 
Creating Interactive eBooks
Creating Interactive eBooksCreating Interactive eBooks
Creating Interactive eBooks
 
Web technology: Web search
Web technology: Web searchWeb technology: Web search
Web technology: Web search
 
1115 wed alsh1 killingworth
1115 wed alsh1 killingworth 1115 wed alsh1 killingworth
1115 wed alsh1 killingworth
 
Writing The Research Paper A Handbook (7th ed) - Ch 5 computers and the resea...
Writing The Research Paper A Handbook (7th ed) - Ch 5 computers and the resea...Writing The Research Paper A Handbook (7th ed) - Ch 5 computers and the resea...
Writing The Research Paper A Handbook (7th ed) - Ch 5 computers and the resea...
 
Of Dodos, 'Karma' & Free Software in the Library
Of Dodos, 'Karma' & Free Software in the LibraryOf Dodos, 'Karma' & Free Software in the Library
Of Dodos, 'Karma' & Free Software in the Library
 
John Einar Sandvand
John Einar SandvandJohn Einar Sandvand
John Einar Sandvand
 
Digital Distribution With eDistributor
Digital Distribution With eDistributorDigital Distribution With eDistributor
Digital Distribution With eDistributor
 
E-publishing
E-publishingE-publishing
E-publishing
 
Prototyping Accessibility - WordCamp Europe 2018
Prototyping Accessibility - WordCamp Europe 2018Prototyping Accessibility - WordCamp Europe 2018
Prototyping Accessibility - WordCamp Europe 2018
 
A comprhensive guide to electronic books
A comprhensive guide to electronic booksA comprhensive guide to electronic books
A comprhensive guide to electronic books
 
MCN Pro - ePublishing: What you need to know
MCN Pro - ePublishing: What you need to knowMCN Pro - ePublishing: What you need to know
MCN Pro - ePublishing: What you need to know
 
eBooks and the future of libraries
eBooks and the future of librarieseBooks and the future of libraries
eBooks and the future of libraries
 

More from stuartayeates

Migration off DSpace 1.8.2
Migration off DSpace 1.8.2Migration off DSpace 1.8.2
Migration off DSpace 1.8.2stuartayeates
 
Conflict between the DOAJ acceptance criteria and the C4DISC principles
Conflict between the DOAJ acceptance criteria and the C4DISC principlesConflict between the DOAJ acceptance criteria and the C4DISC principles
Conflict between the DOAJ acceptance criteria and the C4DISC principlesstuartayeates
 
Working with Wikipedia, ResBaz Wellington 2017
Working with Wikipedia, ResBaz Wellington 2017Working with Wikipedia, ResBaz Wellington 2017
Working with Wikipedia, ResBaz Wellington 2017stuartayeates
 
Working with wikipedia presentation for Rezbaz Wellington 2018
Working with wikipedia presentation for Rezbaz Wellington 2018Working with wikipedia presentation for Rezbaz Wellington 2018
Working with wikipedia presentation for Rezbaz Wellington 2018stuartayeates
 
METS Metadata for Complete Beginners
METS Metadata for Complete BeginnersMETS Metadata for Complete Beginners
METS Metadata for Complete Beginnersstuartayeates
 
5 things to think about when starting wikipedia
5 things to think about when starting wikipedia5 things to think about when starting wikipedia
5 things to think about when starting wikipediastuartayeates
 
Doing wikipedia when you're not editing wikipedia
Doing wikipedia when you're not editing wikipediaDoing wikipedia when you're not editing wikipedia
Doing wikipedia when you're not editing wikipediastuartayeates
 
20 ways to mark up a sentence
20 ways to mark up a sentence20 ways to mark up a sentence
20 ways to mark up a sentencestuartayeates
 

More from stuartayeates (8)

Migration off DSpace 1.8.2
Migration off DSpace 1.8.2Migration off DSpace 1.8.2
Migration off DSpace 1.8.2
 
Conflict between the DOAJ acceptance criteria and the C4DISC principles
Conflict between the DOAJ acceptance criteria and the C4DISC principlesConflict between the DOAJ acceptance criteria and the C4DISC principles
Conflict between the DOAJ acceptance criteria and the C4DISC principles
 
Working with Wikipedia, ResBaz Wellington 2017
Working with Wikipedia, ResBaz Wellington 2017Working with Wikipedia, ResBaz Wellington 2017
Working with Wikipedia, ResBaz Wellington 2017
 
Working with wikipedia presentation for Rezbaz Wellington 2018
Working with wikipedia presentation for Rezbaz Wellington 2018Working with wikipedia presentation for Rezbaz Wellington 2018
Working with wikipedia presentation for Rezbaz Wellington 2018
 
METS Metadata for Complete Beginners
METS Metadata for Complete BeginnersMETS Metadata for Complete Beginners
METS Metadata for Complete Beginners
 
5 things to think about when starting wikipedia
5 things to think about when starting wikipedia5 things to think about when starting wikipedia
5 things to think about when starting wikipedia
 
Doing wikipedia when you're not editing wikipedia
Doing wikipedia when you're not editing wikipediaDoing wikipedia when you're not editing wikipedia
Doing wikipedia when you're not editing wikipedia
 
20 ways to mark up a sentence
20 ways to mark up a sentence20 ways to mark up a sentence
20 ways to mark up a sentence
 

Recently uploaded

DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 

Recently uploaded (20)

DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 

eBooks: Why they break ISBNs

  • 1. eBooks: Why they break ISBNs Stuart Yeates http://www.nzetc.org/
  • 2. Digital Publishing It's different from print publishing
  • 3. Who we are ● Unit of the Victoria University library ● Digital (re)publisher of documents used in teaching, learning and research ● TEI/XML, tomcat/cocoon/XSLT ● Out-sourced digitisation ● In-house authority control
  • 7. ePubs ● Open standard for eBooks ● A zip file of all the same stuff you can put on a static website ● DAISY metadata for naviation ● XHTML, CSS, etc ● We create ePubs by crawling our website ● Device not page does navigation ● grep dimensioned measurements from CSS
  • 8. ISBNs ● Widely used in the print world to track editions ● Issued to publishers by a bureaucracy ● Used end-to-end in supply chain ● Printing, wharehousing, distribution, wholesaling, retailing, purchase, cataloging, circulation, …
  • 9. Print Runs ● 99% of the time in traditional prublishing ISBNs are print run identifiers ● Print runs are extraordinarily expensive ● Print runs are a speculative gamble on the part of publishers ● Print runs have no direct analogue in the pure- digital model
  • 10. What's an edition? ● Currecting a single-character OCR error? ● Authority control change in body? ● Authority control change in metadata? ● Decreasing image quality? ● Increasing image quality? ● Factual corrections?
  • 11. What's an edition? ● It doesn't matter because all non-commercial ePubs are “digital photocopies” and don't quality for ISBNs anyway.
  • 12. What kind of identifier do we need?
  • 13. Free of bureaucracy ● Arguments about what an “book” / “eBook” is ● Arguments about what an “edition” is ● Arguments about jurisdiction (cloud, ISO, etc) ● Baked-in assumptions about who produces what, why and for whom ● $$$ to support
  • 14. Enormously plentiful ● Many more things appear to qualify as eBooks than books ● ISBNs are being reused ● Versions / updates ● NZETC: 1300 works x regenerated monthly
  • 15. Naïve hashes insufficient ● “Use an hash of the ePub as the identifier” ● Needs to be an identifier not the identifier ● The identifer can't be used within the ePub ● Many tools in the tool chain alter the ePub
  • 16. Questions ● Does a bookseller's sticker on a book make it a “different” book? ● Does an author's signature? ● Does the intended market?