SlideShare una empresa de Scribd logo
1 de 21
JPEG 2000 at the Wellcome
Library
Christy Henshaw
Digitisation Programme Manager – Wellcome Library
JP2 Summit
12-13 May 2011
Library of Congress
The Wellcome Trust
•A global charitable foundation
•Achieving extraordinary improvements in human and animal health
•Supporting the brightest minds in biomedical research and the
medical humanities
•Exploring medicine in historical and cultural contexts
The Wellcome Library
The Wellcome Library
Collections of books, manuscripts, archives, films and pictures on the
history of medicine from the earliest times to the present day .
The Wellcome Digital Library pilot,
2010-2013
Genetics and its Modern Foundations
A new online resource for everyone interested in the history of
human and animal health.

Aims
• build sustainable/expandable mechanism – foundation stone
for WDL
• digitise key library holdings - relating to a major Trust
challenge area
• digitise important third party content – linked to theme
• use innovative content and tools – to encourage discovery and
use
• explore commercial partnerships – enhance access to nontheme material
JPEG 2000 conversion – scope
•Wellcome Images – image library, legacy images, 300,000
images in the archive
•Current projects – pilot digitisation projects, 7m images 2010 2014

• Long-term plans – digitisation of large proportion of our
collections (mainly special collections), 15m – 25m images 2014
and beyond
Type of content
•Printed books – early printed books, modern books
(monographs), pamphlets, reports
•Archives – personal papers, institutional papers, unpublished
works, mostly 20th century
• Manuscripts – unpublished, handwritten “manuscript books” and
related materials, mostly 17th, 18th and 19th century, can be fragile

• Artworks – prints, paintings, posters, drawings, glass slides, etc.
The Francis Crick Archive
Books related to genetic research
Early printed books
Artworks, manuscripts
Decision to adopt JP2
JPEG 2000 was found to answer the following needs:
•Storage costs –20/30m TIFFs stored on online, backed-up
storage = multiple petabytes. Needed something cost-effective.
•Quality – needed a high-quality compressed format that would
cover a wide range of content types.
• Robustness – needed a well-established image format with a
high chance of long-term support.
• Practical – feasible to use in a Library digitisation workflow.
Finding our way
Working with JP2 opened up a whole new world – reading
specifications, finding conversion software, so many choices.

Commissioned the report:
JPEG 2000 as a Preservation and Access Format for the Wellcome Trust Libr
Goal to find a single version of JPEG 2000 that would meet the
needs of both long-term preservation and flexible delivery needs.
The result
Parameter

Settings

File format

Part 1 (.jp2)

Compression

Lossy (6:1, 10:1)

Tiling

1024 x 1024

Progression order

RLCP

Decomp levels

5

Quality layers

8

Code block size

6, 64x64

Regions of interest

No

TLM markers

Yes

Bypass

N/A
Embedding JP2
Chose LuraWave command line tool
• Some issues (bugs, or inconvenient implementations) arose, and
all have been successfully addressed by LuraTech
• Created a firm consensus to use JP2 as the format for all stillimage digital imaging (with one or two exceptions)
• No plans to use JP2 for digital video – but never say never
• Internal information sharing – digital archivists, systems
administrators, IT department, programme board members
• External communication and networking
Current status, future plans
• Conversion of all new digital images is now carried out as
standard
• Nearing the final stages of a project to convert 450k image
backlog to JP2 (reducing current footprint from 20 Tb to 5.5 Tb)
• Large projects use lossy JP2, legacy picture library uses lossless
• Developed a strategy to determine compression levels
• Currently using the GUI, but will use the command line interface
with our new workflow system, streamlining conversion and QA
• Medium term, will look at automating compression level selection
Quality control for compression
• Visual inspection
• Color shifts, loss of detail, halo effects, pixelation, blurring, etc.
• Collection-based, representative sample
• Test range of compressions with intervals such as 2:1, 4:1, 6:1
• Once artefacts are discovered, step back to previous
compression ratio
• Worst-performing image rules, for any particular collection
• Efficient for homogenous collections – less so for heterogenous
collections with wide variety of content
• Archives particularly difficult – black and white compresses very
well – colour drawings and photographs, not so well
Establishing the JP2K-UK group
• Unknown who in the UK were using JPEG 2000, or considering it
• Unknown who was even interested in JPEG 2000
• No one wants to work in a vacuum…
• Discovered a high level of interest: British Library, The National
Archives, Oxford, King’s College London, Cambridge and
Southampton Universities, Digital Preservation Coalition,
commercial companies/consultants
• Loose affiliation of the like-minded – a user group
Remit of the JP2K-UK group
• Initial meeting in December 2009
• Everyone had a little knowledge – no one knew enough
• Agreed the need to approach JP2 implementation from
practitioner’s point of view
• “Practitioner” meaning those who manage digital imaging
strategies and implementation
• Agreed need to share information and collaborate
• Discussed ideas for a conference, and creating some guidelines
for the user community
• Wellcome encouraged to write a blog about specific experiences
working with JP2
Ouputs
• JPEG 2000 Seminar, held in London in November 2010
> 80 attendees
> UK and European speakers and delegates
> mostly non-technical audience
• Advocacy for practitioner’s needs
> discussing and airing the needs and concerns of
practitioners has influenced software developers, and even the
JPEG Committee
> JPEG

2000 at the Wellcome Library blog
www.jpeg2000wellcomelibrary.blogspot.com
Future plans for JP2K-UK
• Guidance for practitioners
> Human readable
> Focus on practicalities
> Enable practitioners to make informed choices
> Advice on implementation
• Community building
> Case studies
> Lessons learned
> Networking (nationally and internationally)

Más contenido relacionado

Similar a Jpeg2000 at Wellcome Library

Newman, DAM + Image Intellectual Property Management
Newman, DAM + Image Intellectual Property ManagementNewman, DAM + Image Intellectual Property Management
Newman, DAM + Image Intellectual Property ManagementAlan Newman
 
Wordofa presentation icadla2
Wordofa presentation icadla2Wordofa presentation icadla2
Wordofa presentation icadla2Johannes Phaladi
 
Wordofa presentation icadla2
Wordofa presentation icadla2Wordofa presentation icadla2
Wordofa presentation icadla2Johannes Phaladi
 
Of Communities and Practices: Digital Preservation Innovation & Research
Of Communities  and Practices: Digital Preservation Innovation & ResearchOf Communities  and Practices: Digital Preservation Innovation & Research
Of Communities and Practices: Digital Preservation Innovation & ResearchErwin Verbruggen
 
Digitizing Spectator - Libraries Digital Program
Digitizing Spectator - Libraries Digital ProgramDigitizing Spectator - Libraries Digital Program
Digitizing Spectator - Libraries Digital ProgramRobert Frech
 
VERDOODT Measuring clouds. A large scale acquisition and preservation service...
VERDOODT Measuring clouds. A large scale acquisition and preservation service...VERDOODT Measuring clouds. A large scale acquisition and preservation service...
VERDOODT Measuring clouds. A large scale acquisition and preservation service...FIAT/IFTA
 
Evolution of motion picture digitization at the National Library of Medicine
Evolution of motion picture digitization at the National Library of MedicineEvolution of motion picture digitization at the National Library of Medicine
Evolution of motion picture digitization at the National Library of MedicineJohn Rees
 
20yrs: 2004 iPRES Beijing e-journals
20yrs: 2004 iPRES Beijing e-journals20yrs: 2004 iPRES Beijing e-journals
20yrs: 2004 iPRES Beijing e-journalsNeil Beagrie
 
"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with ArchivematicaJenny Mitcham
 
Digital projects best practices [xxxiii reunión nacional de archivos 201111]
Digital projects best practices [xxxiii reunión nacional de archivos 201111]Digital projects best practices [xxxiii reunión nacional de archivos 201111]
Digital projects best practices [xxxiii reunión nacional de archivos 201111]Frederick Zarndt
 
Prototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and CeremonyPrototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and CeremonyArchiver
 
Research in the digital age - circa 2005
Research in the digital age - circa 2005Research in the digital age - circa 2005
Research in the digital age - circa 2005Larry Naukam
 
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12ASIS&T
 
Piloting an E-Journals Preservation Registry Service (PEPRS)
Piloting an E-Journals Preservation Registry Service (PEPRS)Piloting an E-Journals Preservation Registry Service (PEPRS)
Piloting an E-Journals Preservation Registry Service (PEPRS)EDINA, University of Edinburgh
 
Constructing bottomup
Constructing bottomupConstructing bottomup
Constructing bottomupAlex Hardisty
 
SCAPE Presentation at the Elag2013 conference in Gent/Belgium
SCAPE Presentation at the Elag2013 conference in Gent/BelgiumSCAPE Presentation at the Elag2013 conference in Gent/Belgium
SCAPE Presentation at the Elag2013 conference in Gent/BelgiumSven Schlarb
 
Digitising Hansard
Digitising HansardDigitising Hansard
Digitising HansardALISS
 
"Filling the digital preservation gap" with Archivematica
"Filling the digital preservation gap" with Archivematica"Filling the digital preservation gap" with Archivematica
"Filling the digital preservation gap" with ArchivematicaJenny Mitcham
 
Resource description and new media : challenges and opportunities. Authors: E...
Resource description and new media : challenges and opportunities. Authors: E...Resource description and new media : challenges and opportunities. Authors: E...
Resource description and new media : challenges and opportunities. Authors: E...UCD Library
 

Similar a Jpeg2000 at Wellcome Library (20)

Newman, DAM + Image Intellectual Property Management
Newman, DAM + Image Intellectual Property ManagementNewman, DAM + Image Intellectual Property Management
Newman, DAM + Image Intellectual Property Management
 
Wordofa presentation icadla2
Wordofa presentation icadla2Wordofa presentation icadla2
Wordofa presentation icadla2
 
Wordofa presentation icadla2
Wordofa presentation icadla2Wordofa presentation icadla2
Wordofa presentation icadla2
 
Of Communities and Practices: Digital Preservation Innovation & Research
Of Communities  and Practices: Digital Preservation Innovation & ResearchOf Communities  and Practices: Digital Preservation Innovation & Research
Of Communities and Practices: Digital Preservation Innovation & Research
 
Digitizing Spectator - Libraries Digital Program
Digitizing Spectator - Libraries Digital ProgramDigitizing Spectator - Libraries Digital Program
Digitizing Spectator - Libraries Digital Program
 
VERDOODT Measuring clouds. A large scale acquisition and preservation service...
VERDOODT Measuring clouds. A large scale acquisition and preservation service...VERDOODT Measuring clouds. A large scale acquisition and preservation service...
VERDOODT Measuring clouds. A large scale acquisition and preservation service...
 
Ariadne overview
Ariadne overviewAriadne overview
Ariadne overview
 
Evolution of motion picture digitization at the National Library of Medicine
Evolution of motion picture digitization at the National Library of MedicineEvolution of motion picture digitization at the National Library of Medicine
Evolution of motion picture digitization at the National Library of Medicine
 
20yrs: 2004 iPRES Beijing e-journals
20yrs: 2004 iPRES Beijing e-journals20yrs: 2004 iPRES Beijing e-journals
20yrs: 2004 iPRES Beijing e-journals
 
"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica
 
Digital projects best practices [xxxiii reunión nacional de archivos 201111]
Digital projects best practices [xxxiii reunión nacional de archivos 201111]Digital projects best practices [xxxiii reunión nacional de archivos 201111]
Digital projects best practices [xxxiii reunión nacional de archivos 201111]
 
Prototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and CeremonyPrototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and Ceremony
 
Research in the digital age - circa 2005
Research in the digital age - circa 2005Research in the digital age - circa 2005
Research in the digital age - circa 2005
 
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
 
Piloting an E-Journals Preservation Registry Service (PEPRS)
Piloting an E-Journals Preservation Registry Service (PEPRS)Piloting an E-Journals Preservation Registry Service (PEPRS)
Piloting an E-Journals Preservation Registry Service (PEPRS)
 
Constructing bottomup
Constructing bottomupConstructing bottomup
Constructing bottomup
 
SCAPE Presentation at the Elag2013 conference in Gent/Belgium
SCAPE Presentation at the Elag2013 conference in Gent/BelgiumSCAPE Presentation at the Elag2013 conference in Gent/Belgium
SCAPE Presentation at the Elag2013 conference in Gent/Belgium
 
Digitising Hansard
Digitising HansardDigitising Hansard
Digitising Hansard
 
"Filling the digital preservation gap" with Archivematica
"Filling the digital preservation gap" with Archivematica"Filling the digital preservation gap" with Archivematica
"Filling the digital preservation gap" with Archivematica
 
Resource description and new media : challenges and opportunities. Authors: E...
Resource description and new media : challenges and opportunities. Authors: E...Resource description and new media : challenges and opportunities. Authors: E...
Resource description and new media : challenges and opportunities. Authors: E...
 

Más de Wellcome Library

ProQuest Early European Books: Partner Perspective
ProQuest Early European Books: Partner PerspectiveProQuest Early European Books: Partner Perspective
ProQuest Early European Books: Partner PerspectiveWellcome Library
 
Wt dnt digitisation_open_day_v9
Wt dnt digitisation_open_day_v9Wt dnt digitisation_open_day_v9
Wt dnt digitisation_open_day_v9Wellcome Library
 
Doing Projects: 10 laws of digitisation
Doing Projects: 10 laws of digitisationDoing Projects: 10 laws of digitisation
Doing Projects: 10 laws of digitisationWellcome Library
 
Systems and Processes: making order out of chaos
Systems and Processes: making order out of chaosSystems and Processes: making order out of chaos
Systems and Processes: making order out of chaosWellcome Library
 
Copyright clearance for genetics books - a pilot project at the Wellcome Library
Copyright clearance for genetics books - a pilot project at the Wellcome LibraryCopyright clearance for genetics books - a pilot project at the Wellcome Library
Copyright clearance for genetics books - a pilot project at the Wellcome LibraryWellcome Library
 
Systems, processes & how we stop the wheels falling off
Systems, processes & how we stop the wheels falling offSystems, processes & how we stop the wheels falling off
Systems, processes & how we stop the wheels falling offWellcome Library
 
Digitisation Projects at Wellcome Library
Digitisation Projects at Wellcome LibraryDigitisation Projects at Wellcome Library
Digitisation Projects at Wellcome LibraryWellcome Library
 
How will history remember you…?
How will history remember you…?How will history remember you…?
How will history remember you…?Wellcome Library
 
Conservation for Digitisation
Conservation for DigitisationConservation for Digitisation
Conservation for DigitisationWellcome Library
 
Copyright Clearance for Genetics Books, A pilot project at the Wellcome Library
Copyright Clearance for Genetics Books, A pilot project at the Wellcome LibraryCopyright Clearance for Genetics Books, A pilot project at the Wellcome Library
Copyright Clearance for Genetics Books, A pilot project at the Wellcome LibraryWellcome Library
 
Mandating Open Access - Wellcome Trust
Mandating Open Access - Wellcome TrustMandating Open Access - Wellcome Trust
Mandating Open Access - Wellcome TrustWellcome Library
 

Más de Wellcome Library (12)

ProQuest Early European Books: Partner Perspective
ProQuest Early European Books: Partner PerspectiveProQuest Early European Books: Partner Perspective
ProQuest Early European Books: Partner Perspective
 
Wt dnt digitisation_open_day_v9
Wt dnt digitisation_open_day_v9Wt dnt digitisation_open_day_v9
Wt dnt digitisation_open_day_v9
 
Doing Projects: 10 laws of digitisation
Doing Projects: 10 laws of digitisationDoing Projects: 10 laws of digitisation
Doing Projects: 10 laws of digitisation
 
Systems and Processes: making order out of chaos
Systems and Processes: making order out of chaosSystems and Processes: making order out of chaos
Systems and Processes: making order out of chaos
 
Copyright clearance for genetics books - a pilot project at the Wellcome Library
Copyright clearance for genetics books - a pilot project at the Wellcome LibraryCopyright clearance for genetics books - a pilot project at the Wellcome Library
Copyright clearance for genetics books - a pilot project at the Wellcome Library
 
Systems, processes & how we stop the wheels falling off
Systems, processes & how we stop the wheels falling offSystems, processes & how we stop the wheels falling off
Systems, processes & how we stop the wheels falling off
 
Digitisation Projects at Wellcome Library
Digitisation Projects at Wellcome LibraryDigitisation Projects at Wellcome Library
Digitisation Projects at Wellcome Library
 
How will history remember you…?
How will history remember you…?How will history remember you…?
How will history remember you…?
 
Image Capture
Image CaptureImage Capture
Image Capture
 
Conservation for Digitisation
Conservation for DigitisationConservation for Digitisation
Conservation for Digitisation
 
Copyright Clearance for Genetics Books, A pilot project at the Wellcome Library
Copyright Clearance for Genetics Books, A pilot project at the Wellcome LibraryCopyright Clearance for Genetics Books, A pilot project at the Wellcome Library
Copyright Clearance for Genetics Books, A pilot project at the Wellcome Library
 
Mandating Open Access - Wellcome Trust
Mandating Open Access - Wellcome TrustMandating Open Access - Wellcome Trust
Mandating Open Access - Wellcome Trust
 

Último

Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 

Último (20)

Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 

Jpeg2000 at Wellcome Library

  • 1. JPEG 2000 at the Wellcome Library Christy Henshaw Digitisation Programme Manager – Wellcome Library JP2 Summit 12-13 May 2011 Library of Congress
  • 2. The Wellcome Trust •A global charitable foundation •Achieving extraordinary improvements in human and animal health •Supporting the brightest minds in biomedical research and the medical humanities •Exploring medicine in historical and cultural contexts
  • 4. The Wellcome Library Collections of books, manuscripts, archives, films and pictures on the history of medicine from the earliest times to the present day .
  • 5. The Wellcome Digital Library pilot, 2010-2013 Genetics and its Modern Foundations A new online resource for everyone interested in the history of human and animal health. Aims • build sustainable/expandable mechanism – foundation stone for WDL • digitise key library holdings - relating to a major Trust challenge area • digitise important third party content – linked to theme • use innovative content and tools – to encourage discovery and use • explore commercial partnerships – enhance access to nontheme material
  • 6. JPEG 2000 conversion – scope •Wellcome Images – image library, legacy images, 300,000 images in the archive •Current projects – pilot digitisation projects, 7m images 2010 2014 • Long-term plans – digitisation of large proportion of our collections (mainly special collections), 15m – 25m images 2014 and beyond
  • 7. Type of content •Printed books – early printed books, modern books (monographs), pamphlets, reports •Archives – personal papers, institutional papers, unpublished works, mostly 20th century • Manuscripts – unpublished, handwritten “manuscript books” and related materials, mostly 17th, 18th and 19th century, can be fragile • Artworks – prints, paintings, posters, drawings, glass slides, etc.
  • 9. Books related to genetic research
  • 12. Decision to adopt JP2 JPEG 2000 was found to answer the following needs: •Storage costs –20/30m TIFFs stored on online, backed-up storage = multiple petabytes. Needed something cost-effective. •Quality – needed a high-quality compressed format that would cover a wide range of content types. • Robustness – needed a well-established image format with a high chance of long-term support. • Practical – feasible to use in a Library digitisation workflow.
  • 13. Finding our way Working with JP2 opened up a whole new world – reading specifications, finding conversion software, so many choices. Commissioned the report: JPEG 2000 as a Preservation and Access Format for the Wellcome Trust Libr Goal to find a single version of JPEG 2000 that would meet the needs of both long-term preservation and flexible delivery needs.
  • 14. The result Parameter Settings File format Part 1 (.jp2) Compression Lossy (6:1, 10:1) Tiling 1024 x 1024 Progression order RLCP Decomp levels 5 Quality layers 8 Code block size 6, 64x64 Regions of interest No TLM markers Yes Bypass N/A
  • 15. Embedding JP2 Chose LuraWave command line tool • Some issues (bugs, or inconvenient implementations) arose, and all have been successfully addressed by LuraTech • Created a firm consensus to use JP2 as the format for all stillimage digital imaging (with one or two exceptions) • No plans to use JP2 for digital video – but never say never • Internal information sharing – digital archivists, systems administrators, IT department, programme board members • External communication and networking
  • 16. Current status, future plans • Conversion of all new digital images is now carried out as standard • Nearing the final stages of a project to convert 450k image backlog to JP2 (reducing current footprint from 20 Tb to 5.5 Tb) • Large projects use lossy JP2, legacy picture library uses lossless • Developed a strategy to determine compression levels • Currently using the GUI, but will use the command line interface with our new workflow system, streamlining conversion and QA • Medium term, will look at automating compression level selection
  • 17. Quality control for compression • Visual inspection • Color shifts, loss of detail, halo effects, pixelation, blurring, etc. • Collection-based, representative sample • Test range of compressions with intervals such as 2:1, 4:1, 6:1 • Once artefacts are discovered, step back to previous compression ratio • Worst-performing image rules, for any particular collection • Efficient for homogenous collections – less so for heterogenous collections with wide variety of content • Archives particularly difficult – black and white compresses very well – colour drawings and photographs, not so well
  • 18. Establishing the JP2K-UK group • Unknown who in the UK were using JPEG 2000, or considering it • Unknown who was even interested in JPEG 2000 • No one wants to work in a vacuum… • Discovered a high level of interest: British Library, The National Archives, Oxford, King’s College London, Cambridge and Southampton Universities, Digital Preservation Coalition, commercial companies/consultants • Loose affiliation of the like-minded – a user group
  • 19. Remit of the JP2K-UK group • Initial meeting in December 2009 • Everyone had a little knowledge – no one knew enough • Agreed the need to approach JP2 implementation from practitioner’s point of view • “Practitioner” meaning those who manage digital imaging strategies and implementation • Agreed need to share information and collaborate • Discussed ideas for a conference, and creating some guidelines for the user community • Wellcome encouraged to write a blog about specific experiences working with JP2
  • 20. Ouputs • JPEG 2000 Seminar, held in London in November 2010 > 80 attendees > UK and European speakers and delegates > mostly non-technical audience • Advocacy for practitioner’s needs > discussing and airing the needs and concerns of practitioners has influenced software developers, and even the JPEG Committee > JPEG 2000 at the Wellcome Library blog www.jpeg2000wellcomelibrary.blogspot.com
  • 21. Future plans for JP2K-UK • Guidance for practitioners > Human readable > Focus on practicalities > Enable practitioners to make informed choices > Advice on implementation • Community building > Case studies > Lessons learned > Networking (nationally and internationally)