78. Meeting the Expectations of the
Community : CRL & the Auditing
of Digital Repositories
Marie Waltz
Center for Research Libraries
79. What is the Center for Research Libraries
(CRL)
• A consortium of over 260 College and
University Library’s, primarily in the U.S.
and Canada.
• Our members have an interest in auditing
and certification because they are
investing in digital repositories.
Speaker: Marie Waltz
11/28/2007 Practices & Challenges in Preservation
80. Research libraries are changing...content
is no longer on library shelves
•The collections of 80% of U.S. research libraries are duplicating the contents
of other research libraries. Most of what is owned will be digitized within the
next ten years.
•Google has digitized more than 12 million volumes.
•Born digital material is now the norm for many types of academic materials
(course syllabi, articles, and many manuscripts submitted to publishers.)
Speaker: Marie Waltz
Practices & Challenges in Preservation
81. CRL & Digital Preservation
2002 - 2004 – Political Communications
Web Archiving Project
2005 - 2006 – Test audits of TRAC
2008 - 2010 – NSF Case Studies
Speaker: Marie Waltz
Practices & Challenges in Preservation
82. CRL & Test Audits using TRAC
RLG/CRL Mellon Foundation Project to
audits digital repositories
– ICPSR, Portico, KB, LOCKSS
– Tested the TRAC metrics
– Understanding of the auditing process.
Speaker: Marie Waltz
Practices & Challenges in Preservation
83. CRL & NSF Case Studies
• Looked at eight organizations of various
sizes who house digital content
• Gave us a broader understanding of what
makes a “successful” organization
• Allowed us to see how technology
decisions effect a repository.
Speaker: Marie Waltz
Practices & Challenges in Preservation
84. Current Digital Preservation Projects at
CRL
• Certifying digital repositories of interest to
members.
• Participation in establishment of ISO
16363
• Human Rights Archives & Documentation.
Speaker: Marie Waltz
Practices & Challenges in Preservation
85. NISO Presentation
What is a Trusted Digital Repository?
A trusted digital repository is one whose
mission is to provide reliable, long-term
access to managed digital resources to its
designated community, now and in the
future
– Trusted Digital Repositories : Attributes and
Responsibilities, An RLG-OCLC Report (RLG, 2002)
Speaker: Marie Waltz
Practices & Challenges in Preservation
86. NISO Presentation
Why Auditing?
An audit establishes the soundness and
dependability of a repository.
Speaker: Marie Waltz
Practices & Challenges in Preservation
87. NISO Presentation
Auditing a Digital Repository
• Advisory Panel
• Audit Criteria
• Time table
• Standards in auditing
• Certification and reporting to community
Speaker: Marie Waltz
Practices & Challenges in Preservation
88. Current Advisory Panel
• Martha Brogan (Chair) • Anne Pottier
Director of Collection Development & Associate University Librarian
Management McMaster University
University of Pennsylvania
• Oya Y. Rieger
• Winston Atkins Associate University Librarian for
Preservation Officer Information Technologies
Duke University Cornell University
• William Parod • Perry Willett
Senior Repository Developer Digital Preservation Services Manager
Northwestern University Libraries California Digital Library
• Mark Phillips
Assistant Dean for Digital Libraries
University of North Texas Libraries
Speaker: Marie Waltz
Practices & Challenges in Preservation
89. NISO Presentation
Criteria
• Advisory Panel
• ISO 16363 (TDR) / TRAC
• Community feedback
Speaker: Marie Waltz
Practices & Challenges in Preservation
90. NISO Presentation
Timetable for an Audit
1. Logistics for the audit
2. Request for Documentation
3. Identify activities, policies etc. of key
significance for preservation
4. Evaluation of Repository (includes site visit)
5. Review findings with panel
6. Report findings
Speaker: Marie Waltz
Practices & Challenges in Preservation
91. NISO Presentation
Standards Used in Auditing
• Metadata Standards, Dublin Core, Content
Standard for Digital Geospatial Metadata
(CSDGM)
• Technical Standards, ISO 27001
• Data and format standards, PDF/A, etc.
Speaker: Marie Waltz
Practices & Challenges in Preservation
92. NISO Presentation
Certification
• Means the Repository is “Trusted”
• We report findings to the Research
Community
• Considerations for: Length of Certification
and Re-Certification
Speaker: Marie Waltz
Practices & Challenges in Preservation
93. NISO Presentation
Certification of HathiTrust& Portico
• More information about preservation on
their websites
• Assurance they are adhering to standards
and preservation strategies
• Assurance that changes will take
preservation into account
Speaker: Marie Waltz
Practices & Challenges in Preservation
94. NISO Presentation
ISO Standard 16363
• Work on the standard: PTAB Group
• Test audits
• Future plans for auditing using ISO 16363
Speaker: Marie Waltz
Practices & Challenges in Preservation
95. NISO Presentation
Test Audits of ISO 16363
• May-July 2011. Tested six digital
repositories, three in the U.S. and three in
Europe.
• Future plans for auditing using ISO 16363.
– European Framework for Certification
and Auditing.
Speaker: Marie Waltz
Practices & Challenges in Preservation
96. NISO Presentation
Future Plans for CRL Auditing
• Use ISO 16363
• Audit repositories of interest to academic
and independent researchers in the United
States and Canada.
• Encourage community feedback
• Become a resource for information about
digital repositories
Speaker: Marie Waltz
Practices & Challenges in Preservation
97. NISO Presentation
Community involvement
• Without community feedback we will not
be successful at targeting our audits
• We want to audit repositories of interest to
academic and independent researchers.
Speaker: Marie Waltz
Practices & Challenges in Preservation
98. NISO Presentation
CRL is becoming a Resource for
Information about Digital Repositories
• Global Resources Forum (GRF) Reviews
and Profiles
• Audit reports
• Webinars
Speaker: Marie Waltz
Practices & Challenges in Preservation
99. NISO Presentation
Summary
CRL will continue to audit digital repositories
of interest on behalf of our members using
ISO 16363.
Speaker: Marie Waltz
Practices & Challenges in Preservation
Join 15 minutes early.Close everything that might ding: yammer, AIM, rescue time, Outlook, etc.Undock the Chat and Questions and move them over to the left hand screen.Mute my phone and turn cell to alarm and BB off.Felicity starts the broadcast. Until then, the users will see the Go To Webinar entrance screen and hear silence.The meeting will be recorded. Send Felicity my polls.--Thank you Pamela.Excited to talk to you about some of the research and analysis we’ve done in this area.Questions … happy to take questions. Tricky to keep an eye on both. If I miss one while going through, I’ll catch it at the end. Also happy to take additional questions by email or phone after the fact.
Archive Service product manager for Portico and ITHAKA as a whole.I like to describe what I do as problem solving … from relatively high level archival policy setting to help figuring out what to do with particular files (or rather, teaching folks how to think about what to do with particular files).
General advice …Set your policies first, make design and implementation decisions based upon your strategies. Use standards if they will help you.If you choose not to follow standards, make sure you can accept them both at import and export.
Mind Map …Wide World of standards, I’m going to stick with the ones we use or have seriously considered …
On my white board …I teach classes on this to inculcate new hires …Emphasize long-term before clicking into key goals …
Standards around receiptStandards around metadata to add/subtract/modify during ingestStandards around files during ingestStandards around archive management
Wide World of standards, I’m going to stick with the ones we use or have seriously considered …
there must be preservation planningthe archive requires ongoing administrationcontent in the form of files and metadata about the files must come into the archivecontent in the form of files and metadata about the files must go out of the archivethe archive is not the hardware or software—rather the hardware and software are elements of the archive, which includes activities performed by peoplethe original producer of the content and the eventual users of the content must have input into the ongoing management of the archiveat its very base, the preserved information consists of the content to be archived and metadata about the content—both of which are required for long-term preservation.
Wide World of standards, I’m going to stick with the ones we use or have seriously considered …
Wide World of standards, I’m going to stick with the ones we use or have seriously considered …
Wide World of standards, I’m going to stick with the ones we use or have seriously considered …
Must describe the packaging, includes abbreviated DMD.The Preservation and Packaging MD file is the thing that will let you manage the content over the long term.
Wide World of standards, I’m going to stick with the ones we use or have seriously considered …
Wide World of standards, I’m going to stick with the ones we use or have seriously considered …
Wide World of standards, I’m going to stick with the ones we use or have seriously considered …
Wide World of standards, I’m going to stick with the ones we use or have seriously considered …
Wide World of standards, I’m going to stick with the ones we use or have seriously considered …
General advice …Set your policies first, make design and implementation decisions based upon your strategies. Use standards if they will help you.If you choose not to follow standards, make sure you can accept them both at import and export.
Change presenter to Pamela when done.Stay on afterward.
Community knowledge sharing on technical and business decisions:FormatsRisksPreservation plans and actions
sustainable digital preservation services are possible only when combine with access services.Although access to information tomorrow depends on preservation actions taken today, without access there is no need for preservation.This is why is was crucial to make the distribution model as optimized as possible.