Hot Topics: The DuraSpace Community Webinar Series
Series 12: "Digital Preservation with ArchivesDirect: Ready, Set, Go!"
Webinar 1: “Digital Preservation with ArchivesDirect: Ready, Set...” May 14, 2016
Curated and Presented by Artefactual Systems, Sarah Romkey, Systems Archivist and Courtney Mumma, Consultant, US and International Community Development
5.14.15 Slides, “Digital Preservation with ArchivesDirect: Ready, Set...”
1. May 14, 2015 Hot Topics: DuraSpace Community Webinar Series
Hot Topics: The DuraSpace
Community Webinar Series
Series Twelve:
“Digital Preservation with
ArchivesDirect:
Ready, Set, Go!”
Curated by Artefactual Systems
2. May 14, 2015 Hot Topics: DuraSpace Community Webinar Series
Webinar 1:
Digital Preservation with
ArchivesDirect:
Ready, Set...
Presented by:
Sarah Romkey, MAS/MLIS
Systems Archivist
Courtney Mumma, MAS/MLIS
US and International Community Development
3. May 14, 2015 Hot Topics: DuraSpace Community Webinar Series
Agenda
● What is ArchivesDirect?
● Questioning your content:
○ What types of digital content?
○ Where is your digital content?
○ How much is there?
○ Asking questions of your content
● Prepare your content for ArchivesDirect
○ Case study: Digitized content
○ Case study: Born digital material
● Readiness for digital preservation
● Next time: Go!
4. May 14, 2015 Hot Topics: DuraSpace Community Webinar Series
What is ArchivesDirect?
● Hosted service from DuraSpace
● Support, training, and consulting from
Artefactual Systems
● Archivematica + DuraCloud = standards-
based digital preservation packages in
secure long-term storage
5. May 14, 2015 Hot Topics: DuraSpace Community Webinar Series
What types of digital content?
born-digital
● government and university records, student artwork, e-theses and
dissertations
● diverse formats: audiovisual, textual, geospatial, websites,
presentations, images, databases
digitized
● books, newspapers, images, video from vendors
● pre-made access and preservation copies
submission documentation & metadata
● permission forms, accession records, pictures of digital media, etc.
● descriptive MD from other systems
6. May 14, 2015 Hot Topics: DuraSpace Community Webinar Series
Where is your digital content?
• stored locally
• in other systems
● ie CONTENTdm, Dspace, DuraCloud, Islandora
• on detached media
● floppies, hard drives, cds, dvds, usb sticks, etc.
• packaged
● Bagged using Library of Congress BagIt specification
● Forensic images
● Zipped or tarballed
7. May 14, 2015 Hot Topics: DuraSpace Community Webinar Series
How much is there?
Size: gigabytes, terabytes, petabytes
● Sum total of all material
● Size of distinct content sets
● Biggest single digital objects
Quantity
● Sum total of all files
● Number of files in distinct content sets
Resource capacity
● Space allocated to processing and storage locations
● Consider ideal transfer, SIP and AIP sizes
8. May 14, 2015 Hot Topics: DuraSpace Community Webinar Series
Asking questions of your content
descriptive metadata?
● needs preserved? already existent or need to add? complex or
simple objects?
submission documentation?
● donor agreements, pictures of physical media, licenses, etc
access copies?
● already have them? what system to send/store?
generate preservation copies?
● already have them?
service masters?
9. May 14, 2015 Hot Topics: DuraSpace Community Webinar Series
Asking questions of your content
• directory structure important (Original Order)?
• keep the package AND the content, or just one?
• rights information?
• is content Bagged? in DSpace? a forensic image?
(Transfer type)
• how large should my archival packages be?
• will my archival packages have a 1:1 relationship with my
transferred digital content? will my content be arranged
into multiple packages or combined into one?
(Arrangement workflow)
10. May 14, 2015 Hot Topics: DuraSpace Community Webinar Series
Prepare your content for
ArchivesDirect
● DuraCloud Sync Tool is the easiest way to make your
content available to process in ArchivesDirect
○ Sync Tool works by “watching” a directory on your computer or
server
● Archivematica understands content in directories
○ It will use one directory to create a “space” to find transfer
content
○ Content for preservation must be inside directories also
1 Sync Tool Directory + 2 Archivematica Directories =
3 directory layers for your content
11. May 14, 2015 Hot Topics: DuraSpace Community Webinar Series
Prepare your content for
ArchivesDirect
Sync Tool Watched
Directory contains
“transfers” directory,
which contains
material for
preservation within
directories
12. May 14, 2015 Hot Topics: DuraSpace Community Webinar Series
Case Study 1: digitized content
Scenario:
○ Institution has digitized master images provided by
a vendor and has the following goals:
■ Preserve the digitization masters in an AIP
■ Create a DIP using Archivematica’s
Normalization functionality
■ Include Dublin Core metadata in both the AIP
and DIP
13. May 14, 2015 Hot Topics: DuraSpace Community Webinar Series
Case Study 1: digitized content
How to prepare this content for ArchivesDirect:
● Prepare a metadata.csv file using Archivematica
instructions
● Place all digital content in directory called “objects”
(subdirectories ok too) and metadata.csv in a directory
called “metadata”
Full directory paths for Sync Tool:
/duraspace/transfers/Collection name/objects/file1.tif, etc
/duraspace/transfers/Collection name/metadata/metadata.csv
14. May 14, 2015 Hot Topics: DuraSpace Community Webinar Series
Case Study 2: born digital content
Scenario:
○ The institution has some content from a
donor or another department which it
wishes to preserve in AIPs. The access
situation is unclear, but the institution
would like to make access copies, “just in
case”.
15. May 14, 2015 Hot Topics: DuraSpace Community Webinar Series
Case Study 2: born digital content
● Before “syncing” the content consider:
○ Is the structure as transferred from the creator
important? It can be retained in the AIP.
○ Are there logical groupings within the collection
to make transfers from? Must the whole
collection be one transfer? If so, how large is it?
● Move the content to your watched directory,
or configure the sync tool to watch its
current location
16. May 14, 2015 Hot Topics: DuraSpace Community Webinar Series
Readiness for digital preservation
Other factors to consider:
1. Has your institution thought about writing a digital
preservation policy?
2. Are you preparing to have a TRAC audit or self-audit?
3. Do you have the hardware/software to make safe
acquisitions from donors (write blocker, digital forensics
tools?)
4. Do you have a donation form that is appropriate for
born-digital content?
17. May 14, 2015 Hot Topics: DuraSpace Community Webinar Series
Next time: Go!
Next webinar, 5/26/15, will cover:
● How to process content in Archivematica
● Both of our case study workflows
● Helpful Archivematica micro-services and
features
● Accessing/downloading AIPs stored through
ArchivesDirect
18. May 14, 2015 Hot Topics: DuraSpace Community Webinar Series
Upcoming Event
IS&T Archiving Conference, Los Angeles
Thursday, May 21, 1:10 - 2:00pm
DIGITAL PRESERVATION II
"Cloud-based Digital Preservation Services for
Small or Mid-sized Institutions: Results of a
Pilot Study of Archivematica + DuraCloud,
Kevin Miller, Pepperdine University"
19. May 14, 2015 Hot Topics: DuraSpace Community Webinar Series
QUESTIONS
For more information please visit:
http://archivesdirect.org/
http://archivematica.org/