SlideShare a Scribd company logo
1 of 33
Download to read offline
Tim Donohue
 DSpace + DuraCloud Integrations

                                                                               DuraSpace


Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                         To request other use: info@duracloud.org
Basis for DSpace Integration

1. DSpace AIP Backup & Restore (1.7 +)
   (Initial DuraCloud use case: Backup & Restore)


2. DSpace Curation Task System (1.7 +)

3. DSpace Replication Task Suite (1.8)



          Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                   To request other use: info@duracloud.org
Intro to Archival Info Pkgs (1.7+)

• Primary Use Cases
  – Backup & Restore of DSpace Content
     • All content or just partial (Community/Collection/Item)
  – Migration/Export of DSpace Content
     • All content or just partial (Community/Collection/Item)
  – DuraCloud Integration




          Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                   To request other use: info@duracloud.org
How to Backup DSpace (pre-1.7)




                  Database                          Assetstore Folder



 Full Database                                                                        Folder
    Backup                                                                            Backup



         Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                  To request other use: info@duracloud.org
How to Restore All (pre-1.7)




                 Database                          Assetstore Folder



Full Database                                                                        Folder
   Backup                                                                            Backup



        Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                 To request other use: info@duracloud.org
How to Restore a Collection (pre-1.7)




                       Database                          Assetstore Folder



Temporary                                                                                                   Temporary
 Database                                                                                                    Folder?

      Full Database                                                                         Folder
         Backup                                                                             Backup
              Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                       To request other use: info@duracloud.org
How to Restore a Collection (pre-1.7)




                       Database                          Assetstore Folder



Temporary                                                                                                   Temporary
 Database                                                                                                    Folder?

      Full Database                                                                         Folder
         Backup                                                                             Backup
              Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                       To request other use: info@duracloud.org
Backup via Archival Info Pkgs




Package for each
  Community,
Collection & Item


        AIP backup
       Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                To request other use: info@duracloud.org
Restore All via Archival Info Pkgs




 Package for each
   Community,
 Collection & Item


         AIP backup
        Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                 To request other use: info@duracloud.org
Restore a Collection via AIPs




                                            1                          2
   Collection AIP

                                                                              Items in Collection

     AIP backup
    Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                             To request other use: info@duracloud.org
What’s in an AIP?


                                                                 Content Files
                License                                           or Logos                           Other Files
                                 METS                                                                in Bundles
                        (DIM / MODS / PREMIS /                                                           (optional)
                             METSRights)


                               *Also a BagIt version in works

Archival Information Package (AIP)

                Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                         To request other use: info@duracloud.org
What’s in an AIP?
Related
Object
 AIPs
                                                           Content Files
          License                                           or Logos                           Other Files
                           METS                                                                in Bundles
                  (DIM / MODS / PREMIS /                                                           (optional)
                       METSRights)

                     Descriptive Metadata: DIM & MODS
                     Tech/Preservation Metadata: PREMIS
                     Rights Metadata: METSRights
          Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                   To request other use: info@duracloud.org
The “Site” AIP
Top-Level
Community
  AIPs
                                              METS
                                     (DIM / MODS / PREMIS /
                                          METSRights)




                   Special AIP for site-wide info/metadata:
                   (e.g. Group Memberships, EPeople)

            Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                     To request other use: info@duracloud.org
What can AIPs restore?
Restore All In-Archive Content (Files + Metadata)
Restore All People & Groups
Restore All Permissions / Access Rights
Restore Community / Collection Logos, Metadata,
  Rights & Item Templates
Restore Community / Collection / Item Hierarchy
Restore In-Process / Incomplete Items
Restore Collection OAI-PMH/ORE Harvest Settings
Restore all configuration files (dspace.cfg, etc.)
        Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                 To request other use: info@duracloud.org
Migrate a Collection


One DSpace Install                                                           Another DSpace Install


           2                 1

                                   Collection AIP



                                                                           Items in Collection

     Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                              To request other use: info@duracloud.org
Migrate Content


One DSpace Install
                                                                                     (Future work)


           2                 1

                                   Collection AIP



                                                                           Items in Collection

     Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                              To request other use: info@duracloud.org
DuraCloud Integration (1.7.x)


                [1]                ./dspace packager -d

                [2]                java -jar synctool.jar

                                               1
Package for each
  Community,
Collection & Item


Local “Watch” Folder
       Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                To request other use: info@duracloud.org
DuraCloud Integration (1.7.x)


                [1]                java -jar retrievaltool.jar

                [2]                ./dspace packager -r


                                               2
Package for each
  Community,
Collection & Item


      Local Folder
       Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                To request other use: info@duracloud.org
New: DSpace Replication Suite in 1.8




Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                         To request other use: info@duracloud.org
DSpace Curation System (1.7+)

• Enables a basic ‘microservices’ approach to
  curating DSpace objects
• Anyone can build a task & share it.
• Currently tasks must be written in Java
  – Working on JRuby & Jython integration (1.8?)
• “Frees” admin tasks from Command Line
  – Can now run from Admin UI or CLI


         Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                  To request other use: info@duracloud.org
DSpace Replication Suite (1.8)

• A set of curation tasks geared towards
  ‘replicating’ (backup/restore/audit) content
• “Wraps” 1.7 DSpace AIP Backup & Restore

  Backup content to AIP (filesystem or DuraCloud)
  Restore/Replace from AIP
  Audit AIP (compare to DSpace content)
  Basic IO Tracking of AIP Upload/Downloads
  All replication tasks can be run via Admin UI

         Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                  To request other use: info@duracloud.org
DuraCloud Integration (1.8.x)
                                                              “Replication Task Suite”:
                                                              • Suite of Curation Tasks
                                                              • One step Sync process
               OR
Command line             Curation Tools
                                                              • Via UI or CLI



 Package for each
   Community,
                                                                               1
 Collection & Item
                                                    1
   Local Temp Folder
        (Cache)
           Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                    To request other use: info@duracloud.org
DuraCloud Integration (1.8.x)
                                                              “Replication Task Suite”:
                                                              • One step Retrieve process
                                                              • Via UI or CLI
               OR
Command line             Curation Tools
                                                              • Also ‘auditing’ tools



 Package for each
   Community,
                                                                               1
 Collection & Item
                                                    1
   Local Temp Folder
        (Cache)
           Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                    To request other use: info@duracloud.org
DSpace Replication Suite Demo




Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                         To request other use: info@duracloud.org
Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                         To request other use: info@duracloud.org
Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                         To request other use: info@duracloud.org
Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                         To request other use: info@duracloud.org
Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                         To request other use: info@duracloud.org
Known Limitations

Cannot yet take advantage of DuraCloud
  streaming capabilities (AIPs are zip files)
Cannot yet take advantage of DuraCloud
  transformation services (AIPs are zip files)




       Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                To request other use: info@duracloud.org
Next Steps

• Working towards “unzipped” AIPs (1.8?)
  – METS file & Content files stored in an AIP ‘folder’
    but NOT zipped up
  – Support for DuraCloud streaming, etc.
• DSpace UI Streaming Integration (@mire)
• ‘Auto-Sync’ options
  – Updates in DSpace -> DuraCloud (queued?)
  – Updates via DuraCloud services -> DSpace?

          Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                   To request other use: info@duracloud.org
In Large Thanks to…

• MIT : Richard Rodgers & Wendy Bossons
  – Developed Curation Task Framework
  – Developed initial Replication Suite tasks
• @mire : Mark Diggory
  – Look for @mire’s “Integrating DuraCloud Services
    in DSpace” talk on Friday at 3:30pm




          Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                   To request other use: info@duracloud.org
For More Information

• Replication Task Suite:
  – https://wiki.duraspace.org/display/DSPACE/Replic
    ationTaskSuite
• AIP Backup & Restore:
  – https://wiki.duraspace.org/display/DSDOC/AIP+Ba
    ckup+and+Restore
• Curation Task System:
  – https://wiki.duraspace.org/display/DSDOC/Curati
    on+System
         Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                  To request other use: info@duracloud.org
Photo/Icon Acknowledgments
•   Package : http://www.flickr.com/photos/halfbisqued/2353845688/
•   Harddrive & Terminal icons: http://tango.freedesktop.org/Tango_Desktop_Project
•   Folder icon: http://www.openclipart.org/detail/13740
•   Database icon: http://www.openclipart.org/detail/68413
•   Zip Pkg icon: http://veryicon.com/icons/system/capital-icon-suite-mac/zip-10.html
•   File icons: http://veryicon.com/icons/system/rhor-v2-part-3/
•   Checkmark & Delete icons: http://veryicon.com/icons/system/on-stage/
•   Tools Icon: http://veryicon.com/icons/system/azullustre/




                 Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0)
                                          To request other use: info@duracloud.org

More Related Content

Similar to DSpace & DuraCloud Integrations

Online storage for the masses and the case of pithos
Online storage for the masses and the case of pithosOnline storage for the masses and the case of pithos
Online storage for the masses and the case of pithosnkoziris
 
CS9222 ADVANCED OPERATING SYSTEMS
CS9222 ADVANCED OPERATING SYSTEMSCS9222 ADVANCED OPERATING SYSTEMS
CS9222 ADVANCED OPERATING SYSTEMSKathirvel Ayyaswamy
 
storage & file strucure in dbms
storage & file strucure in dbmsstorage & file strucure in dbms
storage & file strucure in dbmssachin2690
 
File system interfacefinal
File system interfacefinalFile system interfacefinal
File system interfacefinalmarangburu42
 
Integrating Fedora with DuraCloud 1-11-12
Integrating Fedora with DuraCloud 1-11-12Integrating Fedora with DuraCloud 1-11-12
Integrating Fedora with DuraCloud 1-11-12DuraSpace
 
Hadoop Distributed File System for Big Data Analytics
Hadoop Distributed File System for Big Data AnalyticsHadoop Distributed File System for Big Data Analytics
Hadoop Distributed File System for Big Data AnalyticsDrPDShebaKeziaMalarc
 
Distributed File System
Distributed File SystemDistributed File System
Distributed File SystemNtu
 
Ch11 OS
Ch11 OSCh11 OS
Ch11 OSC.U
 
Cloud Data De Duplication in Multiuser Environment DeposM2
Cloud Data De Duplication in Multiuser Environment DeposM2Cloud Data De Duplication in Multiuser Environment DeposM2
Cloud Data De Duplication in Multiuser Environment DeposM2ijtsrd
 
Chapter 10 - File System Interface
Chapter 10 - File System InterfaceChapter 10 - File System Interface
Chapter 10 - File System InterfaceWayne Jones Jnr
 
File management
File managementFile management
File managementMohd Arif
 
File system.
File system.File system.
File system.elyza12
 
(BAC309) Automating Backup and Archiving with AWS and CommVault | AWS re:Inve...
(BAC309) Automating Backup and Archiving with AWS and CommVault | AWS re:Inve...(BAC309) Automating Backup and Archiving with AWS and CommVault | AWS re:Inve...
(BAC309) Automating Backup and Archiving with AWS and CommVault | AWS re:Inve...Amazon Web Services
 

Similar to DSpace & DuraCloud Integrations (20)

Online storage for the masses and the case of pithos
Online storage for the masses and the case of pithosOnline storage for the masses and the case of pithos
Online storage for the masses and the case of pithos
 
CS9222 ADVANCED OPERATING SYSTEMS
CS9222 ADVANCED OPERATING SYSTEMSCS9222 ADVANCED OPERATING SYSTEMS
CS9222 ADVANCED OPERATING SYSTEMS
 
storage & file strucure in dbms
storage & file strucure in dbmsstorage & file strucure in dbms
storage & file strucure in dbms
 
File system interfacefinal
File system interfacefinalFile system interfacefinal
File system interfacefinal
 
Integrating Fedora with DuraCloud 1-11-12
Integrating Fedora with DuraCloud 1-11-12Integrating Fedora with DuraCloud 1-11-12
Integrating Fedora with DuraCloud 1-11-12
 
Hadoop Distributed File System for Big Data Analytics
Hadoop Distributed File System for Big Data AnalyticsHadoop Distributed File System for Big Data Analytics
Hadoop Distributed File System for Big Data Analytics
 
CIT173_Ch15_Mnstr_23.pdf
CIT173_Ch15_Mnstr_23.pdfCIT173_Ch15_Mnstr_23.pdf
CIT173_Ch15_Mnstr_23.pdf
 
009709863.pdf
009709863.pdf009709863.pdf
009709863.pdf
 
Distributed File System
Distributed File SystemDistributed File System
Distributed File System
 
Ch11 OS
Ch11 OSCh11 OS
Ch11 OS
 
OSCh11
OSCh11OSCh11
OSCh11
 
OS_Ch11
OS_Ch11OS_Ch11
OS_Ch11
 
Cloud Data De Duplication in Multiuser Environment DeposM2
Cloud Data De Duplication in Multiuser Environment DeposM2Cloud Data De Duplication in Multiuser Environment DeposM2
Cloud Data De Duplication in Multiuser Environment DeposM2
 
Windows Forensics
Windows ForensicsWindows Forensics
Windows Forensics
 
Chapter 10 - File System Interface
Chapter 10 - File System InterfaceChapter 10 - File System Interface
Chapter 10 - File System Interface
 
File management
File managementFile management
File management
 
File system.
File system.File system.
File system.
 
Personal Digital Archiving 2015 - NYU - Workshop
Personal Digital Archiving 2015 - NYU - WorkshopPersonal Digital Archiving 2015 - NYU - Workshop
Personal Digital Archiving 2015 - NYU - Workshop
 
Posscon2013
Posscon2013Posscon2013
Posscon2013
 
(BAC309) Automating Backup and Archiving with AWS and CommVault | AWS re:Inve...
(BAC309) Automating Backup and Archiving with AWS and CommVault | AWS re:Inve...(BAC309) Automating Backup and Archiving with AWS and CommVault | AWS re:Inve...
(BAC309) Automating Backup and Archiving with AWS and CommVault | AWS re:Inve...
 

More from Tim Donohue

On the Road to DSpace 7: Angular UI + REST
On the Road to DSpace 7: Angular UI + RESTOn the Road to DSpace 7: Angular UI + REST
On the Road to DSpace 7: Angular UI + RESTTim Donohue
 
Introducing the New DSpace User Interface
Introducing the New DSpace User InterfaceIntroducing the New DSpace User Interface
Introducing the New DSpace User InterfaceTim Donohue
 
DSpace UI Prototype Challenge: Spring Boot + Thymeleaf
DSpace UI Prototype Challenge: Spring Boot + ThymeleafDSpace UI Prototype Challenge: Spring Boot + Thymeleaf
DSpace UI Prototype Challenge: Spring Boot + ThymeleafTim Donohue
 
Discussion on DSpace's Two UIs : DuraSpace 2015 Summit
Discussion on DSpace's Two UIs : DuraSpace 2015 SummitDiscussion on DSpace's Two UIs : DuraSpace 2015 Summit
Discussion on DSpace's Two UIs : DuraSpace 2015 SummitTim Donohue
 
How to "Hack" the DSpace Community
How to "Hack" the DSpace CommunityHow to "Hack" the DSpace Community
How to "Hack" the DSpace CommunityTim Donohue
 
DSpace Overview / Roadmap 2014
DSpace Overview / Roadmap 2014DSpace Overview / Roadmap 2014
DSpace Overview / Roadmap 2014Tim Donohue
 
DSpace RoadMap & Vision 2013 (OR13)
DSpace RoadMap & Vision 2013 (OR13)DSpace RoadMap & Vision 2013 (OR13)
DSpace RoadMap & Vision 2013 (OR13)Tim Donohue
 
DSpace RoadMap 2012
DSpace RoadMap 2012DSpace RoadMap 2012
DSpace RoadMap 2012Tim Donohue
 
DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)
DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)
DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)Tim Donohue
 
Future Trends for Repositories
Future Trends for RepositoriesFuture Trends for Repositories
Future Trends for RepositoriesTim Donohue
 
DSpace RoadMap 2011
DSpace RoadMap 2011DSpace RoadMap 2011
DSpace RoadMap 2011Tim Donohue
 
DSpace RoadMap 2010
DSpace RoadMap 2010DSpace RoadMap 2010
DSpace RoadMap 2010Tim Donohue
 
BibApp 1.0 : Information In, Impact Out
BibApp 1.0 : Information In, Impact OutBibApp 1.0 : Information In, Impact Out
BibApp 1.0 : Information In, Impact OutTim Donohue
 
Making DSpace XMLUI Your Own
Making DSpace XMLUI Your OwnMaking DSpace XMLUI Your Own
Making DSpace XMLUI Your OwnTim Donohue
 
Digital Preservation in the Wild
Digital Preservation in the WildDigital Preservation in the Wild
Digital Preservation in the WildTim Donohue
 

More from Tim Donohue (15)

On the Road to DSpace 7: Angular UI + REST
On the Road to DSpace 7: Angular UI + RESTOn the Road to DSpace 7: Angular UI + REST
On the Road to DSpace 7: Angular UI + REST
 
Introducing the New DSpace User Interface
Introducing the New DSpace User InterfaceIntroducing the New DSpace User Interface
Introducing the New DSpace User Interface
 
DSpace UI Prototype Challenge: Spring Boot + Thymeleaf
DSpace UI Prototype Challenge: Spring Boot + ThymeleafDSpace UI Prototype Challenge: Spring Boot + Thymeleaf
DSpace UI Prototype Challenge: Spring Boot + Thymeleaf
 
Discussion on DSpace's Two UIs : DuraSpace 2015 Summit
Discussion on DSpace's Two UIs : DuraSpace 2015 SummitDiscussion on DSpace's Two UIs : DuraSpace 2015 Summit
Discussion on DSpace's Two UIs : DuraSpace 2015 Summit
 
How to "Hack" the DSpace Community
How to "Hack" the DSpace CommunityHow to "Hack" the DSpace Community
How to "Hack" the DSpace Community
 
DSpace Overview / Roadmap 2014
DSpace Overview / Roadmap 2014DSpace Overview / Roadmap 2014
DSpace Overview / Roadmap 2014
 
DSpace RoadMap & Vision 2013 (OR13)
DSpace RoadMap & Vision 2013 (OR13)DSpace RoadMap & Vision 2013 (OR13)
DSpace RoadMap & Vision 2013 (OR13)
 
DSpace RoadMap 2012
DSpace RoadMap 2012DSpace RoadMap 2012
DSpace RoadMap 2012
 
DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)
DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)
DSpace RoadMap and Vision (at 2013 OAI8 DSpace User Group)
 
Future Trends for Repositories
Future Trends for RepositoriesFuture Trends for Repositories
Future Trends for Repositories
 
DSpace RoadMap 2011
DSpace RoadMap 2011DSpace RoadMap 2011
DSpace RoadMap 2011
 
DSpace RoadMap 2010
DSpace RoadMap 2010DSpace RoadMap 2010
DSpace RoadMap 2010
 
BibApp 1.0 : Information In, Impact Out
BibApp 1.0 : Information In, Impact OutBibApp 1.0 : Information In, Impact Out
BibApp 1.0 : Information In, Impact Out
 
Making DSpace XMLUI Your Own
Making DSpace XMLUI Your OwnMaking DSpace XMLUI Your Own
Making DSpace XMLUI Your Own
 
Digital Preservation in the Wild
Digital Preservation in the WildDigital Preservation in the Wild
Digital Preservation in the Wild
 

Recently uploaded

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 

Recently uploaded (20)

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 

DSpace & DuraCloud Integrations

  • 1. Tim Donohue DSpace + DuraCloud Integrations DuraSpace Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 2. Basis for DSpace Integration 1. DSpace AIP Backup & Restore (1.7 +) (Initial DuraCloud use case: Backup & Restore) 2. DSpace Curation Task System (1.7 +) 3. DSpace Replication Task Suite (1.8) Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 3. Intro to Archival Info Pkgs (1.7+) • Primary Use Cases – Backup & Restore of DSpace Content • All content or just partial (Community/Collection/Item) – Migration/Export of DSpace Content • All content or just partial (Community/Collection/Item) – DuraCloud Integration Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 4. How to Backup DSpace (pre-1.7) Database Assetstore Folder Full Database Folder Backup Backup Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 5. How to Restore All (pre-1.7) Database Assetstore Folder Full Database Folder Backup Backup Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 6. How to Restore a Collection (pre-1.7) Database Assetstore Folder Temporary Temporary Database Folder? Full Database Folder Backup Backup Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 7. How to Restore a Collection (pre-1.7) Database Assetstore Folder Temporary Temporary Database Folder? Full Database Folder Backup Backup Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 8. Backup via Archival Info Pkgs Package for each Community, Collection & Item AIP backup Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 9. Restore All via Archival Info Pkgs Package for each Community, Collection & Item AIP backup Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 10. Restore a Collection via AIPs 1 2 Collection AIP Items in Collection AIP backup Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 11. What’s in an AIP? Content Files License or Logos Other Files METS in Bundles (DIM / MODS / PREMIS / (optional) METSRights) *Also a BagIt version in works Archival Information Package (AIP) Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 12. What’s in an AIP? Related Object AIPs Content Files License or Logos Other Files METS in Bundles (DIM / MODS / PREMIS / (optional) METSRights) Descriptive Metadata: DIM & MODS Tech/Preservation Metadata: PREMIS Rights Metadata: METSRights Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 13. The “Site” AIP Top-Level Community AIPs METS (DIM / MODS / PREMIS / METSRights) Special AIP for site-wide info/metadata: (e.g. Group Memberships, EPeople) Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 14. What can AIPs restore? Restore All In-Archive Content (Files + Metadata) Restore All People & Groups Restore All Permissions / Access Rights Restore Community / Collection Logos, Metadata, Rights & Item Templates Restore Community / Collection / Item Hierarchy Restore In-Process / Incomplete Items Restore Collection OAI-PMH/ORE Harvest Settings Restore all configuration files (dspace.cfg, etc.) Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 15. Migrate a Collection One DSpace Install Another DSpace Install 2 1 Collection AIP Items in Collection Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 16. Migrate Content One DSpace Install (Future work) 2 1 Collection AIP Items in Collection Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 17. DuraCloud Integration (1.7.x) [1] ./dspace packager -d [2] java -jar synctool.jar 1 Package for each Community, Collection & Item Local “Watch” Folder Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 18. DuraCloud Integration (1.7.x) [1] java -jar retrievaltool.jar [2] ./dspace packager -r 2 Package for each Community, Collection & Item Local Folder Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 19. New: DSpace Replication Suite in 1.8 Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 20. DSpace Curation System (1.7+) • Enables a basic ‘microservices’ approach to curating DSpace objects • Anyone can build a task & share it. • Currently tasks must be written in Java – Working on JRuby & Jython integration (1.8?) • “Frees” admin tasks from Command Line – Can now run from Admin UI or CLI Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 21. DSpace Replication Suite (1.8) • A set of curation tasks geared towards ‘replicating’ (backup/restore/audit) content • “Wraps” 1.7 DSpace AIP Backup & Restore Backup content to AIP (filesystem or DuraCloud) Restore/Replace from AIP Audit AIP (compare to DSpace content) Basic IO Tracking of AIP Upload/Downloads All replication tasks can be run via Admin UI Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 22. DuraCloud Integration (1.8.x) “Replication Task Suite”: • Suite of Curation Tasks • One step Sync process OR Command line Curation Tools • Via UI or CLI Package for each Community, 1 Collection & Item 1 Local Temp Folder (Cache) Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 23. DuraCloud Integration (1.8.x) “Replication Task Suite”: • One step Retrieve process • Via UI or CLI OR Command line Curation Tools • Also ‘auditing’ tools Package for each Community, 1 Collection & Item 1 Local Temp Folder (Cache) Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 24. DSpace Replication Suite Demo Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 25. Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 26. Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 27. Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 28. Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 29. Known Limitations Cannot yet take advantage of DuraCloud streaming capabilities (AIPs are zip files) Cannot yet take advantage of DuraCloud transformation services (AIPs are zip files) Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 30. Next Steps • Working towards “unzipped” AIPs (1.8?) – METS file & Content files stored in an AIP ‘folder’ but NOT zipped up – Support for DuraCloud streaming, etc. • DSpace UI Streaming Integration (@mire) • ‘Auto-Sync’ options – Updates in DSpace -> DuraCloud (queued?) – Updates via DuraCloud services -> DSpace? Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 31. In Large Thanks to… • MIT : Richard Rodgers & Wendy Bossons – Developed Curation Task Framework – Developed initial Replication Suite tasks • @mire : Mark Diggory – Look for @mire’s “Integrating DuraCloud Services in DSpace” talk on Friday at 3:30pm Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 32. For More Information • Replication Task Suite: – https://wiki.duraspace.org/display/DSPACE/Replic ationTaskSuite • AIP Backup & Restore: – https://wiki.duraspace.org/display/DSDOC/AIP+Ba ckup+and+Restore • Curation Task System: – https://wiki.duraspace.org/display/DSDOC/Curati on+System Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org
  • 33. Photo/Icon Acknowledgments • Package : http://www.flickr.com/photos/halfbisqued/2353845688/ • Harddrive & Terminal icons: http://tango.freedesktop.org/Tango_Desktop_Project • Folder icon: http://www.openclipart.org/detail/13740 • Database icon: http://www.openclipart.org/detail/68413 • Zip Pkg icon: http://veryicon.com/icons/system/capital-icon-suite-mac/zip-10.html • File icons: http://veryicon.com/icons/system/rhor-v2-part-3/ • Checkmark & Delete icons: http://veryicon.com/icons/system/on-stage/ • Tools Icon: http://veryicon.com/icons/system/azullustre/ Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: info@duracloud.org