SlideShare una empresa de Scribd logo
1 de 27
SCAPE 
Scalable Preservation Environments
• Your collection of digital data is growing rapidly. 
• Your preservation activities must become more 
efficient and more scalable. 
• You need SCAPE! 
• The SCAPE project has developed scalable solutions 
for long-term preservation of large-scale and 
heterogeneous data sets. 
2 
Digital Preservation – What do I need? 
This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
3 
What is SCAPE? 
Its all about scalability! 
• Scalable services for planning and execution of 
institutional preservation strategies 
• Infrastructure for the execution of digital 
preservation processes on large volumes of data 
• Existing tools have been improved and extended. 
• New tools have been developed where necessary. 
This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
4 
What is SCAPE? 
SCAPE covers a whole digital preservation life cycle 
• Interconnecting services support 
the preservation of large 
repositories of digital objects 
• Applications support the 
formulation of preservation 
policies, decision making and 
selection of preservation actions 
This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
5 
What is SCAPE? 
Take your pick – choose what you need! 
• Use the full set of interconnected 
SCAPE components or a selected 
series of SCAPE tools or workflows. 
• Many SCAPE components can be 
individually incorporated. 
This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
• All SCAPE solutions arise from real-world challenges at 
partner institutions. 
• Each challenge is tested in testbeds at the partner 
institutions. 
6 
Solutions Tested in Real Life 
Web 
Content 
Digital 
Repositories 
This work was partially supported by the SCAPE Project. 
Data 
Centres 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137). 
Research 
Data Sets 
Testbeds
Solutions for Content Holders 
Scalability 
In four dimensions: 
Heterogeneity of collections 
as well as number, size and 
complexity of objects 
Automation 
Through scalable, 
automated and simple to 
design preservation 
workflows 
Planning 
Answering core 
preservation planning 
questions 
Integration 
Through a robust, 
integrated, open source 
preservation system 
7 This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
8 
Overview: SCAPE Architecture 
This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
9 
Overview: SCAPE Components 
This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137). 
The SCAPE 
Platform is a 
reference 
architecture 
for scalable 
preservation 
environments
10 
Overview: SCAPE Components 
This work was partially supported by the SCAPE Project. 
The SCAPE Preservation 
Components are tools which 
enhance the functionality of a 
digital preservation system in: 
• Scalability 
• Functional coverage 
• Quality 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
The SCAPE Planning and Watch 
components address the 
bottleneck of decision 
processes and processing 
information required for 
decision making 
11 
Overview: SCAPE Components 
This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
Examples of tools and services 
12 This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
13 
Scalable Planning and Watch 
Scout – an Automated Preservation Watch System 
• Enables you to monitor your 
collections 
• Lets you access 
community knowledge 
• Collects relevant knowledge 
and enables automated 
notification 
This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
14 
Scalable Planning and Watch 
C3PO – Content Profiling Tool for Preservation Analysis 
• Analyses characterisation 
metadata for digital collections 
• Aggregates and combines the 
metadata information across 
collections 
• Generates a profile of the 
content set 
• Allows use of different 
metadata formats 
This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
15 
Scalable Planning and Watch 
Plato – Scalable Preservation Planning 
• Decision-making support tool 
• Guides you through the 
preservation planning 
workflow 
• Provides trust through 
controlled experiments and 
documentation 
• Provides an executable plan 
This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
16 
Scalable Tools 
ToMaR – let your Preservation Tools Scale 
• Run existing tools against 
large amounts of files 
• Execute tools in a scalable 
fashion on a MapReduce 
cluster 
• Enable scalable workflows 
which chain together a set of 
tools 
• Process payloads too big to be 
computed on a single machine 
This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
17 
Preservation Components 
Pagelyzer – Monitor your Web Content 
• Detect changes in web pages 
• Compare web page versions 
on a large scale 
• Compare web page rendering 
in different browsers 
• Determine appropriate 
frequency of web harvestings 
This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
18 
Preservation Components 
Jpylyzer – Easy Validation of JPEG 2000 
• Automated JP2 validation and 
feature extraction 
• Enables you to confirm 
whether an image is a valid, 
intact JP2 file 
• Reports the key technical 
properties of the image 
This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
19 
Preservation Components 
Matchbox – easy Detection of Nearly Duplicate Images 
• Identify duplicate content, 
even where files are of 
different size, format, 
cropping etc. or scanned from 
different original copy 
• Automate quality assurance 
and reduce manual effort 
This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
20 
Preservation Components 
xcorrSound – Automate Sound Wave Analysis 
• Compare two audio files and 
output the similarity 
• Detect overlaps in audio files 
• Detect occurrences of a 
smaller audio file (e.g. a jingle) 
within a larger audio file or an 
index of audio files 
This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
SCAPE tools are published as open source software. 
Tools and services from SCAPE are sustained by 
• Open Planets Foundation - 
address core digital preservation 
challenges and engage with the community 
• COPTR - 
Community Owned digital Preservation 
Tool Registry 
21 
Sustainability of Tools and Services 
This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
Sustainability of SCAPE results 
Ultimate Sustainability goal: 
• Supporting communities of practice by enabling 
efficient collaboration during the project and 
beyond. 
Open Planets Foundation will take post-project 
ownership of the outputs, supported by other 
partners providing specific capabilities. 
22 This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
Sustainability of SCAPE results 
Five complementary approaches: 
• Visibility 
Providing integrated outreach to multiple audiences to maximise 
discoverability. 
• Quality 
Ensuring that project outputs conform to standards-driven quality 
assurance. 
• Training 
Supporting skills development to further institutional capacity building. 
• Open licensing 
Using open licences to encourage the adoption and reuse of project 
outputs. 
• Community integration 
Integrating project outputs into commercial and non-commercial 
systems and services. 
23 This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
• EU-funded project under FP7 (Research and 
Technological Development) 
• Project runtime: February 2011 to September 2014 
• 20 partners from 10 countries - from memory 
institutions, data centres, research labs, universities, 
and industrial firms 
• Public Project materials are licensed under a 
CC-BY-SA International License 
24 
About SCAPE 
This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
25 
SCAPE Consortium 
This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
26 
Additional Sources of Interest 
• Development Infrastructure 
• Code repository hosted by the Open Planets Foundation and GitHub 
• https://github.com/openplanets/scape/ 
• Development Wiki 
• http://wiki.opf-labs.org/display/SP/Home 
This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137). 
• Tools 
• http://www.scape-project.eu/tools 
• Experimental Workflows 
• http://www.myexperiment.org/search?query=SCAPE&type=all&commit=Search 
• Publications 
• http://www.scape-project.eu/category/publication 
• Public Deliverables 
• http://www.scape-project.eu/category/deliverable
27 
More Information 
• SCAPE website: www.scape-project.eu 
• Blog posts and more: 
www.openplanetsfoundation.com/projects/scape 
• Tools and Services: 
https://github.com/openplanets/scape 
• SCAPE Twitter: @SCAPEProject, #SCAPEProject 
• SCAPE Newsletter: Sign up via www.scape-project.eu 
All images © the SCAPE Project or its partners, 
except images on slides 3, 6 and 26 © www.digitalbevaring.dk 
This work was partially supported by the SCAPE Project. 
The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).

Más contenido relacionado

La actualidad más candente

ESCAPE Kick-off meeting - WP5 (Feb 2019)
ESCAPE Kick-off meeting - WP5 (Feb 2019)ESCAPE Kick-off meeting - WP5 (Feb 2019)
ESCAPE Kick-off meeting - WP5 (Feb 2019)ESCAPE EU
 
Intelligent tools-mitja-jermol-2013-bali-7 may2013
Intelligent tools-mitja-jermol-2013-bali-7 may2013Intelligent tools-mitja-jermol-2013-bali-7 may2013
Intelligent tools-mitja-jermol-2013-bali-7 may2013MediaMixerCommunity
 
OpenAIRE for SPARC and SPARC Europe
OpenAIRE for SPARC and SPARC EuropeOpenAIRE for SPARC and SPARC Europe
OpenAIRE for SPARC and SPARC EuropeOpenAIRE
 
Experience in managing service portfolio by Pasquale Pagano
Experience in managing service portfolio by Pasquale PaganoExperience in managing service portfolio by Pasquale Pagano
Experience in managing service portfolio by Pasquale PaganoBlue BRIDGE
 
OpenAIRE "How to make your repository OpenAIRE compliant: proprietary platforms"
OpenAIRE "How to make your repository OpenAIRE compliant: proprietary platforms"OpenAIRE "How to make your repository OpenAIRE compliant: proprietary platforms"
OpenAIRE "How to make your repository OpenAIRE compliant: proprietary platforms"OpenAIRE
 
OSDC 2011 | Open Source Software for Public Administration by Dr. George Kolo...
OSDC 2011 | Open Source Software for Public Administration by Dr. George Kolo...OSDC 2011 | Open Source Software for Public Administration by Dr. George Kolo...
OSDC 2011 | Open Source Software for Public Administration by Dr. George Kolo...NETWAYS
 
OpenAIRE "How to make your repository OpenAIRE compliant: EPrints"
OpenAIRE "How to make your repository OpenAIRE compliant: EPrints"OpenAIRE "How to make your repository OpenAIRE compliant: EPrints"
OpenAIRE "How to make your repository OpenAIRE compliant: EPrints"OpenAIRE
 
ENP Belgrade WS refinement introduction
ENP Belgrade WS refinement introductionENP Belgrade WS refinement introduction
ENP Belgrade WS refinement introductionEuropeana Newspapers
 
Science Demonstrator Session: Social and Earth Sciences
Science Demonstrator Session: Social and Earth SciencesScience Demonstrator Session: Social and Earth Sciences
Science Demonstrator Session: Social and Earth SciencesEOSCpilot .eu
 
Prototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and CeremonyPrototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and CeremonyArchiver
 
OpenAIRE NOADs
OpenAIRE NOADsOpenAIRE NOADs
OpenAIRE NOADsOpenAIRE
 
The European life-science data infrastructure: Data, Computing and Services ...
The European life-science data infrastructure: Data, Computing and Services ...The European life-science data infrastructure: Data, Computing and Services ...
The European life-science data infrastructure: Data, Computing and Services ...Rafael C. Jimenez
 
Europeana Newspapers - the Gateway to European Newspapers Online
Europeana Newspapers - the Gateway to European Newspapers OnlineEuropeana Newspapers - the Gateway to European Newspapers Online
Europeana Newspapers - the Gateway to European Newspapers Onlinecneudecker
 
Archiver 3rd omc_project_overview
Archiver 3rd omc_project_overviewArchiver 3rd omc_project_overview
Archiver 3rd omc_project_overviewArchiver
 
Design phase kick-off event and Ceremony
Design phase kick-off event and CeremonyDesign phase kick-off event and Ceremony
Design phase kick-off event and CeremonyArchiver
 
OpenAIRE at WeNMR Kick-off meeting
OpenAIRE at WeNMR Kick-off meetingOpenAIRE at WeNMR Kick-off meeting
OpenAIRE at WeNMR Kick-off meetingOpenAIRE
 
Linked Data with hybrid services in Agriculture
Linked Data with hybrid services in AgricultureLinked Data with hybrid services in Agriculture
Linked Data with hybrid services in AgricultureRaul Palma
 
Archiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award CeremonyArchiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award CeremonyArchiver
 
How to make your repository OpenAIRE compliant
How to make your repository OpenAIRE compliantHow to make your repository OpenAIRE compliant
How to make your repository OpenAIRE compliantPedro Príncipe
 
Webinar on OpenAIRE compatibility for repositories: EPrints repository platform
Webinar on OpenAIRE compatibility for repositories: EPrints repository platform Webinar on OpenAIRE compatibility for repositories: EPrints repository platform
Webinar on OpenAIRE compatibility for repositories: EPrints repository platform OpenAIRE
 

La actualidad más candente (20)

ESCAPE Kick-off meeting - WP5 (Feb 2019)
ESCAPE Kick-off meeting - WP5 (Feb 2019)ESCAPE Kick-off meeting - WP5 (Feb 2019)
ESCAPE Kick-off meeting - WP5 (Feb 2019)
 
Intelligent tools-mitja-jermol-2013-bali-7 may2013
Intelligent tools-mitja-jermol-2013-bali-7 may2013Intelligent tools-mitja-jermol-2013-bali-7 may2013
Intelligent tools-mitja-jermol-2013-bali-7 may2013
 
OpenAIRE for SPARC and SPARC Europe
OpenAIRE for SPARC and SPARC EuropeOpenAIRE for SPARC and SPARC Europe
OpenAIRE for SPARC and SPARC Europe
 
Experience in managing service portfolio by Pasquale Pagano
Experience in managing service portfolio by Pasquale PaganoExperience in managing service portfolio by Pasquale Pagano
Experience in managing service portfolio by Pasquale Pagano
 
OpenAIRE "How to make your repository OpenAIRE compliant: proprietary platforms"
OpenAIRE "How to make your repository OpenAIRE compliant: proprietary platforms"OpenAIRE "How to make your repository OpenAIRE compliant: proprietary platforms"
OpenAIRE "How to make your repository OpenAIRE compliant: proprietary platforms"
 
OSDC 2011 | Open Source Software for Public Administration by Dr. George Kolo...
OSDC 2011 | Open Source Software for Public Administration by Dr. George Kolo...OSDC 2011 | Open Source Software for Public Administration by Dr. George Kolo...
OSDC 2011 | Open Source Software for Public Administration by Dr. George Kolo...
 
OpenAIRE "How to make your repository OpenAIRE compliant: EPrints"
OpenAIRE "How to make your repository OpenAIRE compliant: EPrints"OpenAIRE "How to make your repository OpenAIRE compliant: EPrints"
OpenAIRE "How to make your repository OpenAIRE compliant: EPrints"
 
ENP Belgrade WS refinement introduction
ENP Belgrade WS refinement introductionENP Belgrade WS refinement introduction
ENP Belgrade WS refinement introduction
 
Science Demonstrator Session: Social and Earth Sciences
Science Demonstrator Session: Social and Earth SciencesScience Demonstrator Session: Social and Earth Sciences
Science Demonstrator Session: Social and Earth Sciences
 
Prototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and CeremonyPrototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and Ceremony
 
OpenAIRE NOADs
OpenAIRE NOADsOpenAIRE NOADs
OpenAIRE NOADs
 
The European life-science data infrastructure: Data, Computing and Services ...
The European life-science data infrastructure: Data, Computing and Services ...The European life-science data infrastructure: Data, Computing and Services ...
The European life-science data infrastructure: Data, Computing and Services ...
 
Europeana Newspapers - the Gateway to European Newspapers Online
Europeana Newspapers - the Gateway to European Newspapers OnlineEuropeana Newspapers - the Gateway to European Newspapers Online
Europeana Newspapers - the Gateway to European Newspapers Online
 
Archiver 3rd omc_project_overview
Archiver 3rd omc_project_overviewArchiver 3rd omc_project_overview
Archiver 3rd omc_project_overview
 
Design phase kick-off event and Ceremony
Design phase kick-off event and CeremonyDesign phase kick-off event and Ceremony
Design phase kick-off event and Ceremony
 
OpenAIRE at WeNMR Kick-off meeting
OpenAIRE at WeNMR Kick-off meetingOpenAIRE at WeNMR Kick-off meeting
OpenAIRE at WeNMR Kick-off meeting
 
Linked Data with hybrid services in Agriculture
Linked Data with hybrid services in AgricultureLinked Data with hybrid services in Agriculture
Linked Data with hybrid services in Agriculture
 
Archiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award CeremonyArchiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award Ceremony
 
How to make your repository OpenAIRE compliant
How to make your repository OpenAIRE compliantHow to make your repository OpenAIRE compliant
How to make your repository OpenAIRE compliant
 
Webinar on OpenAIRE compatibility for repositories: EPrints repository platform
Webinar on OpenAIRE compatibility for repositories: EPrints repository platform Webinar on OpenAIRE compatibility for repositories: EPrints repository platform
Webinar on OpenAIRE compatibility for repositories: EPrints repository platform
 

Similar a Scape project presentation - Scalable Preservation Environments

Policy driven validation of JPEG 2000 files based on Jpylyzer, SCAPE Informat...
Policy driven validation of JPEG 2000 files based on Jpylyzer, SCAPE Informat...Policy driven validation of JPEG 2000 files based on Jpylyzer, SCAPE Informat...
Policy driven validation of JPEG 2000 files based on Jpylyzer, SCAPE Informat...SCAPE Project
 
LIBER Satellite Event, SCAPE by Sven Schlarb
LIBER Satellite Event, SCAPE by Sven SchlarbLIBER Satellite Event, SCAPE by Sven Schlarb
LIBER Satellite Event, SCAPE by Sven SchlarbSCAPE Project
 
Application scenarios of the SCAPE project at the Austrian National Library
Application scenarios of the SCAPE project at the Austrian National LibraryApplication scenarios of the SCAPE project at the Austrian National Library
Application scenarios of the SCAPE project at the Austrian National LibrarySven Schlarb
 
SCAPE Webinar: Tools for uncovering preservation risks in large repositories
SCAPE Webinar: Tools for uncovering preservation risks in large repositoriesSCAPE Webinar: Tools for uncovering preservation risks in large repositories
SCAPE Webinar: Tools for uncovering preservation risks in large repositoriesSCAPE Project
 
Hadoop and its applications at the State and University Library, SCAPE Inform...
Hadoop and its applications at the State and University Library, SCAPE Inform...Hadoop and its applications at the State and University Library, SCAPE Inform...
Hadoop and its applications at the State and University Library, SCAPE Inform...SCAPE Project
 
SCAPE Information Day at BL - Some of the SCAPE Outputs Available
SCAPE Information Day at BL - Some of the SCAPE Outputs AvailableSCAPE Information Day at BL - Some of the SCAPE Outputs Available
SCAPE Information Day at BL - Some of the SCAPE Outputs AvailableSCAPE Project
 
Integrating the Fedora based DOMS repository with Hadoop, SCAPE Information D...
Integrating the Fedora based DOMS repository with Hadoop, SCAPE Information D...Integrating the Fedora based DOMS repository with Hadoop, SCAPE Information D...
Integrating the Fedora based DOMS repository with Hadoop, SCAPE Information D...SCAPE Project
 
Content profiling and C3PO
Content profiling and C3POContent profiling and C3PO
Content profiling and C3POSCAPE Project
 
SCAPE - Scalable Preservation Environments
SCAPE - Scalable Preservation EnvironmentsSCAPE - Scalable Preservation Environments
SCAPE - Scalable Preservation EnvironmentsSCAPE Project
 
Scape information day at BL - Using Jpylyzer and Schematron for validating JP...
Scape information day at BL - Using Jpylyzer and Schematron for validating JP...Scape information day at BL - Using Jpylyzer and Schematron for validating JP...
Scape information day at BL - Using Jpylyzer and Schematron for validating JP...SCAPE Project
 
IMPACT at OCR Summit
IMPACT at OCR SummitIMPACT at OCR Summit
IMPACT at OCR Summitcneudecker
 
Taverna Components: The Basics
Taverna Components: The BasicsTaverna Components: The Basics
Taverna Components: The BasicsDonal Fellows
 
OCEAN - support for Open Cloud projects, Yuri Glikman, Fraunhofer FOKUS
OCEAN - support for Open Cloud projects, Yuri Glikman, Fraunhofer FOKUSOCEAN - support for Open Cloud projects, Yuri Glikman, Fraunhofer FOKUS
OCEAN - support for Open Cloud projects, Yuri Glikman, Fraunhofer FOKUSOW2
 
Refinement of Digitised Newspapers
Refinement of Digitised NewspapersRefinement of Digitised Newspapers
Refinement of Digitised Newspaperscneudecker
 
What is Hadoop?
What is Hadoop?What is Hadoop?
What is Hadoop?cneudecker
 
SCAPE Information Day at BL - Large Scale Processing with Hadoop
SCAPE Information Day at BL - Large Scale Processing with HadoopSCAPE Information Day at BL - Large Scale Processing with Hadoop
SCAPE Information Day at BL - Large Scale Processing with HadoopSCAPE Project
 
An Experimental Workflow Development Platform for Historical Document Digitis...
An Experimental Workflow Development Platform for Historical Document Digitis...An Experimental Workflow Development Platform for Historical Document Digitis...
An Experimental Workflow Development Platform for Historical Document Digitis...cneudecker
 
Europeana Newspapers wp2 liber2013
Europeana Newspapers wp2 liber2013Europeana Newspapers wp2 liber2013
Europeana Newspapers wp2 liber2013Europeana Newspapers
 
20110712.we nmr.utrecht
20110712.we nmr.utrecht20110712.we nmr.utrecht
20110712.we nmr.utrechtNuno Ferreira
 

Similar a Scape project presentation - Scalable Preservation Environments (20)

Policy driven validation of JPEG 2000 files based on Jpylyzer, SCAPE Informat...
Policy driven validation of JPEG 2000 files based on Jpylyzer, SCAPE Informat...Policy driven validation of JPEG 2000 files based on Jpylyzer, SCAPE Informat...
Policy driven validation of JPEG 2000 files based on Jpylyzer, SCAPE Informat...
 
LIBER Satellite Event, SCAPE by Sven Schlarb
LIBER Satellite Event, SCAPE by Sven SchlarbLIBER Satellite Event, SCAPE by Sven Schlarb
LIBER Satellite Event, SCAPE by Sven Schlarb
 
Application scenarios of the SCAPE project at the Austrian National Library
Application scenarios of the SCAPE project at the Austrian National LibraryApplication scenarios of the SCAPE project at the Austrian National Library
Application scenarios of the SCAPE project at the Austrian National Library
 
SCAPE Webinar: Tools for uncovering preservation risks in large repositories
SCAPE Webinar: Tools for uncovering preservation risks in large repositoriesSCAPE Webinar: Tools for uncovering preservation risks in large repositories
SCAPE Webinar: Tools for uncovering preservation risks in large repositories
 
Hadoop and its applications at the State and University Library, SCAPE Inform...
Hadoop and its applications at the State and University Library, SCAPE Inform...Hadoop and its applications at the State and University Library, SCAPE Inform...
Hadoop and its applications at the State and University Library, SCAPE Inform...
 
SCAPE Information Day at BL - Some of the SCAPE Outputs Available
SCAPE Information Day at BL - Some of the SCAPE Outputs AvailableSCAPE Information Day at BL - Some of the SCAPE Outputs Available
SCAPE Information Day at BL - Some of the SCAPE Outputs Available
 
Integrating the Fedora based DOMS repository with Hadoop, SCAPE Information D...
Integrating the Fedora based DOMS repository with Hadoop, SCAPE Information D...Integrating the Fedora based DOMS repository with Hadoop, SCAPE Information D...
Integrating the Fedora based DOMS repository with Hadoop, SCAPE Information D...
 
Content profiling and C3PO
Content profiling and C3POContent profiling and C3PO
Content profiling and C3PO
 
SCAPE - Scalable Preservation Environments
SCAPE - Scalable Preservation EnvironmentsSCAPE - Scalable Preservation Environments
SCAPE - Scalable Preservation Environments
 
Scape information day at BL - Using Jpylyzer and Schematron for validating JP...
Scape information day at BL - Using Jpylyzer and Schematron for validating JP...Scape information day at BL - Using Jpylyzer and Schematron for validating JP...
Scape information day at BL - Using Jpylyzer and Schematron for validating JP...
 
IMPACT at OCR Summit
IMPACT at OCR SummitIMPACT at OCR Summit
IMPACT at OCR Summit
 
Taverna Components: The Basics
Taverna Components: The BasicsTaverna Components: The Basics
Taverna Components: The Basics
 
OCEAN - support for Open Cloud projects, Yuri Glikman, Fraunhofer FOKUS
OCEAN - support for Open Cloud projects, Yuri Glikman, Fraunhofer FOKUSOCEAN - support for Open Cloud projects, Yuri Glikman, Fraunhofer FOKUS
OCEAN - support for Open Cloud projects, Yuri Glikman, Fraunhofer FOKUS
 
ExPaNDS
ExPaNDSExPaNDS
ExPaNDS
 
Refinement of Digitised Newspapers
Refinement of Digitised NewspapersRefinement of Digitised Newspapers
Refinement of Digitised Newspapers
 
What is Hadoop?
What is Hadoop?What is Hadoop?
What is Hadoop?
 
SCAPE Information Day at BL - Large Scale Processing with Hadoop
SCAPE Information Day at BL - Large Scale Processing with HadoopSCAPE Information Day at BL - Large Scale Processing with Hadoop
SCAPE Information Day at BL - Large Scale Processing with Hadoop
 
An Experimental Workflow Development Platform for Historical Document Digitis...
An Experimental Workflow Development Platform for Historical Document Digitis...An Experimental Workflow Development Platform for Historical Document Digitis...
An Experimental Workflow Development Platform for Historical Document Digitis...
 
Europeana Newspapers wp2 liber2013
Europeana Newspapers wp2 liber2013Europeana Newspapers wp2 liber2013
Europeana Newspapers wp2 liber2013
 
20110712.we nmr.utrecht
20110712.we nmr.utrecht20110712.we nmr.utrecht
20110712.we nmr.utrecht
 

Más de SCAPE Project

SCAPE Information Day at BL - Characterising content in web archives with Nanite
SCAPE Information Day at BL - Characterising content in web archives with NaniteSCAPE Information Day at BL - Characterising content in web archives with Nanite
SCAPE Information Day at BL - Characterising content in web archives with NaniteSCAPE Project
 
SCAPE Information day at BL - Flint, a Format and File Validation Tool
SCAPE Information day at BL - Flint, a Format and File Validation ToolSCAPE Information day at BL - Flint, a Format and File Validation Tool
SCAPE Information day at BL - Flint, a Format and File Validation ToolSCAPE Project
 
Migration of audio files using Hadoop, SCAPE Information Day, 25 June 2014
Migration of audio files using Hadoop, SCAPE Information Day, 25 June 2014Migration of audio files using Hadoop, SCAPE Information Day, 25 June 2014
Migration of audio files using Hadoop, SCAPE Information Day, 25 June 2014SCAPE Project
 
Control policy formulation
Control policy formulationControl policy formulation
Control policy formulationSCAPE Project
 
An image based approach for content analysis in document collections
An image based approach for content analysis in document collectionsAn image based approach for content analysis in document collections
An image based approach for content analysis in document collectionsSCAPE Project
 
SCAPE - Skalierbare Langzeitarchivierung (SCAPE - scalable longterm digital p...
SCAPE - Skalierbare Langzeitarchivierung (SCAPE - scalable longterm digital p...SCAPE - Skalierbare Langzeitarchivierung (SCAPE - scalable longterm digital p...
SCAPE - Skalierbare Langzeitarchivierung (SCAPE - scalable longterm digital p...SCAPE Project
 
TAVERNA Components - Semantically annotated and sharable units of functionality
TAVERNA Components - Semantically annotated and sharable units of functionalityTAVERNA Components - Semantically annotated and sharable units of functionality
TAVERNA Components - Semantically annotated and sharable units of functionalitySCAPE Project
 
PDF/A-3 for preservation. Notes on embedded files and JPEG2000
PDF/A-3 for preservation. Notes on embedded files and JPEG2000PDF/A-3 for preservation. Notes on embedded files and JPEG2000
PDF/A-3 for preservation. Notes on embedded files and JPEG2000SCAPE Project
 
Scalable Preservation Workflows
Scalable Preservation WorkflowsScalable Preservation Workflows
Scalable Preservation WorkflowsSCAPE Project
 
Quality assurance for document image collections in digital preservation
Quality assurance for document image collections in digital preservation Quality assurance for document image collections in digital preservation
Quality assurance for document image collections in digital preservation SCAPE Project
 
Digital Preservation Policies - SCAPE
Digital Preservation Policies - SCAPEDigital Preservation Policies - SCAPE
Digital Preservation Policies - SCAPESCAPE Project
 
Large scale preservation workflows with Taverna – SCAPE Training event, Guima...
Large scale preservation workflows with Taverna – SCAPE Training event, Guima...Large scale preservation workflows with Taverna – SCAPE Training event, Guima...
Large scale preservation workflows with Taverna – SCAPE Training event, Guima...SCAPE Project
 
Matchbox tool. Quality control for digital collections – SCAPE Training event...
Matchbox tool. Quality control for digital collections – SCAPE Training event...Matchbox tool. Quality control for digital collections – SCAPE Training event...
Matchbox tool. Quality control for digital collections – SCAPE Training event...SCAPE Project
 
Characterisation - 101. An introduction to the identification and characteris...
Characterisation - 101. An introduction to the identification and characteris...Characterisation - 101. An introduction to the identification and characteris...
Characterisation - 101. An introduction to the identification and characteris...SCAPE Project
 
Digital Preservation - The Saga Continues - SCAPE Training event, Guimarães 2012
Digital Preservation - The Saga Continues - SCAPE Training event, Guimarães 2012Digital Preservation - The Saga Continues - SCAPE Training event, Guimarães 2012
Digital Preservation - The Saga Continues - SCAPE Training event, Guimarães 2012SCAPE Project
 

Más de SCAPE Project (16)

C sz z6
C sz z6C sz z6
C sz z6
 
SCAPE Information Day at BL - Characterising content in web archives with Nanite
SCAPE Information Day at BL - Characterising content in web archives with NaniteSCAPE Information Day at BL - Characterising content in web archives with Nanite
SCAPE Information Day at BL - Characterising content in web archives with Nanite
 
SCAPE Information day at BL - Flint, a Format and File Validation Tool
SCAPE Information day at BL - Flint, a Format and File Validation ToolSCAPE Information day at BL - Flint, a Format and File Validation Tool
SCAPE Information day at BL - Flint, a Format and File Validation Tool
 
Migration of audio files using Hadoop, SCAPE Information Day, 25 June 2014
Migration of audio files using Hadoop, SCAPE Information Day, 25 June 2014Migration of audio files using Hadoop, SCAPE Information Day, 25 June 2014
Migration of audio files using Hadoop, SCAPE Information Day, 25 June 2014
 
Control policy formulation
Control policy formulationControl policy formulation
Control policy formulation
 
An image based approach for content analysis in document collections
An image based approach for content analysis in document collectionsAn image based approach for content analysis in document collections
An image based approach for content analysis in document collections
 
SCAPE - Skalierbare Langzeitarchivierung (SCAPE - scalable longterm digital p...
SCAPE - Skalierbare Langzeitarchivierung (SCAPE - scalable longterm digital p...SCAPE - Skalierbare Langzeitarchivierung (SCAPE - scalable longterm digital p...
SCAPE - Skalierbare Langzeitarchivierung (SCAPE - scalable longterm digital p...
 
TAVERNA Components - Semantically annotated and sharable units of functionality
TAVERNA Components - Semantically annotated and sharable units of functionalityTAVERNA Components - Semantically annotated and sharable units of functionality
TAVERNA Components - Semantically annotated and sharable units of functionality
 
PDF/A-3 for preservation. Notes on embedded files and JPEG2000
PDF/A-3 for preservation. Notes on embedded files and JPEG2000PDF/A-3 for preservation. Notes on embedded files and JPEG2000
PDF/A-3 for preservation. Notes on embedded files and JPEG2000
 
Scalable Preservation Workflows
Scalable Preservation WorkflowsScalable Preservation Workflows
Scalable Preservation Workflows
 
Quality assurance for document image collections in digital preservation
Quality assurance for document image collections in digital preservation Quality assurance for document image collections in digital preservation
Quality assurance for document image collections in digital preservation
 
Digital Preservation Policies - SCAPE
Digital Preservation Policies - SCAPEDigital Preservation Policies - SCAPE
Digital Preservation Policies - SCAPE
 
Large scale preservation workflows with Taverna – SCAPE Training event, Guima...
Large scale preservation workflows with Taverna – SCAPE Training event, Guima...Large scale preservation workflows with Taverna – SCAPE Training event, Guima...
Large scale preservation workflows with Taverna – SCAPE Training event, Guima...
 
Matchbox tool. Quality control for digital collections – SCAPE Training event...
Matchbox tool. Quality control for digital collections – SCAPE Training event...Matchbox tool. Quality control for digital collections – SCAPE Training event...
Matchbox tool. Quality control for digital collections – SCAPE Training event...
 
Characterisation - 101. An introduction to the identification and characteris...
Characterisation - 101. An introduction to the identification and characteris...Characterisation - 101. An introduction to the identification and characteris...
Characterisation - 101. An introduction to the identification and characteris...
 
Digital Preservation - The Saga Continues - SCAPE Training event, Guimarães 2012
Digital Preservation - The Saga Continues - SCAPE Training event, Guimarães 2012Digital Preservation - The Saga Continues - SCAPE Training event, Guimarães 2012
Digital Preservation - The Saga Continues - SCAPE Training event, Guimarães 2012
 

Último

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 

Último (20)

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 

Scape project presentation - Scalable Preservation Environments

  • 2. • Your collection of digital data is growing rapidly. • Your preservation activities must become more efficient and more scalable. • You need SCAPE! • The SCAPE project has developed scalable solutions for long-term preservation of large-scale and heterogeneous data sets. 2 Digital Preservation – What do I need? This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
  • 3. 3 What is SCAPE? Its all about scalability! • Scalable services for planning and execution of institutional preservation strategies • Infrastructure for the execution of digital preservation processes on large volumes of data • Existing tools have been improved and extended. • New tools have been developed where necessary. This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
  • 4. 4 What is SCAPE? SCAPE covers a whole digital preservation life cycle • Interconnecting services support the preservation of large repositories of digital objects • Applications support the formulation of preservation policies, decision making and selection of preservation actions This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
  • 5. 5 What is SCAPE? Take your pick – choose what you need! • Use the full set of interconnected SCAPE components or a selected series of SCAPE tools or workflows. • Many SCAPE components can be individually incorporated. This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
  • 6. • All SCAPE solutions arise from real-world challenges at partner institutions. • Each challenge is tested in testbeds at the partner institutions. 6 Solutions Tested in Real Life Web Content Digital Repositories This work was partially supported by the SCAPE Project. Data Centres The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137). Research Data Sets Testbeds
  • 7. Solutions for Content Holders Scalability In four dimensions: Heterogeneity of collections as well as number, size and complexity of objects Automation Through scalable, automated and simple to design preservation workflows Planning Answering core preservation planning questions Integration Through a robust, integrated, open source preservation system 7 This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
  • 8. 8 Overview: SCAPE Architecture This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
  • 9. 9 Overview: SCAPE Components This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137). The SCAPE Platform is a reference architecture for scalable preservation environments
  • 10. 10 Overview: SCAPE Components This work was partially supported by the SCAPE Project. The SCAPE Preservation Components are tools which enhance the functionality of a digital preservation system in: • Scalability • Functional coverage • Quality The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
  • 11. The SCAPE Planning and Watch components address the bottleneck of decision processes and processing information required for decision making 11 Overview: SCAPE Components This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
  • 12. Examples of tools and services 12 This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
  • 13. 13 Scalable Planning and Watch Scout – an Automated Preservation Watch System • Enables you to monitor your collections • Lets you access community knowledge • Collects relevant knowledge and enables automated notification This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
  • 14. 14 Scalable Planning and Watch C3PO – Content Profiling Tool for Preservation Analysis • Analyses characterisation metadata for digital collections • Aggregates and combines the metadata information across collections • Generates a profile of the content set • Allows use of different metadata formats This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
  • 15. 15 Scalable Planning and Watch Plato – Scalable Preservation Planning • Decision-making support tool • Guides you through the preservation planning workflow • Provides trust through controlled experiments and documentation • Provides an executable plan This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
  • 16. 16 Scalable Tools ToMaR – let your Preservation Tools Scale • Run existing tools against large amounts of files • Execute tools in a scalable fashion on a MapReduce cluster • Enable scalable workflows which chain together a set of tools • Process payloads too big to be computed on a single machine This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
  • 17. 17 Preservation Components Pagelyzer – Monitor your Web Content • Detect changes in web pages • Compare web page versions on a large scale • Compare web page rendering in different browsers • Determine appropriate frequency of web harvestings This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
  • 18. 18 Preservation Components Jpylyzer – Easy Validation of JPEG 2000 • Automated JP2 validation and feature extraction • Enables you to confirm whether an image is a valid, intact JP2 file • Reports the key technical properties of the image This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
  • 19. 19 Preservation Components Matchbox – easy Detection of Nearly Duplicate Images • Identify duplicate content, even where files are of different size, format, cropping etc. or scanned from different original copy • Automate quality assurance and reduce manual effort This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
  • 20. 20 Preservation Components xcorrSound – Automate Sound Wave Analysis • Compare two audio files and output the similarity • Detect overlaps in audio files • Detect occurrences of a smaller audio file (e.g. a jingle) within a larger audio file or an index of audio files This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
  • 21. SCAPE tools are published as open source software. Tools and services from SCAPE are sustained by • Open Planets Foundation - address core digital preservation challenges and engage with the community • COPTR - Community Owned digital Preservation Tool Registry 21 Sustainability of Tools and Services This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
  • 22. Sustainability of SCAPE results Ultimate Sustainability goal: • Supporting communities of practice by enabling efficient collaboration during the project and beyond. Open Planets Foundation will take post-project ownership of the outputs, supported by other partners providing specific capabilities. 22 This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
  • 23. Sustainability of SCAPE results Five complementary approaches: • Visibility Providing integrated outreach to multiple audiences to maximise discoverability. • Quality Ensuring that project outputs conform to standards-driven quality assurance. • Training Supporting skills development to further institutional capacity building. • Open licensing Using open licences to encourage the adoption and reuse of project outputs. • Community integration Integrating project outputs into commercial and non-commercial systems and services. 23 This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
  • 24. • EU-funded project under FP7 (Research and Technological Development) • Project runtime: February 2011 to September 2014 • 20 partners from 10 countries - from memory institutions, data centres, research labs, universities, and industrial firms • Public Project materials are licensed under a CC-BY-SA International License 24 About SCAPE This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
  • 25. 25 SCAPE Consortium This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).
  • 26. 26 Additional Sources of Interest • Development Infrastructure • Code repository hosted by the Open Planets Foundation and GitHub • https://github.com/openplanets/scape/ • Development Wiki • http://wiki.opf-labs.org/display/SP/Home This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137). • Tools • http://www.scape-project.eu/tools • Experimental Workflows • http://www.myexperiment.org/search?query=SCAPE&type=all&commit=Search • Publications • http://www.scape-project.eu/category/publication • Public Deliverables • http://www.scape-project.eu/category/deliverable
  • 27. 27 More Information • SCAPE website: www.scape-project.eu • Blog posts and more: www.openplanetsfoundation.com/projects/scape • Tools and Services: https://github.com/openplanets/scape • SCAPE Twitter: @SCAPEProject, #SCAPEProject • SCAPE Newsletter: Sign up via www.scape-project.eu All images © the SCAPE Project or its partners, except images on slides 3, 6 and 26 © www.digitalbevaring.dk This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).