SlideShare una empresa de Scribd logo
1 de 40
Descargar para leer sin conexión
Update on Memento
                         http://www.mementoweb.org/


                                  Herbert Van de Sompel
                                     Robert Sanderson
                                      Michael L. Nelson

                                       This research funded by
                                        the Library of Congress


Towards Seamless Navigation
   of the Web of the Past

              Memento Update
  2011 IIPC General Assembly, Den Hague 1
Overview of Memento Framework

Deployment Progress

Memento and Discovery

Memento and Branding

Alternative Web Archiving Strategies



                       Memento Update
           2011 IIPC General Assembly, Den Hague 2
Overview of Memento Framework

Deployment Progress

Memento and Discovery

Memento and Branding

Alternative Web Archiving Strategies



                       Memento Update
           2011 IIPC General Assembly, Den Hague 3
Memento wants to make it easy

to navigate the Web of the Past.




             Memento Update
 2011 IIPC General Assembly, Den Hague 4
Tate Online              Select Date                      Tate Online
  Today                 March 16 2008                    March 16 2008




                                                              From
                                                        National Archives


                          Memento Update
              2011 IIPC General Assembly, Den Hague 5
Versions: Web vs CMS

      World Wide Web                      Content Management Systems

•  Designed to forget about               •  Designed to be aware of all
   prior versions of a resource              versions of a resource

•  Highly Distributed                     •  Self-contained

•  No standard version                    •  Variety of proprietary version
   mechanisms                                mechanisms

•  Standardized interlinking              •  Versions interlinked using
   mechanisms                                proprietary mechanisms


                               Memento Update
                   2011 IIPC General Assembly, Den Hague 6
Versions are not Integrated

                       The Web Architecture has a
                         hard time dealing with the
                         versions that do exist:

                       •  Cannot talk about a resource
                          as it used to exist

                       •  Cannot access a prior version
                          given the current one

                       •  Cannot access the current
                          version given a prior one


            Memento Update
2011 IIPC General Assembly, Den Hague 7
Memento Framework



                       •  Regards the Web as a big
                          Content Management System

                       •  Introduces a uniform
                          capability to access versions
                          on the Web

                       •  Does not build new archives
                          but leverages all systems that
                          host versions



            Memento Update
2011 IIPC General Assembly, Den Hague 8
Memento Framework


                       •  Is Distributed: versions may
                          exist on several servers

                       •  Uses Time as a global
                          version indicator

                       •  Is based on the primitives of
                          the Web: resource, resource
                          state, representation, content
                          negotiation, link




            Memento Update
2011 IIPC General Assembly, Den Hague 9
Memento Interaction Overview




             Memento Update
2011 IIPC General Assembly, Den Hague 10
Original Resource and Versions




             Memento Update
 2011 IIPC General Assembly, Den Hague 11
Bridge from Present to Past




             Memento Update
2011 IIPC General Assembly, Den Hague 12
Bridge from Past to Present




             Memento Update
2011 IIPC General Assembly, Den Hague 13
Memento Framework




             Memento Update
2011 IIPC General Assembly, Den Hague 14
Framework with Multiple Archives




                        Memento Update
           2011 IIPC General Assembly, Den Hague 15
Overview of Memento Framework

Deployment Progress

Memento and Discovery

Memento and Branding

Alternative Web Archiving Strategies



                        Memento Update
           2011 IIPC General Assembly, Den Hague 16
Significant progress has been made towards

seamless navigation of the Web of the Past.




                   Memento Update
      2011 IIPC General Assembly, Den Hague 17
Standardization



                                •  Standardization process started
                                   via the IETF

                                •  Interest from IETF and W3C

                                •  Encouraged by major Web
                                   architects, including: Tim
                                   Berners-Lee, Mark Nottingham,
                                   Michael Hausenblas



https://datatracker.ietf.org/doc/draft-vandesompel-memento/

                      Memento Update
         2011 IIPC General Assembly, Den Hague 18
Memento Clients

                       •  Several client tools developed
                          by us and others

                       •  Add-ons for FireFox
                          (operational) and Internet
                          Explorer (experimental)

                       •  Applications for Android
                          (operational) and iPhone/iPad
                          (in development)

                       •  Paper in current Issue of
                          Code4Lib Journal

   http://www.mementoweb.org/tools/

             Memento Update
2011 IIPC General Assembly, Den Hague 19
Memento Server Support



                       •  Memento-compliant Wayback
                          software:

                            •  In use by Internet Archive

                            •  Available to Web archives,
                               worldwide

                            •  Please experiment with this
                               new 1.6 version!



   http://www.mementoweb.org/tools/

             Memento Update
2011 IIPC General Assembly, Den Hague 20
Memento Server Support (2)




                       •  Plug-in for MediaWiki
                          (operational)

                            •  Used on W3C’s main wiki

                       •  Please install it for your
                          MediaWiki!




   http://www.mementoweb.org/tools/


             Memento Update
2011 IIPC General Assembly, Den Hague 21
Memento Server Validator


                        •  Server side client:
                             •  Attempts to perform all
                                Memento actions against a
                                given URI
                             •  Reports success/failure of
                                the interactions and
                                warnings for optional
                                aspects
                             •  Kept up to date with IETF
                                Internet Draft

http://www.mementoweb.org/tools/validator/


              Memento Update
 2011 IIPC General Assembly, Den Hague 22
Memento Proxy Support

                       •  Several systems that host
                          Mementos made Memento-
                          compliant “by proxy”:

                            •  Many Web Archives that do
                               not yet run Memento-
                               compliant software

                            •  3,000+ MediaWiki systems,
                               including Wikipedia, Wikia

                       •  We would love all of these to
                          become natively Memento
                          compliant!

             Memento Update
2011 IIPC General Assembly, Den Hague 23
Memento Web Site


                       •  Ongoing effort to add materials
                          that support understanding and
                          adoption:

                            •  Introduction to Memento
                            •  How to recognize
                               Mementos, TimeGates,
                               Original Resources?
                            •  Guidelines for servers that
                               host Mementos (Web
                               Archives, CMS, snapshot
                               archives, etc.)

  http://www.mementoweb.org/guide/

             Memento Update
2011 IIPC General Assembly, Den Hague 24
Funding

                       •  2007-2010: US $250K grant
                          from Library of Congress

                            •  Approx. $50K on Memento


                       •  2010-2011: US $1 Million
                          follow-up grant from Library of
                          Congress

                            •  For: Specification, outreach,
                               tool development, further
                               research



             Memento Update
2011 IIPC General Assembly, Den Hague 25
Overview of Memento Framework

Deployment Progress

Memento and Discovery

Memento and Branding

Alternative Web Archiving Strategies



                        Memento Update
           2011 IIPC General Assembly, Den Hague 26
Very few Web sites provide a “timegate” link.

Need additional mechanisms to support Discovery.




                        Memento Update
           2011 IIPC General Assembly, Den Hague 27
Batch Discovery: TimeMaps




                        A TimeMap minimally lists:

•  URI and datetime of Mementos known to an archive
•  URI of Original Resource

    TimeMaps can be aggregated across systems that host Mementos

                               Memento Update
                  2011 IIPC General Assembly, Den Hague 28
Batch Discovery: Feed of TimeMaps

System that hosts Mementos exposes Feed of TimeMaps to
allow applications to remain in sync with its collection:

   •  One Atom entry per Original Resource
   •  The entry links to or includes a TimeMap
   •  The entry's updated changes when additional
       Mementos become available
   •  The ID of the entry is a tag URI based on URI of
       Original Resource
   •  Can be protected, and include license information
   •  Could be anonymized by aggregating service



                          Memento Update
             2011 IIPC General Assembly, Den Hague 29
Batch Discovery: robots.txt

•  robots.txt file is used by Web servers to convey
crawling policies

•  Add a directives to support discovery of TimeGates and
Feeds of TimeMaps


TimeGate: http://dutch.archive.org/timegate/
  Archived: .nl

TimeGate: http://all.archive.org/timegate/
  Archived: *

TimeMapFeed: http://dutch.archive.org/feed/feed1.xml




                           Memento Update
              2011 IIPC General Assembly, Den Hague 30
Overview of Memento Framework

Deployment Progress

Memento and Discovery

Memento and Branding

Alternative Web Archiving Strategies



                        Memento Update
           2011 IIPC General Assembly, Den Hague 31
Memento can recreate pages using
 resources from different archives.

 This poses a branding challenge.




                  Memento Update
     2011 IIPC General Assembly, Den Hague 32
Current Branding Practice for Web Archives

          Page and embedded resources from same Web Archive




  Branding
     for
    page
     and
embedded
 resources
from single
   archive




                                 Memento Update
                    2011 IIPC General Assembly, Den Hague 33
Branding for Web Archives in Memento Mode

       Page and embedded resources from various Web Archives

HTML's
branding



   No
branding



   No
branding


                             Will be researched

                               Memento Update
                  2011 IIPC General Assembly, Den Hague 34
Overview of Memento Framework

Deployment Progress

Memento and Discovery

Memento and Branding

Alternative Web Archiving Strategies



                        Memento Update
           2011 IIPC General Assembly, Den Hague 35
Crawl-based Archives host distinct observations.

 Transactional Archives never miss an update.




                        Memento Update
           2011 IIPC General Assembly, Den Hague 36
Crawl-Based Web Archives




Distinct Observations are Archived for Many Servers



                    Memento Update
       2011 IIPC General Assembly, Den Hague 37
Server-Side Transactional Web Archives




Entire Change History is Archived for a Single Server



                     Memento Update
        2011 IIPC General Assembly, Den Hague 38
Development of Transactional Web Archive Software
Capture:
   •  Apache connection filter module captures URI, headers, body
   •  POSTs in real-time to transactional archive




Access:
   •  Online, real time access via Memento TimeGates
   •  Batch Export via WARC files for long term preservation


                               Memento Update
                  2011 IIPC General Assembly, Den Hague 39
Update on Memento
                             http://mementoweb.org/


                              Herbert Van de Sompel
                                  Robert Sanderson
                                  Michael L. Nelson


Towards Seamless Navigation of
     the Web of the Past

                Memento Update
   2011 IIPC General Assembly, Den Hague 40

Más contenido relacionado

Destacado

NISO/Internet Archive Meeting on Social Bookmarking and Annotation
NISO/Internet Archive Meeting on Social Bookmarking and AnnotationNISO/Internet Archive Meeting on Social Bookmarking and Annotation
NISO/Internet Archive Meeting on Social Bookmarking and AnnotationRobert Sanderson
 
British Library Seminar: Shared Canvas (September 2011)
British Library Seminar: Shared Canvas (September 2011)British Library Seminar: Shared Canvas (September 2011)
British Library Seminar: Shared Canvas (September 2011)Robert Sanderson
 
Linked Data and Images: Building Blocks for Cultural Heritage
Linked Data and Images: Building Blocks for Cultural HeritageLinked Data and Images: Building Blocks for Cultural Heritage
Linked Data and Images: Building Blocks for Cultural HeritageRobert Sanderson
 
Open Annotation Core Data Model (tutorial)
Open Annotation Core Data Model (tutorial)Open Annotation Core Data Model (tutorial)
Open Annotation Core Data Model (tutorial)Robert Sanderson
 
Transactional Archiving (Web Archive Globalization Workshop)
Transactional Archiving (Web Archive Globalization Workshop)Transactional Archiving (Web Archive Globalization Workshop)
Transactional Archiving (Web Archive Globalization Workshop)Robert Sanderson
 
Parker Keio 2011: Interoperable Manuscript Framework
Parker Keio 2011: Interoperable Manuscript FrameworkParker Keio 2011: Interoperable Manuscript Framework
Parker Keio 2011: Interoperable Manuscript FrameworkRobert Sanderson
 
Open Repositories 2014: Crowdsourced Transcription via IIIF
Open Repositories 2014: Crowdsourced Transcription via IIIFOpen Repositories 2014: Crowdsourced Transcription via IIIF
Open Repositories 2014: Crowdsourced Transcription via IIIFRobert Sanderson
 
W3C Web Annotation WG Update (I Annotate 2016)
W3C Web Annotation WG Update (I Annotate 2016)W3C Web Annotation WG Update (I Annotate 2016)
W3C Web Annotation WG Update (I Annotate 2016)Robert Sanderson
 
IIIF and JSON-LD: LODLAM Training Day
IIIF and JSON-LD: LODLAM Training DayIIIF and JSON-LD: LODLAM Training Day
IIIF and JSON-LD: LODLAM Training DayRobert Sanderson
 
IIIF: The Advantages of APIs
IIIF: The Advantages of APIsIIIF: The Advantages of APIs
IIIF: The Advantages of APIsRobert Sanderson
 
Managing Annotations (OR2016)
Managing Annotations (OR2016)Managing Annotations (OR2016)
Managing Annotations (OR2016)Robert Sanderson
 
Annotations as Linked Data with Fedora4 and Triannon
Annotations as Linked Data with Fedora4 and TriannonAnnotations as Linked Data with Fedora4 and Triannon
Annotations as Linked Data with Fedora4 and TriannonRobert Sanderson
 
IIIF: Discovery of Resources
IIIF: Discovery of ResourcesIIIF: Discovery of Resources
IIIF: Discovery of ResourcesRobert Sanderson
 
Annotating Scholarly Works - the W3C Open Annotation Model
Annotating Scholarly Works - the W3C Open Annotation ModelAnnotating Scholarly Works - the W3C Open Annotation Model
Annotating Scholarly Works - the W3C Open Annotation ModelRobert Sanderson
 
IIIF Foundational Specifications
IIIF Foundational SpecificationsIIIF Foundational Specifications
IIIF Foundational SpecificationsRobert Sanderson
 

Destacado (20)

NISO/Internet Archive Meeting on Social Bookmarking and Annotation
NISO/Internet Archive Meeting on Social Bookmarking and AnnotationNISO/Internet Archive Meeting on Social Bookmarking and Annotation
NISO/Internet Archive Meeting on Social Bookmarking and Annotation
 
British Library Seminar: Shared Canvas (September 2011)
British Library Seminar: Shared Canvas (September 2011)British Library Seminar: Shared Canvas (September 2011)
British Library Seminar: Shared Canvas (September 2011)
 
OAC Technical Summary
OAC Technical SummaryOAC Technical Summary
OAC Technical Summary
 
Linked Data and Images: Building Blocks for Cultural Heritage
Linked Data and Images: Building Blocks for Cultural HeritageLinked Data and Images: Building Blocks for Cultural Heritage
Linked Data and Images: Building Blocks for Cultural Heritage
 
Open Annotation Core Data Model (tutorial)
Open Annotation Core Data Model (tutorial)Open Annotation Core Data Model (tutorial)
Open Annotation Core Data Model (tutorial)
 
Transactional Archiving (Web Archive Globalization Workshop)
Transactional Archiving (Web Archive Globalization Workshop)Transactional Archiving (Web Archive Globalization Workshop)
Transactional Archiving (Web Archive Globalization Workshop)
 
Parker Keio 2011: Interoperable Manuscript Framework
Parker Keio 2011: Interoperable Manuscript FrameworkParker Keio 2011: Interoperable Manuscript Framework
Parker Keio 2011: Interoperable Manuscript Framework
 
8 Panduan Silabus
8 Panduan Silabus8 Panduan Silabus
8 Panduan Silabus
 
Open Repositories 2014: Crowdsourced Transcription via IIIF
Open Repositories 2014: Crowdsourced Transcription via IIIFOpen Repositories 2014: Crowdsourced Transcription via IIIF
Open Repositories 2014: Crowdsourced Transcription via IIIF
 
Espolon Oeste
Espolon OesteEspolon Oeste
Espolon Oeste
 
IIIF: Shared Canvas 2.0
IIIF: Shared Canvas 2.0IIIF: Shared Canvas 2.0
IIIF: Shared Canvas 2.0
 
W3C Web Annotation WG Update (I Annotate 2016)
W3C Web Annotation WG Update (I Annotate 2016)W3C Web Annotation WG Update (I Annotate 2016)
W3C Web Annotation WG Update (I Annotate 2016)
 
IIIF and JSON-LD: LODLAM Training Day
IIIF and JSON-LD: LODLAM Training DayIIIF and JSON-LD: LODLAM Training Day
IIIF and JSON-LD: LODLAM Training Day
 
IIIF: The Advantages of APIs
IIIF: The Advantages of APIsIIIF: The Advantages of APIs
IIIF: The Advantages of APIs
 
Managing Annotations (OR2016)
Managing Annotations (OR2016)Managing Annotations (OR2016)
Managing Annotations (OR2016)
 
Annotations as Linked Data with Fedora4 and Triannon
Annotations as Linked Data with Fedora4 and TriannonAnnotations as Linked Data with Fedora4 and Triannon
Annotations as Linked Data with Fedora4 and Triannon
 
IIIF: Discovery of Resources
IIIF: Discovery of ResourcesIIIF: Discovery of Resources
IIIF: Discovery of Resources
 
Annotating Scholarly Works - the W3C Open Annotation Model
Annotating Scholarly Works - the W3C Open Annotation ModelAnnotating Scholarly Works - the W3C Open Annotation Model
Annotating Scholarly Works - the W3C Open Annotation Model
 
IIIF Foundational Specifications
IIIF Foundational SpecificationsIIIF Foundational Specifications
IIIF Foundational Specifications
 
Introduction to IIIF
Introduction to IIIFIntroduction to IIIF
Introduction to IIIF
 

Similar a Memento Framework Update

Memento: Big Leaps Towards Seamless Navigation of the Web of the Past
Memento: Big Leaps Towards Seamless Navigation of the Web of the PastMemento: Big Leaps Towards Seamless Navigation of the Web of the Past
Memento: Big Leaps Towards Seamless Navigation of the Web of the PastHerbert Van de Sompel
 
Memento: Updated technical details (May 2011)
Memento: Updated technical details (May 2011)Memento: Updated technical details (May 2011)
Memento: Updated technical details (May 2011)Herbert Van de Sompel
 
facebook architecture for 600M users
facebook architecture for 600M usersfacebook architecture for 600M users
facebook architecture for 600M usersJongyoon Choi
 
An introduction to honeyclient technology
An introduction to honeyclient technologyAn introduction to honeyclient technology
An introduction to honeyclient technologyAngelo Dell'Aera
 
VA Smalltalk Update ESUG2014
VA Smalltalk Update ESUG2014VA Smalltalk Update ESUG2014
VA Smalltalk Update ESUG2014ESUG
 
#OSSPARIS19 - Do not be afraid to be forked ! - YOAV KUTNER, Oro Inc.
#OSSPARIS19 - Do not be afraid to be forked ! - YOAV KUTNER, Oro Inc.#OSSPARIS19 - Do not be afraid to be forked ! - YOAV KUTNER, Oro Inc.
#OSSPARIS19 - Do not be afraid to be forked ! - YOAV KUTNER, Oro Inc.Paris Open Source Summit
 
VA Smalltalk Update
VA Smalltalk UpdateVA Smalltalk Update
VA Smalltalk UpdateESUG
 
Linux field-update-2015
Linux field-update-2015Linux field-update-2015
Linux field-update-2015Chris Simmonds
 
Open MPI SC'15 State of the Union BOF
Open MPI SC'15 State of the Union BOFOpen MPI SC'15 State of the Union BOF
Open MPI SC'15 State of the Union BOFJeff Squyres
 
IWMW 2002: Web standards briefing (session C2)
IWMW 2002: Web standards briefing (session C2)IWMW 2002: Web standards briefing (session C2)
IWMW 2002: Web standards briefing (session C2)IWMW
 
State of the Apache NiFi Ecosystem & Community
State of the Apache NiFi Ecosystem & CommunityState of the Apache NiFi Ecosystem & Community
State of the Apache NiFi Ecosystem & CommunityAccumulo Summit
 
The adoption of FOSS workfows in commercial software development: the case of...
The adoption of FOSS workfows in commercial software development: the case of...The adoption of FOSS workfows in commercial software development: the case of...
The adoption of FOSS workfows in commercial software development: the case of...dmgerman
 
The Source Control Landscape
The Source Control LandscapeThe Source Control Landscape
The Source Control LandscapeLorna Mitchell
 
Tycho - Building plug-ins with Maven
Tycho - Building plug-ins with MavenTycho - Building plug-ins with Maven
Tycho - Building plug-ins with MavenPascal Rapicault
 
DotNetNuke – CMS redefined
DotNetNuke – CMS redefinedDotNetNuke – CMS redefined
DotNetNuke – CMS redefinedCharles Nurse
 
Dataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFiDataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFiDataWorks Summit
 
Chicago HUG Presentation Oct 2011
Chicago HUG Presentation Oct 2011Chicago HUG Presentation Oct 2011
Chicago HUG Presentation Oct 2011Abe Taha
 
Content Management Systems and Refactoring - Drupal, WordPress and eZ Publish
Content Management Systems and Refactoring - Drupal, WordPress and eZ PublishContent Management Systems and Refactoring - Drupal, WordPress and eZ Publish
Content Management Systems and Refactoring - Drupal, WordPress and eZ PublishJani Tarvainen
 
Vimeo and Open Source (SMPTE Forum 2015)
Vimeo and Open Source (SMPTE Forum 2015)Vimeo and Open Source (SMPTE Forum 2015)
Vimeo and Open Source (SMPTE Forum 2015)Derek Buitenhuis
 

Similar a Memento Framework Update (20)

Memento: Big Leaps Towards Seamless Navigation of the Web of the Past
Memento: Big Leaps Towards Seamless Navigation of the Web of the PastMemento: Big Leaps Towards Seamless Navigation of the Web of the Past
Memento: Big Leaps Towards Seamless Navigation of the Web of the Past
 
Memento: Updated technical details (May 2011)
Memento: Updated technical details (May 2011)Memento: Updated technical details (May 2011)
Memento: Updated technical details (May 2011)
 
facebook architecture for 600M users
facebook architecture for 600M usersfacebook architecture for 600M users
facebook architecture for 600M users
 
Preserving access
Preserving accessPreserving access
Preserving access
 
An introduction to honeyclient technology
An introduction to honeyclient technologyAn introduction to honeyclient technology
An introduction to honeyclient technology
 
VA Smalltalk Update ESUG2014
VA Smalltalk Update ESUG2014VA Smalltalk Update ESUG2014
VA Smalltalk Update ESUG2014
 
#OSSPARIS19 - Do not be afraid to be forked ! - YOAV KUTNER, Oro Inc.
#OSSPARIS19 - Do not be afraid to be forked ! - YOAV KUTNER, Oro Inc.#OSSPARIS19 - Do not be afraid to be forked ! - YOAV KUTNER, Oro Inc.
#OSSPARIS19 - Do not be afraid to be forked ! - YOAV KUTNER, Oro Inc.
 
VA Smalltalk Update
VA Smalltalk UpdateVA Smalltalk Update
VA Smalltalk Update
 
Linux field-update-2015
Linux field-update-2015Linux field-update-2015
Linux field-update-2015
 
Open MPI SC'15 State of the Union BOF
Open MPI SC'15 State of the Union BOFOpen MPI SC'15 State of the Union BOF
Open MPI SC'15 State of the Union BOF
 
IWMW 2002: Web standards briefing (session C2)
IWMW 2002: Web standards briefing (session C2)IWMW 2002: Web standards briefing (session C2)
IWMW 2002: Web standards briefing (session C2)
 
State of the Apache NiFi Ecosystem & Community
State of the Apache NiFi Ecosystem & CommunityState of the Apache NiFi Ecosystem & Community
State of the Apache NiFi Ecosystem & Community
 
The adoption of FOSS workfows in commercial software development: the case of...
The adoption of FOSS workfows in commercial software development: the case of...The adoption of FOSS workfows in commercial software development: the case of...
The adoption of FOSS workfows in commercial software development: the case of...
 
The Source Control Landscape
The Source Control LandscapeThe Source Control Landscape
The Source Control Landscape
 
Tycho - Building plug-ins with Maven
Tycho - Building plug-ins with MavenTycho - Building plug-ins with Maven
Tycho - Building plug-ins with Maven
 
DotNetNuke – CMS redefined
DotNetNuke – CMS redefinedDotNetNuke – CMS redefined
DotNetNuke – CMS redefined
 
Dataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFiDataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFi
 
Chicago HUG Presentation Oct 2011
Chicago HUG Presentation Oct 2011Chicago HUG Presentation Oct 2011
Chicago HUG Presentation Oct 2011
 
Content Management Systems and Refactoring - Drupal, WordPress and eZ Publish
Content Management Systems and Refactoring - Drupal, WordPress and eZ PublishContent Management Systems and Refactoring - Drupal, WordPress and eZ Publish
Content Management Systems and Refactoring - Drupal, WordPress and eZ Publish
 
Vimeo and Open Source (SMPTE Forum 2015)
Vimeo and Open Source (SMPTE Forum 2015)Vimeo and Open Source (SMPTE Forum 2015)
Vimeo and Open Source (SMPTE Forum 2015)
 

Más de Robert Sanderson

LUX - Cross Collections Cultural Heritage at Yale
LUX - Cross Collections Cultural Heritage at YaleLUX - Cross Collections Cultural Heritage at Yale
LUX - Cross Collections Cultural Heritage at YaleRobert Sanderson
 
Zoom as a Paradigm for Linked Open Usable Data
Zoom as a Paradigm for Linked Open Usable DataZoom as a Paradigm for Linked Open Usable Data
Zoom as a Paradigm for Linked Open Usable DataRobert Sanderson
 
Provenance and Uncertainty in Linked Art
Provenance and Uncertainty in Linked ArtProvenance and Uncertainty in Linked Art
Provenance and Uncertainty in Linked ArtRobert Sanderson
 
Data is our Product: Thoughts on LOD Sustainability
Data is our Product: Thoughts on LOD SustainabilityData is our Product: Thoughts on LOD Sustainability
Data is our Product: Thoughts on LOD SustainabilityRobert Sanderson
 
A Perspective on Wikidata: Ecosystems, Trust, and Usability
A Perspective on Wikidata: Ecosystems, Trust, and UsabilityA Perspective on Wikidata: Ecosystems, Trust, and Usability
A Perspective on Wikidata: Ecosystems, Trust, and UsabilityRobert Sanderson
 
Linked Art: Sustainable Cultural Knowledge through Linked Open Usable Data
Linked Art: Sustainable Cultural Knowledge through Linked Open Usable DataLinked Art: Sustainable Cultural Knowledge through Linked Open Usable Data
Linked Art: Sustainable Cultural Knowledge through Linked Open Usable DataRobert Sanderson
 
Illusions of Grandeur: Trust and Belief in Cultural Heritage Linked Open Data
Illusions of Grandeur: Trust and Belief in Cultural Heritage Linked Open DataIllusions of Grandeur: Trust and Belief in Cultural Heritage Linked Open Data
Illusions of Grandeur: Trust and Belief in Cultural Heritage Linked Open DataRobert Sanderson
 
Structural Metadata in RDF (IS575)
Structural Metadata in RDF (IS575)Structural Metadata in RDF (IS575)
Structural Metadata in RDF (IS575)Robert Sanderson
 
Sanderson CNI 2020 Keynote - Cultural Heritage Research Data Ecosystem
Sanderson CNI 2020 Keynote - Cultural Heritage Research Data EcosystemSanderson CNI 2020 Keynote - Cultural Heritage Research Data Ecosystem
Sanderson CNI 2020 Keynote - Cultural Heritage Research Data EcosystemRobert Sanderson
 
Tiers of Abstraction and Audience in Cultural Heritage Data Modeling
Tiers of Abstraction and Audience in Cultural Heritage Data ModelingTiers of Abstraction and Audience in Cultural Heritage Data Modeling
Tiers of Abstraction and Audience in Cultural Heritage Data ModelingRobert Sanderson
 
The Importance of being LOUD
The Importance of being LOUDThe Importance of being LOUD
The Importance of being LOUDRobert Sanderson
 
Introduction to Linked Art Model
Introduction to Linked Art ModelIntroduction to Linked Art Model
Introduction to Linked Art ModelRobert Sanderson
 
Standards and Communities: Connected People, Consistent Data, Usable Applicat...
Standards and Communities: Connected People, Consistent Data, Usable Applicat...Standards and Communities: Connected People, Consistent Data, Usable Applicat...
Standards and Communities: Connected People, Consistent Data, Usable Applicat...Robert Sanderson
 
Strong Opinions, Weakly Held
Strong Opinions, Weakly HeldStrong Opinions, Weakly Held
Strong Opinions, Weakly HeldRobert Sanderson
 
IIIF Discovery Walkthrough
IIIF Discovery WalkthroughIIIF Discovery Walkthrough
IIIF Discovery WalkthroughRobert Sanderson
 
Linked Art: An Art Museum Profile for CIDOC-CRM
Linked Art: An Art Museum Profile for CIDOC-CRMLinked Art: An Art Museum Profile for CIDOC-CRM
Linked Art: An Art Museum Profile for CIDOC-CRMRobert Sanderson
 
Euromed2018 Keynote: Usability over Completeness, Community over Committee
Euromed2018 Keynote: Usability over Completeness, Community over CommitteeEuromed2018 Keynote: Usability over Completeness, Community over Committee
Euromed2018 Keynote: Usability over Completeness, Community over CommitteeRobert Sanderson
 
Linked Art - Our Linked Open Usable Data Model
Linked Art - Our Linked Open Usable Data ModelLinked Art - Our Linked Open Usable Data Model
Linked Art - Our Linked Open Usable Data ModelRobert Sanderson
 
EuropeanaTech Keynote: Shout it out LOUD
EuropeanaTech Keynote: Shout it out LOUDEuropeanaTech Keynote: Shout it out LOUD
EuropeanaTech Keynote: Shout it out LOUDRobert Sanderson
 

Más de Robert Sanderson (20)

Understanding Linked Art
Understanding Linked ArtUnderstanding Linked Art
Understanding Linked Art
 
LUX - Cross Collections Cultural Heritage at Yale
LUX - Cross Collections Cultural Heritage at YaleLUX - Cross Collections Cultural Heritage at Yale
LUX - Cross Collections Cultural Heritage at Yale
 
Zoom as a Paradigm for Linked Open Usable Data
Zoom as a Paradigm for Linked Open Usable DataZoom as a Paradigm for Linked Open Usable Data
Zoom as a Paradigm for Linked Open Usable Data
 
Provenance and Uncertainty in Linked Art
Provenance and Uncertainty in Linked ArtProvenance and Uncertainty in Linked Art
Provenance and Uncertainty in Linked Art
 
Data is our Product: Thoughts on LOD Sustainability
Data is our Product: Thoughts on LOD SustainabilityData is our Product: Thoughts on LOD Sustainability
Data is our Product: Thoughts on LOD Sustainability
 
A Perspective on Wikidata: Ecosystems, Trust, and Usability
A Perspective on Wikidata: Ecosystems, Trust, and UsabilityA Perspective on Wikidata: Ecosystems, Trust, and Usability
A Perspective on Wikidata: Ecosystems, Trust, and Usability
 
Linked Art: Sustainable Cultural Knowledge through Linked Open Usable Data
Linked Art: Sustainable Cultural Knowledge through Linked Open Usable DataLinked Art: Sustainable Cultural Knowledge through Linked Open Usable Data
Linked Art: Sustainable Cultural Knowledge through Linked Open Usable Data
 
Illusions of Grandeur: Trust and Belief in Cultural Heritage Linked Open Data
Illusions of Grandeur: Trust and Belief in Cultural Heritage Linked Open DataIllusions of Grandeur: Trust and Belief in Cultural Heritage Linked Open Data
Illusions of Grandeur: Trust and Belief in Cultural Heritage Linked Open Data
 
Structural Metadata in RDF (IS575)
Structural Metadata in RDF (IS575)Structural Metadata in RDF (IS575)
Structural Metadata in RDF (IS575)
 
Sanderson CNI 2020 Keynote - Cultural Heritage Research Data Ecosystem
Sanderson CNI 2020 Keynote - Cultural Heritage Research Data EcosystemSanderson CNI 2020 Keynote - Cultural Heritage Research Data Ecosystem
Sanderson CNI 2020 Keynote - Cultural Heritage Research Data Ecosystem
 
Tiers of Abstraction and Audience in Cultural Heritage Data Modeling
Tiers of Abstraction and Audience in Cultural Heritage Data ModelingTiers of Abstraction and Audience in Cultural Heritage Data Modeling
Tiers of Abstraction and Audience in Cultural Heritage Data Modeling
 
The Importance of being LOUD
The Importance of being LOUDThe Importance of being LOUD
The Importance of being LOUD
 
Introduction to Linked Art Model
Introduction to Linked Art ModelIntroduction to Linked Art Model
Introduction to Linked Art Model
 
Standards and Communities: Connected People, Consistent Data, Usable Applicat...
Standards and Communities: Connected People, Consistent Data, Usable Applicat...Standards and Communities: Connected People, Consistent Data, Usable Applicat...
Standards and Communities: Connected People, Consistent Data, Usable Applicat...
 
Strong Opinions, Weakly Held
Strong Opinions, Weakly HeldStrong Opinions, Weakly Held
Strong Opinions, Weakly Held
 
IIIF Discovery Walkthrough
IIIF Discovery WalkthroughIIIF Discovery Walkthrough
IIIF Discovery Walkthrough
 
Linked Art: An Art Museum Profile for CIDOC-CRM
Linked Art: An Art Museum Profile for CIDOC-CRMLinked Art: An Art Museum Profile for CIDOC-CRM
Linked Art: An Art Museum Profile for CIDOC-CRM
 
Euromed2018 Keynote: Usability over Completeness, Community over Committee
Euromed2018 Keynote: Usability over Completeness, Community over CommitteeEuromed2018 Keynote: Usability over Completeness, Community over Committee
Euromed2018 Keynote: Usability over Completeness, Community over Committee
 
Linked Art - Our Linked Open Usable Data Model
Linked Art - Our Linked Open Usable Data ModelLinked Art - Our Linked Open Usable Data Model
Linked Art - Our Linked Open Usable Data Model
 
EuropeanaTech Keynote: Shout it out LOUD
EuropeanaTech Keynote: Shout it out LOUDEuropeanaTech Keynote: Shout it out LOUD
EuropeanaTech Keynote: Shout it out LOUD
 

Memento Framework Update

  • 1. Update on Memento http://www.mementoweb.org/ Herbert Van de Sompel Robert Sanderson Michael L. Nelson This research funded by the Library of Congress Towards Seamless Navigation of the Web of the Past Memento Update 2011 IIPC General Assembly, Den Hague 1
  • 2. Overview of Memento Framework Deployment Progress Memento and Discovery Memento and Branding Alternative Web Archiving Strategies Memento Update 2011 IIPC General Assembly, Den Hague 2
  • 3. Overview of Memento Framework Deployment Progress Memento and Discovery Memento and Branding Alternative Web Archiving Strategies Memento Update 2011 IIPC General Assembly, Den Hague 3
  • 4. Memento wants to make it easy to navigate the Web of the Past. Memento Update 2011 IIPC General Assembly, Den Hague 4
  • 5. Tate Online Select Date Tate Online Today March 16 2008 March 16 2008 From National Archives Memento Update 2011 IIPC General Assembly, Den Hague 5
  • 6. Versions: Web vs CMS World Wide Web Content Management Systems •  Designed to forget about •  Designed to be aware of all prior versions of a resource versions of a resource •  Highly Distributed •  Self-contained •  No standard version •  Variety of proprietary version mechanisms mechanisms •  Standardized interlinking •  Versions interlinked using mechanisms proprietary mechanisms Memento Update 2011 IIPC General Assembly, Den Hague 6
  • 7. Versions are not Integrated The Web Architecture has a hard time dealing with the versions that do exist: •  Cannot talk about a resource as it used to exist •  Cannot access a prior version given the current one •  Cannot access the current version given a prior one Memento Update 2011 IIPC General Assembly, Den Hague 7
  • 8. Memento Framework •  Regards the Web as a big Content Management System •  Introduces a uniform capability to access versions on the Web •  Does not build new archives but leverages all systems that host versions Memento Update 2011 IIPC General Assembly, Den Hague 8
  • 9. Memento Framework •  Is Distributed: versions may exist on several servers •  Uses Time as a global version indicator •  Is based on the primitives of the Web: resource, resource state, representation, content negotiation, link Memento Update 2011 IIPC General Assembly, Den Hague 9
  • 10. Memento Interaction Overview Memento Update 2011 IIPC General Assembly, Den Hague 10
  • 11. Original Resource and Versions Memento Update 2011 IIPC General Assembly, Den Hague 11
  • 12. Bridge from Present to Past Memento Update 2011 IIPC General Assembly, Den Hague 12
  • 13. Bridge from Past to Present Memento Update 2011 IIPC General Assembly, Den Hague 13
  • 14. Memento Framework Memento Update 2011 IIPC General Assembly, Den Hague 14
  • 15. Framework with Multiple Archives Memento Update 2011 IIPC General Assembly, Den Hague 15
  • 16. Overview of Memento Framework Deployment Progress Memento and Discovery Memento and Branding Alternative Web Archiving Strategies Memento Update 2011 IIPC General Assembly, Den Hague 16
  • 17. Significant progress has been made towards seamless navigation of the Web of the Past. Memento Update 2011 IIPC General Assembly, Den Hague 17
  • 18. Standardization •  Standardization process started via the IETF •  Interest from IETF and W3C •  Encouraged by major Web architects, including: Tim Berners-Lee, Mark Nottingham, Michael Hausenblas https://datatracker.ietf.org/doc/draft-vandesompel-memento/ Memento Update 2011 IIPC General Assembly, Den Hague 18
  • 19. Memento Clients •  Several client tools developed by us and others •  Add-ons for FireFox (operational) and Internet Explorer (experimental) •  Applications for Android (operational) and iPhone/iPad (in development) •  Paper in current Issue of Code4Lib Journal http://www.mementoweb.org/tools/ Memento Update 2011 IIPC General Assembly, Den Hague 19
  • 20. Memento Server Support •  Memento-compliant Wayback software: •  In use by Internet Archive •  Available to Web archives, worldwide •  Please experiment with this new 1.6 version! http://www.mementoweb.org/tools/ Memento Update 2011 IIPC General Assembly, Den Hague 20
  • 21. Memento Server Support (2) •  Plug-in for MediaWiki (operational) •  Used on W3C’s main wiki •  Please install it for your MediaWiki! http://www.mementoweb.org/tools/ Memento Update 2011 IIPC General Assembly, Den Hague 21
  • 22. Memento Server Validator •  Server side client: •  Attempts to perform all Memento actions against a given URI •  Reports success/failure of the interactions and warnings for optional aspects •  Kept up to date with IETF Internet Draft http://www.mementoweb.org/tools/validator/ Memento Update 2011 IIPC General Assembly, Den Hague 22
  • 23. Memento Proxy Support •  Several systems that host Mementos made Memento- compliant “by proxy”: •  Many Web Archives that do not yet run Memento- compliant software •  3,000+ MediaWiki systems, including Wikipedia, Wikia •  We would love all of these to become natively Memento compliant! Memento Update 2011 IIPC General Assembly, Den Hague 23
  • 24. Memento Web Site •  Ongoing effort to add materials that support understanding and adoption: •  Introduction to Memento •  How to recognize Mementos, TimeGates, Original Resources? •  Guidelines for servers that host Mementos (Web Archives, CMS, snapshot archives, etc.) http://www.mementoweb.org/guide/ Memento Update 2011 IIPC General Assembly, Den Hague 24
  • 25. Funding •  2007-2010: US $250K grant from Library of Congress •  Approx. $50K on Memento •  2010-2011: US $1 Million follow-up grant from Library of Congress •  For: Specification, outreach, tool development, further research Memento Update 2011 IIPC General Assembly, Den Hague 25
  • 26. Overview of Memento Framework Deployment Progress Memento and Discovery Memento and Branding Alternative Web Archiving Strategies Memento Update 2011 IIPC General Assembly, Den Hague 26
  • 27. Very few Web sites provide a “timegate” link. Need additional mechanisms to support Discovery. Memento Update 2011 IIPC General Assembly, Den Hague 27
  • 28. Batch Discovery: TimeMaps A TimeMap minimally lists: •  URI and datetime of Mementos known to an archive •  URI of Original Resource TimeMaps can be aggregated across systems that host Mementos Memento Update 2011 IIPC General Assembly, Den Hague 28
  • 29. Batch Discovery: Feed of TimeMaps System that hosts Mementos exposes Feed of TimeMaps to allow applications to remain in sync with its collection: •  One Atom entry per Original Resource •  The entry links to or includes a TimeMap •  The entry's updated changes when additional Mementos become available •  The ID of the entry is a tag URI based on URI of Original Resource •  Can be protected, and include license information •  Could be anonymized by aggregating service Memento Update 2011 IIPC General Assembly, Den Hague 29
  • 30. Batch Discovery: robots.txt •  robots.txt file is used by Web servers to convey crawling policies •  Add a directives to support discovery of TimeGates and Feeds of TimeMaps TimeGate: http://dutch.archive.org/timegate/ Archived: .nl TimeGate: http://all.archive.org/timegate/ Archived: * TimeMapFeed: http://dutch.archive.org/feed/feed1.xml Memento Update 2011 IIPC General Assembly, Den Hague 30
  • 31. Overview of Memento Framework Deployment Progress Memento and Discovery Memento and Branding Alternative Web Archiving Strategies Memento Update 2011 IIPC General Assembly, Den Hague 31
  • 32. Memento can recreate pages using resources from different archives. This poses a branding challenge. Memento Update 2011 IIPC General Assembly, Den Hague 32
  • 33. Current Branding Practice for Web Archives Page and embedded resources from same Web Archive Branding for page and embedded resources from single archive Memento Update 2011 IIPC General Assembly, Den Hague 33
  • 34. Branding for Web Archives in Memento Mode Page and embedded resources from various Web Archives HTML's branding No branding No branding Will be researched Memento Update 2011 IIPC General Assembly, Den Hague 34
  • 35. Overview of Memento Framework Deployment Progress Memento and Discovery Memento and Branding Alternative Web Archiving Strategies Memento Update 2011 IIPC General Assembly, Den Hague 35
  • 36. Crawl-based Archives host distinct observations. Transactional Archives never miss an update. Memento Update 2011 IIPC General Assembly, Den Hague 36
  • 37. Crawl-Based Web Archives Distinct Observations are Archived for Many Servers Memento Update 2011 IIPC General Assembly, Den Hague 37
  • 38. Server-Side Transactional Web Archives Entire Change History is Archived for a Single Server Memento Update 2011 IIPC General Assembly, Den Hague 38
  • 39. Development of Transactional Web Archive Software Capture: •  Apache connection filter module captures URI, headers, body •  POSTs in real-time to transactional archive Access: •  Online, real time access via Memento TimeGates •  Batch Export via WARC files for long term preservation Memento Update 2011 IIPC General Assembly, Den Hague 39
  • 40. Update on Memento http://mementoweb.org/ Herbert Van de Sompel Robert Sanderson Michael L. Nelson Towards Seamless Navigation of the Web of the Past Memento Update 2011 IIPC General Assembly, Den Hague 40