TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
NLLC 2011: Memento, Open Annotation, SharedCanvas
1. Memento, Open Annotation, SharedCanvas
Robert Sanderson
rsanderson@lanl.gov
Los Alamos National Laboratory
Herbert Van de Sompel
herbertv@lanl.gov
Los Alamos National Laboratory
With thanks to our collaborators:
• Benjamin Albritton, at Stanford
• Tim Cole, at UIUC
• Jane Hunter, at Queensland
• Michael L Nelson, at Old Dominion
• Many others!
This research is funded in part by the Andrew W.
Mellon Foundation and the Library of Congress.
LANL Proto Team: Memento, Open Annotation, SharedCanvas 1
NLLC 2011, LANL, New Mexico, 6th October 2011
2. Overview … for the Web
• Memento
• Time Travel for the Web
• Uses
• Open Annotation
• Interoperable Annotation for the Web
• Use Case: SharedCanvas
• SharedCanvas
• Distributed Digital Facsimiles for the Web
• Uses
LANL Proto Team: Memento, Open Annotation, SharedCanvas 2
NLLC 2011, LANL, New Mexico, 6th October 2011
3. Memento: Time Travel for the Web
Memento wants to make it Easy
to Navigate the Web of the Past
Website: http://www.mementoweb.org/
LANL Proto Team: Memento, Open Annotation, SharedCanvas 3
NLLC 2011, LANL, New Mexico, 6th October 2011
4. Tate Online Select Date Tate Online
Today March 16 2008 March 16 2008
From UK
National Archives
LANL Proto Team: Memento, Open Annotation, SharedCanvas 4
NLLC 2011, LANL, New Mexico, 6th October 2011 4
5. Old Versions of Web Pages Exist …
http://www.cnn.com/ at 2001-09-11: http://en.wikipedia.org/wiki/September_11_attacks
From the Internet Archive at 2001-12-20: From the Wikipedia history
LANL Proto Team: Memento, Open Annotation, SharedCanvas 5
NLLC 2011, LANL, New Mexico, 6th October 2011
6. … But are not Integrated with the Web
• We note three issues:
• Cannot talk about a resource as it used to exist
• Cannot navigate to a prior version of a resource,
knowing its current version
• Cannot navigate to the current version of a resource,
knowing a prior version
LANL Proto Team: Memento, Open Annotation, SharedCanvas 6
NLLC 2011, LANL, New Mexico, 6th October 2011
7. Memento Framework
• Memento:
• Introduces a uniform version
mechanism to the Web: Time
• Does not build new archives, but
leverages all existing ones
• Is based on the Architecture of the
World Wide Web: resources,
representations and links
LANL Proto Team: Memento, Open Annotation, SharedCanvas 7
NLLC 2011, LANL, New Mexico, 6th October 2011
8. Original Resources and Mementos
LANL Proto Team: Memento, Open Annotation, SharedCanvas 8
NLLC 2011, LANL, New Mexico, 6th October 2011 8
9. TimeGate: A Bridge from the Present to the Past
LANL Proto Team: Memento, Open Annotation, SharedCanvas 9
NLLC 2011, LANL, New Mexico, 6th October 2011 9
10. Memento Framework in Practice
LANL Proto Team: Memento, Open Annotation, SharedCanvas 10
NLLC 2011, LANL, New Mexico, 6th October 2011 10
11. Memento Uses: Browsing with MementoFox
Get it from:
http://bit.ly/memfox
Firefox 3.6+
4000 downloads
400 daily users
LANL Proto Team: Memento, Open Annotation, SharedCanvas 11
NLLC 2011, LANL, New Mexico, 6th October 2011 11
12. Memento Uses: Time Series Analysis
Historical data retrieved via Memento from DBPedia archive
LANL Proto Team: Memento, Open Annotation, SharedCanvas 12
NLLC 2011, LANL, New Mexico, 6th October 2011 12
13. Memento Uses: Archiving Extent for Scholarship
UNT • 72% in archives and/or still exist
• High proportion of archived
URLs, possibly due to academic
level and general disciplines
arXiv • 78% in archives and/or still exist
• 45% still exist, but not archived!
Possibly due to high value, but
very discipline specific references
LANL Proto Team: Memento, Open Annotation, SharedCanvas 13
NLLC 2011, LANL, New Mexico, 6th October 2011 13
14. Web-Centric Annotation: No Persistence
Google Sidewiki Annotation on http://news.bbc.co.uk/ as of 2010-06-14
LANL Proto Team: Memento, Open Annotation, SharedCanvas 14
1
NLLC 2011, LANL, New Mexico, 6th October 2011
15. Web-Centric Annotation: No Annotations
Archived page from:
http://www.dracos.co.uk/work/bbc-news-archive/2010/03/08/07.05.html
LANL Proto Team: Memento, Open Annotation, SharedCanvas 15
1
NLLC 2011, LANL, New Mexico, 6th October 2011
16. Web-Centric Annotation: Cross-Linking via Memento
LANL Proto Team: Memento, Open Annotation, SharedCanvas 16
1
NLLC 2011, LANL, New Mexico, 6th October 2011
17. Scholarly Communication and Annotation
Scholarly Communication is increasingly:
• Online
• Open
• Distributed
• Collaborative
• Data-Oriented
Annotation is a scholarly primitive, spanning discipline and level.
Need to ensure that Digital Annotations fall under these headings!
• Apply the standards and architecture of the World Wide Web to the
Annotation use case.
• Even if scholar doesn’t share annotations with others, she will want
to access them from different tools and environments.
LANL Proto Team: Memento, Open Annotation, SharedCanvas 17
NLLC 2011, LANL, New Mexico, 6th October 2011
18. Open Annotation
• Website: http://www.openannotation.org/
• Focus on interoperable sharing of annotations
• Web-centric and open, not locked down silos
• Create, consume and interact in different environments
• Build from a simple model for simple cases,
to more detailed for complex scholarly annotation requirements
• Status: Beta, with 9 ongoing funded experiments to inform 1.0
LANL Proto Team: Memento, Open Annotation, SharedCanvas 18
NLLC 2011, LANL, New Mexico, 6th October 2011
19. Basic Model
The basic model has three resources:
• Annotation (an RDF document)
• Body (the ‘comment’ of the annotation)
• Target (the resource the Body is ‘about’)
LANL Proto Team: Memento, Open Annotation, SharedCanvas 19
NLLC 2011, LANL, New Mexico, 6th October 2011
20. Basic Model Example
LANL Proto Team: Memento, Open Annotation, SharedCanvas 20
NLLC 2011, LANL, New Mexico, 6th October 2011
21. Segments of Resources
Most annotations are about part of a resource
Different segments for different media types:
• Text: paragraph, arbitrary span of words
• Image: rectangular or arbitrary shaped area
• Audio: start and end time points, track name/number
• Video: area and time points
• Other: slice of a data set, volume in a 3d object, …
LANL Proto Team: Memento, Open Annotation, SharedCanvas 21
NLLC 2011, LANL, New Mexico, 6th October 2011
22. Constraints
• Introduce a Constraint that describes the segment of interest
• And a ConstrainedTarget that identifies the segment of interest
• Constraints are resources, so can be expressive and detailed
LANL Proto Team: Memento, Open Annotation, SharedCanvas 22
NLLC 2011, LANL, New Mexico, 6th October 2011
23. Constraint Example
LANL Proto Team: Memento, Open Annotation, SharedCanvas 23
NLLC 2011, LANL, New Mexico, 6th October 2011
24. Annotation Protocols
Unlike previous systems, Open
Annotation does not mandate a
protocol.
No reliance on a client/server
combination gives the client
autonomy.
Instead we promote a publish/
subscribe methodology, where
annotations may be stored and
consumed from anywhere.
Protocol: publish, subscribe, consume tied together
LANL Proto Team: Memento, Open Annotation, SharedCanvas 24
NLLC 2011, LANL, New Mexico, 6th October 2011
25. Publish/Subscribe Method
We don’t specify how this transfer should occur
publish
LANL Proto Team: Memento, Open Annotation, SharedCanvas 25
NLLC 2011, LANL, New Mexico, 6th October 2011
26. Publish/Subscribe Method
Nor this.
publish subscribe
LANL Proto Team: Memento, Open Annotation, SharedCanvas 26
NLLC 2011, LANL, New Mexico, 6th October 2011
27. Publish/Subscribe Method
Nor this.
publish subscribe consume
LANL Proto Team: Memento, Open Annotation, SharedCanvas 27
NLLC 2011, LANL, New Mexico, 6th October 2011
28. Open Annotation Usage: SharedCanvas
BNF f.fr 113, folio 1 recto
http://www.shared-canvas.org/
Digital surrogates for digitized documents:
• Initial use case: Medieval Manuscripts
A collaborative future:
• Rich landscape of interconnected
repositories, with seamless user
interfaces
We need:
• Shared description to act as input to
shared viewing environments
LANL Proto Team: Memento, Open Annotation, SharedCanvas 28
NLLC 2011, LANL, New Mexico, 6th October 2011
29. Naïve Approach: Transcribe Images Directly
Align multiple images, pages without images, fragments of pages… ?!
LANL Proto Team: Memento, Open Annotation, SharedCanvas 29
NLLC 2011, LANL, New Mexico, 6th October 2011
30. Canvas Paradigm
A Canvas is an empty space in which to build up a display
• HTML5, SVG, PDF, … even Powerpoint!
• Can "paint" many different resources, including text, images and
audio, on to a Canvas
We can use a Canvas to represent a page of a manuscript.
Distributed nature is fundamental in the requirements
• Idea: Use Annotations to paint resources and comments
• Annotations target the Canvas instead of the Images
LANL Proto Team: Memento, Open Annotation, SharedCanvas 30
NLLC 2011, LANL, New Mexico, 6th October 2011
31. Canvas to Page Relationship
The Canvas's top left and bottom right corners
correspond to the corners of the page
LANL Proto Team: Memento, Open Annotation, SharedCanvas 31
NLLC 2011, LANL, New Mexico, 6th October 2011
32. Annotate Image to Canvas
We can paint the canvas by annotating it with resources.
LANL Proto Team: Memento, Open Annotation, SharedCanvas 32
NLLC 2011, LANL, New Mexico, 6th October 2011
33. Annotate Text to Canvas Segment
LANL Proto Team: Memento, Open Annotation, SharedCanvas 33
NLLC 2011, LANL, New Mexico, 6th October 2011
34. Annotate Choice of Image
LANL Proto Team: Memento, Open Annotation, SharedCanvas 34
NLLC 2011, LANL, New Mexico, 6th October 2011
35. Annotate Many Texts
LANL Proto Team: Memento, Open Annotation, SharedCanvas 35
NLLC 2011, LANL, New Mexico, 6th October 2011
40. … of Lab Notebooks and Dissertations?
LANL Proto Team: Memento, Open Annotation, SharedCanvas 40
NLLC 2011, LANL, New Mexico, 6th October 2011
41. Summary
Memento wants to make it Easy to Navigate the Web of the Past
Open Annotation enables sharing Annotations
between users, tools and environments.
SharedCanvas brings primary research objects to the desktop
in a powerful, extensible and interoperable fashion
LANL Proto Team: Memento, Open Annotation, SharedCanvas 41
NLLC 2011, LANL, New Mexico, 6th October 2011
42. Thank You!
Me: Robert Sanderson
rsanderson@lanl.gov
azaroth42@gmail.com
@azaroth42
Memento:
Web: http://www.mementoweb.org/
Open Annotation:
Web: http://www.openannotation.org/
SharedCanvas:
Web: http://www.shared-canvas.org/
These Slides: http://slidesha.re/…
LANL Proto Team: Memento, Open Annotation, SharedCanvas 42
NLLC 2011, LANL, New Mexico, 6th October 2011