The document introduces the SemLib project, which aims to develop semantic web tools for digital libraries. Specifically, it will develop a semantic annotation system and recommender system. The semantic annotation system will allow users to generate and share structured annotations on digital objects in a way that is published as linked data. This supports engagement of both expert and non-expert users in enriching digital library collections. The document outlines the project details, expected outputs, use cases, system requirements, and proposed annotation and sharing models.
1. Introducing
the
SemLib
Project:
Seman6c
Web
Tools
for
Digital
Libraries
Chris6an
Morbidoni,
Marco
Grassi
and
Michele
Nucci
DII
-‐
Department
of
Informa6on
Engeneering
Università
Politecnica
delle
Marche,
Ancona,
ITALY
2. Outlines
1. DL
Evolu6on
2. Semlib
Project
3. Semlib
Annota6on
System
– Use
cases
– System
Requirements
– Annota6on
Model
– Sharing
Model
– System
Architecture
– Technical
Implementa6on
4. Demo
5. Conclusions
29/09/2011
SDA2011
Introducing
SemLib
Project
m.grassi@univpm.it
3. DL
Evolu6on
Experts
Create Contents
Add Content Add Annotations
Digital Library
Experts Consume Commenting
Contents
Tagging Linking
Consume
Contents
Create Contents
Digital Library
Users
Crowdsourcing
Consume Commenting
Contents
Experts Tagging Linking
Consume
Contents
• Users
engagement
is
employed
Create Contents
Users
in
achieving
a
common
goal
• Enrich
and
create
contents:
Social
Engagement
Digital Library
§ curate
documents
Engagement
Consume Contents
§ transcribe
manuscripts
• Tags,
links,
comments
• Great
poten6al
Users
• I m p r o v e
r e s o u r c e
r a n k i n g ,
§ Ex.
Wikipedia
vs
MS
Encarta
classifica6on
Expert
Model
• Users
annotate
contents
for
their
• Experts
create
contents
and
own
purposes
users
consume
them
Properly
structured
and
machine-‐processable
annota6ons:
Time
-‐ enhance
search
and
browsing
capabili6es
-‐ provide
users
a
more
engaging
experience
-‐ improve
DLs
metadata,
enrich
contents
29/09/2011
SDA2011
Introducing
SemLib
Project
m.grassi@univpm.it
4. Seman6c
Web
Tools
For
DLs
h"p://www.semlibproject.eu/
• R&D
project
supported
by
EU
FP7
Theme:
Research
for
SMEs
(no.
FP7-‐SME-‐2010-‐01-‐262301-‐SEMLIB)
• 24
months
• commenced
in
January
2011.
29/09/2011
SDA2011
Introducing
SemLib
Project
m.grassi@univpm.it
5. SemLib
Expected
Outputs
Seman6c
Seman6c
Recommender
System
Annota6on
System
use
Linked
Data
to
improve
searching
exploit
user-‐generated
RDF
metadata
and
browsing
in
the
DLs
and
publish
it
as
Linked
Data.
29/09/2011
SDA2011
Introducing
SemLib
Project
m.grassi@univpm.it
6. Use
cases
1/2
SMEs’
use-‐cases:
• Different
types
of
users
– Different
technical
exper6ses
– Different
roles
(DL
employers
or
end-‐users)
• Fragments
Annota6on
– Text
fragments
/
Image
regions
/
Video
segments
and
regions
• Seman6c
Tags
– Unambiguous
/
Hierarchical
Vocabularies
– Support
user
in
tag
crea6on
(ex.
sugges6ng
tags)
– Exploi6ng
Similari6es
Among
Digital
Objects
29/09/2011
SDA2011
Introducing
SemLib
Project
m.grassi@univpm.it
7. Use
Case
2/2
• Seman6c
Rela6ons
– express
more
seman6cs
specifying
the
type
of
rela6on
(cite,
define,
etc.)
– answer
more
precise
queries
(“show
me
all
sentences
that
cites
a
given
author)
• Annota6on
Bookmarking
and
Sharing
– organize
annota6ons
in
different
collec6ons
– share
with
other
users
or
user
communi6es
– Support
collabora6ve
annota6on
crea6on
• Annota6ons
as
enrichment
of
DLs
29/09/2011
SDA2011
Introducing
SemLib
Project
m.grassi@univpm.it
8. System
Requirements
1/2
• Modularity
of
the
system
– Customizability
and
Extensibility
– Custom
configura6ons
deliver
domain
specific
annota6on
tools
to
the
users
(including
Javascript
libreries
or
shortcut
to
bookmarklets)
• Seman6cally
structured
annota6ons
• Pluggable
vocabularies
and
ontologies
– Flexibility
and
interoperability
– Different
applica6on
domains
• Annota6on
at
different
level
of
granularity
– Whole
resource
– Resource
Fragments
(Xpointer,
Media
Fragment
URI)
29/09/2011
SDA2011
Introducing
SemLib
Project
m.grassi@univpm.it
9. System
Requirements
2/2
• Annota6on
at
different
level
of
complexity
– Text
Comments
/
Seman6c
Tagging
/
Full
Statement
(S-‐P-‐P)
– Automa6c
named
en66es
extrac6on(ex.
DBpedia
Spotlight)
• Organize
annota6ons
in
Notebook
• Social
Model
for
Annota6on
Sharing
• Support
for
crowdsourced
annota6on
collec6ons
– Annota6on
selec6on
and
cura6on
• Highly
Dynamic
and
Interac6ve
/
Easy
to
use
GUIs
29/09/2011
SDA2011
Introducing
SemLib
Project
m.grassi@univpm.it
10. Annota6on
Model
2011-01-27 10:30:56
ex:MarcoGrassi Annotation 1
dcterms:creator
dcterms:created
Pluggable
Vocabularies
rdfs:label (Ontologies,
SKOS)
An example annotation showing the
annotation model
oac:Annotation
a rdfs:comment
ex:ANNOTATION-ID-1
ex:ANNOTATION-GRAPH-ID-1
oac:hasBody
http://example.com/ http://example.com/
oac:hasTarget mypage.htm#textFragment 1.htm
rdfs:label
http://example.com/ semlib:mentionsPeriod
1.htm
Fragment: Durante gli semlib:Renassance
Alighieri...
oac:hasTarget
Named
oac:hasTarget semlib:hasContent semlib:mentionsAuthor
http://example.com/
mypage.htm#textFragment
"Durante degli Alighieri, commonly
known as Dante, was a major Italian Graph
poet of the Middle Ages."
semlib:DanteAlighieri
semlib:depicts
http://example.com/
img1.jpeg http://example.com/
img1.jpeg
Annota6on
Container
Informa6ve
Content
29/09/2011
SDA2011
Introducing
SemLib
Project
m.grassi@univpm.it
11. Notebooks
2011-01-27 10:30:56
dcterms:creator SINGLE USER
My Example Notebook A RE
dcterms:created SH
U RI
ok
rdfs:label
te bo
An Example Notebook No
used to show the model
WIKI
SHARE
rdfs:comment
NotebookURI
NotebookURI SH COMMUNITIES
AR
No E
te
bo
ok
U RI
PUBLIC
• Sharing
a
notebook
is
as
easy
as
sharing
its
URL
on
the
web
(similarly
to
popular
file
sharing
plalorms)
29/09/2011
SDA2011
Introducing
SemLib
Project
m.grassi@univpm.it
12. Notebook
access
semlib:owns semlib:owns
semlib:owns
semlib:owns
semlib:owns
semlib:owns
semlib:owns
Every
user
can
organize
its
annota6ons
in
different
notebooks
29/09/2011
SDA2011
Introducing
SemLib
Project
m.grassi@univpm.it
13. Notebook
access
semlib:canRead semlib:canRead
semlib:owns semlib:owns
semlib:owns
semlib:owns
semlib:canRead
semlib:CanWrite
semlib:CanWrite
semlib:owns
semlib:owns
semlib:owns
Every
user
can
provide
different
access
rights
for
different
users
29/09/2011
SDA2011
Introducing
SemLib
Project
m.grassi@univpm.it
14. Annota6on
Sharing
Model
Annotation Annotation Annotation
Client Client Client
structured annotations structured annotations
Annotation
trusted/ufficial
Authoring API
annotations
Annotation
Annotation Server Client
Annotation selected
COLLECTIVE KB Consuming API annotations
Third Party
Application
Annotation Annotation Annotation Annotation
Client Client Client Client
29/09/2011
SDA2011
Introducing
SemLib
Project
m.grassi@univpm.it
15. Atomic
Content
• DLs
change
over
6me
2011-01-27 10:30:56 – Presenta6on
restyled
and
content
ex:MarcoGrassi Atomic Content can
be
re-‐organized
Example
dcterms:creator
dcterms:created
rdfs:label • Same
content
can
appear
in
AthomicContentURI different
pages
Necessary
to
unambiguously
dcterms:isPartOf iden6fy
atomic
contents.
Addi6onal
requirement
DLs
include
RDFa
tags
to
wrap
ex:MarcoGrassi
dcterms:creator
atomic
content
TextFragmentURI
dcterms:created
semlib:hasContent
Atomic
contents
have
a
resolvable
2011-01-27 10:30:56 rdfs:label
His Divine Comedy, originally called Commedia and later
Atomic Content
Example
called Divina by Boccaccio, is considered the greatest
literary work composed in the Italian language and a
masterpiece of world literature.
U R I
a s s o c i a t e d
t o
w h i c h
annota6ons
are
amached
29/09/2011
SDA2011
Introducing
SemLib
Project
m.grassi@univpm.it
16. System
General
Architecture
Fragment Handlers Selectors
Text Reconciliation
Pundit
Image Vocabulary
Client
Video Predicate
... ... ... ... ... ...
Annotation Annotation
Viewer Writer
Annotations Users Annotations
Consuming API Management API Authoring API
Server
Storage System
29/09/2011
SDA2011
Introducing
SemLib
Project
m.grassi@univpm.it
17. Annota6on
Server
HTTP
Annotation Server • Open
Source
Request
HTTP
• RESTful
Web
Service
Client Filters RESTful APIs
Java
Jersey
framework
Response
message
•
• Cross
origin
request
CORE APIs and Managers – CORS
(Cross-‐Origin
Resource
Sharing)
[Filter]
– JSONP
Repository APIs • Sesame
triple
store
Response – SPARQL
and
inference
– Different
sail
are
provided
to
Query
HTTP
implement
different
storages
Storage Server
(OWLIM,MySQL,
PostgreeSQL)
(Triplestore, etc.) – repository
APIs
can
be
easily
extended
for
other
storages
29/09/2011
SDA2011
Introducing
SemLib
Project
m.grassi@univpm.it