SlideShare una empresa de Scribd logo
1 de 65
Linked Open Data
        and innovation:
              libraries and
           the Semantic Web
                      Daniel Vila Suero
                      dvila@fi.upm.es
                        03/11/2011

 Ontology Engineering Group, Universidad Politécnica de Madrid
Agradecimientos: A los miembros del OEG que han participado en
            la elaboración de estas transparencias
Contenido

• Linked Data
• Library Linked Data
   -   W3C Incubator Group
   -   IFLA
   -   Stanford Manifesto
   -   A Bibliographic Framework for the Digital Age
• Casos de uso, herramientas y demos




                                                              2
Linked Data



              3
World Wide Web (Visión original)




                              4
Smart Web, Dumb Web

• La Web está llena de aplicaciones ―inteligentes‖
  (Motores de búsqueda, recomendadores,
  geolocalización, etc.)

• Sin embargo, también se dan situaciones en las que
  la respuesta de la Web no parece alineada con el
  estado de la tecnología




                                                       5
Smart Web, Dumb Web

• Problemas frecuentes (Usuario):
   - Información inconsistente entre servicios aparentemente
     relacionados.
   - Necesidad de visitar múltiples aplicaciones para una tarea
     simple
   - Dificultad para encontrar información muy específica
                         CLAVE:
     Integración de datos en la
     Web en un formato estándar
• Problemas frecuentes (Desarrollador):
   -   Heterogeneidad de formatos
   -   Formatos propietarios o de difícil tratamiento
   -   Falta de documentación APIs
   -   1 API 1 Funcionalidad 1 Forma de acceso => APIs
       desconectadas


                                                                  6
¿Qué es la Web de Linked Data?

• Han pasado 10 años desde la visión original de la
  Web Semántica.
• Hasta ahora poco ejemplos de impacto real
• Tecnologías demasiado complejas (maduras a día
  de hoy)
• En 2006 aparece la iniciativa Linked Data

   - Una extensión de la Web actual donde se publican y
     consumen datos de acuerdo a 4 principios
     (http://www.w3.org/DesignIssues/LinkedData.html)
Principios de Linked Data

1. Utilizar URIs para hacer referencias a cosas (recursos)
2. Usar el protocolo HTTP para publicar/recuperar recursos
      http://dbpedia.org/resource/Tim_Berners-Lee

      http://geo.linkeddata.es/resource/Provincia/Navarra


3. Describir datos en un formato estándar (RDF)
         dbpedia:Tim_Berners-Lee   rdf:type foaf:Person

         foaf:surname              "Berners-Lee"@en ;
         foaf:givenName            "Tim"@en ;


4. Enlazar con otros recursos a través de URIs
                                                             8
Web tradicional (documentos)


Links to




                           9
Linked Data (Web de datos)


leads




                   RDF Book
                    Mashup




                              10
Linked Data cloud evolution




Credits: Richard Cyganiak                            11
Linked Data cloud evolution




Credits: Richard Cyganiak                            12
Linked Data cloud evolution




Credits: Richard Cyganiak                            13
Linked Data cloud evolution




Credits: Richard Cyganiak                            14
Linked Data cloud evolution




Credits: Richard Cyganiak                            15
Linked Open Data 2011




―Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/‖

                                                                                                     16
Credits: Frank Van Harmelen   17
Credits: Frank Van Harmelen   18
Herramientas en la Web de Datos

• Representar recursos: RDF (Resource Description
  Format)

• Modelar/describir recursos: RDFS/OWL

• Consultar/recuperar recursos: SPARQL y HTTP

• Transformar recursos a RDF:
   - Bases de datos: RDB2RML, OdeMapster, R2O
   - MARC21: MARC2LOD
   - Any23
  - XML: GRRDL
  - Etc.
• Metodología: Linked Data lifecycles

                                                    19
Library Linked Data
Creating a “knowledge-generating engine”




                                           20
Library Linked Data is here

• Growing interest on Linked Data:

   - Stanford Manifesto
   - IFLA Semantic Web Special Interest Group and RDFS/OWL
     models
   - W3C Incubator Group
   - RDA vocabularies
   - European Librarians supporting Open Licensing
     announcement
   - LOC Bibliographic Framework Initiative




                                                             21
European national libraries: Open Data

• CENL (Conference of European National Librarians)
• 46 National Libraries voted to support open
  licensing
• Data more accesible and reusable
• Keys:
   - Innovation for app development
   - Enrichment of services like Wikipedia with highly curated
     data
   - Generation of relationships accross datasets through LOD




                                                                 22
Stanford Manifesto
•   Manifesto for Linked Libraries - http://bit.ly/sldw-mf


    1. Publishing data on the Web for discovery over
       preserving it in dark archives.
    2. Continuous improvement of data over waiting to
       publish perfect data.
    3. Semantically structured data over flat
       unstructured data.
    4. Use common vocabularies over rolling your own.
    5. Collaboration over working alone.
    6. Web standards over domain-specific standards.
    7. Use of open, commonly understood licenses over
       closed, local licenses.

                                                                 23
LOC: A Bibliographic Framework for the Digital Age

• Bibliographic Framework Initiative

• 31st of October 2011

• APPROACH: Embrace the Web and Linked Data and
  broadly adopted data models (RDF)

• GOAL: move the current library-technological
  environment away from being a niche market unto
  itself to one more readily understandable by present
  and future
   - data creators,
   - data modelers,
   - and software developers.

                                                           24
W3C incubator (XG) activity


• Short-lived working groups: around 1 year



• No delivery of W3C Recommendations, but ―innovative
  ideas for specifications, guidelines, and applications that
  are not (or not yet) clear candidates as Web standards‖




                                           http://www.w3.org/2005/Incubator/
                                                                               25
Library Linked Data incubator

• May 2010 – August 2011
• 51 participants
• 23 W3C member organizations
   VU Amsterdam, INRIA, Library of Congress, JISC, Deutsche
     Nationalbibliotek, DERI Galway, OCLC, Talis, LANL,
     Helsinki University of Technology, University of Edinburgh,
     Universidad Politécnica de Madrid, etc.
• Invited experts from other organizations
   BnF, National Library of Latvia, German National Library of
     Economics, etc.
W3C XG Participants
Alexander Haffner        Guenther Neher                           Marcia Zeng
András Micsik            Herbert Van De Sompel                    Mark van Assem
Andrew Houghton          Hideaki Takeda                           Martin Malmsten
Anette Seiler            Ikki Ohmukai                             Michael Hausenblas
Antoine Isaac            Jeff Young                               Michael Panzer
Asaf Bartov              Joachim Neubert                          Monica Duke
Bernard Vatant           Jodi Schneider                           Nicolas Delaforge
Carlo Meghini            Jon Phipps                               Oreste Signore
Dan Brickley             Jonathan Rees                            Peter Murray
Daniel Vila Suero        Kai Eckert                               Ray Denenberg
Dickson Lukose           Karen Coyle                              Ross Singer
Ed Summers               Kevin Ford                               Stu Weibel
Emmanuelle Bermes        Kim Viljanen                             Thomas Baker
Felix Sasaki             Kosuke Tanabe                            Tod Matola
Fumihiro Kato            Lars Svensson                            Uldis Bojars
Glen Newton              Laszlo Kovacs                            William Waites
Gordon Dunsire           Marcel Ruhl                              Wolfgang Halb

                    Up-to-date list at http://www.w3.org/2000/09/dbwg/details?group=44833
W3C XG Mission

• To help increase global interoperability of library data
  on the Web, by

   - bringing together people involved in Semantic Web
     activities—focusing on Linked Data—in the library
     community and beyond,

   - building on existing initiatives, and

   - identifying collaboration tracks for the future.




                                                             28
W3C XG Results

• Loads of interesting discussions! See public mailing
  list archive: http://lists.w3.org/Archives/Public/public-
  lld/

• Final report (3 separate documents) 25/10/2011
    1. Final report
    2. Datasets, Value Vocabularies, and Metadata Element Sets
    3. Use Cases report


•   Translation into Spanish coming soon…




                                                                 29
W3C XG Final report

• Available at
  http://www.w3.org/2005/Incubator/lld/XGR-lld-
  20111025/
               BENEFITS

              CURRENT
              SITUATION

      RECOMMENDATIONS

                                                  30
W3C XG Final report: Benefits



              Researchers, students, patrons


                       Organizations

BENEFITS
             Librarians, archivists and curators


                 Developers and vendors




                                                   31
W3C XG Final report: Benefits

                                           • Improved discovery and
           Researchers, students,
                                               browsing of data
                 patrons

                                           • Better visibility of library
                                               resources (SEO)
                Organizations


BENEFITS                                     • Enriched (scientific)
                                                  publications
           Librarians, archivists and
                    curators




           Developers and vendors




                                                                            32
W3C XG Final report: Benefits

                                         • Bottom-up approach to data
           Researchers, students,
                                         publication  More actors,
                 patrons                 different views

                                         • Wider choice of vendors and
                Organizations
                                         technologies, not only ILS
BENEFITS
                                         • + Visibility and connectivity 
                                         - infrastructure costs
           Librarians, archivists and
                    curators
                                         • ―The coolest thing to do to
                                         your data will be thought by
           Developers and vendors        someone else‖




                                                                             33
W3C XG Final report: Benefits

                                         • Up-to-date resource
           Researchers, students,
                                         descriptions directly citable by
                 patrons                 catalogers  thanks to
                                         URIs+RDF

                Organizations
                                         • Reduce redundancy and
                                         duplication
BENEFITS
                                         • Catalogers efforts focused on
           Librarians, archivists and    their domain of expertise
                    curators




           Developers and vendors




                                                                            34
W3C XG Final report: Benefits

                                         • Use of well-known Web
           Researchers, students,
                                         standards and protocols
                 patrons

                                         • More and more generic tools,
                                         not tied to library-specific
                Organizations
                                         formats
BENEFITS
                                         • Welcomes a much larger
                                         developer community
           Librarians, archivists and
                    curators




           Developers and vendors




                                                                          35
W3C XG Final report: Current situation

      Issues with traditional library data

1. Library data is not integrated with Web resources
2. Library standards are designed only for the library
      community


3. Library data is expressed primarily as natural-language
4. Library and SemWeb communities use different
      terminology for similar metadata concepts

5. Library technology changes depend on vendor systems
      development
                                                         36
W3C XG Final report: Current situation

     Library Linked Data available today

1. Fewer bibliographic datasets than value vocabs & el. sets

2. Variable quality and support

3. Cross-linking requires further effort and coordination

                                                            37
W3C XG Final report: Current situation

                  Right issues

1. Rights ownership is complex

2. Data rights may be considered business assets



                                                   38
W3C XG Final report: Current situation

   Recommendations: Library leadership

1. Identify candidate data sets for early exposure

2. Foster discussion about Open Data and rights



                                                     39
W3C XG Final report: Current situation

   Recommendations: data and sys designers

1. Design/Test user services based on LD capabilities
2. Develop policies for managing vocabs and URIs
3. Create URIs for the items in library datasets
4. Reuse and Map to existing LD vocabularies

                                                        40
W3C XG Final report: Current situation

   Recommendations: librarians and archivists

1. Preserve LD element sets and value vocabularies
2. Apply library experience in curation and long-term
      preservation to LD datasets




                                                        41
W3C XG Vocabs and Datasets report

• Available at
  http://www.w3.org/2005/Incubator/lld/XGR-lld-
  vocabdataset-20111025/
                           British National Bibliography,
       Datasets            Europeana LOD, data.bnf.fr ..



                                    LCSH, VIAF, AGROVOC …
Value vocabularies

      Element sets
                                                            42
W3C XG Use cases report

• Available at
  http://www.w3.org/2005/Incubator/lld/XGR-lld-
  usecase-20111025/




                                                  43
W3C XG Use cases report

• 8 Clusters

• 60 Individual use cases from XG participants and
  community

• Generalized (Extracted) use cases for each cluster

• Good place to look for examples, fresh ideas, space
  of innovation and research topics!




                                                        44
W3C XG Use cases report




Generated with TagCrowd                        45
Use cases, tools and
    applications


                       46
Chronicling America




chroniclingamerica.loc.gov                    47
Chronicling America

• Historic newspapers and select digitized newspaper pages
  (+2.5 million), produced by the National Digital Newspaper
  Program

• From1690 to the present

• Nice example of Linked Data best practices and transparent
  integration

• Linking and describing:
   -   DBpedia
   -   Dublin Core and DCMI Terms
   -   FRBR concepts in RDF
   -   GeoNames
   -   OAI-ORE (more about aggregations below)
   -   OWL
   -   RDA
   -   WorldCat


                                                               48
Chronicling America

Text/html




                             49
Chronicling America
         rdf




                 50
Datos.bne.es

• December 2011
• Catalog data from BNE MARC21 to RDF using IFLA
  models
   - Authority records: +5 million
   - Bibliographic records: +8 million


• Release of the MARC2LOD tool (Open Source)

• Public announcement at the BNE:

14th December



                                                   51
BNE data Modelling
MARC2LOD Tool
Flexible tool for transforming MARC21 records to RDF

Allows free selection of any RDFS/OWL set of terms

Easy to handle mappings

Composed of two modules:

         MODULE 1: Mapping templates and report generation
         MODULE 2: RDF Generation and linkage


Three main steps:

         1) Mapping template generation
         2) Mapping assignment by domain experts
         3) RDF generation and linkage
marc2LOD
Framework overview
marc2LOD
Mapping templates
Linked Data architecture




                      56
DEMO
http://cultura.linkeddata.es/visualizer




                  57
map4rdf

                   http://oegdev.dia.fi.upm.es/projects/map4rdf/


                               map4rdf:
• Google maps viewer of RDF resources
• Resources with spatial information
• Extensible with Google plugins
• Used in other applications like Aemet, Goodrelations




                            map4rdf                SPARQL




                                                  Triplestore
                                                                        58
DEMO
http://geo.linkeddata.es/browser




              59
Provinces




60
Capital of Province




61
Provinces – Industry Production Index




62
Beaches




63
Visor: A tool for end user data exploration
                   VISOR alpha v0.11

           A tool for end user data exploration

•   http://visor.psi.enakting.org/
•   Linked data browser from University of Southampton
•   Multifaceted browsing
•   Configurable for any SPARQL endpoint

• DEMO: http://visor.psi.enakting.org/visor




                                                         64
Aemoo




• http://wit.istc.cnr.it/aemoo/
• Explore knowledge using knowledge patterns
• Uses:
   -   DBPEDIA
   -   Wikipedia links
   -   Twitter
   -   Googles News feed




                                                  65

Más contenido relacionado

La actualidad más candente

Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
National Information Standards Organization (NISO)
 
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
National Information Standards Organization (NISO)
 
Rober stephenson
Rober stephensonRober stephenson
Rober stephenson
NASAPMC
 

La actualidad más candente (18)

Linked Data Now & Next
Linked Data Now & NextLinked Data Now & Next
Linked Data Now & Next
 
Unlocking Doors: recent initiatives in open and linked data at the National L...
Unlocking Doors: recent initiatives in open and linked data at the National L...Unlocking Doors: recent initiatives in open and linked data at the National L...
Unlocking Doors: recent initiatives in open and linked data at the National L...
 
Issues in long-term knowledge retention in engineering
Issues in long-term knowledge retention in engineeringIssues in long-term knowledge retention in engineering
Issues in long-term knowledge retention in engineering
 
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
 
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
 
Edina cigs-21-september-2012
Edina cigs-21-september-2012Edina cigs-21-september-2012
Edina cigs-21-september-2012
 
Rober stephenson
Rober stephensonRober stephenson
Rober stephenson
 
Cautious Optimism: Cultivate your Garden
Cautious Optimism: Cultivate your GardenCautious Optimism: Cultivate your Garden
Cautious Optimism: Cultivate your Garden
 
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
 
The Distributed National Electronic Resource and the Electronic Libraries Pro...
The Distributed National Electronic Resource and the Electronic Libraries Pro...The Distributed National Electronic Resource and the Electronic Libraries Pro...
The Distributed National Electronic Resource and the Electronic Libraries Pro...
 
Wetzel, Baish, Johnson, Reich, and Grant "Digital Preservation: Current Efforts"
Wetzel, Baish, Johnson, Reich, and Grant "Digital Preservation: Current Efforts"Wetzel, Baish, Johnson, Reich, and Grant "Digital Preservation: Current Efforts"
Wetzel, Baish, Johnson, Reich, and Grant "Digital Preservation: Current Efforts"
 
Dspace
DspaceDspace
Dspace
 
DBpedia InsideOut
DBpedia InsideOutDBpedia InsideOut
DBpedia InsideOut
 
W3C Library Linked Data Incubator Group
W3C Library Linked Data Incubator GroupW3C Library Linked Data Incubator Group
W3C Library Linked Data Incubator Group
 
LAK Dataset and Challenge (April 2013)
LAK Dataset and Challenge (April 2013)LAK Dataset and Challenge (April 2013)
LAK Dataset and Challenge (April 2013)
 
Ukla uksg 2013_final
Ukla uksg 2013_finalUkla uksg 2013_final
Ukla uksg 2013_final
 
NISO/DCMI Webinar: Metadata for Managing Scientific Research Data
NISO/DCMI Webinar: Metadata for Managing Scientific Research DataNISO/DCMI Webinar: Metadata for Managing Scientific Research Data
NISO/DCMI Webinar: Metadata for Managing Scientific Research Data
 
[[edit]] this GLAM
[[edit]] this GLAM[[edit]] this GLAM
[[edit]] this GLAM
 

Destacado

Denny listing presentation
Denny listing presentationDenny listing presentation
Denny listing presentation
Mike'n Rosenhahn
 
Bernal-The role of digital
Bernal-The role of digitalBernal-The role of digital
Bernal-The role of digital
LIS EPI Meeting
 
Изместьева Л.А. Борецкая роспись. - Архангельск, 1994
Изместьева Л.А. Борецкая роспись. - Архангельск, 1994Изместьева Л.А. Борецкая роспись. - Архангельск, 1994
Изместьева Л.А. Борецкая роспись. - Архангельск, 1994
Юрий Березниковский
 
ô Nhiễm môi trường đất
ô Nhiễm môi trường đấtô Nhiễm môi trường đất
ô Nhiễm môi trường đất
samesb
 

Destacado (17)

Krichel·A Collaboration Graph for E-LIS
Krichel·A Collaboration Graph for E-LISKrichel·A Collaboration Graph for E-LIS
Krichel·A Collaboration Graph for E-LIS
 
Portfolio2009
Portfolio2009Portfolio2009
Portfolio2009
 
Denny listing presentation
Denny listing presentationDenny listing presentation
Denny listing presentation
 
Bernal-The role of digital
Bernal-The role of digitalBernal-The role of digital
Bernal-The role of digital
 
Book 2016 michaël boitin
Book 2016 michaël boitinBook 2016 michaël boitin
Book 2016 michaël boitin
 
Pwned
PwnedPwned
Pwned
 
Yrittäjyyslinja
YrittäjyyslinjaYrittäjyyslinja
Yrittäjyyslinja
 
Tsakonas-Robbio·Open Bibliographic Data E-Lis
Tsakonas-Robbio·Open Bibliographic Data E-LisTsakonas-Robbio·Open Bibliographic Data E-Lis
Tsakonas-Robbio·Open Bibliographic Data E-Lis
 
The right to information act
The right to information actThe right to information act
The right to information act
 
Importance of Personal branding for Job Seekers
Importance of Personal branding for Job SeekersImportance of Personal branding for Job Seekers
Importance of Personal branding for Job Seekers
 
cosmozoic theory
cosmozoic theorycosmozoic theory
cosmozoic theory
 
Doppler in IUGR
Doppler in IUGRDoppler in IUGR
Doppler in IUGR
 
Humanities 101 Art Appreciation
Humanities 101 Art AppreciationHumanities 101 Art Appreciation
Humanities 101 Art Appreciation
 
Изместьева Л.А. Борецкая роспись. - Архангельск, 1994
Изместьева Л.А. Борецкая роспись. - Архангельск, 1994Изместьева Л.А. Борецкая роспись. - Архангельск, 1994
Изместьева Л.А. Борецкая роспись. - Архангельск, 1994
 
Вопросы к зачету по Семейному праву
Вопросы к зачету по Семейному правуВопросы к зачету по Семейному праву
Вопросы к зачету по Семейному праву
 
Doppler in IUGR
Doppler in IUGRDoppler in IUGR
Doppler in IUGR
 
ô Nhiễm môi trường đất
ô Nhiễm môi trường đấtô Nhiễm môi trường đất
ô Nhiễm môi trường đất
 

Similar a Vila LOD-innovacion- bib-semweb-redux

Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817
Figoblog
 
IASSIST 2012 - DDI-RDF - Trouble with Triples
IASSIST 2012 - DDI-RDF - Trouble with TriplesIASSIST 2012 - DDI-RDF - Trouble with Triples
IASSIST 2012 - DDI-RDF - Trouble with Triples
Dr.-Ing. Thomas Hartmann
 
Skb web2.0
Skb web2.0Skb web2.0
Skb web2.0
animove
 

Similar a Vila LOD-innovacion- bib-semweb-redux (20)

What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?
 
CLARIAH Toogdag 2018: A distributed network of digital heritage information
CLARIAH Toogdag 2018: A distributed network of digital heritage informationCLARIAH Toogdag 2018: A distributed network of digital heritage information
CLARIAH Toogdag 2018: A distributed network of digital heritage information
 
Linked Open Data for Cultural Heritage
Linked Open Data for Cultural HeritageLinked Open Data for Cultural Heritage
Linked Open Data for Cultural Heritage
 
Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817
 
Linked Data
Linked DataLinked Data
Linked Data
 
EDF2012 LATC and the Data Cloud
EDF2012 LATC and the Data CloudEDF2012 LATC and the Data Cloud
EDF2012 LATC and the Data Cloud
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the Software
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application Scenarios
 
IASSIST 2012 - DDI-RDF - Trouble with Triples
IASSIST 2012 - DDI-RDF - Trouble with TriplesIASSIST 2012 - DDI-RDF - Trouble with Triples
IASSIST 2012 - DDI-RDF - Trouble with Triples
 
Linked Data Basics
Linked Data BasicsLinked Data Basics
Linked Data Basics
 
IASSIT Kansa Presentation
IASSIT Kansa PresentationIASSIT Kansa Presentation
IASSIT Kansa Presentation
 
Manage it locally to share it globally: RDM and Wikimedia Commons
Manage it locally to share it globally: RDM and Wikimedia CommonsManage it locally to share it globally: RDM and Wikimedia Commons
Manage it locally to share it globally: RDM and Wikimedia Commons
 
Session 1.4 a distributed network of heritage information
Session 1.4   a distributed network of heritage informationSession 1.4   a distributed network of heritage information
Session 1.4 a distributed network of heritage information
 
A distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics AmsterdamA distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics Amsterdam
 
Will's World: Walking Through Shakespeare
Will's World: Walking Through ShakespeareWill's World: Walking Through Shakespeare
Will's World: Walking Through Shakespeare
 
Presentation on the Warsaw Conference on National Bibliographies August 2012
Presentation on the Warsaw Conference on National Bibliographies August 2012Presentation on the Warsaw Conference on National Bibliographies August 2012
Presentation on the Warsaw Conference on National Bibliographies August 2012
 
Skb web2.0
Skb web2.0Skb web2.0
Skb web2.0
 
LOD2 Webinar Series: SILK
LOD2 Webinar Series: SILKLOD2 Webinar Series: SILK
LOD2 Webinar Series: SILK
 
EuropeanaTech 2018: A distributed network of digital heritage information
EuropeanaTech 2018: A distributed network of digital heritage informationEuropeanaTech 2018: A distributed network of digital heritage information
EuropeanaTech 2018: A distributed network of digital heritage information
 
A distributed network of digital heritage information - Unesco/NDL India
A distributed network of digital heritage information - Unesco/NDL IndiaA distributed network of digital heritage information - Unesco/NDL India
A distributed network of digital heritage information - Unesco/NDL India
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 

Vila LOD-innovacion- bib-semweb-redux

  • 1. Linked Open Data and innovation: libraries and the Semantic Web Daniel Vila Suero dvila@fi.upm.es 03/11/2011 Ontology Engineering Group, Universidad Politécnica de Madrid Agradecimientos: A los miembros del OEG que han participado en la elaboración de estas transparencias
  • 2. Contenido • Linked Data • Library Linked Data - W3C Incubator Group - IFLA - Stanford Manifesto - A Bibliographic Framework for the Digital Age • Casos de uso, herramientas y demos 2
  • 4. World Wide Web (Visión original) 4
  • 5. Smart Web, Dumb Web • La Web está llena de aplicaciones ―inteligentes‖ (Motores de búsqueda, recomendadores, geolocalización, etc.) • Sin embargo, también se dan situaciones en las que la respuesta de la Web no parece alineada con el estado de la tecnología 5
  • 6. Smart Web, Dumb Web • Problemas frecuentes (Usuario): - Información inconsistente entre servicios aparentemente relacionados. - Necesidad de visitar múltiples aplicaciones para una tarea simple - Dificultad para encontrar información muy específica CLAVE: Integración de datos en la Web en un formato estándar • Problemas frecuentes (Desarrollador): - Heterogeneidad de formatos - Formatos propietarios o de difícil tratamiento - Falta de documentación APIs - 1 API 1 Funcionalidad 1 Forma de acceso => APIs desconectadas 6
  • 7. ¿Qué es la Web de Linked Data? • Han pasado 10 años desde la visión original de la Web Semántica. • Hasta ahora poco ejemplos de impacto real • Tecnologías demasiado complejas (maduras a día de hoy) • En 2006 aparece la iniciativa Linked Data - Una extensión de la Web actual donde se publican y consumen datos de acuerdo a 4 principios (http://www.w3.org/DesignIssues/LinkedData.html)
  • 8. Principios de Linked Data 1. Utilizar URIs para hacer referencias a cosas (recursos) 2. Usar el protocolo HTTP para publicar/recuperar recursos http://dbpedia.org/resource/Tim_Berners-Lee http://geo.linkeddata.es/resource/Provincia/Navarra 3. Describir datos en un formato estándar (RDF) dbpedia:Tim_Berners-Lee rdf:type foaf:Person foaf:surname "Berners-Lee"@en ; foaf:givenName "Tim"@en ; 4. Enlazar con otros recursos a través de URIs 8
  • 10. Linked Data (Web de datos) leads RDF Book Mashup 10
  • 11. Linked Data cloud evolution Credits: Richard Cyganiak 11
  • 12. Linked Data cloud evolution Credits: Richard Cyganiak 12
  • 13. Linked Data cloud evolution Credits: Richard Cyganiak 13
  • 14. Linked Data cloud evolution Credits: Richard Cyganiak 14
  • 15. Linked Data cloud evolution Credits: Richard Cyganiak 15
  • 16. Linked Open Data 2011 ―Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/‖ 16
  • 17. Credits: Frank Van Harmelen 17
  • 18. Credits: Frank Van Harmelen 18
  • 19. Herramientas en la Web de Datos • Representar recursos: RDF (Resource Description Format) • Modelar/describir recursos: RDFS/OWL • Consultar/recuperar recursos: SPARQL y HTTP • Transformar recursos a RDF: - Bases de datos: RDB2RML, OdeMapster, R2O - MARC21: MARC2LOD - Any23 - XML: GRRDL - Etc. • Metodología: Linked Data lifecycles 19
  • 20. Library Linked Data Creating a “knowledge-generating engine” 20
  • 21. Library Linked Data is here • Growing interest on Linked Data: - Stanford Manifesto - IFLA Semantic Web Special Interest Group and RDFS/OWL models - W3C Incubator Group - RDA vocabularies - European Librarians supporting Open Licensing announcement - LOC Bibliographic Framework Initiative 21
  • 22. European national libraries: Open Data • CENL (Conference of European National Librarians) • 46 National Libraries voted to support open licensing • Data more accesible and reusable • Keys: - Innovation for app development - Enrichment of services like Wikipedia with highly curated data - Generation of relationships accross datasets through LOD 22
  • 23. Stanford Manifesto • Manifesto for Linked Libraries - http://bit.ly/sldw-mf 1. Publishing data on the Web for discovery over preserving it in dark archives. 2. Continuous improvement of data over waiting to publish perfect data. 3. Semantically structured data over flat unstructured data. 4. Use common vocabularies over rolling your own. 5. Collaboration over working alone. 6. Web standards over domain-specific standards. 7. Use of open, commonly understood licenses over closed, local licenses. 23
  • 24. LOC: A Bibliographic Framework for the Digital Age • Bibliographic Framework Initiative • 31st of October 2011 • APPROACH: Embrace the Web and Linked Data and broadly adopted data models (RDF) • GOAL: move the current library-technological environment away from being a niche market unto itself to one more readily understandable by present and future - data creators, - data modelers, - and software developers. 24
  • 25. W3C incubator (XG) activity • Short-lived working groups: around 1 year • No delivery of W3C Recommendations, but ―innovative ideas for specifications, guidelines, and applications that are not (or not yet) clear candidates as Web standards‖ http://www.w3.org/2005/Incubator/ 25
  • 26. Library Linked Data incubator • May 2010 – August 2011 • 51 participants • 23 W3C member organizations VU Amsterdam, INRIA, Library of Congress, JISC, Deutsche Nationalbibliotek, DERI Galway, OCLC, Talis, LANL, Helsinki University of Technology, University of Edinburgh, Universidad Politécnica de Madrid, etc. • Invited experts from other organizations BnF, National Library of Latvia, German National Library of Economics, etc.
  • 27. W3C XG Participants Alexander Haffner Guenther Neher Marcia Zeng András Micsik Herbert Van De Sompel Mark van Assem Andrew Houghton Hideaki Takeda Martin Malmsten Anette Seiler Ikki Ohmukai Michael Hausenblas Antoine Isaac Jeff Young Michael Panzer Asaf Bartov Joachim Neubert Monica Duke Bernard Vatant Jodi Schneider Nicolas Delaforge Carlo Meghini Jon Phipps Oreste Signore Dan Brickley Jonathan Rees Peter Murray Daniel Vila Suero Kai Eckert Ray Denenberg Dickson Lukose Karen Coyle Ross Singer Ed Summers Kevin Ford Stu Weibel Emmanuelle Bermes Kim Viljanen Thomas Baker Felix Sasaki Kosuke Tanabe Tod Matola Fumihiro Kato Lars Svensson Uldis Bojars Glen Newton Laszlo Kovacs William Waites Gordon Dunsire Marcel Ruhl Wolfgang Halb Up-to-date list at http://www.w3.org/2000/09/dbwg/details?group=44833
  • 28. W3C XG Mission • To help increase global interoperability of library data on the Web, by - bringing together people involved in Semantic Web activities—focusing on Linked Data—in the library community and beyond, - building on existing initiatives, and - identifying collaboration tracks for the future. 28
  • 29. W3C XG Results • Loads of interesting discussions! See public mailing list archive: http://lists.w3.org/Archives/Public/public- lld/ • Final report (3 separate documents) 25/10/2011 1. Final report 2. Datasets, Value Vocabularies, and Metadata Element Sets 3. Use Cases report • Translation into Spanish coming soon… 29
  • 30. W3C XG Final report • Available at http://www.w3.org/2005/Incubator/lld/XGR-lld- 20111025/ BENEFITS CURRENT SITUATION RECOMMENDATIONS 30
  • 31. W3C XG Final report: Benefits Researchers, students, patrons Organizations BENEFITS Librarians, archivists and curators Developers and vendors 31
  • 32. W3C XG Final report: Benefits • Improved discovery and Researchers, students, browsing of data patrons • Better visibility of library resources (SEO) Organizations BENEFITS • Enriched (scientific) publications Librarians, archivists and curators Developers and vendors 32
  • 33. W3C XG Final report: Benefits • Bottom-up approach to data Researchers, students, publication  More actors, patrons different views • Wider choice of vendors and Organizations technologies, not only ILS BENEFITS • + Visibility and connectivity  - infrastructure costs Librarians, archivists and curators • ―The coolest thing to do to your data will be thought by Developers and vendors someone else‖ 33
  • 34. W3C XG Final report: Benefits • Up-to-date resource Researchers, students, descriptions directly citable by patrons catalogers  thanks to URIs+RDF Organizations • Reduce redundancy and duplication BENEFITS • Catalogers efforts focused on Librarians, archivists and their domain of expertise curators Developers and vendors 34
  • 35. W3C XG Final report: Benefits • Use of well-known Web Researchers, students, standards and protocols patrons • More and more generic tools, not tied to library-specific Organizations formats BENEFITS • Welcomes a much larger developer community Librarians, archivists and curators Developers and vendors 35
  • 36. W3C XG Final report: Current situation Issues with traditional library data 1. Library data is not integrated with Web resources 2. Library standards are designed only for the library community 3. Library data is expressed primarily as natural-language 4. Library and SemWeb communities use different terminology for similar metadata concepts 5. Library technology changes depend on vendor systems development 36
  • 37. W3C XG Final report: Current situation Library Linked Data available today 1. Fewer bibliographic datasets than value vocabs & el. sets 2. Variable quality and support 3. Cross-linking requires further effort and coordination 37
  • 38. W3C XG Final report: Current situation Right issues 1. Rights ownership is complex 2. Data rights may be considered business assets 38
  • 39. W3C XG Final report: Current situation Recommendations: Library leadership 1. Identify candidate data sets for early exposure 2. Foster discussion about Open Data and rights 39
  • 40. W3C XG Final report: Current situation Recommendations: data and sys designers 1. Design/Test user services based on LD capabilities 2. Develop policies for managing vocabs and URIs 3. Create URIs for the items in library datasets 4. Reuse and Map to existing LD vocabularies 40
  • 41. W3C XG Final report: Current situation Recommendations: librarians and archivists 1. Preserve LD element sets and value vocabularies 2. Apply library experience in curation and long-term preservation to LD datasets 41
  • 42. W3C XG Vocabs and Datasets report • Available at http://www.w3.org/2005/Incubator/lld/XGR-lld- vocabdataset-20111025/ British National Bibliography, Datasets Europeana LOD, data.bnf.fr .. LCSH, VIAF, AGROVOC … Value vocabularies Element sets 42
  • 43. W3C XG Use cases report • Available at http://www.w3.org/2005/Incubator/lld/XGR-lld- usecase-20111025/ 43
  • 44. W3C XG Use cases report • 8 Clusters • 60 Individual use cases from XG participants and community • Generalized (Extracted) use cases for each cluster • Good place to look for examples, fresh ideas, space of innovation and research topics! 44
  • 45. W3C XG Use cases report Generated with TagCrowd 45
  • 46. Use cases, tools and applications 46
  • 48. Chronicling America • Historic newspapers and select digitized newspaper pages (+2.5 million), produced by the National Digital Newspaper Program • From1690 to the present • Nice example of Linked Data best practices and transparent integration • Linking and describing: - DBpedia - Dublin Core and DCMI Terms - FRBR concepts in RDF - GeoNames - OAI-ORE (more about aggregations below) - OWL - RDA - WorldCat 48
  • 51. Datos.bne.es • December 2011 • Catalog data from BNE MARC21 to RDF using IFLA models - Authority records: +5 million - Bibliographic records: +8 million • Release of the MARC2LOD tool (Open Source) • Public announcement at the BNE: 14th December 51
  • 53. MARC2LOD Tool Flexible tool for transforming MARC21 records to RDF Allows free selection of any RDFS/OWL set of terms Easy to handle mappings Composed of two modules: MODULE 1: Mapping templates and report generation MODULE 2: RDF Generation and linkage Three main steps: 1) Mapping template generation 2) Mapping assignment by domain experts 3) RDF generation and linkage
  • 58. map4rdf http://oegdev.dia.fi.upm.es/projects/map4rdf/ map4rdf: • Google maps viewer of RDF resources • Resources with spatial information • Extensible with Google plugins • Used in other applications like Aemet, Goodrelations map4rdf SPARQL Triplestore 58
  • 62. Provinces – Industry Production Index 62
  • 64. Visor: A tool for end user data exploration VISOR alpha v0.11 A tool for end user data exploration • http://visor.psi.enakting.org/ • Linked data browser from University of Southampton • Multifaceted browsing • Configurable for any SPARQL endpoint • DEMO: http://visor.psi.enakting.org/visor 64
  • 65. Aemoo • http://wit.istc.cnr.it/aemoo/ • Explore knowledge using knowledge patterns • Uses: - DBPEDIA - Wikipedia links - Twitter - Googles News feed 65