SlideShare una empresa de Scribd logo
1 de 11
Building Bridges: from Europeana
       Libraries to Europeana Newspapers
Susan Reilly, LIBER
Twitter: @skreilly
IFLA Newspapers/GENLOC, Helsinki, 13th Aug 2012
Overview

About LIBER
Introduction to Europeana Newspapers
The foundation stone: Europeana Libraries




      This project is partially funded under the ICT Policy Support Programme (ICT PSP)
      as part of the Competitiveness and Innovation Framework Programme by the
      European Community http://ec.europa.eu/ict_psp                                      2
LIBER & the European Digital Agenda

Association of European Research Libraries
    Our projects:
  Content
       Europeana Libraries
       Europeana Newspapers
  Policy
       MEDOANET
  Infrastructure
       APARSEN
       AAA Study
       ODE
      This project is partially funded under the ICT Policy Support Programme (ICT PSP)
      as part of the Competitiveness and Innovation Framework Programme by the
      European Community http://ec.europa.eu/ict_psp
Europeana Newspapers

• 17 partner institutions
• 3 years (2012-2015)
• Aggregation of more than 18 million newspapers
• Will use refinement methods for OCR, OLR (article
  segmentation), and named entity (NER) and class
  recognition
• Suvey existing collections in Europe
• Make content accessible


       This project is partially funded under the ICT Policy Support Programme (ICT PSP)
       as part of the Competitiveness and Innovation Framework Programme by the
       European Community http://ec.europa.eu/ict_psp
Why newspapers?

“The museum (and the
 newspaper) today seeks
 whatever represents normal life
 in its own native locality and
 with infinite pains its collections
 are arranged in a manner which
 is natural to them in their own
 habitat”
                      Lucy Maynard Salmon (1976) in The Newspaper and the Historian

      This project is partially funded under the ICT Policy Support Programme (ICT PSP)
      as part of the Competitiveness and Innovation Framework Programme by the
      European Community http://ec.europa.eu/ict_psp
Europeana Newspapers: where the content
               comes from…

We are looking for
 more libraries!                                                                NL E
                                                    LIBER
                                                                                                                 NLF
                                                                         SUB HH
                                                                                                           NLL
                                                                                       CCS
                        USAL
                                                                                                     NLP

                                    BL                     KB               SBB                ONB


                                                                UIBK                                               NLT
                                              BnF

                                                                                               UB
                                                                        LFT
           This project is partially funded under the ICT Policy Support Programme (ICT PSP)
           as part of the Competitiveness and Innovation Framework Programme by the
           European Community http://ec.europa.eu/ict_psp
What we do with the content

• Select 10 million items to be OCR’d
  • Structural information by UKIB e.g. headings, table of contents
• Select 2 million items for OCR and OLR
  • Article segmentation and page class recognition by CCS
• Libraries carry out manual correction of recognition and
  segmentation results
• Named entity recognition applied to English, Dutch and
  German material




       This project is partially funded under the ICT Policy Support Programme (ICT PSP)
       as part of the Competitiveness and Innovation Framework Programme by the
       European Community http://ec.europa.eu/ict_psp
Making the content accessible

• OCR enables full text searching
• OLR enables more targeted searching (titles and sections)
• NER enables searching by people, place,and the discover of
 new relationships between entities




       This project is partially funded under the ICT Policy Support Programme (ICT PSP)
       as part of the Competitiveness and Innovation Framework Programme by the
       European Community http://ec.europa.eu/ict_psp
No access without aggregation

• Europeana Libraries
  •   A single library domain aggregator
  •   Content from European research libraries
  •   Full-text search capabilities
  •   Portal for researchers
                                                     Access = Sustainability
                                                      Access = Visibility




          This project is partially funded under the ICT Policy Support Programme (ICT PSP)
          as part of the Competitiveness and Innovation Framework Programme by the
          European Community http://ec.europa.eu/ict_psp
Go to www.theeuropeanlibrary.org
Thank you for your attention!
http://www.libereurope.eu
http://www.europeana-newspapers.eu/
http://www.europeana-libraries.eu/
Hall 4/5, stand H104

Más contenido relacionado

Más de Europeana Newspapers

Présentation Günter Mühlberger, BnF Information Day
Présentation Günter Mühlberger, BnF Information DayPrésentation Günter Mühlberger, BnF Information Day
Présentation Günter Mühlberger, BnF Information Day
Europeana Newspapers
 

Más de Europeana Newspapers (20)

Presentation of Ioannis Anagnostopoulos at BnF Information Day
Presentation of Ioannis Anagnostopoulos at BnF Information DayPresentation of Ioannis Anagnostopoulos at BnF Information Day
Presentation of Ioannis Anagnostopoulos at BnF Information Day
 
Presentation of Clemens Neudecker, BnF Information Day
Presentation of Clemens Neudecker, BnF Information DayPresentation of Clemens Neudecker, BnF Information Day
Presentation of Clemens Neudecker, BnF Information Day
 
Presentation of Hans-Jörg Lieder, BnF Information Day
Presentation of Hans-Jörg Lieder, BnF Information DayPresentation of Hans-Jörg Lieder, BnF Information Day
Presentation of Hans-Jörg Lieder, BnF Information Day
 
Présentation Günter Mühlberger, BnF Information Day
Présentation Günter Mühlberger, BnF Information DayPrésentation Günter Mühlberger, BnF Information Day
Présentation Günter Mühlberger, BnF Information Day
 
Presentation of Claus Gravenhorst, BnF Information Day
Presentation of Claus Gravenhorst, BnF Information DayPresentation of Claus Gravenhorst, BnF Information Day
Presentation of Claus Gravenhorst, BnF Information Day
 
Presentation of Alaa Abi Haidar at the BnF Information Day
Presentation of Alaa Abi Haidar at the BnF Information DayPresentation of Alaa Abi Haidar at the BnF Information Day
Presentation of Alaa Abi Haidar at the BnF Information Day
 
IFLA 2014 Europeana Newspapers Rossitza Atanassova
IFLA 2014 Europeana Newspapers Rossitza AtanassovaIFLA 2014 Europeana Newspapers Rossitza Atanassova
IFLA 2014 Europeana Newspapers Rossitza Atanassova
 
Europeana Newspapers Estonian Infoday Ragne Kouts
Europeana Newspapers Estonian Infoday Ragne KoutsEuropeana Newspapers Estonian Infoday Ragne Kouts
Europeana Newspapers Estonian Infoday Ragne Kouts
 
Europeana Newspapers Estonian Infoday Kristel Veimann
Europeana Newspapers Estonian Infoday Kristel VeimannEuropeana Newspapers Estonian Infoday Kristel Veimann
Europeana Newspapers Estonian Infoday Kristel Veimann
 
Europeana Newspapers Estonian Infoday Krista Kiisa
Europeana Newspapers Estonian Infoday Krista KiisaEuropeana Newspapers Estonian Infoday Krista Kiisa
Europeana Newspapers Estonian Infoday Krista Kiisa
 
Europeana Newspapers Estonian Infoday Krista Aru
Europeana Newspapers Estonian Infoday Krista AruEuropeana Newspapers Estonian Infoday Krista Aru
Europeana Newspapers Estonian Infoday Krista Aru
 
Europeana Newspapers Estonian Infoday Fred Puss
Europeana Newspapers Estonian Infoday Fred PussEuropeana Newspapers Estonian Infoday Fred Puss
Europeana Newspapers Estonian Infoday Fred Puss
 
Europeana Newpapers LFT Infoday Neudecker
Europeana Newpapers LFT Infoday NeudeckerEuropeana Newpapers LFT Infoday Neudecker
Europeana Newpapers LFT Infoday Neudecker
 
Europeana Newspapers LFT Infoday Thompson
Europeana Newspapers LFT Infoday ThompsonEuropeana Newspapers LFT Infoday Thompson
Europeana Newspapers LFT Infoday Thompson
 
Europeana Newspapers LFT Infoday Rossi
Europeana Newspapers LFT Infoday RossiEuropeana Newspapers LFT Infoday Rossi
Europeana Newspapers LFT Infoday Rossi
 
Europeana Newspapers LFT Infoday Muehlberger
Europeana Newspapers LFT Infoday MuehlbergerEuropeana Newspapers LFT Infoday Muehlberger
Europeana Newspapers LFT Infoday Muehlberger
 
Europeana Newspapers LFT Infoday Messina
Europeana Newspapers LFT Infoday MessinaEuropeana Newspapers LFT Infoday Messina
Europeana Newspapers LFT Infoday Messina
 
Europeana Newspapers Infoday Marchetti
Europeana Newspapers Infoday MarchettiEuropeana Newspapers Infoday Marchetti
Europeana Newspapers Infoday Marchetti
 
Europeana Newspapers LFT Infoday Kempf
Europeana Newspapers LFT Infoday KempfEuropeana Newspapers LFT Infoday Kempf
Europeana Newspapers LFT Infoday Kempf
 
Europeana Newspapers LFT Infoday Genereux
Europeana Newspapers LFT Infoday GenereuxEuropeana Newspapers LFT Infoday Genereux
Europeana Newspapers LFT Infoday Genereux
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 

Building Bridges: from Europeana Libraries to Europeana Newspapers

  • 1. Building Bridges: from Europeana Libraries to Europeana Newspapers Susan Reilly, LIBER Twitter: @skreilly IFLA Newspapers/GENLOC, Helsinki, 13th Aug 2012
  • 2. Overview About LIBER Introduction to Europeana Newspapers The foundation stone: Europeana Libraries This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 2
  • 3. LIBER & the European Digital Agenda Association of European Research Libraries Our projects: Content Europeana Libraries Europeana Newspapers Policy MEDOANET Infrastructure APARSEN AAA Study ODE This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp
  • 4. Europeana Newspapers • 17 partner institutions • 3 years (2012-2015) • Aggregation of more than 18 million newspapers • Will use refinement methods for OCR, OLR (article segmentation), and named entity (NER) and class recognition • Suvey existing collections in Europe • Make content accessible This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp
  • 5. Why newspapers? “The museum (and the newspaper) today seeks whatever represents normal life in its own native locality and with infinite pains its collections are arranged in a manner which is natural to them in their own habitat” Lucy Maynard Salmon (1976) in The Newspaper and the Historian This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp
  • 6. Europeana Newspapers: where the content comes from… We are looking for more libraries! NL E LIBER NLF SUB HH NLL CCS USAL NLP BL KB SBB ONB UIBK NLT BnF UB LFT This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp
  • 7. What we do with the content • Select 10 million items to be OCR’d • Structural information by UKIB e.g. headings, table of contents • Select 2 million items for OCR and OLR • Article segmentation and page class recognition by CCS • Libraries carry out manual correction of recognition and segmentation results • Named entity recognition applied to English, Dutch and German material This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp
  • 8. Making the content accessible • OCR enables full text searching • OLR enables more targeted searching (titles and sections) • NER enables searching by people, place,and the discover of new relationships between entities This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp
  • 9. No access without aggregation • Europeana Libraries • A single library domain aggregator • Content from European research libraries • Full-text search capabilities • Portal for researchers Access = Sustainability Access = Visibility This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp
  • 11. Thank you for your attention! http://www.libereurope.eu http://www.europeana-newspapers.eu/ http://www.europeana-libraries.eu/ Hall 4/5, stand H104

Notas del editor

  1. Before we get in to the drivers and barriers for data sharing I would like to ‘share’ 2 things about me with you.. First of all, I am a librarian. I work as project officer for LIBER, which is the Association of European Research Libraries. We have 380 member libraries from all over Europe. Our projects really focus on developing the role of the library as part of the Europeana Research Infrastructure and they fall into 3 main categories.
  2. To this.. How do we get from the image of the research we have built up to a dedicated pan-European research portal with content from practically all the research libraries in Europe, including bibliographic records, full text and special tools for resaercher- all the things that we know that researchers want. Well of course I’m going to say though partnership, through enabling national, university and other research libraries to work together to build this service and provide research content in a sustainable mannor. Which is what the Europeana Libraries project sets out to do…