LinkedIn emplea cookies para mejorar la funcionalidad y el rendimiento de nuestro sitio web, así como para ofrecer publicidad relevante. Si continúas navegando por ese sitio web, aceptas el uso de cookies. Consulta nuestras Condiciones de uso y nuestra Política de privacidad para más información.
LinkedIn emplea cookies para mejorar la funcionalidad y el rendimiento de nuestro sitio web, así como para ofrecer publicidad relevante. Si continúas navegando por ese sitio web, aceptas el uso de cookies. Consulta nuestra Política de privacidad y nuestras Condiciones de uso para más información.
Apologies if the colour is a bit blinding. Metadata is one of those things that people take for granted until it goes wrong, so we wanted something eye catching. And the future is bright for metadata. My purpose today is not to talk about linked data – there are many speakers who can do so much more knowledgably than I can. It is really to reflect on the place of linked data in the library’s metadata strategy.
First: a quick introduction to the Library The British Library is the U.K.’s national library and also a legal deposit library. The Library currently has around 1,500 staff operating on three sites in Stockton on Tees, Boston Spa and London. The main cataloguing department is based in Boston Spa, but there are many specialist cataloguers in London
The Library was created in 1973 as the result of an Act of Parliament to amalgamate many existing institutions. Under the Act, the Library is “the national centre for bibliographical & other information services”
From an external perspective, the Library has charged for many of its bibliographic services, but over recent years in line with government policy, barriers to access have been removed. We began to offer open data in 210 and Linked Open Data form 2011.
In 2015 we published our first Collection Metadata Strategy. I’ll come to what the strategy actually is, a bit later, but first, why did we decide that we needed a strategy.
“Why” was the hardest question to answer. In fact, it needed a separate document to justify the need for a strategy.
The Library faces new challenges. In particular those posed by the extension of legal deposit to non-print media. In 2013 the scope of legal deposit was extended to non-print material. This means that publishers can be switched from deposit of printed books and serials to deposit of the e-book or e-journal. The rate of transition is under the Library’s control, which enables us to develop our infrastructure. However, we are already seeing interesting challenges. Based on the contribution from a very small subset of publishers over May 2015 to January 2016 we received 50,000 e books. This corresponds to about 50% of annual intake of printed books. Closer analysis of what has been deposited shows that we are receiving materials which are in effect re-issues of printed books and because of the international scope of publishing, we are receiving materials which have no UK imprint, but are distributed in UK. For much of this additional material we may already have records for the print manifestation, which we should be able to reuse and adapt.
There are also long standing challenges. We still have printed catalogues. Around 2 million items can only be found by consulting a printed catalogue in the reading room. There is a smaller but unquantified number of catalogues and finding aids on electronic media that need to be migrated to strategic repositories. There are some collections that remain uncatalogued.
A pervasive problem is that a lot of legacy metadata, particularly retrospectively converted metadata falls far short of current standards.
Here is legacy metadata: this record was retrospectively converted from the British Museum catalogue. As the cataloguers among you will recognise, there are some omissions. Where is the language of content? Where is the country of publication? Where is the subject data? This information wasn’t recorded in the original catalogue. Taking language as an example, the orange line shows that for current processing 100% of published resources are coded for language of content. However, if we look at the foundation or legacy catalogues, this drops to under 30%, which has implications for services and for the metadata to inform research and collection management or to visualise the collection. How can you build a linguistic picture of the collection, if the language is not explicitly recorded. This metadata is not capable of answering the questions posed by users and staff.
Metadata is increasing in importance but it was recognised that within the organization responsibility for metadata was divided and the boundaries were unclear. Addressing the challenges would be impossible unless we could also break down silos.
In effect each content stream has its own input format and its own system. Each content stream is therefore a distinct silo, usually with its own workflow and specialist staff.
The persistence of silos at the discovery layer prevent cross searching of the whole collection but the complexity they create at every level is a barrier to efficiency.
In 2004 we implemented Aleph, which removed significant long standing silos based around printed collections. This has created much more uniform metadata and more efficient processes, but many content streams and processes were out of scope and thus silos have been perpetuated.
To some extent these silos reflect historic divisions between different components of the Library and responsibility has been distributed between different services and departments.
How did we do it?
It helps to scare people a bit at first – look at all the stuff we couldn’t do without metadata. But you also have to offer solutions and hope!
The following diagram gives an overview of the range of external stakeholders
And internal stakeholders who are increasingly dependent on metadata to deliver services, and the Library’s strategic objectives.
The fundamental idea behind the evolving strategy was to treat Collection Metadata as a strategic asset of the Library, on a par with the collection, the staff and the estate. We deliberately limited the scope to metadata about the collection. Collection metadata identifies attributes and relationships of collection resources; location and availability of collection resources and status and rights associate with them
Like the collection or any other asset collection metadata needs careful stewarship over time and this in turn needs clear leadership and adequate resourcing.
Like any other asset, investment should be rewarded by improvements in efficiency, better services and more value for money. An important contention is that the metadata should be made to work much harder.
What’s in the strateggy?
The vision provides a view of where we want to be in 2020, which aligns closely with the Library’s Strategy, Living Knowledge and its core programmes.
And a plan…
If you search the document for the phrase “linked data”, you won’t find it. Nor will you find an explicit mention of RDF. There are lots of references to opening but none to linking. We see opening data as a strategic objective, whereas linked data is a technology whose potential we can exploit.
We will continue to develop the BNB Linked data offering
We have been doing some work on linked data. We’ve added URIs for ISNIs to the LOD BNB. We’ve been working with our linked data service provider TSO to improve our understanding of demand for the linked data service and we’ve also worked with Fujitsu Ireland to analyse the statistics in more depth to discover who is using the service and how. A lesson is that although the linked data offering is being used, it is very difficult to analyse by whom and for what reason. It is clear that for many researchers who may be interested in our data, linked data poses a significant technical barrier.
So linked data is just one of the channels through which we will publish our metadata.
Our free Z39.50 service remains very heavily used and continues to grow. We have more than 1500 users in 177 countries, many in singleton posts or small libraries, who reuse our bibliographic records in their own institutions. More recently we have begun to make the data available to researchers as csv, which allows them to work with desktop tools such as Excel or Open Refine. Datasets are available from our downloads page. New data sets are added to complement exhibitions, such as Shakespeare, Punk or in relation to landmark anniversaries, such as Great Fire of London. These include archival and mss resources as well as published books, maps and scores and are certainly being used. We work with government and professional bodies to promote these offerings. We also create bespoke datasets to meet specific needs and for collaborations with students and even our artist in residence.
To realise the strategic vision and deliver the objectives we have set, we have to break down the silos. The strategy provides a framework within which we can encourage the convergence of standards where appropriate and improve our metadata to make it more actionable.
New synergies will emerge offering the potential for better services and greater efficiency.
There is an opportunity arising from end of life of out major systems Aleph, Symphony and IAMs which will all need to be replace within the next few years. We can influence but not determine the future of the library’s major systems, which all need to be replaced.
We have raised the awareness of collection metadata within the institution. We are also better informed of new requirements and receive earlier notice which allows us to influence as well as respond.
It is clear that metadata is central to the delivery of many corporate strategic objectives. This has increased demand, but also substantiates bids for additional resource.
Convergence creates new synergies. The vision to bring our metadata together informs decisions about system architecture.
Unlocking the value : metadata and linked data at the British Library / Alan Danskin (British Library)
Unlocking the Value
metadata strategy and linked data
at the British Library
Collection Metadata Standards Manager
CIGS Metadata and Linked Data Seminar, Edinburgh 12/9/2016
The National Library of the United Kingdom
Stockton on Tees
British Library Metadata Services
The British Library Act records
our role as “national centre for…
bibliographical & other information services”
BL Metadata Services:
• Originally offered priced services &
evolved through many technologies
• Began to offer open data in 2010 &
Linked Open Data in 2011
• Collection Metadata Strategy published
Legacy Metadata Challenges
% Records containing Language of Content
Foundation catalogues Integrated Catalogue Annual Production
Collection Metadata - Uses
Resource description & availability for
discovery & access
Licensed content rights
Legal deposit & purchased claims
Representation of BL holdings in shared
Re-use in 3rd party commercial services
Derived cataloguing by other libraries
Identification of candidate material for
Responding to FOI requests
Content identification for collaborative
Confirmation of UK publication status
Identification of last resort copy for
collection disposal initiatives
Open data contribution
Collection Metadata = Strategic Asset
• Improved services
• More value for money
• Strategy published externally
• Established & communicated
best practice via new CM Wiki
• Centralised support mailbox
• Horizon scanning function
• Workshop on 2016/17 plans
“Our vision is that by 2020
the Library’s collection
metadata assets will be
sustainable, enabling their
full value to be unlocked for
collaboration and wider use
of the collection.”
Drive efficiencies in the creation, management and
exploitation of collection metadata to support delivery of
the Library’s strategic priorities and programmes
Improve the Library’s return on investment in its
collection metadata assets by ensuring their long term
value is maintained for future activities
Open up more of the Library’s collection metadata to
improve access to Library content and promote wider re-use
Collection Metadata Strategy
2015-18 Implementation Roadmap
Preservation, Maintenance & Enhancement Open Metadata Discovery & Delivery
2015 2016 2017 2018 2019 2020
ISNI integration into key
Collection metadata available via
global cross domain delivery channels
Establish horizon scanning
function for metadata
Undertake Options appraisal for linked
metadata platform replacement
Development of new
Redevelopment of AMED
Undertake investigation of
DRM metadata solutions
Convergence on agreed metadata standards portfolio
for derived collection
Large scale batch
IAMS data release
Create metadata based
collection analysis tools
Undertake review of national library
metadata systems options
Non standard systems
Assessment and prioritisation
of metadata systems
All collection metadata held in
centralised master metadata
infrastructure for metadata and
Implementation of comprehensive
DRM metadata solution
Promotion of agreed
Preparatory work for ILS
Improved user access
PSI directives for
Complex rights & license metadata
New staff intranet
and wiki resources on
Optimised metadata management
Efficient international standards
Undertake review of new
metadata creation &
Collaborate with international partners on core metadata standards
BNB Linked data service
Printed music metadata
New open researcher format
Fully representative range
of metadata sets available
Hidden collection metadata
assets exposed for re-use
Undertake audits of
metadata assets & licenses
options for sound
best practice resource
available for staff
New linked data platform
JISC national shared
metadata service platform
External user support
Preparatory work for
new external sources
Linked data exploitation
Undertake review of metadata
standards & develop engagement plan
Investigate BL on
metadata service offering
Collection Metadata Strategy
The British National Bibliography
3.7m entries for UK books &
journals on all subjects in all
languages 1950- to date
Reusable publication dataset - not a
unique institutional catalogue
Permissive License – CC0
Includes: People, Places, Dates, Subjects
Consistent - over 60 years
Linked Open Data
Linked Data Analytics Project
• Who is using our data?
• Which data?
• How to optimise publication?
Visitors - Government
Visitors - Academia
Visitors - Libraries
745K ISNIs added to Linked Data BNB
Increasing Access, Reuse & Relevance
177 countries use services
New open ‘researcher format’ option
Created open metadata sets
for printed music, manuscripts &
Events – CILIP,
Cabinet Office Data Science…
Breaking down silos
Lots to be done
Linked data potential solution not an objective
Influencing and delivering corporate strategic objectives
Increasing Demand for our help!
Unlocking the value: the British Library’s Collection Metadata Strategy,
Collection Metadata Strategy Roadmap, 2015-2018
Living knowledge: the British Library 2015-2023
Free data services
Downloads: linked data; open data; researcher format