The British Library's Digital Research Team supports new ways of exploring and accessing the Library's digital collections through computational methods like machine learning, data visualization, and text mining. The Team collaborates on projects that make more content available digitally, and provides training and guidance to researchers. Examples of projects include crowdsourcing accent maps, analyzing patterns in music history from bibliographic data, and transcribing and georeferencing placenames from historical texts.
1. Digital Scholarship, Digital Research:
widening access to BL collections and
services
Dr Mia Ridge, @mia_out
Digital Curator, British Library
BL Labs Roadshow, Cambridge, February 2016
2. Defining Digital Research
Using computational methods either
to answer existing research questions
or to challenge existing theoretical
paradigms….
Machine learning
Data Visualisation
Data Mining
Georeferencing
Digital Mapping
Crowdsourcing
Text mining
Collaboration
4. The Digital Research Team is a cross-
disciplinary mix of curators, researchers,
librarians and programmers supporting the
creation and innovative use of British
Library's digital collections.
Meet the Team
5. How we help researchers
Support new ways of exploring and accessing
our collections with those at the intersection of
academic research / cultural heritage /
technology through:
• Getting content in digital form and online
• Collaborative projects
• Offering digital research support and guidance
• Events, competitions, and awards (BL Labs)
6. Main Activities
• Staff training
• Promoting Digital Scholarship within BL
• Curating digital research data
• Project management
• Engagement with users
• Creating and sharing online content with other
libraries and research centres
• Communication: events, blogging, social media
7. Social Media; Working collaboratively:
Using the BL Wiki; Presentation skills;
Foundations in working with Digital
Objects: From Images to A/V; Behind
the Screen: Basics of the Web;
Metadata for Electronic Resources:
Dublin Core, METS, MODS, RDF, XML;
Digital Collections at British Library;
Communicating our collections online:
Access & Reuse Policy; Crowdsourcing
in Libraries, Museums and Cultural
Heritage Institutions; Text Encoding
Initiative (TEI); Data Visualisation for
Analysis in Scholarly Research; Geo-
referencing and Digital Mapping;
Information Integration: Mash-ups,
APIs and The Semantic Web
Digital Scholarship Training Programme
8. Digital Conversations
Public evening talks on specific themes around ideas, tools and projects
around Digital Scholarship. Contributors have included entrepreneurs,
technologists, librarians, academics and analysts.
Topics include:
• Search and Discovery
• Sharing and Annotation
• Profiling and Privacy
• Open for Re-use
• Future of Text
• Digital Narratives
• Using the Cloud
• Games and literature
• Computational creativity
Videos are shared afterwards: http://bit.ly/XFJrcI
12. Big Data History of Music
How can vast amounts of bibliographic data held by research libraries be
unlocked for music researchers to analyse?
Can this data be interrogated in ways that challenge the traditional
narratives of music history?
Analyses and visualisations
exposed previously
uncharted patterns in the
history of music, for
instance the rise and fall of
music printing in 16th- and
17th-century Europe (huge
dips in output in Venice
were down to plague and
war).
13. Pelagios: Enabling Linked Ancient Geodata
Collaborative project to transcribe & map Classical and Medieval
placenames from digital texts and manuscript images. Allows for
visualisation of place and space in historical documents.
BL contributed >350 digitised
images of medieval materials,
plus one digital paleographer!
16. Getting in touch
Web: http://bl.uk/digital
Blog: http://britishlibrary.typepad.co.uk/digital-scholarship/
Email: digitalresearch@bl.uk
Twitter: #BLdigital
BL Labs Roadshow, Cambridge, February 2016
Dr Mia Ridge @mia_out Digital Curator, British Library
Notas del editor
20 mins on 'doing digital research at the British Library'. Supporting creation and innovative uses of BL digital collections.
https://www.youtube.com/watch?v=tp4y-_VoXdA
The Digital Scholarship Training Programme was a two-year internal training initiative dreamed up by the Digital Curator team that launched in November 2012.
Now three years in and have created nineteen bespoke one-day courses for staff covering the basics of Digital Scholarship which we deliver on a rolling basis.
Deliver 88 courses to nearly 400 staff members so far!
Convert-a-Card is an experiment with a new method for transforming printed card catalogues into electronic records for inclusion in our online catalogue Explore. Catalogues pre-date computers. The library's online catalogue, Explore, provides public access to nearly 57 million records, but c750,000 printed books held in Asian & African Collections are listed only in hard-copy catalogues. Records matched, located, transcribed or translated as part of Convert-a-Card will be uploaded to the British Library's Explore catalogue for anyone to search online. By participating you will have a direct impact on the availability of research material to anyone interested in the diverse collections available at the British Library.
Began as Chinese Books Card Catalogue: project supported by the Chinese embassy aiming to identify OCLC records that match 28,000 printed card catalogues of Chinese publications held at the BL through crowdsourcing. Matching records improves the catalogue: http://www.libcrowds.com/
Research Question:
Brought together for the first time the world's biggest datasets about published sheet music, music manuscripts and classical concerts (in excess of 5 million records) for statistical analysis, manipulation and visualisation. Aim was to unlock musical-bibliographical data held by libraries in order to create new research opportunities. The project cleaned and enhanced aspects of the British Library catalogues of printed and manuscript music, which are now available as open data from www.bl.uk/bibliographic/download.html and piloted big data research techniques on these and five other datasets.
Source Collections:
Data from seven existing databases and catalogues were used as the basis of this project: the British Library's catalogues of printed and manuscript music; the bibliographies created by Répertoire International des Sources Musicales (RISM) that list European music printed 1500-1800 and music manuscripts in European libraries; and the RISM UK Music Manuscripts Database and the Concert Programmes Project database.
Digital/Computational Techniques:
Data wrangling using Open Refine and MARCedit. Data visualisation using: Google Fusion Tables and PalladioProject slides: http://www.slideshare.net/historyspot/ihr-big-data-history-of-music-9-june15
Outcome: Analyses and visualisations of these datasets exposed previously uncharted patterns in the history of music, for instance involving the rise and fall of music printing in 16th- and 17th-century Europe (huge dips in output in Venice were down to plague and war!), or the rise of nationalist colourings in music of the late 18th and early 19th centuries. The detection of these long-term trends permits new ways of linking music history to wider histories of culture, economics, society and politics