SlideShare una empresa de Scribd logo
1 de 38
Welcome and introduction to
             British Library digital resources
             for social scientists

             John Kaye – Lead Curator Digital Social Science

             Peter Webster - Web Archiving Engagement and Liaison
             Manager

             7th December 2012




www.slideshare.net/johnkayebl
What kind of library are we?


 “We exist for everyone who wants to
  do research – for academic, personal
  or commercial purposes”

 Our collections cover all known
  subject areas; sciences, technology,
  medicine, arts & humanities, social
  sciences…

 We have a copy of every item
  published in the UK

 Our collections cover all formats;
  sound, images, video, newspapers,
  maps, manuscripts, databases,
  books and journals, much more…


                                         2
News, newspapers and magazines




                                 3
News and current events




                Broadcast news, recording from May 2010
                Political change in Middle East
                Olympic Games
                Occupy movement
                                                          4
Images and photographs




 Images online
 Online gallery
 Photographically illustrated books


                                       5
Online Services




                  6
Social Science online resources for researchers




 ESRC online resource

 Management and Business Studies Portal

 Social Welfare Portal

 www.bl.uk/oralhistory

 Social Science blog




                                                  7
8
Management and Business Studies portal




                                         9
10
http://britishlibrary.typepad.co.uk/socialscience/




                                                     11
Oral history at a glance
www.bl.uk/oralhistory


   370 collections from 1 tape to 5,500 (Millennium Memory Bank)

   100-150 hours of new digital fieldwork recordings per month

   2200 catalogue records added or updated per year

   4000 public enquiries per year

   40 talks and lectures per year

   60 training sessions per year with OHS (500+ people)

                                                                    12
Guides and support

 Reference services: reading room, telephone, email

 Help for Researchers web pages

 Collection guides, eg for government publications:
  http://www.bl.uk/reshelp/findhelprestype/offpubs/guides/govtgu

 Topical bibliographies, eg Globalisation and
  employment, Gang culture and knife crime, Corporate
  Social Responsibility, Far Right in Britain …

 Welfare Reform on the Web


                                                           13
Exhibitions and events
www.bl.uk/whatson




                         14
Doctoral Open Days 2013




11 February – Social Sciences

18 February – Media, Cultural Studies and Journalism

http://www.bl.uk/whatson/events/docopendays/index.html




                                                         15
Web archives and digital method




  Dr Peter Webster
  Web Archiving Engagement and Liaison Officer

  @UKWebArchive / @pj_webster

  Peter.Webster@bl.uk

  http://www.webarchive.org.uk

  December 7th 2012
The lost web: people




[votedavidcameron.com, (archived 24/5/05)]
                                             17
The lost web: people




[robincook.org.uk (archived 8/8/05)]

                                       18
The lost web: organisations




[tvpa.police.uk (archived 21/11/12)]   19
The lost web: organisations




[woolworthsgroupplc.com (archived 12/12/08)]   20
Our mission:




      Collect, preserve, and
         make accessible
           web sites of
      cultural and scholarly
           importance
       from the UK domain
UK Web Archive http://www.webarchive.org.uk


 Selective Web Archive
       over 11,000 websites collected since
        2004
       over 50,000 instances
       Over 16TB of compressed data

 British Library, National Library
of Wales, JISC
     Also National Library of Scotland,
      the National Archives, Wellcome
      Library

 Many collaborators
     eg Women’s Library, Live Arts
      Development Agency, Quakers in
      Britain

                                                  22
A typical event-based special collection




       Collect, preserve, and
          make accessible
             eb sites of
       cultural and scholarly
            importance
        from the UK domain
The orphaned web




                   24
A comprehensive special collection




      Collect, preserve, and
         make accessible
            eb sites of
      cultural and scholarly
           importance
       from the UK domain
Web archiving: the basics

   What
          Selecting, capturing, storing, preserving and managing access to snapshots of websites over time
   How
          Use crawler software to download websites automatically
          Selective or domain archiving
          Provide access in a Web Archive
When
          Since mid 1990s
Who
          Heritage and memory organisations, eg BL, The National Archives
          University libraries
          Not-for-profit and commercial organisations, eg Internet Archive
          Individual researchers
   Why
          Global information resource
          Artefact of cultural and technology change
          Representative sample of the web: historical and sociological data that may not be found
           elsewhere
          Part of national digital heritage - legal requirements




                                                                                                              26
Selective versus domain archiving

   Two complementary approaches: selective and domain archiving
                            Width



    D
    e                                           Domain harvesting:
    p
                                                - Typically once/twice a year
    t                                           - Domain wide snapshot

    h                                           - Supported by national legislative
                                                framework
           Selective archiving:                 -- automated & cost-effective
           - More frequent gathers; manual QA
           - Guided by collection policy
           - Can be based on events or themes
           e.g. credit crunch
           -- manual & expensive
                                                                                      27
Non-print Legal Deposit 2013: what will we collect ?




A deposit library is entitled to copy UK publications from the
   open web.

A deposit library is entitled to collect other password-protected
   material by harvesting, subject to giving at least 1 month’s
   written notice for the publisher to provide a password or
   access credentials.




                                                                    28
What will we be collecting ?




Includes resources:

• that are issued from a .uk or other UK geographic top-level
  domain, or

• where part of the publishing process takes place in the UK;

• but excluding any which are only accessible to audiences
  outside the UK.




                                                                29
What will we NOT be collecting ?




Film and recorded sound where the audio-visual content
   predominates

Private intranets and emails

Personal data in social networking sites or that are only
  available to restricted groups.




                                                            30
What will users be able to do with it ?




Users may:

• access deposited material while on “library premises
  controlled by a deposit library”.

• print one copy of a restricted amount of any deposited
  material, for non-commercial research or other defined ‘fair
  dealing’ purposes such as court proceedings, statutory
  enquiry, criticism and review or journalism.




                                                                 31
What will users NOT be able to do with it ?




Users may NOT:

• use an item simultaneously with another user;

• make any digital copies, except by specific and explicit
  licence of the publisher.




                                                             32
A web archiving strategy based on prioritisation


                  Domain Crawl


    Event             Event                Event
  Domain             Events:            Special
  harvesting:        •Political,        Collection:
  •Broad sweep       cultural, social   •Focused,
  of .uk domain      and economic       thematic
  •Survey and        events of          collections
  discovery          national           •Support
  •Implement         interest, eg       priority
  Legal Deposit      Olympics           subjects
                     2012




                                                      33
JISC UK Web Domain Dataset (1996-2010)


Funded by JISC to create a research collection of UK
websites
 Collaboration between the Internet Archive, JISC and the
British Library
 Copy of subset of the Internet Archive’s web collection that
relates to the UK
   470466 files, mostly arc.gz, with 4494 warc.gz.
   Total size: 32TB

   No local access – possible through the Internet Archive
 Can be used to generate secondary datasets and make
these available
   Analytical access the main route
                                                                 34
Historical Archive – HTML Version Analysis
N-Gram Search: Prime Ministers
N-Gram Search: Social Media
Questions ?




John.Kaye@bl.uk

Twitter: @johnkayeBL



Peter.Webster@bl.uk

Twitter: @UKWebArchive / @pj_webster

UK Web Archive: http://www.webarchive.org.uk



                                               38

Más contenido relacionado

La actualidad más candente

Preserving our past together: reflections on the Easter Rising 1916 Web Archi...
Preserving our past together: reflections on the Easter Rising 1916 Web Archi...Preserving our past together: reflections on the Easter Rising 1916 Web Archi...
Preserving our past together: reflections on the Easter Rising 1916 Web Archi...CONUL Conference
 
Does anybody care about digital preservation? Digital preservation from a per...
Does anybody care about digital preservation? Digital preservation from a per...Does anybody care about digital preservation? Digital preservation from a per...
Does anybody care about digital preservation? Digital preservation from a per...Torsten Reimer
 
Libraries during and after Covid-2019
Libraries during and after Covid-2019Libraries during and after Covid-2019
Libraries during and after Covid-2019Dr Trivedi
 
Implementing digital preservation strategy: collection profiling at the Briti...
Implementing digital preservation strategy: collection profiling at the Briti...Implementing digital preservation strategy: collection profiling at the Briti...
Implementing digital preservation strategy: collection profiling at the Briti...Michael Day
 
John Scally: The National Library of Scotland: A future vision for all
John Scally: The National Library of Scotland: A future vision for allJohn Scally: The National Library of Scotland: A future vision for all
John Scally: The National Library of Scotland: A future vision for allCILIPScotland
 
BL Labs CityLIS Talk
BL Labs CityLIS TalkBL Labs CityLIS Talk
BL Labs CityLIS Talklabsbl
 
Lockss usdocs-dl cfall10
Lockss usdocs-dl cfall10Lockss usdocs-dl cfall10
Lockss usdocs-dl cfall10James Jacobs
 
Enabling digital scholarship through staff training: the British Library's ex...
Enabling digital scholarship through staff training: the British Library's ex...Enabling digital scholarship through staff training: the British Library's ex...
Enabling digital scholarship through staff training: the British Library's ex...Mia
 
Digitisation in the UK and the JISC Content programme
Digitisation in the UK and the JISC Content programmeDigitisation in the UK and the JISC Content programme
Digitisation in the UK and the JISC Content programmePaolaMarchionni
 
(W) introduction to copyright (nov 2016) (1)
(W) introduction to copyright (nov 2016) (1)(W) introduction to copyright (nov 2016) (1)
(W) introduction to copyright (nov 2016) (1)JeremyOHare1
 
Cb publiclibrariesinthe21stcentury
Cb publiclibrariesinthe21stcenturyCb publiclibrariesinthe21stcentury
Cb publiclibrariesinthe21stcenturyJoyzel De Leon
 
An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...Frederick Zarndt
 
SCA Scotland Forum 210508 Paul Ell
SCA Scotland Forum 210508 Paul EllSCA Scotland Forum 210508 Paul Ell
SCA Scotland Forum 210508 Paul Ellmichellep
 
An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...Frederick Zarndt
 

La actualidad más candente (20)

Preserving our past together: reflections on the Easter Rising 1916 Web Archi...
Preserving our past together: reflections on the Easter Rising 1916 Web Archi...Preserving our past together: reflections on the Easter Rising 1916 Web Archi...
Preserving our past together: reflections on the Easter Rising 1916 Web Archi...
 
Jisc MediaHub webinar
Jisc MediaHub webinarJisc MediaHub webinar
Jisc MediaHub webinar
 
Does anybody care about digital preservation? Digital preservation from a per...
Does anybody care about digital preservation? Digital preservation from a per...Does anybody care about digital preservation? Digital preservation from a per...
Does anybody care about digital preservation? Digital preservation from a per...
 
Digital Scholarship at the British Library
Digital Scholarship at the British LibraryDigital Scholarship at the British Library
Digital Scholarship at the British Library
 
Libraries during and after Covid-2019
Libraries during and after Covid-2019Libraries during and after Covid-2019
Libraries during and after Covid-2019
 
Implementing digital preservation strategy: collection profiling at the Briti...
Implementing digital preservation strategy: collection profiling at the Briti...Implementing digital preservation strategy: collection profiling at the Briti...
Implementing digital preservation strategy: collection profiling at the Briti...
 
John Scally: The National Library of Scotland: A future vision for all
John Scally: The National Library of Scotland: A future vision for allJohn Scally: The National Library of Scotland: A future vision for all
John Scally: The National Library of Scotland: A future vision for all
 
BL Labs CityLIS Talk
BL Labs CityLIS TalkBL Labs CityLIS Talk
BL Labs CityLIS Talk
 
The role public libraries play in supporting digital literacy
The role public libraries play in supporting digital literacyThe role public libraries play in supporting digital literacy
The role public libraries play in supporting digital literacy
 
Lockss usdocs-dl cfall10
Lockss usdocs-dl cfall10Lockss usdocs-dl cfall10
Lockss usdocs-dl cfall10
 
Ukla uksg 2013_final
Ukla uksg 2013_finalUkla uksg 2013_final
Ukla uksg 2013_final
 
Enabling digital scholarship through staff training: the British Library's ex...
Enabling digital scholarship through staff training: the British Library's ex...Enabling digital scholarship through staff training: the British Library's ex...
Enabling digital scholarship through staff training: the British Library's ex...
 
Researching Archives and Documents
Researching Archives and DocumentsResearching Archives and Documents
Researching Archives and Documents
 
Digitisation in the UK and the JISC Content programme
Digitisation in the UK and the JISC Content programmeDigitisation in the UK and the JISC Content programme
Digitisation in the UK and the JISC Content programme
 
(W) introduction to copyright (nov 2016) (1)
(W) introduction to copyright (nov 2016) (1)(W) introduction to copyright (nov 2016) (1)
(W) introduction to copyright (nov 2016) (1)
 
Cb publiclibrariesinthe21stcentury
Cb publiclibrariesinthe21stcenturyCb publiclibrariesinthe21stcentury
Cb publiclibrariesinthe21stcentury
 
An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...
 
Jisc MediaHub 2014/2015 Update
Jisc MediaHub 2014/2015 UpdateJisc MediaHub 2014/2015 Update
Jisc MediaHub 2014/2015 Update
 
SCA Scotland Forum 210508 Paul Ell
SCA Scotland Forum 210508 Paul EllSCA Scotland Forum 210508 Paul Ell
SCA Scotland Forum 210508 Paul Ell
 
An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...
 

Destacado

ODIN Project Presentation to CLOSER Leadership Team
ODIN Project Presentation to CLOSER Leadership TeamODIN Project Presentation to CLOSER Leadership Team
ODIN Project Presentation to CLOSER Leadership Teamjohnkayebl
 
What is DataCite-screenshots
What is DataCite-screenshotsWhat is DataCite-screenshots
What is DataCite-screenshotsdatacite
 
DMPTool Webinar 5: Promoting institutional services with the DMPTool; EZID as...
DMPTool Webinar 5: Promoting institutional services with the DMPTool; EZID as...DMPTool Webinar 5: Promoting institutional services with the DMPTool; EZID as...
DMPTool Webinar 5: Promoting institutional services with the DMPTool; EZID as...University of California Curation Center
 
Now we are six: Integrating Edinburgh DataShare into local and internet in...
Now we are six: Integrating Edinburgh DataShare into local and internet in...Now we are six: Integrating Edinburgh DataShare into local and internet in...
Now we are six: Integrating Edinburgh DataShare into local and internet in...Robin Rice
 
Summary of data citation synthesis activity & Review
Summary of data citation synthesis activity & ReviewSummary of data citation synthesis activity & Review
Summary of data citation synthesis activity & ReviewMicah Altman
 
IEDA Data Publication Workshop @AGU
IEDA Data Publication Workshop @AGUIEDA Data Publication Workshop @AGU
IEDA Data Publication Workshop @AGUKerstin Lehnert
 
BL Doctoral Open Days Feb 2012 - Social Science Data and Digital Resources
BL Doctoral Open Days Feb 2012 - Social Science Data and Digital ResourcesBL Doctoral Open Days Feb 2012 - Social Science Data and Digital Resources
BL Doctoral Open Days Feb 2012 - Social Science Data and Digital Resourcesjohnkayebl
 
Mobile access to educational resources in humanities and social sciences
Mobile access to educational resources in humanities and social sciencesMobile access to educational resources in humanities and social sciences
Mobile access to educational resources in humanities and social sciencesIreland & UK Moodlemoot 2012
 
NCompass Live: Digital Resources of the National Library of Medicine
NCompass Live: Digital Resources of the National Library of MedicineNCompass Live: Digital Resources of the National Library of Medicine
NCompass Live: Digital Resources of the National Library of MedicineNebraska Library Commission
 

Destacado (9)

ODIN Project Presentation to CLOSER Leadership Team
ODIN Project Presentation to CLOSER Leadership TeamODIN Project Presentation to CLOSER Leadership Team
ODIN Project Presentation to CLOSER Leadership Team
 
What is DataCite-screenshots
What is DataCite-screenshotsWhat is DataCite-screenshots
What is DataCite-screenshots
 
DMPTool Webinar 5: Promoting institutional services with the DMPTool; EZID as...
DMPTool Webinar 5: Promoting institutional services with the DMPTool; EZID as...DMPTool Webinar 5: Promoting institutional services with the DMPTool; EZID as...
DMPTool Webinar 5: Promoting institutional services with the DMPTool; EZID as...
 
Now we are six: Integrating Edinburgh DataShare into local and internet in...
Now we are six: Integrating Edinburgh DataShare into local and internet in...Now we are six: Integrating Edinburgh DataShare into local and internet in...
Now we are six: Integrating Edinburgh DataShare into local and internet in...
 
Summary of data citation synthesis activity & Review
Summary of data citation synthesis activity & ReviewSummary of data citation synthesis activity & Review
Summary of data citation synthesis activity & Review
 
IEDA Data Publication Workshop @AGU
IEDA Data Publication Workshop @AGUIEDA Data Publication Workshop @AGU
IEDA Data Publication Workshop @AGU
 
BL Doctoral Open Days Feb 2012 - Social Science Data and Digital Resources
BL Doctoral Open Days Feb 2012 - Social Science Data and Digital ResourcesBL Doctoral Open Days Feb 2012 - Social Science Data and Digital Resources
BL Doctoral Open Days Feb 2012 - Social Science Data and Digital Resources
 
Mobile access to educational resources in humanities and social sciences
Mobile access to educational resources in humanities and social sciencesMobile access to educational resources in humanities and social sciences
Mobile access to educational resources in humanities and social sciences
 
NCompass Live: Digital Resources of the National Library of Medicine
NCompass Live: Digital Resources of the National Library of MedicineNCompass Live: Digital Resources of the National Library of Medicine
NCompass Live: Digital Resources of the National Library of Medicine
 

Similar a Introduction to British Library digital resources for social scientists

Digital Cultural Heritage: Experiences from British Library
Digital Cultural Heritage: Experiences from British LibraryDigital Cultural Heritage: Experiences from British Library
Digital Cultural Heritage: Experiences from British LibraryNora McGregor
 
Web@rchive Austria (Archiving Online Media)
Web@rchive Austria (Archiving Online Media)Web@rchive Austria (Archiving Online Media)
Web@rchive Austria (Archiving Online Media)Web@rchive Austria
 
Digital Activities at the British Library (11-12-08)
Digital Activities at the British Library  (11-12-08)Digital Activities at the British Library  (11-12-08)
Digital Activities at the British Library (11-12-08)Richard Davies
 
20yrs: 2007 Brussels Digital Preservation: Setting the Course for a Decade of...
20yrs: 2007 Brussels Digital Preservation: Setting the Course for a Decade of...20yrs: 2007 Brussels Digital Preservation: Setting the Course for a Decade of...
20yrs: 2007 Brussels Digital Preservation: Setting the Course for a Decade of...Neil Beagrie
 
WNR.sg - Keynote Address by Mr John van Oudenaren, Director, World Digital Li...
WNR.sg - Keynote Address by Mr John van Oudenaren, Director, World Digital Li...WNR.sg - Keynote Address by Mr John van Oudenaren, Director, World Digital Li...
WNR.sg - Keynote Address by Mr John van Oudenaren, Director, World Digital Li...wnradmin
 
Stewardship of the Digital Scholarly Record and Digital Published Heritage
Stewardship of the Digital Scholarly Record and Digital Published HeritageStewardship of the Digital Scholarly Record and Digital Published Heritage
Stewardship of the Digital Scholarly Record and Digital Published HeritageNASIG
 
20221018_Panel_Covid_WARCnet_closing_conference.pdf
20221018_Panel_Covid_WARCnet_closing_conference.pdf20221018_Panel_Covid_WARCnet_closing_conference.pdf
20221018_Panel_Covid_WARCnet_closing_conference.pdfWARCnet
 
NORFest 2023 Lightning Talks Session Three
NORFest 2023 Lightning Talks Session Three NORFest 2023 Lightning Talks Session Three
NORFest 2023 Lightning Talks Session Three dri_ireland
 
Valerie Johnson: Supporting the Archives Sector via Collaboration
Valerie Johnson: Supporting the Archives Sector via CollaborationValerie Johnson: Supporting the Archives Sector via Collaboration
Valerie Johnson: Supporting the Archives Sector via CollaborationNetwerk Digitaal Erfgoed
 
Digitised Content: What universities can learn from publishers and what publi...
Digitised Content: What universities can learn from publishers and what publi...Digitised Content: What universities can learn from publishers and what publi...
Digitised Content: What universities can learn from publishers and what publi...Alastair Dunning
 
Doctoral open day_digital_research_session_Social_Sciences_BL
Doctoral open day_digital_research_session_Social_Sciences_BLDoctoral open day_digital_research_session_Social_Sciences_BL
Doctoral open day_digital_research_session_Social_Sciences_BLAquiles Alencar Brayner
 
Investigating the PROMISE of a Belgian web archive
Investigating the PROMISE of a Belgian web archive Investigating the PROMISE of a Belgian web archive
Investigating the PROMISE of a Belgian web archive Sally Chambers
 

Similar a Introduction to British Library digital resources for social scientists (20)

Digital Cultural Heritage: Experiences from British Library
Digital Cultural Heritage: Experiences from British LibraryDigital Cultural Heritage: Experiences from British Library
Digital Cultural Heritage: Experiences from British Library
 
Digital Cultural Heritage: Experiences from British Library
Digital Cultural Heritage: Experiences from British LibraryDigital Cultural Heritage: Experiences from British Library
Digital Cultural Heritage: Experiences from British Library
 
Aquiles imlr seminar
Aquiles imlr seminarAquiles imlr seminar
Aquiles imlr seminar
 
BL Digital Scholarship
BL Digital Scholarship BL Digital Scholarship
BL Digital Scholarship
 
Web@rchive Austria (Archiving Online Media)
Web@rchive Austria (Archiving Online Media)Web@rchive Austria (Archiving Online Media)
Web@rchive Austria (Archiving Online Media)
 
Webarchiv - Curatorial approaches, topic collections and cooperation with the...
Webarchiv - Curatorial approaches, topic collections and cooperation with the...Webarchiv - Curatorial approaches, topic collections and cooperation with the...
Webarchiv - Curatorial approaches, topic collections and cooperation with the...
 
Digital Activities at the British Library (11-12-08)
Digital Activities at the British Library  (11-12-08)Digital Activities at the British Library  (11-12-08)
Digital Activities at the British Library (11-12-08)
 
OpenGLAM presentation at EOD conference, 11 April 2014
OpenGLAM presentation at EOD conference, 11 April 2014OpenGLAM presentation at EOD conference, 11 April 2014
OpenGLAM presentation at EOD conference, 11 April 2014
 
20yrs: 2007 Brussels Digital Preservation: Setting the Course for a Decade of...
20yrs: 2007 Brussels Digital Preservation: Setting the Course for a Decade of...20yrs: 2007 Brussels Digital Preservation: Setting the Course for a Decade of...
20yrs: 2007 Brussels Digital Preservation: Setting the Course for a Decade of...
 
Cpd25_Aquiles Alencar Brayner
Cpd25_Aquiles Alencar BraynerCpd25_Aquiles Alencar Brayner
Cpd25_Aquiles Alencar Brayner
 
BL_English doctoral_open_day_session
BL_English doctoral_open_day_sessionBL_English doctoral_open_day_session
BL_English doctoral_open_day_session
 
WNR.sg - Keynote Address by Mr John van Oudenaren, Director, World Digital Li...
WNR.sg - Keynote Address by Mr John van Oudenaren, Director, World Digital Li...WNR.sg - Keynote Address by Mr John van Oudenaren, Director, World Digital Li...
WNR.sg - Keynote Address by Mr John van Oudenaren, Director, World Digital Li...
 
Stewardship of the Digital Scholarly Record and Digital Published Heritage
Stewardship of the Digital Scholarly Record and Digital Published HeritageStewardship of the Digital Scholarly Record and Digital Published Heritage
Stewardship of the Digital Scholarly Record and Digital Published Heritage
 
Dh2016 dstp
Dh2016 dstpDh2016 dstp
Dh2016 dstp
 
20221018_Panel_Covid_WARCnet_closing_conference.pdf
20221018_Panel_Covid_WARCnet_closing_conference.pdf20221018_Panel_Covid_WARCnet_closing_conference.pdf
20221018_Panel_Covid_WARCnet_closing_conference.pdf
 
NORFest 2023 Lightning Talks Session Three
NORFest 2023 Lightning Talks Session Three NORFest 2023 Lightning Talks Session Three
NORFest 2023 Lightning Talks Session Three
 
Valerie Johnson: Supporting the Archives Sector via Collaboration
Valerie Johnson: Supporting the Archives Sector via CollaborationValerie Johnson: Supporting the Archives Sector via Collaboration
Valerie Johnson: Supporting the Archives Sector via Collaboration
 
Digitised Content: What universities can learn from publishers and what publi...
Digitised Content: What universities can learn from publishers and what publi...Digitised Content: What universities can learn from publishers and what publi...
Digitised Content: What universities can learn from publishers and what publi...
 
Doctoral open day_digital_research_session_Social_Sciences_BL
Doctoral open day_digital_research_session_Social_Sciences_BLDoctoral open day_digital_research_session_Social_Sciences_BL
Doctoral open day_digital_research_session_Social_Sciences_BL
 
Investigating the PROMISE of a Belgian web archive
Investigating the PROMISE of a Belgian web archive Investigating the PROMISE of a Belgian web archive
Investigating the PROMISE of a Belgian web archive
 

Introduction to British Library digital resources for social scientists

  • 1. Welcome and introduction to British Library digital resources for social scientists John Kaye – Lead Curator Digital Social Science Peter Webster - Web Archiving Engagement and Liaison Manager 7th December 2012 www.slideshare.net/johnkayebl
  • 2. What kind of library are we?  “We exist for everyone who wants to do research – for academic, personal or commercial purposes”  Our collections cover all known subject areas; sciences, technology, medicine, arts & humanities, social sciences…  We have a copy of every item published in the UK  Our collections cover all formats; sound, images, video, newspapers, maps, manuscripts, databases, books and journals, much more… 2
  • 3. News, newspapers and magazines 3
  • 4. News and current events Broadcast news, recording from May 2010 Political change in Middle East Olympic Games Occupy movement 4
  • 5. Images and photographs  Images online  Online gallery  Photographically illustrated books 5
  • 7. Social Science online resources for researchers  ESRC online resource  Management and Business Studies Portal  Social Welfare Portal  www.bl.uk/oralhistory  Social Science blog 7
  • 8. 8
  • 9. Management and Business Studies portal 9
  • 10. 10
  • 12. Oral history at a glance www.bl.uk/oralhistory  370 collections from 1 tape to 5,500 (Millennium Memory Bank)  100-150 hours of new digital fieldwork recordings per month  2200 catalogue records added or updated per year  4000 public enquiries per year  40 talks and lectures per year  60 training sessions per year with OHS (500+ people) 12
  • 13. Guides and support  Reference services: reading room, telephone, email  Help for Researchers web pages  Collection guides, eg for government publications: http://www.bl.uk/reshelp/findhelprestype/offpubs/guides/govtgu  Topical bibliographies, eg Globalisation and employment, Gang culture and knife crime, Corporate Social Responsibility, Far Right in Britain …  Welfare Reform on the Web 13
  • 15. Doctoral Open Days 2013 11 February – Social Sciences 18 February – Media, Cultural Studies and Journalism http://www.bl.uk/whatson/events/docopendays/index.html 15
  • 16. Web archives and digital method Dr Peter Webster Web Archiving Engagement and Liaison Officer @UKWebArchive / @pj_webster Peter.Webster@bl.uk http://www.webarchive.org.uk December 7th 2012
  • 17. The lost web: people [votedavidcameron.com, (archived 24/5/05)] 17
  • 18. The lost web: people [robincook.org.uk (archived 8/8/05)] 18
  • 19. The lost web: organisations [tvpa.police.uk (archived 21/11/12)] 19
  • 20. The lost web: organisations [woolworthsgroupplc.com (archived 12/12/08)] 20
  • 21. Our mission: Collect, preserve, and make accessible web sites of cultural and scholarly importance from the UK domain
  • 22. UK Web Archive http://www.webarchive.org.uk  Selective Web Archive  over 11,000 websites collected since 2004  over 50,000 instances  Over 16TB of compressed data  British Library, National Library of Wales, JISC  Also National Library of Scotland, the National Archives, Wellcome Library  Many collaborators  eg Women’s Library, Live Arts Development Agency, Quakers in Britain 22
  • 23. A typical event-based special collection Collect, preserve, and make accessible eb sites of cultural and scholarly importance from the UK domain
  • 25. A comprehensive special collection Collect, preserve, and make accessible eb sites of cultural and scholarly importance from the UK domain
  • 26. Web archiving: the basics  What  Selecting, capturing, storing, preserving and managing access to snapshots of websites over time  How  Use crawler software to download websites automatically  Selective or domain archiving  Provide access in a Web Archive When  Since mid 1990s Who  Heritage and memory organisations, eg BL, The National Archives  University libraries  Not-for-profit and commercial organisations, eg Internet Archive  Individual researchers  Why  Global information resource  Artefact of cultural and technology change  Representative sample of the web: historical and sociological data that may not be found elsewhere  Part of national digital heritage - legal requirements 26
  • 27. Selective versus domain archiving  Two complementary approaches: selective and domain archiving Width D e Domain harvesting: p - Typically once/twice a year t - Domain wide snapshot h - Supported by national legislative framework Selective archiving: -- automated & cost-effective - More frequent gathers; manual QA - Guided by collection policy - Can be based on events or themes e.g. credit crunch -- manual & expensive 27
  • 28. Non-print Legal Deposit 2013: what will we collect ? A deposit library is entitled to copy UK publications from the open web. A deposit library is entitled to collect other password-protected material by harvesting, subject to giving at least 1 month’s written notice for the publisher to provide a password or access credentials. 28
  • 29. What will we be collecting ? Includes resources: • that are issued from a .uk or other UK geographic top-level domain, or • where part of the publishing process takes place in the UK; • but excluding any which are only accessible to audiences outside the UK. 29
  • 30. What will we NOT be collecting ? Film and recorded sound where the audio-visual content predominates Private intranets and emails Personal data in social networking sites or that are only available to restricted groups. 30
  • 31. What will users be able to do with it ? Users may: • access deposited material while on “library premises controlled by a deposit library”. • print one copy of a restricted amount of any deposited material, for non-commercial research or other defined ‘fair dealing’ purposes such as court proceedings, statutory enquiry, criticism and review or journalism. 31
  • 32. What will users NOT be able to do with it ? Users may NOT: • use an item simultaneously with another user; • make any digital copies, except by specific and explicit licence of the publisher. 32
  • 33. A web archiving strategy based on prioritisation Domain Crawl Event Event Event Domain Events: Special harvesting: •Political, Collection: •Broad sweep cultural, social •Focused, of .uk domain and economic thematic •Survey and events of collections discovery national •Support •Implement interest, eg priority Legal Deposit Olympics subjects 2012 33
  • 34. JISC UK Web Domain Dataset (1996-2010) Funded by JISC to create a research collection of UK websites  Collaboration between the Internet Archive, JISC and the British Library  Copy of subset of the Internet Archive’s web collection that relates to the UK  470466 files, mostly arc.gz, with 4494 warc.gz.  Total size: 32TB  No local access – possible through the Internet Archive  Can be used to generate secondary datasets and make these available  Analytical access the main route 34
  • 35. Historical Archive – HTML Version Analysis
  • 36. N-Gram Search: Prime Ministers
  • 38. Questions ? John.Kaye@bl.uk Twitter: @johnkayeBL Peter.Webster@bl.uk Twitter: @UKWebArchive / @pj_webster UK Web Archive: http://www.webarchive.org.uk 38

Notas del editor

  1. Welcome to the British Library, My name is John Kaye, I am Lead Curator for Digital Social Sciences. I ’ m going to take this opportunity to very briefly outline some of the digital resources that our team and others have produced that could be useful to social scientists in your field. I will then hand over to Peter Webster Web Archiving Engagement and Liaison Manager who will go into more of detail about one of our best digital assets, the UK Web Archive. So you can easily find these resources I have uploaded my slides to slideshare and have also placed some leaflets
  2. National library of the UK, legal deposit library (we receive copy of all published material in UK). One of 5 largest research reference libraries in world Two main functions: the role of the BL is to provide access to these collections to whoever has a need to use them. Also to preserve for future generations (including material in printed and electronic format) Who uses the BL? Diverse audiences from students and academics from around the world - Currently over 60% of users of our collections and services are from UK higher education - includes university libraries, academic staff, postgraduate students, etc. Also the general public for exhibitions and events, to visit the building. Also run large engagement programme with schools which includes online resources and workshops. Includes a wide range of formats of material, not just books & journals, but sound, maps, photographs, illustrations, electronic databases. For example our Sound Archive is a national resource of audio history, recordings, music, including a large collection of wildlife sounds to regional dialects.
  3. The American Vogue archive is now available digitally in our British Library Reading Rooms, including the Business & IP Centre. It features every issues of American Vogue from 1892 to the present day, spanning over 400,000 pages. You can find inspiration from style icons from past and present, from Suzy Parker and Jean Shrimpton to Kate Moss.  Explore the history of fashion brands such as Chanel, Elizabeth Arden and Revlon over 120 years. The archive allows you to search across the issues by designer, contributor, type of garment or even fabric. Individual covers, advertisements, photo shoots and fold-outs have been pulled out as separate reports for you to search. Jack the Ripper, Illustrated Police News, October 27 1888. Digitised as part of the digitsation of 19 th century newspapers. Which is available to academic users via institutional login.
  4. Broadcast news service! Broadcast News This service provides access to daily television and radio news and current affairs programmes from seventeen channels (fifteen TV, two radio) broadcast in the UK since May 2010, recorded off-air by the British Library. The programmes will be almost instantly available, with new programmes available in our Reading Rooms within hours of broadcast. We currently record forty-six hours per day, including television services of the BBC, ITV, Channel 4, Sky News, Al-Jazeera English, NHK World, CNN, France 24, Bloomberg, Russia Today and China's CCTV News, plus key news and current affairs programmes from BBC Radio 4 and the BBC World Service. Many of the television programmes come with subtitles, which we have made word-searchable, greatly enhancing Broadcast News as a research resource.
  5. Working with photographs and other visual sources at the British Library is complex but offers great opportunities. Important to note that the examples provided above can be used for other sorts of visual material (maps, illustrations, etc) Working with visual materials will continue to evolve as the Library and researcher ‘go digital’: creating new opportunities and challenges.
  6. For some types of material, there are services that can provide digital copies remotely. UK Theses can now be searched using Ethos, with some 59 institutions offering some or all of their theses. BL Sounds provides digital copies from our sound recordings holdings (by no means all though). Digitsied collections include ethnographic recordings, dialect, oral histories, wildlife sounds and folk songs. The UK Web Archive, mentioned earlier, is also freely accessible from any computer.
  7. A major part of the work we do revolves around finding ways to improve access to our collections and support researchers (both inside academia and beyond – for example in the third sector). We have a new resource on the ESRC website which introduces PhD researchers to our collections. We have two portals which enable access to numerous articles and reports online – both of which are absolutely free. Our Sports and Society website has explored the Olympics and Paralympics through the lens of social science and includes numerous original pieces by academics and other researchers. Our new social science blog is a place to find out more about interesting and unusual collections and the work that we do with our different audiences.
  8. We ’ ve recently produced our new ESRC guide to using the British Library, written specifically with researchers in the social sciences in mind. This is found on the ESRC website – just google ESRC British Library. This provides all the information you need to get started, with a rich collection of case studies from researchers who have used the Library collections, providing inspiration and practical advice. Case studies cover, amongst other topics,: market research; government publications and United Nations documentation; political pamphlets; and cookery and fashion publications.
  9. Other services have been developed with a particular audience in mind. The Management and Business Studies portal is a free service for practitioners and researchers. It brings together information on the Library ’s content, with access to a curated set of research papers, policy documents, briefings and other material. Includes articles on key management thinkers, produced in association with the Chartered Management Institute. You need to register to use some of the features and content on the portal – once registered you can get regular updates on research published and content added.
  10. The MBS portal has proved to be a big success, so we have followed this up with Social Welfare at the British Library – which is our latest service. Like MBS portal, it ’s aimed at practitioners as well as researchers on social policy and social welfare. In addition to access to research and policy documents, you can also find the Welfare Reform Digest, which abstracts news articles, government publications, research reports etc on social policy around the world.
  11. We have recently launched our social sciences team blog focusing on research methods and resources, it has posts from our curators about projects we are working on, but we are also keen to hear what members of the research community are doing so we gladly accept guest posts and contributions, so if any of you would like a place to talk about your work then please get in touch.
  12. I'm now going to go talk more in depth about some of the formats and materials we collect which may be of interest to you. I’m going to try to say something about the materials which aren’t books or journals i.e. the things you would be less likely to find in your local or university library. One such collection area for the British Library is oral histories. We collect and commission oral history recordings on subjects of national interest. Many of these are funded by a charitable trust called National Life Stories which has its home at the BL. Examples include: The Irish Women Travellers (catalogue no: C1106) is a collection of life story interviews with women from the Irish Traveller community. The recordings are part of an oral history research project undertaken by Sue Beck for an MSc in Public Health and Health Promotion at South Bank University, which explores the health of these women across generations and across the life span. The HIV/AIDS Testimonies (catalogue no: C743) is a collection of life story interviews with people with the HIV and AIDS virus. This project has been recorded in two stages. Interviewees from the original set of interviews recorded between 1995-2000 were re-approached and interviewed again between 2005-2008. This project, led by Dr Wendy Rickard, was conducted in conjunction with the University of East London and then London South Bank University. The Socialist Workers' Party Collection (catalogue no: C797) includes recordings made between 1992 and 2000 at the annual 'Marxism' event held in July each year. Speakers include Arthur Scargill, Tony Benn, Terry Eagleton, Tony Cliff, Chris Mullin, John Pilger, Patricia Hewitt, Michael Bogdanov, Christopher Hill, and George Galloway.
  13. On site we also hold two major exhibitions every year. Our new exhibition has just launched: Mughal India: Art, Culture and Empire and our exhibition of On the Road has been on since October and is here until Christmas Our next exhibition will be on Propaganda (May 2013) Myths and Realities events, evening events run by the social sciences team are public events and we have three coming up in the spring around family, work and addiction.
  14. Doctoral Open Days are designed for new postgraduate students (eg in first year) and are a more-detailed introduction to the Library. There is a more-detailed introduction to the Library, with curator-led workshops and talks relating to specific parts of the collections. They are free to attend but booking is essential as they fill up quickly. You can book at what ’ s on section of our website Lastly, if you are not able to attend these days, but would like know more, then reference teams and curatorial staff are very happy to discuss your research with you in more detail.
  15. Footer text here...
  16. Search by URL, title, full-text Browse by
  17. Footer text here...
  18. Footer text here...
  19. Broadly there are 2 complementary main approaches to web archiving. Selective archiving is in general driven by an institution’s collection policy, which focuses on a selected, small portion of the national domain. The gathers tend to be more in-depth and drills into the structure of a website beyond the top level pages. It is a labour intensive archiving procedure involving detailed manual QA. Sites are gathered more frequently as well. In our case, we also ask permissions from the site owners. Web resources deemed appropriate for inclusion were selected and copyright holders of the sites were then contacted and sent requests to grant us a licence to archive and preserve the sites over time and make them available for public access.