SlideShare una empresa de Scribd logo
1 de 40
WE HAVE
INTERESTING
PROBLEMSSOME APPLIED GRAND CHALLENGES FROM
DIGITAL LIBRARIES, ARCHIVES AND
MUSEUMS
TALK ROADMAP
- Context on where I’m coming from
- The New ABCs of Research as Framework
- Examples from IMLS National Digital Platform Projects
- Examples of initiatives from LC Labs
- Some Jumping off Points and Applied Grand Challenges
CONTEXT ON
WHERE I’M
COMING
FROM
https://lj.libraryjournal.com/2017/12/people/qa-with-trevor-
owens-lc-head-of-digital-content-management/#_
The Theory and Craft of Digital Preservation
https://osf.io/preprints/lissa/5cpjt/
THE NEW
ABCS OF
RESEARCH AS
A FRAMEWORK
EXAMPLES FROM
IMLS NATIONAL
DIGITAL
PLATFORM
PROJECTS
NDP@3
REPORT
DETAILS
RESULTS
AND
TRENDS
Extending Intelligent Computational Image Analysis for
Archival Discovery (LG-71-16-0152-16), Board of Regents
of the University of Nebraska, $462,317 The Image
Analysis for Archival Discovery (Aida) research team at the
University of Nebraska-Lincoln will investigate the use of
image analysis as a methodology for content identification,
description, and information retrieval in digital libraries and
other digitized collections. The project will focus on identifying
poetic and advertising content in digitized historic
newspapers. Using a machine learning approach, the project
will result in an intelligent computational system that can
process digital images and identify these specific types of
content. https://www.imls.gov/grants/awarded/LG-71-16-
0152-16
Improving Access to Time-Based Media through
Crowdsourcing and Machine Learning (LG-71-15-0208-
15), WGBH Educational Foundation, $898,474 WGBH, in
partnership with Pop-Up Archive, will address the challenges
faced by many libraries and archives trying to provide better
online access to their media collections. This 30-month
research project will explore and test technological and social
approaches for metadata creation by leveraging scalable
computation and engaging the public to improve access
through crowdsourcing games for time-based media.
https://www.imls.gov/grants/awarded/LG-71-15-0208-15
Systems Interoperability and Collaborative Development
for Web Archiving $353,221 and $98,460 in cost share:
The Internet Archive, with the University of North Texas,
Rutgers University, and Stanford University Library will build
a foundation for collaborative technology development,
improved systems interoperability, and an Application
Programming Interface (API) based model for enhanced
access to, and research use of, web archives. In working with
the Archive-It platform, used by more than 350 partner
institutions, results of this research will be directly applicable
to libraries, archives, and museums around the country and
the world. https://www.imls.gov/grants/awarded/LG-71-15-
0174-15
Transforming Libraries and Archives through
Crowdsourcing (LG-71-16-0028-16), Adler Planetarium,
$1,214,780 This research partnership between Adler
Planetarium’s Library and researchers at Oxford University,
will expand the capacity for libraries and archives across the
country to use crowdsourcing techniques to engage with
audiences and improve access to digital collections. Through
this effort, the team will develop a series of library/archive
Zooniverse projects that explore improvements to full text
and audio transcription and image annotation crowdsourcing
tools and research differences between transcribing in
isolation versus with knowledge of others’ transcription.
Lessons learned from these projects will be incorporated into
the Project Builder. https://www.imls.gov/grants/awarded/LG-
71-16-0028-16
Programmatic Extraction of “Documents” from Web
Archives (LG-71-17-0202-17), University of North Texas,
$318,988 The University of North Texas Libraries and the
Computer Science and Engineering Department will research
the efficacy of using machine-learning algorithms to identify
and extract publications contained in web archives. The
overarching goal of this project is to understand if machine-
learning models can successfully identify content-rich PDF
and Word documents from web archives that align with
library and archives collecting plans.
https://www.imls.gov/grants/awarded/LG-71-17-0202-17
EXAMPLES OF
INITIATIVES
FROM LC LABS
Jer Thorp, Innovator in Residence
• Overview https://labs.loc.gov/experiments/innovator-in-residence-jer-thorp/
• Research materials https://osf.io/b7e6w/
• Code https://github.com/blprnt/loc
• Podcast https://artistinthearchive.podbean.com/
Laura Wrubel’s Library of Congress Colors
• Application https://loc-colors.glitch.me/
• Code https://github.com/lwrubel/loc-colors
• Blog post https://blogs.loc.gov/thesignal/2018/01/from-code-to-colors-working-with-the-
loc-gov-json-api/
Tahir Hemphill, Papamarkou Chair in Education at the John W. Kluge Center
• About Hip Hop Word Count https://www.newyorker.com/magazine/2013/04/01/rap-
sheet-2
• Studio https://www.tahirhemphill.com/
• Past chairs https://www.loc.gov/loc/kluge/fellowships/hpeducation.html
Reports
• Gallinger, M. & Chudnov, D. Recommendations for a Digital Scholarship Lab at the
Library of Congress
• Herron, S. Digital Scholarship Resource Guide.
• Access the reports https://labs.loc.gov/meta/reports/
LC LABS RESOURCES
SOME JUMPING
OFF POINTS AND
APPLIED GRAND
CHALLENGES
SOME ESSENTIAL APPLIED
RESEARCH AREAS
- How can various new technologies be implemented to
scale the ability to acquire, describe, organize and make
available digital collections?
- How can we best integrate various automated methods for
working with digital collections with the work of subject
catalogers/subject matter experts?
- What ways can we best connect and build relationships
with various user communities through crowdsourcing
initiatives?
- What do all of these technologies look like in ongoing
production workflows?
SOME MORE SPECIFIC
EXAMPLES
- Working models for content addressable storage in digital
repository storage architectures
- Reconciling data warehousing approaches with library
approaches to content and metadata management
- Weaving together structured cataloging workflows with
metadata generating mechanisms (crowdsourcing, NLP,
Computer Vision, etc.)
- Virtual machines general purpose policy based restricted
access infrastructure
- Enabling data mining and computational scholarship on
arbitrary restricted access collections
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libraries, Archives and Museums

Más contenido relacionado

La actualidad más candente

Digital Medieval Data Curation
Digital Medieval Data CurationDigital Medieval Data Curation
Digital Medieval Data Curationblalbritton
 
Discovering Scholarly Orphans Using ORCID
Discovering Scholarly Orphans Using ORCIDDiscovering Scholarly Orphans Using ORCID
Discovering Scholarly Orphans Using ORCIDMartin Klein
 
Exposing Humanities Data for Reuse and Linking - RED, linked data and the sem...
Exposing Humanities Data for Reuse and Linking - RED, linked data and the sem...Exposing Humanities Data for Reuse and Linking - RED, linked data and the sem...
Exposing Humanities Data for Reuse and Linking - RED, linked data and the sem...Mathieu d'Aquin
 
How much does $1.7 billion buy?
How much does $1.7 billion buy?How much does $1.7 billion buy?
How much does $1.7 billion buy?Martin Klein
 
Advances in Scientific Workflow Environments
Advances in Scientific Workflow EnvironmentsAdvances in Scientific Workflow Environments
Advances in Scientific Workflow EnvironmentsCarole Goble
 
Making social science more reproducible by encapsulating access to linked data
Making social science more reproducible by encapsulating access to linked dataMaking social science more reproducible by encapsulating access to linked data
Making social science more reproducible by encapsulating access to linked dataAlbert Meroño-Peñuela
 
WebART in 10 minutes
WebART in 10 minutesWebART in 10 minutes
WebART in 10 minutesJaap Kamps
 
WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)
WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)
WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)TimelessFuture
 

La actualidad más candente (15)

Digital Medieval Data Curation
Digital Medieval Data CurationDigital Medieval Data Curation
Digital Medieval Data Curation
 
ROHub
ROHubROHub
ROHub
 
Discovering Scholarly Orphans Using ORCID
Discovering Scholarly Orphans Using ORCIDDiscovering Scholarly Orphans Using ORCID
Discovering Scholarly Orphans Using ORCID
 
Semantic Web in the Digital Humanities
Semantic Web in the Digital HumanitiesSemantic Web in the Digital Humanities
Semantic Web in the Digital Humanities
 
Exposing Humanities Data for Reuse and Linking - RED, linked data and the sem...
Exposing Humanities Data for Reuse and Linking - RED, linked data and the sem...Exposing Humanities Data for Reuse and Linking - RED, linked data and the sem...
Exposing Humanities Data for Reuse and Linking - RED, linked data and the sem...
 
How much does $1.7 billion buy?
How much does $1.7 billion buy?How much does $1.7 billion buy?
How much does $1.7 billion buy?
 
Advances in Scientific Workflow Environments
Advances in Scientific Workflow EnvironmentsAdvances in Scientific Workflow Environments
Advances in Scientific Workflow Environments
 
2014_WWW_BTOR
2014_WWW_BTOR2014_WWW_BTOR
2014_WWW_BTOR
 
Making social science more reproducible by encapsulating access to linked data
Making social science more reproducible by encapsulating access to linked dataMaking social science more reproducible by encapsulating access to linked data
Making social science more reproducible by encapsulating access to linked data
 
Analyzing poetry databases to develop a metadata application profile. Why eac...
Analyzing poetry databases to develop a metadata application profile. Why eac...Analyzing poetry databases to develop a metadata application profile. Why eac...
Analyzing poetry databases to develop a metadata application profile. Why eac...
 
WebART in 10 minutes
WebART in 10 minutesWebART in 10 minutes
WebART in 10 minutes
 
FAIRer Research
FAIRer ResearchFAIRer Research
FAIRer Research
 
Csdh sbg clariah_intr01
Csdh sbg clariah_intr01Csdh sbg clariah_intr01
Csdh sbg clariah_intr01
 
WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)
WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)
WebART: Facilitating Scholarly Use of Web Archives (IIPC, Apr. 2013)
 
Creating Pockets of Persistence
Creating Pockets of PersistenceCreating Pockets of Persistence
Creating Pockets of Persistence
 

Similar a We Have Interesting Problems: Some Applied Grand Challenges from Digital Libraries, Archives and Museums

Digital Scholarly Communication @Claremont Colleges
Digital Scholarly Communication @Claremont CollegesDigital Scholarly Communication @Claremont Colleges
Digital Scholarly Communication @Claremont CollegesAshley Sanders, Ph.D.
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsJon Voss
 
British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...
British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...
British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...The European Library
 
Uk discovery-jisc-project-showcase
Uk discovery-jisc-project-showcaseUk discovery-jisc-project-showcase
Uk discovery-jisc-project-showcaseRDTF-Discovery
 
Enabling Complex Analysis of Large-Scale Digital Collections: Humanities Rese...
Enabling Complex Analysis of Large-Scale Digital Collections: Humanities Rese...Enabling Complex Analysis of Large-Scale Digital Collections: Humanities Rese...
Enabling Complex Analysis of Large-Scale Digital Collections: Humanities Rese...James Baker
 
Mahendra Mahey, British Library Labs
Mahendra Mahey, British Library LabsMahendra Mahey, British Library Labs
Mahendra Mahey, British Library LabsResearchLibrariesUK
 
EPrints Update, Les Carr, University of Southampton
EPrints  Update, Les Carr, University of SouthamptonEPrints  Update, Les Carr, University of Southampton
EPrints Update, Les Carr, University of SouthamptonRepository Fringe
 
BL Labs at Bloomsbury Digital Humanities Group
BL Labs at Bloomsbury Digital Humanities Group BL Labs at Bloomsbury Digital Humanities Group
BL Labs at Bloomsbury Digital Humanities Group labsbl
 
Next Steps for IMLS's National Digital Platform
Next Steps for IMLS's National Digital PlatformNext Steps for IMLS's National Digital Platform
Next Steps for IMLS's National Digital PlatformTrevor Owens
 
Cultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data CollectionsCultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data Collectionslljohnston
 
OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.OCLC
 
How to read a million books?
How to read a million books?How to read a million books?
How to read a million books?cneudecker
 
Open sciencerefresher2019
Open sciencerefresher2019Open sciencerefresher2019
Open sciencerefresher2019heila1
 
Smarter Data for Smarter Libraries
Smarter Data for Smarter LibrariesSmarter Data for Smarter Libraries
Smarter Data for Smarter LibrariesOCLC
 
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012lljohnston
 
Benefits and practice of open science
Benefits and practice of open scienceBenefits and practice of open science
Benefits and practice of open scienceSarah Jones
 

Similar a We Have Interesting Problems: Some Applied Grand Challenges from Digital Libraries, Archives and Museums (20)

Final Johnson Research Libraries and Computational Research
Final Johnson Research Libraries and Computational ResearchFinal Johnson Research Libraries and Computational Research
Final Johnson Research Libraries and Computational Research
 
Digital Scholarly Communication @Claremont Colleges
Digital Scholarly Communication @Claremont CollegesDigital Scholarly Communication @Claremont Colleges
Digital Scholarly Communication @Claremont Colleges
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
 
British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...
British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...
British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...
 
Uk discovery-jisc-project-showcase
Uk discovery-jisc-project-showcaseUk discovery-jisc-project-showcase
Uk discovery-jisc-project-showcase
 
Enabling Complex Analysis of Large-Scale Digital Collections: Humanities Rese...
Enabling Complex Analysis of Large-Scale Digital Collections: Humanities Rese...Enabling Complex Analysis of Large-Scale Digital Collections: Humanities Rese...
Enabling Complex Analysis of Large-Scale Digital Collections: Humanities Rese...
 
Mahendra Mahey, British Library Labs
Mahendra Mahey, British Library LabsMahendra Mahey, British Library Labs
Mahendra Mahey, British Library Labs
 
EPrints Update, Les Carr, University of Southampton
EPrints  Update, Les Carr, University of SouthamptonEPrints  Update, Les Carr, University of Southampton
EPrints Update, Les Carr, University of Southampton
 
Mahendra Mahay's slides from the Bloomsbury DH Meeting 30/09/2013
Mahendra Mahay's slides from the Bloomsbury DH Meeting 30/09/2013Mahendra Mahay's slides from the Bloomsbury DH Meeting 30/09/2013
Mahendra Mahay's slides from the Bloomsbury DH Meeting 30/09/2013
 
BL Labs at Bloomsbury Digital Humanities Group
BL Labs at Bloomsbury Digital Humanities Group BL Labs at Bloomsbury Digital Humanities Group
BL Labs at Bloomsbury Digital Humanities Group
 
Next Steps for IMLS's National Digital Platform
Next Steps for IMLS's National Digital PlatformNext Steps for IMLS's National Digital Platform
Next Steps for IMLS's National Digital Platform
 
Cultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data CollectionsCultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data Collections
 
OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.
 
Ji cv6n1
Ji cv6n1Ji cv6n1
Ji cv6n1
 
NISO Virtual Conference: Web-Scale Discovery Services: Transforming Access to...
NISO Virtual Conference: Web-Scale Discovery Services: Transforming Access to...NISO Virtual Conference: Web-Scale Discovery Services: Transforming Access to...
NISO Virtual Conference: Web-Scale Discovery Services: Transforming Access to...
 
How to read a million books?
How to read a million books?How to read a million books?
How to read a million books?
 
Open sciencerefresher2019
Open sciencerefresher2019Open sciencerefresher2019
Open sciencerefresher2019
 
Smarter Data for Smarter Libraries
Smarter Data for Smarter LibrariesSmarter Data for Smarter Libraries
Smarter Data for Smarter Libraries
 
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
 
Benefits and practice of open science
Benefits and practice of open scienceBenefits and practice of open science
Benefits and practice of open science
 

Más de Trevor Owens

Caring for Digital Collections in the Anthropocene
Caring for Digital Collections in the AnthropoceneCaring for Digital Collections in the Anthropocene
Caring for Digital Collections in the AnthropoceneTrevor Owens
 
Theory and Craft of Digital Preservation Lightning Talk
Theory and Craft of Digital Preservation Lightning TalkTheory and Craft of Digital Preservation Lightning Talk
Theory and Craft of Digital Preservation Lightning TalkTrevor Owens
 
Planning for Digital Preservation in Organizations
Planning for Digital Preservation in OrganizationsPlanning for Digital Preservation in Organizations
Planning for Digital Preservation in OrganizationsTrevor Owens
 
Enduring Digital Access: Establishing, Supporting, and Sustaining Digital Cur...
Enduring Digital Access: Establishing, Supporting, and Sustaining Digital Cur...Enduring Digital Access: Establishing, Supporting, and Sustaining Digital Cur...
Enduring Digital Access: Establishing, Supporting, and Sustaining Digital Cur...Trevor Owens
 
Make it Last: Principals for Digital Preservation and Conservation
Make it Last: Principals for Digital Preservation and ConservationMake it Last: Principals for Digital Preservation and Conservation
Make it Last: Principals for Digital Preservation and ConservationTrevor Owens
 
Digital Preservation: Understanding the Risks
Digital Preservation: Understanding the RisksDigital Preservation: Understanding the Risks
Digital Preservation: Understanding the RisksTrevor Owens
 
Testing Our Assumptions: The Centrality of Design Thinking and Scholarship fo...
Testing Our Assumptions: The Centrality of Design Thinking and Scholarship fo...Testing Our Assumptions: The Centrality of Design Thinking and Scholarship fo...
Testing Our Assumptions: The Centrality of Design Thinking and Scholarship fo...Trevor Owens
 
Start Today: Digital Stewardship Communities & Collaborations
Start Today: Digital Stewardship  Communities & CollaborationsStart Today: Digital Stewardship  Communities & Collaborations
Start Today: Digital Stewardship Communities & CollaborationsTrevor Owens
 
Scientists’ Hard Drives, Databases, and Blogs: Preservation Intent and Source...
Scientists’ Hard Drives, Databases, and Blogs: Preservation Intent and Source...Scientists’ Hard Drives, Databases, and Blogs: Preservation Intent and Source...
Scientists’ Hard Drives, Databases, and Blogs: Preservation Intent and Source...Trevor Owens
 
Platform Thinking: Frameworks for a National Digital Platform State of Mind
Platform Thinking: Frameworks for a National Digital Platform State of MindPlatform Thinking: Frameworks for a National Digital Platform State of Mind
Platform Thinking: Frameworks for a National Digital Platform State of MindTrevor Owens
 
Digital Infrastructures that Embody Library Principles: The IMLS national dig...
Digital Infrastructures that Embody Library Principles: The IMLS national dig...Digital Infrastructures that Embody Library Principles: The IMLS national dig...
Digital Infrastructures that Embody Library Principles: The IMLS national dig...Trevor Owens
 
The IMLS National Digital Platform & Your Library: Tools You Can Use
The IMLS National Digital Platform & Your Library: Tools You Can UseThe IMLS National Digital Platform & Your Library: Tools You Can Use
The IMLS National Digital Platform & Your Library: Tools You Can UseTrevor Owens
 
Update on IMLS National Digital Platform
Update on IMLS National Digital Platform Update on IMLS National Digital Platform
Update on IMLS National Digital Platform Trevor Owens
 
Next Steps for IMLS's National Digital Platform
Next Steps for IMLS's National Digital PlatformNext Steps for IMLS's National Digital Platform
Next Steps for IMLS's National Digital PlatformTrevor Owens
 
People, Communities and Platforms: Digital Cultural Heritage and the Web
People, Communities and Platforms: Digital Cultural Heritage and the WebPeople, Communities and Platforms: Digital Cultural Heritage and the Web
People, Communities and Platforms: Digital Cultural Heritage and the WebTrevor Owens
 
Macroscopes and Distant Reading: Implications for Infrastructures to Support ...
Macroscopes and Distant Reading: Implications for Infrastructures to Support ...Macroscopes and Distant Reading: Implications for Infrastructures to Support ...
Macroscopes and Distant Reading: Implications for Infrastructures to Support ...Trevor Owens
 
Digital Preservation's Role in the Future of the Digital Humanities
Digital Preservation's Role in the Future of the Digital HumanitiesDigital Preservation's Role in the Future of the Digital Humanities
Digital Preservation's Role in the Future of the Digital HumanitiesTrevor Owens
 
Cultural Heritage and the Crowd
Cultural Heritage and the CrowdCultural Heritage and the Crowd
Cultural Heritage and the CrowdTrevor Owens
 
Signifying and significance
Signifying and significanceSignifying and significance
Signifying and significanceTrevor Owens
 
Viewshare Curategear 2013
Viewshare Curategear 2013Viewshare Curategear 2013
Viewshare Curategear 2013Trevor Owens
 

Más de Trevor Owens (20)

Caring for Digital Collections in the Anthropocene
Caring for Digital Collections in the AnthropoceneCaring for Digital Collections in the Anthropocene
Caring for Digital Collections in the Anthropocene
 
Theory and Craft of Digital Preservation Lightning Talk
Theory and Craft of Digital Preservation Lightning TalkTheory and Craft of Digital Preservation Lightning Talk
Theory and Craft of Digital Preservation Lightning Talk
 
Planning for Digital Preservation in Organizations
Planning for Digital Preservation in OrganizationsPlanning for Digital Preservation in Organizations
Planning for Digital Preservation in Organizations
 
Enduring Digital Access: Establishing, Supporting, and Sustaining Digital Cur...
Enduring Digital Access: Establishing, Supporting, and Sustaining Digital Cur...Enduring Digital Access: Establishing, Supporting, and Sustaining Digital Cur...
Enduring Digital Access: Establishing, Supporting, and Sustaining Digital Cur...
 
Make it Last: Principals for Digital Preservation and Conservation
Make it Last: Principals for Digital Preservation and ConservationMake it Last: Principals for Digital Preservation and Conservation
Make it Last: Principals for Digital Preservation and Conservation
 
Digital Preservation: Understanding the Risks
Digital Preservation: Understanding the RisksDigital Preservation: Understanding the Risks
Digital Preservation: Understanding the Risks
 
Testing Our Assumptions: The Centrality of Design Thinking and Scholarship fo...
Testing Our Assumptions: The Centrality of Design Thinking and Scholarship fo...Testing Our Assumptions: The Centrality of Design Thinking and Scholarship fo...
Testing Our Assumptions: The Centrality of Design Thinking and Scholarship fo...
 
Start Today: Digital Stewardship Communities & Collaborations
Start Today: Digital Stewardship  Communities & CollaborationsStart Today: Digital Stewardship  Communities & Collaborations
Start Today: Digital Stewardship Communities & Collaborations
 
Scientists’ Hard Drives, Databases, and Blogs: Preservation Intent and Source...
Scientists’ Hard Drives, Databases, and Blogs: Preservation Intent and Source...Scientists’ Hard Drives, Databases, and Blogs: Preservation Intent and Source...
Scientists’ Hard Drives, Databases, and Blogs: Preservation Intent and Source...
 
Platform Thinking: Frameworks for a National Digital Platform State of Mind
Platform Thinking: Frameworks for a National Digital Platform State of MindPlatform Thinking: Frameworks for a National Digital Platform State of Mind
Platform Thinking: Frameworks for a National Digital Platform State of Mind
 
Digital Infrastructures that Embody Library Principles: The IMLS national dig...
Digital Infrastructures that Embody Library Principles: The IMLS national dig...Digital Infrastructures that Embody Library Principles: The IMLS national dig...
Digital Infrastructures that Embody Library Principles: The IMLS national dig...
 
The IMLS National Digital Platform & Your Library: Tools You Can Use
The IMLS National Digital Platform & Your Library: Tools You Can UseThe IMLS National Digital Platform & Your Library: Tools You Can Use
The IMLS National Digital Platform & Your Library: Tools You Can Use
 
Update on IMLS National Digital Platform
Update on IMLS National Digital Platform Update on IMLS National Digital Platform
Update on IMLS National Digital Platform
 
Next Steps for IMLS's National Digital Platform
Next Steps for IMLS's National Digital PlatformNext Steps for IMLS's National Digital Platform
Next Steps for IMLS's National Digital Platform
 
People, Communities and Platforms: Digital Cultural Heritage and the Web
People, Communities and Platforms: Digital Cultural Heritage and the WebPeople, Communities and Platforms: Digital Cultural Heritage and the Web
People, Communities and Platforms: Digital Cultural Heritage and the Web
 
Macroscopes and Distant Reading: Implications for Infrastructures to Support ...
Macroscopes and Distant Reading: Implications for Infrastructures to Support ...Macroscopes and Distant Reading: Implications for Infrastructures to Support ...
Macroscopes and Distant Reading: Implications for Infrastructures to Support ...
 
Digital Preservation's Role in the Future of the Digital Humanities
Digital Preservation's Role in the Future of the Digital HumanitiesDigital Preservation's Role in the Future of the Digital Humanities
Digital Preservation's Role in the Future of the Digital Humanities
 
Cultural Heritage and the Crowd
Cultural Heritage and the CrowdCultural Heritage and the Crowd
Cultural Heritage and the Crowd
 
Signifying and significance
Signifying and significanceSignifying and significance
Signifying and significance
 
Viewshare Curategear 2013
Viewshare Curategear 2013Viewshare Curategear 2013
Viewshare Curategear 2013
 

Último

modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxHimangsuNath
 
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfWorld Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfsimulationsindia
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxTasha Penwell
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data VisualizationKianJazayeri1
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataTecnoIncentive
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...Jack Cole
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 

Último (20)

modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptx
 
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfWorld Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data Visualization
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded data
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis Project
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 

We Have Interesting Problems: Some Applied Grand Challenges from Digital Libraries, Archives and Museums

  • 1. WE HAVE INTERESTING PROBLEMSSOME APPLIED GRAND CHALLENGES FROM DIGITAL LIBRARIES, ARCHIVES AND MUSEUMS
  • 2. TALK ROADMAP - Context on where I’m coming from - The New ABCs of Research as Framework - Examples from IMLS National Digital Platform Projects - Examples of initiatives from LC Labs - Some Jumping off Points and Applied Grand Challenges
  • 4.
  • 5.
  • 6.
  • 7.
  • 8.
  • 10. The Theory and Craft of Digital Preservation https://osf.io/preprints/lissa/5cpjt/
  • 11.
  • 12.
  • 13. THE NEW ABCS OF RESEARCH AS A FRAMEWORK
  • 14.
  • 17.
  • 18. Extending Intelligent Computational Image Analysis for Archival Discovery (LG-71-16-0152-16), Board of Regents of the University of Nebraska, $462,317 The Image Analysis for Archival Discovery (Aida) research team at the University of Nebraska-Lincoln will investigate the use of image analysis as a methodology for content identification, description, and information retrieval in digital libraries and other digitized collections. The project will focus on identifying poetic and advertising content in digitized historic newspapers. Using a machine learning approach, the project will result in an intelligent computational system that can process digital images and identify these specific types of content. https://www.imls.gov/grants/awarded/LG-71-16- 0152-16
  • 19. Improving Access to Time-Based Media through Crowdsourcing and Machine Learning (LG-71-15-0208- 15), WGBH Educational Foundation, $898,474 WGBH, in partnership with Pop-Up Archive, will address the challenges faced by many libraries and archives trying to provide better online access to their media collections. This 30-month research project will explore and test technological and social approaches for metadata creation by leveraging scalable computation and engaging the public to improve access through crowdsourcing games for time-based media. https://www.imls.gov/grants/awarded/LG-71-15-0208-15
  • 20. Systems Interoperability and Collaborative Development for Web Archiving $353,221 and $98,460 in cost share: The Internet Archive, with the University of North Texas, Rutgers University, and Stanford University Library will build a foundation for collaborative technology development, improved systems interoperability, and an Application Programming Interface (API) based model for enhanced access to, and research use of, web archives. In working with the Archive-It platform, used by more than 350 partner institutions, results of this research will be directly applicable to libraries, archives, and museums around the country and the world. https://www.imls.gov/grants/awarded/LG-71-15- 0174-15
  • 21. Transforming Libraries and Archives through Crowdsourcing (LG-71-16-0028-16), Adler Planetarium, $1,214,780 This research partnership between Adler Planetarium’s Library and researchers at Oxford University, will expand the capacity for libraries and archives across the country to use crowdsourcing techniques to engage with audiences and improve access to digital collections. Through this effort, the team will develop a series of library/archive Zooniverse projects that explore improvements to full text and audio transcription and image annotation crowdsourcing tools and research differences between transcribing in isolation versus with knowledge of others’ transcription. Lessons learned from these projects will be incorporated into the Project Builder. https://www.imls.gov/grants/awarded/LG- 71-16-0028-16
  • 22. Programmatic Extraction of “Documents” from Web Archives (LG-71-17-0202-17), University of North Texas, $318,988 The University of North Texas Libraries and the Computer Science and Engineering Department will research the efficacy of using machine-learning algorithms to identify and extract publications contained in web archives. The overarching goal of this project is to understand if machine- learning models can successfully identify content-rich PDF and Word documents from web archives that align with library and archives collecting plans. https://www.imls.gov/grants/awarded/LG-71-17-0202-17
  • 24.
  • 25.
  • 26.
  • 27.
  • 28.
  • 29.
  • 30.
  • 31.
  • 32.
  • 33.
  • 34.
  • 35.
  • 36. Jer Thorp, Innovator in Residence • Overview https://labs.loc.gov/experiments/innovator-in-residence-jer-thorp/ • Research materials https://osf.io/b7e6w/ • Code https://github.com/blprnt/loc • Podcast https://artistinthearchive.podbean.com/ Laura Wrubel’s Library of Congress Colors • Application https://loc-colors.glitch.me/ • Code https://github.com/lwrubel/loc-colors • Blog post https://blogs.loc.gov/thesignal/2018/01/from-code-to-colors-working-with-the- loc-gov-json-api/ Tahir Hemphill, Papamarkou Chair in Education at the John W. Kluge Center • About Hip Hop Word Count https://www.newyorker.com/magazine/2013/04/01/rap- sheet-2 • Studio https://www.tahirhemphill.com/ • Past chairs https://www.loc.gov/loc/kluge/fellowships/hpeducation.html Reports • Gallinger, M. & Chudnov, D. Recommendations for a Digital Scholarship Lab at the Library of Congress • Herron, S. Digital Scholarship Resource Guide. • Access the reports https://labs.loc.gov/meta/reports/ LC LABS RESOURCES
  • 37. SOME JUMPING OFF POINTS AND APPLIED GRAND CHALLENGES
  • 38. SOME ESSENTIAL APPLIED RESEARCH AREAS - How can various new technologies be implemented to scale the ability to acquire, describe, organize and make available digital collections? - How can we best integrate various automated methods for working with digital collections with the work of subject catalogers/subject matter experts? - What ways can we best connect and build relationships with various user communities through crowdsourcing initiatives? - What do all of these technologies look like in ongoing production workflows?
  • 39. SOME MORE SPECIFIC EXAMPLES - Working models for content addressable storage in digital repository storage architectures - Reconciling data warehousing approaches with library approaches to content and metadata management - Weaving together structured cataloging workflows with metadata generating mechanisms (crowdsourcing, NLP, Computer Vision, etc.) - Virtual machines general purpose policy based restricted access infrastructure - Enabling data mining and computational scholarship on arbitrary restricted access collections

Notas del editor

  1. https://artistinthearchive.podbean.com/
  2. https://www.loc.gov/item/98518700/
  3. https://www.loc.gov/item/2008676434/