SlideShare una empresa de Scribd logo
1 de 43
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Martin Klein
LANL
@mart1nkle1n
https://orcid.org/0000-0003-0130-2097
Herbert Van de Sompel
DANS
@hvdsomp
https://orcid.org/0000-0002-0715-6126
An Institutional Perspective to Rescue Scholarly Orphans
The Scholarly Orphans project
is funded by the Andrew W. Mellon Foundation
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Scholarly Orphans Team
• Los Alamos National Laboratory:
• Lyudmila Balakireva
• Martin Klein
• James Powell
• Harihar Shankar
• Herbert Van de Sompel
• Old Dominion University:
• Sawood Alam
• Grant Atkins
• Shawn Jones
• Mat Kelly
• Michael L. Nelson
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Scholarly Orphans – Project Motivation
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
• Consideration
• Researchers are increasingly using a variety of web platforms for
collaboration and communication
• Why?
• Many of these platforms have desirable characteristics
• Versioning
• Time stamping
• Social embedding
• Their institutions do not provide platforms that have global reach
• Collaboration, cf. Github ~ productivity
• Communication, cf. SlideShare ~ visibility
Research and Research Communication on the Web
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Emma Schymanski
https://orcid.org/0000-0001-6868-8145
https://github.com/schymane
https://www.slideshare.net/EmmaSchymanski
https://figshare.com/authors/Emma_Schymanski/5087039
https://publons.com/author/1538491/emma-schymanski#profile
https://www.eawag.ch/en/aboutus/portrait/organisation/staff/profile/emma-schymanski/
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Shawn Jones
https://orcid.org/0000-0002-4372-870X
http://www.shawnmjones.org/
https://github.com/shawnmjones
https://www.slideshare.net/shawnmjones
https://en.wikipedia.org/wiki/User:Shawnmjones
https://www.blogger.com/profile/17827543974149663194
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
• Consideration
• Researchers deposit artifacts in web platforms
• Web Platforms:
• Dedicated to scholarship:
• Commercial: e.g., FigShare, Publons
• Not for profit: e.g., OSF, Zenodo
• General purpose:
• Commercial: e.g., GitHub, SlideShare
• Not for profit: e.g., Wikipedia, Wikidata
Research and Research Communication on the Web
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
• Consideration
• Researchers deposit artifacts in web platforms
• Status quo - The researchers’ institutions are in the dark
• Do not know about the existence of these artifact
• Do not have a copy of these artifacts
Research and Research Communication on the Web
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
• Consideration
• Researchers deposit artifacts in web platforms
• Status quo – Uncertainty regarding long-term access
• Commercial: changing business model, no preservation commitment
• Not for profit: unpredictable funding stream
Research and Research Communication on the Web
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
• Consideration
• Researchers deposit artifacts in web platforms
• Status quo - Not systematically archived
• No frameworks like LOCKSS/Portico exist for these artifacts
• Researchers only selectively deposit artifacts in portals that
provide archival guarantees; to obtain a cite-able DOI
• Can’t expect researchers to (also) upload all artifacts in IRs
• Web archives only incidentally archive these artifacts, cf.
anecdotal & Hiberlink project evidence
Research and Research Communication on the Web
Martin Klein, Herbert Van de Sompel, et al. (2014) Scholarly context not found. In: PLOS ONE
https://doi.org/10.1371/journal.pone.0115253
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Emma’s SlideShare Artifact: 0 Mementos
https://www.slideshare.net/EmmaSchymanski/dmcm2018-community-resources-connecting-chemistry-and-toxicity-knowledge
http://timetravel.mementoweb.org/
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Shawn’s GitHub Artifact: 1 Memento
https://github.com/shawnmjones/mediawiki
https://web.archive.org/web/*/https://github.com/shawnmjones/mediawiki
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Scholarly Orphans – Project Overview
How to capture Scholarly Orphans for long-term archiving?
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
The Scholarly Orphans Project
• Explores an institution-driven paradigm
• Academic institutions typically have a long shelf life
• A basic premise underlying e.g., LOCKSS, perma.cc
• An academic institution should be interested in capturing the
artifacts (intellectual property) its scholars deposit on the web
• Collecting and archiving such artifacts aligns with the
mission of academic libraries
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
An Institutional Perspective
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
The Scholarly Orphans Project
• Explores a paradigm inspired by web archiving
• Scale of the problem
• Can’t expect researchers to upload all artifacts in an institutional
repository
• Bilateral agreements for archival purposes with most web
portals unlikely
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
A Web Archiving Perspective
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Scholarly Orphans – Prototype Pipeline Overview
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Prototype Pipeline
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Tracking Artifacts
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Tracking Artifacts - Description
• In order to track artifacts that were recently deposited by an
institutional researcher in a portal, one reasonably needs:
• The web identity of the researcher in the portal
• Algorithmic discovery
• Discovery via a registry
• Manual collection
• A portal API that supports:
• Access by web identity
• Access to contributions “since …” for the web identity
• Result of tracking:
• URI(s) of new artifact(s) discovered in the portal
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Tracking Artifacts - Challenges
• Portal API access by web identity
• Broadly supported by general purpose portals
• Typically not supported by scholarly portals
• Some lack an API altogether
• Should add ORCID access to APIs
• OAI-PMH and ResourceSync need sets per web identity
• Professional versus personal contributions
• Tracking frequency/scale
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Capturing Artifacts
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Capturing Artifacts - Description
• The capture process takes as input the URI of a new artifact
discovered in a portal
• Its task is to create a representative institutional capture of the
artifact
• Result of capture:
• WARC file for new artifact in an institutional archive
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Capturing Artifacts - Challenges
• Delineate the web boundary of the artifact
• More than the input artifact URI
• The boundary is in the eye of the beholder
• Create a high-fidelity capture using an approach that scales for a
steady stream of new artifacts
• Determine the web boundary of the artifact
• Handle dynamic content & interactive features of web pages
• We made a significant breakthrough with the Memento Tracer
framework
Memento Tracer: http://tracer.mementoweb.org
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Archiving Artifacts
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Archiving Artifacts - Description
• The archiving process takes as input the URI of a WARC file
generated by the capture process
• Its task is to ingest the WARC file in a cross-institutional web archive
• This can be achieved using off-the-shelf web archiving software,
e.g., pywb, Open Wayback
• Result of archiving:
• Mementos pertaining to newly discovered artifact in a cross-
institutional, Memento-compliant web archive
• Possibility to link to artifacts using Robust Links:
<a href=“URI-A”
data-versionurl=“URI-M”
data-versiondate=“date-of-capture”
Robust Links: http://robustlinks.mementoweb.org/about/
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Archiving Artifacts - Challenges
• Attempted to use ipwb, a pywb version that uses IPFS
• Cross-institutional distributed file system with redundancy
• Ran out of time to get it operationally stable
Sawood Alam, Mat Kelly, and Michael L. Nelson (2016) InterPlanetary Wayback: The Permanent Web Archive
https://doi.org/10.1145/2910896.2925467
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Pipeline Demo
https://myresearchinstitute.org
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
myresearch.institute - Researchers
• Uniquely identified by ORCIDs
• Web identities in multiple portals
• Create various types of artifacts
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
myresearch.institute - Portals
• Tracking started August 27 2018
• Tracking artifacts created starting
August 1 2018
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
myresearch.institute – Statistics
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Productivity Portal Distribution
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Researcher Contributions
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Researcher Contributions
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Researcher Contributions
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Artifact Frequency
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Artifact Frequency per Portal
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Scholarly Orphans – Pipeline
• 10,187 unique artifacts tracked, captured, and archived since
08/01/2018
• 41MB event database
• 61GB of WARC files
• 2.3GB of web archive index
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Scholarly Orphans – Pipeline
• Capture process, post tracking
• Within 9 minutes 50% of artifacts captured
• Within 1 hour 21 minutes 75% of artifacts captured
• Archiver process, post capture
• Within 10 minutes 50% of artifacts archived
• Within 57 minutes 75% of artifacts archived
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Summary
• The Scholarly Orphans project explores an institution-driven
approach to capture scholarly artifacts deposited in web portals
• Artifacts out of scope of existing archival approaches such as
LOCKSS, Portico, web archives
• Institutions have a long shelf life, should be interested in
collecting these artifacts, and have feasible scale for
identity/artifact discovery
• Prototype at myresearch.institute illustrates feasibility, opportunities,
and challenges of this institutional perspective
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
“Ha, this is awesome! Thanks for letting me know - carry on as usual, and feel
free to monitor away. I'll try not to change my behaviour or anything now with
this new knowledge :)”
“This is fine, since everything you are capturing is public to start with. I also
wonder if you know about Software Heritage?”
“I’m very comfortable with being part of this (very important) research project”
“I'm cool with it :-)”
“Interesting project! I’m happy to participate.”
“One more thing, is it possible to get a copy of the URI-Rs that you guys
detected so that I can feed them into an archive of my choice?...”
What Our Researchers Say…
@mart1nkle1n @hvdsomp
CNI Spring 2019, April 8 2019, St, Louis, MO
Martin Klein
LANL
@mart1nkle1n
https://orcid.org/0000-0003-0130-2097
Herbert Van de Sompel
DANS
@hvdsomp
https://orcid.org/0000-0002-0715-6126
An Institutional Perspective to Rescue Scholarly Orphans
The Scholarly Orphans project
is funded by the Andrew W. Mellon Foundation

Más contenido relacionado

La actualidad más candente

1818 societypresentation revised2013
1818 societypresentation revised20131818 societypresentation revised2013
1818 societypresentation revised2013
Eliza McLeod
 

La actualidad más candente (20)

Blogs
BlogsBlogs
Blogs
 
The State of Open Education (#OpenCon2014)
The State of Open Education (#OpenCon2014)The State of Open Education (#OpenCon2014)
The State of Open Education (#OpenCon2014)
 
A Framework for Verifying the Fixity of Archived Web Resources
A Framework for Verifying the Fixity of Archived Web ResourcesA Framework for Verifying the Fixity of Archived Web Resources
A Framework for Verifying the Fixity of Archived Web Resources
 
Storytelling for Summarizing Collections in Web Archives
Storytelling for Summarizing Collections in Web ArchivesStorytelling for Summarizing Collections in Web Archives
Storytelling for Summarizing Collections in Web Archives
 
csvconfyasmin2017_05_03
csvconfyasmin2017_05_03csvconfyasmin2017_05_03
csvconfyasmin2017_05_03
 
Genealogical Deeds Done Dirt Cheap: No Apologies to AC/DC
Genealogical Deeds Done Dirt Cheap: No Apologies to AC/DCGenealogical Deeds Done Dirt Cheap: No Apologies to AC/DC
Genealogical Deeds Done Dirt Cheap: No Apologies to AC/DC
 
AL Live—Libraries and COVID-19: Considering Copyright During a Crisis, Part 2...
AL Live—Libraries and COVID-19: Considering Copyright During a Crisis, Part 2...AL Live—Libraries and COVID-19: Considering Copyright During a Crisis, Part 2...
AL Live—Libraries and COVID-19: Considering Copyright During a Crisis, Part 2...
 
Enabling Personal Use of Web Archives
Enabling Personal Use of Web ArchivesEnabling Personal Use of Web Archives
Enabling Personal Use of Web Archives
 
Memento Tracer An Innovative Approach Towards Balancing Scale and Fidelity f...
Memento Tracer An Innovative Approach Towards Balancing  Scale and Fidelity f...Memento Tracer An Innovative Approach Towards Balancing  Scale and Fidelity f...
Memento Tracer An Innovative Approach Towards Balancing Scale and Fidelity f...
 
#OERde14 Keynote: "Generation Open: An International Look at the Coming Revol...
#OERde14 Keynote: "Generation Open: An International Look at the Coming Revol...#OERde14 Keynote: "Generation Open: An International Look at the Coming Revol...
#OERde14 Keynote: "Generation Open: An International Look at the Coming Revol...
 
New Life to Old Serials:
New Life to Old Serials: New Life to Old Serials:
New Life to Old Serials:
 
1818 societypresentation revised2013
1818 societypresentation revised20131818 societypresentation revised2013
1818 societypresentation revised2013
 
Social Networking & Libraries: Best Practices & Challenges
Social Networking & Libraries: Best Practices & ChallengesSocial Networking & Libraries: Best Practices & Challenges
Social Networking & Libraries: Best Practices & Challenges
 
Linked Data and Discovery with Steve Meyer
Linked Data and Discovery with Steve MeyerLinked Data and Discovery with Steve Meyer
Linked Data and Discovery with Steve Meyer
 
Why We Need Multiple Archives
Why We Need Multiple ArchivesWhy We Need Multiple Archives
Why We Need Multiple Archives
 
Intro to Wikisource
Intro to WikisourceIntro to Wikisource
Intro to Wikisource
 
Improving research skills
Improving research skillsImproving research skills
Improving research skills
 
American Libraries Live—Libraries and COVID-19: Providing Virtual Services, L...
American Libraries Live—Libraries and COVID-19: Providing Virtual Services, L...American Libraries Live—Libraries and COVID-19: Providing Virtual Services, L...
American Libraries Live—Libraries and COVID-19: Providing Virtual Services, L...
 
Wikipedia: Why? Who? and How?
Wikipedia: Why? Who? and How?Wikipedia: Why? Who? and How?
Wikipedia: Why? Who? and How?
 
Prototypes of pro-active approaches to support the archiving of web reference...
Prototypes of pro-active approaches to support the archiving of web reference...Prototypes of pro-active approaches to support the archiving of web reference...
Prototypes of pro-active approaches to support the archiving of web reference...
 

Similar a An Institutional Perspective to Rescue Scholarly Orphans

Understanding information sources (online) library course (Updated August 2012)
Understanding information sources (online) library course (Updated August 2012)Understanding information sources (online) library course (Updated August 2012)
Understanding information sources (online) library course (Updated August 2012)
Joanne4
 

Similar a An Institutional Perspective to Rescue Scholarly Orphans (20)

To the Rescue of Scholarly Orphans
To the Rescue of Scholarly OrphansTo the Rescue of Scholarly Orphans
To the Rescue of Scholarly Orphans
 
To the Rescue of the Orphans of Scholarly Communication
To the Rescue of the Orphans of Scholarly CommunicationTo the Rescue of the Orphans of Scholarly Communication
To the Rescue of the Orphans of Scholarly Communication
 
OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.
 
The Journal of Open Archaeology Data and PRIME: Incentivising Open Data Archi...
The Journal of Open Archaeology Data and PRIME: Incentivising Open Data Archi...The Journal of Open Archaeology Data and PRIME: Incentivising Open Data Archi...
The Journal of Open Archaeology Data and PRIME: Incentivising Open Data Archi...
 
PRIME: Publisher, Repository & Institutional Metadata Exchange
PRIME: Publisher, Repository & Institutional Metadata ExchangePRIME: Publisher, Repository & Institutional Metadata Exchange
PRIME: Publisher, Repository & Institutional Metadata Exchange
 
Open Access: Advantages, Funding, Opportunities
Open Access: Advantages, Funding, Opportunities Open Access: Advantages, Funding, Opportunities
Open Access: Advantages, Funding, Opportunities
 
PA Digital and the DPLA
PA Digital and the DPLAPA Digital and the DPLA
PA Digital and the DPLA
 
Introducing PRIME:Publisher, Repository and Institutional Metadata Exchange
Introducing PRIME:Publisher, Repository and Institutional Metadata ExchangeIntroducing PRIME:Publisher, Repository and Institutional Metadata Exchange
Introducing PRIME:Publisher, Repository and Institutional Metadata Exchange
 
OCLC Research update: Active engagement.
OCLC Research update: Active engagement.OCLC Research update: Active engagement.
OCLC Research update: Active engagement.
 
Questions to Ask Across the Ethnographic Lifecycle
Questions to Ask Across the Ethnographic LifecycleQuestions to Ask Across the Ethnographic Lifecycle
Questions to Ask Across the Ethnographic Lifecycle
 
Advocating Open Access: Before, during and after HEFCE
Advocating Open Access: Before, during and after HEFCEAdvocating Open Access: Before, during and after HEFCE
Advocating Open Access: Before, during and after HEFCE
 
CoPILOT at the University of Surrey: An Introduction
CoPILOT at the University of Surrey: An IntroductionCoPILOT at the University of Surrey: An Introduction
CoPILOT at the University of Surrey: An Introduction
 
CoPILOT at the University of Surrey: An Introduction
CoPILOT at the University of Surrey: An IntroductionCoPILOT at the University of Surrey: An Introduction
CoPILOT at the University of Surrey: An Introduction
 
Libraries, collections, technology: presented at Pennylvania State University...
Libraries, collections, technology: presented at Pennylvania State University...Libraries, collections, technology: presented at Pennylvania State University...
Libraries, collections, technology: presented at Pennylvania State University...
 
@WebSciDL PhD Student Project Reviews August 5&6, 2015
@WebSciDL PhD Student Project Reviews August 5&6, 2015@WebSciDL PhD Student Project Reviews August 5&6, 2015
@WebSciDL PhD Student Project Reviews August 5&6, 2015
 
Aggregating Private and Public Web Archives Using the Mementity Framework
Aggregating Private and Public Web Archives Using the Mementity FrameworkAggregating Private and Public Web Archives Using the Mementity Framework
Aggregating Private and Public Web Archives Using the Mementity Framework
 
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
 
New Identities: Adapting the Academic Library
New Identities: Adapting the Academic LibraryNew Identities: Adapting the Academic Library
New Identities: Adapting the Academic Library
 
Adapting the Academic Library MD ACRL-MILEX
Adapting the Academic Library MD ACRL-MILEXAdapting the Academic Library MD ACRL-MILEX
Adapting the Academic Library MD ACRL-MILEX
 
Understanding information sources (online) library course (Updated August 2012)
Understanding information sources (online) library course (Updated August 2012)Understanding information sources (online) library course (Updated August 2012)
Understanding information sources (online) library course (Updated August 2012)
 

Más de Martin Klein

Más de Martin Klein (20)

On the Persistence of Persistent Identifiers of the Scholarly Web
On the Persistence of Persistent Identifiers of the Scholarly WebOn the Persistence of Persistent Identifiers of the Scholarly Web
On the Persistence of Persistent Identifiers of the Scholarly Web
 
On the Persistence of Persistent Identifiers of the Scholarly Web
 On the Persistence of Persistent Identifiers of the Scholarly Web On the Persistence of Persistent Identifiers of the Scholarly Web
On the Persistence of Persistent Identifiers of the Scholarly Web
 
An Institutional Perspective to Rescue Scholarly Orphans
An Institutional Perspective to Rescue Scholarly OrphansAn Institutional Perspective to Rescue Scholarly Orphans
An Institutional Perspective to Rescue Scholarly Orphans
 
Who is Asking - Humans and Machines Experience a Different Scholarly Web
Who is Asking - Humans and Machines  Experience a Different Scholarly WebWho is Asking - Humans and Machines  Experience a Different Scholarly Web
Who is Asking - Humans and Machines Experience a Different Scholarly Web
 
The Memento Tracer Framework: Balancing Quality and Scalability for Web Arch...
The Memento Tracer Framework: Balancing Quality and Scalability  for Web Arch...The Memento Tracer Framework: Balancing Quality and Scalability  for Web Arch...
The Memento Tracer Framework: Balancing Quality and Scalability for Web Arch...
 
Comparing the Performance of OAI-PMH with ResourceSync
Comparing the Performance of OAI-PMH with ResourceSyncComparing the Performance of OAI-PMH with ResourceSync
Comparing the Performance of OAI-PMH with ResourceSync
 
Evaluating Memento Service Optimizations
Evaluating Memento Service OptimizationsEvaluating Memento Service Optimizations
Evaluating Memento Service Optimizations
 
First Steps in Research Data Management Under Constraints of a National Secur...
First Steps in Research Data Management Under Constraints of a National Secur...First Steps in Research Data Management Under Constraints of a National Secur...
First Steps in Research Data Management Under Constraints of a National Secur...
 
Smart Routing of Memento Requests
Smart Routing of Memento RequestsSmart Routing of Memento Requests
Smart Routing of Memento Requests
 
Building Event Collections from Crawling Web Archives
Building Event Collections from Crawling Web ArchivesBuilding Event Collections from Crawling Web Archives
Building Event Collections from Crawling Web Archives
 
A Web-Centric Pipeline for Archiving Scholarly Artifacts
A Web-Centric Pipeline for Archiving Scholarly ArtifactsA Web-Centric Pipeline for Archiving Scholarly Artifacts
A Web-Centric Pipeline for Archiving Scholarly Artifacts
 
Focused Crawl of Web Archives to Build Event Collections
Focused Crawl of Web Archives to Build Event CollectionsFocused Crawl of Web Archives to Build Event Collections
Focused Crawl of Web Archives to Build Event Collections
 
Creating Topical Collections: Web Archives vs. Live Web
Creating Topical Collections:Web Archives vs. Live WebCreating Topical Collections:Web Archives vs. Live Web
Creating Topical Collections: Web Archives vs. Live Web
 
Robust Linking to Web Resources
Robust Linking to Web ResourcesRobust Linking to Web Resources
Robust Linking to Web Resources
 
Signposting for Repositories
Signposting for RepositoriesSignposting for Repositories
Signposting for Repositories
 
Discovering Scholarly Orphans Using ORCID
Discovering Scholarly Orphans Using ORCIDDiscovering Scholarly Orphans Using ORCID
Discovering Scholarly Orphans Using ORCID
 
Using the Memento Framework to Assess Content Drift in Scholarly Communication
Using the Memento Framework to Assess Content Drift in Scholarly CommunicationUsing the Memento Framework to Assess Content Drift in Scholarly Communication
Using the Memento Framework to Assess Content Drift in Scholarly Communication
 
Uniform Access to Raw Mementos
Uniform Access to Raw MementosUniform Access to Raw Mementos
Uniform Access to Raw Mementos
 
Robust Links - a proposed solution to reference rot in scholarly communication
Robust Links - a proposed solution to reference rot in scholarly communicationRobust Links - a proposed solution to reference rot in scholarly communication
Robust Links - a proposed solution to reference rot in scholarly communication
 
ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...
ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...
ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...
 

Último

一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
ayvbos
 
call girls in Anand Vihar (delhi) call me [🔝9953056974🔝] escort service 24X7
call girls in Anand Vihar (delhi) call me [🔝9953056974🔝] escort service 24X7call girls in Anand Vihar (delhi) call me [🔝9953056974🔝] escort service 24X7
call girls in Anand Vihar (delhi) call me [🔝9953056974🔝] escort service 24X7
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
ydyuyu
 
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样
ayvbos
 
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfpdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
JOHNBEBONYAP1
 
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
ydyuyu
 
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制
pxcywzqs
 
Russian Escort Abu Dhabi 0503464457 Abu DHabi Escorts
Russian Escort Abu Dhabi 0503464457 Abu DHabi EscortsRussian Escort Abu Dhabi 0503464457 Abu DHabi Escorts
Russian Escort Abu Dhabi 0503464457 Abu DHabi Escorts
Monica Sydney
 
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girlsRussian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Monica Sydney
 
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
gajnagarg
 
Abu Dhabi Escorts Service 0508644382 Escorts in Abu Dhabi
Abu Dhabi Escorts Service 0508644382 Escorts in Abu DhabiAbu Dhabi Escorts Service 0508644382 Escorts in Abu Dhabi
Abu Dhabi Escorts Service 0508644382 Escorts in Abu Dhabi
Monica Sydney
 
一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理
F
 

Último (20)

一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
 
Vip Firozabad Phone 8250092165 Escorts Service At 6k To 30k Along With Ac Room
Vip Firozabad Phone 8250092165 Escorts Service At 6k To 30k Along With Ac RoomVip Firozabad Phone 8250092165 Escorts Service At 6k To 30k Along With Ac Room
Vip Firozabad Phone 8250092165 Escorts Service At 6k To 30k Along With Ac Room
 
call girls in Anand Vihar (delhi) call me [🔝9953056974🔝] escort service 24X7
call girls in Anand Vihar (delhi) call me [🔝9953056974🔝] escort service 24X7call girls in Anand Vihar (delhi) call me [🔝9953056974🔝] escort service 24X7
call girls in Anand Vihar (delhi) call me [🔝9953056974🔝] escort service 24X7
 
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
 
Ballia Escorts Service Girl ^ 9332606886, WhatsApp Anytime Ballia
Ballia Escorts Service Girl ^ 9332606886, WhatsApp Anytime BalliaBallia Escorts Service Girl ^ 9332606886, WhatsApp Anytime Ballia
Ballia Escorts Service Girl ^ 9332606886, WhatsApp Anytime Ballia
 
Trump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts SweatshirtTrump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts Sweatshirt
 
Real Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirtReal Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirt
 
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样
 
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfpdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
 
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
 
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制
 
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
 
Russian Escort Abu Dhabi 0503464457 Abu DHabi Escorts
Russian Escort Abu Dhabi 0503464457 Abu DHabi EscortsRussian Escort Abu Dhabi 0503464457 Abu DHabi Escorts
Russian Escort Abu Dhabi 0503464457 Abu DHabi Escorts
 
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girlsRussian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
 
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
 
Call girls Service in Ajman 0505086370 Ajman call girls
Call girls Service in Ajman 0505086370 Ajman call girlsCall girls Service in Ajman 0505086370 Ajman call girls
Call girls Service in Ajman 0505086370 Ajman call girls
 
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime NagercoilNagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
 
Abu Dhabi Escorts Service 0508644382 Escorts in Abu Dhabi
Abu Dhabi Escorts Service 0508644382 Escorts in Abu DhabiAbu Dhabi Escorts Service 0508644382 Escorts in Abu Dhabi
Abu Dhabi Escorts Service 0508644382 Escorts in Abu Dhabi
 
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
 
一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理
 

An Institutional Perspective to Rescue Scholarly Orphans

  • 1. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Martin Klein LANL @mart1nkle1n https://orcid.org/0000-0003-0130-2097 Herbert Van de Sompel DANS @hvdsomp https://orcid.org/0000-0002-0715-6126 An Institutional Perspective to Rescue Scholarly Orphans The Scholarly Orphans project is funded by the Andrew W. Mellon Foundation
  • 2. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Scholarly Orphans Team • Los Alamos National Laboratory: • Lyudmila Balakireva • Martin Klein • James Powell • Harihar Shankar • Herbert Van de Sompel • Old Dominion University: • Sawood Alam • Grant Atkins • Shawn Jones • Mat Kelly • Michael L. Nelson
  • 3. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Scholarly Orphans – Project Motivation
  • 4. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO • Consideration • Researchers are increasingly using a variety of web platforms for collaboration and communication • Why? • Many of these platforms have desirable characteristics • Versioning • Time stamping • Social embedding • Their institutions do not provide platforms that have global reach • Collaboration, cf. Github ~ productivity • Communication, cf. SlideShare ~ visibility Research and Research Communication on the Web
  • 5. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Emma Schymanski https://orcid.org/0000-0001-6868-8145 https://github.com/schymane https://www.slideshare.net/EmmaSchymanski https://figshare.com/authors/Emma_Schymanski/5087039 https://publons.com/author/1538491/emma-schymanski#profile https://www.eawag.ch/en/aboutus/portrait/organisation/staff/profile/emma-schymanski/
  • 6. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Shawn Jones https://orcid.org/0000-0002-4372-870X http://www.shawnmjones.org/ https://github.com/shawnmjones https://www.slideshare.net/shawnmjones https://en.wikipedia.org/wiki/User:Shawnmjones https://www.blogger.com/profile/17827543974149663194
  • 7. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO • Consideration • Researchers deposit artifacts in web platforms • Web Platforms: • Dedicated to scholarship: • Commercial: e.g., FigShare, Publons • Not for profit: e.g., OSF, Zenodo • General purpose: • Commercial: e.g., GitHub, SlideShare • Not for profit: e.g., Wikipedia, Wikidata Research and Research Communication on the Web
  • 8. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO • Consideration • Researchers deposit artifacts in web platforms • Status quo - The researchers’ institutions are in the dark • Do not know about the existence of these artifact • Do not have a copy of these artifacts Research and Research Communication on the Web
  • 9. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO • Consideration • Researchers deposit artifacts in web platforms • Status quo – Uncertainty regarding long-term access • Commercial: changing business model, no preservation commitment • Not for profit: unpredictable funding stream Research and Research Communication on the Web
  • 10. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO • Consideration • Researchers deposit artifacts in web platforms • Status quo - Not systematically archived • No frameworks like LOCKSS/Portico exist for these artifacts • Researchers only selectively deposit artifacts in portals that provide archival guarantees; to obtain a cite-able DOI • Can’t expect researchers to (also) upload all artifacts in IRs • Web archives only incidentally archive these artifacts, cf. anecdotal & Hiberlink project evidence Research and Research Communication on the Web Martin Klein, Herbert Van de Sompel, et al. (2014) Scholarly context not found. In: PLOS ONE https://doi.org/10.1371/journal.pone.0115253
  • 11. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Emma’s SlideShare Artifact: 0 Mementos https://www.slideshare.net/EmmaSchymanski/dmcm2018-community-resources-connecting-chemistry-and-toxicity-knowledge http://timetravel.mementoweb.org/
  • 12. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Shawn’s GitHub Artifact: 1 Memento https://github.com/shawnmjones/mediawiki https://web.archive.org/web/*/https://github.com/shawnmjones/mediawiki
  • 13. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Scholarly Orphans – Project Overview How to capture Scholarly Orphans for long-term archiving?
  • 14. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO The Scholarly Orphans Project • Explores an institution-driven paradigm • Academic institutions typically have a long shelf life • A basic premise underlying e.g., LOCKSS, perma.cc • An academic institution should be interested in capturing the artifacts (intellectual property) its scholars deposit on the web • Collecting and archiving such artifacts aligns with the mission of academic libraries
  • 15. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO An Institutional Perspective
  • 16. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO The Scholarly Orphans Project • Explores a paradigm inspired by web archiving • Scale of the problem • Can’t expect researchers to upload all artifacts in an institutional repository • Bilateral agreements for archival purposes with most web portals unlikely
  • 17. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO A Web Archiving Perspective
  • 18. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Scholarly Orphans – Prototype Pipeline Overview
  • 19. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Prototype Pipeline
  • 20. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Tracking Artifacts
  • 21. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Tracking Artifacts - Description • In order to track artifacts that were recently deposited by an institutional researcher in a portal, one reasonably needs: • The web identity of the researcher in the portal • Algorithmic discovery • Discovery via a registry • Manual collection • A portal API that supports: • Access by web identity • Access to contributions “since …” for the web identity • Result of tracking: • URI(s) of new artifact(s) discovered in the portal
  • 22. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Tracking Artifacts - Challenges • Portal API access by web identity • Broadly supported by general purpose portals • Typically not supported by scholarly portals • Some lack an API altogether • Should add ORCID access to APIs • OAI-PMH and ResourceSync need sets per web identity • Professional versus personal contributions • Tracking frequency/scale
  • 23. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Capturing Artifacts
  • 24. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Capturing Artifacts - Description • The capture process takes as input the URI of a new artifact discovered in a portal • Its task is to create a representative institutional capture of the artifact • Result of capture: • WARC file for new artifact in an institutional archive
  • 25. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Capturing Artifacts - Challenges • Delineate the web boundary of the artifact • More than the input artifact URI • The boundary is in the eye of the beholder • Create a high-fidelity capture using an approach that scales for a steady stream of new artifacts • Determine the web boundary of the artifact • Handle dynamic content & interactive features of web pages • We made a significant breakthrough with the Memento Tracer framework Memento Tracer: http://tracer.mementoweb.org
  • 26. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Archiving Artifacts
  • 27. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Archiving Artifacts - Description • The archiving process takes as input the URI of a WARC file generated by the capture process • Its task is to ingest the WARC file in a cross-institutional web archive • This can be achieved using off-the-shelf web archiving software, e.g., pywb, Open Wayback • Result of archiving: • Mementos pertaining to newly discovered artifact in a cross- institutional, Memento-compliant web archive • Possibility to link to artifacts using Robust Links: <a href=“URI-A” data-versionurl=“URI-M” data-versiondate=“date-of-capture” Robust Links: http://robustlinks.mementoweb.org/about/
  • 28. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Archiving Artifacts - Challenges • Attempted to use ipwb, a pywb version that uses IPFS • Cross-institutional distributed file system with redundancy • Ran out of time to get it operationally stable Sawood Alam, Mat Kelly, and Michael L. Nelson (2016) InterPlanetary Wayback: The Permanent Web Archive https://doi.org/10.1145/2910896.2925467
  • 29. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Pipeline Demo https://myresearchinstitute.org
  • 30. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO myresearch.institute - Researchers • Uniquely identified by ORCIDs • Web identities in multiple portals • Create various types of artifacts
  • 31. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO myresearch.institute - Portals • Tracking started August 27 2018 • Tracking artifacts created starting August 1 2018
  • 32. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO myresearch.institute – Statistics
  • 33. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Productivity Portal Distribution
  • 34. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Researcher Contributions
  • 35. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Researcher Contributions
  • 36. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Researcher Contributions
  • 37. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Artifact Frequency
  • 38. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Artifact Frequency per Portal
  • 39. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Scholarly Orphans – Pipeline • 10,187 unique artifacts tracked, captured, and archived since 08/01/2018 • 41MB event database • 61GB of WARC files • 2.3GB of web archive index
  • 40. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Scholarly Orphans – Pipeline • Capture process, post tracking • Within 9 minutes 50% of artifacts captured • Within 1 hour 21 minutes 75% of artifacts captured • Archiver process, post capture • Within 10 minutes 50% of artifacts archived • Within 57 minutes 75% of artifacts archived
  • 41. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Summary • The Scholarly Orphans project explores an institution-driven approach to capture scholarly artifacts deposited in web portals • Artifacts out of scope of existing archival approaches such as LOCKSS, Portico, web archives • Institutions have a long shelf life, should be interested in collecting these artifacts, and have feasible scale for identity/artifact discovery • Prototype at myresearch.institute illustrates feasibility, opportunities, and challenges of this institutional perspective
  • 42. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO “Ha, this is awesome! Thanks for letting me know - carry on as usual, and feel free to monitor away. I'll try not to change my behaviour or anything now with this new knowledge :)” “This is fine, since everything you are capturing is public to start with. I also wonder if you know about Software Heritage?” “I’m very comfortable with being part of this (very important) research project” “I'm cool with it :-)” “Interesting project! I’m happy to participate.” “One more thing, is it possible to get a copy of the URI-Rs that you guys detected so that I can feed them into an archive of my choice?...” What Our Researchers Say…
  • 43. @mart1nkle1n @hvdsomp CNI Spring 2019, April 8 2019, St, Louis, MO Martin Klein LANL @mart1nkle1n https://orcid.org/0000-0003-0130-2097 Herbert Van de Sompel DANS @hvdsomp https://orcid.org/0000-0002-0715-6126 An Institutional Perspective to Rescue Scholarly Orphans The Scholarly Orphans project is funded by the Andrew W. Mellon Foundation