SlideShare a Scribd company logo
1 of 43
Where data and journal content collide
what does it mean to ‘publish your data’?
Peter Burnhill,
EDINA, Information Services
University of Edinburgh
09:40 – 10:00
#ReCon_15 : Beyond the paper: publishing data, software and more. Edinburgh, 19 June 2015
Overview
Time-served data person reverts to being a PI-cum-
researcher, & having to ask: What data should
be shared, when and how?
1. Propose 3 categories of data
A: Databases used [how the data came about]
B: The assembled Datasets [what I analysed]
C: Data behind the graph [what is part of my statement]
2. Report on 2 case studies to illustrate this
• Each of relevance to scholarly communication
1. Scottish Education Data Archive, 1979 - mid ‘80s
– Survey statistician: school leavers, YTS & 16-19 cohort surveys
• In Centre for Educational Sociology
2. Edinburgh University Data Library,1984 & on
– Manager: set-up and development
– President of IASSIST, 2000 – 2004 : social science data professionals
3. Graduate School, Faculty of Social Science, 1987 – 1997
– Senior Lecturer, teaching quantitative/survey methods
• In Research Centre for Social Sciences
4. ESRC Regional Research Laboratory for Scotland, 1986/90
– Co-director: early days of Geographical Information Systems (GIS)
• With University’s Department of Geography; Honorary Fellow, Royal Scottish Geographical Society, 2015
5. EDINA, 1995/6 to present - main focus as day job
– Director: set-up and continuous development
– Jisc-designated centre for service delivery & digital expertise
6. Digital Curation Centre, 2004/05
– Director for set-up & definition of ‘data curation + digital preservation’
• With University’s School of Informatics
a time-served data person (at U of Ed)
Two ‘case studies’ to illustrate
① Project funded by Andrew Mellon Foundation
• No mandate on data deposit but encourage OA for
tools/application developed as part of the project
② ‘Ongoing project’: statistical statement using
data from operation of two Jisc services
• with no direct mandate (& could have passed
undetected)
Both case studies have findings about threats to the
integrity of the scholarly record.
① Reference Rot ② E-Journal
Archiving
Study Measure the extent of what we now call
Reference Rot = Link Rot + Content Drift
• Identify intervention opportunities to stop the rot
• Devise sustainable solutions with maximal reach
Project
Hiberlink
Andrew Mellon Foundation
EDINA & Language Technology Group, School of
Informatics (Claire Grover & colleagues )
jointly with the
Research Library, Los Alamos National Laboratory
(Herbert Van de Sompel & colleagues).
hiberlink.org
Link Rot
‘Link Rot’
+ Content Drift: What is at end of URI has changed, or gone!
http://dl00.org
2000
http://dl00.org
2004
http://dl00.org
2005
http://dl00.org
2008
(a) Dynamic content
as values on webpage
changes over time
(b) Static content
but very different (often
unrelated) web pages
① Reference Rot ② E-Journal
Archiving
Study status of references to the web-at-large
(in e-theses)
Project Hiberlink
Findings
Empirical statements
Made as:
i) WORK-IN-PROGRESS
in preparation for
ii) PUBLICATION
Analysis of of 7,000 e-theses revealed that
Reference Rot occurs in over 36% of the
embedded URIs
Routine web archiving delivers less than a
50:50 chance that content is being kept safe
circa 1 in 5 of referenced content is probably
lost for ever
+ Use of 3 very large corpus of journal
articles demonstrated very significant
reference rot => ‘rotten articles for sale’
‘
Scholarly Articles increasingly link to
Web Resources, not just back to other Articles
Findings: Status of Referenced URIs, PMC corpus
Klein M, Van de Sompel H, Sanderson R, Shankar H, Balakireva L, et al. (2014) Scholarly Context Not Found: One
in Five Articles Suffers from Reference Rot. PLoS ONE 9(12): e115253. doi:10.1371/journal.pone.0115253
http://127.0.0.1:8081/plosone/article?id=info:doi/10.1371/journal.pone.0115253
6 publicly accessible web archives for lookup: Internet Archive, archive.is (archive.today),
Archive-It, BL Web Archive, UK National Archives Web Archive & Icelandic National Archive
Klein M, Van de Sompel H, Sanderson R, Shankar H, Balakireva L, et al. (2014) Scholarly Context Not Found: One
in Five Articles Suffers from Reference Rot. PLoS ONE 9(12): e115253. doi:10.1371/journal.pone.0115253
http://127.0.0.1:8081/plosone/article?id=info:doi/10.1371/journal.pone.0115253
Findings: Status of Referenced URIs, Elsevier corpus
6 publicly accessible web archives for lookup: Internet Archive, archive.is (archive.today),
Archive-It, BL Web Archive, UK National Archives Web Archive & Icelandic National Archive
Remedy: Create Snapshots of Referenced Resources
Snapshots can be created at various stages. The closer to
the moment of referencing, the better the image captured.
Stage Actor Snapshot Quality
Preparation Author/reference tool best
Submission
/Issue
Editor/manuscript
system
good
Access
(post-publication)
Aggregator/
publisher platform
so-so
Shelving Librarian/IR,
journal archive
better than nothing
Prototypes of pro-active approaches to support the
archiving of web references for scholarly
communications
Richard Wincewicz1, Peter Burnhill1
& Herbert Van de Sompel2
1EDINA, University of Edinburgh, 2Los Alamos National Laboratory
http://hiberlink.org #hiberlink
Authoring - Zotero Plugin Demonstrator
Richard Wincewicz (2014) Prototype Hiberlink plugin for Zotero for pro-active archiving and temporal references
https://www.youtube.com/v/ZYmi_Ydr65M%26vq
①
Reference
Rot
②E-Journal Archiving
Study Extent to which scholarly record is at risk of loss:
who is looking after your e-journal content?
Project] Keepers+
‘Unfunded’ (Jisc / UoEd)
EDINA in collaboration internationally with archiving
organisations & research libraries
thekeepers.org
http://thekeepers.blogs.edina.ac.uk
That Article in the Scholarly Record is not in the
custody of Libraries, nor yet on their digital shelves.
Picture credit: http://somanybooksblog.com/2009/03/27/library-tour/
thekeepers.org as Global Monitor
… to discover who is looking after what
① Reference
Rot
② E-Journal Archiving
Study status of
references to the
web-at-large in e-
theses.
scholarly record at risk of loss: who is looking after
e-journal content?
Project Hiberlink Keepers+
Key Findings
Empirical statements
Made as:
i) WORK-IN-PROGRESS
in preparation for
ii) PUBLICATION
Two thirds (68%) of what was consulted
online (108 UK universities) in 2012 is at risk
of loss.
Missing Volumes & Issues
Only 22% to 28% of Title Lists of 3 US
research libraries (Columbia, Cornell & Duke) were
being archived when checked in 2011/12
We need to update these findings annually
 Libraries don’t have e-collections of serials
(only e-connections)
 So we all need to know what scholarly
content is being kept safe somewhere!
Two Key Statistics
‘Ingest Ratio’ = titles ingested by one or more Keeper
/ ‘online serials’ in ISSN Register
= 28,103 / 165,949 [as of June 2015]
=> 17%
‘KeepSafe Ratio’ = titles being ingested by 3+ Keepers
/ ‘online serials’ in ISSN Register
= 9,836 / 165,949
=> 6%
with usage logs for the UK OpenURL Router*
• 8.5m full text requests in UK during 2012
=> 53,311 online titles requested
Analysis in 2013:
‘Ingest Ratio’ = 32% (16,985/53,311)
=> over two thirds 68% (36,326 titles) held by none!
Archival Status of e-Serials Requested
* As reported in Keepers Registry Blog, OpenURL Router passes ‘discovery’ requests to commercial OpenURL
resolver services; developed & delivered by EDINA as part of Jisc support for UK universities & colleges
with usage logs for the UK OpenURL Router*
• 8.5m full text requests in UK during 2012
 53,311 online titles requested
Analysis carried out again in 2015:
‘Ingest Ratio’ = 36% (19,231/53,311) ; up by 2,246 (4%)
=> but still, 64% (34,080 titles) held by none!
‘KeepSafe Ratio’ = 20% (10,847/53,311) ; up by 2,985 (5%)
Archival Status of Requested e-Serials: Update
Archival Status of Online Continuing Resources
assigned ISSN, by Country, June 2015
very many ‘at risk’ e-journals from many small publishers
BIG
publishers
act early but
incompletely
Priority:
find economic way to
archive content from …
Cannot ignore the focus on Publication
re-visiting an article now being cited again:
On measuring the relation between
social science research activity and
research publication.
Research Evaluation 4.3 130-152
doi: 10.1093/rev/4.3.130
P. Burnhill & M. Tubby-Hille (1994)
& What the Funder sees
STUDY
DATA, other working capital
& references to work of others
FINDINGS
Taken from: Figure 1 in P. Burnhill & M. Tubby-Hille (1994)
On measuring the relation between social science research activity and research publication.
Research Evaluation 4.3 130-152. doi: 10.1093/rev/4.3.130
Study / Project / Data / Findings / Publication
STUDY/ Activity [Purpose] Large-scale experiment /
Exploratory investigation
PROJECT [Grant] FunderRef ; GrantID
Databases consulted / used
Source / Origination
Using extant databases
(Generating new data)
Dataset(s)
Assembled & Analysed
Extracted data ; derived
variables; multiple versions
FINDINGS
i) Work-in-progress
ii) PUBLICATION
Empirical Statement(s)
i) Presentations etc
ii) Formal report of the
results of research
DATA as results
to be shared?
DATA as
working capital
Study / Project / Data / Findings / Publication
Study Large-scale experiment /
Exploratory investigation
Project
Data Source / Origination
‘database(s)’
Using extant databases
(Generating new data)
Who has custody of new
data?
‘Assembled datasets’
’Dataset(s)’ Analysed
Extracted data; derived
variables; multiple
versions
‘Data behind the graph’ Supplementary data which
enhance the publication of
the results reported.
Do publishers want to
hand responsibility to
subject & institutional
repositories?
Key Findings
i) Work-in-progress
ii) Publication
Empirical Statement(s)
What Data
should be
shared?
DataType C
DataType B
DataType A
Study / Project / Data / Findings / Publication
Study
Project
Data Source / Origination
‘database(s)’
External to Project
Generating new data Using extant databases
Assembled Datasets
’Dataset(s)’ Analysed
Product of Project
multiple versions
‘Data behind the graph’ Supplementary data
Key Findings
i) Work-in-progress
ii) Publication
Empirical Statement(s)
DataType C: Should be made available & preserved as multi- part work
But do publishers want the responsibility; role of subject & institutional repositories?
DataType B: Choices: which of these exactly?
For your future use? For others? Required for
reproducibility?
DataType A: These sources should be cited
But when are preservation & ‘continuity of access’ proper
tasks for the University?
Study / Project / Data / Findings / Publication
① Reference Rot Study ② E-Journal Archiving
Study status of references to the
web-at-large [in e-theses]
scholarly record at risk of loss: who
is looking after e-journal content?
Project Hiberlink Keepers+
‘database(s)’
Data Source / Origination
DataType A
External to Project
• Full text of c.7,500 doctoral
theses, as downloaded from
5 university repositories
• Networked Digital Library of
Theses and Dissertations
metadata
•Logs of requests from UK
universities (c.10m pa) via Jisc
OpenURL Router
• Aggregation of archival actions’
for online serials via the Keepers
Registry
‘Assembled datasets’
’Dataset(s)’ Analysed
‘Data behind the graph’
Study / Project / Data = Findings / Publication
① Reference Rot Study ② E-Journal Archiving
Study status of references to the web-at-
large (in e-theses)
scholarly record at risk of loss: who is
looking after e-journal content?
Project Hiberlink Keepers+
‘database(s)’
Data Source / Origination
DataType A
• Full text of c.7,500 doctoral
theses, as downloaded from
5 university repositories
• Networked Digital Library of
Theses and Dissertations
metadata
•Logs of requests from UK universities
(c.10m pa) via Jisc OpenURL Router
• Aggregation of archival actions’ for
online serials via the Keepers Registry
Datasets Assembled
Dataset(s) Analysed
DataType B
Product of Project
c.46,000 URIs extracted
from 7,000 eTheses
&
3 other very large corpus
tested for status, recording
live/not, archived/not &
other attributes
c.53,000 online serial titles
cross checked against the
reports in Keepers Registry
* This could be the first of a
regular (annual) series of
datasets recording what is
being archived and what is not
• why should we publish our data?
• what data should be shared, when and how?
& what about the new Web-resident research statements?
Data as scholarship: a cultural shift?
Preserve or Perish
“You are not finished until you have done the
research, published the results, and published
the data, receiving formal credit for everything.”
Mark A. Parsons (2006)
International Polar Year
“A scholar’s positive contribution is measured by the sum of
the original data that he contributes. Hypotheses come and
go but data remain.”
in Advice to a Young Investigator (1897) Santiago Ramón y Cajal
(Nobel Prize winner, 1906)
A more practical set of questions?
• why should we publish our data?
• what data should be shared,
when &
how?
The What
• why should we publish our data?
• what data should be shared, when and how?
DataType B: Data = Findings
• The dataset(s) on which we based our research
statements, or …
• The dataset(s) that were assembled, upon which
others can base their research
STUDY
DATA, other working capital
& references to work of others
FINDINGS
Taken from: Figure 1 in P. Burnhill & M. Tubby-Hille (1994)
On measuring the relation between social science research activity and research publication.
Research Evaluation 4.3 130-152. doi: 10.1093/rev/4.3.130
DATA as FINDINGS
http://www.restfulliving.com/wp-content/uploads/2013/12/Time-1024x861.jpg
Preserving the integrity
of the scholarly
record
When?
STUDY
DATA, other working capital
& references to work of others
FINDINGS
When Findings are reported in Publications?
STUDY
DATA, other working capital
& references to work of others
FINDINGS
This last stage can take a very long time!
Temporal
Rot
• why should we publish our data?
• what data should be shared, when and how?
– What?
• The dataset(s) on which we based our research statements, or better still the datasets we
assembled
– When?: Start early … with documentation &
deposit (with embargo?)
– How?
• We are about to learn that first-hand
– with a little help from a friend in the Data Library
• maybe we might publish one of those new
Web-resident research statements
 Time to use Datashare …
The When & How
Jisc-funded DataShare Project: Edinburgh, LSE, Oxford, Southampton (DISC-UK)
from informal
storage and
sharing
to formal
institutional
arrangement
Side Note on Web-resident research objects
Web as dominant means to make & access scholarly statement
• The Web enables rich aggregations of linked content, with
data intrinsic to the statement
– research objects, composite digital objects, ‘multi-part works’
• As scholarly statement has become digital, it becomes
malleable & lacking in ‘fixity’
• Notions of fixity may conflict with demands for usability:
– a record of activity, and thus be immutable?
– made available with secondary analysis by a third party in mind?
• What should it be cited? Role of Linked Data?
• Need to avoid Reference Rot for this ‘rich content’
DataShare2
from formal
institutional
arrangement
formal publishing into
In Llinked) Data
infrastructure
① Reference Rot ② E-Journal Archiving
Study Investigation into status of
references in scholarly
statement to the web-at-large
Monitoring extent the scholarly
record is at risk of loss: who is
looking after e-journal content?
Project Hiberlink
Andrew Mellon Foundation
with Language Technology Group & the
Research Library at Los Alamos
National Laboratory
Keepers+
‘Unfunded’ (Jisc / UoEd)
in collaboration internationally with
archiving organisations & research libraries
http://thekeepers.blogs.edina.ac.uk
hiberlink.org thekeepers.org
Thank You!
edina@ed.ac.uk

More Related Content

What's hot

Doing data in the social sciences and humanities: links to and from published...
Doing data in the social sciences and humanities: links to and from published...Doing data in the social sciences and humanities: links to and from published...
Doing data in the social sciences and humanities: links to and from published...EDINA, University of Edinburgh
 
Ensuring Continuing Access to Online Scholarly Resources
Ensuring Continuing Access to Online Scholarly ResourcesEnsuring Continuing Access to Online Scholarly Resources
Ensuring Continuing Access to Online Scholarly ResourcesEDINA, University of Edinburgh
 
Data Library Services at the University of Edinburgh
Data Library Services at the University of EdinburghData Library Services at the University of Edinburgh
Data Library Services at the University of EdinburghRobin Rice
 
Introduction to data and support services for Political Data Analysis
Introduction to data and support services for Political Data AnalysisIntroduction to data and support services for Political Data Analysis
Introduction to data and support services for Political Data AnalysisEDINA, University of Edinburgh
 
Tales from the Keepers Registry: Dr Who and the Scholarly Record
Tales from the Keepers Registry: Dr Who and the Scholarly RecordTales from the Keepers Registry: Dr Who and the Scholarly Record
Tales from the Keepers Registry: Dr Who and the Scholarly RecordEDINA, University of Edinburgh
 
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...EDINA, University of Edinburgh
 
University of Edinburgh RDM Training: MANTRA & beyond
University of Edinburgh RDM Training: MANTRA & beyondUniversity of Edinburgh RDM Training: MANTRA & beyond
University of Edinburgh RDM Training: MANTRA & beyondRobin Rice
 
Research Data Management at Edinburgh: Effecting Culture Change
Research Data Management at Edinburgh: Effecting Culture ChangeResearch Data Management at Edinburgh: Effecting Culture Change
Research Data Management at Edinburgh: Effecting Culture ChangeEDINA, University of Edinburgh
 
Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...EDINA, University of Edinburgh
 
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareScottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareRobin Rice
 
Digital Preservation Case Study: Community Action via UK LOCKSS Alliance
Digital Preservation Case Study: Community Action via UK LOCKSS AllianceDigital Preservation Case Study: Community Action via UK LOCKSS Alliance
Digital Preservation Case Study: Community Action via UK LOCKSS AllianceEDINA, University of Edinburgh
 
Edin casestudy-ou-rr-2011
Edin casestudy-ou-rr-2011Edin casestudy-ou-rr-2011
Edin casestudy-ou-rr-2011Robin Rice
 

What's hot (20)

What's So Special about the Social Sciences
What's So Special about the Social SciencesWhat's So Special about the Social Sciences
What's So Special about the Social Sciences
 
Research Data MANTRA Project at Edinburgh
Research Data MANTRA Project at EdinburghResearch Data MANTRA Project at Edinburgh
Research Data MANTRA Project at Edinburgh
 
Doing data in the social sciences and humanities: links to and from published...
Doing data in the social sciences and humanities: links to and from published...Doing data in the social sciences and humanities: links to and from published...
Doing data in the social sciences and humanities: links to and from published...
 
Ensuring Continuing Access to Online Scholarly Resources
Ensuring Continuing Access to Online Scholarly ResourcesEnsuring Continuing Access to Online Scholarly Resources
Ensuring Continuing Access to Online Scholarly Resources
 
Roles & Skills for RDM
Roles & Skills for RDMRoles & Skills for RDM
Roles & Skills for RDM
 
Data Library Services at the University of Edinburgh
Data Library Services at the University of EdinburghData Library Services at the University of Edinburgh
Data Library Services at the University of Edinburgh
 
Introduction to data and support services for Political Data Analysis
Introduction to data and support services for Political Data AnalysisIntroduction to data and support services for Political Data Analysis
Introduction to data and support services for Political Data Analysis
 
Tales from the Keepers Registry: Dr Who and the Scholarly Record
Tales from the Keepers Registry: Dr Who and the Scholarly RecordTales from the Keepers Registry: Dr Who and the Scholarly Record
Tales from the Keepers Registry: Dr Who and the Scholarly Record
 
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
 
University of Edinburgh RDM Training: MANTRA & beyond
University of Edinburgh RDM Training: MANTRA & beyondUniversity of Edinburgh RDM Training: MANTRA & beyond
University of Edinburgh RDM Training: MANTRA & beyond
 
Research Data Management and Spatial Data
Research Data Management and Spatial DataResearch Data Management and Spatial Data
Research Data Management and Spatial Data
 
Research Data Management at Edinburgh: Effecting Culture Change
Research Data Management at Edinburgh: Effecting Culture ChangeResearch Data Management at Edinburgh: Effecting Culture Change
Research Data Management at Edinburgh: Effecting Culture Change
 
RDM through a UK lens - New Roles for Librarians?
RDM through a UK lens - New Roles for Librarians? RDM through a UK lens - New Roles for Librarians?
RDM through a UK lens - New Roles for Librarians?
 
Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...
 
Edinburgh DataShare - DSpace for Data
Edinburgh DataShare - DSpace for DataEdinburgh DataShare - DSpace for Data
Edinburgh DataShare - DSpace for Data
 
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareScottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
 
RDM Programme@Edinburgh
RDM Programme@EdinburghRDM Programme@Edinburgh
RDM Programme@Edinburgh
 
Digital Preservation Case Study: Community Action via UK LOCKSS Alliance
Digital Preservation Case Study: Community Action via UK LOCKSS AllianceDigital Preservation Case Study: Community Action via UK LOCKSS Alliance
Digital Preservation Case Study: Community Action via UK LOCKSS Alliance
 
Edin casestudy-ou-rr-2011
Edin casestudy-ou-rr-2011Edin casestudy-ou-rr-2011
Edin casestudy-ou-rr-2011
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
 

Similar to Where data and journal content collide: what does it mean to ‘publish your data’?

HIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and RemedyHIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and RemedyPRELIDA Project
 
Stronger together: community initiatives in journal management
Stronger together: community initiatives in journal managementStronger together: community initiatives in journal management
Stronger together: community initiatives in journal managementJisc
 
Web Today, Good Tomorrow? Transactional archiving of web content [Long Version]
Web Today, Good Tomorrow? Transactional archiving of web content [Long Version]Web Today, Good Tomorrow? Transactional archiving of web content [Long Version]
Web Today, Good Tomorrow? Transactional archiving of web content [Long Version]Peter Burnhill
 
Ensuring the Integrity (& Continuity) of Our Record of Scholarship
Ensuring the Integrity (& Continuity) of Our Record of ScholarshipEnsuring the Integrity (& Continuity) of Our Record of Scholarship
Ensuring the Integrity (& Continuity) of Our Record of ScholarshipEDINA, University of Edinburgh
 
Web Today, Good Tomorrow? Transactional archiving of web content
Web Today, Good Tomorrow? Transactional archiving of web contentWeb Today, Good Tomorrow? Transactional archiving of web content
Web Today, Good Tomorrow? Transactional archiving of web contentPeter Burnhill
 
Digital Scholarly Communication @Claremont Colleges
Digital Scholarly Communication @Claremont CollegesDigital Scholarly Communication @Claremont Colleges
Digital Scholarly Communication @Claremont CollegesAshley Sanders, Ph.D.
 
We need to solve more that just our access problems
We need to solve more that just our access problemsWe need to solve more that just our access problems
We need to solve more that just our access problemsBjörn Brembs
 
Institutional Repositories (NLA 2011)
Institutional Repositories (NLA 2011)Institutional Repositories (NLA 2011)
Institutional Repositories (NLA 2011)Paul Royster
 
Ensuring Continuity of Access To Our Published Heritage
Ensuring Continuity of Access To Our Published HeritageEnsuring Continuity of Access To Our Published Heritage
Ensuring Continuity of Access To Our Published HeritageEDINA, University of Edinburgh
 
Gtm2014 poster-palop-et-al
Gtm2014 poster-palop-et-alGtm2014 poster-palop-et-al
Gtm2014 poster-palop-et-alsfausto
 
Floriane Muller, Pablo Iriarte, University of Geneva Library, Switzerland Mea...
Floriane Muller, Pablo Iriarte, University of Geneva Library, Switzerland Mea...Floriane Muller, Pablo Iriarte, University of Geneva Library, Switzerland Mea...
Floriane Muller, Pablo Iriarte, University of Geneva Library, Switzerland Mea...CTLes
 
Revitalizing the Library in the University Knowledge Community
Revitalizing the Library in the University Knowledge CommunityRevitalizing the Library in the University Knowledge Community
Revitalizing the Library in the University Knowledge CommunityKaren S Calhoun
 
Introduction to information literacy part 1
Introduction to information literacy part 1Introduction to information literacy part 1
Introduction to information literacy part 1mhayes2006
 
Presentation of LUCERO at EURECOM
Presentation of LUCERO at EURECOMPresentation of LUCERO at EURECOM
Presentation of LUCERO at EURECOMMathieu d'Aquin
 

Similar to Where data and journal content collide: what does it mean to ‘publish your data’? (20)

HIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and RemedyHIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and Remedy
 
Reference Rot and Linked Data: Threat and Remedy
Reference Rot and Linked Data: Threat and RemedyReference Rot and Linked Data: Threat and Remedy
Reference Rot and Linked Data: Threat and Remedy
 
Stronger together: community initiatives in journal management
Stronger together: community initiatives in journal managementStronger together: community initiatives in journal management
Stronger together: community initiatives in journal management
 
Web Today, Good Tomorrow? Transactional archiving of web content [Long Version]
Web Today, Good Tomorrow? Transactional archiving of web content [Long Version]Web Today, Good Tomorrow? Transactional archiving of web content [Long Version]
Web Today, Good Tomorrow? Transactional archiving of web content [Long Version]
 
Ensuring the Integrity (& Continuity) of Our Record of Scholarship
Ensuring the Integrity (& Continuity) of Our Record of ScholarshipEnsuring the Integrity (& Continuity) of Our Record of Scholarship
Ensuring the Integrity (& Continuity) of Our Record of Scholarship
 
Reference Rot and E-Theses: Threat and Remedy
Reference Rot and E-Theses: Threat and RemedyReference Rot and E-Theses: Threat and Remedy
Reference Rot and E-Theses: Threat and Remedy
 
Web Today, Good Tomorrow? Transactional archiving of web content
Web Today, Good Tomorrow? Transactional archiving of web contentWeb Today, Good Tomorrow? Transactional archiving of web content
Web Today, Good Tomorrow? Transactional archiving of web content
 
Digital Scholarly Communication @Claremont Colleges
Digital Scholarly Communication @Claremont CollegesDigital Scholarly Communication @Claremont Colleges
Digital Scholarly Communication @Claremont Colleges
 
We need to solve more that just our access problems
We need to solve more that just our access problemsWe need to solve more that just our access problems
We need to solve more that just our access problems
 
Institutional Repositories (NLA 2011)
Institutional Repositories (NLA 2011)Institutional Repositories (NLA 2011)
Institutional Repositories (NLA 2011)
 
Ensuring Continuity of Access To Our Published Heritage
Ensuring Continuity of Access To Our Published HeritageEnsuring Continuity of Access To Our Published Heritage
Ensuring Continuity of Access To Our Published Heritage
 
Gtm2014 poster-palop-et-al
Gtm2014 poster-palop-et-alGtm2014 poster-palop-et-al
Gtm2014 poster-palop-et-al
 
Mapping dh through heterogeneous communicative practices
Mapping dh through heterogeneous communicative practicesMapping dh through heterogeneous communicative practices
Mapping dh through heterogeneous communicative practices
 
Floriane Muller, Pablo Iriarte, University of Geneva Library, Switzerland Mea...
Floriane Muller, Pablo Iriarte, University of Geneva Library, Switzerland Mea...Floriane Muller, Pablo Iriarte, University of Geneva Library, Switzerland Mea...
Floriane Muller, Pablo Iriarte, University of Geneva Library, Switzerland Mea...
 
Preserving Streams of Issued Content
Preserving Streams of Issued ContentPreserving Streams of Issued Content
Preserving Streams of Issued Content
 
Preserving the Integrity of the Scholarly Record
Preserving the Integrity of the Scholarly RecordPreserving the Integrity of the Scholarly Record
Preserving the Integrity of the Scholarly Record
 
Reference Rot: Threat and Remedy
Reference Rot: Threat and RemedyReference Rot: Threat and Remedy
Reference Rot: Threat and Remedy
 
Revitalizing the Library in the University Knowledge Community
Revitalizing the Library in the University Knowledge CommunityRevitalizing the Library in the University Knowledge Community
Revitalizing the Library in the University Knowledge Community
 
Introduction to information literacy part 1
Introduction to information literacy part 1Introduction to information literacy part 1
Introduction to information literacy part 1
 
Presentation of LUCERO at EURECOM
Presentation of LUCERO at EURECOMPresentation of LUCERO at EURECOM
Presentation of LUCERO at EURECOM
 

More from EDINA, University of Edinburgh

We have the technology... We have the data... What next?
We have the technology... We have the data... What next?We have the technology... We have the data... What next?
We have the technology... We have the data... What next?EDINA, University of Edinburgh
 
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...EDINA, University of Edinburgh
 
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...EDINA, University of Edinburgh
 
Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...EDINA, University of Edinburgh
 
Social media and blogging to develop and communicate research in the arts and...
Social media and blogging to develop and communicate research in the arts and...Social media and blogging to develop and communicate research in the arts and...
Social media and blogging to develop and communicate research in the arts and...EDINA, University of Edinburgh
 
Enhancing your research impact through social media - Nicola Osborne
Enhancing your research impact through social media - Nicola OsborneEnhancing your research impact through social media - Nicola Osborne
Enhancing your research impact through social media - Nicola OsborneEDINA, University of Edinburgh
 
Social Media in Marketing in Support of Your Personal Brand - Nicola Osborne
Social Media in Marketing in Support of Your Personal Brand - Nicola OsborneSocial Media in Marketing in Support of Your Personal Brand - Nicola Osborne
Social Media in Marketing in Support of Your Personal Brand - Nicola OsborneEDINA, University of Edinburgh
 
Best Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
Best Practice for Social Media in Teaching & Learning Contexts - Nicola OsborneBest Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
Best Practice for Social Media in Teaching & Learning Contexts - Nicola OsborneEDINA, University of Edinburgh
 
Introduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data servicesIntroduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data servicesEDINA, University of Edinburgh
 
Digimap for Schools: Introduction to an ICT based cross curricular resource f...
Digimap for Schools: Introduction to an ICT based cross curricular resource f...Digimap for Schools: Introduction to an ICT based cross curricular resource f...
Digimap for Schools: Introduction to an ICT based cross curricular resource f...EDINA, University of Edinburgh
 

More from EDINA, University of Edinburgh (20)

The Making of the English Landscape:
The Making of the English Landscape: The Making of the English Landscape:
The Making of the English Landscape:
 
Spatial Data, Spatial Humanities
Spatial Data, Spatial HumanitiesSpatial Data, Spatial Humanities
Spatial Data, Spatial Humanities
 
Land Cover Map 2015
Land Cover Map 2015Land Cover Map 2015
Land Cover Map 2015
 
We have the technology... We have the data... What next?
We have the technology... We have the data... What next?We have the technology... We have the data... What next?
We have the technology... We have the data... What next?
 
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
 
GeoForum EDINA report 2017
GeoForum EDINA report 2017GeoForum EDINA report 2017
GeoForum EDINA report 2017
 
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
 
Moray housemarch2017
Moray housemarch2017Moray housemarch2017
Moray housemarch2017
 
Uniof stirlingmarch2017secondary
Uniof stirlingmarch2017secondaryUniof stirlingmarch2017secondary
Uniof stirlingmarch2017secondary
 
Uniof glasgow jan2017_secondary
Uniof glasgow jan2017_secondaryUniof glasgow jan2017_secondary
Uniof glasgow jan2017_secondary
 
Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...
 
Social media and blogging to develop and communicate research in the arts and...
Social media and blogging to develop and communicate research in the arts and...Social media and blogging to develop and communicate research in the arts and...
Social media and blogging to develop and communicate research in the arts and...
 
Enhancing your research impact through social media - Nicola Osborne
Enhancing your research impact through social media - Nicola OsborneEnhancing your research impact through social media - Nicola Osborne
Enhancing your research impact through social media - Nicola Osborne
 
Social Media in Marketing in Support of Your Personal Brand - Nicola Osborne
Social Media in Marketing in Support of Your Personal Brand - Nicola OsborneSocial Media in Marketing in Support of Your Personal Brand - Nicola Osborne
Social Media in Marketing in Support of Your Personal Brand - Nicola Osborne
 
Best Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
Best Practice for Social Media in Teaching & Learning Contexts - Nicola OsborneBest Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
Best Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
 
SCURL and SUNCAT serials holdings comparison service
SCURL and SUNCAT serials holdings comparison serviceSCURL and SUNCAT serials holdings comparison service
SCURL and SUNCAT serials holdings comparison service
 
Big data in Digimap
Big data in DigimapBig data in Digimap
Big data in Digimap
 
Introduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data servicesIntroduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data services
 
Digimap for Schools: Introduction to an ICT based cross curricular resource f...
Digimap for Schools: Introduction to an ICT based cross curricular resource f...Digimap for Schools: Introduction to an ICT based cross curricular resource f...
Digimap for Schools: Introduction to an ICT based cross curricular resource f...
 
Digimap Update - Geoforum 2016 - Guy McGarva
Digimap Update - Geoforum 2016 - Guy McGarvaDigimap Update - Geoforum 2016 - Guy McGarva
Digimap Update - Geoforum 2016 - Guy McGarva
 

Recently uploaded

Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作ys8omjxb
 
Call Girls Service Adil Nagar 7001305949 Need escorts Service Pooja Vip
Call Girls Service Adil Nagar 7001305949 Need escorts Service Pooja VipCall Girls Service Adil Nagar 7001305949 Need escorts Service Pooja Vip
Call Girls Service Adil Nagar 7001305949 Need escorts Service Pooja VipCall Girls Lucknow
 
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170Sonam Pathan
 
Film cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasaFilm cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasa494f574xmv
 
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一Fs
 
Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxDyna Gilbert
 
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一Fs
 
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一Fs
 
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一Fs
 
Magic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptxMagic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptxMartaLoveguard
 
Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...Excelmac1
 
PHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 DocumentationPHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 DocumentationLinaWolf1
 
Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24Paul Calvano
 
Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)
Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)
Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)Dana Luther
 
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012rehmti665
 
Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170Sonam Pathan
 
Git and Github workshop GDSC MLRITM
Git and Github  workshop GDSC MLRITMGit and Github  workshop GDSC MLRITM
Git and Github workshop GDSC MLRITMgdsc13
 
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)Christopher H Felton
 

Recently uploaded (20)

Model Call Girl in Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in  Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in  Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝
 
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
 
Call Girls Service Adil Nagar 7001305949 Need escorts Service Pooja Vip
Call Girls Service Adil Nagar 7001305949 Need escorts Service Pooja VipCall Girls Service Adil Nagar 7001305949 Need escorts Service Pooja Vip
Call Girls Service Adil Nagar 7001305949 Need escorts Service Pooja Vip
 
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
 
Film cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasaFilm cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasa
 
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
 
Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptx
 
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
 
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
 
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
 
Magic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptxMagic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptx
 
Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...
 
PHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 DocumentationPHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 Documentation
 
Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24
 
Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)
Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)
Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)
 
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
 
Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝
 
Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170
 
Git and Github workshop GDSC MLRITM
Git and Github  workshop GDSC MLRITMGit and Github  workshop GDSC MLRITM
Git and Github workshop GDSC MLRITM
 
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
 

Where data and journal content collide: what does it mean to ‘publish your data’?

  • 1. Where data and journal content collide what does it mean to ‘publish your data’? Peter Burnhill, EDINA, Information Services University of Edinburgh 09:40 – 10:00 #ReCon_15 : Beyond the paper: publishing data, software and more. Edinburgh, 19 June 2015
  • 2. Overview Time-served data person reverts to being a PI-cum- researcher, & having to ask: What data should be shared, when and how? 1. Propose 3 categories of data A: Databases used [how the data came about] B: The assembled Datasets [what I analysed] C: Data behind the graph [what is part of my statement] 2. Report on 2 case studies to illustrate this • Each of relevance to scholarly communication
  • 3. 1. Scottish Education Data Archive, 1979 - mid ‘80s – Survey statistician: school leavers, YTS & 16-19 cohort surveys • In Centre for Educational Sociology 2. Edinburgh University Data Library,1984 & on – Manager: set-up and development – President of IASSIST, 2000 – 2004 : social science data professionals 3. Graduate School, Faculty of Social Science, 1987 – 1997 – Senior Lecturer, teaching quantitative/survey methods • In Research Centre for Social Sciences 4. ESRC Regional Research Laboratory for Scotland, 1986/90 – Co-director: early days of Geographical Information Systems (GIS) • With University’s Department of Geography; Honorary Fellow, Royal Scottish Geographical Society, 2015 5. EDINA, 1995/6 to present - main focus as day job – Director: set-up and continuous development – Jisc-designated centre for service delivery & digital expertise 6. Digital Curation Centre, 2004/05 – Director for set-up & definition of ‘data curation + digital preservation’ • With University’s School of Informatics a time-served data person (at U of Ed)
  • 4. Two ‘case studies’ to illustrate ① Project funded by Andrew Mellon Foundation • No mandate on data deposit but encourage OA for tools/application developed as part of the project ② ‘Ongoing project’: statistical statement using data from operation of two Jisc services • with no direct mandate (& could have passed undetected) Both case studies have findings about threats to the integrity of the scholarly record.
  • 5. ① Reference Rot ② E-Journal Archiving Study Measure the extent of what we now call Reference Rot = Link Rot + Content Drift • Identify intervention opportunities to stop the rot • Devise sustainable solutions with maximal reach Project Hiberlink Andrew Mellon Foundation EDINA & Language Technology Group, School of Informatics (Claire Grover & colleagues ) jointly with the Research Library, Los Alamos National Laboratory (Herbert Van de Sompel & colleagues). hiberlink.org
  • 7. + Content Drift: What is at end of URI has changed, or gone! http://dl00.org 2000 http://dl00.org 2004 http://dl00.org 2005 http://dl00.org 2008 (a) Dynamic content as values on webpage changes over time (b) Static content but very different (often unrelated) web pages
  • 8. ① Reference Rot ② E-Journal Archiving Study status of references to the web-at-large (in e-theses) Project Hiberlink Findings Empirical statements Made as: i) WORK-IN-PROGRESS in preparation for ii) PUBLICATION Analysis of of 7,000 e-theses revealed that Reference Rot occurs in over 36% of the embedded URIs Routine web archiving delivers less than a 50:50 chance that content is being kept safe circa 1 in 5 of referenced content is probably lost for ever + Use of 3 very large corpus of journal articles demonstrated very significant reference rot => ‘rotten articles for sale’ ‘
  • 9. Scholarly Articles increasingly link to Web Resources, not just back to other Articles
  • 10. Findings: Status of Referenced URIs, PMC corpus Klein M, Van de Sompel H, Sanderson R, Shankar H, Balakireva L, et al. (2014) Scholarly Context Not Found: One in Five Articles Suffers from Reference Rot. PLoS ONE 9(12): e115253. doi:10.1371/journal.pone.0115253 http://127.0.0.1:8081/plosone/article?id=info:doi/10.1371/journal.pone.0115253 6 publicly accessible web archives for lookup: Internet Archive, archive.is (archive.today), Archive-It, BL Web Archive, UK National Archives Web Archive & Icelandic National Archive
  • 11. Klein M, Van de Sompel H, Sanderson R, Shankar H, Balakireva L, et al. (2014) Scholarly Context Not Found: One in Five Articles Suffers from Reference Rot. PLoS ONE 9(12): e115253. doi:10.1371/journal.pone.0115253 http://127.0.0.1:8081/plosone/article?id=info:doi/10.1371/journal.pone.0115253 Findings: Status of Referenced URIs, Elsevier corpus 6 publicly accessible web archives for lookup: Internet Archive, archive.is (archive.today), Archive-It, BL Web Archive, UK National Archives Web Archive & Icelandic National Archive
  • 12. Remedy: Create Snapshots of Referenced Resources Snapshots can be created at various stages. The closer to the moment of referencing, the better the image captured. Stage Actor Snapshot Quality Preparation Author/reference tool best Submission /Issue Editor/manuscript system good Access (post-publication) Aggregator/ publisher platform so-so Shelving Librarian/IR, journal archive better than nothing
  • 13. Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz1, Peter Burnhill1 & Herbert Van de Sompel2 1EDINA, University of Edinburgh, 2Los Alamos National Laboratory http://hiberlink.org #hiberlink
  • 14. Authoring - Zotero Plugin Demonstrator Richard Wincewicz (2014) Prototype Hiberlink plugin for Zotero for pro-active archiving and temporal references https://www.youtube.com/v/ZYmi_Ydr65M%26vq
  • 15. ① Reference Rot ②E-Journal Archiving Study Extent to which scholarly record is at risk of loss: who is looking after your e-journal content? Project] Keepers+ ‘Unfunded’ (Jisc / UoEd) EDINA in collaboration internationally with archiving organisations & research libraries thekeepers.org http://thekeepers.blogs.edina.ac.uk
  • 16. That Article in the Scholarly Record is not in the custody of Libraries, nor yet on their digital shelves. Picture credit: http://somanybooksblog.com/2009/03/27/library-tour/
  • 17. thekeepers.org as Global Monitor … to discover who is looking after what
  • 18. ① Reference Rot ② E-Journal Archiving Study status of references to the web-at-large in e- theses. scholarly record at risk of loss: who is looking after e-journal content? Project Hiberlink Keepers+ Key Findings Empirical statements Made as: i) WORK-IN-PROGRESS in preparation for ii) PUBLICATION Two thirds (68%) of what was consulted online (108 UK universities) in 2012 is at risk of loss. Missing Volumes & Issues Only 22% to 28% of Title Lists of 3 US research libraries (Columbia, Cornell & Duke) were being archived when checked in 2011/12 We need to update these findings annually  Libraries don’t have e-collections of serials (only e-connections)  So we all need to know what scholarly content is being kept safe somewhere!
  • 19. Two Key Statistics ‘Ingest Ratio’ = titles ingested by one or more Keeper / ‘online serials’ in ISSN Register = 28,103 / 165,949 [as of June 2015] => 17% ‘KeepSafe Ratio’ = titles being ingested by 3+ Keepers / ‘online serials’ in ISSN Register = 9,836 / 165,949 => 6%
  • 20. with usage logs for the UK OpenURL Router* • 8.5m full text requests in UK during 2012 => 53,311 online titles requested Analysis in 2013: ‘Ingest Ratio’ = 32% (16,985/53,311) => over two thirds 68% (36,326 titles) held by none! Archival Status of e-Serials Requested * As reported in Keepers Registry Blog, OpenURL Router passes ‘discovery’ requests to commercial OpenURL resolver services; developed & delivered by EDINA as part of Jisc support for UK universities & colleges
  • 21. with usage logs for the UK OpenURL Router* • 8.5m full text requests in UK during 2012  53,311 online titles requested Analysis carried out again in 2015: ‘Ingest Ratio’ = 36% (19,231/53,311) ; up by 2,246 (4%) => but still, 64% (34,080 titles) held by none! ‘KeepSafe Ratio’ = 20% (10,847/53,311) ; up by 2,985 (5%) Archival Status of Requested e-Serials: Update
  • 22. Archival Status of Online Continuing Resources assigned ISSN, by Country, June 2015
  • 23. very many ‘at risk’ e-journals from many small publishers BIG publishers act early but incompletely Priority: find economic way to archive content from …
  • 24. Cannot ignore the focus on Publication re-visiting an article now being cited again: On measuring the relation between social science research activity and research publication. Research Evaluation 4.3 130-152 doi: 10.1093/rev/4.3.130 P. Burnhill & M. Tubby-Hille (1994) & What the Funder sees
  • 25. STUDY DATA, other working capital & references to work of others FINDINGS Taken from: Figure 1 in P. Burnhill & M. Tubby-Hille (1994) On measuring the relation between social science research activity and research publication. Research Evaluation 4.3 130-152. doi: 10.1093/rev/4.3.130
  • 26. Study / Project / Data / Findings / Publication STUDY/ Activity [Purpose] Large-scale experiment / Exploratory investigation PROJECT [Grant] FunderRef ; GrantID Databases consulted / used Source / Origination Using extant databases (Generating new data) Dataset(s) Assembled & Analysed Extracted data ; derived variables; multiple versions FINDINGS i) Work-in-progress ii) PUBLICATION Empirical Statement(s) i) Presentations etc ii) Formal report of the results of research DATA as results to be shared? DATA as working capital
  • 27. Study / Project / Data / Findings / Publication Study Large-scale experiment / Exploratory investigation Project Data Source / Origination ‘database(s)’ Using extant databases (Generating new data) Who has custody of new data? ‘Assembled datasets’ ’Dataset(s)’ Analysed Extracted data; derived variables; multiple versions ‘Data behind the graph’ Supplementary data which enhance the publication of the results reported. Do publishers want to hand responsibility to subject & institutional repositories? Key Findings i) Work-in-progress ii) Publication Empirical Statement(s) What Data should be shared? DataType C DataType B DataType A
  • 28. Study / Project / Data / Findings / Publication Study Project Data Source / Origination ‘database(s)’ External to Project Generating new data Using extant databases Assembled Datasets ’Dataset(s)’ Analysed Product of Project multiple versions ‘Data behind the graph’ Supplementary data Key Findings i) Work-in-progress ii) Publication Empirical Statement(s) DataType C: Should be made available & preserved as multi- part work But do publishers want the responsibility; role of subject & institutional repositories? DataType B: Choices: which of these exactly? For your future use? For others? Required for reproducibility? DataType A: These sources should be cited But when are preservation & ‘continuity of access’ proper tasks for the University?
  • 29. Study / Project / Data / Findings / Publication ① Reference Rot Study ② E-Journal Archiving Study status of references to the web-at-large [in e-theses] scholarly record at risk of loss: who is looking after e-journal content? Project Hiberlink Keepers+ ‘database(s)’ Data Source / Origination DataType A External to Project • Full text of c.7,500 doctoral theses, as downloaded from 5 university repositories • Networked Digital Library of Theses and Dissertations metadata •Logs of requests from UK universities (c.10m pa) via Jisc OpenURL Router • Aggregation of archival actions’ for online serials via the Keepers Registry ‘Assembled datasets’ ’Dataset(s)’ Analysed ‘Data behind the graph’
  • 30. Study / Project / Data = Findings / Publication ① Reference Rot Study ② E-Journal Archiving Study status of references to the web-at- large (in e-theses) scholarly record at risk of loss: who is looking after e-journal content? Project Hiberlink Keepers+ ‘database(s)’ Data Source / Origination DataType A • Full text of c.7,500 doctoral theses, as downloaded from 5 university repositories • Networked Digital Library of Theses and Dissertations metadata •Logs of requests from UK universities (c.10m pa) via Jisc OpenURL Router • Aggregation of archival actions’ for online serials via the Keepers Registry Datasets Assembled Dataset(s) Analysed DataType B Product of Project c.46,000 URIs extracted from 7,000 eTheses & 3 other very large corpus tested for status, recording live/not, archived/not & other attributes c.53,000 online serial titles cross checked against the reports in Keepers Registry * This could be the first of a regular (annual) series of datasets recording what is being archived and what is not
  • 31. • why should we publish our data? • what data should be shared, when and how? & what about the new Web-resident research statements?
  • 32. Data as scholarship: a cultural shift? Preserve or Perish “You are not finished until you have done the research, published the results, and published the data, receiving formal credit for everything.” Mark A. Parsons (2006) International Polar Year “A scholar’s positive contribution is measured by the sum of the original data that he contributes. Hypotheses come and go but data remain.” in Advice to a Young Investigator (1897) Santiago Ramón y Cajal (Nobel Prize winner, 1906)
  • 33. A more practical set of questions? • why should we publish our data? • what data should be shared, when & how?
  • 34. The What • why should we publish our data? • what data should be shared, when and how? DataType B: Data = Findings • The dataset(s) on which we based our research statements, or … • The dataset(s) that were assembled, upon which others can base their research
  • 35. STUDY DATA, other working capital & references to work of others FINDINGS Taken from: Figure 1 in P. Burnhill & M. Tubby-Hille (1994) On measuring the relation between social science research activity and research publication. Research Evaluation 4.3 130-152. doi: 10.1093/rev/4.3.130 DATA as FINDINGS
  • 37. STUDY DATA, other working capital & references to work of others FINDINGS When Findings are reported in Publications?
  • 38. STUDY DATA, other working capital & references to work of others FINDINGS This last stage can take a very long time! Temporal Rot
  • 39. • why should we publish our data? • what data should be shared, when and how? – What? • The dataset(s) on which we based our research statements, or better still the datasets we assembled – When?: Start early … with documentation & deposit (with embargo?) – How? • We are about to learn that first-hand – with a little help from a friend in the Data Library • maybe we might publish one of those new Web-resident research statements  Time to use Datashare … The When & How
  • 40. Jisc-funded DataShare Project: Edinburgh, LSE, Oxford, Southampton (DISC-UK) from informal storage and sharing to formal institutional arrangement
  • 41. Side Note on Web-resident research objects Web as dominant means to make & access scholarly statement • The Web enables rich aggregations of linked content, with data intrinsic to the statement – research objects, composite digital objects, ‘multi-part works’ • As scholarly statement has become digital, it becomes malleable & lacking in ‘fixity’ • Notions of fixity may conflict with demands for usability: – a record of activity, and thus be immutable? – made available with secondary analysis by a third party in mind? • What should it be cited? Role of Linked Data? • Need to avoid Reference Rot for this ‘rich content’
  • 43. ① Reference Rot ② E-Journal Archiving Study Investigation into status of references in scholarly statement to the web-at-large Monitoring extent the scholarly record is at risk of loss: who is looking after e-journal content? Project Hiberlink Andrew Mellon Foundation with Language Technology Group & the Research Library at Los Alamos National Laboratory Keepers+ ‘Unfunded’ (Jisc / UoEd) in collaboration internationally with archiving organisations & research libraries http://thekeepers.blogs.edina.ac.uk hiberlink.org thekeepers.org Thank You! edina@ed.ac.uk