SlideShare una empresa de Scribd logo
1 de 42
Rachael Lammey
Product Manager, CrossRef
28 October 2014
Not-for-profit association of scholarly publishers
All subjects, all business models
4,000+ organizations from all over the world
83 non-publisher affiliates, 2000 library affiliates
68 million content items
10.1098/ rstl.
1665.0001
User clicks on
CrossRef DOI
reference link
in Journal A
Tani, N., N. Tomaru, M. Araki, AND K. Ohba. 1996. Genetic diversity and
differentiation in populations of Japanese stone pine (Pinus pumila) in
Japan. Canadian Journal of Forest Research 26: 1454–1462.[CrossRef]
DOI
directory
returns URL
User accesses
cited article in
Journal B
90,000,000
Services
• Cross-publisher
reference linking
• Cross-publisher
Cited-by linking
• Cross-publisher
metadata feeds
• Cross-publisher
plagiarism screening
• Cross-publisher
update identification
• Cross-publisher
funder identification
• Cross-publisher
text and data mining
Powered by
iThenticate
A Text and Data Mining Hub for Researchers
What is text and data mining?
Text Mining is an interdisciplinary field combining
techniques from linguistics, computer science and
statistics to build tools that can efficiently retrieve
and extract information from digital text.
http://blogs.plos.org/everyone/2013/04/17/announcing-the-plos-text-mining-collection/
It uses powerful computers to find links between
drugs and side effects, or genes and diseases, that
are hidden within the vast scientific literature.
These are discoveries that a person scouring
through papers one by one may never notice.
http://www.theguardian.com/science/2012/may/23/text-mining-research-tool-forbidden
http://www.jisc.ac.uk/media/documents/publications/textminingbp_rtf.rtf
Marc Weeber and colleagues used automated text mining tools to infer that the
drug thalidomide could treat several diseases it had not been associated with
before. Thalidomide was taken off the market 40 years ago, but is still the subject of
research because it seems to benefit leprosy patients via their immune systems.
Weeber and Grietje Molema, an immunologist, used text mining tools to search the
literature for papers on thalidomide and then pick out those containing concepts
related to immunology. One concept, concerning thalidomide’s ability to inhibit
Interleukin-12 (IL-12), a chemical involved in the launch of an immune response,
struck Molema as particularly interesting. A second automated search for diseases
that improve when the action of IL-12 is blocked, revealed several not previously
linked with thalidomide, including chronic hepatitis, myasthenia gravis and a type of
gastritis.
“Type in thalidomide and you get 2-3000 hits. Type in disease and you get 40,000
hits. With automated text mining tools we only had to read 100-200 abstracts and
20 or 30 full papers. We’ve created hypotheses for others to follow up” says
Weeber.
Weeber et al. J Am Med Inform Assoc. 2003 10 252-259
http://www.forbes.com/sites/stevensalzberg/2014/03/23/why-google-flu-
is-a-failure/
Why?
• Researchers find it impractical to negotiate multiple
bilateral agreements with hundreds of subscription-
based publishers in order to authorize TDM of
subscribed content.
• Subscription-based publishers find it impractical to
negotiate multiple bilateral agreements with thousands
of researchers and institutions in order to authorize TDM
of subscribed content.
• All parties would benefit from support of standard APIs
and data representations in order to enable TDM across
both open access and subscription-based publishers.
* Chinese Geoscience Union * Chinese Institute Of
Automation Engineers (Ciae) * Chinese Journal Of
Mechanical Engineering * Chinese Mathematical Society *
Chinese Physical Society * Chinese Physiological Society *
Chinese Society Of Theoretical And Applied Mechanics *
Chonnam National University Medical School (Kamje) *
Christ University Bangalore * Cic Edizioni Internazionali *
Cig Media Group * Cilip Information Literacy Group *
Civil-Comp, Ltd. * Claremont Colleges Library * Classical
Association Of The Middle West And South, Inc. (Camws)
* Clawar Association Limited * Clay Minerals Society *
Cleo Revues.Org * Cleveland Clinic Journal Of Medicine *
Clinical Autonomic Research Society * Clinical Laboratory
Publications * Clinics Cardive Publishing * Clockss Archive
* Cnps * Cnrs France * Cnu Journal Of Agricultural Science
Using the DOI as the basis for a common text and data mining
API provides several benefits. For example, the DOI provides:
•An easy way to de-duplicate documents that may be found on
several sites.
•Persistent provenance information.
•An easy way to document, share and compare coropra without
having to exchange the actual documents
•A mechanism to ensure the reproducibility of TDM results using
the source documents.
•A mechanism to track the impact of updates, corrections
retractions and withdrawls on corpora.
Why use the DOI?
The TDM Workflow
Researchers:Comm
on API
DOI
Content
Negotiation
http://dx.doi.org/10.5555-12345678
(Accept: text/html)
http://dx.doi.org/10.5555-12345678
(Accept: application/bibjson+json)
Rate
Limiting(Optiona
l)
CrossRef TDM
HTTP Headers
CR-TDM-Rate-Limit: 1500
(the rate limit ceiling per window on requests)
CR-TDM-Rate-Limit-Remaining: 1387
(number of requests left for the current window)
CR-TDM-Rate-Limit-Reset: 1378072800
(the remaining time in UTC epoch seconds before the
rate limit resets and a new window is started)
*this is a technique used by many APIs, including Twitter’s
Common API Summary
• Content Negotiation (Required)
• New Metadata (Required)
• Full text URIs
• License URIs
• Rate Limiting Headers (optional)
New
Metadata
1. Full Text Link
https://apps.crossref.org/docs/tdm/full-text-
uris-technical-details/
https://apps.crossref.org/docs/tdm/license-uris-technical-https://apps.crossref.org/docs/tdm/license-uris-technical-
details/details/
2. License Information
https://apps.crossref.org/docs/tdm/license-
uris-technical-details/
Example from Hindawi
<ai:program name="AccessIndicators">
<ai:license_ref>http://creativecommons.org/licenses/by/3.0/</ai:license_ref>
</ai:program>
<doi_data>
<doi>10.1155/2014/969265</doi>
<timestamp>20140401090031</timestamp>
<resource>http://www.hindawi.com/journals/aaa/2014/969265/</resource>
<collection property="text-mining">
<item>
<resource mime_type="application/pdf">
http://downloads.hindawi.com/journals/aaa/2014/969265.pdf
</resource>
</item>
<item>
<resource mime_type="application/xml">
http://downloads.hindawi.com/journals/aaa/2014/969265.xml
</resource>
</item>
Stop here if
• You are an open access publisher
• You include TDM as a part of
your subscription license/T&Cs.
Click-Through
Service
(Optional)
Extended TDM Workflow
Researcher
View
Publisher
View
Researcher queries DOI using CN + API
token
Publisher verifies API token
If token verified AND access control allows,
publisher returns full text
(frequency at publisher discretion)
Benefits
• Streamlines researcher access to distributed
full text for TDM
• Enables machine-to-machine, automated
access for recognized TDM (i.e. researchers won’t be
locked out of publisher sites)
• Enables article-level licensing info and easy
mechanism for supplemental T&Cs for text
and data mining (publishers discussing
model license via STM)
What do
researchers
publishers
tools developers
need to do?
Publishers
There are two additional metadata elements that publishers will
need to deposit to support TDM via CrossRef. These are:
•Full Text URIs: One or more URIs that point to full text
representations of the content identified by your CrossRef DOIs.
•License URIs: One or more URIs pointing at licenses that govern
how the full text content can be used.
•OPTIONAL: Add publisher TDM terms and conditions to the
click-through service
Researchers
• Modify TDM tools to make use of the API token
• Modify TDM tools to look for <lic_ref>
elements
• Register with the click-through service and
accept/decline licenses (if applicable)
http://tdmsupport.crossref.org/
Progress to date
• DOI content negotiation
• CrossRef support for recording links to full text
• CrossRef metadata support for:
• ORCIDS
• FundRef
• License information
• CrossRef Metadata Search for Discovery:
http://search.labs.crossref.org/
• Click-through license service
• Publisher API for verifying and managing tokens
• Launched as live service 29th
May 2014
Publishers
Articles with full-text links and license information deposited:
998,416
Cost? Free to researchers and the public
No cost for publishers through 2014, 2015 tbc
Register interest at:
http://www.crossref.org/tdm/contact_form.html
Usable as is:
https://blogs.nd.edu/emorgan/
www.crossref.org
http://www.crossref.org/tdm/index.html
tdm@crossref.org

Más contenido relacionado

La actualidad más candente

Crossref Metadata and Metadata Services
Crossref Metadata and Metadata ServicesCrossref Metadata and Metadata Services
Crossref Metadata and Metadata ServicesCrossref
 
Concepts on some benificial research tools
Concepts on some benificial research toolsConcepts on some benificial research tools
Concepts on some benificial research toolsمحمد الرشاح
 
Introduction to Crossref: History, Mission, Members
Introduction to Crossref: History, Mission, MembersIntroduction to Crossref: History, Mission, Members
Introduction to Crossref: History, Mission, MembersCrossref
 
Citation Analysis for the Free, Online Literature
Citation Analysis for the Free, Online LiteratureCitation Analysis for the Free, Online Literature
Citation Analysis for the Free, Online LiteratureBalachandar Radhakrishnan
 
Barcelona 2014: An Introduction to CrossRef by Carol Meyer
Barcelona 2014: An Introduction to CrossRef by Carol MeyerBarcelona 2014: An Introduction to CrossRef by Carol Meyer
Barcelona 2014: An Introduction to CrossRef by Carol MeyerCrossref
 
Understanding Crossref Metadata
Understanding Crossref MetadataUnderstanding Crossref Metadata
Understanding Crossref MetadataCrossref
 
Introduction to CrossRef Basics Webinar
Introduction to CrossRef Basics WebinarIntroduction to CrossRef Basics Webinar
Introduction to CrossRef Basics WebinarCrossref
 
Barcelona 2014: CrossRef System and Support Update by Chuck Koscher
Barcelona 2014: CrossRef System and Support Update by Chuck KoscherBarcelona 2014: CrossRef System and Support Update by Chuck Koscher
Barcelona 2014: CrossRef System and Support Update by Chuck KoscherCrossref
 
Update on Crossref Services - Rachael Lammey
Update on Crossref Services - Rachael LammeyUpdate on Crossref Services - Rachael Lammey
Update on Crossref Services - Rachael LammeyCrossref
 
The Crossref/ORCID Auto-Update: all you need to know
The Crossref/ORCID Auto-Update: all you need to knowThe Crossref/ORCID Auto-Update: all you need to know
The Crossref/ORCID Auto-Update: all you need to knowCrossref
 
crossmark update
crossmark updatecrossmark update
crossmark updateCrossref
 
ORCID: An Overview - Alice Meadows
ORCID: An Overview - Alice MeadowsORCID: An Overview - Alice Meadows
ORCID: An Overview - Alice MeadowsCrossref
 
Collecting and Using Funding Data Crossref
Collecting and Using Funding Data CrossrefCollecting and Using Funding Data Crossref
Collecting and Using Funding Data CrossrefRelawan Jurnal Indonesia
 
FundRef Webinar
FundRef WebinarFundRef Webinar
FundRef WebinarCrossref
 
Multiple Resolution and handling content available in multiple places
Multiple Resolution and handling content available in multiple placesMultiple Resolution and handling content available in multiple places
Multiple Resolution and handling content available in multiple placesCrossref
 
Who is using your metadata - Ginny Hendricks
Who is using your metadata - Ginny HendricksWho is using your metadata - Ginny Hendricks
Who is using your metadata - Ginny HendricksCrossref
 
cited by how-to
cited by how-tocited by how-to
cited by how-toCrossref
 
3. Crossref and PLOS, a publisher perspective
3. Crossref and PLOS, a publisher perspective3. Crossref and PLOS, a publisher perspective
3. Crossref and PLOS, a publisher perspectiveCrossref
 

La actualidad más candente (20)

Crossref Metadata and Metadata Services
Crossref Metadata and Metadata ServicesCrossref Metadata and Metadata Services
Crossref Metadata and Metadata Services
 
Concepts on some benificial research tools
Concepts on some benificial research toolsConcepts on some benificial research tools
Concepts on some benificial research tools
 
Introduction to Crossref: History, Mission, Members
Introduction to Crossref: History, Mission, MembersIntroduction to Crossref: History, Mission, Members
Introduction to Crossref: History, Mission, Members
 
Citation Analysis for the Free, Online Literature
Citation Analysis for the Free, Online LiteratureCitation Analysis for the Free, Online Literature
Citation Analysis for the Free, Online Literature
 
Barcelona 2014: An Introduction to CrossRef by Carol Meyer
Barcelona 2014: An Introduction to CrossRef by Carol MeyerBarcelona 2014: An Introduction to CrossRef by Carol Meyer
Barcelona 2014: An Introduction to CrossRef by Carol Meyer
 
Understanding Crossref Metadata
Understanding Crossref MetadataUnderstanding Crossref Metadata
Understanding Crossref Metadata
 
Introduction to CrossRef Basics Webinar
Introduction to CrossRef Basics WebinarIntroduction to CrossRef Basics Webinar
Introduction to CrossRef Basics Webinar
 
Barcelona 2014: CrossRef System and Support Update by Chuck Koscher
Barcelona 2014: CrossRef System and Support Update by Chuck KoscherBarcelona 2014: CrossRef System and Support Update by Chuck Koscher
Barcelona 2014: CrossRef System and Support Update by Chuck Koscher
 
Update on Crossref Services - Rachael Lammey
Update on Crossref Services - Rachael LammeyUpdate on Crossref Services - Rachael Lammey
Update on Crossref Services - Rachael Lammey
 
The Crossref/ORCID Auto-Update: all you need to know
The Crossref/ORCID Auto-Update: all you need to knowThe Crossref/ORCID Auto-Update: all you need to know
The Crossref/ORCID Auto-Update: all you need to know
 
CEK KEMIRIPAN PADA CROSSREF
CEK KEMIRIPAN PADA CROSSREFCEK KEMIRIPAN PADA CROSSREF
CEK KEMIRIPAN PADA CROSSREF
 
crossmark update
crossmark updatecrossmark update
crossmark update
 
ORCID: An Overview - Alice Meadows
ORCID: An Overview - Alice MeadowsORCID: An Overview - Alice Meadows
ORCID: An Overview - Alice Meadows
 
Collecting and Using Funding Data Crossref
Collecting and Using Funding Data CrossrefCollecting and Using Funding Data Crossref
Collecting and Using Funding Data Crossref
 
CARA MENGELOLA PERUBAHAN PADA NASKAH
CARA MENGELOLA PERUBAHAN PADA NASKAHCARA MENGELOLA PERUBAHAN PADA NASKAH
CARA MENGELOLA PERUBAHAN PADA NASKAH
 
FundRef Webinar
FundRef WebinarFundRef Webinar
FundRef Webinar
 
Multiple Resolution and handling content available in multiple places
Multiple Resolution and handling content available in multiple placesMultiple Resolution and handling content available in multiple places
Multiple Resolution and handling content available in multiple places
 
Who is using your metadata - Ginny Hendricks
Who is using your metadata - Ginny HendricksWho is using your metadata - Ginny Hendricks
Who is using your metadata - Ginny Hendricks
 
cited by how-to
cited by how-tocited by how-to
cited by how-to
 
3. Crossref and PLOS, a publisher perspective
3. Crossref and PLOS, a publisher perspective3. Crossref and PLOS, a publisher perspective
3. Crossref and PLOS, a publisher perspective
 

Destacado

Introduction to CrossRef for Researchers
Introduction to CrossRef for ResearchersIntroduction to CrossRef for Researchers
Introduction to CrossRef for ResearchersCrossref
 
Social Media and Scholarly Communication
Social Media and Scholarly CommunicationSocial Media and Scholarly Communication
Social Media and Scholarly CommunicationCrossref
 
CrossRef DOIs for eBooks: Making it easier for readers to find your stuff
CrossRef DOIs for eBooks: Making it easier for readers to find your stuffCrossRef DOIs for eBooks: Making it easier for readers to find your stuff
CrossRef DOIs for eBooks: Making it easier for readers to find your stuffCrossref
 
CrossRef: Improving Scholarly Communications
CrossRef: Improving Scholarly CommunicationsCrossRef: Improving Scholarly Communications
CrossRef: Improving Scholarly CommunicationsCrossref
 
Orcid auto-update at Crossref
Orcid auto-update at CrossrefOrcid auto-update at Crossref
Orcid auto-update at CrossrefCrossref
 
Managing changes to content: Crossmark
Managing changes to content: CrossmarkManaging changes to content: Crossmark
Managing changes to content: CrossmarkCrossref
 
Help and support from Crossref
Help and support from Crossref Help and support from Crossref
Help and support from Crossref Crossref
 
The Global reach of Crossref metadata
The Global reach of Crossref metadataThe Global reach of Crossref metadata
The Global reach of Crossref metadataCrossref
 
Reference linking and Cited-by
Reference linking and Cited-byReference linking and Cited-by
Reference linking and Cited-byCrossref
 
Working with Crossref and registering content
Working with Crossref and registering contentWorking with Crossref and registering content
Working with Crossref and registering contentCrossref
 
Checking for originality: Crossref Similarity Check
Checking for originality: Crossref Similarity CheckChecking for originality: Crossref Similarity Check
Checking for originality: Crossref Similarity CheckCrossref
 
Collecting and using funding data in your publications
Collecting and using funding data in your publicationsCollecting and using funding data in your publications
Collecting and using funding data in your publicationsCrossref
 
Introduction to Crossref LIVE Yogyakarta
Introduction to Crossref LIVE Yogyakarta Introduction to Crossref LIVE Yogyakarta
Introduction to Crossref LIVE Yogyakarta Crossref
 

Destacado (13)

Introduction to CrossRef for Researchers
Introduction to CrossRef for ResearchersIntroduction to CrossRef for Researchers
Introduction to CrossRef for Researchers
 
Social Media and Scholarly Communication
Social Media and Scholarly CommunicationSocial Media and Scholarly Communication
Social Media and Scholarly Communication
 
CrossRef DOIs for eBooks: Making it easier for readers to find your stuff
CrossRef DOIs for eBooks: Making it easier for readers to find your stuffCrossRef DOIs for eBooks: Making it easier for readers to find your stuff
CrossRef DOIs for eBooks: Making it easier for readers to find your stuff
 
CrossRef: Improving Scholarly Communications
CrossRef: Improving Scholarly CommunicationsCrossRef: Improving Scholarly Communications
CrossRef: Improving Scholarly Communications
 
Orcid auto-update at Crossref
Orcid auto-update at CrossrefOrcid auto-update at Crossref
Orcid auto-update at Crossref
 
Managing changes to content: Crossmark
Managing changes to content: CrossmarkManaging changes to content: Crossmark
Managing changes to content: Crossmark
 
Help and support from Crossref
Help and support from Crossref Help and support from Crossref
Help and support from Crossref
 
The Global reach of Crossref metadata
The Global reach of Crossref metadataThe Global reach of Crossref metadata
The Global reach of Crossref metadata
 
Reference linking and Cited-by
Reference linking and Cited-byReference linking and Cited-by
Reference linking and Cited-by
 
Working with Crossref and registering content
Working with Crossref and registering contentWorking with Crossref and registering content
Working with Crossref and registering content
 
Checking for originality: Crossref Similarity Check
Checking for originality: Crossref Similarity CheckChecking for originality: Crossref Similarity Check
Checking for originality: Crossref Similarity Check
 
Collecting and using funding data in your publications
Collecting and using funding data in your publicationsCollecting and using funding data in your publications
Collecting and using funding data in your publications
 
Introduction to Crossref LIVE Yogyakarta
Introduction to Crossref LIVE Yogyakarta Introduction to Crossref LIVE Yogyakarta
Introduction to Crossref LIVE Yogyakarta
 

Similar a CrossRef Text and Data Mining

Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...
Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...
Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...Chris Shillum
 
Intro_To_FHIR.pptx
Intro_To_FHIR.pptxIntro_To_FHIR.pptx
Intro_To_FHIR.pptxPierluigi10
 
Biocatalogue Talk Slides
Biocatalogue Talk SlidesBiocatalogue Talk Slides
Biocatalogue Talk SlidesBioCatalogue
 
Text and Data Mining (TDM):Tools to make it easier by Chuck Koscher
Text and Data Mining (TDM):Tools to make it easier by Chuck KoscherText and Data Mining (TDM):Tools to make it easier by Chuck Koscher
Text and Data Mining (TDM):Tools to make it easier by Chuck KoscherCrossref
 
Text Data Mining: Unlocking the hidden potential from scholarly content.
Text Data Mining: Unlocking the hidden potential from scholarly content.Text Data Mining: Unlocking the hidden potential from scholarly content.
Text Data Mining: Unlocking the hidden potential from scholarly content.Emma Warren-Jones
 
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...David Peyruc
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation Research Data Alliance
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation Research Data Alliance
 
Who is using your content?
Who is using your content? Who is using your content?
Who is using your content? Crossref
 
Webinar: Lucidworks + Thomson Reuters for Improved Investment Performance
Webinar: Lucidworks + Thomson Reuters for Improved Investment PerformanceWebinar: Lucidworks + Thomson Reuters for Improved Investment Performance
Webinar: Lucidworks + Thomson Reuters for Improved Investment PerformanceLucidworks
 
DataCite and its DOI infrastructure - IASSIST 2013
DataCite and its DOI infrastructure - IASSIST 2013DataCite and its DOI infrastructure - IASSIST 2013
DataCite and its DOI infrastructure - IASSIST 2013Frauke Ziedorn
 
Introduction to Digital Humanities: Metadata standards and ontologies
Introduction to Digital Humanities: Metadata standards and ontologies Introduction to Digital Humanities: Metadata standards and ontologies
Introduction to Digital Humanities: Metadata standards and ontologies LIBIS
 
(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline Pilot
(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline Pilot(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline Pilot
(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline PilotBIOVIA
 
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...Dr. Haxel Consult
 
Multi-agent interactions on the Web through Linked Data Notifications
Multi-agent interactions on the Web through Linked Data NotificationsMulti-agent interactions on the Web through Linked Data Notifications
Multi-agent interactions on the Web through Linked Data NotificationsJean-Paul Calbimonte
 
OpenAIRE guidelines and broker service for repository managers - OpenAIRE #OA...
OpenAIRE guidelines and broker service for repository managers - OpenAIRE #OA...OpenAIRE guidelines and broker service for repository managers - OpenAIRE #OA...
OpenAIRE guidelines and broker service for repository managers - OpenAIRE #OA...OpenAIRE
 
Revelations about relations in connecting research: content types, data and i...
Revelations about relations in connecting research: content types, data and i...Revelations about relations in connecting research: content types, data and i...
Revelations about relations in connecting research: content types, data and i...Jisc
 
VODAN Africa IN.pptx
VODAN Africa IN.pptxVODAN Africa IN.pptx
VODAN Africa IN.pptxGetu Tadele
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Anita de Waard
 
OSFair2017 training | Machine accessibility of Open Access scientific publica...
OSFair2017 training | Machine accessibility of Open Access scientific publica...OSFair2017 training | Machine accessibility of Open Access scientific publica...
OSFair2017 training | Machine accessibility of Open Access scientific publica...Open Science Fair
 

Similar a CrossRef Text and Data Mining (20)

Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...
Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...
Presentation from ALA Midwinter 2014 on Elsevier's new Text and Data Mining P...
 
Intro_To_FHIR.pptx
Intro_To_FHIR.pptxIntro_To_FHIR.pptx
Intro_To_FHIR.pptx
 
Biocatalogue Talk Slides
Biocatalogue Talk SlidesBiocatalogue Talk Slides
Biocatalogue Talk Slides
 
Text and Data Mining (TDM):Tools to make it easier by Chuck Koscher
Text and Data Mining (TDM):Tools to make it easier by Chuck KoscherText and Data Mining (TDM):Tools to make it easier by Chuck Koscher
Text and Data Mining (TDM):Tools to make it easier by Chuck Koscher
 
Text Data Mining: Unlocking the hidden potential from scholarly content.
Text Data Mining: Unlocking the hidden potential from scholarly content.Text Data Mining: Unlocking the hidden potential from scholarly content.
Text Data Mining: Unlocking the hidden potential from scholarly content.
 
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
 
Who is using your content?
Who is using your content? Who is using your content?
Who is using your content?
 
Webinar: Lucidworks + Thomson Reuters for Improved Investment Performance
Webinar: Lucidworks + Thomson Reuters for Improved Investment PerformanceWebinar: Lucidworks + Thomson Reuters for Improved Investment Performance
Webinar: Lucidworks + Thomson Reuters for Improved Investment Performance
 
DataCite and its DOI infrastructure - IASSIST 2013
DataCite and its DOI infrastructure - IASSIST 2013DataCite and its DOI infrastructure - IASSIST 2013
DataCite and its DOI infrastructure - IASSIST 2013
 
Introduction to Digital Humanities: Metadata standards and ontologies
Introduction to Digital Humanities: Metadata standards and ontologies Introduction to Digital Humanities: Metadata standards and ontologies
Introduction to Digital Humanities: Metadata standards and ontologies
 
(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline Pilot
(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline Pilot(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline Pilot
(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline Pilot
 
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...
 
Multi-agent interactions on the Web through Linked Data Notifications
Multi-agent interactions on the Web through Linked Data NotificationsMulti-agent interactions on the Web through Linked Data Notifications
Multi-agent interactions on the Web through Linked Data Notifications
 
OpenAIRE guidelines and broker service for repository managers - OpenAIRE #OA...
OpenAIRE guidelines and broker service for repository managers - OpenAIRE #OA...OpenAIRE guidelines and broker service for repository managers - OpenAIRE #OA...
OpenAIRE guidelines and broker service for repository managers - OpenAIRE #OA...
 
Revelations about relations in connecting research: content types, data and i...
Revelations about relations in connecting research: content types, data and i...Revelations about relations in connecting research: content types, data and i...
Revelations about relations in connecting research: content types, data and i...
 
VODAN Africa IN.pptx
VODAN Africa IN.pptxVODAN Africa IN.pptx
VODAN Africa IN.pptx
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
OSFair2017 training | Machine accessibility of Open Access scientific publica...
OSFair2017 training | Machine accessibility of Open Access scientific publica...OSFair2017 training | Machine accessibility of Open Access scientific publica...
OSFair2017 training | Machine accessibility of Open Access scientific publica...
 

Más de Crossref

Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...
Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...
Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...Crossref
 
Crossref LIVE Chinese网络研讨会——Crossref简介 – 14 Oct 2021
Crossref LIVE Chinese网络研讨会——Crossref简介 – 14 Oct 2021  Crossref LIVE Chinese网络研讨会——Crossref简介 – 14 Oct 2021
Crossref LIVE Chinese网络研讨会——Crossref简介 – 14 Oct 2021 Crossref
 
Seminario web ‘Crossmark’, en español
Seminario web ‘Crossmark’, en español Seminario web ‘Crossmark’, en español
Seminario web ‘Crossmark’, en español Crossref
 
Working with ROR as a Crossref member: what you need to know
Working with ROR as a Crossref member: what you need to knowWorking with ROR as a Crossref member: what you need to know
Working with ROR as a Crossref member: what you need to knowCrossref
 
Преимущества и варианты использования метаданных в Crossref / The Value and ...
Преимущества и варианты использования метаданных в Crossref /  The Value and ...Преимущества и варианты использования метаданных в Crossref /  The Value and ...
Преимущества и варианты использования метаданных в Crossref / The Value and ...Crossref
 
Seminario web ‘Similarity Check’, en español
Seminario web ‘Similarity Check’, en españolSeminario web ‘Similarity Check’, en español
Seminario web ‘Similarity Check’, en españolCrossref
 
Crossref LIVE Indonesia: One Search Platform (Drs. Muhammad Syarif Bando pres...
Crossref LIVE Indonesia: One Search Platform (Drs. Muhammad Syarif Bando pres...Crossref LIVE Indonesia: One Search Platform (Drs. Muhammad Syarif Bando pres...
Crossref LIVE Indonesia: One Search Platform (Drs. Muhammad Syarif Bando pres...Crossref
 
Crossref LIVE Indonesia: The Future of Indonesian Journal Policy (with Dr. Lu...
Crossref LIVE Indonesia: The Future of Indonesian Journal Policy (with Dr. Lu...Crossref LIVE Indonesia: The Future of Indonesian Journal Policy (with Dr. Lu...
Crossref LIVE Indonesia: The Future of Indonesian Journal Policy (with Dr. Lu...Crossref
 
Crossref LIVE Indonesia: The Value and Use of Crossref Metadata, CRLIVE-ID 15...
Crossref LIVE Indonesia: The Value and Use of Crossref Metadata, CRLIVE-ID 15...Crossref LIVE Indonesia: The Value and Use of Crossref Metadata, CRLIVE-ID 15...
Crossref LIVE Indonesia: The Value and Use of Crossref Metadata, CRLIVE-ID 15...Crossref
 
Crossref LIVE Indonesia: Content Registration at Crossref, CRLIVE-ID 14 July ...
Crossref LIVE Indonesia: Content Registration at Crossref, CRLIVE-ID 14 July ...Crossref LIVE Indonesia: Content Registration at Crossref, CRLIVE-ID 14 July ...
Crossref LIVE Indonesia: Content Registration at Crossref, CRLIVE-ID 14 July ...Crossref
 
Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021
Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021
Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021Crossref
 
Crossref İçerik Kaydı Webinarı, Türkçe | Content Registration at Crossref , ...
 Crossref İçerik Kaydı Webinarı, Türkçe | Content Registration at Crossref , ... Crossref İçerik Kaydı Webinarı, Türkçe | Content Registration at Crossref , ...
Crossref İçerik Kaydı Webinarı, Türkçe | Content Registration at Crossref , ...Crossref
 
Los Metadatos Para la Comunidad de Investigacion
Los Metadatos Para la Comunidad de InvestigacionLos Metadatos Para la Comunidad de Investigacion
Los Metadatos Para la Comunidad de InvestigacionCrossref
 
تسجيل المحتوي مع كروس رف – ندوة عبر الانترنت باللغة العربية | Content Registr...
تسجيل المحتوي مع كروس رف – ندوة عبر الانترنت باللغة العربية | Content Registr...تسجيل المحتوي مع كروس رف – ندوة عبر الانترنت باللغة العربية | Content Registr...
تسجيل المحتوي مع كروس رف – ندوة عبر الانترنت باللغة العربية | Content Registr...Crossref
 
Content Registration, Crossref ALJEBI, Indonesia
Content Registration, Crossref ALJEBI, IndonesiaContent Registration, Crossref ALJEBI, Indonesia
Content Registration, Crossref ALJEBI, IndonesiaCrossref
 
Participation reports webinar December 2020
Participation reports webinar December 2020Participation reports webinar December 2020
Participation reports webinar December 2020Crossref
 
Participation reports webinar November 2020
Participation reports webinar November 2020Participation reports webinar November 2020
Participation reports webinar November 2020Crossref
 
Introduction to Crossmark/Crossmark: O que é e como usar
Introduction to Crossmark/Crossmark: O que é e como usarIntroduction to Crossmark/Crossmark: O que é e como usar
Introduction to Crossmark/Crossmark: O que é e como usarCrossref
 
Crossref LIVE UK Online
Crossref LIVE UK OnlineCrossref LIVE UK Online
Crossref LIVE UK OnlineCrossref
 
Registro y actualización de contenido en Crossref | Content Registration at C...
Registro y actualización de contenido en Crossref | Content Registration at C...Registro y actualización de contenido en Crossref | Content Registration at C...
Registro y actualización de contenido en Crossref | Content Registration at C...Crossref
 

Más de Crossref (20)

Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...
Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...
Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...
 
Crossref LIVE Chinese网络研讨会——Crossref简介 – 14 Oct 2021
Crossref LIVE Chinese网络研讨会——Crossref简介 – 14 Oct 2021  Crossref LIVE Chinese网络研讨会——Crossref简介 – 14 Oct 2021
Crossref LIVE Chinese网络研讨会——Crossref简介 – 14 Oct 2021
 
Seminario web ‘Crossmark’, en español
Seminario web ‘Crossmark’, en español Seminario web ‘Crossmark’, en español
Seminario web ‘Crossmark’, en español
 
Working with ROR as a Crossref member: what you need to know
Working with ROR as a Crossref member: what you need to knowWorking with ROR as a Crossref member: what you need to know
Working with ROR as a Crossref member: what you need to know
 
Преимущества и варианты использования метаданных в Crossref / The Value and ...
Преимущества и варианты использования метаданных в Crossref /  The Value and ...Преимущества и варианты использования метаданных в Crossref /  The Value and ...
Преимущества и варианты использования метаданных в Crossref / The Value and ...
 
Seminario web ‘Similarity Check’, en español
Seminario web ‘Similarity Check’, en españolSeminario web ‘Similarity Check’, en español
Seminario web ‘Similarity Check’, en español
 
Crossref LIVE Indonesia: One Search Platform (Drs. Muhammad Syarif Bando pres...
Crossref LIVE Indonesia: One Search Platform (Drs. Muhammad Syarif Bando pres...Crossref LIVE Indonesia: One Search Platform (Drs. Muhammad Syarif Bando pres...
Crossref LIVE Indonesia: One Search Platform (Drs. Muhammad Syarif Bando pres...
 
Crossref LIVE Indonesia: The Future of Indonesian Journal Policy (with Dr. Lu...
Crossref LIVE Indonesia: The Future of Indonesian Journal Policy (with Dr. Lu...Crossref LIVE Indonesia: The Future of Indonesian Journal Policy (with Dr. Lu...
Crossref LIVE Indonesia: The Future of Indonesian Journal Policy (with Dr. Lu...
 
Crossref LIVE Indonesia: The Value and Use of Crossref Metadata, CRLIVE-ID 15...
Crossref LIVE Indonesia: The Value and Use of Crossref Metadata, CRLIVE-ID 15...Crossref LIVE Indonesia: The Value and Use of Crossref Metadata, CRLIVE-ID 15...
Crossref LIVE Indonesia: The Value and Use of Crossref Metadata, CRLIVE-ID 15...
 
Crossref LIVE Indonesia: Content Registration at Crossref, CRLIVE-ID 14 July ...
Crossref LIVE Indonesia: Content Registration at Crossref, CRLIVE-ID 14 July ...Crossref LIVE Indonesia: Content Registration at Crossref, CRLIVE-ID 14 July ...
Crossref LIVE Indonesia: Content Registration at Crossref, CRLIVE-ID 14 July ...
 
Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021
Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021
Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021
 
Crossref İçerik Kaydı Webinarı, Türkçe | Content Registration at Crossref , ...
 Crossref İçerik Kaydı Webinarı, Türkçe | Content Registration at Crossref , ... Crossref İçerik Kaydı Webinarı, Türkçe | Content Registration at Crossref , ...
Crossref İçerik Kaydı Webinarı, Türkçe | Content Registration at Crossref , ...
 
Los Metadatos Para la Comunidad de Investigacion
Los Metadatos Para la Comunidad de InvestigacionLos Metadatos Para la Comunidad de Investigacion
Los Metadatos Para la Comunidad de Investigacion
 
تسجيل المحتوي مع كروس رف – ندوة عبر الانترنت باللغة العربية | Content Registr...
تسجيل المحتوي مع كروس رف – ندوة عبر الانترنت باللغة العربية | Content Registr...تسجيل المحتوي مع كروس رف – ندوة عبر الانترنت باللغة العربية | Content Registr...
تسجيل المحتوي مع كروس رف – ندوة عبر الانترنت باللغة العربية | Content Registr...
 
Content Registration, Crossref ALJEBI, Indonesia
Content Registration, Crossref ALJEBI, IndonesiaContent Registration, Crossref ALJEBI, Indonesia
Content Registration, Crossref ALJEBI, Indonesia
 
Participation reports webinar December 2020
Participation reports webinar December 2020Participation reports webinar December 2020
Participation reports webinar December 2020
 
Participation reports webinar November 2020
Participation reports webinar November 2020Participation reports webinar November 2020
Participation reports webinar November 2020
 
Introduction to Crossmark/Crossmark: O que é e como usar
Introduction to Crossmark/Crossmark: O que é e como usarIntroduction to Crossmark/Crossmark: O que é e como usar
Introduction to Crossmark/Crossmark: O que é e como usar
 
Crossref LIVE UK Online
Crossref LIVE UK OnlineCrossref LIVE UK Online
Crossref LIVE UK Online
 
Registro y actualización de contenido en Crossref | Content Registration at C...
Registro y actualización de contenido en Crossref | Content Registration at C...Registro y actualización de contenido en Crossref | Content Registration at C...
Registro y actualización de contenido en Crossref | Content Registration at C...
 

Último

Regression analysis: Simple Linear Regression Multiple Linear Regression
Regression analysis:  Simple Linear Regression Multiple Linear RegressionRegression analysis:  Simple Linear Regression Multiple Linear Regression
Regression analysis: Simple Linear Regression Multiple Linear RegressionRavindra Nath Shukla
 
Monthly Social Media Update April 2024 pptx.pptx
Monthly Social Media Update April 2024 pptx.pptxMonthly Social Media Update April 2024 pptx.pptx
Monthly Social Media Update April 2024 pptx.pptxAndy Lambert
 
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...amitlee9823
 
VIP Call Girls Gandi Maisamma ( Hyderabad ) Phone 8250192130 | ₹5k To 25k Wit...
VIP Call Girls Gandi Maisamma ( Hyderabad ) Phone 8250192130 | ₹5k To 25k Wit...VIP Call Girls Gandi Maisamma ( Hyderabad ) Phone 8250192130 | ₹5k To 25k Wit...
VIP Call Girls Gandi Maisamma ( Hyderabad ) Phone 8250192130 | ₹5k To 25k Wit...Suhani Kapoor
 
Value Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and painsValue Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and painsP&CO
 
RSA Conference Exhibitor List 2024 - Exhibitors Data
RSA Conference Exhibitor List 2024 - Exhibitors DataRSA Conference Exhibitor List 2024 - Exhibitors Data
RSA Conference Exhibitor List 2024 - Exhibitors DataExhibitors Data
 
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Dave Litwiller
 
Event mailer assignment progress report .pdf
Event mailer assignment progress report .pdfEvent mailer assignment progress report .pdf
Event mailer assignment progress report .pdftbatkhuu1
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMRavindra Nath Shukla
 
M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.Aaiza Hassan
 
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRLMONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRLSeo
 
A305_A2_file_Batkhuu progress report.pdf
A305_A2_file_Batkhuu progress report.pdfA305_A2_file_Batkhuu progress report.pdf
A305_A2_file_Batkhuu progress report.pdftbatkhuu1
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst SummitHolger Mueller
 
It will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayIt will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayNZSG
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdfRenandantas16
 
Call Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine ServiceCall Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine Serviceritikaroy0888
 
Unlocking the Secrets of Affiliate Marketing.pdf
Unlocking the Secrets of Affiliate Marketing.pdfUnlocking the Secrets of Affiliate Marketing.pdf
Unlocking the Secrets of Affiliate Marketing.pdfOnline Income Engine
 
Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...Roland Driesen
 

Último (20)

Regression analysis: Simple Linear Regression Multiple Linear Regression
Regression analysis:  Simple Linear Regression Multiple Linear RegressionRegression analysis:  Simple Linear Regression Multiple Linear Regression
Regression analysis: Simple Linear Regression Multiple Linear Regression
 
Monthly Social Media Update April 2024 pptx.pptx
Monthly Social Media Update April 2024 pptx.pptxMonthly Social Media Update April 2024 pptx.pptx
Monthly Social Media Update April 2024 pptx.pptx
 
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
 
VIP Call Girls Gandi Maisamma ( Hyderabad ) Phone 8250192130 | ₹5k To 25k Wit...
VIP Call Girls Gandi Maisamma ( Hyderabad ) Phone 8250192130 | ₹5k To 25k Wit...VIP Call Girls Gandi Maisamma ( Hyderabad ) Phone 8250192130 | ₹5k To 25k Wit...
VIP Call Girls Gandi Maisamma ( Hyderabad ) Phone 8250192130 | ₹5k To 25k Wit...
 
Value Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and painsValue Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and pains
 
RSA Conference Exhibitor List 2024 - Exhibitors Data
RSA Conference Exhibitor List 2024 - Exhibitors DataRSA Conference Exhibitor List 2024 - Exhibitors Data
RSA Conference Exhibitor List 2024 - Exhibitors Data
 
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
 
Event mailer assignment progress report .pdf
Event mailer assignment progress report .pdfEvent mailer assignment progress report .pdf
Event mailer assignment progress report .pdf
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSM
 
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabiunwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
 
M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.
 
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRLMONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
 
VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
 
A305_A2_file_Batkhuu progress report.pdf
A305_A2_file_Batkhuu progress report.pdfA305_A2_file_Batkhuu progress report.pdf
A305_A2_file_Batkhuu progress report.pdf
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst Summit
 
It will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayIt will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 May
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
 
Call Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine ServiceCall Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine Service
 
Unlocking the Secrets of Affiliate Marketing.pdf
Unlocking the Secrets of Affiliate Marketing.pdfUnlocking the Secrets of Affiliate Marketing.pdf
Unlocking the Secrets of Affiliate Marketing.pdf
 
Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...
 

CrossRef Text and Data Mining

  • 1. Rachael Lammey Product Manager, CrossRef 28 October 2014
  • 2. Not-for-profit association of scholarly publishers All subjects, all business models 4,000+ organizations from all over the world 83 non-publisher affiliates, 2000 library affiliates 68 million content items
  • 3.
  • 5. User clicks on CrossRef DOI reference link in Journal A Tani, N., N. Tomaru, M. Araki, AND K. Ohba. 1996. Genetic diversity and differentiation in populations of Japanese stone pine (Pinus pumila) in Japan. Canadian Journal of Forest Research 26: 1454–1462.[CrossRef] DOI directory returns URL User accesses cited article in Journal B
  • 7. Services • Cross-publisher reference linking • Cross-publisher Cited-by linking • Cross-publisher metadata feeds • Cross-publisher plagiarism screening • Cross-publisher update identification • Cross-publisher funder identification • Cross-publisher text and data mining Powered by iThenticate
  • 8. A Text and Data Mining Hub for Researchers
  • 9. What is text and data mining? Text Mining is an interdisciplinary field combining techniques from linguistics, computer science and statistics to build tools that can efficiently retrieve and extract information from digital text. http://blogs.plos.org/everyone/2013/04/17/announcing-the-plos-text-mining-collection/ It uses powerful computers to find links between drugs and side effects, or genes and diseases, that are hidden within the vast scientific literature. These are discoveries that a person scouring through papers one by one may never notice. http://www.theguardian.com/science/2012/may/23/text-mining-research-tool-forbidden
  • 10. http://www.jisc.ac.uk/media/documents/publications/textminingbp_rtf.rtf Marc Weeber and colleagues used automated text mining tools to infer that the drug thalidomide could treat several diseases it had not been associated with before. Thalidomide was taken off the market 40 years ago, but is still the subject of research because it seems to benefit leprosy patients via their immune systems. Weeber and Grietje Molema, an immunologist, used text mining tools to search the literature for papers on thalidomide and then pick out those containing concepts related to immunology. One concept, concerning thalidomide’s ability to inhibit Interleukin-12 (IL-12), a chemical involved in the launch of an immune response, struck Molema as particularly interesting. A second automated search for diseases that improve when the action of IL-12 is blocked, revealed several not previously linked with thalidomide, including chronic hepatitis, myasthenia gravis and a type of gastritis. “Type in thalidomide and you get 2-3000 hits. Type in disease and you get 40,000 hits. With automated text mining tools we only had to read 100-200 abstracts and 20 or 30 full papers. We’ve created hypotheses for others to follow up” says Weeber. Weeber et al. J Am Med Inform Assoc. 2003 10 252-259
  • 12. Why? • Researchers find it impractical to negotiate multiple bilateral agreements with hundreds of subscription- based publishers in order to authorize TDM of subscribed content. • Subscription-based publishers find it impractical to negotiate multiple bilateral agreements with thousands of researchers and institutions in order to authorize TDM of subscribed content. • All parties would benefit from support of standard APIs and data representations in order to enable TDM across both open access and subscription-based publishers.
  • 13. * Chinese Geoscience Union * Chinese Institute Of Automation Engineers (Ciae) * Chinese Journal Of Mechanical Engineering * Chinese Mathematical Society * Chinese Physical Society * Chinese Physiological Society * Chinese Society Of Theoretical And Applied Mechanics * Chonnam National University Medical School (Kamje) * Christ University Bangalore * Cic Edizioni Internazionali * Cig Media Group * Cilip Information Literacy Group * Civil-Comp, Ltd. * Claremont Colleges Library * Classical Association Of The Middle West And South, Inc. (Camws) * Clawar Association Limited * Clay Minerals Society * Cleo Revues.Org * Cleveland Clinic Journal Of Medicine * Clinical Autonomic Research Society * Clinical Laboratory Publications * Clinics Cardive Publishing * Clockss Archive * Cnps * Cnrs France * Cnu Journal Of Agricultural Science
  • 14.
  • 15. Using the DOI as the basis for a common text and data mining API provides several benefits. For example, the DOI provides: •An easy way to de-duplicate documents that may be found on several sites. •Persistent provenance information. •An easy way to document, share and compare coropra without having to exchange the actual documents •A mechanism to ensure the reproducibility of TDM results using the source documents. •A mechanism to track the impact of updates, corrections retractions and withdrawls on corpora. Why use the DOI?
  • 22. CrossRef TDM HTTP Headers CR-TDM-Rate-Limit: 1500 (the rate limit ceiling per window on requests) CR-TDM-Rate-Limit-Remaining: 1387 (number of requests left for the current window) CR-TDM-Rate-Limit-Reset: 1378072800 (the remaining time in UTC epoch seconds before the rate limit resets and a new window is started) *this is a technique used by many APIs, including Twitter’s
  • 23. Common API Summary • Content Negotiation (Required) • New Metadata (Required) • Full text URIs • License URIs • Rate Limiting Headers (optional)
  • 25. 1. Full Text Link https://apps.crossref.org/docs/tdm/full-text- uris-technical-details/
  • 27. Example from Hindawi <ai:program name="AccessIndicators"> <ai:license_ref>http://creativecommons.org/licenses/by/3.0/</ai:license_ref> </ai:program> <doi_data> <doi>10.1155/2014/969265</doi> <timestamp>20140401090031</timestamp> <resource>http://www.hindawi.com/journals/aaa/2014/969265/</resource> <collection property="text-mining"> <item> <resource mime_type="application/pdf"> http://downloads.hindawi.com/journals/aaa/2014/969265.pdf </resource> </item> <item> <resource mime_type="application/xml"> http://downloads.hindawi.com/journals/aaa/2014/969265.xml </resource> </item>
  • 28. Stop here if • You are an open access publisher • You include TDM as a part of your subscription license/T&Cs.
  • 33. Researcher queries DOI using CN + API token Publisher verifies API token If token verified AND access control allows, publisher returns full text (frequency at publisher discretion)
  • 34. Benefits • Streamlines researcher access to distributed full text for TDM • Enables machine-to-machine, automated access for recognized TDM (i.e. researchers won’t be locked out of publisher sites) • Enables article-level licensing info and easy mechanism for supplemental T&Cs for text and data mining (publishers discussing model license via STM)
  • 36. Publishers There are two additional metadata elements that publishers will need to deposit to support TDM via CrossRef. These are: •Full Text URIs: One or more URIs that point to full text representations of the content identified by your CrossRef DOIs. •License URIs: One or more URIs pointing at licenses that govern how the full text content can be used. •OPTIONAL: Add publisher TDM terms and conditions to the click-through service
  • 37. Researchers • Modify TDM tools to make use of the API token • Modify TDM tools to look for <lic_ref> elements • Register with the click-through service and accept/decline licenses (if applicable)
  • 39. Progress to date • DOI content negotiation • CrossRef support for recording links to full text • CrossRef metadata support for: • ORCIDS • FundRef • License information • CrossRef Metadata Search for Discovery: http://search.labs.crossref.org/ • Click-through license service • Publisher API for verifying and managing tokens • Launched as live service 29th May 2014
  • 40. Publishers Articles with full-text links and license information deposited: 998,416 Cost? Free to researchers and the public No cost for publishers through 2014, 2015 tbc Register interest at: http://www.crossref.org/tdm/contact_form.html

Notas del editor

  1. Questions at end. Talk a little bit about what CrossRef is then move on to talk about our text and data mining service.
  2. First just a few words about CrossRef for anyone who isn’t a member or might not be familiar with us as an organisation. CrossRef is a not-for-profit membership organisation of international scholarly publishers. We have 4000 member publishers, representing all disciplines - not just STM, and comprising commercial publishers, academic societies, open access publishers, university presses. We also have 83 affiliate members and 2000 library affiliates - these libraries and other organisations make use of the CrossRef database to look up DOIs and metadata. We are the largest DOI registration agency and have assigned nearly 63 million DOIs to date.
  3. CrossRef was founded 14 years ago to solve the problem of broken links. The web is all about links, but links break. This is annoying if you’re browsing the web and want to follow an interesting link, but in the context of scholarly publishing it becomes more than annoying - if you can’t follow a citation from one paper to another you’re being hampered in your research. CItation linking is one of the greatest benefits of online publishing, but it really does need to be reliable
  4. and publishers were finding that web sites changed, content moved, and links that they had put into their articles stopped working. So they started a multi-publisher initiative to solve this problem of broken links. This is done using the DOI - the Digital Object Identifier, which I’m sure many of you are familiar with. A CrossRef DOI is simply a unique identifier for a piece of content. Once assigned, it doesn’t change. It is to all intents and purposes a meaningless number, but it allows that piece of content to be located on the web.
  5. And it works like this: publishers use CrossRef DOIs to link to content, usually from the references at the end of articles. Users click on those DOI-based links and are referred via the CrossRef database to the cited article at it’s correct location on the web. If content moves the publisher only has to update the CrossRef database once, and all of the publishers that are linking to their content using CrossRef DOIs will be redirected to the content in its new location.
  6. Every month there are around 90 million clicks on CrossRef DOI links, so 90 million citations resolved to content.
  7. The issue of Text and Data Mining has become very important and we feel that CrossRef is in a unique position to expand its current infrastructure (a registry of unique identifiers and metadata for scholarly content and thousands of members) to make TDM easier for researchers and their institutions and publishers. Technical solution - we aren’t addressing the issue of licencing.
  8. Looking at positives. Finding treatments to diseases that may not have been found before.
  9. But urge caution – Google Flu!
  10. Why did CrossRef develop this service? Applies to OA content too. Let’s just illustrate these issues.
  11. Researcher to illustrate that plus some of the publishers we represent. TDM is about scale.
  12. Bilateral agreements aspect - In the past, researchers who wish to text and data mine published literature have no common or simple way of accessing the full text for the content they wish to mine. This is true both of subscription-based content as well as of open access content. Consequently, TDM users access the content in one or two ways: Negotiating with publishers to have the content delivered to them, either via physical media or bulk data transfer (e.g. FTP) “Screen-scraping” the publisher’s website. The first option doesn’t scale well across multiple Publishers and Researchers. It also presents synchronisation problems if the researchers want an ongoing feed of refreshed content. The issue with the second option is that “screen scraping” is an inefficient, fragile and error prone mechanism for identifying and downloading full text. Screen scrapers put a large performance burden on web sites and, at the same time, any slight changes to the web site can break the tool that is doing the screen scraping. CrossRef Text and Data Mining provides a common solution which works across Open Access and subscription-based publishers and is free for anyone to use.
  13. Processing the same document on multiple sites could easily skew text and data mining results and traditional techniques for eliminating duplicates (e.g. hashes, etc.) will not work reliably if the document in question exists in several representations (e.g. PDF, HTML, ePub ) and/or versions (e.g. accepted manuscript, version of record) Using the DOI as a key will allow researchers to retrieve and verify the provenance of the items in the TDM corpus, many years into the future when traditional HTTP URLs will have already broken
  14. The CrossRef Common API is the main aspect of this service and is designed to allow researchers to easily harvest full text documents from all participating publishers regardless of their business model (e.g. open access, subscription). It makes use of CrossRef DOI content negotiation to provide researchers with links to the full text of content located on the publisher’s site. The publisher remains responsible for actually delivering the full text of the content requested. Thus, open access publishers can simply deliver the requested content while subscription based publishers continue to support subscriptions using their existing access control systems.
  15. API works with content negotiation – what is content negotiation
  16. Content negotiation allows a user to request a particular representation of a web resource. DOI resolvers use content negotation to provide different representations of metadata associated with DOIs.A content negotiated request to a DOI resolver is much like a standard HTTP request, except server-driven negotiation will take place based on the list of acceptable content types a client provides. Here, they’re asking for text
  17. Here they’re asking for XML – and can also request PDF too as we know a lot of publishers may only have back content in PDF and that’s fine.
  18. Set of standard HTTP headers that can be used by servers to convey rate-limiting information to automated TDM tools. Well-behaved TDM tools can simply look for these headers when they query publisher sites in order to understand how best to adjust their behaviour so as not to effect the performance of the site. The headers allow a publisher to define a “rate limit window”- which is basically a time span (e.g. a minute, and hour, a day).
  19. In order for researchers to use the CrossRef API, Publishers need to add new metadata to their CrossRef DOI deposits.
  20. One or more URIs pointing at licenses that govern how the full text content can be used.
  21. This needs to be added to the publisher XML – license information at the article-level. Examples on our support site.
  22. Publishers who require researchers to agree to a specific set of Terms and Conditions (T&amp;Cs) before they are allowed to text and data mine content that they otherwise have access to (e.g. through an existing subscription) will need to make use of the click-through service.
  23. So to put it all together…
  24. If you are an open access publisher or if your existing subscription licenses already allow TDM of subscribed full text, then the registration of the above metadata deposit is the ONLY thing you need to do in order to enable TDM of your content via the CrossRef Metadata API. Rate limiting.
  25. Rate limiting too
  26. Support site with info. Info on rate limiting on there too.
  27. LAUNCH later this month. More deposits from OA publishers at the moment as it’s easier as their licenses already allow text and data mining. Piloted last year and got approval from the board to move it into Production in November which we’ve done to prepare for launch in March.
  28. Working group which will migrate to a full CrossRef Committee when the service is officially launched seen over 100,000 deposits of full text links and license information, mainly from Hindawi but some from AIP and IEEE as well.
  29. Eric Lease Morgan
  30. Publishers and researchers in pilot. Launch in May