Connecting Scientific Resources Breakout
Science Online London 2010 - British Library
Session abstract: "Do you have data? Have you decided that you want to publish that data in a friendly way? Then this session is for you. Allowing your data to be linked to other data sets is an obvious way to make your data more useful, and to contribute back to the data community that you are a part of, but the mechanics of how you do that is not always so clear cut. This session will discuss just that. With experts from the publishing world, the liked data community, and scientific data services, this is a unique opportunity to get an insight into how to create linked scientific data, and what you can do with it once you have created it."- http://www.scienceonlinelondon.org/programme.php?tab=abstracts#breakout8
4. …that is not what I will be speaking about today…
4
5. …I am talking about integrating supplementary data in context. In the case of
working with PANGAEA, data is not only in context, but visualized. Publishers need to
work to integrate third-party data repositories into the context of related articles.
How to scale this though?
More information about the PANGAEA repository here:
http://www.pangaea.de/
More information about this collaboration here:
http://www.elsevier.com/wps/find/authored_newsitem.cws_home/companynews05
_01616
5
6. Another example of supplementary data integrated and visualized in context.
6
10. Will not be what I am discussing today (but remember this example for later).
Instead…
10
11. …I will be discussing automated entity extraction which has a number of benefits
including.
•Easily extendable to new / other entities
•Works retrospectively on older content
•Does create recall / precision errors (not actually a benefit, but important to
note)
Furthermore…
11
14. 1. Colours & tags genes, proteins, molecule names
2. Clicking shows a summary of features for the term (ie: sequence or 2D structure)
3. User can click on links in the pop-up leading out to more information
The Reflect integration also links to information in context…
14
15. Reflect integration in ScienceDirect was a result of EMBL winning the Elsevier Grand
Challenge.
“The Elsevier Grand Challenge: Knowledge Enhancement in the Life Sciences is a
contest created to improve the way scientific information is communicated and
used. The contest invites members of the scientific community to describe and
prototype a tool to improve the interpretation and identification of meaning in
(online) journals and text databases relating to the life sciences. Specifically we are
looking for new ways to:
improve the process/methods/results of creating, reviewing and editing scientific
content
interpret, visualize or connect the knowledge more effectively, and/or
provide tools/ideas for measuring the impact of these improvements.” -
http://www.elseviergrandchallenge.com/description.html
On route to a scalable model?
15
16. Remember this example from earlier? Integrating PDB content in context will greatly
enhance the user experience (watch for this change). Things are improving fast, but
how to keep up with it all?
16
18. Q: How can we scale the PANGAEA, NextBio and Reflect model to its full potential?
A: By opening up our platform to third-party developers to develop applications in
context on Scopus and ScienceDirect.
“SciVerse Applications is the place to find and market applications, or collaborate
with the scientific community to create applications.” -
http://www.info.sciverse.com/sciverse-applications
18
19. Platform built on the (open source) Apache Shindig framework for Open Social
gadgets and RESTful APIs. Customer entitlements for ScienceDirect and Scopus
remain the same for users.
19
20. Applications can be developed in context in multiple locations throughout
ScienceDirect, Scopus, and Hub Beta.
20