The ChemSpider database is a resource hosted by the Royal Society of Chemistry. With over 28 million unique chemicals on the database linked out to over 400 data sources the platform provides access to experimental and predicted data (properties, spectra etc.), links to publications, patents and a myriad of other resources. The ChemSpider database has been used as the foundation of a number of other resources for chemists including ChemSpider SyntheticPages, the Learn Chemistry Wiki and the Spectral Game. This presentation will provide an overview of ChemSpider and discuss how chemists can both derive value from and contribute to the content available from the database and its related resources. We will also discuss our view of future platform for managing personal, institutional and public chemistry in a shared environment.
RSC ChemSpider is the online chemistry database where community contributions coun
1. RSC|ChemSpider – The Online
Chemistry Database Where
Community Contributions Count
2. ChemSpider
The RSC’s Online Chemical Database
A central hub for chemists to source information
>28 million unique chemical records
Aggregated from >400 data sources
Chemicals, spectra, CIF files, movies, images,
podcasts, links to patents, publications,
predictions
A central hub for chemists to deposit & curate data
3. Answer Questions with ChemSpider
Questions a chemist might ask…
What is the melting point of n-heptanol?
What is the chemical structure of Xanax?
Chemically, what is phenolphthalein?
What are the stereocenters of cholesterol?
Where can I find publications about xylene?
What are the different trade names for Ketoconazole?
What is the NMR spectrum of Aspirin?
What are the safety handling issues for Thymol Blue?
25. Scientists are measured by…
Impact
Citations
Papers
Patents
Funding
and increasingly by “Alt-Metrics” – what you say,
what you contribute, your data depositions, your
code in repositories, your voice in the network,
your activities on Facebook (be careful!)
27. If it was not just about me…
We might have a community
built encyclopedia
I might know where the best
restaurants are
I might get good advice on
books to read
I might know which movies
to watch
I might know which plumber
to call
Data might just be Open
28. If it was not just about me…
We might have a community
built encyclopedia
I might know where the best
restaurants are
I might get good advice on
books to read
I might know which movies
to watch
I might know which plumber
to call
Data might just be Open
29. The Social Network
Career-wise NOT having a personal presence
online will be a detriment
Self-marketing
Establishing a profile
Getting on the record
Collaborative Science
Demonstrating a skill set
Measured using alternative metrics
Contributing to the public peer review process
30. Social Networking Tools
A growing number of social networking tools:
Facebook
Twitter
Linked-In
Flickr
YouTube
Blogs
Communities
Collaborative environments
31. Chemistry Social Networking
Methods of sharing MY chemistry online include:
Wikis or blogs
Slideshare for presentations
YouTube for videos
Flickr, Wikimedia etc. for images
PubChem for assay data
NMRShiftDB for NMR assignments
GoogleDocs for data
42. The World of Contribution
Times have changed
Immediacy of social networks
Commenting on articles/data is here
The “participating scientist” has high profile
And who can be a scientist now???
45. Share Science!!! Not Just Yourself
If you have time, and the inclination, become a
community contributor
Share your expertise in the new world of openness
Share your Open Source code
Share your data and your model
Share your Figures
Contribute to Wikis – Wikipedia and others
Become an Open Notebook Scientist
48. ChemSpider SyntheticPages
Many syntheses are not published but are of value
A database of synthesis procedures built for the
community, by the community.
Peer-reviewed by the community
Each contribution DOI’ed. Develop online scientific
reputation at a time of “micro-publications”
Integrates semantic mark-up and visualization tools
53. Submission Process
Submissions reviewed by editorial board
Published as is or comments sent to author
Online Peer Review process – engage chemists
in ongoing discussions and feedback loop
Data supported include web movies, images, live
spectra etc.
60. Is it working?
Show of hands…
How many of you know ChemSpider?
How many of you know CSSP?
Have any of you submitted to CSSP?
Low submissions but some dedicated authors
62. Is it working?
Show of hands…
How many of you know CSSP?
Have any of you submitted to CSSP?
Low submissions but some dedicated authors
What reasons are there you would not publish?
Time
Approval from supervisor
Need to keep the science quiet
Publishing on CSSP prevents future publishing?
64. Contributing to The Quality of Data
What is the Structure of Vitamin K?
A lipid cofactor that is required for normal blood
clotting. Several forms of vitamin K have been
identified: VITAMIN K1 (phytomenadione)
derived from plants, VITAMIN K2
(menaquinone) from bacteria & synthetic
naphthoquinone provitamins, VITAMIN K3
(menadione).
77. Deposition, Annotation and
Validation
ANYBODY can annotate a record on ChemSpider
Registered users can deposit new data
Registered users can validate existing data
91. Data Enabling the RSC Archive
An archive going back to 1841. Project underway
to “data enable” the archive:
Extract chemistry – chemicals, reactions,
experimental data points, complex data
Semantic enriching of the articles for interactive
viewing and crowdsourced annotation/curation
Dramatically enables the type of queries
possible across the archive
92. A model for data segregation
Integrate to Institutional repositories
Access to Theses and Dissertations
93. Model Building with Community Data
Community data can be the basis of model
building
Consume data from available databases, RSC
archive, new publications and build predictive
algorithms for the community
Accept research data from the community and
include into predictions
94. An Open Data-Centric Chemistry Hub
Internet Data
Small organic molecules Commercial Software
Undefined materials Pre-competitive Data
Organometallics Open Science
Nanomaterials Open Data
Polymers Publishers
Minerals Educators
Particle bound Open Databases
Links to Biologicals Chemical Vendors
98. ScientistsDB
Write your OWN article about yourself on
ScientistsDB
It is a community-policed site so any comments
you write might be challenged/edited. It is “your”
page but edited by all
An article, once approved by the community, can,
in theory, be moved to Wikipedia
All content is licensed under standard CC-BY-SA
3.0 licensing provided by Wikipedia
99. Acknowledgments
RSC|ChemSpider team
CSSP Editorial Team
All data source providers
Curators and annotators
Service providers:
ACD/Labs
OpenEye
GGA Software Services
Many others….
100. Communicating Science
As scientists one of our primary roles is contribution
The internet enables contribution in different ways,
benefitting the scientist and the community
Share your data and experience – it can enhance
your public profile as a scientist, make you more
discoverable and contribute data to the community
AltMetrics will be a measure of scientists…