SlideShare una empresa de Scribd logo
1 de 44
Feeding and consuming data to
support Open Notebook Science via
          the ChemSpider Platform

Antony Williams, Jean-Claude Bradley, Andrew Lang and
                                      Valery Tkachenko

                               ACS Philadelphia August 2012
Setting the Stage
 Chemists want access to tools and data

     The more capabilities the better
     The more data the better
     And give us an API with that…
     And it should be free…
     And constantly updated…
     And all data should be Open…
     And make it fully Open Source…
     And it needs to be on my mobile…
Setting the Stage
 Chemists have access to tools and data

     The more capabilities the better – we’ll see
     The more data the better – changing daily
     And give us an API with that… - not just one
     And it should be free… - sure
     And constantly updated… - indeed..please help!
     And all data should be Open…- licensing
     And make it fully Open Source… - kinda, sorta
     And it needs to be on my mobile… - sure
Welcome to ChemSpider
 5 years, 28 million chemicals, linking 400 data
  sources and growing daily

 Hosted by the Royal Society of Chemistry
 An important part of our long term strategic vision

 Free to access
 With lots/most/all (?) of the functionality
  necessary to support chemists and Open
  Notebook Science…
Why Use ChemSpider?
Why Use ChemSpider?
Why Use ChemSpider?
Why Use ChemSpider?
Why Use ChemSpider? LINKING OUT
Why Use ChemSpider?
Why Use ChemSpider
Why Use ChemSpider
Why Use ChemSpider
Why Use ChemSpider
What about Syntheses?
ChemSpider SyntheticPages
Work in Progress – 300k Reactions
Storing ONS Reactions
 Working with JC Bradley to host ONS reactions
 Linking directly back to ONS reactions


 What if the links decay?
 Host all related ONS data – benefits of Openness!
 Future applications for RInChIs
What we have been asked for
   “Allow us to grab data”
   “Let us link”
   “Give us web services to integrate”
   “Can we store our data with you?”
   “Can you give us predictions to validate data?”
What we have been asked for
   “Allow us to grab data”
   “Let us link”
   “Give us web services to integrate”
   “Can we store our data with you?”
   “Can you give us predictions to validate data?”



 “Can you build us an ELN?”
Simple Linking to ChemSpider
 Link using ChemSpiderID
 http://www.chemspider.com/1234567
ChemSpider IDs Proliferating Now
Simple Querying Example
 http://
  www.chemspider.com/Search.aspx?q=InChIKey=XXO
Or InChI, or SMILES
 http://www.chemspider.com/Search.aspx?q=InChI=1S
  m1/s1

 http://www.chemspider.com/Search.aspx?
  q=Clc1ccc(cc1)C(O)=C3C(=O)C(=O)N([C@@H]3
  c2cccc(F)c2)CCc5c4ccccc4nc5
Better to provide APIs….
Various Flavors of API
Various Flavors of API
MANY Web Services for integration
Feeding ONS Data into ChemSpider
 ONS data can be deposited into ChemSpider and
  linked out to the ONS pages
 Simply deposit structure(s) and links
Feeding ONS Data into ChemSpider
 ONS Solubility Challenge
Feeding ONS Data into ChemSpider
So isn’t ONS all about ELNs?
 Open Notebook Science is about
   Making records of research publicly available
    online as it is recorded

 ONS is enabled by software tools and platforms
   Keep the notebook of the researcher online
    with all raw and processed data as it is
    generated (close to or near real time)
   Notebooks as Wikis, Commercial or Free ELNs
    published to the web (choose public/private –
    what data to expose)
Feeding ELN Data into ChemSpider
 Integrate e-Notebooks into ChemSpider

   IDBS e-Workbook plug-in allows direct
    deposition of chemical structures
   Can be extended to more ELN content
      Spectra
      Reactions
      Properties etc.

      Integration Video http://tinyurl.com/9xnprqr
Feeding ELN Data into ChemSpider
How much data is lost?
 How many reactions in a thesis never get
  published?
 How many spectra of common materials could be
  shared?
 How many properties are measured and lost?
 What stands in the way of sharing?
    Is it technology?
    Permissions? “The Boss”, Licensing?

 And yes – there are data quality issues but there
  is algorithmic checking and data curation to help
What could the future look like?
 “Publicly funded” research data flows onto the web
 Licensing is clear and NOT a challenge
 Machines are picking up data and depositing

 EXAMPLE project – Any interest?
   Put your spectra/structure in folders (Dropbox)
   ChemSpider robot scoops, processes and
    deposits – opportunity with JC Bradley
   While processing also predicts spectra and
    compares for validation
Leaving the Stage
 Chemists have access to tools and data

     The more capabilities the better – what’s missing?
     The more data the better – anyone want to share?
     And give us an API with that… - ask us for help
     And it should be free… - it is
     And constantly updated… - help annotate/curate
     And all data should be Open…- licensing
     And make it fully Open Source… - book chapter
     And it needs to be on my mobile… - it is
ChemSpider Mobile
New URLs to try out
 ChemSpider Reactions:
  www.chemspider.com/reactions

 ChemSpider Validation and Standardization
  Platform: www.chemspider.com/cvsp

 ChemSpider Google:
  www.chemspider.com/google
ChemSpider Google
ChemSpider Google
Acknowledgments
 RSC Cheminformatics team
 JC Bradley’s lab
 Daniel Lowe – reactions
 Commercial Software – GGA Software,
  ACD/Labs, OpenEye
 Open Source Components
Thank you

Email: williamsa@rsc.org
Blog: www.chemconnector.com
SLIDES: www.slideshare.net/AntonyWilliams

Más contenido relacionado

Similar a Feeding and consuming data to support open notebook science via the chem spider platform

Doing Clever Things with the Semantic Web
Doing Clever Things with the Semantic WebDoing Clever Things with the Semantic Web
Doing Clever Things with the Semantic WebMathieu d'Aquin
 
Six Principles of Software Design to Empower Scientists
Six Principles of Software Design to Empower ScientistsSix Principles of Software Design to Empower Scientists
Six Principles of Software Design to Empower ScientistsDavid De Roure
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...Bonnie Hurwitz
 
Abcd iqs ssoftware-projects-mercecrosas
Abcd iqs ssoftware-projects-mercecrosasAbcd iqs ssoftware-projects-mercecrosas
Abcd iqs ssoftware-projects-mercecrosasMerce Crosas
 
BioIT Europe 2010 - BioCatalogue
BioIT Europe 2010 - BioCatalogueBioIT Europe 2010 - BioCatalogue
BioIT Europe 2010 - BioCatalogueBioCatalogue
 
Berlin 6 Open Access Conference: Tony Hey
Berlin 6 Open Access Conference: Tony HeyBerlin 6 Open Access Conference: Tony Hey
Berlin 6 Open Access Conference: Tony HeyCornelius Puschmann
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 
HKU Data Curation MLIM7350 Class 10
HKU Data Curation MLIM7350 Class 10HKU Data Curation MLIM7350 Class 10
HKU Data Curation MLIM7350 Class 10Scott Edmunds
 

Similar a Feeding and consuming data to support open notebook science via the chem spider platform (20)

Connecting Chemistry Across the Internet Using ChemSpider
Connecting Chemistry Across the Internet Using ChemSpiderConnecting Chemistry Across the Internet Using ChemSpider
Connecting Chemistry Across the Internet Using ChemSpider
 
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
 
Doing Clever Things with the Semantic Web
Doing Clever Things with the Semantic WebDoing Clever Things with the Semantic Web
Doing Clever Things with the Semantic Web
 
Six Principles of Software Design to Empower Scientists
Six Principles of Software Design to Empower ScientistsSix Principles of Software Design to Empower Scientists
Six Principles of Software Design to Empower Scientists
 
Chemistry in the hand: The delivery of structure databases and spectroscopy g...
Chemistry in the hand: The delivery of structure databases and spectroscopy g...Chemistry in the hand: The delivery of structure databases and spectroscopy g...
Chemistry in the hand: The delivery of structure databases and spectroscopy g...
 
Providing support for JC Bradleys vision of open science using RSC cheminform...
Providing support for JC Bradleys vision of open science using RSC cheminform...Providing support for JC Bradleys vision of open science using RSC cheminform...
Providing support for JC Bradleys vision of open science using RSC cheminform...
 
Utilizing ChemSpider As A Platform For Education And Exposure Of Student Data...
Utilizing ChemSpider As A Platform For Education And Exposure Of Student Data...Utilizing ChemSpider As A Platform For Education And Exposure Of Student Data...
Utilizing ChemSpider As A Platform For Education And Exposure Of Student Data...
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
 
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
 
RSC ChemSpider is the online chemistry database where community contributions...
RSC ChemSpider is the online chemistry database where community contributions...RSC ChemSpider is the online chemistry database where community contributions...
RSC ChemSpider is the online chemistry database where community contributions...
 
Hosting a compound centric community resource for chemistry data
Hosting a compound centric community resource for chemistry dataHosting a compound centric community resource for chemistry data
Hosting a compound centric community resource for chemistry data
 
Abcd iqs ssoftware-projects-mercecrosas
Abcd iqs ssoftware-projects-mercecrosasAbcd iqs ssoftware-projects-mercecrosas
Abcd iqs ssoftware-projects-mercecrosas
 
Marrying ACDLabs technologies to eScience Projects at the Royal Society of C...
Marrying ACDLabs technologies to eScience Projects at the  Royal Society of C...Marrying ACDLabs technologies to eScience Projects at the  Royal Society of C...
Marrying ACDLabs technologies to eScience Projects at the Royal Society of C...
 
BioIT Europe 2010 - BioCatalogue
BioIT Europe 2010 - BioCatalogueBioIT Europe 2010 - BioCatalogue
BioIT Europe 2010 - BioCatalogue
 
Checking, Curating And Qualifying Chemistry
Checking, Curating And Qualifying ChemistryChecking, Curating And Qualifying Chemistry
Checking, Curating And Qualifying Chemistry
 
Qualifying Online Information Resources for Chemists
Qualifying Online Information Resources for ChemistsQualifying Online Information Resources for Chemists
Qualifying Online Information Resources for Chemists
 
Berlin 6 Open Access Conference: Tony Hey
Berlin 6 Open Access Conference: Tony HeyBerlin 6 Open Access Conference: Tony Hey
Berlin 6 Open Access Conference: Tony Hey
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
HKU Data Curation MLIM7350 Class 10
HKU Data Curation MLIM7350 Class 10HKU Data Curation MLIM7350 Class 10
HKU Data Curation MLIM7350 Class 10
 
ChemSpider Overview Presentation at Special Libraries Association
ChemSpider Overview Presentation at Special Libraries AssociationChemSpider Overview Presentation at Special Libraries Association
ChemSpider Overview Presentation at Special Libraries Association
 

Último

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 

Último (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 

Feeding and consuming data to support open notebook science via the chem spider platform

  • 1. Feeding and consuming data to support Open Notebook Science via the ChemSpider Platform Antony Williams, Jean-Claude Bradley, Andrew Lang and Valery Tkachenko ACS Philadelphia August 2012
  • 2. Setting the Stage  Chemists want access to tools and data  The more capabilities the better  The more data the better  And give us an API with that…  And it should be free…  And constantly updated…  And all data should be Open…  And make it fully Open Source…  And it needs to be on my mobile…
  • 3. Setting the Stage  Chemists have access to tools and data  The more capabilities the better – we’ll see  The more data the better – changing daily  And give us an API with that… - not just one  And it should be free… - sure  And constantly updated… - indeed..please help!  And all data should be Open…- licensing  And make it fully Open Source… - kinda, sorta  And it needs to be on my mobile… - sure
  • 4. Welcome to ChemSpider  5 years, 28 million chemicals, linking 400 data sources and growing daily  Hosted by the Royal Society of Chemistry  An important part of our long term strategic vision  Free to access  With lots/most/all (?) of the functionality necessary to support chemists and Open Notebook Science…
  • 9. Why Use ChemSpider? LINKING OUT
  • 17. Work in Progress – 300k Reactions
  • 18. Storing ONS Reactions  Working with JC Bradley to host ONS reactions  Linking directly back to ONS reactions  What if the links decay?  Host all related ONS data – benefits of Openness!  Future applications for RInChIs
  • 19. What we have been asked for  “Allow us to grab data”  “Let us link”  “Give us web services to integrate”  “Can we store our data with you?”  “Can you give us predictions to validate data?”
  • 20. What we have been asked for  “Allow us to grab data”  “Let us link”  “Give us web services to integrate”  “Can we store our data with you?”  “Can you give us predictions to validate data?”  “Can you build us an ELN?”
  • 21. Simple Linking to ChemSpider  Link using ChemSpiderID  http://www.chemspider.com/1234567
  • 23. Simple Querying Example  http:// www.chemspider.com/Search.aspx?q=InChIKey=XXO
  • 24. Or InChI, or SMILES  http://www.chemspider.com/Search.aspx?q=InChI=1S m1/s1  http://www.chemspider.com/Search.aspx? q=Clc1ccc(cc1)C(O)=C3C(=O)C(=O)N([C@@H]3 c2cccc(F)c2)CCc5c4ccccc4nc5
  • 25. Better to provide APIs….
  • 28. MANY Web Services for integration
  • 29. Feeding ONS Data into ChemSpider  ONS data can be deposited into ChemSpider and linked out to the ONS pages  Simply deposit structure(s) and links
  • 30.
  • 31. Feeding ONS Data into ChemSpider  ONS Solubility Challenge
  • 32. Feeding ONS Data into ChemSpider
  • 33. So isn’t ONS all about ELNs?  Open Notebook Science is about  Making records of research publicly available online as it is recorded  ONS is enabled by software tools and platforms  Keep the notebook of the researcher online with all raw and processed data as it is generated (close to or near real time)  Notebooks as Wikis, Commercial or Free ELNs published to the web (choose public/private – what data to expose)
  • 34. Feeding ELN Data into ChemSpider  Integrate e-Notebooks into ChemSpider  IDBS e-Workbook plug-in allows direct deposition of chemical structures  Can be extended to more ELN content  Spectra  Reactions  Properties etc.  Integration Video http://tinyurl.com/9xnprqr
  • 35. Feeding ELN Data into ChemSpider
  • 36. How much data is lost?  How many reactions in a thesis never get published?  How many spectra of common materials could be shared?  How many properties are measured and lost?  What stands in the way of sharing?  Is it technology?  Permissions? “The Boss”, Licensing?  And yes – there are data quality issues but there is algorithmic checking and data curation to help
  • 37. What could the future look like?  “Publicly funded” research data flows onto the web  Licensing is clear and NOT a challenge  Machines are picking up data and depositing  EXAMPLE project – Any interest?  Put your spectra/structure in folders (Dropbox)  ChemSpider robot scoops, processes and deposits – opportunity with JC Bradley  While processing also predicts spectra and compares for validation
  • 38. Leaving the Stage  Chemists have access to tools and data  The more capabilities the better – what’s missing?  The more data the better – anyone want to share?  And give us an API with that… - ask us for help  And it should be free… - it is  And constantly updated… - help annotate/curate  And all data should be Open…- licensing  And make it fully Open Source… - book chapter  And it needs to be on my mobile… - it is
  • 40. New URLs to try out  ChemSpider Reactions: www.chemspider.com/reactions  ChemSpider Validation and Standardization Platform: www.chemspider.com/cvsp  ChemSpider Google: www.chemspider.com/google
  • 43. Acknowledgments  RSC Cheminformatics team  JC Bradley’s lab  Daniel Lowe – reactions  Commercial Software – GGA Software, ACD/Labs, OpenEye  Open Source Components
  • 44. Thank you Email: williamsa@rsc.org Blog: www.chemconnector.com SLIDES: www.slideshare.net/AntonyWilliams