Eureka Research Workbench: A Semantic Approach to an Open Source Electronic Laboratory Notebook

•

1 recomendación•2,263 vistas

Scientists are looking for ways to leverage web 2.0 technologies in the research laboratory and as a consequence a number of approaches to web-based electronic notebooks are being evaluated. In this presentation I discuss the Eureka Research Workbench, an electronic laboratory notebook built on semantic technology and XML. Using this approach the context of the information recorded in the laboratory can be captured and searched along with the data itself. A discussion of the current system is presented along with the next planned development of the framework and long-term plans relative to linked open data. Presented at the 246th American Chemical Society Meeting in Indianapolis, IN, USA on September 12th, 2013.

Tecnología Educación

Eureka Research Workbench:
A Semantic Approach
to an Open Source
Electronic Laboratory Notebook
Stuart J. Chalk
Department of Chemistry
University of North Florida
schalk@unf.edu
2013 Fall ACS Meeting – CINF Paper 116

Big Data
Electronic Notebooks
The Eureka Research Workbench
Experiment Markup Language
ExptML Schema and Files
Semantic Data and Ontologies
File Storage
Eureka Interface
Web Interface
Conclusion
Outline

Current buzz word for “this bring together lots of data and
build tools on top to extract knowledge”
This is great, except…
How do we do that for science?
Platform, data structures, and exchange protocols to
capture, identify, and disseminate scientific information
Research Data Alliance (https://rd-alliance.org/)
http://www.nytimes.com/2013/08/13/science/how-to-share-scientific-data.html
Big Data

Electronic Notebooks (ELNs) very common in industry
Not appropriate for academics doing science
Expensive
Overly complicated (regulations)
Data sharing not easy
We need an electronic notebook for faculty/students
LabArchives http://www.labarchives.com
eCAT http://www.researchspace.com/electronic-lab-notebook/
LabTrove http://www.labtrove.org/
Dryad data publishing http://datadryad.org/
Electronic Notebooks

Started in 2006 as an offshoot of getting involved in the
Analytical Information Markup Language (AnIML) project
through ASTM
No way to store all research notes in a digital format
No way to capture the workflow of scientists
Realized writing in a lab notebook is equivalent to “multi-
type” blogging in the digital world
How to capture information? Many datatypes -> ExptML
How to store files and make them available through web
interface? (Fedora-Commons)
How to link data together? RDF (in Fedora-Commons)
Eureka Research Workbench

A specification (written in XML) that describes
different types of information recorded during the
scientific process (http://exptml.sourceforge.net)
Many datatypes (will expand…)
Experiment Markup Language (ExptML)
 Sample
 Solution
 Space
 Specimen
 Substance
 Task
 Template
 Timeline
 User
 Vendor
 Annotation
 Api
 Calculation
 Chemical
 Citation
 Communication
 Customer
 Data
 Dataset
 Definition
 Element
 Equipment
 Event
 Experiment
 Group
 Project
 Protocol
 Quote
 Report
 Result

ExptML Chemical Schema

ExptML Chemical Schema

ExptML Chemical Instance

ExptML Chemical Instance

Files that represent the data need to be ‘linked’ together to
allow the user to see the context of the data
The ‘Semantic Web’ is a big push to contextualize data
Proposed storage of ‘relationships’ between data is the
Resource Description Format (RDF - http://www.w3.org/RDF/)
Semantic Data
From http://www.w3.org/TR/2004/REC-rdf-primer-20040210/

In computer science and ontology
“formally represents knowledge as a set of concepts within
a domain, and the relationships between those concepts. It
can be used to model a domain and support reasoning about
concepts.”*
In essence, an ontology allows us to define the
relationships and assertions about concepts
For substances represented in ExptML we define
isSubstance (assertion)
hasSubstance
isSubstanceOf
ExptML Ontology
*https://en.wikipedia.org/wiki/Ontology_(information_science)

ExptML Ontology

Digital repository software for creating and managing
online digital libraries
Stores the ExptML files
Stores any other files (PDFs, Images, Word etc.)
Stores relationships as RDF
Version control
Checksumming
Built in search of content and relationships
Fedora Commons

Fedora-Commons treats each ExptML file as an object
In the definition of a fedora object the file is just one
stream of many. By default each object also has a “DC”
stream of metadata and an “RELS-EXT” stream of
relationships
Each Fedora object can have any number of additional
streams for
Paper PDFs, product/sample pictures, original file formats (if a
conversion has been done)
Video, audio, anything
You can export individual streams or the whole Fedora
object with streams binary encoded (Sharing/archiving)
File Storage

File Storage

So, finally to the Eureka Research Workbench!
Web interface written in PHP using the CakePHP Framework
Communicates with Fedora-Commons API to create,
retrieve, update and delete (CRUD) ExptML and other files
Representational State Transfer (REST) format for URLs
E.g. http://web.server/chemicals/view/exptml:chm1
Allows for searching of all files in Fedora
Can also search based on relationships
Can extract data out of XML files
Can gather data from other websites (via API controller) and
add it to ExptML files
Eureka Interface

Eureka Website - Group
Onlydatatypesrelatedtothe
researchgroupshowuponleft


Eureka Website – Lab Bench
Typesofinformationthatarethingsyou
wouldhaveonyourlabbenchareonleft

Clicking on the “Add” menu on the right
Allows you add a comment to this solution

Eureka Website – Notebook
Typicalthingswerecord
inournotebook


Eureka Website - Laboratory
Informationaboutresourcesthat
youuseinyourlaboratory

The “Rel” menu shows you the information related to this instrument

Eureka Website - Library
Papersandprotocols
relatedtoyourwork

You can add the PDF
of the paper to the
citation.
The contents of the
PDF is searchable in
the system

Eureka Website - StockroomChemicalandSubstance
Informationisrelatedtogether


Robust markup language for representing science data
(ExptML)
Reliable storage system for ExptML files (Fedora)
Method for storage of relationships (RDF in Fedora)
Web application to create ExptML files (Eureka)
TODO
Provide web functionality to process data
Provide mechanism for sharing of data (different levels)
Integration into the RDA model for sharing research data
Get the word out and test system with many users
Conclusion

References
Eureka – http://sourceforge.net/projects/eureka
Fedora-Commons – http://fedora-commons.org
XML – http://www.w3.org/standards/xml
ExptML – http://exptml.sourceforge.net/
JSON – http://www.json.org/
UnitsML – http://unitsml.nist.gov/
RDF – http://www.w3.org/RDF/
CIR – http://cactus.nci.nih.gov/chemical/structure
RDA – http://rd-alliance.org

Recomendados

Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...

Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...

Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Stuart Chalk

FAIRness through a novel combination of Web technologies

FAIRness through a novel combination of Web technologies

FAIRness through a novel combination of Web technologiesResearch Data Alliance

A Generic Scientific Data Model and Ontology for Representation of Chemical Data

A Generic Scientific Data Model and Ontology for Representation of Chemical Data

A Generic Scientific Data Model and Ontology for Representation of Chemical DataStuart Chalk

Scientific Units in the Electronic Age

Scientific Units in the Electronic Age

Scientific Units in the Electronic AgeStuart Chalk

Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...

Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...

Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...Stuart Chalk

247th ACS Meeting: The Eureka Research Workbench

247th ACS Meeting: The Eureka Research Workbench

247th ACS Meeting: The Eureka Research WorkbenchStuart Chalk

AnIML: A New Analytical Data Standard

AnIML: A New Analytical Data Standard

AnIML: A New Analytical Data StandardStuart Chalk

2009 0807 Lod Gmod

2009 0807 Lod Gmod

2009 0807 Lod GmodJun Zhao

Recomendados

Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...

Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...

Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Stuart Chalk

FAIRness through a novel combination of Web technologies

FAIRness through a novel combination of Web technologies

FAIRness through a novel combination of Web technologiesResearch Data Alliance

A Generic Scientific Data Model and Ontology for Representation of Chemical Data

A Generic Scientific Data Model and Ontology for Representation of Chemical Data

A Generic Scientific Data Model and Ontology for Representation of Chemical DataStuart Chalk

Scientific Units in the Electronic Age

Scientific Units in the Electronic Age

Scientific Units in the Electronic AgeStuart Chalk

Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...

Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...

Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...Stuart Chalk

247th ACS Meeting: The Eureka Research Workbench

247th ACS Meeting: The Eureka Research Workbench

247th ACS Meeting: The Eureka Research WorkbenchStuart Chalk

AnIML: A New Analytical Data Standard

AnIML: A New Analytical Data Standard

AnIML: A New Analytical Data StandardStuart Chalk

2009 0807 Lod Gmod

2009 0807 Lod Gmod

2009 0807 Lod GmodJun Zhao

ACS 248th Paper 71 ChAMP Project

ACS 248th Paper 71 ChAMP Project

ACS 248th Paper 71 ChAMP ProjectStuart Chalk

Liberating Laboratory Data - Eureka

Liberating Laboratory Data - Eureka

Liberating Laboratory Data - EurekaStuart Chalk

Acs collaborative computational technologies for biomedical research an enabl...

Acs collaborative computational technologies for biomedical research an enabl...

Acs collaborative computational technologies for biomedical research an enabl...Sean Ekins

ACS 248th Paper 136 JSmol/JSpecView Eureka Integration

ACS 248th Paper 136 JSmol/JSpecView Eureka Integration

ACS 248th Paper 136 JSmol/JSpecView Eureka IntegrationStuart Chalk

FAIR Data and Model Management for Systems Biology(and SOPs too!)

FAIR Data and Model Management for Systems Biology(and SOPs too!)

FAIR Data and Model Management for Systems Biology(and SOPs too!)Carole Goble

Hosting a compound centric community resource for chemistry data

Hosting a compound centric community resource for chemistry data

Hosting a compound centric community resource for chemistry dataUS Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure

FAIR Data, Operations and Model management for Systems Biology and Systems Me...

FAIR Data, Operations and Model management for Systems Biology and Systems Me...

FAIR Data, Operations and Model management for Systems Biology and Systems Me...Carole Goble

Reproducible and citable data and models: an introduction.

Reproducible and citable data and models: an introduction.

Reproducible and citable data and models: an introduction.FAIRDOM

Research Data Sharing: A Basic Framework

Research Data Sharing: A Basic Framework

Research Data Sharing: A Basic FrameworkPaul Groth

Software Sustainability: Better Software Better Science

Software Sustainability: Better Software Better Science

Software Sustainability: Better Software Better ScienceCarole Goble

ACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka

ACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka

ACS 248th Paper 146 VIVO/ScientistsDB Integration into EurekaStuart Chalk

FAIR data and model management for systems biology.

FAIR data and model management for systems biology.

FAIR data and model management for systems biology.FAIRDOM

2016 Bio-IT World Cell Line Coordination Poster 2016-04-05v1

2016 Bio-IT World Cell Line Coordination Poster 2016-04-05v1

2016 Bio-IT World Cell Line Coordination Poster 2016-04-05v1Bruce Kozuma

Citing data in research articles: principles, implementation, challenges - an...

Citing data in research articles: principles, implementation, challenges - an...

Citing data in research articles: principles, implementation, challenges - an...FAIRDOM

Building a Standard for Standards: The ChAMP Project

Building a Standard for Standards: The ChAMP Project

Building a Standard for Standards: The ChAMP ProjectStuart Chalk

FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...

FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...

FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...Carole Goble

MESUR: Making sense and use of usage data

MESUR: Making sense and use of usage data

MESUR: Making sense and use of usage dataHerbert Van de Sompel

The Electronic Notebook Ontology

The Electronic Notebook Ontology

The Electronic Notebook OntologyStuart Chalk

Mtsr2015 goble-keynote

Mtsr2015 goble-keynote

Mtsr2015 goble-keynoteCarole Goble

The FAIRDOM Commons for Systems Biology

The FAIRDOM Commons for Systems Biology

The FAIRDOM Commons for Systems BiologyFAIRDOM

10.1007_s11605-016-3123-1 YST

10.1007_s11605-016-3123-1 YST

10.1007_s11605-016-3123-1 YSTAleksandr Reznichenko

Acuerdo de san andrés

Acuerdo de san andrés

Acuerdo de san andrésEnrique Solano

Más contenido relacionado

La actualidad más candente

ACS 248th Paper 71 ChAMP Project

ACS 248th Paper 71 ChAMP Project

ACS 248th Paper 71 ChAMP ProjectStuart Chalk

Liberating Laboratory Data - Eureka

Liberating Laboratory Data - Eureka

Liberating Laboratory Data - EurekaStuart Chalk

Acs collaborative computational technologies for biomedical research an enabl...

Acs collaborative computational technologies for biomedical research an enabl...

Acs collaborative computational technologies for biomedical research an enabl...Sean Ekins

ACS 248th Paper 136 JSmol/JSpecView Eureka Integration

ACS 248th Paper 136 JSmol/JSpecView Eureka Integration

ACS 248th Paper 136 JSmol/JSpecView Eureka IntegrationStuart Chalk

FAIR Data and Model Management for Systems Biology(and SOPs too!)

FAIR Data and Model Management for Systems Biology(and SOPs too!)

FAIR Data and Model Management for Systems Biology(and SOPs too!)Carole Goble

Hosting a compound centric community resource for chemistry data

Hosting a compound centric community resource for chemistry data

Hosting a compound centric community resource for chemistry dataUS Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure

FAIR Data, Operations and Model management for Systems Biology and Systems Me...

FAIR Data, Operations and Model management for Systems Biology and Systems Me...

FAIR Data, Operations and Model management for Systems Biology and Systems Me...Carole Goble

Reproducible and citable data and models: an introduction.

Reproducible and citable data and models: an introduction.

Reproducible and citable data and models: an introduction.FAIRDOM

Research Data Sharing: A Basic Framework

Research Data Sharing: A Basic Framework

Research Data Sharing: A Basic FrameworkPaul Groth

Software Sustainability: Better Software Better Science

Software Sustainability: Better Software Better Science

Software Sustainability: Better Software Better ScienceCarole Goble

ACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka

ACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka

ACS 248th Paper 146 VIVO/ScientistsDB Integration into EurekaStuart Chalk

FAIR data and model management for systems biology.

FAIR data and model management for systems biology.

FAIR data and model management for systems biology.FAIRDOM

2016 Bio-IT World Cell Line Coordination Poster 2016-04-05v1

2016 Bio-IT World Cell Line Coordination Poster 2016-04-05v1

2016 Bio-IT World Cell Line Coordination Poster 2016-04-05v1Bruce Kozuma

Citing data in research articles: principles, implementation, challenges - an...

Citing data in research articles: principles, implementation, challenges - an...

Citing data in research articles: principles, implementation, challenges - an...FAIRDOM

Building a Standard for Standards: The ChAMP Project

Building a Standard for Standards: The ChAMP Project

Building a Standard for Standards: The ChAMP ProjectStuart Chalk

FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...

FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...

FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...Carole Goble

MESUR: Making sense and use of usage data

MESUR: Making sense and use of usage data

MESUR: Making sense and use of usage dataHerbert Van de Sompel

The Electronic Notebook Ontology

The Electronic Notebook Ontology

The Electronic Notebook OntologyStuart Chalk

Mtsr2015 goble-keynote

Mtsr2015 goble-keynote

Mtsr2015 goble-keynoteCarole Goble

The FAIRDOM Commons for Systems Biology

The FAIRDOM Commons for Systems Biology

The FAIRDOM Commons for Systems BiologyFAIRDOM

La actualidad más candente (20)

ACS 248th Paper 71 ChAMP Project

ACS 248th Paper 71 ChAMP Project

ACS 248th Paper 71 ChAMP Project

Liberating Laboratory Data - Eureka

Liberating Laboratory Data - Eureka

Liberating Laboratory Data - Eureka

Acs collaborative computational technologies for biomedical research an enabl...

Acs collaborative computational technologies for biomedical research an enabl...

Acs collaborative computational technologies for biomedical research an enabl...

ACS 248th Paper 136 JSmol/JSpecView Eureka Integration

ACS 248th Paper 136 JSmol/JSpecView Eureka Integration

ACS 248th Paper 136 JSmol/JSpecView Eureka Integration

FAIR Data and Model Management for Systems Biology(and SOPs too!)

FAIR Data and Model Management for Systems Biology(and SOPs too!)

FAIR Data and Model Management for Systems Biology(and SOPs too!)

Hosting a compound centric community resource for chemistry data

Hosting a compound centric community resource for chemistry data

Hosting a compound centric community resource for chemistry data

FAIR Data, Operations and Model management for Systems Biology and Systems Me...

FAIR Data, Operations and Model management for Systems Biology and Systems Me...

FAIR Data, Operations and Model management for Systems Biology and Systems Me...

Reproducible and citable data and models: an introduction.

Reproducible and citable data and models: an introduction.

Reproducible and citable data and models: an introduction.

Research Data Sharing: A Basic Framework

Research Data Sharing: A Basic Framework

Research Data Sharing: A Basic Framework

Software Sustainability: Better Software Better Science

Software Sustainability: Better Software Better Science

Software Sustainability: Better Software Better Science

ACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka

ACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka

ACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka

FAIR data and model management for systems biology.

FAIR data and model management for systems biology.

FAIR data and model management for systems biology.

2016 Bio-IT World Cell Line Coordination Poster 2016-04-05v1

2016 Bio-IT World Cell Line Coordination Poster 2016-04-05v1

2016 Bio-IT World Cell Line Coordination Poster 2016-04-05v1

Citing data in research articles: principles, implementation, challenges - an...

Citing data in research articles: principles, implementation, challenges - an...

Citing data in research articles: principles, implementation, challenges - an...

Building a Standard for Standards: The ChAMP Project

Building a Standard for Standards: The ChAMP Project

Building a Standard for Standards: The ChAMP Project

FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...

FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...

FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...

MESUR: Making sense and use of usage data

MESUR: Making sense and use of usage data

MESUR: Making sense and use of usage data

The Electronic Notebook Ontology

The Electronic Notebook Ontology

The Electronic Notebook Ontology

Mtsr2015 goble-keynote

Mtsr2015 goble-keynote

Mtsr2015 goble-keynote

The FAIRDOM Commons for Systems Biology

The FAIRDOM Commons for Systems Biology

The FAIRDOM Commons for Systems Biology

Destacado

10.1007_s11605-016-3123-1 YST

10.1007_s11605-016-3123-1 YST

10.1007_s11605-016-3123-1 YSTAleksandr Reznichenko

Acuerdo de san andrés

Acuerdo de san andrés

Acuerdo de san andrésEnrique Solano

Sunu 25.4.1PHİLOSOPHER EFRUZHU PHRMP

Ter ArkhAleksandr Reznichenko

Hasanpaşa Bosch Kombi Servisi - 923

Hasanpaşa Bosch Kombi Servisi - 923

Hasanpaşa Bosch Kombi Servisi - 923kombiservisi81

2 1 ос-віндовсАлексей Свирь

Autoevaluacionesexcel

Autoevaluacionesexcel

Autoevaluacionesexcellisshdt

How to write a blog post

How to write a blog post

How to write a blog postwongsc

참소망 제 43호종현 최

AlHuda - Certified sukuk professional

AlHuda - Certified sukuk professional

AlHuda - Certified sukuk professionalAlhuda Centre of Islamic Banking & Economics

Reach Through License Considerations

Reach Through License Considerations

Reach Through License Considerations Kay Collins

Academic Transcript

Academic Transcript

Academic TranscriptMichael Willis

Smart Team tracking

Smart Team tracking

Smart Team trackingDavide Meacci

การวิเคราะห์แบบจำลองเวกเตอร์ (Vector Model)

การวิเคราะห์แบบจำลองเวกเตอร์ (Vector Model)

การวิเคราะห์แบบจำลองเวกเตอร์ (Vector Model)chanchira kongboumai

Conação (2)Nuno Pereira

Intro to p-spice

Intro to p-spice

Intro to p-spicesathiyavathisasikumar

PSpice Tutorial

PSpice Tutorial

PSpice Tutorialankitgdoshi

Prince de carnaval

Prince de carnaval

Prince de carnavalminoutemd

Presentación10Andres Strieder

Destacado (19)

10.1007_s11605-016-3123-1 YST

10.1007_s11605-016-3123-1 YST

10.1007_s11605-016-3123-1 YST

Acuerdo de san andrés

Acuerdo de san andrés

Acuerdo de san andrés

Sunu 25.4.1

Ter Arkh

Hasanpaşa Bosch Kombi Servisi - 923

Hasanpaşa Bosch Kombi Servisi - 923

Hasanpaşa Bosch Kombi Servisi - 923

2 1 ос-віндовс

Autoevaluacionesexcel

Autoevaluacionesexcel

Autoevaluacionesexcel

How to write a blog post

How to write a blog post

How to write a blog post

참소망 제 43호

AlHuda - Certified sukuk professional

AlHuda - Certified sukuk professional

AlHuda - Certified sukuk professional

Reach Through License Considerations

Reach Through License Considerations

Reach Through License Considerations

Academic Transcript

Academic Transcript

Academic Transcript

Smart Team tracking

Smart Team tracking

Smart Team tracking

การวิเคราะห์แบบจำลองเวกเตอร์ (Vector Model)

การวิเคราะห์แบบจำลองเวกเตอร์ (Vector Model)

การวิเคราะห์แบบจำลองเวกเตอร์ (Vector Model)

Conação (2)

Intro to p-spice

Intro to p-spice

Intro to p-spice

PSpice Tutorial

PSpice Tutorial

PSpice Tutorial

Prince de carnaval

Prince de carnaval

Prince de carnaval

Presentación10

Similar a Eureka Research Workbench: A Semantic Approach to an Open Source Electronic Laboratory Notebook

247th ACS Meeting: Experiment Markup Language (ExptML)

247th ACS Meeting: Experiment Markup Language (ExptML)

247th ACS Meeting: Experiment Markup Language (ExptML)Stuart Chalk

Blogs Logs Pods: Smart Labs

Blogs Logs Pods: Smart Labs

Blogs Logs Pods: Smart LabsJeremy Frey

ACS 248th Paper 67 Eureka Collaboration

ACS 248th Paper 67 Eureka Collaboration

ACS 248th Paper 67 Eureka CollaborationStuart Chalk

Cornell20080516

Cornell20080516

Cornell20080516charper

Doing Clever Things with the Semantic Web

Doing Clever Things with the Semantic Web

Doing Clever Things with the Semantic WebMathieu d'Aquin

The Chemtools LaBLog

The Chemtools LaBLog

The Chemtools LaBLogCameron Neylon

Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences

Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences

Discovery Engines for Big Data: Accelerating Discovery in Basic Energy SciencesIan Foster

The Nature of Information

The Nature of Information

The Nature of InformationAdrian Paschke

2011linked science4mccuskermcguinnessfinal

2011linked science4mccuskermcguinnessfinal

2011linked science4mccuskermcguinnessfinalDeborah McGuinness

Part2- The Atomic Information Resource

Part2- The Atomic Information Resource

Part2- The Atomic Information ResourceJEAN-MICHEL LETENNIER

How to Find a Needle in the Haystack

How to Find a Needle in the Haystack

How to Find a Needle in the HaystackAdrian Stevenson

Semantic Web from the 2013 Perspective

Semantic Web from the 2013 Perspective

Semantic Web from the 2013 PerspectiveAdrian Paschke

Towards an Infrastructure for Enabling Systematic Development and Research of...

Towards an Infrastructure for Enabling Systematic Development and Research of...

Towards an Infrastructure for Enabling Systematic Development and Research of...Rafael Ferreira da Silva

A Look into the Apache OODT Ecosystem

A Look into the Apache OODT Ecosystem

A Look into the Apache OODT EcosystemChris Mattmann

eScience: A Transformed Scientific Method

eScience: A Transformed Scientific Method

eScience: A Transformed Scientific MethodDuncan Hull

Working with data.open.ac.uk, the Linked Data Platform of the Open University

Working with data.open.ac.uk, the Linked Data Platform of the Open University

Working with data.open.ac.uk, the Linked Data Platform of the Open UniversityMathieu d'Aquin

RO-Crate: packaging metadata love notes into FAIR Digital Objects

RO-Crate: packaging metadata love notes into FAIR Digital Objects

RO-Crate: packaging metadata love notes into FAIR Digital ObjectsCarole Goble

Next-Generation Search Engines for Information Retrieval

Next-Generation Search Engines for Information Retrieval

Next-Generation Search Engines for Information RetrievalWaqas Tariq

dotte.pptqwerty309340

Information Extraction and Linked Data Cloud

Information Extraction and Linked Data Cloud

Information Extraction and Linked Data CloudDhaval Thakker

Similar a Eureka Research Workbench: A Semantic Approach to an Open Source Electronic Laboratory Notebook (20)

247th ACS Meeting: Experiment Markup Language (ExptML)

247th ACS Meeting: Experiment Markup Language (ExptML)

247th ACS Meeting: Experiment Markup Language (ExptML)

Blogs Logs Pods: Smart Labs

Blogs Logs Pods: Smart Labs

Blogs Logs Pods: Smart Labs

ACS 248th Paper 67 Eureka Collaboration

ACS 248th Paper 67 Eureka Collaboration

ACS 248th Paper 67 Eureka Collaboration

Cornell20080516

Cornell20080516

Cornell20080516

Doing Clever Things with the Semantic Web

Doing Clever Things with the Semantic Web

Doing Clever Things with the Semantic Web

The Chemtools LaBLog

The Chemtools LaBLog

The Chemtools LaBLog

Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences

Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences

Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences

The Nature of Information

The Nature of Information

The Nature of Information

2011linked science4mccuskermcguinnessfinal

2011linked science4mccuskermcguinnessfinal

2011linked science4mccuskermcguinnessfinal

Part2- The Atomic Information Resource

Part2- The Atomic Information Resource

Part2- The Atomic Information Resource

How to Find a Needle in the Haystack

How to Find a Needle in the Haystack

How to Find a Needle in the Haystack

Semantic Web from the 2013 Perspective

Semantic Web from the 2013 Perspective

Semantic Web from the 2013 Perspective

Towards an Infrastructure for Enabling Systematic Development and Research of...

Towards an Infrastructure for Enabling Systematic Development and Research of...

Towards an Infrastructure for Enabling Systematic Development and Research of...

A Look into the Apache OODT Ecosystem

A Look into the Apache OODT Ecosystem

A Look into the Apache OODT Ecosystem

eScience: A Transformed Scientific Method

eScience: A Transformed Scientific Method

eScience: A Transformed Scientific Method

Working with data.open.ac.uk, the Linked Data Platform of the Open University

Working with data.open.ac.uk, the Linked Data Platform of the Open University

Working with data.open.ac.uk, the Linked Data Platform of the Open University

RO-Crate: packaging metadata love notes into FAIR Digital Objects

RO-Crate: packaging metadata love notes into FAIR Digital Objects

RO-Crate: packaging metadata love notes into FAIR Digital Objects

Next-Generation Search Engines for Information Retrieval

Next-Generation Search Engines for Information Retrieval

Next-Generation Search Engines for Information Retrieval

dotte.ppt

Information Extraction and Linked Data Cloud

Information Extraction and Linked Data Cloud

Information Extraction and Linked Data Cloud

Más de Stuart Chalk

Semantic properties and units

Semantic properties and units

Semantic properties and unitsStuart Chalk

Open semantic chemical structures

Open semantic chemical structures

Open semantic chemical structuresStuart Chalk

ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...

ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...

ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...Stuart Chalk

Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data

Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data

Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series DataStuart Chalk

Bringing Flow injection Analysis to the Semantic Web

Bringing Flow injection Analysis to the Semantic Web

Bringing Flow injection Analysis to the Semantic WebStuart Chalk

Reactions to the Open Spectral Database

Reactions to the Open Spectral Database

Reactions to the Open Spectral DatabaseStuart Chalk

Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015

Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015

Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015Stuart Chalk

A Standard Data Format for Computational Chemistry: CSX

A Standard Data Format for Computational Chemistry: CSX

A Standard Data Format for Computational Chemistry: CSXStuart Chalk

Overview of the Analytical Information Markup Language (AnIML)

Overview of the Analytical Information Markup Language (AnIML)

Overview of the Analytical Information Markup Language (AnIML)Stuart Chalk

ACS 248th Paper 108 NIST-IUPAC Solubility Data

ACS 248th Paper 108 NIST-IUPAC Solubility Data

ACS 248th Paper 108 NIST-IUPAC Solubility DataStuart Chalk

ACS 248th Paper 104 ChemData Project

ACS 248th Paper 104 ChemData Project

ACS 248th Paper 104 ChemData ProjectStuart Chalk

Liberating Laboratory Data - AnIML

Liberating Laboratory Data - AnIML

Liberating Laboratory Data - AnIMLStuart Chalk

Más de Stuart Chalk (12)

Semantic properties and units

Semantic properties and units

Semantic properties and units

Open semantic chemical structures

Open semantic chemical structures

Open semantic chemical structures

ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...

ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...

ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...

Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data

Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data

Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data

Bringing Flow injection Analysis to the Semantic Web

Bringing Flow injection Analysis to the Semantic Web

Bringing Flow injection Analysis to the Semantic Web

Reactions to the Open Spectral Database

Reactions to the Open Spectral Database

Reactions to the Open Spectral Database

Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015

Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015

Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015

A Standard Data Format for Computational Chemistry: CSX

A Standard Data Format for Computational Chemistry: CSX

A Standard Data Format for Computational Chemistry: CSX

Overview of the Analytical Information Markup Language (AnIML)

Overview of the Analytical Information Markup Language (AnIML)

Overview of the Analytical Information Markup Language (AnIML)

ACS 248th Paper 108 NIST-IUPAC Solubility Data

ACS 248th Paper 108 NIST-IUPAC Solubility Data

ACS 248th Paper 108 NIST-IUPAC Solubility Data

ACS 248th Paper 104 ChemData Project

ACS 248th Paper 104 ChemData Project

ACS 248th Paper 104 ChemData Project

Liberating Laboratory Data - AnIML

Liberating Laboratory Data - AnIML

Liberating Laboratory Data - AnIML

Último

Commit 2024 - Secret Management made easy

Commit 2024 - Secret Management made easy

Commit 2024 - Secret Management made easyAlfredo García Lavilla

"ML in Production",Oleksandr Bagan

"ML in Production",Oleksandr Bagan

"ML in Production",Oleksandr BaganFwdays

Digital Identity is Under Attack: FIDO Paris Seminar.pptx

Digital Identity is Under Attack: FIDO Paris Seminar.pptx

Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3

Generative AI for Technical Writer or Information Developers

Generative AI for Technical Writer or Information Developers

Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3

The Ultimate Guide to Choosing WordPress Pros and Cons

The Ultimate Guide to Choosing WordPress Pros and Cons

The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech

Advanced Computer Architecture – An Introduction

Advanced Computer Architecture – An Introduction

Advanced Computer Architecture – An IntroductionDilum Bandara

SIP trunking in Janus @ Kamailio World 2024

SIP trunking in Janus @ Kamailio World 2024

SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3

How to write a Business Continuity Plan

How to write a Business Continuity Plan

How to write a Business Continuity PlanDatabarracks

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3

SALESFORCE EDUCATION CLOUD | FEXLE SERVICES

SALESFORCE EDUCATION CLOUD | FEXLE SERVICES

SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos

"Debugging python applications inside k8s environment", Andrii Soldatenko

"Debugging python applications inside k8s environment", Andrii Soldatenko

"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays

unit 4 immunoblotting technique complete.pptx

unit 4 immunoblotting technique complete.pptx

unit 4 immunoblotting technique complete.pptxBkGupta21

The State of Passkeys with FIDO Alliance.pptx

The State of Passkeys with FIDO Alliance.pptx

The State of Passkeys with FIDO Alliance.pptxLoriGlavin3

Unleash Your Potential - Namagunga Girls Coding Club

Unleash Your Potential - Namagunga Girls Coding Club

Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar

Unraveling Multimodality with Large Language Models.pdf

Unraveling Multimodality with Large Language Models.pdf

Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro

Último (20)

Commit 2024 - Secret Management made easy

Commit 2024 - Secret Management made easy

Commit 2024 - Secret Management made easy

"ML in Production",Oleksandr Bagan

"ML in Production",Oleksandr Bagan

"ML in Production",Oleksandr Bagan

Digital Identity is Under Attack: FIDO Paris Seminar.pptx

Digital Identity is Under Attack: FIDO Paris Seminar.pptx

Digital Identity is Under Attack: FIDO Paris Seminar.pptx

Generative AI for Technical Writer or Information Developers

Generative AI for Technical Writer or Information Developers

Generative AI for Technical Writer or Information Developers

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx

The Ultimate Guide to Choosing WordPress Pros and Cons

The Ultimate Guide to Choosing WordPress Pros and Cons

The Ultimate Guide to Choosing WordPress Pros and Cons

Advanced Computer Architecture – An Introduction

Advanced Computer Architecture – An Introduction

Advanced Computer Architecture – An Introduction

SIP trunking in Janus @ Kamailio World 2024

SIP trunking in Janus @ Kamailio World 2024

SIP trunking in Janus @ Kamailio World 2024

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx

How to write a Business Continuity Plan

How to write a Business Continuity Plan

How to write a Business Continuity Plan

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx

SALESFORCE EDUCATION CLOUD | FEXLE SERVICES

SALESFORCE EDUCATION CLOUD | FEXLE SERVICES

SALESFORCE EDUCATION CLOUD | FEXLE SERVICES

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)

"Debugging python applications inside k8s environment", Andrii Soldatenko

"Debugging python applications inside k8s environment", Andrii Soldatenko

"Debugging python applications inside k8s environment", Andrii Soldatenko

unit 4 immunoblotting technique complete.pptx

unit 4 immunoblotting technique complete.pptx

unit 4 immunoblotting technique complete.pptx

The State of Passkeys with FIDO Alliance.pptx

The State of Passkeys with FIDO Alliance.pptx

The State of Passkeys with FIDO Alliance.pptx

Unleash Your Potential - Namagunga Girls Coding Club

Unleash Your Potential - Namagunga Girls Coding Club

Unleash Your Potential - Namagunga Girls Coding Club

Unraveling Multimodality with Large Language Models.pdf

Unraveling Multimodality with Large Language Models.pdf

Unraveling Multimodality with Large Language Models.pdf

Eureka Research Workbench: A Semantic Approach to an Open Source Electronic Laboratory Notebook

1. Eureka Research Workbench: A Semantic Approach to an Open Source Electronic Laboratory Notebook Stuart J. Chalk Department of Chemistry University of North Florida schalk@unf.edu 2013 Fall ACS Meeting – CINF Paper 116

2. Big Data Electronic Notebooks The Eureka Research Workbench Experiment Markup Language ExptML Schema and Files Semantic Data and Ontologies File Storage Eureka Interface Web Interface Conclusion Outline

3. Current buzz word for “this bring together lots of data and build tools on top to extract knowledge” This is great, except… How do we do that for science? Platform, data structures, and exchange protocols to capture, identify, and disseminate scientific information Research Data Alliance (https://rd-alliance.org/) http://www.nytimes.com/2013/08/13/science/how-to-share-scientific-data.html Big Data

4. Electronic Notebooks (ELNs) very common in industry Not appropriate for academics doing science Expensive Overly complicated (regulations) Data sharing not easy We need an electronic notebook for faculty/students LabArchives http://www.labarchives.com eCAT http://www.researchspace.com/electronic-lab-notebook/ LabTrove http://www.labtrove.org/ Dryad data publishing http://datadryad.org/ Electronic Notebooks

5. Started in 2006 as an offshoot of getting involved in the Analytical Information Markup Language (AnIML) project through ASTM No way to store all research notes in a digital format No way to capture the workflow of scientists Realized writing in a lab notebook is equivalent to “multi- type” blogging in the digital world How to capture information? Many datatypes -> ExptML How to store files and make them available through web interface? (Fedora-Commons) How to link data together? RDF (in Fedora-Commons) Eureka Research Workbench

6. A specification (written in XML) that describes different types of information recorded during the scientific process (http://exptml.sourceforge.net) Many datatypes (will expand…) Experiment Markup Language (ExptML)  Sample  Solution  Space  Specimen  Substance  Task  Template  Timeline  User  Vendor  Annotation  Api  Calculation  Chemical  Citation  Communication  Customer  Data  Dataset  Definition  Element  Equipment  Event  Experiment  Group  Project  Protocol  Quote  Report  Result

7. ExptML Chemical Schema

8. ExptML Chemical Schema

9. ExptML Chemical Instance

10. ExptML Chemical Instance

11. Files that represent the data need to be ‘linked’ together to allow the user to see the context of the data The ‘Semantic Web’ is a big push to contextualize data Proposed storage of ‘relationships’ between data is the Resource Description Format (RDF - http://www.w3.org/RDF/) Semantic Data From http://www.w3.org/TR/2004/REC-rdf-primer-20040210/

12. In computer science and ontology “formally represents knowledge as a set of concepts within a domain, and the relationships between those concepts. It can be used to model a domain and support reasoning about concepts.”* In essence, an ontology allows us to define the relationships and assertions about concepts For substances represented in ExptML we define isSubstance (assertion) hasSubstance isSubstanceOf ExptML Ontology *https://en.wikipedia.org/wiki/Ontology_(information_science)

13. ExptML Ontology

14. Digital repository software for creating and managing online digital libraries Stores the ExptML files Stores any other files (PDFs, Images, Word etc.) Stores relationships as RDF Version control Checksumming Built in search of content and relationships Fedora Commons

15. Fedora-Commons treats each ExptML file as an object In the definition of a fedora object the file is just one stream of many. By default each object also has a “DC” stream of metadata and an “RELS-EXT” stream of relationships Each Fedora object can have any number of additional streams for Paper PDFs, product/sample pictures, original file formats (if a conversion has been done) Video, audio, anything You can export individual streams or the whole Fedora object with streams binary encoded (Sharing/archiving) File Storage

16. File Storage

17. So, finally to the Eureka Research Workbench! Web interface written in PHP using the CakePHP Framework Communicates with Fedora-Commons API to create, retrieve, update and delete (CRUD) ExptML and other files Representational State Transfer (REST) format for URLs E.g. http://web.server/chemicals/view/exptml:chm1 Allows for searching of all files in Fedora Can also search based on relationships Can extract data out of XML files Can gather data from other websites (via API controller) and add it to ExptML files Eureka Interface

18. Eureka Website - Group Onlydatatypesrelatedtothe researchgroupshowuponleft 

19. Eureka Website – Lab Bench Typesofinformationthatarethingsyou wouldhaveonyourlabbenchareonleft  Clicking on the “Add” menu on the right Allows you add a comment to this solution

20. Eureka Website – Notebook Typicalthingswerecord inournotebook 

21. Eureka Website - Laboratory Informationaboutresourcesthat youuseinyourlaboratory  The “Rel” menu shows you the information related to this instrument

22. Eureka Website - Library Papersandprotocols relatedtoyourwork  You can add the PDF of the paper to the citation. The contents of the PDF is searchable in the system

23. Eureka Website - StockroomChemicalandSubstance Informationisrelatedtogether 

24. Robust markup language for representing science data (ExptML) Reliable storage system for ExptML files (Fedora) Method for storage of relationships (RDF in Fedora) Web application to create ExptML files (Eureka) TODO Provide web functionality to process data Provide mechanism for sharing of data (different levels) Integration into the RDA model for sharing research data Get the word out and test system with many users Conclusion

25. References Eureka – http://sourceforge.net/projects/eureka Fedora-Commons – http://fedora-commons.org XML – http://www.w3.org/standards/xml ExptML – http://exptml.sourceforge.net/ JSON – http://www.json.org/ UnitsML – http://unitsml.nist.gov/ RDF – http://www.w3.org/RDF/ CIR – http://cactus.nci.nih.gov/chemical/structure RDA – http://rd-alliance.org