SlideShare una empresa de Scribd logo
1 de 22
A Workflow System for Virtual
    Screening in Cancer
     Chemoprevention
 Kannas C. C., Achilleos K. G., Antoniou Z., Kalvari I., Kirmitzoglou
        I., Nicolaou C. A., Promponas V. J., Pattichis C. S.

                         13th November 2012
      IEEE 12th International Conference on BioInformatics &
                           BioEngineering
Outline
       About LiSIs platform
             Objectives
             What is LiSIs?
             Virtual Screening & Scientific Workflow Management Systems
       LiSIs platform
             Virtual Screening Process Template
             Docking Models
             Predictive Models
             3rd Party Tools
             The GRANATUM-LiSIs Platform
       Concluding Remarks
       Acknowledgement
       Questions?
IEEE 12th International Conference on BioInformatics &
                                                            2             13th November 2012
BioEngineering
About LiSIs platform




IEEE 12th International Conference on BioInformatics &
                                                         3   13th November 2012
BioEngineering
Objectives

       Provide a set of tools to create virtual screening process for
       the discovery of novel agents with desired properties.

       Provide a set of tools to create data-driven models designed
       to predict biochemical properties of interest.

       Provide an environment to create, update, store and share
       Scientific Workflows, that is accessible via a web interface.



IEEE 12th International Conference on BioInformatics &
                                                             4        13th November 2012
BioEngineering
What is LiSIs?

          Life Science Informatics
          Scientific Workflow Management System
               Computational Environment for Virtual Screening.
          Cancer Chemoprevention Research
               Computational Tools borrowed from Drug Discovery Process.
          GRANATUM project (http://www.granatum.org)
               Partially funded by the European Commission under the
               Seventh Framework Programme in the area of Virtual
               Physiological Human (ICT-2009.5.3).

IEEE 12th International Conference on BioInformatics &
                                                         5             13th November 2012
BioEngineering
Virtual Screening & Scientific
              Workflow Management Systems
        Virtual Screening
              Computational counterpart of biological screening.
              Goal: decrease the number of compounds physically screened.
        Scientific Workflow Management Systems (SWMS)
              Computational environments which facilitate the design and
              execution of computational experiments (workflows).
              Known SWMS:
                   Taverna (http://www.taverna.org.uk/)
                   KNIME (http://www.knime.org/ )
                   Galaxy (http://galaxy.psu.edu/)

IEEE 12th International Conference on BioInformatics &
                                                         6           13th November 2012
BioEngineering
Available @ http://lisis.cs.ucy.ac.cy


                                                LiSIs platform




IEEE 12th International Conference on BioInformatics &
                                                              7                     13th November 2012
BioEngineering
Virtual Screening Process
                                 Template

                                                                                                 Output
                                                                                Postprocessing   •Storage
                                                                                                 •Visualization
                                                                                •Cleaning
                                                                                •Formatting
                                                          Processing
                                                                                •Merging
                                                          •Descriptor Filters
                                                          •Similarity Search
                                          Preprocessing   •Substructure
                                                           Search
                                      •File format        •Docking Models
                                       transformations
                                                          •Predictive Models
                                      •Property
                                       Normalizer
                      Input           •Descriptors
                      •GRANATUM        Calculation
                        platform File •Fragmentation
                        Loader        •Coordinates
                      •File Readers    Calculation
                                      •Protein Cleaner
IEEE 12th International Conference on BioInformatics &
                                                                 8                                       13th November 2012
BioEngineering
Docking Model Preparation
                                                             Protein (pdb file)



                                                         Add Hydrogens




       Protein Preparation:                              Remove H2O               Docking
             Input                                                                 Model
                  Protein.pdb file                       Calculate Pocket
             Process                                     Coordinates

                  Remove water molecules         Separate reference
                  Add hydrogen atoms             ligand

                  Calculate binding domain coordinates
                  Clean protein from co-crystalized molecule
             Output
                  Modified protein.pdb file

IEEE 12th International Conference on BioInformatics &
                                                               9                            13th November 2012
BioEngineering
Predictive Model Preparation
                                                         Chemical
                                                           data
                                                                      Algorithm
                Biological
                                                                      •Algorithm
                   data                                                parameters


                                                         Predictive
                                                           Model

IEEE 12th International Conference on BioInformatics &
                                                              10                    13th November 2012
BioEngineering
Predictive Model Preparation



                                                          Model
                                                                                                       Predictions
 •Chembl                                                 Training         •Cross-Validation
 •Pubchem                                                                 •Independent
                                          •Feature extraction /            validation using    •Predict biological
 •Literature
                                           selection                       separate datasets    properties of
                                          •Various algorithms                                   compounds
                                           (kNN, SVM, Random                                   •E.g. Toxicity, ER-
                                           Forest, Decision Trees)                              binding activity
               Data                                                                Model
             gathering                                                            Validation




IEEE 12th International Conference on BioInformatics &
                                                                     11                                   13th November 2012
BioEngineering
3rd Party Tools




IEEE 12th International Conference on BioInformatics &
                                                         12       13th November 2012
BioEngineering
Galaxy

       Galaxy is an open, web-based platform for data intensive
       biomedical research.
       Free public server.
       Deploy locally.
       Deploy in a Cluster/Grid environment
       Deploy in the cloud (CloudMan Amazon EC2).

       http://galaxy.psu.edu/


IEEE 12th International Conference on BioInformatics &
                                                           13     13th November 2012
BioEngineering
Galaxy Integration Methodology

           Galaxy configuration:
                Web Server(s) (Handle Users Requests)
                Job Manager (Job Management)
                Job Handler(s) (Job Execution)
           Galaxy Tools are wrappers around command line
           applications.
           Galaxy runs command line jobs at the background.




IEEE 12th International Conference on BioInformatics &
                                                         14   13th November 2012
BioEngineering
RDKit

       A collection of cheminformatics and machine-learning
       software written in C++ and Python.
       The core algorithms and data structures are written in C++.
       Wrappers are provided to use the toolkit from either Python
       or Java.
       Additionally, the RDKit distribution includes a PostgreSQL-
       based cartridge that allows molecules to be stored in
       relational database and retrieved via substructure and
       similarity searches.
       Features Overview
       RDKit Home Page
IEEE 12th International Conference on BioInformatics &
                                                           15    13th November 2012
BioEngineering
The R Project
       R is a language and environment for statistical computing and
       graphics.
       R provides a wide variety of statistical (linear and nonlinear
       modelling, classical statistical tests, time-series
       analysis, classification, clustering, ...) and graphical techniques, and
       is highly extensible.
       One of R's strengths is the ease with which well-designed
       publication-quality plots can be produced, including mathematical
       symbols and formulae where needed. Great care has been taken
       over the defaults for the minor design choices in graphics, but the
       user retains full control.

       The R Project

IEEE 12th International Conference on BioInformatics &
                                                         16             13th November 2012
BioEngineering
AutoDock Vina
       AutoDock Vina is a new open-source program for drug
       discovery, molecular docking and virtual screening, offering multi-
       core capability, high performance and enhanced accuracy and ease
       of use.
       Features:
             Accuracy
             Compatibility with AutoDock Tools
             Ease of Use
             Flexible Side Chains
                  Some receptor side chains can be chosen to be treated as flexible during
                  docking.
             Speed
       AutoDock Vina

IEEE 12th International Conference on BioInformatics &
                                                         17                      13th November 2012
BioEngineering
Available @ http://lisis.cs.ucy.ac.cy


                                  LiSIs platform Demo




IEEE 12th International Conference on BioInformatics &
                                                             18                     13th November 2012
BioEngineering
LiSIs platform Live Demo!!!




                                                http://lisis.cs.ucy.ac.cy




IEEE 12th International Conference on BioInformatics &
                                                            19              13th November 2012
BioEngineering
Concluding Remarks

       Fill a current void in chemoprevention, and in general life
       sciences, research.

       Enabling researchers to utilize state of the art computational
       techniques to search for the discovery of novel agents with
       desired properties.




IEEE 12th International Conference on BioInformatics &
                                                         20     13th November 2012
BioEngineering
Acknowledgement

  Computer Science, UCY                                       Cancer Biology and
  Prof. C. S. Pattichis                                       Chemoprevention
  C. A. Nicolaou (PhD)                                        laboratory at Department of
                                                              Biological Sciences, UCY.
  Z. Antoniou
                                                              Cancer Chemoprevention
  K. G. Achilleos                                             and Epigenomics Workgroup
  Biological Sciences, UCY                                    at German Cancer Research
  Dr. V. J. Promponas                                         Center.
  I. Kirmitzoglou                                             Members of the GRANATUM
  I. Kalvari                                                  consortium.

IEEE 12th International Conference on BioInformatics &
                                                         21                       13th November 2012
BioEngineering
Questions?




IEEE 12th International Conference on BioInformatics &
                                                             22       13th November 2012
BioEngineering

Más contenido relacionado

Similar a Granatum_LiSIs_BIBE_2012_presentation_v4.0

Pistoia presentation bio it-worldexpo 21april2010
Pistoia presentation   bio it-worldexpo 21april2010Pistoia presentation   bio it-worldexpo 21april2010
Pistoia presentation bio it-worldexpo 21april2010Nick Lynch
 
Development, Confusion and Exploration of Honeypot Technology
Development, Confusion and Exploration of Honeypot TechnologyDevelopment, Confusion and Exploration of Honeypot Technology
Development, Confusion and Exploration of Honeypot TechnologyAntiy Labs
 
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...SEAD
 
Clincial Data Management
Clincial Data ManagementClincial Data Management
Clincial Data ManagementDeepak Yadav
 
Collaboration and Sharing
Collaboration and SharingCollaboration and Sharing
Collaboration and SharingJisc
 
Databus: LinkedIn's Change Data Capture Pipeline SOCC 2012
Databus: LinkedIn's Change Data Capture Pipeline SOCC 2012Databus: LinkedIn's Change Data Capture Pipeline SOCC 2012
Databus: LinkedIn's Change Data Capture Pipeline SOCC 2012Shirshanka Das
 
Big Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the FutureBig Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the FutureOdinot Stanislas
 
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...OSTHUS
 
Preserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of ScholarshipPreserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of Scholarshiptsbbbu
 
Bio-IT World 2009: Adjusting Information Flow from In-house HTS to Global Out...
Bio-IT World 2009: Adjusting Information Flow from In-house HTS to Global Out...Bio-IT World 2009: Adjusting Information Flow from In-house HTS to Global Out...
Bio-IT World 2009: Adjusting Information Flow from In-house HTS to Global Out...Brian Bissett
 
All Aboard the Databus
All Aboard the DatabusAll Aboard the Databus
All Aboard the DatabusAmy W. Tang
 
Sem tech 2011 v8
Sem tech 2011 v8Sem tech 2011 v8
Sem tech 2011 v8dallemang
 
Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack
Sequence Services Phase 2 Webinar Series: Constellation Technology and GenestackSequence Services Phase 2 Webinar Series: Constellation Technology and Genestack
Sequence Services Phase 2 Webinar Series: Constellation Technology and GenestackPistoia Alliance
 
Albert Simard - Mobilizing Knowledge: Acquisition, Analysis, and Action
Albert Simard - Mobilizing Knowledge: Acquisition, Analysis, and ActionAlbert Simard - Mobilizing Knowledge: Acquisition, Analysis, and Action
Albert Simard - Mobilizing Knowledge: Acquisition, Analysis, and ActionInstitute for Knowledge Mobilization
 
Leadership Symposium on Digital Media in Healthcare
Leadership Symposium on Digital Media in HealthcareLeadership Symposium on Digital Media in Healthcare
Leadership Symposium on Digital Media in Healthcaresetstanford
 

Similar a Granatum_LiSIs_BIBE_2012_presentation_v4.0 (20)

Pistoia presentation bio it-worldexpo 21april2010
Pistoia presentation   bio it-worldexpo 21april2010Pistoia presentation   bio it-worldexpo 21april2010
Pistoia presentation bio it-worldexpo 21april2010
 
Development, Confusion and Exploration of Honeypot Technology
Development, Confusion and Exploration of Honeypot TechnologyDevelopment, Confusion and Exploration of Honeypot Technology
Development, Confusion and Exploration of Honeypot Technology
 
iRODS
iRODSiRODS
iRODS
 
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
 
Clincial Data Management
Clincial Data ManagementClincial Data Management
Clincial Data Management
 
Collaboration and Sharing
Collaboration and SharingCollaboration and Sharing
Collaboration and Sharing
 
Databus: LinkedIn's Change Data Capture Pipeline SOCC 2012
Databus: LinkedIn's Change Data Capture Pipeline SOCC 2012Databus: LinkedIn's Change Data Capture Pipeline SOCC 2012
Databus: LinkedIn's Change Data Capture Pipeline SOCC 2012
 
SEALS @ WWW2012
SEALS @ WWW2012SEALS @ WWW2012
SEALS @ WWW2012
 
Big Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the FutureBig Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the Future
 
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
 
Preserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of ScholarshipPreserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of Scholarship
 
Bio-IT World 2009: Adjusting Information Flow from In-house HTS to Global Out...
Bio-IT World 2009: Adjusting Information Flow from In-house HTS to Global Out...Bio-IT World 2009: Adjusting Information Flow from In-house HTS to Global Out...
Bio-IT World 2009: Adjusting Information Flow from In-house HTS to Global Out...
 
All Aboard the Databus
All Aboard the DatabusAll Aboard the Databus
All Aboard the Databus
 
Sem tech 2011 v8
Sem tech 2011 v8Sem tech 2011 v8
Sem tech 2011 v8
 
Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack
Sequence Services Phase 2 Webinar Series: Constellation Technology and GenestackSequence Services Phase 2 Webinar Series: Constellation Technology and Genestack
Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack
 
Knowledge mobilization
Knowledge mobilization Knowledge mobilization
Knowledge mobilization
 
Albert Simard - Mobilizing Knowledge: Acquisition, Analysis, and Action
Albert Simard - Mobilizing Knowledge: Acquisition, Analysis, and ActionAlbert Simard - Mobilizing Knowledge: Acquisition, Analysis, and Action
Albert Simard - Mobilizing Knowledge: Acquisition, Analysis, and Action
 
Trip Report Seattle
Trip Report SeattleTrip Report Seattle
Trip Report Seattle
 
2013-01-17 Research Object
2013-01-17 Research Object2013-01-17 Research Object
2013-01-17 Research Object
 
Leadership Symposium on Digital Media in Healthcare
Leadership Symposium on Digital Media in HealthcareLeadership Symposium on Digital Media in Healthcare
Leadership Symposium on Digital Media in Healthcare
 

Más de Christos Kannas

CKannas PhD Thesis Slides
CKannas PhD Thesis SlidesCKannas PhD Thesis Slides
CKannas PhD Thesis SlidesChristos Kannas
 
CKannas_UK_QSAR_Oct_2015_Poster_Port
CKannas_UK_QSAR_Oct_2015_Poster_PortCKannas_UK_QSAR_Oct_2015_Poster_Port
CKannas_UK_QSAR_Oct_2015_Poster_PortChristos Kannas
 
CKannas_ACS_MOST_Transfomation_Based_DnD_20150818
CKannas_ACS_MOST_Transfomation_Based_DnD_20150818CKannas_ACS_MOST_Transfomation_Based_DnD_20150818
CKannas_ACS_MOST_Transfomation_Based_DnD_20150818Christos Kannas
 
LiSIs: a Galaxy based platform for Life Sciences Research
LiSIs: a Galaxy based platform for Life Sciences ResearchLiSIs: a Galaxy based platform for Life Sciences Research
LiSIs: a Galaxy based platform for Life Sciences ResearchChristos Kannas
 
Estimate Water Solubility
Estimate Water SolubilityEstimate Water Solubility
Estimate Water SolubilityChristos Kannas
 
LiSIs Poster Presentation
LiSIs Poster PresentationLiSIs Poster Presentation
LiSIs Poster PresentationChristos Kannas
 
GCC2013 LiSIs Lightning Talk
GCC2013 LiSIs Lightning TalkGCC2013 LiSIs Lightning Talk
GCC2013 LiSIs Lightning TalkChristos Kannas
 
20120615_Granatum_COST_v2
20120615_Granatum_COST_v220120615_Granatum_COST_v2
20120615_Granatum_COST_v2Christos Kannas
 
2009 MSc Presentation for Parallel-MEGA
2009 MSc Presentation for Parallel-MEGA2009 MSc Presentation for Parallel-MEGA
2009 MSc Presentation for Parallel-MEGAChristos Kannas
 
9th ITAB 2009 Parallel-MEGA
9th ITAB 2009 Parallel-MEGA9th ITAB 2009 Parallel-MEGA
9th ITAB 2009 Parallel-MEGAChristos Kannas
 

Más de Christos Kannas (14)

CKannas PhD Thesis Slides
CKannas PhD Thesis SlidesCKannas PhD Thesis Slides
CKannas PhD Thesis Slides
 
CKannas_UK_QSAR_Oct_2015_Poster_Port
CKannas_UK_QSAR_Oct_2015_Poster_PortCKannas_UK_QSAR_Oct_2015_Poster_Port
CKannas_UK_QSAR_Oct_2015_Poster_Port
 
CKannas_ACS_MOST_Transfomation_Based_DnD_20150818
CKannas_ACS_MOST_Transfomation_Based_DnD_20150818CKannas_ACS_MOST_Transfomation_Based_DnD_20150818
CKannas_ACS_MOST_Transfomation_Based_DnD_20150818
 
CSC2013_LiSIs_poster
CSC2013_LiSIs_posterCSC2013_LiSIs_poster
CSC2013_LiSIs_poster
 
LiSIs: a Galaxy based platform for Life Sciences Research
LiSIs: a Galaxy based platform for Life Sciences ResearchLiSIs: a Galaxy based platform for Life Sciences Research
LiSIs: a Galaxy based platform for Life Sciences Research
 
Estimate Water Solubility
Estimate Water SolubilityEstimate Water Solubility
Estimate Water Solubility
 
Diversity Filtering
Diversity FilteringDiversity Filtering
Diversity Filtering
 
LiSIs platform
LiSIs platformLiSIs platform
LiSIs platform
 
LiSIs Poster Presentation
LiSIs Poster PresentationLiSIs Poster Presentation
LiSIs Poster Presentation
 
GCC2013 LiSIs poster
GCC2013 LiSIs posterGCC2013 LiSIs poster
GCC2013 LiSIs poster
 
GCC2013 LiSIs Lightning Talk
GCC2013 LiSIs Lightning TalkGCC2013 LiSIs Lightning Talk
GCC2013 LiSIs Lightning Talk
 
20120615_Granatum_COST_v2
20120615_Granatum_COST_v220120615_Granatum_COST_v2
20120615_Granatum_COST_v2
 
2009 MSc Presentation for Parallel-MEGA
2009 MSc Presentation for Parallel-MEGA2009 MSc Presentation for Parallel-MEGA
2009 MSc Presentation for Parallel-MEGA
 
9th ITAB 2009 Parallel-MEGA
9th ITAB 2009 Parallel-MEGA9th ITAB 2009 Parallel-MEGA
9th ITAB 2009 Parallel-MEGA
 

Último

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 

Último (20)

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 

Granatum_LiSIs_BIBE_2012_presentation_v4.0

  • 1. A Workflow System for Virtual Screening in Cancer Chemoprevention Kannas C. C., Achilleos K. G., Antoniou Z., Kalvari I., Kirmitzoglou I., Nicolaou C. A., Promponas V. J., Pattichis C. S. 13th November 2012 IEEE 12th International Conference on BioInformatics & BioEngineering
  • 2. Outline About LiSIs platform Objectives What is LiSIs? Virtual Screening & Scientific Workflow Management Systems LiSIs platform Virtual Screening Process Template Docking Models Predictive Models 3rd Party Tools The GRANATUM-LiSIs Platform Concluding Remarks Acknowledgement Questions? IEEE 12th International Conference on BioInformatics & 2 13th November 2012 BioEngineering
  • 3. About LiSIs platform IEEE 12th International Conference on BioInformatics & 3 13th November 2012 BioEngineering
  • 4. Objectives Provide a set of tools to create virtual screening process for the discovery of novel agents with desired properties. Provide a set of tools to create data-driven models designed to predict biochemical properties of interest. Provide an environment to create, update, store and share Scientific Workflows, that is accessible via a web interface. IEEE 12th International Conference on BioInformatics & 4 13th November 2012 BioEngineering
  • 5. What is LiSIs? Life Science Informatics Scientific Workflow Management System Computational Environment for Virtual Screening. Cancer Chemoprevention Research Computational Tools borrowed from Drug Discovery Process. GRANATUM project (http://www.granatum.org) Partially funded by the European Commission under the Seventh Framework Programme in the area of Virtual Physiological Human (ICT-2009.5.3). IEEE 12th International Conference on BioInformatics & 5 13th November 2012 BioEngineering
  • 6. Virtual Screening & Scientific Workflow Management Systems Virtual Screening Computational counterpart of biological screening. Goal: decrease the number of compounds physically screened. Scientific Workflow Management Systems (SWMS) Computational environments which facilitate the design and execution of computational experiments (workflows). Known SWMS: Taverna (http://www.taverna.org.uk/) KNIME (http://www.knime.org/ ) Galaxy (http://galaxy.psu.edu/) IEEE 12th International Conference on BioInformatics & 6 13th November 2012 BioEngineering
  • 7. Available @ http://lisis.cs.ucy.ac.cy LiSIs platform IEEE 12th International Conference on BioInformatics & 7 13th November 2012 BioEngineering
  • 8. Virtual Screening Process Template Output Postprocessing •Storage •Visualization •Cleaning •Formatting Processing •Merging •Descriptor Filters •Similarity Search Preprocessing •Substructure Search •File format •Docking Models transformations •Predictive Models •Property Normalizer Input •Descriptors •GRANATUM Calculation platform File •Fragmentation Loader •Coordinates •File Readers Calculation •Protein Cleaner IEEE 12th International Conference on BioInformatics & 8 13th November 2012 BioEngineering
  • 9. Docking Model Preparation Protein (pdb file) Add Hydrogens Protein Preparation: Remove H2O Docking Input Model Protein.pdb file Calculate Pocket Process Coordinates Remove water molecules Separate reference Add hydrogen atoms ligand Calculate binding domain coordinates Clean protein from co-crystalized molecule Output Modified protein.pdb file IEEE 12th International Conference on BioInformatics & 9 13th November 2012 BioEngineering
  • 10. Predictive Model Preparation Chemical data Algorithm Biological •Algorithm data parameters Predictive Model IEEE 12th International Conference on BioInformatics & 10 13th November 2012 BioEngineering
  • 11. Predictive Model Preparation Model Predictions •Chembl Training •Cross-Validation •Pubchem •Independent •Feature extraction / validation using •Predict biological •Literature selection separate datasets properties of •Various algorithms compounds (kNN, SVM, Random •E.g. Toxicity, ER- Forest, Decision Trees) binding activity Data Model gathering Validation IEEE 12th International Conference on BioInformatics & 11 13th November 2012 BioEngineering
  • 12. 3rd Party Tools IEEE 12th International Conference on BioInformatics & 12 13th November 2012 BioEngineering
  • 13. Galaxy Galaxy is an open, web-based platform for data intensive biomedical research. Free public server. Deploy locally. Deploy in a Cluster/Grid environment Deploy in the cloud (CloudMan Amazon EC2). http://galaxy.psu.edu/ IEEE 12th International Conference on BioInformatics & 13 13th November 2012 BioEngineering
  • 14. Galaxy Integration Methodology Galaxy configuration: Web Server(s) (Handle Users Requests) Job Manager (Job Management) Job Handler(s) (Job Execution) Galaxy Tools are wrappers around command line applications. Galaxy runs command line jobs at the background. IEEE 12th International Conference on BioInformatics & 14 13th November 2012 BioEngineering
  • 15. RDKit A collection of cheminformatics and machine-learning software written in C++ and Python. The core algorithms and data structures are written in C++. Wrappers are provided to use the toolkit from either Python or Java. Additionally, the RDKit distribution includes a PostgreSQL- based cartridge that allows molecules to be stored in relational database and retrieved via substructure and similarity searches. Features Overview RDKit Home Page IEEE 12th International Conference on BioInformatics & 15 13th November 2012 BioEngineering
  • 16. The R Project R is a language and environment for statistical computing and graphics. R provides a wide variety of statistical (linear and nonlinear modelling, classical statistical tests, time-series analysis, classification, clustering, ...) and graphical techniques, and is highly extensible. One of R's strengths is the ease with which well-designed publication-quality plots can be produced, including mathematical symbols and formulae where needed. Great care has been taken over the defaults for the minor design choices in graphics, but the user retains full control. The R Project IEEE 12th International Conference on BioInformatics & 16 13th November 2012 BioEngineering
  • 17. AutoDock Vina AutoDock Vina is a new open-source program for drug discovery, molecular docking and virtual screening, offering multi- core capability, high performance and enhanced accuracy and ease of use. Features: Accuracy Compatibility with AutoDock Tools Ease of Use Flexible Side Chains Some receptor side chains can be chosen to be treated as flexible during docking. Speed AutoDock Vina IEEE 12th International Conference on BioInformatics & 17 13th November 2012 BioEngineering
  • 18. Available @ http://lisis.cs.ucy.ac.cy LiSIs platform Demo IEEE 12th International Conference on BioInformatics & 18 13th November 2012 BioEngineering
  • 19. LiSIs platform Live Demo!!! http://lisis.cs.ucy.ac.cy IEEE 12th International Conference on BioInformatics & 19 13th November 2012 BioEngineering
  • 20. Concluding Remarks Fill a current void in chemoprevention, and in general life sciences, research. Enabling researchers to utilize state of the art computational techniques to search for the discovery of novel agents with desired properties. IEEE 12th International Conference on BioInformatics & 20 13th November 2012 BioEngineering
  • 21. Acknowledgement Computer Science, UCY Cancer Biology and Prof. C. S. Pattichis Chemoprevention C. A. Nicolaou (PhD) laboratory at Department of Biological Sciences, UCY. Z. Antoniou Cancer Chemoprevention K. G. Achilleos and Epigenomics Workgroup Biological Sciences, UCY at German Cancer Research Dr. V. J. Promponas Center. I. Kirmitzoglou Members of the GRANATUM I. Kalvari consortium. IEEE 12th International Conference on BioInformatics & 21 13th November 2012 BioEngineering
  • 22. Questions? IEEE 12th International Conference on BioInformatics & 22 13th November 2012 BioEngineering