SlideShare una empresa de Scribd logo
1 de 37
Descargar para leer sin conexión
Protein-Protein Interactions Prediction
Sergey Knyazev
November 21, 2012
Outline
1) Introduction
2) Protein-Protein interaction
3) Protein-Protein interaction databases
4) Protein-Protein interaction prediction
Introduction
1) Introduction
2) Protein-Protein interaction
3) Protein-Protein interaction databases
4) Protein-Protein interaction prediction
There is Huge Ammount of
Interactions in a Cell
Example: Possible molecular interactions
in a spreading cell.
There is Many Ways Biomolecules
Interacts in a Cell.
Protein-Protein Interaction
1) Introduction
2) Protein-Protein interaction
3) Protein-Protein interaction databases
4) Protein-Protein interaction prediction
Protein-Protein Interaction
●
Physical contacts with molecular docking between
proteins that occur in a cell or in a living organism.
●
Not just a ‘‘functional contact’’: The existence of
many other types of functional links between
biomolecular entities (genes, proteins, metabolites,
etc.) in living organisms should not be confused
with protein physical interactions.
●
‘‘Specific contact’’, not just all proteins that bump
into each other by chance.
●
Should be excluded interactions that a protein
experiences when it is being made, folded, quality
checked, or degraded.
Protein-Protein Interaction (PPI)
detection
PPIs Detection Methods
Protein-Protein Interaction
Databases
1) Introduction
2) Protein-Protein interaction
3) Protein-Protein interaction
databases
4) Protein-Protein interaction prediction
Protein-Protein Interaction
Databases
●
BIND - Biomolecular Interaction Network Database;
●
BioGRID - Biological General Repository for
Interaction Datasets;
●
DIP - Database of Interacting Proteins;
●
IntAct - IntAct Molecular Interaction Database;
●
HPRD - Human Protein Reference Database
●
MINT - Molecular INTeraction database;
●
PIPs - Human PPI Prediction database;
●
STRING - Known and Predicted Protein-Protein
Interactions.
PPI Network Derived from
Databases
PIPs human PPIs database
●
Contains predictions of 37 000 high probability
interactions of which 34 000 are not reported in the
interaction databases HPRD, BIND, DIP or OPHID.
●
Interactions predicted by a naive Bayesian model.
The method combines information from gene co-
expression, orthology, co-occurrence of domains,
post-translational modifications, co-localization of
the proteins within the cell and analysis of the local
topology of the predicted PPI network.
●
Based on a prediction algorithm described bellow...
Protein-Protein interaction
prediction
1) Introduction
2) Protein-Protein interaction
3) Protein-Protein interaction databases
4) Protein-Protein interaction
prediction
Protein-Protein Interaction
Prediction
●
The prediction of human protein-protein
interactions was investigated in a Bayesian
framework by considering combinations of
individual protein features known to be indicative
of interaction.
●
The seven individual features are used.
●
The features are grouped into five distinct
modules: Expression (E), Ortology(O),
Combined(C), Disorder(D), Transitive(T).
Expression Module
●
Data Source:
–
GDS596 from the Gene Expression Omnibus
●
Description:
–
Gene co-expression profiles from 79 physiologically normal
tissues obtained from various sources
●
Scoring function:
–
Pearson correlation of coexpression over all conditions
●
Bins:
–
20 of equal size covering the correlation value
range (-1 to +1)
Orthology Module
●
Data Source:
–
InParanoid, BIND, DIP and GRID databases
●
Description:
–
Interactions of homologous protein pairs from yeast, fly, worm and human
●
Scoring function:
–
Organism-based using InParanoid score
●
13 Bins:
–
High, medium and low confidence bins were defined for human protein pairs that have
interacting orthologs in either yeast, fly or worm (for a total of 9 bins)
–
two bin for human pairs that have interacting paralogs in human (a medium and a low
confidence)
–
one bin for human pairs that have interacting homologs in more than one organism
–
one bin for human pairs that have only noninteracting orthologs
Combined Module
●
This module incorporates three distinct features in a nonnaïve
Bayesian framework: subcellular localization, domain co-
occurrence and post-translational modification co-occurrence.
●
Localization:
–
Data source:
●
PSLT predictions
–
Description:
●
PSLT is a human subcellular localization predictor that considers nine different
compartments (ER, Golgi, cytosol, nucleus, peroxisome, plasma membrane,
lysosome, mitochondria and extracellular)
–
Scoring function:
●
Qualitative score: proximity of compartments
–
4 bins:
●
same, neighboring, different compartments, or not localized
Combined module
●
Domain co-occurrence
–
Data source:
●
InterPro and Pfam
–
Description:
●
Protein domains and motifs
–
Scoring function:
●
The chi-square test was used as a measure of the likelihood of co-
occurrence of specific InterPro domains and motifs in protein pairs
●
Chi-square scores were calculated for all pairs of domains/motifs
that occurred in the training data
–
Bins:
●
5 covering range of Chi-square scores
Combined module
●
PTM co-occurrence
–
Data source:
●
HPRD and UniProt
–
Description:
●
Post-translational modifications
–
Scoring function:
–
Bins:
●
4 covering range of PTM scores
Disorder Module
●
Data source:
–
VLS2 predictions
●
Description:
–
Prediction of protein intrinsic disorder
●
Scoring function:
–
Sum of the percent disorder for each protein in a pair
●
Bins:
–
6 covering range of scoring function (0 to 200%)
Transitive Module
●
Description:
–
Module that considers local
topology of underlying network
predicted using combinations of
above features
●
Scoring function:
●
Bins:
–
5 covering range of scoring
function
Independence of the Modules
●
The final likelihood ratio output by the predictor is only
representative of the true likelihood of interaction of a protein pair if
the modules considered are independent. If the modules were not
independent, some likelihood ratios would likely be overestimated.
●
Previous studies have demonstrated that some of the features
considered here are indeed independent.
●
Independence of all modules used in our predictor was verified by
calculating Pearson correlation coefficients for all pairs of modules.
Architecture of the Predictor and
Likelihoods of the Modules
Posterior Odds Ratio Estimation
●
f1, … , fn — features
●
I — interaction
●
~I — non-interaction
Accuracy of the Predictors
●
In order to analyze the predictions, five-fold cross validation
experiments were performed and the area under partial ROC
(receiver operator characteristic) curves (partial AUCs) measured.
●
T is the total number of positives in the test set
●
Ti is the number of positives that score higher than the ith highest
scoring negative
Prediction Accuracy of Different Combinations of Modules
PPI Prediction by Single Module
PPI Prediction by Combination of
Modules
Receiver Operator Characteristic
(ROC)
Comparison with Other Interaction
Datasets
●
Estimated datasets:
–
Rhodes probabilistic dataset
–
LR400 (derived from our predictors)
–
Lehner orthology-derived dataset
●
The false positive rates:
●
Reference datasets:
–
Literature-mined Ramani dataset
–
Human Protein Reference Database
(HPRD)
Comparison with Other Interaction
Datasets
Independent Validation
Conclusion
●
Predicted over 37000 human protein
interactions
●
Explored a subspace of the human
interactome that has not been
investigated by previous large
interaction datasets.
References
●
Protein–Protein Interactions Essentials: Key Concepts to
Building and Analyzing Interactome Networks 2010
Javier De Las Rivas, Celia Fontanillo
●
PIPs: human protein–protein interaction prediction
database 2008
Mark D. McDowall, Michelle S. Scott and Geoffrey J. Barton
●
Probabilistic prediction and ranking of human protein-
protein interactions 2007
Michelle S Scott and Geoffrey J Barton
Thank you!

Más contenido relacionado

La actualidad más candente

Biotech 2012 spring-6_protein_interactions_0
Biotech 2012 spring-6_protein_interactions_0Biotech 2012 spring-6_protein_interactions_0
Biotech 2012 spring-6_protein_interactions_0
BioinformaticsInstitute
 
Protein-protein interaction
Protein-protein interactionProtein-protein interaction
Protein-protein interaction
sigma-tau
 
Yeast two hybrid
Yeast two hybridYeast two hybrid
Yeast two hybrid
hina ojha
 

La actualidad más candente (20)

co immunoprecipitation
co immunoprecipitationco immunoprecipitation
co immunoprecipitation
 
Cytoscape: Gene coexppression and PPI networks
Cytoscape: Gene coexppression and PPI networksCytoscape: Gene coexppression and PPI networks
Cytoscape: Gene coexppression and PPI networks
 
Ppi
PpiPpi
Ppi
 
Proteomics and protein-protein interaction
Proteomics  and protein-protein interactionProteomics  and protein-protein interaction
Proteomics and protein-protein interaction
 
Protein-protein interaction (PPI)
Protein-protein interaction (PPI)Protein-protein interaction (PPI)
Protein-protein interaction (PPI)
 
Protein protein interaction basic
Protein protein interaction basicProtein protein interaction basic
Protein protein interaction basic
 
Protein protein interactions
Protein protein interactionsProtein protein interactions
Protein protein interactions
 
Protein interaction Creative Biomart
Protein interaction Creative BiomartProtein interaction Creative Biomart
Protein interaction Creative Biomart
 
Protein-Protein Interactions (PPIs)
Protein-Protein Interactions (PPIs)Protein-Protein Interactions (PPIs)
Protein-Protein Interactions (PPIs)
 
Protein protein interaction
Protein protein interactionProtein protein interaction
Protein protein interaction
 
The yeast two hybrid system and ChIP
The yeast two hybrid system and ChIPThe yeast two hybrid system and ChIP
The yeast two hybrid system and ChIP
 
yeast two hybrid system
yeast two hybrid systemyeast two hybrid system
yeast two hybrid system
 
Yeast two hybrid system for Protein Protein Interaction Studies
Yeast two hybrid system for Protein Protein Interaction StudiesYeast two hybrid system for Protein Protein Interaction Studies
Yeast two hybrid system for Protein Protein Interaction Studies
 
Protein protein interactions
Protein protein interactionsProtein protein interactions
Protein protein interactions
 
Biotech 2012 spring-6_protein_interactions_0
Biotech 2012 spring-6_protein_interactions_0Biotech 2012 spring-6_protein_interactions_0
Biotech 2012 spring-6_protein_interactions_0
 
Protein-protein interaction
Protein-protein interactionProtein-protein interaction
Protein-protein interaction
 
Proteomics
ProteomicsProteomics
Proteomics
 
Gene regulatory networks
Gene regulatory networksGene regulatory networks
Gene regulatory networks
 
Yeast two hybrid
Yeast two hybridYeast two hybrid
Yeast two hybrid
 
Yeast two hybrid
Yeast two hybrid Yeast two hybrid
Yeast two hybrid
 

Destacado (7)

Bioinformatics and functional genomics
Bioinformatics and functional genomicsBioinformatics and functional genomics
Bioinformatics and functional genomics
 
Structural genomics
Structural genomicsStructural genomics
Structural genomics
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Genomics
GenomicsGenomics
Genomics
 
Types of genomics ppt
Types of genomics pptTypes of genomics ppt
Types of genomics ppt
 
Genomics
GenomicsGenomics
Genomics
 

Similar a Slides 0

University of Texas at Austin
University of Texas at AustinUniversity of Texas at Austin
University of Texas at Austin
butest
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
hemantbreeder
 
Systems Biology Approaches to Cancer
Systems Biology Approaches to CancerSystems Biology Approaches to Cancer
Systems Biology Approaches to Cancer
Raunak Shrestha
 
Systems biology in polypharmacology: explaining and predicting drug secondary...
Systems biology in polypharmacology: explaining and predicting drug secondary...Systems biology in polypharmacology: explaining and predicting drug secondary...
Systems biology in polypharmacology: explaining and predicting drug secondary...
Andrei KUCHARAVY
 

Similar a Slides 0 (20)

Protein protein interaction, functional proteomics
Protein protein interaction, functional proteomicsProtein protein interaction, functional proteomics
Protein protein interaction, functional proteomics
 
Systems biology
Systems biologySystems biology
Systems biology
 
A Systems Biology Approach to Natural Products Research
A Systems Biology Approach to Natural Products ResearchA Systems Biology Approach to Natural Products Research
A Systems Biology Approach to Natural Products Research
 
Applied Bioinformatics Assignment 5docx
Applied Bioinformatics Assignment  5docxApplied Bioinformatics Assignment  5docx
Applied Bioinformatics Assignment 5docx
 
Yeast Two Hybrid System
Yeast Two Hybrid SystemYeast Two Hybrid System
Yeast Two Hybrid System
 
Role of genomics and proteomics
Role of genomics and proteomicsRole of genomics and proteomics
Role of genomics and proteomics
 
University of Texas at Austin
University of Texas at AustinUniversity of Texas at Austin
University of Texas at Austin
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICSSTRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
 
presentation
presentationpresentation
presentation
 
Proteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASyProteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASy
 
Systems Biology Approaches to Cancer
Systems Biology Approaches to CancerSystems Biology Approaches to Cancer
Systems Biology Approaches to Cancer
 
Systems biology in polypharmacology: explaining and predicting drug secondary...
Systems biology in polypharmacology: explaining and predicting drug secondary...Systems biology in polypharmacology: explaining and predicting drug secondary...
Systems biology in polypharmacology: explaining and predicting drug secondary...
 
Proteomics
ProteomicsProteomics
Proteomics
 
Proteomics - Analysis and integration of large-scale data sets
Proteomics - Analysis and integration of large-scale data setsProteomics - Analysis and integration of large-scale data sets
Proteomics - Analysis and integration of large-scale data sets
 
STRING - Modeling of pathways through cross-species integration of large-scal...
STRING - Modeling of pathways through cross-species integration of large-scal...STRING - Modeling of pathways through cross-species integration of large-scal...
STRING - Modeling of pathways through cross-species integration of large-scal...
 
proteomics
 proteomics proteomics
proteomics
 
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
 
Proteomics in VSC for crop improvement programme
Proteomics in VSC for crop improvement programmeProteomics in VSC for crop improvement programme
Proteomics in VSC for crop improvement programme
 
Functional proteomics, and tools
Functional proteomics, and toolsFunctional proteomics, and tools
Functional proteomics, and tools
 

Más de BioinformaticsInstitute

Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес...
 Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес... Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес...
Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес...
BioinformaticsInstitute
 
Knime & bioinformatics
Knime & bioinformaticsKnime & bioinformatics
Knime & bioinformatics
BioinformaticsInstitute
 
"Зачем биологам суперкомпьютеры", Александр Предеус
"Зачем биологам суперкомпьютеры", Александр Предеус"Зачем биологам суперкомпьютеры", Александр Предеус
"Зачем биологам суперкомпьютеры", Александр Предеус
BioinformaticsInstitute
 
Плюрипотентность 101
Плюрипотентность 101Плюрипотентность 101
Плюрипотентность 101
BioinformaticsInstitute
 
Секвенирование как инструмент исследования сложных фенотипов человека: от ген...
Секвенирование как инструмент исследования сложных фенотипов человека: от ген...Секвенирование как инструмент исследования сложных фенотипов человека: от ген...
Секвенирование как инструмент исследования сложных фенотипов человека: от ген...
BioinformaticsInstitute
 

Más de BioinformaticsInstitute (20)

Graph genome
Graph genome Graph genome
Graph genome
 
Nanopores sequencing
Nanopores sequencingNanopores sequencing
Nanopores sequencing
 
A superglue for string comparison
A superglue for string comparisonA superglue for string comparison
A superglue for string comparison
 
Comparative Genomics and de Bruijn graphs
Comparative Genomics and de Bruijn graphsComparative Genomics and de Bruijn graphs
Comparative Genomics and de Bruijn graphs
 
Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес...
 Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес... Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес...
Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес...
 
Вперед в прошлое. Методы генетической диагностики древней днк
Вперед в прошлое. Методы генетической диагностики древней днкВперед в прошлое. Методы генетической диагностики древней днк
Вперед в прошлое. Методы генетической диагностики древней днк
 
Knime & bioinformatics
Knime & bioinformaticsKnime & bioinformatics
Knime & bioinformatics
 
"Зачем биологам суперкомпьютеры", Александр Предеус
"Зачем биологам суперкомпьютеры", Александр Предеус"Зачем биологам суперкомпьютеры", Александр Предеус
"Зачем биологам суперкомпьютеры", Александр Предеус
 
Иммунотерапия раковых опухолей: взгляд со стороны системной биологии. Максим ...
Иммунотерапия раковых опухолей: взгляд со стороны системной биологии. Максим ...Иммунотерапия раковых опухолей: взгляд со стороны системной биологии. Максим ...
Иммунотерапия раковых опухолей: взгляд со стороны системной биологии. Максим ...
 
Рак 101 (Мария Шутова, ИоГЕН РАН)
Рак 101 (Мария Шутова, ИоГЕН РАН)Рак 101 (Мария Шутова, ИоГЕН РАН)
Рак 101 (Мария Шутова, ИоГЕН РАН)
 
Плюрипотентность 101
Плюрипотентность 101Плюрипотентность 101
Плюрипотентность 101
 
Секвенирование как инструмент исследования сложных фенотипов человека: от ген...
Секвенирование как инструмент исследования сложных фенотипов человека: от ген...Секвенирование как инструмент исследования сложных фенотипов человека: от ген...
Секвенирование как инструмент исследования сложных фенотипов человека: от ген...
 
Инвестиции в биоинформатику и биотех (Андрей Афанасьев)
Инвестиции в биоинформатику и биотех (Андрей Афанасьев)Инвестиции в биоинформатику и биотех (Андрей Афанасьев)
Инвестиции в биоинформатику и биотех (Андрей Афанасьев)
 
Biodb 2011-everything
Biodb 2011-everythingBiodb 2011-everything
Biodb 2011-everything
 
Biodb 2011-05
Biodb 2011-05Biodb 2011-05
Biodb 2011-05
 
Biodb 2011-04
Biodb 2011-04Biodb 2011-04
Biodb 2011-04
 
Biodb 2011-03
Biodb 2011-03Biodb 2011-03
Biodb 2011-03
 
Biodb 2011-01
Biodb 2011-01Biodb 2011-01
Biodb 2011-01
 
Biodb 2011-02
Biodb 2011-02Biodb 2011-02
Biodb 2011-02
 
Ngs 3 1
Ngs 3 1Ngs 3 1
Ngs 3 1
 

Último

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Último (20)

GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 

Slides 0

  • 2. Outline 1) Introduction 2) Protein-Protein interaction 3) Protein-Protein interaction databases 4) Protein-Protein interaction prediction
  • 3. Introduction 1) Introduction 2) Protein-Protein interaction 3) Protein-Protein interaction databases 4) Protein-Protein interaction prediction
  • 4. There is Huge Ammount of Interactions in a Cell Example: Possible molecular interactions in a spreading cell.
  • 5. There is Many Ways Biomolecules Interacts in a Cell.
  • 6. Protein-Protein Interaction 1) Introduction 2) Protein-Protein interaction 3) Protein-Protein interaction databases 4) Protein-Protein interaction prediction
  • 7. Protein-Protein Interaction ● Physical contacts with molecular docking between proteins that occur in a cell or in a living organism. ● Not just a ‘‘functional contact’’: The existence of many other types of functional links between biomolecular entities (genes, proteins, metabolites, etc.) in living organisms should not be confused with protein physical interactions. ● ‘‘Specific contact’’, not just all proteins that bump into each other by chance. ● Should be excluded interactions that a protein experiences when it is being made, folded, quality checked, or degraded.
  • 10. Protein-Protein Interaction Databases 1) Introduction 2) Protein-Protein interaction 3) Protein-Protein interaction databases 4) Protein-Protein interaction prediction
  • 11. Protein-Protein Interaction Databases ● BIND - Biomolecular Interaction Network Database; ● BioGRID - Biological General Repository for Interaction Datasets; ● DIP - Database of Interacting Proteins; ● IntAct - IntAct Molecular Interaction Database; ● HPRD - Human Protein Reference Database ● MINT - Molecular INTeraction database; ● PIPs - Human PPI Prediction database; ● STRING - Known and Predicted Protein-Protein Interactions.
  • 12.
  • 13. PPI Network Derived from Databases
  • 14. PIPs human PPIs database ● Contains predictions of 37 000 high probability interactions of which 34 000 are not reported in the interaction databases HPRD, BIND, DIP or OPHID. ● Interactions predicted by a naive Bayesian model. The method combines information from gene co- expression, orthology, co-occurrence of domains, post-translational modifications, co-localization of the proteins within the cell and analysis of the local topology of the predicted PPI network. ● Based on a prediction algorithm described bellow...
  • 15. Protein-Protein interaction prediction 1) Introduction 2) Protein-Protein interaction 3) Protein-Protein interaction databases 4) Protein-Protein interaction prediction
  • 16. Protein-Protein Interaction Prediction ● The prediction of human protein-protein interactions was investigated in a Bayesian framework by considering combinations of individual protein features known to be indicative of interaction. ● The seven individual features are used. ● The features are grouped into five distinct modules: Expression (E), Ortology(O), Combined(C), Disorder(D), Transitive(T).
  • 17. Expression Module ● Data Source: – GDS596 from the Gene Expression Omnibus ● Description: – Gene co-expression profiles from 79 physiologically normal tissues obtained from various sources ● Scoring function: – Pearson correlation of coexpression over all conditions ● Bins: – 20 of equal size covering the correlation value range (-1 to +1)
  • 18. Orthology Module ● Data Source: – InParanoid, BIND, DIP and GRID databases ● Description: – Interactions of homologous protein pairs from yeast, fly, worm and human ● Scoring function: – Organism-based using InParanoid score ● 13 Bins: – High, medium and low confidence bins were defined for human protein pairs that have interacting orthologs in either yeast, fly or worm (for a total of 9 bins) – two bin for human pairs that have interacting paralogs in human (a medium and a low confidence) – one bin for human pairs that have interacting homologs in more than one organism – one bin for human pairs that have only noninteracting orthologs
  • 19. Combined Module ● This module incorporates three distinct features in a nonnaïve Bayesian framework: subcellular localization, domain co- occurrence and post-translational modification co-occurrence. ● Localization: – Data source: ● PSLT predictions – Description: ● PSLT is a human subcellular localization predictor that considers nine different compartments (ER, Golgi, cytosol, nucleus, peroxisome, plasma membrane, lysosome, mitochondria and extracellular) – Scoring function: ● Qualitative score: proximity of compartments – 4 bins: ● same, neighboring, different compartments, or not localized
  • 20. Combined module ● Domain co-occurrence – Data source: ● InterPro and Pfam – Description: ● Protein domains and motifs – Scoring function: ● The chi-square test was used as a measure of the likelihood of co- occurrence of specific InterPro domains and motifs in protein pairs ● Chi-square scores were calculated for all pairs of domains/motifs that occurred in the training data – Bins: ● 5 covering range of Chi-square scores
  • 21. Combined module ● PTM co-occurrence – Data source: ● HPRD and UniProt – Description: ● Post-translational modifications – Scoring function: – Bins: ● 4 covering range of PTM scores
  • 22. Disorder Module ● Data source: – VLS2 predictions ● Description: – Prediction of protein intrinsic disorder ● Scoring function: – Sum of the percent disorder for each protein in a pair ● Bins: – 6 covering range of scoring function (0 to 200%)
  • 23. Transitive Module ● Description: – Module that considers local topology of underlying network predicted using combinations of above features ● Scoring function: ● Bins: – 5 covering range of scoring function
  • 24. Independence of the Modules ● The final likelihood ratio output by the predictor is only representative of the true likelihood of interaction of a protein pair if the modules considered are independent. If the modules were not independent, some likelihood ratios would likely be overestimated. ● Previous studies have demonstrated that some of the features considered here are indeed independent. ● Independence of all modules used in our predictor was verified by calculating Pearson correlation coefficients for all pairs of modules.
  • 25. Architecture of the Predictor and Likelihoods of the Modules
  • 26. Posterior Odds Ratio Estimation ● f1, … , fn — features ● I — interaction ● ~I — non-interaction
  • 27. Accuracy of the Predictors ● In order to analyze the predictions, five-fold cross validation experiments were performed and the area under partial ROC (receiver operator characteristic) curves (partial AUCs) measured. ● T is the total number of positives in the test set ● Ti is the number of positives that score higher than the ith highest scoring negative
  • 28. Prediction Accuracy of Different Combinations of Modules
  • 29. PPI Prediction by Single Module
  • 30. PPI Prediction by Combination of Modules
  • 32. Comparison with Other Interaction Datasets ● Estimated datasets: – Rhodes probabilistic dataset – LR400 (derived from our predictors) – Lehner orthology-derived dataset ● The false positive rates: ● Reference datasets: – Literature-mined Ramani dataset – Human Protein Reference Database (HPRD)
  • 33. Comparison with Other Interaction Datasets
  • 35. Conclusion ● Predicted over 37000 human protein interactions ● Explored a subspace of the human interactome that has not been investigated by previous large interaction datasets.
  • 36. References ● Protein–Protein Interactions Essentials: Key Concepts to Building and Analyzing Interactome Networks 2010 Javier De Las Rivas, Celia Fontanillo ● PIPs: human protein–protein interaction prediction database 2008 Mark D. McDowall, Michelle S. Scott and Geoffrey J. Barton ● Probabilistic prediction and ranking of human protein- protein interactions 2007 Michelle S Scott and Geoffrey J Barton