SlideShare una empresa de Scribd logo
1 de 18
Pro-Gyan
Build and share protein classifiers from fasta
files
What is Pro-Gyan
o It builds binary classifier directly from protein sequences
Calculates ~5000 different properties from proteins
seuence
 Selects a “maximal relevant and minimal redundant
feature subset” and ranked them applying Information
theory.
Top ranked features are selected to build the final
SVM classifier by 5 fold cross validation.
How to use Pro-Gyan
• Download “Pro_Gyan_1.0.zip” from
(https://code.google.com/p/pro-gyan/downloads/list)
• Extract all the files.
• Double click Pro-Gyan.jar which will open the main
window of “Pro-Gyan”.
• Let us build a classifier.
How to build a protein classifier
1. To build a classifier we need two set of
proteins (like mitochondrial and non-
mitochondrial) in fasta format.
2. Now press “Create Classifier”
How to build a protein classifier
How to build a protein classifier
•Give a name to your classier
•Add description about the classifier and data set.
•Labeled the positive and negative input data
appropriately
•Browse the fasta files and press “Save” button
How to build a protein classifier
Now Pro-Gyan is ready to build a new classifer; press “Self Learn”;
it will take some time depending on the data size
Different performance metrices
Confusion matrix
Accuracy, Sensitivity, Specificity
Mathews correlation coefficient.
Receiver operating characteristic
Selected ranked features & statistics
Evaluation independent test data
Data not used in training is important to evaluate
the chance over of over fitting.
Performance on independent test set
Export the classifier
The classifier could be export/save in “Pro-Gyan
classifier“ (pgc) format and upload in a web-server,
e-mailed, etc. The name and description of
the classifier could be updated at the time of export.
Classify novel proteins
Classify novel proteins
• Import the required
Pro-Gyan built
classifier
Training information of the classifier
Classify novel proteins
• Copy-paste in the text
area or upload (“Fasta
File” button) multiple
protein sequence in fasta
format and “Classify”
them.
Prediction result
• The result is displayed in
tabular format which
could be copy paste to
any text or spreadsheet.

Más contenido relacionado

Similar a Pro gyan complete

NRNB project Stoichiometry Plugin
NRNB project Stoichiometry PluginNRNB project Stoichiometry Plugin
NRNB project Stoichiometry Plugin
Sravanthi Sinha
 
Advanced Application Lifecycle Managment
Advanced Application Lifecycle ManagmentAdvanced Application Lifecycle Managment
Advanced Application Lifecycle Managment
Salesforce Developers
 

Similar a Pro gyan complete (20)

SFDC Deployments
SFDC DeploymentsSFDC Deployments
SFDC Deployments
 
Test automation
Test automationTest automation
Test automation
 
NRNB project
NRNB projectNRNB project
NRNB project
 
NRNB project Stoichiometry Plugin
NRNB project Stoichiometry PluginNRNB project Stoichiometry Plugin
NRNB project Stoichiometry Plugin
 
Hybrid feature selection using correlation coefficient and particle swarm opt...
Hybrid feature selection using correlation coefficient and particle swarm opt...Hybrid feature selection using correlation coefficient and particle swarm opt...
Hybrid feature selection using correlation coefficient and particle swarm opt...
 
Katalon Studio - GUI Overview
Katalon Studio - GUI OverviewKatalon Studio - GUI Overview
Katalon Studio - GUI Overview
 
Getting Started with Selenium
Getting Started with SeleniumGetting Started with Selenium
Getting Started with Selenium
 
Harnessing The Proteome With Proteo Iq Quantitative Proteomics Software
Harnessing The Proteome With Proteo Iq Quantitative Proteomics SoftwareHarnessing The Proteome With Proteo Iq Quantitative Proteomics Software
Harnessing The Proteome With Proteo Iq Quantitative Proteomics Software
 
Selenium training in chennai
Selenium training in chennaiSelenium training in chennai
Selenium training in chennai
 
Profiling documentforaltrec
Profiling documentforaltrecProfiling documentforaltrec
Profiling documentforaltrec
 
Jakarta EE Test Strategies (2022)
Jakarta EE Test Strategies (2022)Jakarta EE Test Strategies (2022)
Jakarta EE Test Strategies (2022)
 
Advanced Application Lifecycle Managment
Advanced Application Lifecycle ManagmentAdvanced Application Lifecycle Managment
Advanced Application Lifecycle Managment
 
How to make a Load Testing with Visual Studio 2012
How to make a Load Testing with Visual Studio 2012How to make a Load Testing with Visual Studio 2012
How to make a Load Testing with Visual Studio 2012
 
Automation Framework 042009 V2
Automation Framework   042009  V2Automation Framework   042009  V2
Automation Framework 042009 V2
 
PRG 421 Education Specialist / snaptutorial.com
PRG 421 Education Specialist / snaptutorial.comPRG 421 Education Specialist / snaptutorial.com
PRG 421 Education Specialist / snaptutorial.com
 
Optimizing Magento for Peak Performance
Optimizing Magento for Peak PerformanceOptimizing Magento for Peak Performance
Optimizing Magento for Peak Performance
 
JCD 2013 OCM Java Developer
JCD 2013 OCM Java DeveloperJCD 2013 OCM Java Developer
JCD 2013 OCM Java Developer
 
OCM Java 開發人員認證與設計模式
OCM Java 開發人員認證與設計模式OCM Java 開發人員認證與設計模式
OCM Java 開發人員認證與設計模式
 
PRG 421 Inspiring Innovation / tutorialrank.com
PRG 421 Inspiring Innovation / tutorialrank.comPRG 421 Inspiring Innovation / tutorialrank.com
PRG 421 Inspiring Innovation / tutorialrank.com
 
PRG 421 Massive success / tutorialrank.com
PRG 421 Massive success / tutorialrank.comPRG 421 Massive success / tutorialrank.com
PRG 421 Massive success / tutorialrank.com
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 

Pro gyan complete

  • 1. Pro-Gyan Build and share protein classifiers from fasta files
  • 2. What is Pro-Gyan o It builds binary classifier directly from protein sequences Calculates ~5000 different properties from proteins seuence  Selects a “maximal relevant and minimal redundant feature subset” and ranked them applying Information theory. Top ranked features are selected to build the final SVM classifier by 5 fold cross validation.
  • 3. How to use Pro-Gyan • Download “Pro_Gyan_1.0.zip” from (https://code.google.com/p/pro-gyan/downloads/list) • Extract all the files. • Double click Pro-Gyan.jar which will open the main window of “Pro-Gyan”. • Let us build a classifier.
  • 4. How to build a protein classifier 1. To build a classifier we need two set of proteins (like mitochondrial and non- mitochondrial) in fasta format. 2. Now press “Create Classifier”
  • 5. How to build a protein classifier
  • 6. How to build a protein classifier •Give a name to your classier •Add description about the classifier and data set. •Labeled the positive and negative input data appropriately •Browse the fasta files and press “Save” button
  • 7. How to build a protein classifier Now Pro-Gyan is ready to build a new classifer; press “Self Learn”; it will take some time depending on the data size
  • 8. Different performance metrices Confusion matrix Accuracy, Sensitivity, Specificity Mathews correlation coefficient.
  • 10. Selected ranked features & statistics
  • 11. Evaluation independent test data Data not used in training is important to evaluate the chance over of over fitting.
  • 13. Export the classifier The classifier could be export/save in “Pro-Gyan classifier“ (pgc) format and upload in a web-server, e-mailed, etc. The name and description of the classifier could be updated at the time of export.
  • 15. Classify novel proteins • Import the required Pro-Gyan built classifier
  • 16. Training information of the classifier
  • 17. Classify novel proteins • Copy-paste in the text area or upload (“Fasta File” button) multiple protein sequence in fasta format and “Classify” them.
  • 18. Prediction result • The result is displayed in tabular format which could be copy paste to any text or spreadsheet.