SlideShare una empresa de Scribd logo
1 de 19
Topic: COGs and
Comparative Genomics
Durdam M.Sc. Bioinformatics sem-2
Some important terminologies:
Orthologs are genes in different species that evolved from a
common ancestral gene by speciation. Normally, orthologs retain the
same function in the course of evolution. Identification of orthologs
is critical for reliable prediction of gene function in newly sequenced
genomes.
Paralogs are genes related by duplication within a genome.
Orthologs retain the same function in the course of evolution,
whereas paralogs evolve new functions, even if these are related to
the original one.
Speciation is the origin of a new species capable of making a living
in a new way from the species from which it arose. As part of this
process it has also acquired some barrier to genetic exchange with
the parent species.
Speciation
COGs
• Cluster of orthologous genes.
• Clusters of Orthologous Groups, are groups of three or more ortholog
genes, meaning they are direct evolutionary counter parts and are
considered to be part of an 'ancient conserved domain'. A COG is defined
as three or more proteins from the genomes of distant species that are
more similar to each other than to any other protein within the individual
genome.
• COGs can be used to predict the function of homologous proteins in
poorly studied species and can also be used to track the evolutionary
divergence from a common ancestor, hence providing a powerful tool for
functional annotation of uncharacterized proteins.
• Important in comparative genomics studies
Application of COG
• The most straightforward application of the COGs is for the prediction of
functions of individual proteins or protein sets, including those from newly
completed genomes.
NCBI provides a COG database that consists of 4,873 COGs that code for over
136,000 proteins from the genomes of 50 bacteria, 13 archaea and 3 unicellular
eukaryotes. This database uses completely sequenced genomes to classify
proteins using the orthology concept.
The COG database
What are some questions that comparative
genomics can address?
• How has the organism evolved?
• What differentiates species?
• Which non-coding regions are important?
• Which genes are required for organisms to survive in a
certain environment?
What is Comparative Genomics?
It is the comparison of one genome to another.
Genomics DNA (Gene)
Functional
Genomics
Transcriptomics RNA
Proteomics PROTEIN
Metabolomics METABOLITE
Transcription
Translation
Enzymatic
reaction
Difference is in Scale and Direction
One or several genes
compared against all
other known genes.
Use genome to
inform us about the
entire organism.
Use information
from many
genomes to learn
more about the
individual genes.
Entire Genome
compared to
other entire
genomes.
Other “omics” Comparative
Comparative genomics
• Discover what lies hidden in genomic
sequence by comparing sequence
information.
• Main areas
– Whole genome alignment
– Gene prediction
– Regulatory element prediction
– Phylogenomics
– Pharmacogenetics
Comparative Genomics
Comparative genomics is a powerful tool for identifying the features and dissecting
the functions of genomes. The approach is based on selection for the gene or
regulatory region constraining the evolution of the sequence. Comparison with other
genomes has become an integral part of the analysis of the human genome sequence
and is one of the most effective methods for identifying genes (Batzoglou et al. ,
2000; Roest Crollius et al. , 2000)
Comparative genomics is a field of biological research in which the genomic features of
different organisms are compared. The genomic features may include the DNA sequence,
genes, gene order, regulatory sequences, and other genomic structural landmarks
Figure: Species tree of different organisms
Figure: Distribution and clustering of orthologous genes of Tulsi genome to other related plant
genomes. a. Distribution of gene families among five plant genomes. Ocimum tenuiflorum (Ote
- green), Arabidopsis thaliana (Ath – black rectangle), Oryza sativa (Osa – red), Solanum
lycopersicum (Sly – blue) and Mimulus guttatus (Mgu – black circle). The numbers in the Venn
diagram represent shared and unique gene families across these 5 species obtained by
OrthoMCL.
b. Horizontal stacked bar plot of all the genes in 23 different genomes. This figure shows
ortholog group distribution in all 23 plant species including Tulsi. Each row represents a plant
species - Physcomitrella patens (Ppa), Selaginella moellendorffii (Smo), Oryza sativa (Osa),
Setaria italic (Sit), Zea mays (Zma), Sorghum bicolor (Sbi), Aquilegia caerulea (Aca), Ocimum
tenuiflorum (Ote), Mimulus guttatus (Mgu), Solanum lycopersicum (Sly), Solanum tuberosum
(Stu), Vitis vinifera (Vvi), Eucalyptus grandis (Egr), Citrus sinensis (Csi), Theobroma cacao (Tca),
Carica papaya (Cpa), Brassica rapa (Bra), Arabidopsis thaliana (Ath), Fragaria vesca (Fve), Prunus
persica (Ppe), Glycine max (Gma), Medicago truncatula (Mtr), Populus trichocarpa (Ptr). The bar
graph represents ortholog protein groups for that species subdivided into 22 categories
depending on the degree of sharing with the other 22 plant species e.g., category 2 represents
the number of orthologous groups that have representatives from the species of interest and
from one more species out of the 23 species selected for the study
Background: Shortly after multiple
genome sequences of bacteria, archae
and unicellular eukaryotes became
available, an attempt on such a
classification was implemented in
Cluster of Orthologous Groups of
proteins (COGs). Rapid accumulation of
genome sequences creates
opportunities for refining COGs but also
represents a challenge because of error
amplification.
Conclusion: The arCOGs provide a
convenient, flexible framework for
functional annotation of archael
genomes, comparative genomics and
evolutionary reconstructions. Genomic
reconstructions suggest that the last
common ancestor of archaea might
have been (nearly) as advanced as the
modern archael hyperthermophiles.
For more info:
ftp://ftp.ncbi.nih.gov/pub/koonin/arCOGs/.
MBGD Database
MBGD is a database for comparative analysis of completely sequenced microbial
genomes, the number of which is now growing rapidly. The aim of MBGD is to
facilitate comparative genomics from various points of view such as ortholog
identification, paralog clustering, motif analysis and gene order comparison.
Conclusion
The study of Cluster of Orthologous
Genes play a vital role in the
Comparative genomic studies.
References and links
• NCBI COGs database
• Chapter 22 of the NCBI handbook: The Clusters of Orthologous Groups (COGs)
Database: Phylogenetic Classification of Proteins from Complete Genomes. NCBI
Bookshelf ID: NBK21101.
• NCBI News Letter: Protein Families and Genome Evolution. Published Feb 1998.
• http://homepage.usask.ca/~ctl271/857/def_homolog.shtml
• http://biologydirect.biomedcentral.com/articles/10.1186/1745-6150-2-33
• Nucleic Acids Res. 2015 Jan;43(Database issue):D261-9. doi: 10.1093/nar/gku1223.
Epub 2014 Nov 26.
• http://www.ncbi.nlm.nih.gov/pubmed/25428365
• http://biologydirect.biomedcentral.com/articles/10.1186/1745-6150-2-33
THANK YOU

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Genome assembly
Genome assemblyGenome assembly
Genome assembly
 
Genome Assembly
Genome AssemblyGenome Assembly
Genome Assembly
 
Prosite
PrositeProsite
Prosite
 
sequence of file formats in bioinformatics
sequence of file formats in bioinformaticssequence of file formats in bioinformatics
sequence of file formats in bioinformatics
 
Gene prediction methods vijay
Gene prediction methods  vijayGene prediction methods  vijay
Gene prediction methods vijay
 
Cath
CathCath
Cath
 
Techniques in proteomics
Techniques in proteomicsTechniques in proteomics
Techniques in proteomics
 
Expressed sequence tag (EST), molecular marker
Expressed sequence tag (EST), molecular markerExpressed sequence tag (EST), molecular marker
Expressed sequence tag (EST), molecular marker
 
Web based servers and softwares for genome analysis
Web based servers and softwares for genome analysisWeb based servers and softwares for genome analysis
Web based servers and softwares for genome analysis
 
Whole genome shotgun sequencing
Whole genome shotgun sequencingWhole genome shotgun sequencing
Whole genome shotgun sequencing
 
Protein protein interactions
Protein protein interactionsProtein protein interactions
Protein protein interactions
 
Kegg
KeggKegg
Kegg
 
Genome annotation
Genome annotationGenome annotation
Genome annotation
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
SAGE (Serial analysis of Gene Expression)
SAGE (Serial analysis of Gene Expression)SAGE (Serial analysis of Gene Expression)
SAGE (Serial analysis of Gene Expression)
 
Electrophoretic mobility shift assay
Electrophoretic mobility shift assay Electrophoretic mobility shift assay
Electrophoretic mobility shift assay
 
Functional genomics, and tools
Functional genomics, and toolsFunctional genomics, and tools
Functional genomics, and tools
 
Bioinformatics data mining
Bioinformatics data miningBioinformatics data mining
Bioinformatics data mining
 
Peptide Mass Fingerprinting
Peptide Mass FingerprintingPeptide Mass Fingerprinting
Peptide Mass Fingerprinting
 
Peptide Mass Fingerprinting
Peptide Mass FingerprintingPeptide Mass Fingerprinting
Peptide Mass Fingerprinting
 

Destacado

What is comparative genomics
What is comparative genomicsWhat is comparative genomics
What is comparative genomicsUsman Arshad
 
Comparative genomics presentation
Comparative genomics presentationComparative genomics presentation
Comparative genomics presentationEmmanuel Aguon
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicsAmol Kunde
 
Comparative genomics and proteomics
Comparative genomics and proteomicsComparative genomics and proteomics
Comparative genomics and proteomicsNikhil Aggarwal
 
Comparative genomics @ sid 2003 format
Comparative genomics @ sid 2003 formatComparative genomics @ sid 2003 format
Comparative genomics @ sid 2003 formatsidjena70
 
Comparative Genomics with GMOD and BioPerl
Comparative Genomics with GMOD and BioPerlComparative Genomics with GMOD and BioPerl
Comparative Genomics with GMOD and BioPerlJason Stajich
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicskiran singh
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomicsajay301
 
Application of Bioinformatics in different fields of sciences
Application of Bioinformatics in different fields of sciencesApplication of Bioinformatics in different fields of sciences
Application of Bioinformatics in different fields of sciencesSobia
 
Comparative Genomics and Visualisation - Part 2
Comparative Genomics and Visualisation - Part 2Comparative Genomics and Visualisation - Part 2
Comparative Genomics and Visualisation - Part 2Leighton Pritchard
 
Database management system
Database management systemDatabase management system
Database management systemashishkthakur94
 
Microbial Genomics and Bioinformatics: BM405 (2015)
Microbial Genomics and Bioinformatics: BM405 (2015)Microbial Genomics and Bioinformatics: BM405 (2015)
Microbial Genomics and Bioinformatics: BM405 (2015)Leighton Pritchard
 
TLIII: Overview of TLII achievements, lessons and challenges for Phase III – ...
TLIII: Overview of TLII achievements, lessons and challenges for Phase III – ...TLIII: Overview of TLII achievements, lessons and challenges for Phase III – ...
TLIII: Overview of TLII achievements, lessons and challenges for Phase III – ...CGIAR Generation Challenge Programme
 
Trans 2 butene
Trans 2 buteneTrans 2 butene
Trans 2 buteneimanijc
 
Unit 2 8 Alcohols And Halogenoalkanes Notes
Unit 2 8 Alcohols And Halogenoalkanes NotesUnit 2 8 Alcohols And Halogenoalkanes Notes
Unit 2 8 Alcohols And Halogenoalkanes NotesM F Ebden
 
rules in naming organic compound
rules in naming organic compoundrules in naming organic compound
rules in naming organic compoundvxiiayah
 

Destacado (20)

What is comparative genomics
What is comparative genomicsWhat is comparative genomics
What is comparative genomics
 
Comparative genomics presentation
Comparative genomics presentationComparative genomics presentation
Comparative genomics presentation
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Comparative genomics and proteomics
Comparative genomics and proteomicsComparative genomics and proteomics
Comparative genomics and proteomics
 
Comparative genomics @ sid 2003 format
Comparative genomics @ sid 2003 formatComparative genomics @ sid 2003 format
Comparative genomics @ sid 2003 format
 
Comparative Genomics with GMOD and BioPerl
Comparative Genomics with GMOD and BioPerlComparative Genomics with GMOD and BioPerl
Comparative Genomics with GMOD and BioPerl
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Application of Bioinformatics in different fields of sciences
Application of Bioinformatics in different fields of sciencesApplication of Bioinformatics in different fields of sciences
Application of Bioinformatics in different fields of sciences
 
Comparative Genomics and Visualisation - Part 2
Comparative Genomics and Visualisation - Part 2Comparative Genomics and Visualisation - Part 2
Comparative Genomics and Visualisation - Part 2
 
GRM 2011: Theme 1 -- Comparative genomics
GRM 2011: Theme 1 -- Comparative genomicsGRM 2011: Theme 1 -- Comparative genomics
GRM 2011: Theme 1 -- Comparative genomics
 
Database management system
Database management systemDatabase management system
Database management system
 
Ozonolysis of 2-Butenes
Ozonolysis of 2-ButenesOzonolysis of 2-Butenes
Ozonolysis of 2-Butenes
 
Microbial Genomics and Bioinformatics: BM405 (2015)
Microbial Genomics and Bioinformatics: BM405 (2015)Microbial Genomics and Bioinformatics: BM405 (2015)
Microbial Genomics and Bioinformatics: BM405 (2015)
 
03 Object Dbms Technology
03 Object Dbms Technology03 Object Dbms Technology
03 Object Dbms Technology
 
Nomenclature
NomenclatureNomenclature
Nomenclature
 
TLIII: Overview of TLII achievements, lessons and challenges for Phase III – ...
TLIII: Overview of TLII achievements, lessons and challenges for Phase III – ...TLIII: Overview of TLII achievements, lessons and challenges for Phase III – ...
TLIII: Overview of TLII achievements, lessons and challenges for Phase III – ...
 
Trans 2 butene
Trans 2 buteneTrans 2 butene
Trans 2 butene
 
Unit 2 8 Alcohols And Halogenoalkanes Notes
Unit 2 8 Alcohols And Halogenoalkanes NotesUnit 2 8 Alcohols And Halogenoalkanes Notes
Unit 2 8 Alcohols And Halogenoalkanes Notes
 
rules in naming organic compound
rules in naming organic compoundrules in naming organic compound
rules in naming organic compound
 

Similar a Comparative genomics

Comparative genomics in eukaryotes, organelles
Comparative genomics in eukaryotes, organellesComparative genomics in eukaryotes, organelles
Comparative genomics in eukaryotes, organellesKAUSHAL SAHU
 
Genetic fine structure
Genetic fine structureGenetic fine structure
Genetic fine structureSujan Karki
 
Chapter 20 ppt
Chapter 20 pptChapter 20 ppt
Chapter 20 pptrehman2009
 
Plang functional genome
Plang functional genomePlang functional genome
Plang functional genometcha163
 
Protein function prediction
Protein function predictionProtein function prediction
Protein function predictionLars Juhl Jensen
 
Molecular Biology
Molecular BiologyMolecular Biology
Molecular BiologyBealise Sc
 
Comparative transcriptomics
Comparative transcriptomicsComparative transcriptomics
Comparative transcriptomicsSayak Ghosh
 
BITS - Introduction to comparative genomics
BITS - Introduction to comparative genomicsBITS - Introduction to comparative genomics
BITS - Introduction to comparative genomicsBITS
 
Functional proteomics, and tools
Functional proteomics, and toolsFunctional proteomics, and tools
Functional proteomics, and toolsKAUSHAL SAHU
 
Plant genomics general overview
Plant genomics general overviewPlant genomics general overview
Plant genomics general overviewKAUSHAL SAHU
 
genomic proteomic changes by Suyash Garg.pptx
genomic proteomic changes by Suyash Garg.pptxgenomic proteomic changes by Suyash Garg.pptx
genomic proteomic changes by Suyash Garg.pptxsuyashempire
 
Basic Formal Ontology (BFO) and Disease
 Basic Formal Ontology (BFO) and Disease Basic Formal Ontology (BFO) and Disease
Basic Formal Ontology (BFO) and DiseaseBarry Smith
 
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...DevikaPatel12
 
Bioinformatics A Biased Overview
Bioinformatics A Biased OverviewBioinformatics A Biased Overview
Bioinformatics A Biased OverviewPhilip Bourne
 
Molecular systematics.pdf
Molecular systematics.pdfMolecular systematics.pdf
Molecular systematics.pdfAartisoni17
 
Metagenomics and it’s applications
Metagenomics and it’s applicationsMetagenomics and it’s applications
Metagenomics and it’s applicationsSham Sadiq
 

Similar a Comparative genomics (20)

Comparative genomics in eukaryotes, organelles
Comparative genomics in eukaryotes, organellesComparative genomics in eukaryotes, organelles
Comparative genomics in eukaryotes, organelles
 
Gene order
Gene orderGene order
Gene order
 
Gene order
Gene orderGene order
Gene order
 
Genetic fine structure
Genetic fine structureGenetic fine structure
Genetic fine structure
 
Chapter 20 ppt
Chapter 20 pptChapter 20 ppt
Chapter 20 ppt
 
Plang functional genome
Plang functional genomePlang functional genome
Plang functional genome
 
Tools of Genomics
Tools of GenomicsTools of Genomics
Tools of Genomics
 
Protein function prediction
Protein function predictionProtein function prediction
Protein function prediction
 
Molecular Biology
Molecular BiologyMolecular Biology
Molecular Biology
 
Comparative transcriptomics
Comparative transcriptomicsComparative transcriptomics
Comparative transcriptomics
 
BITS - Introduction to comparative genomics
BITS - Introduction to comparative genomicsBITS - Introduction to comparative genomics
BITS - Introduction to comparative genomics
 
Functional proteomics, and tools
Functional proteomics, and toolsFunctional proteomics, and tools
Functional proteomics, and tools
 
Plant genomics general overview
Plant genomics general overviewPlant genomics general overview
Plant genomics general overview
 
genomic proteomic changes by Suyash Garg.pptx
genomic proteomic changes by Suyash Garg.pptxgenomic proteomic changes by Suyash Garg.pptx
genomic proteomic changes by Suyash Garg.pptx
 
Basic Formal Ontology (BFO) and Disease
 Basic Formal Ontology (BFO) and Disease Basic Formal Ontology (BFO) and Disease
Basic Formal Ontology (BFO) and Disease
 
THE human genome
THE human genomeTHE human genome
THE human genome
 
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
 
Bioinformatics A Biased Overview
Bioinformatics A Biased OverviewBioinformatics A Biased Overview
Bioinformatics A Biased Overview
 
Molecular systematics.pdf
Molecular systematics.pdfMolecular systematics.pdf
Molecular systematics.pdf
 
Metagenomics and it’s applications
Metagenomics and it’s applicationsMetagenomics and it’s applications
Metagenomics and it’s applications
 

Más de prateek kumar

Más de prateek kumar (9)

Applications of microarray
Applications of microarrayApplications of microarray
Applications of microarray
 
Microarray and its application
Microarray and its applicationMicroarray and its application
Microarray and its application
 
docking
docking docking
docking
 
RAPD, AFLP AND RFLP ANALYSIS
RAPD, AFLP AND RFLP ANALYSISRAPD, AFLP AND RFLP ANALYSIS
RAPD, AFLP AND RFLP ANALYSIS
 
Bhageerath h
Bhageerath  h Bhageerath  h
Bhageerath h
 
DNA sequencing
DNA sequencingDNA sequencing
DNA sequencing
 
Genomic variation
Genomic variationGenomic variation
Genomic variation
 
2 d gel analysis
2 d gel analysis2 d gel analysis
2 d gel analysis
 
2 d gel analysis
2 d gel analysis2 d gel analysis
2 d gel analysis
 

Último

Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxMaryGraceBautista27
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxCulture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxPoojaSen20
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSJoshuaGantuangco2
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 

Último (20)

Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptx
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxCulture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 

Comparative genomics

  • 1. Topic: COGs and Comparative Genomics Durdam M.Sc. Bioinformatics sem-2
  • 2. Some important terminologies: Orthologs are genes in different species that evolved from a common ancestral gene by speciation. Normally, orthologs retain the same function in the course of evolution. Identification of orthologs is critical for reliable prediction of gene function in newly sequenced genomes. Paralogs are genes related by duplication within a genome. Orthologs retain the same function in the course of evolution, whereas paralogs evolve new functions, even if these are related to the original one. Speciation is the origin of a new species capable of making a living in a new way from the species from which it arose. As part of this process it has also acquired some barrier to genetic exchange with the parent species.
  • 4. COGs • Cluster of orthologous genes. • Clusters of Orthologous Groups, are groups of three or more ortholog genes, meaning they are direct evolutionary counter parts and are considered to be part of an 'ancient conserved domain'. A COG is defined as three or more proteins from the genomes of distant species that are more similar to each other than to any other protein within the individual genome. • COGs can be used to predict the function of homologous proteins in poorly studied species and can also be used to track the evolutionary divergence from a common ancestor, hence providing a powerful tool for functional annotation of uncharacterized proteins. • Important in comparative genomics studies
  • 5. Application of COG • The most straightforward application of the COGs is for the prediction of functions of individual proteins or protein sets, including those from newly completed genomes. NCBI provides a COG database that consists of 4,873 COGs that code for over 136,000 proteins from the genomes of 50 bacteria, 13 archaea and 3 unicellular eukaryotes. This database uses completely sequenced genomes to classify proteins using the orthology concept. The COG database
  • 6. What are some questions that comparative genomics can address? • How has the organism evolved? • What differentiates species? • Which non-coding regions are important? • Which genes are required for organisms to survive in a certain environment?
  • 7. What is Comparative Genomics? It is the comparison of one genome to another. Genomics DNA (Gene) Functional Genomics Transcriptomics RNA Proteomics PROTEIN Metabolomics METABOLITE Transcription Translation Enzymatic reaction
  • 8. Difference is in Scale and Direction One or several genes compared against all other known genes. Use genome to inform us about the entire organism. Use information from many genomes to learn more about the individual genes. Entire Genome compared to other entire genomes. Other “omics” Comparative
  • 9. Comparative genomics • Discover what lies hidden in genomic sequence by comparing sequence information. • Main areas – Whole genome alignment – Gene prediction – Regulatory element prediction – Phylogenomics – Pharmacogenetics
  • 10. Comparative Genomics Comparative genomics is a powerful tool for identifying the features and dissecting the functions of genomes. The approach is based on selection for the gene or regulatory region constraining the evolution of the sequence. Comparison with other genomes has become an integral part of the analysis of the human genome sequence and is one of the most effective methods for identifying genes (Batzoglou et al. , 2000; Roest Crollius et al. , 2000) Comparative genomics is a field of biological research in which the genomic features of different organisms are compared. The genomic features may include the DNA sequence, genes, gene order, regulatory sequences, and other genomic structural landmarks
  • 11. Figure: Species tree of different organisms
  • 12.
  • 13. Figure: Distribution and clustering of orthologous genes of Tulsi genome to other related plant genomes. a. Distribution of gene families among five plant genomes. Ocimum tenuiflorum (Ote - green), Arabidopsis thaliana (Ath – black rectangle), Oryza sativa (Osa – red), Solanum lycopersicum (Sly – blue) and Mimulus guttatus (Mgu – black circle). The numbers in the Venn diagram represent shared and unique gene families across these 5 species obtained by OrthoMCL. b. Horizontal stacked bar plot of all the genes in 23 different genomes. This figure shows ortholog group distribution in all 23 plant species including Tulsi. Each row represents a plant species - Physcomitrella patens (Ppa), Selaginella moellendorffii (Smo), Oryza sativa (Osa), Setaria italic (Sit), Zea mays (Zma), Sorghum bicolor (Sbi), Aquilegia caerulea (Aca), Ocimum tenuiflorum (Ote), Mimulus guttatus (Mgu), Solanum lycopersicum (Sly), Solanum tuberosum (Stu), Vitis vinifera (Vvi), Eucalyptus grandis (Egr), Citrus sinensis (Csi), Theobroma cacao (Tca), Carica papaya (Cpa), Brassica rapa (Bra), Arabidopsis thaliana (Ath), Fragaria vesca (Fve), Prunus persica (Ppe), Glycine max (Gma), Medicago truncatula (Mtr), Populus trichocarpa (Ptr). The bar graph represents ortholog protein groups for that species subdivided into 22 categories depending on the degree of sharing with the other 22 plant species e.g., category 2 represents the number of orthologous groups that have representatives from the species of interest and from one more species out of the 23 species selected for the study
  • 14. Background: Shortly after multiple genome sequences of bacteria, archae and unicellular eukaryotes became available, an attempt on such a classification was implemented in Cluster of Orthologous Groups of proteins (COGs). Rapid accumulation of genome sequences creates opportunities for refining COGs but also represents a challenge because of error amplification. Conclusion: The arCOGs provide a convenient, flexible framework for functional annotation of archael genomes, comparative genomics and evolutionary reconstructions. Genomic reconstructions suggest that the last common ancestor of archaea might have been (nearly) as advanced as the modern archael hyperthermophiles. For more info: ftp://ftp.ncbi.nih.gov/pub/koonin/arCOGs/.
  • 15.
  • 16. MBGD Database MBGD is a database for comparative analysis of completely sequenced microbial genomes, the number of which is now growing rapidly. The aim of MBGD is to facilitate comparative genomics from various points of view such as ortholog identification, paralog clustering, motif analysis and gene order comparison.
  • 17. Conclusion The study of Cluster of Orthologous Genes play a vital role in the Comparative genomic studies.
  • 18. References and links • NCBI COGs database • Chapter 22 of the NCBI handbook: The Clusters of Orthologous Groups (COGs) Database: Phylogenetic Classification of Proteins from Complete Genomes. NCBI Bookshelf ID: NBK21101. • NCBI News Letter: Protein Families and Genome Evolution. Published Feb 1998. • http://homepage.usask.ca/~ctl271/857/def_homolog.shtml • http://biologydirect.biomedcentral.com/articles/10.1186/1745-6150-2-33 • Nucleic Acids Res. 2015 Jan;43(Database issue):D261-9. doi: 10.1093/nar/gku1223. Epub 2014 Nov 26. • http://www.ncbi.nlm.nih.gov/pubmed/25428365 • http://biologydirect.biomedcentral.com/articles/10.1186/1745-6150-2-33

Notas del editor

  1. Homology: Is the relationship between biological structures or sequences that are derived from a common ancestry. Two things are homologous if they bear same relationship to one another, such as a certain bone in various forms of the “hand”.