SlideShare una empresa de Scribd logo
1 de 30
Descargar para leer sin conexión
Luca Cozzuto @ Bioinformatics Core
                           http://rfam.sanger.ac.uk/
Non-coding RNA genes codify for a functional RNA
product rather than for a protein.
Non-coding genes codify for a functional RNA
product rather than for a protein.
Family of functional RNAs:

 Biological function   RNA family


 Involved in protein
                       tRNA, rRNA, SRP RNA, tmRNA
 synthesis
 Post-trascriptional
                       snRNA, snoRNA, SmY, scaRNA, gRNA, RNAse P, RNAse MRP,
 modification or DNA
                       Y RNA, telomerase RNA
 replication
                       aRNA, NAT, crRRNA, long ncRNA, miRNA, piRNA, siRNA,
 Regulatory RNAs
                       tasiRNA, rasiRNA, 7SK
 Parasitic RNA         Retrotransposon,Viroid, satellite RNA
The majority of functional RNAs fold in stable
structures that are essential for their biological
activity.




 Micro-RNA        tRNA              U2           Part of
                              spliceosomal     Riboswitch
  precursor
                                   RNA
Unlike protein-coding genes functional RNAs often
show no significant sequence similarity but preserve a
base-paired secondary structure.

           ncRNA_1   AAAAAAGGGGTTTTTT!
           ncRNA_2   AAATAAGGGGTTATTT!
           Struct    ((((((....)))))) !


This makes very difficult to search for those genes
looking only for sequence similarity (i.e. by using
BLAST, FASTA…)
For Rfam database a functional RNA family is
represented by a multiple sequence alignment and a
covariance model.

The model takes into account both sequence and
structure and can be used to scan a genomic
sequence to detect new members of the same family.
The Rfam Seed alignment for the U12 minor spliceosomal RNA family.
Only one sequence,
                                                                            up to 10 kb




Search methodology

The query sequence is scanned against a library of Rfam sequences using WU-BLAST, with an E-value threshold of 1.0.
Any matches to this are then scanned against the corresponding covariance model using the hand-curated threshold for
that family.
Results
Positive hits are reported together with the score, e-value and alignment to the family CM.
Bit score: how well the sequence matches your model.
The score reflects whether the sequence matches better to the profile model (positive score) or to the null model of
nonhomologous sequences (negative score).

E-value: expected number of false positives with bit scores at least high as your hit.
The value is related to the size of database used for the search.
I Predicted secondary structure
“<> [ ] { }” base pairs “_” hairpin loop “-” interior bulge and loop “,” single stranded multifurcation loop “:” external
single stranded residues “.” insertion to the consensus.

II Consensus of the query model

III Alignment to the model and scoring system
“Capital letter” = max score. “: +” score >=0 for base pairs and single stranded. “ ” negative score

IV Target sequence
Going to the family information
A summary written in wikipedia about the family is shown together with information stored into the database.
Going to the family information
Sequences part of that family can be viewed (if they are not so much)
Going to the family information
Both seed and full alignments of members can be displayed.
Going to the family information
Both seed and full alignments of members can be displayed.
Going to the family information
The secondary structure can be viewed.
Going to the family information
The secondary structure can be viewed.
Going to the family information
Also the tree of genomes containing members of that family can be browsed
Going to the family information
If a PDB entry is available it is possible to see also the three-dimensional structure.
Going to the family information
If a PDB entry is available it is possible to see also the three-dimensional structure.
Going to the family information
You can reach some publication on the family.
Problems in searching sequences


- To speed up the searching it is necessary a filtering step based on blast search. This will decrease the sensitivity in
finding true homologues of the functional RNA family.

- The genomes of higher eukaryotes contain many ncRNA-derived pseudogenes and repeats that looks like structured
functional RNAs.




       Gardner PP, et al. Bateman A. Rfam: updates to the RNA families database. Nucleic Acids Res. 2009
Batch search
You can upload a file containing several sequences in fasta format. Generally a job takes 48 hours.




                                                                Files must have fewer than 100,000 lines and fewer
                                                                than 1000 sequences with a size shorter than
                                                                200,000 nucleotides
Browsing for genome
Genomes scanned for the presence of a Rfma family are reported in Browse tab.
Browsing for genome
Species, kingdom, number of Rfam families and members found within the specie (Regions) are reported.
Browsing for genome
Browsing for genome
Running a complete search for a whole genome.




You may install locally                   the    infernal   program   available   at
http://infernal.janelia.org/.


To speed up the search you may install also the rfam_scan.pl script
available at ftp://ftp.sanger.ac.uk/pub/databases/Rfam/tools/ that relies
on Blast program.
Running a complete search for a whole genome.




Typical usage of infernal.

cmsearch -o output.aln --tabfile output.tab infile.fna Rfam.cm!



Typical usage of rfam_scan.pl

Perl   rfam_scan.pl             – blastdb        Rfam.fasta   -outfile.out   Rfam.cm
infile.fna !
Thanks!

Más contenido relacionado

La actualidad más candente

Ngs microbiome
Ngs microbiomeNgs microbiome
Ngs microbiomejukais
 
Bioinfo ngs data format visualization v2
Bioinfo ngs data format visualization v2Bioinfo ngs data format visualization v2
Bioinfo ngs data format visualization v2Li Shen
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencingTapish Goel
 
Introduction to Next-Generation Sequencing (NGS) Technology
Introduction to Next-Generation Sequencing (NGS) TechnologyIntroduction to Next-Generation Sequencing (NGS) Technology
Introduction to Next-Generation Sequencing (NGS) TechnologyQIAGEN
 
RNA Sequencing from Single Cell
RNA Sequencing from Single CellRNA Sequencing from Single Cell
RNA Sequencing from Single CellQIAGEN
 
Introduction to NGS
Introduction to NGSIntroduction to NGS
Introduction to NGScursoNGS
 
BioChain Next Generation Sequencing Products
BioChain Next Generation Sequencing ProductsBioChain Next Generation Sequencing Products
BioChain Next Generation Sequencing Productsbiochain
 
How to cluster and sequence an ngs library (james hadfield160416)
How to cluster and sequence an ngs library (james hadfield160416)How to cluster and sequence an ngs library (james hadfield160416)
How to cluster and sequence an ngs library (james hadfield160416)James Hadfield
 
Sequence assembly
Sequence assemblySequence assembly
Sequence assemblyRamya P
 
Next-generation sequencing and quality control: An Introduction (2016)
Next-generation sequencing and quality control: An Introduction (2016)Next-generation sequencing and quality control: An Introduction (2016)
Next-generation sequencing and quality control: An Introduction (2016)Sebastian Schmeier
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation SequencingSajad Rafatiyan
 
QIAseq Targeted DNA, RNA and Fusion Gene Panels
QIAseq Targeted DNA, RNA and Fusion Gene PanelsQIAseq Targeted DNA, RNA and Fusion Gene Panels
QIAseq Targeted DNA, RNA and Fusion Gene PanelsQIAGEN
 
Digital RNAseq Technology Introduction: Digital RNAseq Webinar Part 1
Digital RNAseq Technology Introduction: Digital RNAseq Webinar Part 1Digital RNAseq Technology Introduction: Digital RNAseq Webinar Part 1
Digital RNAseq Technology Introduction: Digital RNAseq Webinar Part 1QIAGEN
 
2011 jeroen vanhoudt_ngs
2011 jeroen vanhoudt_ngs2011 jeroen vanhoudt_ngs
2011 jeroen vanhoudt_ngsDin Apellidos
 
RNASeq - Analysis Pipeline for Differential Expression
RNASeq - Analysis Pipeline for Differential ExpressionRNASeq - Analysis Pipeline for Differential Expression
RNASeq - Analysis Pipeline for Differential ExpressionJatinder Singh
 

La actualidad más candente (20)

Ngs microbiome
Ngs microbiomeNgs microbiome
Ngs microbiome
 
Bioinfo ngs data format visualization v2
Bioinfo ngs data format visualization v2Bioinfo ngs data format visualization v2
Bioinfo ngs data format visualization v2
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
Introduction to Next-Generation Sequencing (NGS) Technology
Introduction to Next-Generation Sequencing (NGS) TechnologyIntroduction to Next-Generation Sequencing (NGS) Technology
Introduction to Next-Generation Sequencing (NGS) Technology
 
RNA Sequencing from Single Cell
RNA Sequencing from Single CellRNA Sequencing from Single Cell
RNA Sequencing from Single Cell
 
Introduction to NGS
Introduction to NGSIntroduction to NGS
Introduction to NGS
 
BioChain Next Generation Sequencing Products
BioChain Next Generation Sequencing ProductsBioChain Next Generation Sequencing Products
BioChain Next Generation Sequencing Products
 
Hamas 1
Hamas 1Hamas 1
Hamas 1
 
Biotech autumn2012-02-ngs2
Biotech autumn2012-02-ngs2Biotech autumn2012-02-ngs2
Biotech autumn2012-02-ngs2
 
How to cluster and sequence an ngs library (james hadfield160416)
How to cluster and sequence an ngs library (james hadfield160416)How to cluster and sequence an ngs library (james hadfield160416)
How to cluster and sequence an ngs library (james hadfield160416)
 
Sequence assembly
Sequence assemblySequence assembly
Sequence assembly
 
Next-generation sequencing and quality control: An Introduction (2016)
Next-generation sequencing and quality control: An Introduction (2016)Next-generation sequencing and quality control: An Introduction (2016)
Next-generation sequencing and quality control: An Introduction (2016)
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation Sequencing
 
Introduction to next generation sequencing
Introduction to next generation sequencingIntroduction to next generation sequencing
Introduction to next generation sequencing
 
QIAseq Targeted DNA, RNA and Fusion Gene Panels
QIAseq Targeted DNA, RNA and Fusion Gene PanelsQIAseq Targeted DNA, RNA and Fusion Gene Panels
QIAseq Targeted DNA, RNA and Fusion Gene Panels
 
Digital RNAseq Technology Introduction: Digital RNAseq Webinar Part 1
Digital RNAseq Technology Introduction: Digital RNAseq Webinar Part 1Digital RNAseq Technology Introduction: Digital RNAseq Webinar Part 1
Digital RNAseq Technology Introduction: Digital RNAseq Webinar Part 1
 
2011 jeroen vanhoudt_ngs
2011 jeroen vanhoudt_ngs2011 jeroen vanhoudt_ngs
2011 jeroen vanhoudt_ngs
 
Ngs intro_v6_public
 Ngs intro_v6_public Ngs intro_v6_public
Ngs intro_v6_public
 
ChIP-seq
ChIP-seqChIP-seq
ChIP-seq
 
RNASeq - Analysis Pipeline for Differential Expression
RNASeq - Analysis Pipeline for Differential ExpressionRNASeq - Analysis Pipeline for Differential Expression
RNASeq - Analysis Pipeline for Differential Expression
 

Similar a Annotating nc-RNAs with Rfam

Genomics lecture 3
Genomics lecture 3Genomics lecture 3
Genomics lecture 3iainj88
 
RNASeq Experiment Design
RNASeq Experiment DesignRNASeq Experiment Design
RNASeq Experiment DesignYaoyu Wang
 
Fruit breedomics workshop wp6 from marker assisted breeding to genomics assis...
Fruit breedomics workshop wp6 from marker assisted breeding to genomics assis...Fruit breedomics workshop wp6 from marker assisted breeding to genomics assis...
Fruit breedomics workshop wp6 from marker assisted breeding to genomics assis...fruitbreedomics
 
Principle and method of RNAi-Creative Biogene
Principle and method of RNAi-Creative BiogenePrinciple and method of RNAi-Creative Biogene
Principle and method of RNAi-Creative BiogeneDonglin Bao
 
Rna seq and chip seq
Rna seq and chip seqRna seq and chip seq
Rna seq and chip seqJyoti Singh
 
Apollo Exercises Kansas State University 2015
Apollo Exercises Kansas State University 2015Apollo Exercises Kansas State University 2015
Apollo Exercises Kansas State University 2015Monica Munoz-Torres
 
Rna lecture
Rna lectureRna lecture
Rna lecturenishulpu
 
Role of Antisense and RNAi-based Gene Silencing in Crop Improvement
Role of Antisense and RNAi-based Gene Silencing in Crop ImprovementRole of Antisense and RNAi-based Gene Silencing in Crop Improvement
Role of Antisense and RNAi-based Gene Silencing in Crop ImprovementMariya Zaman
 
Lab 15- DNA Model Discovery Kit Answer sheet (For use with DNA Model K.pdf
Lab 15- DNA Model Discovery Kit Answer sheet (For use with DNA Model K.pdfLab 15- DNA Model Discovery Kit Answer sheet (For use with DNA Model K.pdf
Lab 15- DNA Model Discovery Kit Answer sheet (For use with DNA Model K.pdfPeterM9sWhitej
 
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation OverviewPathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation OverviewPathema
 
RNA-Seq_Presentation
RNA-Seq_PresentationRNA-Seq_Presentation
RNA-Seq_PresentationToyin23
 
RNA Seq Data Analysis
RNA Seq Data AnalysisRNA Seq Data Analysis
RNA Seq Data AnalysisRavi Gandham
 
Tools for Transcriptome Data Analysis
Tools for Transcriptome Data AnalysisTools for Transcriptome Data Analysis
Tools for Transcriptome Data AnalysisSANJANA PANDEY
 

Similar a Annotating nc-RNAs with Rfam (20)

Genomics lecture 3
Genomics lecture 3Genomics lecture 3
Genomics lecture 3
 
RNASeq Experiment Design
RNASeq Experiment DesignRNASeq Experiment Design
RNASeq Experiment Design
 
Crispr
CrisprCrispr
Crispr
 
Fruit breedomics workshop wp6 from marker assisted breeding to genomics assis...
Fruit breedomics workshop wp6 from marker assisted breeding to genomics assis...Fruit breedomics workshop wp6 from marker assisted breeding to genomics assis...
Fruit breedomics workshop wp6 from marker assisted breeding to genomics assis...
 
Principle and method of RNAi-Creative Biogene
Principle and method of RNAi-Creative BiogenePrinciple and method of RNAi-Creative Biogene
Principle and method of RNAi-Creative Biogene
 
Protein databases
Protein databasesProtein databases
Protein databases
 
Rna seq and chip seq
Rna seq and chip seqRna seq and chip seq
Rna seq and chip seq
 
Apollo Exercises Kansas State University 2015
Apollo Exercises Kansas State University 2015Apollo Exercises Kansas State University 2015
Apollo Exercises Kansas State University 2015
 
Rna lecture
Rna lectureRna lecture
Rna lecture
 
Role of Antisense and RNAi-based Gene Silencing in Crop Improvement
Role of Antisense and RNAi-based Gene Silencing in Crop ImprovementRole of Antisense and RNAi-based Gene Silencing in Crop Improvement
Role of Antisense and RNAi-based Gene Silencing in Crop Improvement
 
Lab 15- DNA Model Discovery Kit Answer sheet (For use with DNA Model K.pdf
Lab 15- DNA Model Discovery Kit Answer sheet (For use with DNA Model K.pdfLab 15- DNA Model Discovery Kit Answer sheet (For use with DNA Model K.pdf
Lab 15- DNA Model Discovery Kit Answer sheet (For use with DNA Model K.pdf
 
RNA-seq Analysis
RNA-seq AnalysisRNA-seq Analysis
RNA-seq Analysis
 
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation OverviewPathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
 
RNA-Seq_Presentation
RNA-Seq_PresentationRNA-Seq_Presentation
RNA-Seq_Presentation
 
RNA Seq Data Analysis
RNA Seq Data AnalysisRNA Seq Data Analysis
RNA Seq Data Analysis
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Crispr/cas9 101
Crispr/cas9 101Crispr/cas9 101
Crispr/cas9 101
 
Crispr cas system
Crispr cas systemCrispr cas system
Crispr cas system
 
Tools for Transcriptome Data Analysis
Tools for Transcriptome Data AnalysisTools for Transcriptome Data Analysis
Tools for Transcriptome Data Analysis
 
50320130403003 2
50320130403003 250320130403003 2
50320130403003 2
 

Más de Luca Cozzuto

vectorQC: 'A pipeline for assembling and annotation of vectors'
vectorQC: 'A pipeline for assembling and annotation of vectors'vectorQC: 'A pipeline for assembling and annotation of vectors'
vectorQC: 'A pipeline for assembling and annotation of vectors'Luca Cozzuto
 
Course on parsing methods for biologists with a focus on ChIP-seq data
Course on parsing methods for biologists with a focus on ChIP-seq dataCourse on parsing methods for biologists with a focus on ChIP-seq data
Course on parsing methods for biologists with a focus on ChIP-seq dataLuca Cozzuto
 
From Zero to Nextflow 2017
From Zero to Nextflow 2017From Zero to Nextflow 2017
From Zero to Nextflow 2017Luca Cozzuto
 
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...Luca Cozzuto
 

Más de Luca Cozzuto (6)

vectorQC: 'A pipeline for assembling and annotation of vectors'
vectorQC: 'A pipeline for assembling and annotation of vectors'vectorQC: 'A pipeline for assembling and annotation of vectors'
vectorQC: 'A pipeline for assembling and annotation of vectors'
 
Course on parsing methods for biologists with a focus on ChIP-seq data
Course on parsing methods for biologists with a focus on ChIP-seq dataCourse on parsing methods for biologists with a focus on ChIP-seq data
Course on parsing methods for biologists with a focus on ChIP-seq data
 
From Zero to Nextflow 2017
From Zero to Nextflow 2017From Zero to Nextflow 2017
From Zero to Nextflow 2017
 
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...
 
AnnoWiki
AnnoWikiAnnoWiki
AnnoWiki
 
Macs course
Macs courseMacs course
Macs course
 

Último

ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...JojoEDelaCruz
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptxiammrhaywood
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfPatidar M
 

Último (20)

ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdf
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 

Annotating nc-RNAs with Rfam

  • 1. Luca Cozzuto @ Bioinformatics Core http://rfam.sanger.ac.uk/
  • 2. Non-coding RNA genes codify for a functional RNA product rather than for a protein.
  • 3. Non-coding genes codify for a functional RNA product rather than for a protein. Family of functional RNAs: Biological function RNA family Involved in protein tRNA, rRNA, SRP RNA, tmRNA synthesis Post-trascriptional snRNA, snoRNA, SmY, scaRNA, gRNA, RNAse P, RNAse MRP, modification or DNA Y RNA, telomerase RNA replication aRNA, NAT, crRRNA, long ncRNA, miRNA, piRNA, siRNA, Regulatory RNAs tasiRNA, rasiRNA, 7SK Parasitic RNA Retrotransposon,Viroid, satellite RNA
  • 4. The majority of functional RNAs fold in stable structures that are essential for their biological activity. Micro-RNA tRNA U2 Part of spliceosomal Riboswitch precursor RNA
  • 5. Unlike protein-coding genes functional RNAs often show no significant sequence similarity but preserve a base-paired secondary structure. ncRNA_1 AAAAAAGGGGTTTTTT! ncRNA_2 AAATAAGGGGTTATTT! Struct ((((((....)))))) ! This makes very difficult to search for those genes looking only for sequence similarity (i.e. by using BLAST, FASTA…)
  • 6. For Rfam database a functional RNA family is represented by a multiple sequence alignment and a covariance model. The model takes into account both sequence and structure and can be used to scan a genomic sequence to detect new members of the same family.
  • 7. The Rfam Seed alignment for the U12 minor spliceosomal RNA family.
  • 8. Only one sequence, up to 10 kb Search methodology The query sequence is scanned against a library of Rfam sequences using WU-BLAST, with an E-value threshold of 1.0. Any matches to this are then scanned against the corresponding covariance model using the hand-curated threshold for that family.
  • 9. Results Positive hits are reported together with the score, e-value and alignment to the family CM.
  • 10. Bit score: how well the sequence matches your model. The score reflects whether the sequence matches better to the profile model (positive score) or to the null model of nonhomologous sequences (negative score). E-value: expected number of false positives with bit scores at least high as your hit. The value is related to the size of database used for the search.
  • 11. I Predicted secondary structure “<> [ ] { }” base pairs “_” hairpin loop “-” interior bulge and loop “,” single stranded multifurcation loop “:” external single stranded residues “.” insertion to the consensus. II Consensus of the query model III Alignment to the model and scoring system “Capital letter” = max score. “: +” score >=0 for base pairs and single stranded. “ ” negative score IV Target sequence
  • 12. Going to the family information A summary written in wikipedia about the family is shown together with information stored into the database.
  • 13. Going to the family information Sequences part of that family can be viewed (if they are not so much)
  • 14. Going to the family information Both seed and full alignments of members can be displayed.
  • 15. Going to the family information Both seed and full alignments of members can be displayed.
  • 16. Going to the family information The secondary structure can be viewed.
  • 17. Going to the family information The secondary structure can be viewed.
  • 18. Going to the family information Also the tree of genomes containing members of that family can be browsed
  • 19. Going to the family information If a PDB entry is available it is possible to see also the three-dimensional structure.
  • 20. Going to the family information If a PDB entry is available it is possible to see also the three-dimensional structure.
  • 21. Going to the family information You can reach some publication on the family.
  • 22. Problems in searching sequences - To speed up the searching it is necessary a filtering step based on blast search. This will decrease the sensitivity in finding true homologues of the functional RNA family. - The genomes of higher eukaryotes contain many ncRNA-derived pseudogenes and repeats that looks like structured functional RNAs. Gardner PP, et al. Bateman A. Rfam: updates to the RNA families database. Nucleic Acids Res. 2009
  • 23. Batch search You can upload a file containing several sequences in fasta format. Generally a job takes 48 hours. Files must have fewer than 100,000 lines and fewer than 1000 sequences with a size shorter than 200,000 nucleotides
  • 24. Browsing for genome Genomes scanned for the presence of a Rfma family are reported in Browse tab.
  • 25. Browsing for genome Species, kingdom, number of Rfam families and members found within the specie (Regions) are reported.
  • 28. Running a complete search for a whole genome. You may install locally the infernal program available at http://infernal.janelia.org/. To speed up the search you may install also the rfam_scan.pl script available at ftp://ftp.sanger.ac.uk/pub/databases/Rfam/tools/ that relies on Blast program.
  • 29. Running a complete search for a whole genome. Typical usage of infernal. cmsearch -o output.aln --tabfile output.tab infile.fna Rfam.cm! Typical usage of rfam_scan.pl Perl rfam_scan.pl – blastdb Rfam.fasta -outfile.out Rfam.cm infile.fna !