SlideShare una empresa de Scribd logo
1 de 24
RNA Secondary Structure
Prediction
C SC 550 - Spring 2012
Muhammad J. Alam
Sumin Byeon
RNA

Ribonucleic acid

Single-stranded molecule

Consists of nucleotides

Each nucleotide contains
a base (A, C, G, U)
RNA Structures

Primary structure:
Linear sequence of
nucleotide bases

Secondary structure:
Hydrogen bonds
between bases forming
base pairs
RNA Structures

Hairpin loop

Stacked pair

Internal loop

Bulge

Multi loop
Problem Definition
  Input: primary structure of an RNA

  Goal: to predict the secondary structure



Given a primary structure of an RNA, find a secondary
structure that maximizes the number of base pairs
Practical Applications


Function classification

Evolutionary studies

Pseudogene detection
Different Approaches

Physical methods (Kim et al)
  X-ray diffraction, Nuclear Magnetic Resonance (NMR)


Chemical/enzymatic methods (Ehresmann et al)

Mutational analysis (Tang and Draper)
Prediction with
Sequence Only
Structure prediction based on multiple RNA
sequences which are structurally similar
(Sankoff, Gary and Stormo)


Structure prediction based on a single RNA
sequence

   Nussinov Folding Algorithm, Zuker Algorithm
Assumptions

Three base pairs
(A-U, C-G, G-U)

One base forms at most one base pair

Pseudoknots do not occur
Pseudoknots

                                       c
               a               u
   g
       a   c       g   u   g       u
Pseudoknots

                                       c
               a               u
   g
       a   c       g   u   g       u
Nussinov Folding
Algorithm
                                  ...
  1     2                                     n


Case 1: (1) and (n) form a pair

Case 2: There is (k) that is not crossed by any pair
where 1 < k < n
Nussinov Folding
Algorithm

                                  ...
  1     2                               n

Case 1: (1) and (n) form a pair

  V(1, n) = V(2, n-1) + δ(S[1], S[n])
Nussinov Folding
 Algorithm

                                               ...
    1          2                                                  n

 Case 1: (1) and (n) form a pair

   V(1, n) = V(2, n-1) + δ(S[1], S[n])

           ⇢
               1, if(x, y) 2 (a, u), (u, a), (c, g), (g, c), (g, u), (u, g)
(x, y) =
               0, otherwise
Nussinov Folding
Algorithm

                                  ...
  1     2                 k                   n

Case 1: (1) and (n) form a pair

Case 2: There is (k) that is not crossed by any pair
where 1 < k < n

  V(1, n) = V(1, k) + V(k+1, n)
Nussinov Folding
Algorithm
                 ⇢
                     V (i + 1, j 1) + (S[i], S[j])
V (i, j) = max
                     maxik<i {V (i, k) + V (k + 1, j)}
                                                    j




                                 i
Dynamic programming
                                     ...




                                              ...
Nussinov Folding
Algorithm
                 ⇢
                     V (i + 1, j 1) + (S[i], S[j])
V (i, j) = max
                     maxik<i {V (i, k) + V (k + 1, j)}


                                                   . ..

Dynamic programming
Alternate Optimization Goal
  Find the most stable structure: Zuker Algorithm

  The hydrogen bond at a base pair tries to stabilize the
  structure

  Free bases inside a loop tries to disrupt the structure

  Difference between these two is the destabilizing energy


Given a primary structure of an RNA, find the
secondary structure with least total energy
Destabilizing Energy Measure
 Stacked Pair : eS(i, j)

    Stabilizes the structure

    eS(i, j) is negative

 Hairpin : eH(i, j)

    The bigger the loop, the more unstable the structure is

    eH(i, j) depends on |j-i+1|
Destabilizing Energy Measure
 Internal Loop or Bulge : eL(i, j, i', j')

    The bigger the loop is and the more asymmetric the two
    sides are, the more unstable is the structure

    eL(i, j, i', j') depends on (|i'-i+1|+|j'-j+1|) and the asymmetry

 Multi-loop : eM(i1, j1, i2, j2, ..., ik, jk)

    The structure is more unstable if the loop size and k is big
Zuker Algorithm

Finds a secondary structure with minimum total
destabilizing energy

Uses a dynamic Programming

Running Time Exponential
Demo
Conclusion
Summary
  An algorithm that finds a secondary structure
  with the maximum number of base pairs
Future works
  Develop an algorithm that does not make the
  assumption of absence of pseudoknots
  (Gary and Stormo)

  Develop an algorithm that addresses base
  triples and other types of base pairs
Thank you

Más contenido relacionado

La actualidad más candente

Threading modeling methods
Threading modeling methodsThreading modeling methods
Threading modeling methodsratanvishwas
 
Global and Local Sequence Alignment
Global and Local Sequence AlignmentGlobal and Local Sequence Alignment
Global and Local Sequence AlignmentAjayPatil210
 
RNA structure analysis
RNA structure analysis RNA structure analysis
RNA structure analysis Afra Fathima
 
Chou fasman algorithm for protein structure prediction
Chou fasman algorithm for protein structure predictionChou fasman algorithm for protein structure prediction
Chou fasman algorithm for protein structure predictionRoshan Karunarathna
 
Sequence alignments complete coverage
Sequence alignments complete coverageSequence alignments complete coverage
Sequence alignments complete coveragePrasanthperceptron
 
Secondary protein structure prediction
Secondary protein structure predictionSecondary protein structure prediction
Secondary protein structure predictionSiva Dharshini R
 
Introduction to proteomics
Introduction to proteomicsIntroduction to proteomics
Introduction to proteomicsShryli Shreekar
 
methods for protein structure prediction
methods for protein structure predictionmethods for protein structure prediction
methods for protein structure predictionkaramveer prajapat
 
Needleman-Wunsch Algorithm
Needleman-Wunsch AlgorithmNeedleman-Wunsch Algorithm
Needleman-Wunsch AlgorithmProshantaShil
 
Protein databases
Protein databasesProtein databases
Protein databasessarumalay
 
Sequence alignment global vs. local
Sequence alignment  global vs. localSequence alignment  global vs. local
Sequence alignment global vs. localbenazeer fathima
 

La actualidad más candente (20)

Protein fold recognition and ab_initio modeling
Protein fold recognition and ab_initio modelingProtein fold recognition and ab_initio modeling
Protein fold recognition and ab_initio modeling
 
Threading modeling methods
Threading modeling methodsThreading modeling methods
Threading modeling methods
 
Dynamic programming
Dynamic programming Dynamic programming
Dynamic programming
 
Protein Threading
Protein ThreadingProtein Threading
Protein Threading
 
Homology modeling
Homology modelingHomology modeling
Homology modeling
 
Ddbj
DdbjDdbj
Ddbj
 
Global and Local Sequence Alignment
Global and Local Sequence AlignmentGlobal and Local Sequence Alignment
Global and Local Sequence Alignment
 
RNA structure analysis
RNA structure analysis RNA structure analysis
RNA structure analysis
 
Chou fasman algorithm for protein structure prediction
Chou fasman algorithm for protein structure predictionChou fasman algorithm for protein structure prediction
Chou fasman algorithm for protein structure prediction
 
Sequence alignments complete coverage
Sequence alignments complete coverageSequence alignments complete coverage
Sequence alignments complete coverage
 
Secondary protein structure prediction
Secondary protein structure predictionSecondary protein structure prediction
Secondary protein structure prediction
 
Introduction to proteomics
Introduction to proteomicsIntroduction to proteomics
Introduction to proteomics
 
methods for protein structure prediction
methods for protein structure predictionmethods for protein structure prediction
methods for protein structure prediction
 
Protein structure prediction with a focus on Rosetta
Protein structure prediction with a focus on RosettaProtein structure prediction with a focus on Rosetta
Protein structure prediction with a focus on Rosetta
 
Finding motif
Finding motifFinding motif
Finding motif
 
Needleman-Wunsch Algorithm
Needleman-Wunsch AlgorithmNeedleman-Wunsch Algorithm
Needleman-Wunsch Algorithm
 
Protein databases
Protein databasesProtein databases
Protein databases
 
EMBL
EMBLEMBL
EMBL
 
Sequence alignment global vs. local
Sequence alignment  global vs. localSequence alignment  global vs. local
Sequence alignment global vs. local
 
Gemome annotation
Gemome annotationGemome annotation
Gemome annotation
 

Destacado

Structure of dna and rna
Structure of dna and rnaStructure of dna and rna
Structure of dna and rnaHimanshu Dev
 
RNA- Structure, Types and Functions
RNA- Structure, Types and FunctionsRNA- Structure, Types and Functions
RNA- Structure, Types and FunctionsNamrata Chhabra
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-naveed ul mushtaq
 
Introduction to sequence alignment
Introduction to sequence alignmentIntroduction to sequence alignment
Introduction to sequence alignmentKubuldinho
 
structure types and function of RNA
structure types and function of RNAstructure types and function of RNA
structure types and function of RNAadnandinmohammed
 
DNA Transcription- Part-1
DNA Transcription- Part-1DNA Transcription- Part-1
DNA Transcription- Part-1Namrata Chhabra
 
MEME – An Integrated Tool For Advanced Computational Experiments
MEME – An Integrated Tool For Advanced Computational ExperimentsMEME – An Integrated Tool For Advanced Computational Experiments
MEME – An Integrated Tool For Advanced Computational ExperimentsGIScRG
 
Local vs. Global Models for Effort Estimation and Defect Prediction
Local vs. Global Models for Effort Estimation and Defect Prediction Local vs. Global Models for Effort Estimation and Defect Prediction
Local vs. Global Models for Effort Estimation and Defect Prediction CS, NcState
 
Global local alignment
Global local alignmentGlobal local alignment
Global local alignmentScott Hamilton
 
A search engine for phylogenetic tree databases - D. Fernándes-Baca
A search engine for phylogenetic tree databases - D. Fernándes-BacaA search engine for phylogenetic tree databases - D. Fernándes-Baca
A search engine for phylogenetic tree databases - D. Fernándes-BacaRoderic Page
 
Prediction of protein function from sequence derived protein features
Prediction of protein function from sequence derived protein featuresPrediction of protein function from sequence derived protein features
Prediction of protein function from sequence derived protein featuresLars Juhl Jensen
 
Sequence comparison techniques
Sequence comparison techniquesSequence comparison techniques
Sequence comparison techniquesruchibioinfo
 
Presentation for blast algorithm bio-informatice
Presentation for blast algorithm bio-informaticePresentation for blast algorithm bio-informatice
Presentation for blast algorithm bio-informaticezahid6
 

Destacado (20)

Rna secondary structure prediction
Rna secondary structure predictionRna secondary structure prediction
Rna secondary structure prediction
 
Structure of dna and rna
Structure of dna and rnaStructure of dna and rna
Structure of dna and rna
 
RNA- Structure, Types and Functions
RNA- Structure, Types and FunctionsRNA- Structure, Types and Functions
RNA- Structure, Types and Functions
 
Sequence alignment belgaum
Sequence alignment belgaumSequence alignment belgaum
Sequence alignment belgaum
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
 
RNA Structure
RNA StructureRNA Structure
RNA Structure
 
RNA
RNARNA
RNA
 
Introduction to sequence alignment
Introduction to sequence alignmentIntroduction to sequence alignment
Introduction to sequence alignment
 
structure types and function of RNA
structure types and function of RNAstructure types and function of RNA
structure types and function of RNA
 
DNA Transcription- Part-1
DNA Transcription- Part-1DNA Transcription- Part-1
DNA Transcription- Part-1
 
Sk rndm grmmrs
Sk rndm grmmrsSk rndm grmmrs
Sk rndm grmmrs
 
MEME – An Integrated Tool For Advanced Computational Experiments
MEME – An Integrated Tool For Advanced Computational ExperimentsMEME – An Integrated Tool For Advanced Computational Experiments
MEME – An Integrated Tool For Advanced Computational Experiments
 
Ch06 rna
Ch06 rnaCh06 rna
Ch06 rna
 
Local vs. Global Models for Effort Estimation and Defect Prediction
Local vs. Global Models for Effort Estimation and Defect Prediction Local vs. Global Models for Effort Estimation and Defect Prediction
Local vs. Global Models for Effort Estimation and Defect Prediction
 
Blast
BlastBlast
Blast
 
Global local alignment
Global local alignmentGlobal local alignment
Global local alignment
 
A search engine for phylogenetic tree databases - D. Fernándes-Baca
A search engine for phylogenetic tree databases - D. Fernándes-BacaA search engine for phylogenetic tree databases - D. Fernándes-Baca
A search engine for phylogenetic tree databases - D. Fernándes-Baca
 
Prediction of protein function from sequence derived protein features
Prediction of protein function from sequence derived protein featuresPrediction of protein function from sequence derived protein features
Prediction of protein function from sequence derived protein features
 
Sequence comparison techniques
Sequence comparison techniquesSequence comparison techniques
Sequence comparison techniques
 
Presentation for blast algorithm bio-informatice
Presentation for blast algorithm bio-informaticePresentation for blast algorithm bio-informatice
Presentation for blast algorithm bio-informatice
 

Similar a RNA Secondary Structure Prediction

S. Duplij, Y. Hong, F. Li. Uq(sl(m+1))-module algebra structures on the coord...
S. Duplij, Y. Hong, F. Li. Uq(sl(m+1))-module algebra structures on the coord...S. Duplij, Y. Hong, F. Li. Uq(sl(m+1))-module algebra structures on the coord...
S. Duplij, Y. Hong, F. Li. Uq(sl(m+1))-module algebra structures on the coord...Steven Duplij (Stepan Douplii)
 
sublabel accurate convex relaxation of vectorial multilabel energies
sublabel accurate convex relaxation of vectorial multilabel energiessublabel accurate convex relaxation of vectorial multilabel energies
sublabel accurate convex relaxation of vectorial multilabel energiesFujimoto Keisuke
 
SMB_2012_HR_VAN_ST-last version
SMB_2012_HR_VAN_ST-last versionSMB_2012_HR_VAN_ST-last version
SMB_2012_HR_VAN_ST-last versionLilyana Vankova
 
Metodo Monte Carlo -Wang Landau
Metodo Monte Carlo -Wang LandauMetodo Monte Carlo -Wang Landau
Metodo Monte Carlo -Wang Landauangely alcendra
 
Robust Control of Uncertain Switched Linear Systems based on Stochastic Reach...
Robust Control of Uncertain Switched Linear Systems based on Stochastic Reach...Robust Control of Uncertain Switched Linear Systems based on Stochastic Reach...
Robust Control of Uncertain Switched Linear Systems based on Stochastic Reach...Leo Asselborn
 
Constant strain triangular
Constant strain triangular Constant strain triangular
Constant strain triangular rahul183
 
Bath_IMI_Summer_Project
Bath_IMI_Summer_ProjectBath_IMI_Summer_Project
Bath_IMI_Summer_ProjectJosh Young
 
Hecke Operators on Jacobi Forms of Lattice Index and the Relation to Elliptic...
Hecke Operators on Jacobi Forms of Lattice Index and the Relation to Elliptic...Hecke Operators on Jacobi Forms of Lattice Index and the Relation to Elliptic...
Hecke Operators on Jacobi Forms of Lattice Index and the Relation to Elliptic...Ali Ajouz
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysisrik0
 
Munich07 Foils
Munich07 FoilsMunich07 Foils
Munich07 FoilsAntonini
 
M. Dimitrijević, Noncommutative models of gauge and gravity theories
M. Dimitrijević, Noncommutative models of gauge and gravity theoriesM. Dimitrijević, Noncommutative models of gauge and gravity theories
M. Dimitrijević, Noncommutative models of gauge and gravity theoriesSEENET-MTP
 
Solution to schrodinger equation with dirac comb potential
Solution to schrodinger equation with dirac comb potential Solution to schrodinger equation with dirac comb potential
Solution to schrodinger equation with dirac comb potential slides
 

Similar a RNA Secondary Structure Prediction (20)

S. Duplij, Y. Hong, F. Li. Uq(sl(m+1))-module algebra structures on the coord...
S. Duplij, Y. Hong, F. Li. Uq(sl(m+1))-module algebra structures on the coord...S. Duplij, Y. Hong, F. Li. Uq(sl(m+1))-module algebra structures on the coord...
S. Duplij, Y. Hong, F. Li. Uq(sl(m+1))-module algebra structures on the coord...
 
sublabel accurate convex relaxation of vectorial multilabel energies
sublabel accurate convex relaxation of vectorial multilabel energiessublabel accurate convex relaxation of vectorial multilabel energies
sublabel accurate convex relaxation of vectorial multilabel energies
 
SMB_2012_HR_VAN_ST-last version
SMB_2012_HR_VAN_ST-last versionSMB_2012_HR_VAN_ST-last version
SMB_2012_HR_VAN_ST-last version
 
Thesis defense
Thesis defenseThesis defense
Thesis defense
 
Ch07 6
Ch07 6Ch07 6
Ch07 6
 
Metodo Monte Carlo -Wang Landau
Metodo Monte Carlo -Wang LandauMetodo Monte Carlo -Wang Landau
Metodo Monte Carlo -Wang Landau
 
Robust Control of Uncertain Switched Linear Systems based on Stochastic Reach...
Robust Control of Uncertain Switched Linear Systems based on Stochastic Reach...Robust Control of Uncertain Switched Linear Systems based on Stochastic Reach...
Robust Control of Uncertain Switched Linear Systems based on Stochastic Reach...
 
Constant strain triangular
Constant strain triangular Constant strain triangular
Constant strain triangular
 
Bath_IMI_Summer_Project
Bath_IMI_Summer_ProjectBath_IMI_Summer_Project
Bath_IMI_Summer_Project
 
2018 MUMS Fall Course - Statistical Representation of Model Input (EDITED) - ...
2018 MUMS Fall Course - Statistical Representation of Model Input (EDITED) - ...2018 MUMS Fall Course - Statistical Representation of Model Input (EDITED) - ...
2018 MUMS Fall Course - Statistical Representation of Model Input (EDITED) - ...
 
Hecke Operators on Jacobi Forms of Lattice Index and the Relation to Elliptic...
Hecke Operators on Jacobi Forms of Lattice Index and the Relation to Elliptic...Hecke Operators on Jacobi Forms of Lattice Index and the Relation to Elliptic...
Hecke Operators on Jacobi Forms of Lattice Index and the Relation to Elliptic...
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysis
 
Disjoint sets
Disjoint setsDisjoint sets
Disjoint sets
 
dalrymple_slides.ppt
dalrymple_slides.pptdalrymple_slides.ppt
dalrymple_slides.ppt
 
Munich07 Foils
Munich07 FoilsMunich07 Foils
Munich07 Foils
 
M. Dimitrijević, Noncommutative models of gauge and gravity theories
M. Dimitrijević, Noncommutative models of gauge and gravity theoriesM. Dimitrijević, Noncommutative models of gauge and gravity theories
M. Dimitrijević, Noncommutative models of gauge and gravity theories
 
SSA slides
SSA slidesSSA slides
SSA slides
 
Solution to schrodinger equation with dirac comb potential
Solution to schrodinger equation with dirac comb potential Solution to schrodinger equation with dirac comb potential
Solution to schrodinger equation with dirac comb potential
 
QMC: Transition Workshop - Applying Quasi-Monte Carlo Methods to a Stochastic...
QMC: Transition Workshop - Applying Quasi-Monte Carlo Methods to a Stochastic...QMC: Transition Workshop - Applying Quasi-Monte Carlo Methods to a Stochastic...
QMC: Transition Workshop - Applying Quasi-Monte Carlo Methods to a Stochastic...
 
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
 

Más de Sumin Byeon

PyCon 2017 프로그래머가 이사하는 법 2 [천원경매]
PyCon 2017 프로그래머가 이사하는 법 2 [천원경매]PyCon 2017 프로그래머가 이사하는 법 2 [천원경매]
PyCon 2017 프로그래머가 이사하는 법 2 [천원경매]Sumin Byeon
 
BD Talk 2017 봄 - 원정코딩
BD Talk 2017 봄 - 원정코딩BD Talk 2017 봄 - 원정코딩
BD Talk 2017 봄 - 원정코딩Sumin Byeon
 
NDC 2017 마이크로토크 - 프로그래머가 뉴스 읽는 법
NDC 2017 마이크로토크 - 프로그래머가 뉴스 읽는 법NDC 2017 마이크로토크 - 프로그래머가 뉴스 읽는 법
NDC 2017 마이크로토크 - 프로그래머가 뉴스 읽는 법Sumin Byeon
 
Are Credit Cards Evil
Are Credit Cards EvilAre Credit Cards Evil
Are Credit Cards EvilSumin Byeon
 
NDC 2016 마이크로토크 - 프로그래머가 투자하는 법
NDC 2016 마이크로토크 - 프로그래머가 투자하는 법NDC 2016 마이크로토크 - 프로그래머가 투자하는 법
NDC 2016 마이크로토크 - 프로그래머가 투자하는 법Sumin Byeon
 
[야생의 땅: 듀랑고] 지형 관리 완전 자동화 - 생생한 AWS와 Docker 체험기
[야생의 땅: 듀랑고] 지형 관리 완전 자동화 - 생생한 AWS와 Docker 체험기[야생의 땅: 듀랑고] 지형 관리 완전 자동화 - 생생한 AWS와 Docker 체험기
[야생의 땅: 듀랑고] 지형 관리 완전 자동화 - 생생한 AWS와 Docker 체험기Sumin Byeon
 
더 나은 번역기는 나의 삶을 어떻게 바꾸었는가
더 나은 번역기는 나의 삶을 어떻게 바꾸었는가더 나은 번역기는 나의 삶을 어떻게 바꾸었는가
더 나은 번역기는 나의 삶을 어떻게 바꾸었는가Sumin Byeon
 
2015 PyCon - 프로그래머가 이사하는 법
2015 PyCon - 프로그래머가 이사하는 법2015 PyCon - 프로그래머가 이사하는 법
2015 PyCon - 프로그래머가 이사하는 법Sumin Byeon
 
[야생의 땅: 듀랑고]의 식물 생태계를 담당하는 21세기 정원사의 OpenCL 경험담
[야생의 땅: 듀랑고]의 식물 생태계를 담당하는 21세기 정원사의 OpenCL 경험담[야생의 땅: 듀랑고]의 식물 생태계를 담당하는 21세기 정원사의 OpenCL 경험담
[야생의 땅: 듀랑고]의 식물 생태계를 담당하는 21세기 정원사의 OpenCL 경험담Sumin Byeon
 
Cross-Language Information Retrieval
Cross-Language Information RetrievalCross-Language Information Retrieval
Cross-Language Information RetrievalSumin Byeon
 
SLINKY: Static Linking Reloaded
SLINKY: Static Linking ReloadedSLINKY: Static Linking Reloaded
SLINKY: Static Linking ReloadedSumin Byeon
 
Project Proposal: Translation Example Search Engine
Project Proposal: Translation Example Search EngineProject Proposal: Translation Example Search Engine
Project Proposal: Translation Example Search EngineSumin Byeon
 
Self-Tuning Wireless Network Power Management
Self-Tuning Wireless Network Power ManagementSelf-Tuning Wireless Network Power Management
Self-Tuning Wireless Network Power ManagementSumin Byeon
 
Error tolerant search
Error tolerant searchError tolerant search
Error tolerant searchSumin Byeon
 
Git with bitbucket
Git with bitbucketGit with bitbucket
Git with bitbucketSumin Byeon
 
Git with bitbucket (draft)
Git with bitbucket (draft)Git with bitbucket (draft)
Git with bitbucket (draft)Sumin Byeon
 

Más de Sumin Byeon (16)

PyCon 2017 프로그래머가 이사하는 법 2 [천원경매]
PyCon 2017 프로그래머가 이사하는 법 2 [천원경매]PyCon 2017 프로그래머가 이사하는 법 2 [천원경매]
PyCon 2017 프로그래머가 이사하는 법 2 [천원경매]
 
BD Talk 2017 봄 - 원정코딩
BD Talk 2017 봄 - 원정코딩BD Talk 2017 봄 - 원정코딩
BD Talk 2017 봄 - 원정코딩
 
NDC 2017 마이크로토크 - 프로그래머가 뉴스 읽는 법
NDC 2017 마이크로토크 - 프로그래머가 뉴스 읽는 법NDC 2017 마이크로토크 - 프로그래머가 뉴스 읽는 법
NDC 2017 마이크로토크 - 프로그래머가 뉴스 읽는 법
 
Are Credit Cards Evil
Are Credit Cards EvilAre Credit Cards Evil
Are Credit Cards Evil
 
NDC 2016 마이크로토크 - 프로그래머가 투자하는 법
NDC 2016 마이크로토크 - 프로그래머가 투자하는 법NDC 2016 마이크로토크 - 프로그래머가 투자하는 법
NDC 2016 마이크로토크 - 프로그래머가 투자하는 법
 
[야생의 땅: 듀랑고] 지형 관리 완전 자동화 - 생생한 AWS와 Docker 체험기
[야생의 땅: 듀랑고] 지형 관리 완전 자동화 - 생생한 AWS와 Docker 체험기[야생의 땅: 듀랑고] 지형 관리 완전 자동화 - 생생한 AWS와 Docker 체험기
[야생의 땅: 듀랑고] 지형 관리 완전 자동화 - 생생한 AWS와 Docker 체험기
 
더 나은 번역기는 나의 삶을 어떻게 바꾸었는가
더 나은 번역기는 나의 삶을 어떻게 바꾸었는가더 나은 번역기는 나의 삶을 어떻게 바꾸었는가
더 나은 번역기는 나의 삶을 어떻게 바꾸었는가
 
2015 PyCon - 프로그래머가 이사하는 법
2015 PyCon - 프로그래머가 이사하는 법2015 PyCon - 프로그래머가 이사하는 법
2015 PyCon - 프로그래머가 이사하는 법
 
[야생의 땅: 듀랑고]의 식물 생태계를 담당하는 21세기 정원사의 OpenCL 경험담
[야생의 땅: 듀랑고]의 식물 생태계를 담당하는 21세기 정원사의 OpenCL 경험담[야생의 땅: 듀랑고]의 식물 생태계를 담당하는 21세기 정원사의 OpenCL 경험담
[야생의 땅: 듀랑고]의 식물 생태계를 담당하는 21세기 정원사의 OpenCL 경험담
 
Cross-Language Information Retrieval
Cross-Language Information RetrievalCross-Language Information Retrieval
Cross-Language Information Retrieval
 
SLINKY: Static Linking Reloaded
SLINKY: Static Linking ReloadedSLINKY: Static Linking Reloaded
SLINKY: Static Linking Reloaded
 
Project Proposal: Translation Example Search Engine
Project Proposal: Translation Example Search EngineProject Proposal: Translation Example Search Engine
Project Proposal: Translation Example Search Engine
 
Self-Tuning Wireless Network Power Management
Self-Tuning Wireless Network Power ManagementSelf-Tuning Wireless Network Power Management
Self-Tuning Wireless Network Power Management
 
Error tolerant search
Error tolerant searchError tolerant search
Error tolerant search
 
Git with bitbucket
Git with bitbucketGit with bitbucket
Git with bitbucket
 
Git with bitbucket (draft)
Git with bitbucket (draft)Git with bitbucket (draft)
Git with bitbucket (draft)
 

Último

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 

Último (20)

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 

RNA Secondary Structure Prediction

  • 1. RNA Secondary Structure Prediction C SC 550 - Spring 2012 Muhammad J. Alam Sumin Byeon
  • 2. RNA Ribonucleic acid Single-stranded molecule Consists of nucleotides Each nucleotide contains a base (A, C, G, U)
  • 3. RNA Structures Primary structure: Linear sequence of nucleotide bases Secondary structure: Hydrogen bonds between bases forming base pairs
  • 4. RNA Structures Hairpin loop Stacked pair Internal loop Bulge Multi loop
  • 5. Problem Definition Input: primary structure of an RNA Goal: to predict the secondary structure Given a primary structure of an RNA, find a secondary structure that maximizes the number of base pairs
  • 7. Different Approaches Physical methods (Kim et al) X-ray diffraction, Nuclear Magnetic Resonance (NMR) Chemical/enzymatic methods (Ehresmann et al) Mutational analysis (Tang and Draper)
  • 8. Prediction with Sequence Only Structure prediction based on multiple RNA sequences which are structurally similar (Sankoff, Gary and Stormo) Structure prediction based on a single RNA sequence Nussinov Folding Algorithm, Zuker Algorithm
  • 9. Assumptions Three base pairs (A-U, C-G, G-U) One base forms at most one base pair Pseudoknots do not occur
  • 10. Pseudoknots c a u g a c g u g u
  • 11. Pseudoknots c a u g a c g u g u
  • 12. Nussinov Folding Algorithm ... 1 2 n Case 1: (1) and (n) form a pair Case 2: There is (k) that is not crossed by any pair where 1 < k < n
  • 13. Nussinov Folding Algorithm ... 1 2 n Case 1: (1) and (n) form a pair V(1, n) = V(2, n-1) + δ(S[1], S[n])
  • 14. Nussinov Folding Algorithm ... 1 2 n Case 1: (1) and (n) form a pair V(1, n) = V(2, n-1) + δ(S[1], S[n]) ⇢ 1, if(x, y) 2 (a, u), (u, a), (c, g), (g, c), (g, u), (u, g) (x, y) = 0, otherwise
  • 15. Nussinov Folding Algorithm ... 1 2 k n Case 1: (1) and (n) form a pair Case 2: There is (k) that is not crossed by any pair where 1 < k < n V(1, n) = V(1, k) + V(k+1, n)
  • 16. Nussinov Folding Algorithm ⇢ V (i + 1, j 1) + (S[i], S[j]) V (i, j) = max maxik<i {V (i, k) + V (k + 1, j)} j i Dynamic programming ... ...
  • 17. Nussinov Folding Algorithm ⇢ V (i + 1, j 1) + (S[i], S[j]) V (i, j) = max maxik<i {V (i, k) + V (k + 1, j)} . .. Dynamic programming
  • 18. Alternate Optimization Goal Find the most stable structure: Zuker Algorithm The hydrogen bond at a base pair tries to stabilize the structure Free bases inside a loop tries to disrupt the structure Difference between these two is the destabilizing energy Given a primary structure of an RNA, find the secondary structure with least total energy
  • 19. Destabilizing Energy Measure Stacked Pair : eS(i, j) Stabilizes the structure eS(i, j) is negative Hairpin : eH(i, j) The bigger the loop, the more unstable the structure is eH(i, j) depends on |j-i+1|
  • 20. Destabilizing Energy Measure Internal Loop or Bulge : eL(i, j, i', j') The bigger the loop is and the more asymmetric the two sides are, the more unstable is the structure eL(i, j, i', j') depends on (|i'-i+1|+|j'-j+1|) and the asymmetry Multi-loop : eM(i1, j1, i2, j2, ..., ik, jk) The structure is more unstable if the loop size and k is big
  • 21. Zuker Algorithm Finds a secondary structure with minimum total destabilizing energy Uses a dynamic Programming Running Time Exponential
  • 22. Demo
  • 23. Conclusion Summary An algorithm that finds a secondary structure with the maximum number of base pairs Future works Develop an algorithm that does not make the assumption of absence of pseudoknots (Gary and Stormo) Develop an algorithm that addresses base triples and other types of base pairs

Notas del editor

  1. \n
  2. \n
  3. \n
  4. \n
  5. \n
  6. \n
  7. \n
  8. \n
  9. \n
  10. \n
  11. \n
  12. \n
  13. \n
  14. \n
  15. \n
  16. \n
  17. \n
  18. \n
  19. \n
  20. \n
  21. \n
  22. \n