SlideShare una empresa de Scribd logo
1 de 27
Translation initiation start prediction
in human cDNAs with high accuracy
A. G. Hatzigeorgiou

Paper Presentation
Introduction to Bioinformatics
Anaxagoras Fotopoulos | Marina Adamou - Tzani

21/01/2014
Introduction
•

•
•

•

Primary objective of the present research is
contribution to the definition of the coding part of
a gene.
The search is performed in cDNA sequences.
Coding regions are surrounded by UnTraslated
Regions (UTRs).
The interest is focused in finding the Translation
Initiation Start (TIS) which defines the start of the
coding region.

cDNA
complementary DNA (cDNA) is DNA
synthesized from a messenger RNA (mRNA)
in a reaction catalyzed by the enzymes
reverse transcriptase and DNA polymerase.
2
Previous Research
Salzberg, 1997

Positional Conditional Probability matrix.
Generalized Second Order Profiles.
•
Implementation of the Ribosome Scanning Model
(Kozak, 1996)

Agarwal and
Bafna, 1998a

The ribosome first attaches to a
specific region in the 5’ end of the
mRNA and then scans the sequence
for the first ATG

•
•
3

No significant deferences were observed
between the above methods and a weight matrix
The above methods are studied in common due
to the high rate of false positives.
Previous Research
Pedersen and
Nielsen, 1997

Usage of ANNs for the recognition of local context and
statistical properties around the TIS. Large region of
analysis 100 bases before and 100 after the start codon

Salamov et. al.,
1998

Zien et. al.,
2000

Six characteristics are applied for the analysis of
the region around TIS including weight matrix and
hexanucleotide difference.
Use of Support Vector Machines (SVMs) for TIS
prediction

All of the above methods give up to
85% correct predictions.

4
Methods – Suggested Model
Swissprot
475 cDNAs
(Verified + Checked)

Training Gene Pool
Parameter estimation
Training Set + Evaluation Set

Conserved
Motif

Test Gene Pool

TIS Prediction
Consensus

Test Set

NN

Score
Multiplication

Training Gene Pool
Parameter estimation
Training Set + Evaluation Set

Test Gene Pool

TIS Prediction
Test Set
5

Coding/
Non
Coding
Potential
Coding

NN
Consensus Neural Network
325 positive
+
325 negative
examples
12-nucleotides long window
Feed forward with
short cut connections
& two hidden units
trained with cascade
correlation algorithm

Selection of the
appropriate
feed-forward NN

Binirization
of the input

Cascade
Correlation
Algorithm
6
Coding Neural Network
54 nucleotides length
window

Use Smith –
Waterman algorithm
for the elimination of
homologies between
training and test data

12-nucleotides usage static
long window
Apply codon

250 positive

(Count for every window all
non-overlapping codons)

250 negative

The sequence
window is
rescaled to 64
units

7

+

Sequence regions
extracted for testing

Every unit gives
the normalized
frequency of the
codon in the
window

282 genes with less
than 70% homology
were used for training

700 positive
+
700 negative
Sequence regions
extracted for training

Resilient backpropagation
algorithm is
applied to a
feed-forward NN.
Integrated method
Analysis of full length mRNA sequences

1st stage
• Calculation
of coding
score for
every
nucleotide
of the
mRNA
sequence

2nd stage
• Calculation
of coding
evidence of
the coding
region
included in
the longest
ORF of the
sequence

3rd stage

4th stage

• For every
in-frame
ATG a
consensus
score is
calculated

• For the
same inframe ATG,
a coding
difference
score is
calculated

The final score is obtained
by combining the output of
the consensus ANN and the
coding difference
8
Integrated method
Analysis of full length mRNA sequences
• This method provides only one prediction for every ORF
• According to the results of the test group:
• 94% of the TIS were correctly predicted
• 6% of the predictions were false positive
The use of the Las Vegas algorithm gives a confident decision. The incorporation
of this algorithm leads to a highly accurate recognition of the TIS in human
cDNAs for 60% of the cases!
Las
Vegas

9

Las Vegas algorithm provides a correct prediction
in some cases and has a “no answer” option in the
remaining cases. That is, it always produces the
correct result or it informs about the failure.
Results – Score Combination 1/3

Nucleotide 255 :
cod 0.98 – local 0.2

10

A score combination of
coding ANN and consensus
ANN gives low final score.

Cod line: Score of coding ANN
Local line: Score and position of consensus ANN for all ATGs in
coding frame
Results – Score Combination 2/3

Nucleotide 270:
cod 0.44 – local 0.4

11

A score combination of
coding ANN and consensus
ANN gives low final score.

Cod line: Score of coding ANN
Local line: Score and position of consensus ANN for all ATGs in
coding frame
Results – Score Combination 3/3

Correct
TIS

Nucleotide 148:
cod 0.95 – local 0.8

12

A score combination of
coding ANN and consensus
ANN gives high final score.

Cod line: Score of coding ANN
Local line: Score and position of consensus ANN for all ATGs in
coding frame
Results – Methods Comparison
Correct TIS positions

13
Results – Methods Comparison
Prediction for the 3 TIS positions
with the highest scores

14
Results – Methods Comparison
Consensus motif scores
(only for DIANA-TIS)

15
Results – Methods Comparison
Final scores

16
Results – Methods Comparison
Correct predictions

17
Results – Methods Comparison
Prediction Analysis

High
prediction
score
difference

TIS correct
position: 471

Did not find TIS
18

Found TIS but other
higher score exists
Results – Methods Comparison

Performance of the three programs for TIS prediction along
the mRNA with signal peptide sequences
Correct TIS positions

19
Results – Methods Comparison
Length of signal peptide

20
Results – Methods Comparison
Prediction for the 2 TIS positions
with the highest scores

21
Results – Methods Comparison
Consensus motif scores
only for DIANA-TIS)

22
Results – Methods Comparison
Final scores

23
Results – Methods Comparison
Prediction example #1:
DIANA-TIS is able to distinguish between TIS and other ATGs better
than other ANN based programs like NetStart:

2 suitable ATGs are 12
nucleotides away

Coding/non-coding
information is similar

Consensus motif is
completely different

24
Results – Methods Comparison
Prediction example #2:
A favorable prediction does not work for all examples:

Consensus motif is
completely different

Combined score is
much lower

In some signal
peptides sequences
the coding potential
score is relatively
low, and can thus
affect the combined
score.

25
Results – Methods Comparison

TIS
prediction program

TIS
prediction
rate

DIANA-TIS (2001)

94%

Agarwal & Bafna (1998)

85%

ATGPred
(Salamov et al, 1998)

79%

NetStart
(Pedersen & Nielsen, 1997)

78%

These methods allow
more than one
prediction per gene

Notice The results come from different datasets and
thus these numbers should not be directly compared.
26
Thank you!

Introduction to Bioinformatics
Information Technologies in Medicine and Biology
National & Kapodistrian
University of Athens
Department of Informatics
Biomedical Research
Foundation
Academy of Athens
27

Technological Education
Institute of Athens
Department of Biomedical
Engineering
Demokritos
National Center
for Scientific Research

Más contenido relacionado

Destacado

Europa Nostra Congress Athens 2013 - Programme
Europa Nostra Congress Athens 2013 - ProgrammeEuropa Nostra Congress Athens 2013 - Programme
Europa Nostra Congress Athens 2013 - ProgrammeAnax Fotopoulos
 
___tribalistas_velhainf_ncianin
  ___tribalistas_velhainf_ncianin  ___tribalistas_velhainf_ncianin
___tribalistas_velhainf_ncianinDenise Rivera
 
Uma vida vitoriosa e bem sucedida 4 Abra seu coração para algo impensável
Uma vida vitoriosa e bem sucedida 4   Abra seu coração para algo impensávelUma vida vitoriosa e bem sucedida 4   Abra seu coração para algo impensável
Uma vida vitoriosa e bem sucedida 4 Abra seu coração para algo impensávelIBGileade 2012
 
Chapa fênix - apresentação - aperj
Chapa   fênix - apresentação - aperjChapa   fênix - apresentação - aperj
Chapa fênix - apresentação - aperjDenise Rivera
 
Los si y los no del piercing
Los si y los no del piercingLos si y los no del piercing
Los si y los no del piercinginfonacho
 
Conocimientos básicos sobre Pago Único
Conocimientos básicos sobre Pago ÚnicoConocimientos básicos sobre Pago Único
Conocimientos básicos sobre Pago ÚnicoSociedad Desarrollo
 
Corrida Premiada Melnick Even
Corrida Premiada Melnick EvenCorrida Premiada Melnick Even
Corrida Premiada Melnick EvenKelvi Dias
 
Fdfs for cnblogs 1th
Fdfs for cnblogs 1thFdfs for cnblogs 1th
Fdfs for cnblogs 1thkevin zhang
 
Me declaro vivo
Me declaro vivoMe declaro vivo
Me declaro vivoSinapsis
 

Destacado (15)

Europa Nostra Congress Athens 2013 - Programme
Europa Nostra Congress Athens 2013 - ProgrammeEuropa Nostra Congress Athens 2013 - Programme
Europa Nostra Congress Athens 2013 - Programme
 
___tribalistas_velhainf_ncianin
  ___tribalistas_velhainf_ncianin  ___tribalistas_velhainf_ncianin
___tribalistas_velhainf_ncianin
 
Uma vida vitoriosa e bem sucedida 4 Abra seu coração para algo impensável
Uma vida vitoriosa e bem sucedida 4   Abra seu coração para algo impensávelUma vida vitoriosa e bem sucedida 4   Abra seu coração para algo impensável
Uma vida vitoriosa e bem sucedida 4 Abra seu coração para algo impensável
 
Croazia ecc.
Croazia ecc.Croazia ecc.
Croazia ecc.
 
Chapa fênix - apresentação - aperj
Chapa   fênix - apresentação - aperjChapa   fênix - apresentação - aperj
Chapa fênix - apresentação - aperj
 
Los si y los no del piercing
Los si y los no del piercingLos si y los no del piercing
Los si y los no del piercing
 
Conocimientos básicos sobre Pago Único
Conocimientos básicos sobre Pago ÚnicoConocimientos básicos sobre Pago Único
Conocimientos básicos sobre Pago Único
 
Wincash
WincashWincash
Wincash
 
scan0005
scan0005scan0005
scan0005
 
Terremoto de Haiti
Terremoto de HaitiTerremoto de Haiti
Terremoto de Haiti
 
Corrida Premiada Melnick Even
Corrida Premiada Melnick EvenCorrida Premiada Melnick Even
Corrida Premiada Melnick Even
 
Driblando a crise com Gestão do Conhecimento
Driblando a crise com Gestão do ConhecimentoDriblando a crise com Gestão do Conhecimento
Driblando a crise com Gestão do Conhecimento
 
Fdfs for cnblogs 1th
Fdfs for cnblogs 1thFdfs for cnblogs 1th
Fdfs for cnblogs 1th
 
Me declaro vivo
Me declaro vivoMe declaro vivo
Me declaro vivo
 
VERDADES SOBRE COMUNIDADES DE PRÁTICA
VERDADES SOBRE COMUNIDADES DE PRÁTICAVERDADES SOBRE COMUNIDADES DE PRÁTICA
VERDADES SOBRE COMUNIDADES DE PRÁTICA
 

Similar a TIS prediction in human cDNAs with high accuracy

презентация за варшава
презентация за варшавапрезентация за варшава
презентация за варшаваValeriya Simeonova
 
IRJET-A Novel Approaches for Motif Discovery using Data Mining Algorithm
IRJET-A Novel Approaches for Motif Discovery using Data Mining AlgorithmIRJET-A Novel Approaches for Motif Discovery using Data Mining Algorithm
IRJET-A Novel Approaches for Motif Discovery using Data Mining AlgorithmIRJET Journal
 
20100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_020100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_0Computer Science Club
 
2013 02-14 - ngs webinar - sellappan
2013 02-14 - ngs webinar - sellappan2013 02-14 - ngs webinar - sellappan
2013 02-14 - ngs webinar - sellappanElsa von Licy
 
CDAC 2018 Merico optimal scoring
CDAC 2018 Merico optimal scoringCDAC 2018 Merico optimal scoring
CDAC 2018 Merico optimal scoringMarco Antoniotti
 
Bioinformatics.Practical Notebook
Bioinformatics.Practical NotebookBioinformatics.Practical Notebook
Bioinformatics.Practical NotebookNaima Tahsin
 
Advanced Real-Time PCR Array Technology – Coding and Noncoding RNA Expression...
Advanced Real-Time PCR Array Technology – Coding and Noncoding RNA Expression...Advanced Real-Time PCR Array Technology – Coding and Noncoding RNA Expression...
Advanced Real-Time PCR Array Technology – Coding and Noncoding RNA Expression...QIAGEN
 
primerdesignisscarybutthisisbetterthanthattrustme
primerdesignisscarybutthisisbetterthanthattrustmeprimerdesignisscarybutthisisbetterthanthattrustme
primerdesignisscarybutthisisbetterthanthattrustmeabrarhaider15
 
Genome in a bottle for amp GeT-RM 181030
Genome in a bottle for amp GeT-RM 181030Genome in a bottle for amp GeT-RM 181030
Genome in a bottle for amp GeT-RM 181030GenomeInABottle
 
Advanced miRNA Expression Analysis: miRNA and its Role in Human Disease Webin...
Advanced miRNA Expression Analysis: miRNA and its Role in Human Disease Webin...Advanced miRNA Expression Analysis: miRNA and its Role in Human Disease Webin...
Advanced miRNA Expression Analysis: miRNA and its Role in Human Disease Webin...QIAGEN
 

Similar a TIS prediction in human cDNAs with high accuracy (20)

31931 31941
31931 3194131931 31941
31931 31941
 
Ascbrn ai poster
Ascbrn ai posterAscbrn ai poster
Ascbrn ai poster
 
презентация за варшава
презентация за варшавапрезентация за варшава
презентация за варшава
 
Microarray Analysis
Microarray AnalysisMicroarray Analysis
Microarray Analysis
 
IRJET-A Novel Approaches for Motif Discovery using Data Mining Algorithm
IRJET-A Novel Approaches for Motif Discovery using Data Mining AlgorithmIRJET-A Novel Approaches for Motif Discovery using Data Mining Algorithm
IRJET-A Novel Approaches for Motif Discovery using Data Mining Algorithm
 
20100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_020100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_0
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Ngs webinar 2013
Ngs webinar 2013Ngs webinar 2013
Ngs webinar 2013
 
2013 02-14 - ngs webinar - sellappan
2013 02-14 - ngs webinar - sellappan2013 02-14 - ngs webinar - sellappan
2013 02-14 - ngs webinar - sellappan
 
CDAC 2018 Merico optimal scoring
CDAC 2018 Merico optimal scoringCDAC 2018 Merico optimal scoring
CDAC 2018 Merico optimal scoring
 
Bioinformatics.Practical Notebook
Bioinformatics.Practical NotebookBioinformatics.Practical Notebook
Bioinformatics.Practical Notebook
 
Advanced Real-Time PCR Array Technology – Coding and Noncoding RNA Expression...
Advanced Real-Time PCR Array Technology – Coding and Noncoding RNA Expression...Advanced Real-Time PCR Array Technology – Coding and Noncoding RNA Expression...
Advanced Real-Time PCR Array Technology – Coding and Noncoding RNA Expression...
 
primerdesignisscarybutthisisbetterthanthattrustme
primerdesignisscarybutthisisbetterthanthattrustmeprimerdesignisscarybutthisisbetterthanthattrustme
primerdesignisscarybutthisisbetterthanthattrustme
 
Genome in a bottle for amp GeT-RM 181030
Genome in a bottle for amp GeT-RM 181030Genome in a bottle for amp GeT-RM 181030
Genome in a bottle for amp GeT-RM 181030
 
Database Searching
Database SearchingDatabase Searching
Database Searching
 
Advanced miRNA Expression Analysis: miRNA and its Role in Human Disease Webin...
Advanced miRNA Expression Analysis: miRNA and its Role in Human Disease Webin...Advanced miRNA Expression Analysis: miRNA and its Role in Human Disease Webin...
Advanced miRNA Expression Analysis: miRNA and its Role in Human Disease Webin...
 
Ijetr042111
Ijetr042111Ijetr042111
Ijetr042111
 
20140711 2 j_willey_ercc2.0_workshop
20140711 2 j_willey_ercc2.0_workshop20140711 2 j_willey_ercc2.0_workshop
20140711 2 j_willey_ercc2.0_workshop
 
PCR technology
PCR technologyPCR technology
PCR technology
 
PCR lecture.ppt
PCR lecture.pptPCR lecture.ppt
PCR lecture.ppt
 

Más de Anax Fotopoulos

Acropoils & other hellenic world monuments
Acropoils & other hellenic world monumentsAcropoils & other hellenic world monuments
Acropoils & other hellenic world monumentsAnax Fotopoulos
 
Architecture of the human regulatory network derived from encode data
Architecture of the human regulatory network derived from encode dataArchitecture of the human regulatory network derived from encode data
Architecture of the human regulatory network derived from encode dataAnax Fotopoulos
 
From Smart Homes to Smart Cities: An approach based on Internet-of-Things
From Smart Homes to Smart Cities: An approach based on Internet-of-ThingsFrom Smart Homes to Smart Cities: An approach based on Internet-of-Things
From Smart Homes to Smart Cities: An approach based on Internet-of-ThingsAnax Fotopoulos
 
Introduction to HMMER - A biosequence analysis tool with Hidden Markov Models
Introduction to HMMER - A biosequence analysis tool with Hidden Markov Models Introduction to HMMER - A biosequence analysis tool with Hidden Markov Models
Introduction to HMMER - A biosequence analysis tool with Hidden Markov Models Anax Fotopoulos
 
Wef the future role of civil society report 2013
Wef the future role of civil society report 2013Wef the future role of civil society report 2013
Wef the future role of civil society report 2013Anax Fotopoulos
 
UNESCO’s Division for Freedom of Expression, Democracy and Peace Report
UNESCO’s Division for Freedom of  Expression, Democracy and Peace ReportUNESCO’s Division for Freedom of  Expression, Democracy and Peace Report
UNESCO’s Division for Freedom of Expression, Democracy and Peace ReportAnax Fotopoulos
 
Europa Nostra Athens Congress - Registration fees
Europa Nostra Athens Congress - Registration feesEuropa Nostra Athens Congress - Registration fees
Europa Nostra Athens Congress - Registration feesAnax Fotopoulos
 
A new approach in specifying the inverse quadratic matrix in modulo-2 for con...
A new approach in specifying the inverse quadratic matrix in modulo-2 for con...A new approach in specifying the inverse quadratic matrix in modulo-2 for con...
A new approach in specifying the inverse quadratic matrix in modulo-2 for con...Anax Fotopoulos
 
TEI Piraeus IEEE Student Branch Actions 2011-2012
TEI Piraeus IEEE Student Branch  Actions 2011-2012 TEI Piraeus IEEE Student Branch  Actions 2011-2012
TEI Piraeus IEEE Student Branch Actions 2011-2012 Anax Fotopoulos
 
Η ΣΚΟΤΕΙΝΗ ΠΛΕΥΡΑ ΤΟΥ ΔΙΑΔΙΚΤΥΟΥ - ΠΑΡΟΥΣΙΑΣΗ ΤΩΝ ΠΡΟΒΛΗΜΑΤΩΝ & ΤΩΝ ΜΕΤΡΩΝ Π...
Η ΣΚΟΤΕΙΝΗ ΠΛΕΥΡΑ ΤΟΥ ΔΙΑΔΙΚΤΥΟΥ - ΠΑΡΟΥΣΙΑΣΗ ΤΩΝ ΠΡΟΒΛΗΜΑΤΩΝ &  ΤΩΝ ΜΕΤΡΩΝ Π...Η ΣΚΟΤΕΙΝΗ ΠΛΕΥΡΑ ΤΟΥ ΔΙΑΔΙΚΤΥΟΥ - ΠΑΡΟΥΣΙΑΣΗ ΤΩΝ ΠΡΟΒΛΗΜΑΤΩΝ &  ΤΩΝ ΜΕΤΡΩΝ Π...
Η ΣΚΟΤΕΙΝΗ ΠΛΕΥΡΑ ΤΟΥ ΔΙΑΔΙΚΤΥΟΥ - ΠΑΡΟΥΣΙΑΣΗ ΤΩΝ ΠΡΟΒΛΗΜΑΤΩΝ & ΤΩΝ ΜΕΤΡΩΝ Π...Anax Fotopoulos
 
Measuring the EMF of various widely used electronic devices and their possibl...
Measuring the EMF of various widely used electronic devices and their possibl...Measuring the EMF of various widely used electronic devices and their possibl...
Measuring the EMF of various widely used electronic devices and their possibl...Anax Fotopoulos
 
S.A.V.E. - Social Actions Volunteering Ecosystem
S.A.V.E. - Social Actions Volunteering EcosystemS.A.V.E. - Social Actions Volunteering Ecosystem
S.A.V.E. - Social Actions Volunteering EcosystemAnax Fotopoulos
 
Monte Carlo comparison study of the radiation absorption of scintillators for...
Monte Carlo comparison study of the radiation absorption of scintillators for...Monte Carlo comparison study of the radiation absorption of scintillators for...
Monte Carlo comparison study of the radiation absorption of scintillators for...Anax Fotopoulos
 
Semi-empirical Monte Carlo optical-gain modelling of Nuclear Imaging scintill...
Semi-empirical Monte Carlo optical-gain modelling of Nuclear Imaging scintill...Semi-empirical Monte Carlo optical-gain modelling of Nuclear Imaging scintill...
Semi-empirical Monte Carlo optical-gain modelling of Nuclear Imaging scintill...Anax Fotopoulos
 
Environmental monitoring of soil radon in a very tectonic area in south west ...
Environmental monitoring of soil radon in a very tectonic area in south west ...Environmental monitoring of soil radon in a very tectonic area in south west ...
Environmental monitoring of soil radon in a very tectonic area in south west ...Anax Fotopoulos
 
Similarities in the self organised critical characteristics between radon and...
Similarities in the self organised critical characteristics between radon and...Similarities in the self organised critical characteristics between radon and...
Similarities in the self organised critical characteristics between radon and...Anax Fotopoulos
 
State equations model based on modulo 2 arithmetic and its applciation on rec...
State equations model based on modulo 2 arithmetic and its applciation on rec...State equations model based on modulo 2 arithmetic and its applciation on rec...
State equations model based on modulo 2 arithmetic and its applciation on rec...Anax Fotopoulos
 

Más de Anax Fotopoulos (20)

AFMM Manual
AFMM ManualAFMM Manual
AFMM Manual
 
Acropoils & other hellenic world monuments
Acropoils & other hellenic world monumentsAcropoils & other hellenic world monuments
Acropoils & other hellenic world monuments
 
Architecture of the human regulatory network derived from encode data
Architecture of the human regulatory network derived from encode dataArchitecture of the human regulatory network derived from encode data
Architecture of the human regulatory network derived from encode data
 
Ret protooncogene
Ret protooncogeneRet protooncogene
Ret protooncogene
 
From Smart Homes to Smart Cities: An approach based on Internet-of-Things
From Smart Homes to Smart Cities: An approach based on Internet-of-ThingsFrom Smart Homes to Smart Cities: An approach based on Internet-of-Things
From Smart Homes to Smart Cities: An approach based on Internet-of-Things
 
Introduction to HMMER - A biosequence analysis tool with Hidden Markov Models
Introduction to HMMER - A biosequence analysis tool with Hidden Markov Models Introduction to HMMER - A biosequence analysis tool with Hidden Markov Models
Introduction to HMMER - A biosequence analysis tool with Hidden Markov Models
 
Wef the future role of civil society report 2013
Wef the future role of civil society report 2013Wef the future role of civil society report 2013
Wef the future role of civil society report 2013
 
UNESCO’s Division for Freedom of Expression, Democracy and Peace Report
UNESCO’s Division for Freedom of  Expression, Democracy and Peace ReportUNESCO’s Division for Freedom of  Expression, Democracy and Peace Report
UNESCO’s Division for Freedom of Expression, Democracy and Peace Report
 
Europa Nostra Athens Congress - Registration fees
Europa Nostra Athens Congress - Registration feesEuropa Nostra Athens Congress - Registration fees
Europa Nostra Athens Congress - Registration fees
 
RMCEF
RMCEFRMCEF
RMCEF
 
A new approach in specifying the inverse quadratic matrix in modulo-2 for con...
A new approach in specifying the inverse quadratic matrix in modulo-2 for con...A new approach in specifying the inverse quadratic matrix in modulo-2 for con...
A new approach in specifying the inverse quadratic matrix in modulo-2 for con...
 
TEI Piraeus IEEE Student Branch Actions 2011-2012
TEI Piraeus IEEE Student Branch  Actions 2011-2012 TEI Piraeus IEEE Student Branch  Actions 2011-2012
TEI Piraeus IEEE Student Branch Actions 2011-2012
 
Η ΣΚΟΤΕΙΝΗ ΠΛΕΥΡΑ ΤΟΥ ΔΙΑΔΙΚΤΥΟΥ - ΠΑΡΟΥΣΙΑΣΗ ΤΩΝ ΠΡΟΒΛΗΜΑΤΩΝ & ΤΩΝ ΜΕΤΡΩΝ Π...
Η ΣΚΟΤΕΙΝΗ ΠΛΕΥΡΑ ΤΟΥ ΔΙΑΔΙΚΤΥΟΥ - ΠΑΡΟΥΣΙΑΣΗ ΤΩΝ ΠΡΟΒΛΗΜΑΤΩΝ &  ΤΩΝ ΜΕΤΡΩΝ Π...Η ΣΚΟΤΕΙΝΗ ΠΛΕΥΡΑ ΤΟΥ ΔΙΑΔΙΚΤΥΟΥ - ΠΑΡΟΥΣΙΑΣΗ ΤΩΝ ΠΡΟΒΛΗΜΑΤΩΝ &  ΤΩΝ ΜΕΤΡΩΝ Π...
Η ΣΚΟΤΕΙΝΗ ΠΛΕΥΡΑ ΤΟΥ ΔΙΑΔΙΚΤΥΟΥ - ΠΑΡΟΥΣΙΑΣΗ ΤΩΝ ΠΡΟΒΛΗΜΑΤΩΝ & ΤΩΝ ΜΕΤΡΩΝ Π...
 
Measuring the EMF of various widely used electronic devices and their possibl...
Measuring the EMF of various widely used electronic devices and their possibl...Measuring the EMF of various widely used electronic devices and their possibl...
Measuring the EMF of various widely used electronic devices and their possibl...
 
S.A.V.E. - Social Actions Volunteering Ecosystem
S.A.V.E. - Social Actions Volunteering EcosystemS.A.V.E. - Social Actions Volunteering Ecosystem
S.A.V.E. - Social Actions Volunteering Ecosystem
 
Monte Carlo comparison study of the radiation absorption of scintillators for...
Monte Carlo comparison study of the radiation absorption of scintillators for...Monte Carlo comparison study of the radiation absorption of scintillators for...
Monte Carlo comparison study of the radiation absorption of scintillators for...
 
Semi-empirical Monte Carlo optical-gain modelling of Nuclear Imaging scintill...
Semi-empirical Monte Carlo optical-gain modelling of Nuclear Imaging scintill...Semi-empirical Monte Carlo optical-gain modelling of Nuclear Imaging scintill...
Semi-empirical Monte Carlo optical-gain modelling of Nuclear Imaging scintill...
 
Environmental monitoring of soil radon in a very tectonic area in south west ...
Environmental monitoring of soil radon in a very tectonic area in south west ...Environmental monitoring of soil radon in a very tectonic area in south west ...
Environmental monitoring of soil radon in a very tectonic area in south west ...
 
Similarities in the self organised critical characteristics between radon and...
Similarities in the self organised critical characteristics between radon and...Similarities in the self organised critical characteristics between radon and...
Similarities in the self organised critical characteristics between radon and...
 
State equations model based on modulo 2 arithmetic and its applciation on rec...
State equations model based on modulo 2 arithmetic and its applciation on rec...State equations model based on modulo 2 arithmetic and its applciation on rec...
State equations model based on modulo 2 arithmetic and its applciation on rec...
 

Último

Bangalore Call Girls Hebbal Kempapura Number 7001035870 Meetin With Bangalor...
Bangalore Call Girls Hebbal Kempapura Number 7001035870  Meetin With Bangalor...Bangalore Call Girls Hebbal Kempapura Number 7001035870  Meetin With Bangalor...
Bangalore Call Girls Hebbal Kempapura Number 7001035870 Meetin With Bangalor...narwatsonia7
 
Call Girls Darjeeling Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Darjeeling Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Darjeeling Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Darjeeling Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...
(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...
(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...indiancallgirl4rent
 
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Kesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls Service
Kesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls ServiceKesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls Service
Kesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls Servicemakika9823
 
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...astropune
 
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...aartirawatdelhi
 
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls JaipurCall Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipurparulsinha
 
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...jageshsingh5554
 
Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...narwatsonia7
 
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...narwatsonia7
 
Call Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night EnjoyCall Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night Enjoybabeytanya
 
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...Call Girls in Nagpur High Profile
 
Call Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night EnjoyCall Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night Enjoybabeytanya
 
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableVip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableNehru place Escorts
 
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.Artifacts in Nuclear Medicine with Identifying and resolving artifacts.
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.MiadAlsulami
 

Último (20)

Bangalore Call Girls Hebbal Kempapura Number 7001035870 Meetin With Bangalor...
Bangalore Call Girls Hebbal Kempapura Number 7001035870  Meetin With Bangalor...Bangalore Call Girls Hebbal Kempapura Number 7001035870  Meetin With Bangalor...
Bangalore Call Girls Hebbal Kempapura Number 7001035870 Meetin With Bangalor...
 
Call Girls Darjeeling Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Darjeeling Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Darjeeling Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Darjeeling Just Call 9907093804 Top Class Call Girl Service Available
 
(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...
(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...
(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...
 
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
 
Kesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls Service
Kesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls ServiceKesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls Service
Kesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls Service
 
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
 
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
 
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls JaipurCall Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
 
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
 
Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...
 
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
 
Call Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night EnjoyCall Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night Enjoy
 
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
 
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
 
Call Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night EnjoyCall Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night Enjoy
 
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableVip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
 
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
 
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.Artifacts in Nuclear Medicine with Identifying and resolving artifacts.
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.
 

TIS prediction in human cDNAs with high accuracy

  • 1. Translation initiation start prediction in human cDNAs with high accuracy A. G. Hatzigeorgiou Paper Presentation Introduction to Bioinformatics Anaxagoras Fotopoulos | Marina Adamou - Tzani 21/01/2014
  • 2. Introduction • • • • Primary objective of the present research is contribution to the definition of the coding part of a gene. The search is performed in cDNA sequences. Coding regions are surrounded by UnTraslated Regions (UTRs). The interest is focused in finding the Translation Initiation Start (TIS) which defines the start of the coding region. cDNA complementary DNA (cDNA) is DNA synthesized from a messenger RNA (mRNA) in a reaction catalyzed by the enzymes reverse transcriptase and DNA polymerase. 2
  • 3. Previous Research Salzberg, 1997 Positional Conditional Probability matrix. Generalized Second Order Profiles. • Implementation of the Ribosome Scanning Model (Kozak, 1996) Agarwal and Bafna, 1998a The ribosome first attaches to a specific region in the 5’ end of the mRNA and then scans the sequence for the first ATG • • 3 No significant deferences were observed between the above methods and a weight matrix The above methods are studied in common due to the high rate of false positives.
  • 4. Previous Research Pedersen and Nielsen, 1997 Usage of ANNs for the recognition of local context and statistical properties around the TIS. Large region of analysis 100 bases before and 100 after the start codon Salamov et. al., 1998 Zien et. al., 2000 Six characteristics are applied for the analysis of the region around TIS including weight matrix and hexanucleotide difference. Use of Support Vector Machines (SVMs) for TIS prediction All of the above methods give up to 85% correct predictions. 4
  • 5. Methods – Suggested Model Swissprot 475 cDNAs (Verified + Checked) Training Gene Pool Parameter estimation Training Set + Evaluation Set Conserved Motif Test Gene Pool TIS Prediction Consensus Test Set NN Score Multiplication Training Gene Pool Parameter estimation Training Set + Evaluation Set Test Gene Pool TIS Prediction Test Set 5 Coding/ Non Coding Potential Coding NN
  • 6. Consensus Neural Network 325 positive + 325 negative examples 12-nucleotides long window Feed forward with short cut connections & two hidden units trained with cascade correlation algorithm Selection of the appropriate feed-forward NN Binirization of the input Cascade Correlation Algorithm 6
  • 7. Coding Neural Network 54 nucleotides length window Use Smith – Waterman algorithm for the elimination of homologies between training and test data 12-nucleotides usage static long window Apply codon 250 positive (Count for every window all non-overlapping codons) 250 negative The sequence window is rescaled to 64 units 7 + Sequence regions extracted for testing Every unit gives the normalized frequency of the codon in the window 282 genes with less than 70% homology were used for training 700 positive + 700 negative Sequence regions extracted for training Resilient backpropagation algorithm is applied to a feed-forward NN.
  • 8. Integrated method Analysis of full length mRNA sequences 1st stage • Calculation of coding score for every nucleotide of the mRNA sequence 2nd stage • Calculation of coding evidence of the coding region included in the longest ORF of the sequence 3rd stage 4th stage • For every in-frame ATG a consensus score is calculated • For the same inframe ATG, a coding difference score is calculated The final score is obtained by combining the output of the consensus ANN and the coding difference 8
  • 9. Integrated method Analysis of full length mRNA sequences • This method provides only one prediction for every ORF • According to the results of the test group: • 94% of the TIS were correctly predicted • 6% of the predictions were false positive The use of the Las Vegas algorithm gives a confident decision. The incorporation of this algorithm leads to a highly accurate recognition of the TIS in human cDNAs for 60% of the cases! Las Vegas 9 Las Vegas algorithm provides a correct prediction in some cases and has a “no answer” option in the remaining cases. That is, it always produces the correct result or it informs about the failure.
  • 10. Results – Score Combination 1/3 Nucleotide 255 : cod 0.98 – local 0.2 10 A score combination of coding ANN and consensus ANN gives low final score. Cod line: Score of coding ANN Local line: Score and position of consensus ANN for all ATGs in coding frame
  • 11. Results – Score Combination 2/3 Nucleotide 270: cod 0.44 – local 0.4 11 A score combination of coding ANN and consensus ANN gives low final score. Cod line: Score of coding ANN Local line: Score and position of consensus ANN for all ATGs in coding frame
  • 12. Results – Score Combination 3/3 Correct TIS Nucleotide 148: cod 0.95 – local 0.8 12 A score combination of coding ANN and consensus ANN gives high final score. Cod line: Score of coding ANN Local line: Score and position of consensus ANN for all ATGs in coding frame
  • 13. Results – Methods Comparison Correct TIS positions 13
  • 14. Results – Methods Comparison Prediction for the 3 TIS positions with the highest scores 14
  • 15. Results – Methods Comparison Consensus motif scores (only for DIANA-TIS) 15
  • 16. Results – Methods Comparison Final scores 16
  • 17. Results – Methods Comparison Correct predictions 17
  • 18. Results – Methods Comparison Prediction Analysis High prediction score difference TIS correct position: 471 Did not find TIS 18 Found TIS but other higher score exists
  • 19. Results – Methods Comparison Performance of the three programs for TIS prediction along the mRNA with signal peptide sequences Correct TIS positions 19
  • 20. Results – Methods Comparison Length of signal peptide 20
  • 21. Results – Methods Comparison Prediction for the 2 TIS positions with the highest scores 21
  • 22. Results – Methods Comparison Consensus motif scores only for DIANA-TIS) 22
  • 23. Results – Methods Comparison Final scores 23
  • 24. Results – Methods Comparison Prediction example #1: DIANA-TIS is able to distinguish between TIS and other ATGs better than other ANN based programs like NetStart: 2 suitable ATGs are 12 nucleotides away Coding/non-coding information is similar Consensus motif is completely different 24
  • 25. Results – Methods Comparison Prediction example #2: A favorable prediction does not work for all examples: Consensus motif is completely different Combined score is much lower In some signal peptides sequences the coding potential score is relatively low, and can thus affect the combined score. 25
  • 26. Results – Methods Comparison TIS prediction program TIS prediction rate DIANA-TIS (2001) 94% Agarwal & Bafna (1998) 85% ATGPred (Salamov et al, 1998) 79% NetStart (Pedersen & Nielsen, 1997) 78% These methods allow more than one prediction per gene Notice The results come from different datasets and thus these numbers should not be directly compared. 26
  • 27. Thank you! Introduction to Bioinformatics Information Technologies in Medicine and Biology National & Kapodistrian University of Athens Department of Informatics Biomedical Research Foundation Academy of Athens 27 Technological Education Institute of Athens Department of Biomedical Engineering Demokritos National Center for Scientific Research