SlideShare a Scribd company logo
1 of 57
Download to read offline
Dr. Stickle

an EST-Database
Dr. Stickle
an EST-Database
The Goal
The Goal


• Creating a pipeline for EST-Analysis
The Goal


• Creating a pipeline for EST-Analysis
• Displaying the results via an online
 framework
wtf is a pipeline?
Different steps of analysis performed in an
             automated fashion




       wtf is a pipeline?
wtf is a pipeline?


Different steps of analysis performed in an
             automated fashion
Analysis:
✓Assembly of EST-Reads into contigs
✓SNP-Detection
               MIRA
               But:
       ★Takes ages
       ★not well documented
       ★buggy
✓Assembly of EST-Reads into contigs
✓SNP-Detection
               MIRA
               But:
       ★Takes ages
       ★not well documented
       ★buggy
✓Assembly of EST-Reads into contigs
✓SNP-Detection
               MIRA
               But:
       ★Takes ages
       ★not well documented
       ★buggy
MIRA
✓Assembly of EST-Reads into contigs
✓SNP-Detection


               But:
       ★Takes ages
       ★not well documented
       ★buggy
MIRA
✓Assembly of EST-Reads into contigs
✓SNP-Detection


               But:
       ★Takes ages
       ★not well documented
       ★buggy
MIRA
✓Assembly of EST-Reads into contigs
✓SNP-Detection

               But:
       ★Takes ages
       ★not well documented
       ★buggy
MIRA
✓Assembly of EST-Reads into contigs
✓SNP-Detection

               But:
       ★Takes ages
       ★not well documented
       ★buggy
SNPs   ORFs   Contigs BLAST
               MIRA




       PFAM        BLAST2GO
MIRA




SNPs   ORFs   Contigs BLAST




       PFAM           BLAST2GO
MIRA
              Contigs




SNPs   ORFs             BLAST




       PFAM        BLAST2GO
MIRA
              Contigs




SNPs   ORFs             BLAST




       PFAM        BLAST2GO
MIRA
              Contigs




SNPs   ORFs             BLAST




       PFAM        BLAST2GO
MIRA
              Contigs




SNPs   ORFs             BLAST




       PFAM        BLAST2GO
MIRA
              Contigs




SNPs   ORFs             BLAST




       PFAM        BLAST2GO
BLAST
Basic Local Alignment Search Tool
BLAST
Basic Local Alignment Search Tool
BLAST
              Basic Local Alignment Search Tool




• Standard for searching sequences against
 a database
BLAST
            Basic Local Alignment Search Tool




• Standard for searching sequences against
  a database
• emphasizes speed over sensitivity
BLAST
            Basic Local Alignment Search Tool




• Standard for searching sequences against
  a database
• emphasizes speed over sensitivity
BLAST
            Basic Local Alignment Search Tool




• Standard for searching sequences against
  a database
• emphasizes speed over sensitivity
BLAST
            Basic Local Alignment Search Tool




• Standard for searching sequences against
  a database
• emphasizes speed over sensitivity
Tools


                   gene ontology
Blast2GO
                     mapping


              open reading frame
  ORF
                  prediction


 PFAM          domain annotation
Blast2GO
Tools


                   gene ontology
Blast2GO
                     mapping


              open reading frame
  ORF
                  prediction


 PFAM          domain annotation
ORF
Tools


                   gene ontology
Blast2GO
                     mapping


              open reading frame
  ORF
                  prediction


 PFAM          domain annotation
PFAM
Ugly *.ace-output generated via MIRA
What we‘ve got here:
What we‘ve got here:

•Different tools
•many different output-files
What we‘ve got here:

     •Different tools
     •many different output-files

             What we want:

a structured database containing all the
              information
How to parse


Class «Parser»
 •Function BLAST-Parser
 •Function PFAM-Parser
 •Function FASTA-Parser
 •...
                                         Data



Script
 •read input
 •use parser
 •insert db
How to parse


Class «Parser»
 •Function BLAST-Parser
 •Function PFAM-Parser
 •Function FASTA-Parser
 •...
                                         Data



Script
 •read input
 •use parser
 •insert db
How to parse


Class «Parser»
 •Function BLAST-Parser
 •Function PFAM-Parser
 •Function FASTA-Parser
 •...
                                         Data



Script
 •read input
 •use parser
 •insert db
How to parse


Class «Parser»
 •Function BLAST-Parser
 •Function PFAM-Parser
 •Function FASTA-Parser
 •...
                                         Data



Script
 •read input
 •use parser
 •insert db
How to parse

                                Data
Class «Parser»
 •Function BLAST-Parser
 •Function PFAM-Parser
 •Function FASTA-Parser
 •...            Script
                  •read input
                  •use parser
                  •insert db


                           Database
>abc_123
agtagtacgtacgtggacgtatgact
>def_456
agtagtacgtacgtggacgtatgact
Summary & Results
Summary & Results

•created the pipeline
•analysed data
•started filling the database
Summary & Results

•created the pipeline
•analysed data
•started filling the database

        To be done


•wait for MIRA
•SNP-parser
thx to:


•Marvin, for «time till scooter» and sending us to Lothar
•Lothar, for providing always friendly and calm advice
•Suse, for actually having used MIRA at least once
•Andrew, for Andreas
•Andreas, for Andrew
•Bastien Chevreux, for not fixing those damn bugs in MIRA
k,thx,bai
Dr. Iglo
Dr. Iglo

More Related Content

Similar to Dr. Stickle

ICAR 2015 Workshop - Agnes Chan
ICAR 2015 Workshop - Agnes ChanICAR 2015 Workshop - Agnes Chan
ICAR 2015 Workshop - Agnes ChanAraport
 
Microservices and Teraflops: Effortlessly Scaling Data Science with PyWren wi...
Microservices and Teraflops: Effortlessly Scaling Data Science with PyWren wi...Microservices and Teraflops: Effortlessly Scaling Data Science with PyWren wi...
Microservices and Teraflops: Effortlessly Scaling Data Science with PyWren wi...Databricks
 
தமிழ்க்கணிமை கட்டமைப்பு
தமிழ்க்கணிமை கட்டமைப்புதமிழ்க்கணிமை கட்டமைப்பு
தமிழ்க்கணிமை கட்டமைப்புBalaSundaraRaman (Sundar)
 
From Zero to Nextflow 2017
From Zero to Nextflow 2017From Zero to Nextflow 2017
From Zero to Nextflow 2017Luca Cozzuto
 
Jan2015 GIAB intro, Update, and Data Analysis Planning
Jan2015 GIAB intro, Update, and Data Analysis PlanningJan2015 GIAB intro, Update, and Data Analysis Planning
Jan2015 GIAB intro, Update, and Data Analysis PlanningGenomeInABottle
 
The RNA workbench - Galaxy User Conference 2018
The RNA workbench - Galaxy User Conference 2018The RNA workbench - Galaxy User Conference 2018
The RNA workbench - Galaxy User Conference 2018Florian Eggenhofer
 
A guided tour of Araport
A guided tour of AraportA guided tour of Araport
A guided tour of AraportAraport
 
Evaluation of the impact of error correction algorithms on SNP calling.
Evaluation of the impact of error correction algorithms on SNP calling.Evaluation of the impact of error correction algorithms on SNP calling.
Evaluation of the impact of error correction algorithms on SNP calling.Nathan Olson
 
Computational biology bls 303
Computational biology bls 303Computational biology bls 303
Computational biology bls 303Bruno Mmassy
 
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016Prof. Wim Van Criekinge
 
Creating a SNP calling pipeline
Creating a SNP calling pipelineCreating a SNP calling pipeline
Creating a SNP calling pipelineDan Bolser
 
Introduction to NGS
Introduction to NGSIntroduction to NGS
Introduction to NGScursoNGS
 
Enabling Biobank-Scale Genomic Processing with Spark SQL
Enabling Biobank-Scale Genomic Processing with Spark SQLEnabling Biobank-Scale Genomic Processing with Spark SQL
Enabling Biobank-Scale Genomic Processing with Spark SQLDatabricks
 
Programming for biologists
Programming for biologistsProgramming for biologists
Programming for biologistsjigma
 
20110524zurichngs 1st pub
20110524zurichngs 1st pub20110524zurichngs 1st pub
20110524zurichngs 1st pubsesejun
 
The Postmodern Binary Analysis
The Postmodern Binary AnalysisThe Postmodern Binary Analysis
The Postmodern Binary AnalysisOnur Alanbel
 
Presentation on FASTA
Presentation on FASTAPresentation on FASTA
Presentation on FASTANancy599470
 
Species identification.pptx
Species identification.pptxSpecies identification.pptx
Species identification.pptxMaiAnh409544
 

Similar to Dr. Stickle (20)

ChipSeq Data Analysis
ChipSeq Data AnalysisChipSeq Data Analysis
ChipSeq Data Analysis
 
ICAR 2015 Workshop - Agnes Chan
ICAR 2015 Workshop - Agnes ChanICAR 2015 Workshop - Agnes Chan
ICAR 2015 Workshop - Agnes Chan
 
Microservices and Teraflops: Effortlessly Scaling Data Science with PyWren wi...
Microservices and Teraflops: Effortlessly Scaling Data Science with PyWren wi...Microservices and Teraflops: Effortlessly Scaling Data Science with PyWren wi...
Microservices and Teraflops: Effortlessly Scaling Data Science with PyWren wi...
 
தமிழ்க்கணிமை கட்டமைப்பு
தமிழ்க்கணிமை கட்டமைப்புதமிழ்க்கணிமை கட்டமைப்பு
தமிழ்க்கணிமை கட்டமைப்பு
 
From Zero to Nextflow 2017
From Zero to Nextflow 2017From Zero to Nextflow 2017
From Zero to Nextflow 2017
 
Jan2015 GIAB intro, Update, and Data Analysis Planning
Jan2015 GIAB intro, Update, and Data Analysis PlanningJan2015 GIAB intro, Update, and Data Analysis Planning
Jan2015 GIAB intro, Update, and Data Analysis Planning
 
The RNA workbench - Galaxy User Conference 2018
The RNA workbench - Galaxy User Conference 2018The RNA workbench - Galaxy User Conference 2018
The RNA workbench - Galaxy User Conference 2018
 
A guided tour of Araport
A guided tour of AraportA guided tour of Araport
A guided tour of Araport
 
Evaluation of the impact of error correction algorithms on SNP calling.
Evaluation of the impact of error correction algorithms on SNP calling.Evaluation of the impact of error correction algorithms on SNP calling.
Evaluation of the impact of error correction algorithms on SNP calling.
 
Computational biology bls 303
Computational biology bls 303Computational biology bls 303
Computational biology bls 303
 
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
 
Creating a SNP calling pipeline
Creating a SNP calling pipelineCreating a SNP calling pipeline
Creating a SNP calling pipeline
 
Introduction to NGS
Introduction to NGSIntroduction to NGS
Introduction to NGS
 
Enabling Biobank-Scale Genomic Processing with Spark SQL
Enabling Biobank-Scale Genomic Processing with Spark SQLEnabling Biobank-Scale Genomic Processing with Spark SQL
Enabling Biobank-Scale Genomic Processing with Spark SQL
 
Programming for biologists
Programming for biologistsProgramming for biologists
Programming for biologists
 
20110524zurichngs 1st pub
20110524zurichngs 1st pub20110524zurichngs 1st pub
20110524zurichngs 1st pub
 
The Postmodern Binary Analysis
The Postmodern Binary AnalysisThe Postmodern Binary Analysis
The Postmodern Binary Analysis
 
Presentation on FASTA
Presentation on FASTAPresentation on FASTA
Presentation on FASTA
 
Fasta
FastaFasta
Fasta
 
Species identification.pptx
Species identification.pptxSpecies identification.pptx
Species identification.pptx
 

More from Bastian Greshake

2020 03-11-open-life-sciences
2020 03-11-open-life-sciences2020 03-11-open-life-sciences
2020 03-11-open-life-sciencesBastian Greshake
 
openSNP @ Geekend Darmstadt
openSNP @ Geekend DarmstadtopenSNP @ Geekend Darmstadt
openSNP @ Geekend DarmstadtBastian Greshake
 
Crowdsourcing the Analysis of Genomes
Crowdsourcing the Analysis of GenomesCrowdsourcing the Analysis of Genomes
Crowdsourcing the Analysis of GenomesBastian Greshake
 
openSNP - QS Cologne Meetup
openSNP - QS Cologne MeetupopenSNP - QS Cologne Meetup
openSNP - QS Cologne MeetupBastian Greshake
 
openSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association StudiesopenSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association StudiesBastian Greshake
 
Was die Post-Genomics-Ära für die Privatssphäre bedeutet
Was die Post-Genomics-Ära für die Privatssphäre bedeutetWas die Post-Genomics-Ära für die Privatssphäre bedeutet
Was die Post-Genomics-Ära für die Privatssphäre bedeutetBastian Greshake
 
PiratenMS - Google Street View
PiratenMS - Google Street ViewPiratenMS - Google Street View
PiratenMS - Google Street ViewBastian Greshake
 
Next Generation Sequencing & Transcriptome Analysis
Next Generation Sequencing & Transcriptome AnalysisNext Generation Sequencing & Transcriptome Analysis
Next Generation Sequencing & Transcriptome AnalysisBastian Greshake
 
Medienkompetenz in Sozialen Netzwerken
Medienkompetenz in Sozialen NetzwerkenMedienkompetenz in Sozialen Netzwerken
Medienkompetenz in Sozialen NetzwerkenBastian Greshake
 
Denkt denn keiner an die Kernthemen?
Denkt denn keiner an die Kernthemen?Denkt denn keiner an die Kernthemen?
Denkt denn keiner an die Kernthemen?Bastian Greshake
 
Uncanny Valley - Affen vs. Menschen
Uncanny Valley - Affen vs. MenschenUncanny Valley - Affen vs. Menschen
Uncanny Valley - Affen vs. MenschenBastian Greshake
 

More from Bastian Greshake (19)

My Life in Lockdown
My Life in LockdownMy Life in Lockdown
My Life in Lockdown
 
2020 03-11-open-life-sciences
2020 03-11-open-life-sciences2020 03-11-open-life-sciences
2020 03-11-open-life-sciences
 
openSNP @ Geekend Darmstadt
openSNP @ Geekend DarmstadtopenSNP @ Geekend Darmstadt
openSNP @ Geekend Darmstadt
 
Crowdsourcing the Analysis of Genomes
Crowdsourcing the Analysis of GenomesCrowdsourcing the Analysis of Genomes
Crowdsourcing the Analysis of Genomes
 
openSNP - QS Cologne Meetup
openSNP - QS Cologne MeetupopenSNP - QS Cologne Meetup
openSNP - QS Cologne Meetup
 
The Future of Genetics
The Future of GeneticsThe Future of Genetics
The Future of Genetics
 
openSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association StudiesopenSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association Studies
 
Was die Post-Genomics-Ära für die Privatssphäre bedeutet
Was die Post-Genomics-Ära für die Privatssphäre bedeutetWas die Post-Genomics-Ära für die Privatssphäre bedeutet
Was die Post-Genomics-Ära für die Privatssphäre bedeutet
 
Crowdsourcing GWAS
Crowdsourcing GWASCrowdsourcing GWAS
Crowdsourcing GWAS
 
Gentechnik
GentechnikGentechnik
Gentechnik
 
Lernen durch Lehren
Lernen durch LehrenLernen durch Lehren
Lernen durch Lehren
 
Haushalt 2011 Münster
Haushalt 2011 MünsterHaushalt 2011 Münster
Haushalt 2011 Münster
 
LiquidFeedback Workshop
LiquidFeedback WorkshopLiquidFeedback Workshop
LiquidFeedback Workshop
 
PiratenMS - Google Street View
PiratenMS - Google Street ViewPiratenMS - Google Street View
PiratenMS - Google Street View
 
Next Generation Sequencing & Transcriptome Analysis
Next Generation Sequencing & Transcriptome AnalysisNext Generation Sequencing & Transcriptome Analysis
Next Generation Sequencing & Transcriptome Analysis
 
Medienkompetenz in Sozialen Netzwerken
Medienkompetenz in Sozialen NetzwerkenMedienkompetenz in Sozialen Netzwerken
Medienkompetenz in Sozialen Netzwerken
 
Denkt denn keiner an die Kernthemen?
Denkt denn keiner an die Kernthemen?Denkt denn keiner an die Kernthemen?
Denkt denn keiner an die Kernthemen?
 
Uncanny Valley - Affen vs. Menschen
Uncanny Valley - Affen vs. MenschenUncanny Valley - Affen vs. Menschen
Uncanny Valley - Affen vs. Menschen
 
Meerschweinchen
MeerschweinchenMeerschweinchen
Meerschweinchen
 

Recently uploaded

Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...DhatriParmar
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...
BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...
BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...Nguyen Thanh Tu Collection
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptxDhatriParmar
 
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...Nguyen Thanh Tu Collection
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptxmary850239
 
Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17Celine George
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQuiz Club NITW
 
Indexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdfIndexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdfChristalin Nelson
 
Employablity presentation and Future Career Plan.pptx
Employablity presentation and Future Career Plan.pptxEmployablity presentation and Future Career Plan.pptx
Employablity presentation and Future Career Plan.pptxryandux83rd
 
6 ways Samsung’s Interactive Display powered by Android changes the classroom
6 ways Samsung’s Interactive Display powered by Android changes the classroom6 ways Samsung’s Interactive Display powered by Android changes the classroom
6 ways Samsung’s Interactive Display powered by Android changes the classroomSamsung Business USA
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDhatriParmar
 
Objectives n learning outcoms - MD 20240404.pptx
Objectives n learning outcoms - MD 20240404.pptxObjectives n learning outcoms - MD 20240404.pptx
Objectives n learning outcoms - MD 20240404.pptxMadhavi Dharankar
 
4.9.24 School Desegregation in Boston.pptx
4.9.24 School Desegregation in Boston.pptx4.9.24 School Desegregation in Boston.pptx
4.9.24 School Desegregation in Boston.pptxmary850239
 
ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6Vanessa Camilleri
 
Unit :1 Basics of Professional Intelligence
Unit :1 Basics of Professional IntelligenceUnit :1 Basics of Professional Intelligence
Unit :1 Basics of Professional IntelligenceDr Vijay Vishwakarma
 
DiskStorage_BasicFileStructuresandHashing.pdf
DiskStorage_BasicFileStructuresandHashing.pdfDiskStorage_BasicFileStructuresandHashing.pdf
DiskStorage_BasicFileStructuresandHashing.pdfChristalin Nelson
 
CLASSIFICATION OF ANTI - CANCER DRUGS.pptx
CLASSIFICATION OF ANTI - CANCER DRUGS.pptxCLASSIFICATION OF ANTI - CANCER DRUGS.pptx
CLASSIFICATION OF ANTI - CANCER DRUGS.pptxAnupam32727
 
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFE
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFEPART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFE
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFEMISSRITIMABIOLOGYEXP
 

Recently uploaded (20)

Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...
BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...
BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
 
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx
 
Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
 
Indexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdfIndexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdf
 
Employablity presentation and Future Career Plan.pptx
Employablity presentation and Future Career Plan.pptxEmployablity presentation and Future Career Plan.pptx
Employablity presentation and Future Career Plan.pptx
 
6 ways Samsung’s Interactive Display powered by Android changes the classroom
6 ways Samsung’s Interactive Display powered by Android changes the classroom6 ways Samsung’s Interactive Display powered by Android changes the classroom
6 ways Samsung’s Interactive Display powered by Android changes the classroom
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
 
Objectives n learning outcoms - MD 20240404.pptx
Objectives n learning outcoms - MD 20240404.pptxObjectives n learning outcoms - MD 20240404.pptx
Objectives n learning outcoms - MD 20240404.pptx
 
4.9.24 School Desegregation in Boston.pptx
4.9.24 School Desegregation in Boston.pptx4.9.24 School Desegregation in Boston.pptx
4.9.24 School Desegregation in Boston.pptx
 
ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6
 
Unit :1 Basics of Professional Intelligence
Unit :1 Basics of Professional IntelligenceUnit :1 Basics of Professional Intelligence
Unit :1 Basics of Professional Intelligence
 
DiskStorage_BasicFileStructuresandHashing.pdf
DiskStorage_BasicFileStructuresandHashing.pdfDiskStorage_BasicFileStructuresandHashing.pdf
DiskStorage_BasicFileStructuresandHashing.pdf
 
CARNAVAL COM MAGIA E EUFORIA _
CARNAVAL COM MAGIA E EUFORIA            _CARNAVAL COM MAGIA E EUFORIA            _
CARNAVAL COM MAGIA E EUFORIA _
 
CLASSIFICATION OF ANTI - CANCER DRUGS.pptx
CLASSIFICATION OF ANTI - CANCER DRUGS.pptxCLASSIFICATION OF ANTI - CANCER DRUGS.pptx
CLASSIFICATION OF ANTI - CANCER DRUGS.pptx
 
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
 
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFE
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFEPART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFE
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFE
 

Dr. Stickle

  • 4. The Goal • Creating a pipeline for EST-Analysis
  • 5. The Goal • Creating a pipeline for EST-Analysis • Displaying the results via an online framework
  • 6. wtf is a pipeline?
  • 7. Different steps of analysis performed in an automated fashion wtf is a pipeline?
  • 8. wtf is a pipeline? Different steps of analysis performed in an automated fashion
  • 10. ✓Assembly of EST-Reads into contigs ✓SNP-Detection MIRA But: ★Takes ages ★not well documented ★buggy
  • 11. ✓Assembly of EST-Reads into contigs ✓SNP-Detection MIRA But: ★Takes ages ★not well documented ★buggy
  • 12. ✓Assembly of EST-Reads into contigs ✓SNP-Detection MIRA But: ★Takes ages ★not well documented ★buggy
  • 13. MIRA ✓Assembly of EST-Reads into contigs ✓SNP-Detection But: ★Takes ages ★not well documented ★buggy
  • 14. MIRA ✓Assembly of EST-Reads into contigs ✓SNP-Detection But: ★Takes ages ★not well documented ★buggy
  • 15. MIRA ✓Assembly of EST-Reads into contigs ✓SNP-Detection But: ★Takes ages ★not well documented ★buggy
  • 16. MIRA ✓Assembly of EST-Reads into contigs ✓SNP-Detection But: ★Takes ages ★not well documented ★buggy
  • 17. SNPs ORFs Contigs BLAST MIRA PFAM BLAST2GO
  • 18. MIRA SNPs ORFs Contigs BLAST PFAM BLAST2GO
  • 19. MIRA Contigs SNPs ORFs BLAST PFAM BLAST2GO
  • 20. MIRA Contigs SNPs ORFs BLAST PFAM BLAST2GO
  • 21. MIRA Contigs SNPs ORFs BLAST PFAM BLAST2GO
  • 22. MIRA Contigs SNPs ORFs BLAST PFAM BLAST2GO
  • 23. MIRA Contigs SNPs ORFs BLAST PFAM BLAST2GO
  • 26. BLAST Basic Local Alignment Search Tool • Standard for searching sequences against a database
  • 27. BLAST Basic Local Alignment Search Tool • Standard for searching sequences against a database • emphasizes speed over sensitivity
  • 28. BLAST Basic Local Alignment Search Tool • Standard for searching sequences against a database • emphasizes speed over sensitivity
  • 29. BLAST Basic Local Alignment Search Tool • Standard for searching sequences against a database • emphasizes speed over sensitivity
  • 30. BLAST Basic Local Alignment Search Tool • Standard for searching sequences against a database • emphasizes speed over sensitivity
  • 31. Tools gene ontology Blast2GO mapping open reading frame ORF prediction PFAM domain annotation
  • 33. Tools gene ontology Blast2GO mapping open reading frame ORF prediction PFAM domain annotation
  • 34. ORF
  • 35. Tools gene ontology Blast2GO mapping open reading frame ORF prediction PFAM domain annotation
  • 36. PFAM
  • 37.
  • 40. What we‘ve got here: •Different tools •many different output-files
  • 41. What we‘ve got here: •Different tools •many different output-files What we want: a structured database containing all the information
  • 42. How to parse Class «Parser» •Function BLAST-Parser •Function PFAM-Parser •Function FASTA-Parser •... Data Script •read input •use parser •insert db
  • 43. How to parse Class «Parser» •Function BLAST-Parser •Function PFAM-Parser •Function FASTA-Parser •... Data Script •read input •use parser •insert db
  • 44. How to parse Class «Parser» •Function BLAST-Parser •Function PFAM-Parser •Function FASTA-Parser •... Data Script •read input •use parser •insert db
  • 45. How to parse Class «Parser» •Function BLAST-Parser •Function PFAM-Parser •Function FASTA-Parser •... Data Script •read input •use parser •insert db
  • 46. How to parse Data Class «Parser» •Function BLAST-Parser •Function PFAM-Parser •Function FASTA-Parser •... Script •read input •use parser •insert db Database
  • 48.
  • 49.
  • 50.
  • 52. Summary & Results •created the pipeline •analysed data •started filling the database
  • 53. Summary & Results •created the pipeline •analysed data •started filling the database To be done •wait for MIRA •SNP-parser
  • 54. thx to: •Marvin, for «time till scooter» and sending us to Lothar •Lothar, for providing always friendly and calm advice •Suse, for actually having used MIRA at least once •Andrew, for Andreas •Andreas, for Andrew •Bastien Chevreux, for not fixing those damn bugs in MIRA