SlideShare una empresa de Scribd logo
1 de 48
MOM 2010  Bioinformatics: Reality Check & Challenges Dr. SouhamMeshoul Information Technology Dept. CCIS, KSU smeshoul@ksu.edu.sa Dr. NikhatSiddiqi Biochemistry Dept. CS, KSU nikkat@ksu.edu.sa & MOM 2010 ,  May 31 st
Outline ,[object Object]
  Central Dogma of Molecular Biology
  Biological Data representation
  How Computers can be useful in Biology
  Intelligent bioinformatics
  Challenges
  Conclusion,[object Object]
Introduction
Introduction What is Bioinformatics? Bioinformatics is the field of science in which biology, computer science, and information technology merge into a single discipline Bioinformatics is the electronic  infrastructure of Biology Science that encompasses the methods that are used to collect, store, retrieve, analyze, and correlate the mountain of complex biological information. Source: http://ccb.wustl.edu/
Introduction Bioinformatics Vs Computational biology Source: http://ccb.wustl.edu/
Introduction The problem:              basic understanding of how gene sequences code specific proteins     l   Lack of the information necessary to completely understand role of DNA in specific diseases or the functions of the thousands of proteins that are produced. The goals: provide scientists with a means to explain: Biological processes.  Malfunctions in these processes which lead to diseases.  Drug discovery and their mode of action.
Central Dogma of Molecular Biology
Central Dogma of Molecular Biology
Central Dogma of Molecular Biology DNA is responsible for all the hereditary information in an organism.
Central Dogma of Molecular Biology
Deoxyribonucleic Acid (DNA) DNA is found inside a special area of the cell called the nucleus. Because the cell is very small, and because organisms have many DNA molecules per cell, each DNA molecule must be tightly packaged. This packaged form of the DNA is called a chromosome.
Deoxyribonucleic Acid (DNA)Composition DNA is made of chemical building blocks called nucleotides.   What is DNA made of? The four types of nitrogen bases found in nucleotides are: adenine (A), , thymine (T), guanine (G) and cytosine (C). The order, or sequence, of these bases determines what biological instructions are contained in a strand of DNA. For example, the sequence ATCGTT might instruct for blue eyes, while ATCGCT might instruct for brown.
Deoxyribonucleic Acid (DNA)Genes Genes These unique coding sections of DNA that ultimately are transcribed into unique mRNA which are translated into unique proteins are called genes.
Deoxyribonucleic Acid (DNA)Function What does DNA do? DNA contains the instructions needed for an organism to develop, survive and reproduce. To carry out these functions, DNA sequences must be converted into messages that can be used to produce proteins, which are the complex molecules that do most of the work in our bodies. To form a strand of DNA, nucleotides are linked into chains, with the phosphate and sugar groups alternating.
Deoxyribonucleic Acid (DNA)Replication Replication Chromosomes are located in the nucleus of a cell.  DNA must be duplicated in a process called replication before a cell divides.  The replication of DNA allows each daughter cell to contain a full complement of chromosomes.  
Deoxyribonucleic Acid (DNA)Transcription Transcription:  The actual information in the DNA of chromosomes is decoded in a process called transcription   through the formation of another nucleic acid, ribonucleic acid or RNA.
Deoxyribonucleic Acid (DNA)Translation Translation: The information from the DNA, now in the form of a linear RNA sequence, is decoded in a process called translation, to form a protein, another biological polymer.
Biological Data Representation
Biological Data Representation
Biological Data Representation Strings: to represent DNA, RNA and sequences of amino-acids    DNA: {A,C,G,T}, RNA: {A,C,G,U}, Protein: {A,C,D,E,F,G,H,I,K,L,M,N,P,Q,R,S,T,V,W,Y} e.g5’GTAAAGTCCCGTTAGC 3’ Image source; www.ebi.ac.uk/microarray/ biology_intro.htm
Biological Data Representation Trees: to represent the evolution of various organisms
Biological Data Representation Sets of 3D points: to represent the protein structure
Biological Data Representation Graphs: to represent metabolic and signaling pathways.
How can computers be useful for biology?
How can computers be useful to biology? First, Computing technology for storing DNA sequences and constructing these latter from fragments. Data storage and access requirements –  Study of the organization and evolution of genomes through comparative genome analysis. Visualisation tools and techniques requirements, sequence analysis requirements .
How can computers be useful to biology? Structuring and organizing large databases using a common ontology . Access data from different databases using the same query language. (Gene Ontology Consortium ) Many areas of biology use images to communicate their results. Tools and techniques for searching, describing, manipulating and analyzing features within these images.
How can computers be useful to biology? Databases maintenance: Need to check consistency of databases for a valid and error free content.  Storing protein sequences, their structure as well as their function. Tools and techniques for manipulating protein sequences, protein secondary and tertiary structure prediction…
How can computers be useful for biology? Several databases: genome databases, protein sequence databases, metabolic databases, Microarray databases EMBL-EBI (Europe),  GenBank-NCBI (USA),  DDBJ (Japan) Basic Local Alignment Search Tool (BLAST) program for quick DB searches.
How can computers be useful for biology? Several algorithms:  Sequence Comparison Algorithms: Needleman-Wunch (global alignment  1970),Smith-Waterman algorithm: Local sequence alignment (1981). BLAST,  FASTA, CLUSTALW, MEME...etc http://en.wikipedia.org/wiki/Category:Bioinformatics_algorithms
Intelligent Bioinformatics ,[object Object]
  Data interpretation and generation of hypotheses requires intelligence.
  AI offers established methods for knowledge representation and “intelligent” data interpretation.
  Predict utilization of AI in bioinformatics to increase.,[object Object]
Intelligent Bioinformatics AI  and computational intelligence techniques and models: Basic search techniques  A*, Branch and Bound,  Genetic Algorithms  Simulated Annealing  Particle Swarm Optimization  Neural networks,  Support Vector Machines, K nearest neighbors, Hidden Markov Models,….
Intelligent Bioinformatics:An example: Sequence Alignment A - T G G G G - - T T A - T A C C C - G - A G - G T T G T G T - A - - A C C A - G C Possible alignment Possible alignment S1=AGGTC S2=GTTCG S3=TGAAC For lengthy sequences the problem is  very hard to solve: ,[object Object]
Search strategy
Scoring function,[object Object]
Intelligent BioinformaticsAn example: the robot scientist Source: BBC News ,[object Object]
 Designed for the study of functional genomics
 Tested on yeast metabolic pathways
 Uses knowledge representation schemes
Utilizes a Prolog database to store background biological information.

Más contenido relacionado

La actualidad más candente

Bioinformatics Final Presentation
Bioinformatics Final PresentationBioinformatics Final Presentation
Bioinformatics Final Presentation
Shruthi Choudary
 

La actualidad más candente (20)

History and devolopment of bioinfomatics.ppt (1)
History and devolopment of bioinfomatics.ppt (1)History and devolopment of bioinfomatics.ppt (1)
History and devolopment of bioinfomatics.ppt (1)
 
Computational Biology and Bioinformatics
Computational Biology and BioinformaticsComputational Biology and Bioinformatics
Computational Biology and Bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics & It's Scope in Biotechnology
Bioinformatics & It's Scope in BiotechnologyBioinformatics & It's Scope in Biotechnology
Bioinformatics & It's Scope in Biotechnology
 
Bioinformatics in present and its future
Bioinformatics in present and its futureBioinformatics in present and its future
Bioinformatics in present and its future
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
B.sc biochem i bobi u-1 introduction to bioinformatics
B.sc biochem i bobi u-1 introduction to bioinformaticsB.sc biochem i bobi u-1 introduction to bioinformatics
B.sc biochem i bobi u-1 introduction to bioinformatics
 
introduction of Bioinformatics
introduction of Bioinformaticsintroduction of Bioinformatics
introduction of Bioinformatics
 
Intro bioinfo
Intro bioinfoIntro bioinfo
Intro bioinfo
 
Bioinformatics Final Presentation
Bioinformatics Final PresentationBioinformatics Final Presentation
Bioinformatics Final Presentation
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)
 
Bioinformatics & its scope in biotech.
Bioinformatics & its scope in biotech.Bioinformatics & its scope in biotech.
Bioinformatics & its scope in biotech.
 
Bioinformatics for beginners
Bioinformatics for beginnersBioinformatics for beginners
Bioinformatics for beginners
 

Similar a MoM2010: Bioinformatics

57 bio infomark
57 bio infomark57 bio infomark
57 bio infomark
phdcao
 
Introducción a la bioinformatica
Introducción a la bioinformaticaIntroducción a la bioinformatica
Introducción a la bioinformatica
Martín Arrieta
 

Similar a MoM2010: Bioinformatics (20)

LECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSLECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICS
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Introduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdfIntroduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdf
 
Lecture 1 Introduction to Bioinformatics BCH 433.ppt
Lecture 1 Introduction to Bioinformatics BCH 433.pptLecture 1 Introduction to Bioinformatics BCH 433.ppt
Lecture 1 Introduction to Bioinformatics BCH 433.ppt
 
Pcmd bioinformatics-lecture i
Pcmd bioinformatics-lecture iPcmd bioinformatics-lecture i
Pcmd bioinformatics-lecture i
 
Bio Inspired Computing Final Version
Bio Inspired Computing Final VersionBio Inspired Computing Final Version
Bio Inspired Computing Final Version
 
Intro bioinfo
Intro bioinfoIntro bioinfo
Intro bioinfo
 
Synthetic Biology
Synthetic BiologySynthetic Biology
Synthetic Biology
 
Bioinformatica 29-09-2011-t1-bioinformatics
Bioinformatica 29-09-2011-t1-bioinformaticsBioinformatica 29-09-2011-t1-bioinformatics
Bioinformatica 29-09-2011-t1-bioinformatics
 
Bioinformatics seminar
Bioinformatics seminarBioinformatics seminar
Bioinformatics seminar
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST ToolBioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST Tool
 
57 bio infomark
57 bio infomark57 bio infomark
57 bio infomark
 
Bioinformatics .pptx
Bioinformatics .pptxBioinformatics .pptx
Bioinformatics .pptx
 
Presentation.pptx
Presentation.pptxPresentation.pptx
Presentation.pptx
 
Bioinformatics manual
Bioinformatics manualBioinformatics manual
Bioinformatics manual
 
BIOINFO unit 1.pptx
BIOINFO unit 1.pptxBIOINFO unit 1.pptx
BIOINFO unit 1.pptx
 
Enabling next-generation sequencing applications with IBM Storwize V7000 Unif...
Enabling next-generation sequencing applications with IBM Storwize V7000 Unif...Enabling next-generation sequencing applications with IBM Storwize V7000 Unif...
Enabling next-generation sequencing applications with IBM Storwize V7000 Unif...
 
Introducción a la bioinformatica
Introducción a la bioinformaticaIntroducción a la bioinformatica
Introducción a la bioinformatica
 
A comparative study using different measure of filteration
A comparative study using different measure of filterationA comparative study using different measure of filteration
A comparative study using different measure of filteration
 

Más de Hend Al-Khalifa

مقدمة في بناء الأنطولوجيا باستخدام برنامج البورتجي
مقدمة في بناء الأنطولوجيا باستخدام برنامج البورتجيمقدمة في بناء الأنطولوجيا باستخدام برنامج البورتجي
مقدمة في بناء الأنطولوجيا باستخدام برنامج البورتجي
Hend Al-Khalifa
 
CS4HS Booklet- In Arabic
CS4HS Booklet- In ArabicCS4HS Booklet- In Arabic
CS4HS Booklet- In Arabic
Hend Al-Khalifa
 
Website Development Process
Website Development ProcessWebsite Development Process
Website Development Process
Hend Al-Khalifa
 
MoM2010: Data mining as an Saudi IT nation demand
MoM2010: Data mining as an Saudi IT nation demandMoM2010: Data mining as an Saudi IT nation demand
MoM2010: Data mining as an Saudi IT nation demand
Hend Al-Khalifa
 
MoM2010: Saudi Commission for Tourism and Antiquities
MoM2010: Saudi Commission for Tourism and AntiquitiesMoM2010: Saudi Commission for Tourism and Antiquities
MoM2010: Saudi Commission for Tourism and Antiquities
Hend Al-Khalifa
 

Más de Hend Al-Khalifa (20)

أساسيات النشر العلمي
أساسيات النشر العلميأساسيات النشر العلمي
أساسيات النشر العلمي
 
عشرة أسئلة تمشكل الباحثين
عشرة أسئلة تمشكل الباحثينعشرة أسئلة تمشكل الباحثين
عشرة أسئلة تمشكل الباحثين
 
A Survey of IT Jobs in the Kingdom of Saudi Arabia 2017
A Survey of IT Jobs in the Kingdom of Saudi Arabia 2017A Survey of IT Jobs in the Kingdom of Saudi Arabia 2017
A Survey of IT Jobs in the Kingdom of Saudi Arabia 2017
 
How to search isi for Quartiles
How to search isi for QuartilesHow to search isi for Quartiles
How to search isi for Quartiles
 
مقدمة في بناء الأنطولوجيا باستخدام برنامج البورتجي
مقدمة في بناء الأنطولوجيا باستخدام برنامج البورتجيمقدمة في بناء الأنطولوجيا باستخدام برنامج البورتجي
مقدمة في بناء الأنطولوجيا باستخدام برنامج البورتجي
 
صندوق أدوات البحث العلمي
صندوق أدوات البحث العلميصندوق أدوات البحث العلمي
صندوق أدوات البحث العلمي
 
Mediterranean Arabic Language and Speech Technology Resources
Mediterranean Arabic Language and Speech Technology ResourcesMediterranean Arabic Language and Speech Technology Resources
Mediterranean Arabic Language and Speech Technology Resources
 
توجهات تقنية مبتكرة في التعلم الإلكتروني: من التقليدية للإبداعية
توجهات تقنية مبتكرة في التعلم الإلكتروني: من التقليدية للإبداعيةتوجهات تقنية مبتكرة في التعلم الإلكتروني: من التقليدية للإبداعية
توجهات تقنية مبتكرة في التعلم الإلكتروني: من التقليدية للإبداعية
 
كتيب ورشة السجل البحثي
كتيب ورشة السجل البحثيكتيب ورشة السجل البحثي
كتيب ورشة السجل البحثي
 
CIS_CfP
CIS_CfPCIS_CfP
CIS_CfP
 
Nstc
NstcNstc
Nstc
 
Dr.Nawal E-Profile
Dr.Nawal E-ProfileDr.Nawal E-Profile
Dr.Nawal E-Profile
 
It999 seminar 2012
It999 seminar 2012It999 seminar 2012
It999 seminar 2012
 
CS4HS Booklet- In Arabic
CS4HS Booklet- In ArabicCS4HS Booklet- In Arabic
CS4HS Booklet- In Arabic
 
Summer internship
Summer internshipSummer internship
Summer internship
 
Introduction to XHTML
Introduction to XHTMLIntroduction to XHTML
Introduction to XHTML
 
Website Development Process
Website Development ProcessWebsite Development Process
Website Development Process
 
MoM2010: Data mining as an Saudi IT nation demand
MoM2010: Data mining as an Saudi IT nation demandMoM2010: Data mining as an Saudi IT nation demand
MoM2010: Data mining as an Saudi IT nation demand
 
MoM2010: E-Hospitals
MoM2010: E-HospitalsMoM2010: E-Hospitals
MoM2010: E-Hospitals
 
MoM2010: Saudi Commission for Tourism and Antiquities
MoM2010: Saudi Commission for Tourism and AntiquitiesMoM2010: Saudi Commission for Tourism and Antiquities
MoM2010: Saudi Commission for Tourism and Antiquities
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Último (20)

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 

MoM2010: Bioinformatics

  • 1. MOM 2010 Bioinformatics: Reality Check & Challenges Dr. SouhamMeshoul Information Technology Dept. CCIS, KSU smeshoul@ksu.edu.sa Dr. NikhatSiddiqi Biochemistry Dept. CS, KSU nikkat@ksu.edu.sa & MOM 2010 , May 31 st
  • 2.
  • 3. Central Dogma of Molecular Biology
  • 4. Biological Data representation
  • 5. How Computers can be useful in Biology
  • 6. Intelligent bioinformatics
  • 8.
  • 10. Introduction What is Bioinformatics? Bioinformatics is the field of science in which biology, computer science, and information technology merge into a single discipline Bioinformatics is the electronic infrastructure of Biology Science that encompasses the methods that are used to collect, store, retrieve, analyze, and correlate the mountain of complex biological information. Source: http://ccb.wustl.edu/
  • 11. Introduction Bioinformatics Vs Computational biology Source: http://ccb.wustl.edu/
  • 12. Introduction The problem: basic understanding of how gene sequences code specific proteins l Lack of the information necessary to completely understand role of DNA in specific diseases or the functions of the thousands of proteins that are produced. The goals: provide scientists with a means to explain: Biological processes. Malfunctions in these processes which lead to diseases. Drug discovery and their mode of action.
  • 13. Central Dogma of Molecular Biology
  • 14. Central Dogma of Molecular Biology
  • 15. Central Dogma of Molecular Biology DNA is responsible for all the hereditary information in an organism.
  • 16. Central Dogma of Molecular Biology
  • 17. Deoxyribonucleic Acid (DNA) DNA is found inside a special area of the cell called the nucleus. Because the cell is very small, and because organisms have many DNA molecules per cell, each DNA molecule must be tightly packaged. This packaged form of the DNA is called a chromosome.
  • 18. Deoxyribonucleic Acid (DNA)Composition DNA is made of chemical building blocks called nucleotides. What is DNA made of? The four types of nitrogen bases found in nucleotides are: adenine (A), , thymine (T), guanine (G) and cytosine (C). The order, or sequence, of these bases determines what biological instructions are contained in a strand of DNA. For example, the sequence ATCGTT might instruct for blue eyes, while ATCGCT might instruct for brown.
  • 19. Deoxyribonucleic Acid (DNA)Genes Genes These unique coding sections of DNA that ultimately are transcribed into unique mRNA which are translated into unique proteins are called genes.
  • 20. Deoxyribonucleic Acid (DNA)Function What does DNA do? DNA contains the instructions needed for an organism to develop, survive and reproduce. To carry out these functions, DNA sequences must be converted into messages that can be used to produce proteins, which are the complex molecules that do most of the work in our bodies. To form a strand of DNA, nucleotides are linked into chains, with the phosphate and sugar groups alternating.
  • 21. Deoxyribonucleic Acid (DNA)Replication Replication Chromosomes are located in the nucleus of a cell.  DNA must be duplicated in a process called replication before a cell divides.  The replication of DNA allows each daughter cell to contain a full complement of chromosomes.  
  • 22. Deoxyribonucleic Acid (DNA)Transcription Transcription: The actual information in the DNA of chromosomes is decoded in a process called transcription   through the formation of another nucleic acid, ribonucleic acid or RNA.
  • 23. Deoxyribonucleic Acid (DNA)Translation Translation: The information from the DNA, now in the form of a linear RNA sequence, is decoded in a process called translation, to form a protein, another biological polymer.
  • 26. Biological Data Representation Strings: to represent DNA, RNA and sequences of amino-acids DNA: {A,C,G,T}, RNA: {A,C,G,U}, Protein: {A,C,D,E,F,G,H,I,K,L,M,N,P,Q,R,S,T,V,W,Y} e.g5’GTAAAGTCCCGTTAGC 3’ Image source; www.ebi.ac.uk/microarray/ biology_intro.htm
  • 27. Biological Data Representation Trees: to represent the evolution of various organisms
  • 28. Biological Data Representation Sets of 3D points: to represent the protein structure
  • 29. Biological Data Representation Graphs: to represent metabolic and signaling pathways.
  • 30. How can computers be useful for biology?
  • 31. How can computers be useful to biology? First, Computing technology for storing DNA sequences and constructing these latter from fragments. Data storage and access requirements – Study of the organization and evolution of genomes through comparative genome analysis. Visualisation tools and techniques requirements, sequence analysis requirements .
  • 32. How can computers be useful to biology? Structuring and organizing large databases using a common ontology . Access data from different databases using the same query language. (Gene Ontology Consortium ) Many areas of biology use images to communicate their results. Tools and techniques for searching, describing, manipulating and analyzing features within these images.
  • 33. How can computers be useful to biology? Databases maintenance: Need to check consistency of databases for a valid and error free content. Storing protein sequences, their structure as well as their function. Tools and techniques for manipulating protein sequences, protein secondary and tertiary structure prediction…
  • 34. How can computers be useful for biology? Several databases: genome databases, protein sequence databases, metabolic databases, Microarray databases EMBL-EBI (Europe), GenBank-NCBI (USA), DDBJ (Japan) Basic Local Alignment Search Tool (BLAST) program for quick DB searches.
  • 35. How can computers be useful for biology? Several algorithms: Sequence Comparison Algorithms: Needleman-Wunch (global alignment 1970),Smith-Waterman algorithm: Local sequence alignment (1981). BLAST, FASTA, CLUSTALW, MEME...etc http://en.wikipedia.org/wiki/Category:Bioinformatics_algorithms
  • 36.
  • 37. Data interpretation and generation of hypotheses requires intelligence.
  • 38. AI offers established methods for knowledge representation and “intelligent” data interpretation.
  • 39.
  • 40. Intelligent Bioinformatics AI and computational intelligence techniques and models: Basic search techniques A*, Branch and Bound, Genetic Algorithms Simulated Annealing Particle Swarm Optimization Neural networks, Support Vector Machines, K nearest neighbors, Hidden Markov Models,….
  • 41.
  • 43.
  • 44.
  • 45. Designed for the study of functional genomics
  • 46. Tested on yeast metabolic pathways
  • 47. Uses knowledge representation schemes
  • 48. Utilizes a Prolog database to store background biological information.
  • 49. Prolog can inspect biological information, infer knowledge, and make predictions
  • 50.
  • 51. Intelligent BioinformaticsAn other example: the robot scientist Performance similar to humans Performance significantly better than “naïve” or “random” selection of experiments Ross D. King, et al., Nature, January 2004
  • 52. Challenges Data Fusion or integration: Integration of a wide variety of data sources such as clinical and genomic data will allow us to use disease symptoms to predict genetic mutations and vice versa. The integration of GIS data, such as maps, weather systems, with crop health and genotype data, will allow us to predict successful outcomes of agriculture experiments.
  • 53. Challenges Large-scale comparative genomics. development of tools that can do comparisons of genomes will push forward the discovery rate in this field of bioinformatics. Modeling and visualization of full networks of complex systems predict how the system (or cell) reacts to a drug for example.
  • 54. Challenges Compare complex biological observations, such as gene expression patterns and protein networks. Converting biological observations to a model that a computer will understand. More than that we are more than the sum of the parts….COMPLEXITY
  • 55.
  • 56. Exciting research potential: Leads to important discoveries
  • 58. SmartMoney ranks Bioinformatics a #1 among next Hot Jobs
  • 59. Saves time and moneyhttp://smartmoney.com/consumer/index.cfm?story=working-june02
  • 60. OUR CHALLENGE: START WORKING ON BIOINFORMATICS J. Cohen “Computer scientists should be encouraged to learn biology as biologists computer science to prepare themselves for an intellectually stimulating and financially rewarding future.” D. Knuth “…the number of radically new results in pure computer science is likely to decrease, while scientists continue working on biological challenges for the next 500 years….” L. Adleman “..biological life can be equated with computation…”
  • 61. Conclusion Image Source: http://www.amazon.com/
  • 62. Conclusion Image Source: http://www.amazon.com/
  • 63. Ressources ISCB: http://www.iscb.org/ NBCI: http://ncbi.nlm.nih.gov/ http://www.bioinformatics.org/ Journals: IEEE/ACM Conferences (ISMB, RECOMB, PSB…) http://kbrin.a-bldg.louisville.edu/CECS694/
  • 64. Conclusion Bioinformatics is all about how computer science can enhance biology and how biology can stimulate computer science.