SlideShare a Scribd company logo
1 of 13
Download to read offline
National Center for
Biotechnology Information
NCBI
Amandeep Singh
Assistant Professor
Department of Biotechnology
GSSDGS Khalsa College Patiala
Introduction
National Library of Medicine
NCBI
National Institute of Health, Bethesda, Maryland, USA
NCBI is established in 1988 as a part of National Library of Medicine at the
National Institutes of Health, Bethesda, Maryland, USA.
• The NCBI houses a series of databases relevant to
biotechnology and biomedicine and consists of important
resources for bioinformatics tools and services.
• Major databases include GenBank for DNA sequences and
PubMed, a bibliographic database for biomedical literature.
• Major Resources includes Books, software tools (BLAST),
retrieval system (Entrez), submission tools (Bankit, Sequin),
specialized tools (ORF Finder, e-PCR, Spidey/Splign).
Aim of NCBI
1. To Develop
Public Databases
2. To Develop Software Tools 3. To Spread
Biomedical
Information
Roles of NCBI
1. Maintenance of Biological Databases. e.g. GenBank
Primary Secondary/Derivative
1. Original submission by experimentalists. 1. Built up from primary database
2. Content controlled by submitter. 2. Content controlled by third body (NCBI)
2. Provide data retrieval system (if we want to retrieve the data form
biological databases, we need retrieval system). e.g. Entrez
3. Provide computational resources for the analysis of GenBank data.
(only after analysis we can use the information available in databases,
so we need computational resources)
NCBI Database & Resources
DATABASE RETRIEVAL SYSTEM -
Entrez
Entrez: Integrated database search and retrieval system.
• Extract information from DNA sequence data, protein
sequence data, population set, whole genome,
macromolecular structures, biomedical literature like
PubMed.
• Sequence sources: PIR, SWISS-PROT, PDB,
GenBank, RefSeq.
• Covers over 20 databases.
Protein Information
Protein Sequence
Protein Structure
PIR
SWISS-PROT
PDB
GenBank Genetic Sequence
RETRIEVAL SYSTEM
Name of
DATABASES
Information
Provided
Information
Accessible to the
User
Entrez
Sequence Submission Tools
If we want to submit a new genomic sequence to NCBI, then we can use
tools like Bankit and Sequin.
1. Bankit
• Web-based sequence submission tool
• Present on NCBI Homepage
• Tool for Simple submission: when only one or small number of
records are to be submitted.
• Can also be used by submitters to update their existing GenBank
records.
• Sequence analysis tools are not required for submission through this
process.
2. Sequin
• NCBI tool for sequence submission and update.
• Can handle multiple sequence submissions; that includes long
sequences, multiple annotations, segment sets of DNA,
population studies
• Provide graphical viewing and editing options.
Specialized tools
1. ORF Finder
Graphic analysis tool to detect open reading frame (ORF) of a selectable minimum size.
An open reading frame is a portion of a DNA molecule that, when translated into amino
acids, contains no stop codons.
ORF Finder
Detection of ORF
In User’s sequence In a sequence already in
the database
2. e-PCR
Computation procedure that is used to identify sequence-tag-
sites (STSs) with in DNA sequence.
STS is a short (200 to 500 base pair) DNA sequence that has a
single occurrence in the genome and whose location and base
sequence are known.
Search for STS subsequences to match with PCR primer which
have correct order, orientation, spacing to generate known
STSs.
3. Spidey/Splign
• mRNA to Genomic alignment program, which use the local
alignment tools- BLAST & Dot View, to find it’s alignment.
• For identification of exons and introns and link between
them.
• For tracking splice junction.
• For identification of frameshift sequences.
• For identification of changes in sequences.

More Related Content

What's hot (20)

Fasta
FastaFasta
Fasta
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
Biological database
Biological databaseBiological database
Biological database
 
Swiss prot database
Swiss prot databaseSwiss prot database
Swiss prot database
 
History and scope in bioinformatics
History and scope in bioinformaticsHistory and scope in bioinformatics
History and scope in bioinformatics
 
PIR- Protein Information Resource
PIR- Protein Information ResourcePIR- Protein Information Resource
PIR- Protein Information Resource
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Genome annotation
Genome annotationGenome annotation
Genome annotation
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENT
 
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
 
Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASE
 
Entrez databases
Entrez databasesEntrez databases
Entrez databases
 
Tools and database of NCBI
Tools and database of NCBITools and database of NCBI
Tools and database of NCBI
 
Genomic databases
Genomic databasesGenomic databases
Genomic databases
 
Primary and secondary database
Primary and secondary databasePrimary and secondary database
Primary and secondary database
 
Clustal
ClustalClustal
Clustal
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Primary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyanaPrimary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyana
 
blast bioinformatics
blast bioinformaticsblast bioinformatics
blast bioinformatics
 

Similar to NCBI National Center for Biotechnology Information

databases.pptx
databases.pptxdatabases.pptx
databases.pptxifra27
 
Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformaticsVinaKhan1
 
Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu KAUSHAL SAHU
 
Bioinformatics biological databases
Bioinformatics biological databasesBioinformatics biological databases
Bioinformatics biological databasesSangeeta Das
 
Nucleic acid and protein databanks
Nucleic acid and protein databanksNucleic acid and protein databanks
Nucleic acid and protein databanksNithyaNandapal
 
Primary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptxPrimary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptxVandana Yadav03
 
Nucleic Acid Sequence Databases
Nucleic Acid Sequence DatabasesNucleic Acid Sequence Databases
Nucleic Acid Sequence Databasesfarwa fayaz
 
protein databases.ppt
protein databases.pptprotein databases.ppt
protein databases.pptSanthiyaAK
 
Primary sequencing of nucleic acids
Primary sequencing of nucleic acidsPrimary sequencing of nucleic acids
Primary sequencing of nucleic acidsvibhakumari12
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...SBituila
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...BibiQuinah
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...Elufer Akram
 
Primary Databases.pptx
Primary Databases.pptxPrimary Databases.pptx
Primary Databases.pptxSwarup Malakar
 

Similar to NCBI National Center for Biotechnology Information (20)

NCBI
NCBINCBI
NCBI
 
databases.pptx
databases.pptxdatabases.pptx
databases.pptx
 
Biological databases.pptx
Biological databases.pptxBiological databases.pptx
Biological databases.pptx
 
Introduction to databases.pptx
Introduction to databases.pptxIntroduction to databases.pptx
Introduction to databases.pptx
 
Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformatics
 
Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu
 
Ncbi
NcbiNcbi
Ncbi
 
Data base in detail
Data base in detailData base in detail
Data base in detail
 
Bioinformatics biological databases
Bioinformatics biological databasesBioinformatics biological databases
Bioinformatics biological databases
 
Nucleic acid and protein databanks
Nucleic acid and protein databanksNucleic acid and protein databanks
Nucleic acid and protein databanks
 
Primary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptxPrimary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptx
 
Nucleic Acid Sequence Databases
Nucleic Acid Sequence DatabasesNucleic Acid Sequence Databases
Nucleic Acid Sequence Databases
 
protein databases.ppt
protein databases.pptprotein databases.ppt
protein databases.ppt
 
Genomic databases
Genomic databasesGenomic databases
Genomic databases
 
Intro to databases
Intro to databasesIntro to databases
Intro to databases
 
Primary sequencing of nucleic acids
Primary sequencing of nucleic acidsPrimary sequencing of nucleic acids
Primary sequencing of nucleic acids
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...
 
Primary Databases.pptx
Primary Databases.pptxPrimary Databases.pptx
Primary Databases.pptx
 

More from Thapar Institute of Engineering & Technology, Patiala, Punjab, India

More from Thapar Institute of Engineering & Technology, Patiala, Punjab, India (20)

SDS PAGE
SDS PAGESDS PAGE
SDS PAGE
 
Agarose gel electrophoresis
Agarose gel electrophoresisAgarose gel electrophoresis
Agarose gel electrophoresis
 
Prokaryotic and eukaryotic cell
Prokaryotic and eukaryotic cellProkaryotic and eukaryotic cell
Prokaryotic and eukaryotic cell
 
Preparation and staining of specimens for microscopy
Preparation and staining of specimens for microscopyPreparation and staining of specimens for microscopy
Preparation and staining of specimens for microscopy
 
Microbial polysaccharides
Microbial polysaccharidesMicrobial polysaccharides
Microbial polysaccharides
 
Organic acids production copy
Organic acids production   copyOrganic acids production   copy
Organic acids production copy
 
Methods of strain improvement
Methods of strain improvementMethods of strain improvement
Methods of strain improvement
 
Refrigeration
RefrigerationRefrigeration
Refrigeration
 
Patents
PatentsPatents
Patents
 
Vaccines
VaccinesVaccines
Vaccines
 
Chemical reactions and rancidity of fats
Chemical reactions and rancidity of fatsChemical reactions and rancidity of fats
Chemical reactions and rancidity of fats
 
Characteristics of biological databases
Characteristics of biological databasesCharacteristics of biological databases
Characteristics of biological databases
 
FASTA
FASTAFASTA
FASTA
 
Organoleptic properties of proteins
Organoleptic properties of proteinsOrganoleptic properties of proteins
Organoleptic properties of proteins
 
Denaturation of proteins
Denaturation of proteinsDenaturation of proteins
Denaturation of proteins
 
OMIM- Online Mendelian Inheritance in Man
OMIM- Online Mendelian Inheritance in Man OMIM- Online Mendelian Inheritance in Man
OMIM- Online Mendelian Inheritance in Man
 
Antigen & antigenicity
Antigen & antigenicityAntigen & antigenicity
Antigen & antigenicity
 
Protein Data Bank (PDB)
Protein Data Bank (PDB)Protein Data Bank (PDB)
Protein Data Bank (PDB)
 
SWISS-PROT
SWISS-PROTSWISS-PROT
SWISS-PROT
 
Organs of the immune system
Organs of the immune systemOrgans of the immune system
Organs of the immune system
 

Recently uploaded

Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxlancelewisportillo
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdfMr Bounab Samir
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvRicaMaeCastro1
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWQuiz Club NITW
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Association for Project Management
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQuiz Club NITW
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Developmentchesterberbo7
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfPatidar M
 
Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataBabyAnnMotar
 
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxGrade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxkarenfajardo43
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...DhatriParmar
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptxDhatriParmar
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operationalssuser3e220a
 
How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17Celine George
 

Recently uploaded (20)

Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdf
 
Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering
Faculty Profile prashantha K EEE dept Sri Sairam college of EngineeringFaculty Profile prashantha K EEE dept Sri Sairam college of Engineering
Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITW
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Development
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdf
 
Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped data
 
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxGrade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
 
prashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Professionprashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Profession
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operational
 
How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17
 

NCBI National Center for Biotechnology Information

  • 1. National Center for Biotechnology Information NCBI Amandeep Singh Assistant Professor Department of Biotechnology GSSDGS Khalsa College Patiala
  • 2. Introduction National Library of Medicine NCBI National Institute of Health, Bethesda, Maryland, USA NCBI is established in 1988 as a part of National Library of Medicine at the National Institutes of Health, Bethesda, Maryland, USA.
  • 3. • The NCBI houses a series of databases relevant to biotechnology and biomedicine and consists of important resources for bioinformatics tools and services. • Major databases include GenBank for DNA sequences and PubMed, a bibliographic database for biomedical literature. • Major Resources includes Books, software tools (BLAST), retrieval system (Entrez), submission tools (Bankit, Sequin), specialized tools (ORF Finder, e-PCR, Spidey/Splign).
  • 4. Aim of NCBI 1. To Develop Public Databases 2. To Develop Software Tools 3. To Spread Biomedical Information
  • 5. Roles of NCBI 1. Maintenance of Biological Databases. e.g. GenBank Primary Secondary/Derivative 1. Original submission by experimentalists. 1. Built up from primary database 2. Content controlled by submitter. 2. Content controlled by third body (NCBI) 2. Provide data retrieval system (if we want to retrieve the data form biological databases, we need retrieval system). e.g. Entrez 3. Provide computational resources for the analysis of GenBank data. (only after analysis we can use the information available in databases, so we need computational resources)
  • 6. NCBI Database & Resources
  • 7. DATABASE RETRIEVAL SYSTEM - Entrez Entrez: Integrated database search and retrieval system. • Extract information from DNA sequence data, protein sequence data, population set, whole genome, macromolecular structures, biomedical literature like PubMed. • Sequence sources: PIR, SWISS-PROT, PDB, GenBank, RefSeq. • Covers over 20 databases.
  • 8. Protein Information Protein Sequence Protein Structure PIR SWISS-PROT PDB GenBank Genetic Sequence RETRIEVAL SYSTEM Name of DATABASES Information Provided Information Accessible to the User Entrez
  • 9. Sequence Submission Tools If we want to submit a new genomic sequence to NCBI, then we can use tools like Bankit and Sequin. 1. Bankit • Web-based sequence submission tool • Present on NCBI Homepage • Tool for Simple submission: when only one or small number of records are to be submitted. • Can also be used by submitters to update their existing GenBank records. • Sequence analysis tools are not required for submission through this process.
  • 10. 2. Sequin • NCBI tool for sequence submission and update. • Can handle multiple sequence submissions; that includes long sequences, multiple annotations, segment sets of DNA, population studies • Provide graphical viewing and editing options.
  • 11. Specialized tools 1. ORF Finder Graphic analysis tool to detect open reading frame (ORF) of a selectable minimum size. An open reading frame is a portion of a DNA molecule that, when translated into amino acids, contains no stop codons. ORF Finder Detection of ORF In User’s sequence In a sequence already in the database
  • 12. 2. e-PCR Computation procedure that is used to identify sequence-tag- sites (STSs) with in DNA sequence. STS is a short (200 to 500 base pair) DNA sequence that has a single occurrence in the genome and whose location and base sequence are known. Search for STS subsequences to match with PCR primer which have correct order, orientation, spacing to generate known STSs.
  • 13. 3. Spidey/Splign • mRNA to Genomic alignment program, which use the local alignment tools- BLAST & Dot View, to find it’s alignment. • For identification of exons and introns and link between them. • For tracking splice junction. • For identification of frameshift sequences. • For identification of changes in sequences.