SlideShare una empresa de Scribd logo
1 de 31
Amino acids
Presenting By Abdul Qahar (A Q)
Buner Campus
Edited, Prepared and shared By
Abdul Qahar
Structural database and their
classification.
Basic concept about Database
1. What is a database?
A database is a collection of data which can be used:
• alone, or
• combined / related to other data
to provide answers to the user’s question.
Data types
primary data
secondary data
tertiary data
sequence
DNA
amino acid
DMPVERILEALAVE…
primary database
secondary protein
structure“motifs”: regular
expressions, blocks, profiles,
fingerprints
e. g., alpha-helices, beta-
strands
secondary db
domains, folding units
tertiary protein structure tertiary db
atomic co-ordinates
interaction data
binary protein-protein
interactions/ networks
pathways and
functional networks
interaction db
Primary biological databases
Nucleic acid databases
EMBL
GenBank
DDBJ (DNA Data Bank of
Japan)
Protein databases
PIR
MIPS
SWISS-PROT
TrEMBL
NRL-3D
Nucleotide Databases
•EMBL:Nucleotide sequence database
•Ensembl: Automatics annotation of eukaryotic genomes
•Genome Server: Overview of completed genomes at EBI
•Genome-MOT: Genome monitoring table
•EMBL-Align: Multiple sequence alignment database
Sequence data = strings of
letters
Nucleotides (bases)
Adenine (A)
Cytosine (C)
Guanine (G)
Thymine (T)
triplet codons
genetic code
20 amino acids
(A, L, V, S etc.)
Three-dimensional protein structure =
atomic coordinates in 3D space
Protein folding
EMBL/GenBank/DDJB
• These 3 db contain mainly the same information (few differences
in the format and syntax)
• Serve as archives containing all sequences (single genes, ESTs,
complete genomes, etc.) derived from:
– Genome projects and sequencing centers
– Individual scientists
– Patent offices (i.e. USPTO, EPO)
• Non-confidential data are exchanged daily.
Databases related to Genomics
• Contain information on genes, gene location (mapping),
gene nomenclature and links to sequence databases;
• Exist for most organisms important for life science research;
• Examples: MIM, GDB (human), MGD (mouse), FlyBase
(Drosophila), SGD (yeast), MaizeDB (maize), SubtiList
(B.subtilis), etc.
Swiss-Prot
• Annotated protein sequence database established in 1986 and
maintained collaboratively since 1987, by the Department of
Medical Biochemistry of the University of Geneva and EBI
• Complete, Curated, Non-redundant and cross-referenced with 34
other databases
• Highly cross-referenced
• Available from a variety of servers and through sequence analysis
software tools
• More than 8,000 different species
• First 20 species represent about 42% of all sequences in the
database
• More than 1,29,000 entries with 4.7 X 1010 amino acids
PDB: Protein Data Bank
• Holds 3D models of biological macromolecules (protein, RNA,
DNA).
• All data are available to the public.
• Obtained by X-Ray crystallography (84%) or NMR
spectroscopy (16%).
• Submitted by biologists and biochemists from around the
world.
EMBL Nucleotide Sequence
Database
• An annotated collection of all publicly available nucleotide
and protein sequences
• Created in 1980 at the European Molecular Biology
Laboratory in Heidelberg.
• Maintained since 1994 by EBI- Cambridge.
DDBJ–DNA Data Bank of
Japan
• An annotated collection of all publicly available
nucleotide and protein sequences
• Started, 1984 at the National Institute of Genetics (NIG)
in Mishima.
• Still maintained in this institute a team led by Takashi
Gojobori.
Why Proteins Structure ?
Proteins are fundamental components of all living
cells, performing a variety of biological tasks.
Each protein has a particular 3D structure that determines its
function.
Protein structure is more conserved than protein sequence, and
more closely related to function.
Supersecondary structures
Assembly of secondary structures which are
shared by many structures.
Beta hairpin
Beta-alpha-beta unit
Helix hairpin
Structural Databases
SCOP: Structural Classification of Proteins
Current Release: 686 folds; 1073 Superfamilies; 1827 Familes
representing 15,979 PDB entries
CATH: Classification, Architecture, Topology, Homology
Levels in SCOP
1. Class
2. Folds
3. Super families
4. Families
Major classes in scop
• Classes
– All alpha proteins
– Alpha and beta proteins (a/b)
– Alpha and beta proteins (a+b)
– Multi-domain proteins
– Membrane and cell surface proteins
– Small proteins
Folds*
• Each Class may be divided into one or more folds
• Proteins which have the same secondary structure elements
arranged the in the same order in the protein chain and in three
dimensions are classified as having the same fold
Superfamilies
• Superfamilies are a subdivisions of folds
• A superfamily contains proteins which are thought to be
evolutionarily related due to
– Sequence
– Function
– Special structural features
• Relationships between members of a superfamily may not be
readily recognizable from the sequence alone
Families
• Subdivision of super families
• Contains members whose relationship is readily recognizable
from the sequence
• Families are further subdivided in to Proteins
• Proteins are divided into Species
– The same protein may be found in several species
All alpha: Hemoglobin
All beta: Immunoglobulin
(8fab)
OL
OL
Alpha/beta: Triosephosphate
isomerase
CATH
• Levels
• Class
• Architecture
– This level is unique to CATH
• Topology
– ~Fold(/super family) in SCOP
• Homologous Super family
– ~Super family(/family) in SCOP
Architecture
• Same overall arrangement of secondary structures
– Example: The architecture :Two layer beta sheet proteins
contains different folds each with a distinct number and
connectivity of strands
Abdul Qahar Buneri abdulqahar045@gmail.com
www.slideshare.net/abdulqahar045

Más contenido relacionado

La actualidad más candente (20)

Primary and secondary database
Primary and secondary databasePrimary and secondary database
Primary and secondary database
 
Prosite
PrositeProsite
Prosite
 
Primary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyanaPrimary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyana
 
Protein database
Protein databaseProtein database
Protein database
 
European molecular biology laboratory (EMBL)
European molecular biology laboratory (EMBL)European molecular biology laboratory (EMBL)
European molecular biology laboratory (EMBL)
 
Swiss PROT
Swiss PROT Swiss PROT
Swiss PROT
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
 
DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)
 
Ddbj
DdbjDdbj
Ddbj
 
Entrez databases
Entrez databasesEntrez databases
Entrez databases
 
Tools and database of NCBI
Tools and database of NCBITools and database of NCBI
Tools and database of NCBI
 
Gen bank databases
Gen bank databasesGen bank databases
Gen bank databases
 
UniProt
UniProtUniProt
UniProt
 
Protein information resource (PIR)
Protein information resource (PIR)Protein information resource (PIR)
Protein information resource (PIR)
 
Introduction to Biological databases
Introduction to Biological databasesIntroduction to Biological databases
Introduction to Biological databases
 
blast bioinformatics
blast bioinformaticsblast bioinformatics
blast bioinformatics
 
Sequence Submission Tools
Sequence Submission ToolsSequence Submission Tools
Sequence Submission Tools
 
Biological database
Biological databaseBiological database
Biological database
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
Biological databases
Biological databasesBiological databases
Biological databases
 

Destacado

Destacado (13)

Protein structure classification
Protein structure classificationProtein structure classification
Protein structure classification
 
Secondary metabolites
Secondary metabolitesSecondary metabolites
Secondary metabolites
 
Bioinformatica 08-12-2011-t8-go-hmm
Bioinformatica 08-12-2011-t8-go-hmmBioinformatica 08-12-2011-t8-go-hmm
Bioinformatica 08-12-2011-t8-go-hmm
 
Biological Databases
Biological DatabasesBiological Databases
Biological Databases
 
Lecture 2 animal cell biotechnology
Lecture 2  animal cell biotechnologyLecture 2  animal cell biotechnology
Lecture 2 animal cell biotechnology
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Systems biology and biotechnology of Streptomyces species for the production ...
Systems biology and biotechnology of Streptomyces species for the production ...Systems biology and biotechnology of Streptomyces species for the production ...
Systems biology and biotechnology of Streptomyces species for the production ...
 
Protein 3D structure and classification database
Protein 3D structure and classification database Protein 3D structure and classification database
Protein 3D structure and classification database
 
Protein Structure Prediction
Protein Structure PredictionProtein Structure Prediction
Protein Structure Prediction
 
Protein structure
Protein structureProtein structure
Protein structure
 
Insulin
InsulinInsulin
Insulin
 
Insulin presentation
Insulin presentationInsulin presentation
Insulin presentation
 
Biological databases
Biological databasesBiological databases
Biological databases
 

Similar a Structural database and their classification by abdul qahar

Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEPrashantSharma807
 
protein databases.ppt
protein databases.pptprotein databases.ppt
protein databases.pptSanthiyaAK
 
Data Base in Bioinformatics.ppt
Data Base in Bioinformatics.pptData Base in Bioinformatics.ppt
Data Base in Bioinformatics.pptBangaluru
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...SBituila
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...BibiQuinah
 
Primary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptxPrimary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptxVandana Yadav03
 
biological databases.pptx
biological databases.pptxbiological databases.pptx
biological databases.pptxscience lover
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introductionDrGopaSarma
 
Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)Sijo A
 
Nucleic acid database
Nucleic acid databaseNucleic acid database
Nucleic acid databaseEsakkiammal S
 

Similar a Structural database and their classification by abdul qahar (20)

Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASE
 
protein databases.ppt
protein databases.pptprotein databases.ppt
protein databases.ppt
 
Biological databases
Biological databases Biological databases
Biological databases
 
Data Base in Bioinformatics.ppt
Data Base in Bioinformatics.pptData Base in Bioinformatics.ppt
Data Base in Bioinformatics.ppt
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
Biological data base
Biological data baseBiological data base
Biological data base
 
Primary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptxPrimary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptx
 
biological databases.pptx
biological databases.pptxbiological databases.pptx
biological databases.pptx
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Protein database
Protein  databaseProtein  database
Protein database
 
Introduction to databases.pptx
Introduction to databases.pptxIntroduction to databases.pptx
Introduction to databases.pptx
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)
 
Proteomic databases
Proteomic databasesProteomic databases
Proteomic databases
 
Nucleic acid database
Nucleic acid databaseNucleic acid database
Nucleic acid database
 

Más de Abdul Qahar {{Abdul Wali Khan University Mardan}} (Buner Campus)

Más de Abdul Qahar {{Abdul Wali Khan University Mardan}} (Buner Campus) (20)

My presentation work (Sample)
My presentation work (Sample)My presentation work (Sample)
My presentation work (Sample)
 
Teaching in tomorrow’s classrooms
Teaching in tomorrow’s classroomsTeaching in tomorrow’s classrooms
Teaching in tomorrow’s classrooms
 
Classroom accountability and self monitoring systems
Classroom accountability and self monitoring systemsClassroom accountability and self monitoring systems
Classroom accountability and self monitoring systems
 
Discuss the role of provincial education assessment system in enhancing qua...
Discuss the role of provincial education assessment   system in enhancing qua...Discuss the role of provincial education assessment   system in enhancing qua...
Discuss the role of provincial education assessment system in enhancing qua...
 
Development of test items in affective domain in the subject of education psy...
Development of test items in affective domain in the subject of education psy...Development of test items in affective domain in the subject of education psy...
Development of test items in affective domain in the subject of education psy...
 
Use of alternative assessment techniques in enhancing the meaningful learning
Use of alternative assessment techniques in enhancing the meaningful learningUse of alternative assessment techniques in enhancing the meaningful learning
Use of alternative assessment techniques in enhancing the meaningful learning
 
Journals registered with thomson reuters Shared By Abdul Qahar Buneri Abdul W...
Journals registered with thomson reuters Shared By Abdul Qahar Buneri Abdul W...Journals registered with thomson reuters Shared By Abdul Qahar Buneri Abdul W...
Journals registered with thomson reuters Shared By Abdul Qahar Buneri Abdul W...
 
Mammalogy practical Edited and Shared By Abdul Qahar Buneri Abdul Wali Khan U...
Mammalogy practical Edited and Shared By Abdul Qahar Buneri Abdul Wali Khan U...Mammalogy practical Edited and Shared By Abdul Qahar Buneri Abdul Wali Khan U...
Mammalogy practical Edited and Shared By Abdul Qahar Buneri Abdul Wali Khan U...
 
Mammalogy Practical (Msc Zoology) Shared by Abdul Qahar Buneri Abdul Wali kh...
 Mammalogy Practical (Msc Zoology) Shared by Abdul Qahar Buneri Abdul Wali kh... Mammalogy Practical (Msc Zoology) Shared by Abdul Qahar Buneri Abdul Wali kh...
Mammalogy Practical (Msc Zoology) Shared by Abdul Qahar Buneri Abdul Wali kh...
 
Order macroscelidea Shared By Abdul Qahar Buneri AWKUM BUner Campus
Order macroscelidea  Shared By Abdul Qahar Buneri AWKUM BUner CampusOrder macroscelidea  Shared By Abdul Qahar Buneri AWKUM BUner Campus
Order macroscelidea Shared By Abdul Qahar Buneri AWKUM BUner Campus
 
Order sirenia Shared By Abdul Qahar Buneri AWKUM BUner Campus
Order sirenia  Shared By Abdul Qahar Buneri AWKUM BUner CampusOrder sirenia  Shared By Abdul Qahar Buneri AWKUM BUner Campus
Order sirenia Shared By Abdul Qahar Buneri AWKUM BUner Campus
 
Order scadentia... Shared By Abdul Qahar Buneri AWKUM BUner Campus
Order scadentia...  Shared By Abdul Qahar Buneri AWKUM BUner CampusOrder scadentia...  Shared By Abdul Qahar Buneri AWKUM BUner Campus
Order scadentia... Shared By Abdul Qahar Buneri AWKUM BUner Campus
 
Order rodentai Shared By Abdul Qahar Buneri AWKUM BUner Campus
Order rodentai  Shared By Abdul Qahar Buneri AWKUM BUner CampusOrder rodentai  Shared By Abdul Qahar Buneri AWKUM BUner Campus
Order rodentai Shared By Abdul Qahar Buneri AWKUM BUner Campus
 
Order proboscidea Shared By Abdul Qahar Buneri AWKUM BUner Campus
Order proboscidea  Shared By Abdul Qahar Buneri AWKUM BUner CampusOrder proboscidea  Shared By Abdul Qahar Buneri AWKUM BUner Campus
Order proboscidea Shared By Abdul Qahar Buneri AWKUM BUner Campus
 
Order pilosa Shared By Abdul Qahar Buneri AWKUM BUner Campus
Order pilosa  Shared By Abdul Qahar Buneri AWKUM BUner CampusOrder pilosa  Shared By Abdul Qahar Buneri AWKUM BUner Campus
Order pilosa Shared By Abdul Qahar Buneri AWKUM BUner Campus
 
Order pholidota
Order pholidotaOrder pholidota
Order pholidota
 
Order perissodactyla
Order perissodactylaOrder perissodactyla
Order perissodactyla
 
Order cingulata
Order cingulataOrder cingulata
Order cingulata
 
Order chiroptera
Order chiropteraOrder chiroptera
Order chiroptera
 
0rder catariodactyla
0rder catariodactyla0rder catariodactyla
0rder catariodactyla
 

Último

Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 

Último (20)

Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 

Structural database and their classification by abdul qahar

  • 2. Presenting By Abdul Qahar (A Q) Buner Campus Edited, Prepared and shared By Abdul Qahar
  • 3. Structural database and their classification.
  • 4. Basic concept about Database 1. What is a database? A database is a collection of data which can be used: • alone, or • combined / related to other data to provide answers to the user’s question.
  • 5. Data types primary data secondary data tertiary data sequence DNA amino acid DMPVERILEALAVE… primary database secondary protein structure“motifs”: regular expressions, blocks, profiles, fingerprints e. g., alpha-helices, beta- strands secondary db domains, folding units tertiary protein structure tertiary db atomic co-ordinates interaction data binary protein-protein interactions/ networks pathways and functional networks interaction db
  • 6. Primary biological databases Nucleic acid databases EMBL GenBank DDBJ (DNA Data Bank of Japan) Protein databases PIR MIPS SWISS-PROT TrEMBL NRL-3D
  • 7. Nucleotide Databases •EMBL:Nucleotide sequence database •Ensembl: Automatics annotation of eukaryotic genomes •Genome Server: Overview of completed genomes at EBI •Genome-MOT: Genome monitoring table •EMBL-Align: Multiple sequence alignment database
  • 8. Sequence data = strings of letters Nucleotides (bases) Adenine (A) Cytosine (C) Guanine (G) Thymine (T) triplet codons genetic code 20 amino acids (A, L, V, S etc.)
  • 9. Three-dimensional protein structure = atomic coordinates in 3D space
  • 11. EMBL/GenBank/DDJB • These 3 db contain mainly the same information (few differences in the format and syntax) • Serve as archives containing all sequences (single genes, ESTs, complete genomes, etc.) derived from: – Genome projects and sequencing centers – Individual scientists – Patent offices (i.e. USPTO, EPO) • Non-confidential data are exchanged daily.
  • 12. Databases related to Genomics • Contain information on genes, gene location (mapping), gene nomenclature and links to sequence databases; • Exist for most organisms important for life science research; • Examples: MIM, GDB (human), MGD (mouse), FlyBase (Drosophila), SGD (yeast), MaizeDB (maize), SubtiList (B.subtilis), etc.
  • 13. Swiss-Prot • Annotated protein sequence database established in 1986 and maintained collaboratively since 1987, by the Department of Medical Biochemistry of the University of Geneva and EBI • Complete, Curated, Non-redundant and cross-referenced with 34 other databases • Highly cross-referenced • Available from a variety of servers and through sequence analysis software tools • More than 8,000 different species • First 20 species represent about 42% of all sequences in the database • More than 1,29,000 entries with 4.7 X 1010 amino acids
  • 14. PDB: Protein Data Bank • Holds 3D models of biological macromolecules (protein, RNA, DNA). • All data are available to the public. • Obtained by X-Ray crystallography (84%) or NMR spectroscopy (16%). • Submitted by biologists and biochemists from around the world.
  • 15. EMBL Nucleotide Sequence Database • An annotated collection of all publicly available nucleotide and protein sequences • Created in 1980 at the European Molecular Biology Laboratory in Heidelberg. • Maintained since 1994 by EBI- Cambridge.
  • 16. DDBJ–DNA Data Bank of Japan • An annotated collection of all publicly available nucleotide and protein sequences • Started, 1984 at the National Institute of Genetics (NIG) in Mishima. • Still maintained in this institute a team led by Takashi Gojobori.
  • 17. Why Proteins Structure ? Proteins are fundamental components of all living cells, performing a variety of biological tasks. Each protein has a particular 3D structure that determines its function. Protein structure is more conserved than protein sequence, and more closely related to function.
  • 18. Supersecondary structures Assembly of secondary structures which are shared by many structures. Beta hairpin Beta-alpha-beta unit Helix hairpin
  • 19. Structural Databases SCOP: Structural Classification of Proteins Current Release: 686 folds; 1073 Superfamilies; 1827 Familes representing 15,979 PDB entries CATH: Classification, Architecture, Topology, Homology
  • 20. Levels in SCOP 1. Class 2. Folds 3. Super families 4. Families
  • 21. Major classes in scop • Classes – All alpha proteins – Alpha and beta proteins (a/b) – Alpha and beta proteins (a+b) – Multi-domain proteins – Membrane and cell surface proteins – Small proteins
  • 22. Folds* • Each Class may be divided into one or more folds • Proteins which have the same secondary structure elements arranged the in the same order in the protein chain and in three dimensions are classified as having the same fold
  • 23. Superfamilies • Superfamilies are a subdivisions of folds • A superfamily contains proteins which are thought to be evolutionarily related due to – Sequence – Function – Special structural features • Relationships between members of a superfamily may not be readily recognizable from the sequence alone
  • 24. Families • Subdivision of super families • Contains members whose relationship is readily recognizable from the sequence • Families are further subdivided in to Proteins • Proteins are divided into Species – The same protein may be found in several species
  • 28. CATH • Levels • Class • Architecture – This level is unique to CATH • Topology – ~Fold(/super family) in SCOP • Homologous Super family – ~Super family(/family) in SCOP
  • 29. Architecture • Same overall arrangement of secondary structures – Example: The architecture :Two layer beta sheet proteins contains different folds each with a distinct number and connectivity of strands
  • 30.
  • 31. Abdul Qahar Buneri abdulqahar045@gmail.com www.slideshare.net/abdulqahar045