Protein Database

bioinformatics

Uploaded by

Sneha

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views

Protein Database

bioinformatics

Uploaded by

Sneha

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Protein Databases- Types and Importance

 As biology has increasingly turned into a data-rich science, the need for
storing and communicating large datasets has grown tremendously.
 The obvious examples are the nucleotide sequences, the protein sequences,
and the 3D structural data produced by X-ray crystallography and
macromolecular NMR.
 The biological information of proteins is available as sequences and
structures. Sequences are represented in a single dimension whereas the
structure contains the three-dimensional data of sequences.
 A biological database is a collection of data that is organized so that its
contents can easily be accessed, managed, and updated.
 A protein database is one or more datasets about proteins, which could
include a protein’s amino acid sequence, conformation, structure, and
features such as active sites.
 Protein databases are compiled by the translation of DNA sequences from
different gene databases and include structural information. They are an
important resource because proteins mediate most biological functions.

Importance of Protein Databases

Huge amounts of data for protein structures, functions, and particularly sequences are
being generated. Searching databases are often the first step in the study of a new
protein. It has the following uses:
1. Comparison between proteins or between protein families provides
information about the relationship between proteins within a genome or
across different species and hence offers much more information that can be
obtained by studying only an isolated protein.
2. Secondary databases derived from experimental databases are also widely
available. These databases reorganize and annotate the data or provide
predictions.
3. The use of multiple databases often helps researchers understand the
structure and function of a protein.

Primary databases of Protein

The PRIMARY databases hold the experimentally determined protein sequences
inferred from the conceptual translation of the nucleotide sequences. This, of course, is
not experimentally derived information, but has arisen as a result of interpretation of the
nucleotide sequence information and consequently must be treated as potentially
containing misinterpreted information. There is a number of primary protein sequence
databases and each requires some specific consideration.
a. Protein Information Resource (PIR) – Protein Sequence Database (PIR-PSD):
 The PIR-PSD is a collaborative endeavor between the PIR, the MIPS
(Munich Information Centre for Protein Sequences, Germany) and the JIPID
(Japan International Protein Information Database, Japan).
 The PIR-PSD is now a comprehensive, non-redundant, expertly annotated,
object-relational DBMS.
 A unique characteristic of the PIR-PSD is its classification of protein
sequences based on the superfamily concept.
 The sequence in PIR-PSD is also classified based on homology domain and
sequence motifs.
 Homology domains may correspond to evolutionary building blocks, while
sequence motifs represent functional sites or conserved regions.
 The classification approach allows a more complete understanding of
sequence function-structure relationship.
b. SWISS-PROT
 The other well known and extensively used protein database is SWISS-
PROT. Like the PIR-PSD, this curated proteins sequence database also
provides a high level of annotation.
 The data in each entry can be considered separately as core data and
annotation.
 The core data consists of the sequences entered in common single letter
amino acid code, and the related references and bibliography. The taxonomy
of the organism from which the sequence was obtained also forms part of
this core information.
 The annotation contains information on the function or functions of the
protein, post-translational modification such as phosphorylation, acetylation,
etc., functional and structural domains and sites, such as calcium binding
regions, ATP-binding sites, zinc fingers, etc., known secondary structural
features as for examples alpha helix, beta sheet, etc., the quaternary
structure of the protein, similarities to other protein if any, and diseases that
may arise due to different authors publishing different sequences for the
same protein, or due to mutations in different strains of an described as part
of the annotation.
TrEMBL (for Translated EMBL) is a computer-annotated protein sequence database
that is released as a supplement to SWISS-PROT. It contains the translation of all
coding sequences present in the EMBL Nucleotide database, which have not been fully
annotated. Thus it may contain the sequence of proteins that are never expressed and
never actually identified in the organisms.
c. Protein Databank (PDB):
 PDB is a primary protein structure database. It is a crystallographic database
for the three-dimensional structure of large biological molecules, such as
proteins.
 In spite of the name, PDB archive the three-dimensional structures of not
only proteins but also all biologically important molecules, such as nucleic
acid fragments, RNA molecules, large peptides such as antibiotic gramicidin
and complexes of protein and nucleic acids.
 The database holds data derived from mainly three sources: Structure
determined by X-ray crystallography, NMR experiments, and molecular
modeling.

Protein Database Overview
No ratings yet
Protein Database Overview
13 pages
Bioinformatics Day2
No ratings yet
Bioinformatics Day2
3 pages
Protein Databases
No ratings yet
Protein Databases
12 pages
Bioinformatics Biological Database
No ratings yet
Bioinformatics Biological Database
31 pages
Protein Sequence Database Ankita Sharma
No ratings yet
Protein Sequence Database Ankita Sharma
31 pages
Protein Databases
No ratings yet
Protein Databases
8 pages
Mulder 2007
No ratings yet
Mulder 2007
13 pages
note 2
No ratings yet
note 2
54 pages
Presentation 11
No ratings yet
Presentation 11
20 pages
Databases
No ratings yet
Databases
3 pages
UNIT II
No ratings yet
UNIT II
23 pages
BIOINFORMATICS
No ratings yet
BIOINFORMATICS
85 pages
Bioinformatics and Omics Topic: Database and Biological Database With Examples Assignment-3
No ratings yet
Bioinformatics and Omics Topic: Database and Biological Database With Examples Assignment-3
5 pages
Biological Databases
No ratings yet
Biological Databases
13 pages
Unit I
No ratings yet
Unit I
28 pages
Bioinformatics. CH 3 Databases (Summarized Notes)
50% (2)
Bioinformatics. CH 3 Databases (Summarized Notes)
5 pages
Bioinformatics (STH Sir)
No ratings yet
Bioinformatics (STH Sir)
13 pages
Resumen Unidad 1 y 2 Bioinformatica
No ratings yet
Resumen Unidad 1 y 2 Bioinformatica
14 pages
CR Micro
No ratings yet
CR Micro
2 pages
BIOINFORMATICS
No ratings yet
BIOINFORMATICS
22 pages
Protein Databases
No ratings yet
Protein Databases
23 pages
Databases 2 Kd
No ratings yet
Databases 2 Kd
4 pages
DATAbases1KD
No ratings yet
DATAbases1KD
5 pages
161_vansh_sharma
No ratings yet
161_vansh_sharma
4 pages
Database
No ratings yet
Database
16 pages
BIOINFORMATICS PRACTICAL FILE
No ratings yet
BIOINFORMATICS PRACTICAL FILE
12 pages
Bioinformatics Definition
No ratings yet
Bioinformatics Definition
11 pages
Rese Rach
No ratings yet
Rese Rach
37 pages
Adv Bi Unit 1
No ratings yet
Adv Bi Unit 1
39 pages
8024 Bio Info
No ratings yet
8024 Bio Info
28 pages
Bioinformatics Overview
100% (1)
Bioinformatics Overview
18 pages
Sequence and Structure Retrieval
No ratings yet
Sequence and Structure Retrieval
9 pages
Unit II Bioinformatics
No ratings yet
Unit II Bioinformatics
25 pages
Bioinfo U2 KD 2
No ratings yet
Bioinfo U2 KD 2
3 pages
Sequence Retrieval System
No ratings yet
Sequence Retrieval System
2 pages
Essential Info Notes-1
No ratings yet
Essential Info Notes-1
57 pages
Pra 1 Swiss Prot
No ratings yet
Pra 1 Swiss Prot
2 pages
Sec1 Introduction to Bioinformatics
No ratings yet
Sec1 Introduction to Bioinformatics
20 pages
BCH 516-1
No ratings yet
BCH 516-1
32 pages
Computational Immunology
No ratings yet
Computational Immunology
8 pages
Biological Databases PDF
No ratings yet
Biological Databases PDF
13 pages
Bio in For Matics
No ratings yet
Bio in For Matics
4 pages
Abasyn University Peshawar: Name: Ihsan Ullah Depart: BS Medical Lab Technology
No ratings yet
Abasyn University Peshawar: Name: Ihsan Ullah Depart: BS Medical Lab Technology
8 pages
WINSEM2021-22 BIY1012 ETH VL2021220501045 Reference Material I 11-01-2022 Ntroduction To Databases
No ratings yet
WINSEM2021-22 BIY1012 ETH VL2021220501045 Reference Material I 11-01-2022 Ntroduction To Databases
42 pages
Day 1
No ratings yet
Day 1
38 pages
Ncbi
No ratings yet
Ncbi
25 pages
Lecture_3
No ratings yet
Lecture_3
55 pages
Unit-5 Bioinformatics
No ratings yet
Unit-5 Bioinformatics
13 pages
Zhang2011 Article AnOverviewOfHumanProteinDataba
No ratings yet
Zhang2011 Article AnOverviewOfHumanProteinDataba
11 pages
Protein Structure Databases
No ratings yet
Protein Structure Databases
16 pages
BCH 505 Bioinformatics 3(2 2) Databases
No ratings yet
BCH 505 Bioinformatics 3(2 2) Databases
17 pages
Bif401 Manual 2023
No ratings yet
Bif401 Manual 2023
27 pages
Biological Databases
No ratings yet
Biological Databases
3 pages
11-Protein Information Resource (PIR)-02-09-2024 (1)
No ratings yet
11-Protein Information Resource (PIR)-02-09-2024 (1)
11 pages
Bio in For Matics
No ratings yet
Bio in For Matics
26 pages
The Universal Protein Resource (Uniprot) : An Expanding Universe of Protein Information
No ratings yet
The Universal Protein Resource (Uniprot) : An Expanding Universe of Protein Information
6 pages
Database
No ratings yet
Database
40 pages
bau041
No ratings yet
bau041
10 pages
Introduction to Bioinformatics, Sequence and Genome Analysis
From Everand
Introduction to Bioinformatics, Sequence and Genome Analysis
Jerry H. Swift
No ratings yet
Bioinformatics Unveiled
From Everand
Bioinformatics Unveiled
Joan Melody
No ratings yet
COMP90016 2023 06 Data Sources
No ratings yet
COMP90016 2023 06 Data Sources
64 pages
Bioinformatics: Lecture 5: Calculating Identities, Similarity and Gab Scores
No ratings yet
Bioinformatics: Lecture 5: Calculating Identities, Similarity and Gab Scores
28 pages
Curriculum Vitae
No ratings yet
Curriculum Vitae
3 pages
Application of Computers in Science and Research
100% (1)
Application of Computers in Science and Research
2 pages
4 Months NGS Internship
No ratings yet
4 Months NGS Internship
15 pages
Bif 401 PPT 1to 80 by M.habib
No ratings yet
Bif 401 PPT 1to 80 by M.habib
588 pages
Biopython Tutorial
No ratings yet
Biopython Tutorial
237 pages
Desy Indriani Nur Rahmah - B1J014014
No ratings yet
Desy Indriani Nur Rahmah - B1J014014
7 pages
Comparative Genomics
No ratings yet
Comparative Genomics
23 pages
Bioinformatics Notes (1)
No ratings yet
Bioinformatics Notes (1)
6 pages
Human Genome Project
No ratings yet
Human Genome Project
25 pages
Bioinformatics Sequence Structure and Databanks
No ratings yet
Bioinformatics Sequence Structure and Databanks
4 pages
A Field Guide to Whole-genome Sequencing, Assembly and Annotation
No ratings yet
A Field Guide to Whole-genome Sequencing, Assembly and Annotation
17 pages
M2 Imalis Schedule 2021-2022-: July 21St
No ratings yet
M2 Imalis Schedule 2021-2022-: July 21St
1 page
Microbial Genomics and Metagenomics 2021
No ratings yet
Microbial Genomics and Metagenomics 2021
4 pages
Q&A Report From The Workshop - Exploring EMBL-EBI Sequence Analysis Tools and Managing Bioinformatics Workflows
No ratings yet
Q&A Report From The Workshop - Exploring EMBL-EBI Sequence Analysis Tools and Managing Bioinformatics Workflows
4 pages
What Is Bioinformatics
No ratings yet
What Is Bioinformatics
6 pages
MAFFT Ver.7 - RBCL 1
No ratings yet
MAFFT Ver.7 - RBCL 1
1 page
BioInformatics Quiz1 Week12
50% (2)
BioInformatics Quiz1 Week12
5 pages
Nucleic_Acid_Databases
No ratings yet
Nucleic_Acid_Databases
37 pages
Bioinformatics Principles
No ratings yet
Bioinformatics Principles
6 pages
Lab Report 1 Bioinformatics
No ratings yet
Lab Report 1 Bioinformatics
13 pages
Bioinformation: Phylogenetic Analysis of Chloroplast Matk Gene From Zingiberaceae For Plant Dna Barcoding
No ratings yet
Bioinformation: Phylogenetic Analysis of Chloroplast Matk Gene From Zingiberaceae For Plant Dna Barcoding
4 pages
BT403 QP
No ratings yet
BT403 QP
2 pages
Where Did The BLOSUM62 Alignment Score Matrix Come From?: Primer
No ratings yet
Where Did The BLOSUM62 Alignment Score Matrix Come From?: Primer
2 pages
NyBerMan Free Internship Metagenomics
No ratings yet
NyBerMan Free Internship Metagenomics
1 page
Genome Scott Manus v5 Full 20190212144725 Copy 2
No ratings yet
Genome Scott Manus v5 Full 20190212144725 Copy 2
10,823 pages
Blast 2 Sequences, A New Tool For Comparing Protein and Nucleotide Sequences
No ratings yet
Blast 2 Sequences, A New Tool For Comparing Protein and Nucleotide Sequences
17 pages
FASTA Result1
No ratings yet
FASTA Result1
6 pages
ORF Finder Exercise-2
No ratings yet
ORF Finder Exercise-2
2 pages

Protein Database

Uploaded by

Protein Database

Uploaded by

Protein Databases- Types and Importance

Importance of Protein Databases

Primary databases of Protein

You might also like