Protein
The
Entrez Protein Database is a collection of protein sequences from
a variety of sources, including SwissProt, PIR, PRF, PDB, and translations
from annotated coding regions in GenBank and RefSeq.
RefSeq
The Reference
Sequence (RefSeq) is a biologically non-redundant collection of
DNA, RNA, and protein sequences. RefSeq provides one example for each
biological molecule per organism. The alternatively spliced transcripts
that share identical exons have different entries in RefSeq.
The common Refseq accession prefix
| Accession prefix |
Molecular
type |
| NC_ |
Complete genomic molecule |
| NT_ |
Genomic contig |
| NM_ |
mRNA |
| XM_ |
mRNA (Computed) |
| NP_ |
Protein |
| XP_ |
Protein (Computed) |
| NR_ |
RNA |
| XR_ |
RNA(Computed) |
Domain
The Conserved
Domain Database (CDD) is a collection of multiple sequence alignments
that represent functional domains conserved across evolution. Conserved Domains (CDs) are recurring polypeptide chains or modules
which have distinct evolutionary origin and function.
PROW
Protein Reviews
On the Web (PROW) contains authoritative and peer-reviewed short
summaries on proteins and protein families.