- a curated protein sequence database which strives to provide a high level of annotation.UniRef90
- this database clusters sequences and sub-fragments with 11 or more residues that have at least 90% sequence identity with each other (from any organism) into a single UniRef entry, displaying the sequence of a representative.UniProt
is the universal protein resource, a central repository of protein data created by combining Swiss-Prot, TrEMBL and PIR.Clean Uniprot
is a modified version of the UniProt
database aimed to gather the more reliable sequenceses. See details here
this database is maintained by the NCBI
and comprised of non-redundant sequences from GenBank CDS translations, PDB, SwissProt, PIR, and PRF, excluding environmental samples from WGS projects.