ConSurf Logo The ConSurf Server
Server for the Identification of Functional Regions in Proteins

Is there a known protein structure?


Enter the PDB ID:  
Each structure in the PDB is represented by a 4 character alphanumeric identifier, assigned upon its deposition. For example, 1bxl and 1d66 are identification codes for PDB entries for Bcl-Xl / Bak complex and Gal4 (Residues 1 - 65) complex with 19mer DNA, respectively.

Upload your own PDB file:  
The ATOM record is essential for ConSurf run. In case you wish to upload your own PDB file, the ATOM record must be included in it.

Chain Identifier: 


Do you have a Multiple Sequence Alignment (MSA) to upload?


Upload your Multiple Sequece Alignment file:  
ConSurf accepts external MSAs in the 7 formats supported by CLUSTAL W. These are: NBRF/PIR, EMBL/SwissProt, Pearson (Fasta), GDE, Clustal, GCG/MSF and RSF format.
In case you provide an external MSA file in Fasta format, please use the "-" sign as the only gap symbol, as this is the only standard gap sign that ConSurf accepts.

Indicate the Query Sequence Name:  
This should be the exact name of the sequence in the provided MSA file, that corresponds to the query protein.

Do you have a Tree file to upload?

Upload your TREE file (in Newick format):   
In case you provide an external multiple sequence alignment (MSA) file, ConSurf can also accept a corresponding external phylogenetic tree in Newick (Phylip) format, for example: Tree File.
The names of the proteins sequences in the tree file must be identical to the names in the MSA file.



Paste here the protein sequence in FASTA format:

Choose parameteres to build the Multiple Sequence Alignment (MSA)

Alignment program

Proteins Database
SwissProt - a curated protein sequence database which strives to provide a high level of annotation
Clean UniProt - a modified version of the UniProt database aimed to screen the more reliable sequences.
UniRef90 - database cluster sequences and sub-fragments with 11 or more residues that have at least 90% sequence identity with each other (from any organism) into a single UniRef entry, displaying the sequence of a representative
UniProt is the universal protein resource, a central repository of protein data created by combining Swiss-Prot, TrEMBL and PIR.

Maximum Homolougs to collect
The maximum number of homologues, from those found by PSI-BLAST (with the given E-value), to be included in the calculation. In order to include all the homologues, replace the default value with the word "all".

Maximal %ID Between Sequences
Filter out redundant sequences. Sequences are clustered according to the given sequence identity cutoff, one representative of each cluster is reserved.

Minimal %ID For Homologs
Minimal sequence identity with the query sequence. Blast hits that shares less than the given identity cutoff are ignored

PSI-BLAST Iterations

PSI-BLAST E-value Cutoff

Homolog search algorithm
Let me select the sequences for the analysis manually out of Blast results

Calculation Method
The calculation method for the rate of evolution at each site in the MSA.
The Bayesian method was shown to significantly improve the accuracy of conservation scores estimations over the Maximum Likelihood method, in particular when a small number of sequences are used for the calculations. An additional advantage of the Bayesian method is that a confidence interval is assigned to each of the inferred evolutionary conservation score.

Evolutionary Substitution Model

Job Title


Enter a descriptive job title for your ConSurf query

User E-Mail

We will use this address in case your job has exceeded its maximal running time.

Send a link to the results by e-mail

Providing your e-mail address ensures that you don't lose the URL to your results and enables you to view your results in the future using the submission box

Show Example Results