The ConSurf Server

SERVER FOR THE IDENTIFICATION OF FUNCTIONAL REGIONS IN BIOPOLYMERS

The ConSurf Server

SERVER FOR THE IDENTIFICATION OF FUNCTIONAL REGIONS IN BIOPOLYMERS

ConSurf Analysis for: 2POL

Mol* 3D Viewer
1
2
3
4
5
6
7
8
9
Variable
Average
Conserved
Insufficient Data

ConSurf Results
1          11         21         31         41         
M K F T V E R E H L  L K P L Q Q V S G P  L G G R P T L P I L  G N L L L Q V A D G  T L S L T G T D L E
51         61         71         81         91         
M E M V A R V A L V  Q P H E P G A T T V  P A R K F F D I C R  G L P E G A E I A V  Q L E G E R M L V R
101        111        121        131        141        
S G R S R F S L S T  L P A A D F P N L D  D W Q S E V E F T L  P Q A T M K R L I E  A T Q F S M A H Q D
151        161        171        181        191        
V R Y Y L N G M L F  E T E G E E L R T V  A T D G H R L A V C  S M P I G Q S L P S  H S V I V P R K G V
201        211        221        231        241        
I E L M R M L D G G  D N P L R V Q I G S  N N I R A H V G D F  I F T S K L V D G R  F P D Y R R V L P K
251        261        271        281        291        
N P D K H L E A G C  D L L K Q A F A R A  A I L S N E K F R G  V R L Y V S E N Q L  K I T A N N P E Q E
301        311        321        331        341        
E A E E I L D V T Y  S G A E M E I G F N  V S Y V L D V L N A  L K C E N V R M M L  T D S V S S V Q I E
351        361        
D A A S Q S A A Y V  V M P M R L


Running Parameters


  • Homologues Search:
    • Homologues were collected from UNIREF90 database.
    • Homologues search algorithm is HMMER.
    • E-value cutoff is 0.0001.
    • Number of Iterations is 1.
  • Homologues Thresholds:
    • CD-HIT cutoff is 95% (This is the maximal sequence identity between homologues).
    • Maximal number of final homologues is 150.
    • Maximal overlap between homologues is 10% (If overlap between two homologues exceeds 10%, the highest scoring homologue is chosen).
    • Coverage is 60% (This is the minimal percentage of the query sequence covered by the homologue).
    • Minimal sequence identity with the query sequence is 35%.
  • Alignment, Phylogeny and Conservation Scores:
    • Multiple Sequence Alignment was built using MAFFT.
    • Phylogenetic tree was built using Neighbor Joining with ML distance.
    • Conservation Scores were calculated with the Bayesian method.
    • Amino acid substitution model was chosen by best fit.

Homologues, Alignment and Phylogeny


    • 24621 homologues were collected from the UNIREF90 database using HMMER.
    • Of these, 4744 homologues passed the thresholds (min/max similarity, coverage, etc), 4659 of them are CD-HIT unique.
    • The calculations were conducted on 150 hits (query included), sampled from the unique hits. Click here if you wish to view the list of sequences which produced significant alignments, but were not chosen as hits.
  • Alignment details
    • The average number of replacements between any two sequences in the alignment;
      A distance of 0.01 means that on average, the expected replacement for every 100 positions is 1.
    • Average pairwise distance : 1.1
    • Lower bound : 0.06
    • Upper bound : 2.0
    • Residue variety per position in the MSA (The table is best viewed with an editor that respects Comma-Separated Values)

Related Links