Identification of evolutionary important residues by means of an entropy based analysis of MSAs.
- Upload a multiple sequence alignment (MSA) file in FASTA format first. The MSA must contain at least 120 sequences for a sound computation of mutual dependencies. The first sequence in the MSA will be used for numbering in the results. Please note that files larger than 15 MB are not accepted.
- Here you find an example MSA file.
- High Scoring Residue Pairs (HSRPs) are used to determined conn(k) values. A HSPR value of 75 is a good starting point. Alternatively, a fraction of 2.5% of all residues works well for many proteins.
- The addition of pseudo counts helps to correct for small frequency values.