Abstract
Motivation Prediction of protein stability change upon mutation (ΔΔG) is crucial for facilitating protein engineering and understanding of protein folding principles. Robust prediction of protein folding free energy change requires the knowledge of protein three-dimensional (3D) structure. Unfortunately, protein 3D structure is not always available. In this case, one can still predict the protein stability change by constructing a homology model of the protein; however, the accuracy of homology model-based ΔΔG predictions is unknown. The perspectives of using 3D structures of the best templates are also unclear.
Results To investigate these questions, we used the most popular and accurate publicly available tools: FoldX for stability change prediction and I-Tasser for homology modeling. We found that both homology models and best templates worsen the ΔΔG prediction, with best templates performing 1.5 times better than homology models. For AlphaFold models, we also found that the best templates seem to outperform protein models. Our findings imply using the 3D structures of the best templates for ΔΔG prediction if the 3D protein structure is unavailable.
Contact d.ivankov{at}skoltech.ru
Competing Interest Statement
The authors have declared no competing interest.
Abbreviations
- 3D
- three-dimensional
- DSSP
- dictionary of secondary structure for proteins
- PDB
- protein data bank
- SCOP
- structural classification of proteins
- MW
- Mann-Whitney
- PCC
- Pearson correlation coefficient
- MSE
- mean standard error
- GDT
- global distance test
- CASP
- critical assessment of protein structure prediction
- BLAST
- basic local alignment search tool
- RSA
- relative solvent accessibility