Full text

Turn on search term navigation

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Methods such as AlphaFold have revolutionized protein structure prediction, making quantitative prediction of the thermodynamic stability of individual proteins and their complexes one of the next frontiers in computational protein modeling. Here, we develop methods for using protein language models (PLMs) with protein mutational datasets related to protein tertiary and quaternary stability. First, we demonstrate that fine-tuning of a ProtT5 PLM enables accurate prediction of the largest protein mutant stability dataset available. Next, we show that mutational impacts on protein function can be captured by fine-tuning PLMs, using green fluorescent protein (GFP) brightness as a readout of folding and stability. In our final case study, we observe that PLMs can also be extended to protein complexes by identifying mutations that are stabilizing or destabilizing. Finally, we confirmed that state-of-the-art simulation methods (free energy perturbation) can refine the accuracy of predictions made by PLMs. This study highlights the versatility of PLMs and demonstrates their application towards the prediction of protein and complex stability.

Details

Title
Accurate Prediction of Protein Tertiary and Quaternary Stability Using Fine-Tuned Protein Language Models and Free Energy Perturbation
Author
Li Xinning 1   VIAFID ORCID Logo  ; Perez Ryann 1 ; Ferrie, John J 2   VIAFID ORCID Logo  ; James, Petersson E 1   VIAFID ORCID Logo  ; Giannakoulias Sam 3   VIAFID ORCID Logo 

 Department of Chemistry, University of Pennsylvania, Philadelphia, PA 19104, USA; [email protected] (X.L.); [email protected] (R.P.) 
 Division for Advanced Computation, Sentauri Inc., Glenwood, MD 21738, USA; [email protected] 
 Department of Chemistry, University of Pennsylvania, Philadelphia, PA 19104, USA; [email protected] (X.L.); [email protected] (R.P.), Division for Advanced Computation, Sentauri Inc., Glenwood, MD 21738, USA; [email protected] 
First page
7125
Publication year
2025
Publication date
2025
Publisher
MDPI AG
ISSN
16616596
e-ISSN
14220067
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3239070153
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.