2025 AIChE Annual Meeting
(24g) Eppi_Ddg: Efficiently Predicting Point Mutation Effects Using a Structure-Based Feature Space
The ensemble model, which uses an ensemble average of the change in EPPI features, achieved state-of-the-art performance, with a Pearson correlation of 0.723 during cross-validation and 0.716 on a blind validation set. Additionally, a faster model, using only a single state for the mutated structure performed comparably to the previous state of the art method, flex_ddg, in two orders of magnitude less time. Notably, this approach relies only on efficiently computed structural features which eliminates the need for costly webservers, molecular dynamics simulations, ensemble generation, and deep learning architectures. Finally, the models demonstrated an ability to generate better predictions with the addition of limited experimental data. For some unseen complexes, as little as 1 or 2 experimental datapoints improved the model’s performance in both correlation and mean absolute error. This transfer learning performance suggests an application in efficient screening of unseen complexes, making use of the quick retraining process and iterative experimental characterization to carefully and confidently select mutations that result in the desired effect.