2025 AIChE Annual Meeting

(389ck) Augmenting Large Language Models with Reasoning for Navigating Protein-Binding Landscape

Authors

Nisarg Joshi - Presenter, University of Washington
Jim Pfaendtner, University of Washington
Designing ligands that selectively bind to protein targets is a cornerstone of drug discovery. However, identifying suitable ligands that fit the binding site and preserve protein function remains experimentally time consuming and expensive. To accelerate de novo drug discovery, traditional inverse design methods have opened many promising avenues.1 Large language models (LLMs) have further transformed machine learning (ML) applications, shifting from purely predictive tasks to leveraging vast corpora for chemistry. Their generative capabilities have led to the development of text-to-molecule models, offering new approaches for molecular design and drug discovery. Yet, existing LLMs often struggle to generate chemically valid molecules and typically lack mechanisms for reasoning-based validation of their outputs. Much less work has focused on creating reasoning LLMs for scientific tasks,2 and relatively little has been done specifically for protein-binding tasks.

In this work we propose a reasoning LLM which is capable of designing protein-binding ligands. Through group relative policy optimization (GRPO)3 method with a verifiable docking score reward, we demonstrate how reasoning models can enable in sampling valid ligand candidates and justify their predicted binding capabilites. This reasoning driven approach helps towards an efficient and accelerated path towards drug discovery.

References:

  1. Harren et al. WIREs Comput Mol Sci. 2024; 14(3):e1716. doi.org/10.1002/wcms.1716

  2. S. M. Narayanan et al. arXiv:2506.17238

  3. Shao et al. arXiv:2402.03300