Breadcrumb
- Home
- Publications
- Proceedings
- 2025 AIChE Annual Meeting
- Food, Pharmaceutical & Bioengineering Division
- Fundamentals of Biocatalyst Design and Engineering
- (345c) Pathlm: Chemical Language Model for Biosynthesis Planning
Here, we introduce PathLM, a platform for de novo biosynthetic pathway design that integrates large language model finetuning with reinforcement learning to predict complete, balanced routes between precursors and target molecules. PathLM is trained on both publicly available biosynthetic reactions and an internally curated corpus of pathways, enabling it to capture evolutionary context and cofactor requirements across multistep sequences. A reinforcement learning framework enforces chemical validity and mass balance, ensuring that individual reactions and overall pathways are stoichiometrically balanced and yield chemically feasible intermediates.
We demonstrate that PathLM can propose concise, end to end biosynthetic routes that explicitly incorporate necessary cofactors and side metabolites. By bridging the gap between single step retrosynthesis and holistic pathway engineering, PathLM promises to accelerate the development of sustainable biocatalytic processes for pharmaceuticals, fine chemicals and biofuels.