Breadcrumb
- Home
- Publications
- Proceedings
- 2025 AIChE Annual Meeting
- Computing and Systems Technology Division
- 10E Data-Driven Optimization
- (531d) Foundation Model-Guided Optimization of Chemical Reaction Spaces for Autonomous Experimentation
To overcome these limitations, it is critical to reformulate the search space by integrating chemical knowledge and physical theories [4]. This enables the extraction of meaningful features from black-box objective functions, thereby improving the interpretability of the experimental designs. Such strategies provide intuitive insights into experimental systems, reduce unnecessary computations, and focus on experimentally meaningful regions. As part of this representation reformulation, we explored two approaches: (i) representing molecules using physicochemical descriptors obtained from first-principles calculations such as density functional theory (DFT), and (ii) compressing molecular features into low-dimensional vector embeddings that retain the key characteristics of the molecules using artificial intelligence (AI) models. In particular, we employed several pre-trained models to derive latent molecular representations. These included molecular language transformer (MoLFORMER) [5], geometry-enhanced molecular representation learning (GEM) [6], fractional denoising (Frad) [7], and knowledge-guided pre-training of graph transformer (KPGT) [8], among others that can be readily incorporated into the framework. The resulting embeddings were then used as input variables for optimization. Notably, these representations in this study demonstrated superior optimization performance compared with OHE-based inputs. For example, in C–N cross-coupling and Suzuki reactions, MoLFORMER achieved yields exceeding 90% within 20 iterations out of 100 virtual experiments, reaching optimal conditions significantly faster. On the other hand, representations with excessively high embedding dimensions—such as those derived from MoLFORMER and KPGT—tended to require more iterations to reach optimal conditions or showed unstable convergence in certain reactions, including the Buchwald–Hartwig and chiral phosphoric acid-catalyzed thiol addition reactions. In addition, reaction fingerprints, such as differential reaction fingerprint (DRFP) [9] and data-driven reaction fingerprints (RXNFP) [10], have gained attention as promising approaches owing to their capability to effectively capture the overall characteristics of a reaction by integrating information on both reactants and products, as well as the transformation between them. Following this trend, future work will focus on developing reaction representations that can more precisely reflect the nonlinear interactions among reactants and the geometrical structures of molecules.
However, rigorously comparing different combinations of reaction representations and optimization algorithms under varying experimental scenarios remains a time- and resource-intensive challenge [11]. To address this, we developed a unified and user-friendly platform that automatically benchmarks diverse optimization strategies and encoding schemes across a range of organic reaction scenarios. The platform is designed to collect large-scale high-throughput experimental data from organic reactions and enables a comparative analysis of the representations generated from various foundation models. In addition to classical optimization algorithms, the platform incorporates a range of state-of-the-art optimization strategies [12-14], allowing quantitative performance evaluation across a broad spectrum of approaches. A built-in reaction yield prediction model, trained on reaction data, is also provided, facilitating pre-evaluation and simulation of candidate experimental conditions. Furthermore, the platform is not limited to organic chemistry and can be extended to other domains involving black-box optimization problems, such as process systems, material design, and numerical optimizations. It supports custom objective functions defined by users, allowing the design of application-specific optimization workflow. The platform is designed for potential compatibility with automated workflows, supporting closed-loop optimization cycles in which the proposed conditions can be automatically executed, analyzed, and used to inform subsequent iterations. Ultimately, the proposed framework provides a scalable foundation for addressing the complexity of reaction optimization while advancing the feasibility and scientific insights of autonomous experimental systems.
[References]