2025 AIChE Annual Meeting
(393k) DFT and ML Based Property Prediction of Metal Complex Photosensitizers for Photodynamic Therapy
Authors
In this presentation, a hybrid mechanistic and data-driven model is proposed for the quantitative structure-property relationship (QSPR) of photosensitizers. Important excited-state quantum-chemical descriptors (QCD) are first calculated based on density functional theory (DFT), since these QCD can describe the mechanism of the type II PDT process through the differences in electron density among the different excited states. These descriptors and other three kinds of descriptors, including metal-centered descriptors (MCD) describing the impact of radius, oxidation state and outer electron configuration of different metal center; molecule structure descriptors (MSD) describing the impact of molecular size and different functional groups; external condition descriptors (ECD) describing the impact of solvents and excitation wavelengths, are used to build different machine learning (including LASSO, support vector regression, kernel ridge regression, random forest regression and XGBoost) models. These models are tested on the singlet oxygen quantum yield (which is more important as an evaluation index of type II photosensitizers for PDT) prediction of hexa-coordinate transition metal complex (such as Ru-complex, Ir-complex and Re-complex) photosensitizers respectively.
Subsequent comparison of different combinations of descriptors (MCD+QCD+ECD and MCD+MSD+ECD) as model input confirm the impact of QCD which describes the excited state properties on singlet oxygen quantum yield. Support vector regression model and kernel ridge regression model also shows good generalization ability on external test set while XGBoost model shows a little overfitting. Finally, we confirm support vector regression model and kernel ridge regression model with all four kinds of descriptors are the best model on the metal complex photosensitizers property prediction out of the studied models. These two low dimensional machine learning models could be a useful method aiding experimental research in pre-synthetic screening of hexa-coordinate photosensitizers. Other transition metal complexes, such as Metal porphyrin complexes, may also be included in the ML training set for future research.