2020 Virtual AIChE Annual Meeting
(346au) Machine Learning Using the Guest/Host Energy Histogram to Predict the Adsorption of Chain Molecules
Authors
For n-hexane, we found a large number of outliers in the parity plot of ML versus GCMC. We determined that these outliers correspond to points on the adsorption isotherm near the point of condensation in the pores. Through machine learning, we were able to find that the GCMC simulations had not converged. Thus, the error was in the simulation and not the ML model. These un-converged points are nearly impossible to identify by looking at error bars generated from GCMC simulations because the systems remain trapped in one state and appear converged. Thus, we discovered a way to improve the quality of high-throughput screening using molecular simulations at these critical conditions through machine learning.
We also tested additional new features to compensate information loss when constructing one-dimensional histograms from the three-dimensional energy grids. Additionally, the new energy histogram algorithm shows robustness when predicting selectivity for Xe/Kr separation.
References:
(1) Bucior, B. J.; Bobbitt, N. S.; Islamoglu, T.; Goswami, S.; Gopalan, A.; Yildirim, T.; Farha, O. K.; Bagheri, N.; Snurr, R. Q. Energy-Based Descriptors to Rapidly Predict Hydrogen Storage in MetalâOrganic Frameworks. Mol. Syst. Des. Eng. 2019, 4 (1), 162â174. https://doi.org/10.1039/C8ME00050F.
(2) Colón, Y. J.; Gómez-Gualdrón, D. A.; Snurr, R. Q. Topologically Guided, Automated Construction of MetalâOrganic Frameworks and Their Evaluation for Energy-Related Applications. Cryst. Growth Des. 2017, 17 (11), 5801â5810. https://doi.org/10.1021/acs.cgd.7b00848.
(3) Tibshirani, R. Regression Shrinkage and Selection via the Lasso. J. R. Stat. Soc. Ser. B Methodol. 1996, 58 (1), 267â288.
(4) Liaw, A.; Wiener, M. Classiï¬cation and Regression by RandomForest. 2002, 2, 5.
(5) Abadi, M.; Barham, P.; Chen, J.; Chen, Z.; Davis, A.; Dean, J.; Devin, M.; Ghemawat, S.; Irving, G.; Isard, M.; Kudlur, M.; Levenberg, J.; Monga, R.; Moore, S.; Murray, D. G.; Steiner, B.; Tucker, P.; Vasudevan, V.; Warden, P.; Wicke, M.; Yu, Y.; Zheng, X. TensorFlow: A System for Large-Scale Machine Learning. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16); 2016; pp 265â283.