2020 Virtual AIChE Annual Meeting
(346bb) Bayesian Model Selection for Non-Covalent Interactions
Authors
As part of the Open Force Field Initiative, we are developing force fields, including non-covalent interaction models, using data-driven techniques. To this end, we explore the use of Bayesian inference to make data-driven choices between dispersion-repulsion parameters and functional forms, by calculating Bayes factors, which are essentially âoddsâ between different models and sets of parameters. This strategy requires repeated evaluation of parameter sets, which has previously been a large source of computational expense.
In this study, we test this strategy on the 2-center Lennard Jones plus Quadrupole (2CLJQ) model for simple fluids, as its simple functional form is easily modified and analytical âsurrogate modelsâ exist in the literature, allowing for fast evaluation of parameter sets. In this way, we can sample over the entire distribution of parameters and calculate Bayes factors, without incurring the computational cost of equilibrium simulations. Using the reversible jump Monte Carlo (RJMC) algorithm, we sample the posterior probability distributions of both the models and the parameters.
We ask whether including the modelâs quadrupole parameter is justified while reproducing temperature-dependent density, saturation pressure, and surface tension data for simple molecules. In general, we find that the quadrupole is not justified for reproducing these properties (with several notable exceptions). Additionally, we produce parameter probability distributions for these compounds, valuable information for guiding future parameterization; through these distributions we identify targets for future dimensional reduction. This work demonstrates the utility of Bayesian inference as a tool for model selection and paves the wave for future application of this technique to more complex decisions required in fitting complete force fields.