2025 AIChE Annual Meeting

(678a) Automatic Construction of Data-Driven Collective Variables Using Local Atomic Environments

Authors

Zheng Yu, Princeton University
Baris Eser Ugur, University of Rochester
Nicolás Giovambattista, Brooklyn College of the City University of New York
Pablo Debenedetti, Princeton University
Molecular dynamics (MD) simulations are often used to analyze mixtures and transitions in materials at atomic resolution. However, it can be difficult to identify collective variables that describe the extent to which an atomic environment belongs to one state or another, hindering the ability of MD simulations to characterize mixtures and transitions. We present a data-driven approach to automatic collective variable identification, and we use this approach to analyze MD simulations of transitions between high-density amorphous (HDA) and low-density amorphous (LDA) ice.

Specifically, we show that non-linear dimensionality reduction of atom-centered symmetry functions yields distinct clusters of amorphous ice configurations in the resulting latent space. We show that applying kernel density estimation to these clusters allows us to accurately classify atomic environments as being HDA, LDA, or neither. We also use this latent space to determine physically interpretable collective variables associated with transitions between HDA and LDA. The resulting model is used to quantify thermodynamic properties (i.e., transition temperatures and pressures) associated with the transition.

We show that this method can be used to analyze any set of structurally distinct configurations beyond HDA/LDA ice by considering two additional material systems. First, we use local atomic environments to characterize differences between multiple coarse-grained potentials for small molecules that yield similar macroscopic properties. Second, we determine microscopic mechanisms for (anti-) plasticization behavior in simulations of polymers by characterizing the local environments of water molecules embedded in these polymer materials.

Overall, this work shows how applying machine learning to ensembles of local atomic environments from MD simulations can reveal interpretable physical insights into materials behavior.