2025 Spring Meeting and 21st Global Congress on Process Safety
(74c) Surrogate Quality Score: A Hybrid Metric to Balance Model Accuracy and Complexity
With this discussion, the key question in modeling industrial processes and systems is whether a surrogate’s predictive accuracy should be the sole criterion to choose the best surrogate form? Often, process experts are interested in having accurate, yet simple and/or interpretable models. A simple surrogate might be necessary for surrogate-based optimization for maintaining tractability, or an interpretable model might be needed to correlate model predictions with theoretical evidence. Thus, the above question can be reframed as follows: Does the improvement in surrogate accuracy justify an increase in model complexity?
To this end, we formulate a hybrid comparative metric, Surrogate Quality Score (SQS), that balances surrogate accuracy with its complexity to compare and rank surrogates with varying accuracies and complexities. Our metric is developed heuristically, validated on several scenarios with differing model accuracies and complexities. SQS uses normalized root mean squared error on a validation set to interpret surrogate accuracy and degrees of freedom to quantify model complexity. Unlike metrics that penalize goodness-of-fit with a fixed complexity penalty (such as AIC, BIC, etc.), SQS has a tunable parameter to vary the balance between accuracy and complexity factors, thereby allowing the users to observe the best surrogates for varying degrees of accuracy-complexity tradeoffs. This could enable modeling experts to make informed decisions in selecting the ideal surrogate form with appropriate accuracy and simplicity based on the modeling objective. While SQS has been incorporated in our surrogate selection framework, LEAPS2 (Ahmad and Karimi, 2021), our modified SQS is more generic, robust, and versatile than the existing version in LEAPS2.
References:
Ahmad, M., Karimi, I.A., 2022. Families of similar surrogate forms based on predictive accuracy and model complexity. Comput. Chem. Eng. 163, 107845. https://doi.org/10.1016/j.compchemeng.2022.107845
Ahmad, M., Karimi, I.A., 2021. Revised learning based evolutionary assistive paradigm for surrogate selection (LEAPS2v2). Comput. Chem. Eng. 152, 107385. https://doi.org/10.1016/j.compchemeng.2021.107385
Cozad, A., Sahinidis, N.V., Miller, D.C., 2014. Learning surrogate models for simulation‐based optimization. AIChE J. 60, 2211–2227. https://doi.org/10.1002/aic.14418
Garud, S.S., Karimi, I.A., Kraft, M., 2018. LEAPS2: Learning based Evolutionary Assistive Paradigm for Surrogate Selection. Comput. Chem. Eng. 119, 352–370. https://doi.org/10.1016/j.compchemeng.2018.09.008
Sun, W., Braatz, R.D., 2021. Smart process analytics for predictive modeling. Comput. Chem. Eng. 144, 107134. https://doi.org/10.1016/j.compchemeng.2020.107134
Williams, B.A., Cremaschi, S., 2019. Surrogate Model Selection for Design Space Approximation And Surrogatebased Optimization, in: Computer Aided Chemical Engineering. Elsevier, pp. 353–358. https://doi.org/10.1016/B978-0-12-818597-1.50056-4