2025 AIChE Annual Meeting
(510b) A Machine Learning and Data Driven Holistic LCA Approach for Early-Stage Sustainability Assessment
Authors
This study aims to develop machine learning (ML) and data analytics approach to assess the environmental impact of a chemical at the early design stage for its entire life cycle. The first phase of this approach focusses on a cradle-to-gate LCA framework, which assesses the environmental impact of a product from the extraction of raw materials to the point when it enters the manufacturing plant to be processed or used. We employed supervised learning models to forecast four key endpoints Life Cycle Impact Assessment (LCIA) metrics: Resource Utilization Impact (RUI), Ecosystem Quality Impact (EQI), Global Warming Potential (GWP), and Human Health Impact (HHI)[3] . These models use Artificial Neural Networks (ANN) and eXtreme Gradient Boosting (XGBoost) to estimate environmental impacts for novel chemicals using their thermodynamic and molecular descriptor properties[4] .
We extend the analysis to the gate-to-gate phase which evaluates the environmental impact during the operational phase when these chemicals are actively used in industrial processes. Various technologies contribute to the total environmental impact of industrial chemical processes, with separation technologies playing a particularly important role due to their high energy consumption and direct impact on decreasing emissions and waste generation. Given their vital function and environmental impact, this study prioritizes a gate-to-gate evaluation of separation technologies, focusing on how different separation methods affect energy consumption, emission levels, and overall sustainability. This study focusses on technology scale-up analysis and impact assessment, which involves moving from lab-scale to industrial-scale production of greener chemicals. We developed a regression model based on a scaling equation to estimate GWP using fundamental process parameters on an industrial level. The model is trained on a dataset obtained from literature, industrial reports, and computer simulations, resulting in accurate predictions across various separation technologies. Case studies in the pharmaceutical and specialty chemicals sectors have been collected from literature for detailed data on materials, emissions, global warming potential, and energy consumption. Additional case studies are being simulated in Aspen and SuperPro® Designer to evaluate their material and energy balance, while the LCA is conducted in SimaPro to obtain the GWP. These datasets will serve as inputs to identify common scalability indices for environmental emissions and support the development of more sustainable industrial processes.
Open-source AI tools such as ChatGPT, Copilot, Gemini, DeepSeek, and others were employed to accelerate data collecting for the gate-to-gate phase analysis. These tools assisted in finding relevant articles, reports, and industrial case studies related to the technology under consideration that contained the required parameters. Each AI source was carefully checked for accuracy.
To demonstrate the practical effectiveness of the ML-driven Data Analytics approach developed, two different case studies will be analyzed: one in pharmaceutical manufacture and the other in organic waste valorization. Our model will predict the GWP for each case, which will then be compared against the actual GWP values determined by the LCA tool, SimaPro.
[1] “Summary for Policymakers.” Accessed: Mar. 24, 2025. [Online]. Available: https://www.ipcc.ch/report/ar6/wg2/chapter/summary-for-policymakers/
[2] R. Hoogmartens, S. Van Passel, K. Van Acker, and M. Dubois, “Bridging the gap between LCA, LCC and CBA as sustainability assessment tools,” Environ. Impact Assess. Rev., vol. 48, pp. 27–33, Sep. 2014, doi: 10.1016/j.eiar.2014.05.001.
[3] K. M. Yenkie, E. A. Aboagye, A. L. Lehr, J. Pazik, J. Longo, and R. P. Hesketh, “Machine Learning enabled Life Cycle Assessment for Early-stage Sustainable Process Design,” in Computer Aided Chemical Engineering, vol. 53, Elsevier, 2024, pp. 2881–2886. doi: 10.1016/B978-0-443-28824-1.50481-6.
[4] E. A. Aboagye et al., “Machine Learning Methods for the Forecasting of Environmental Impacts in Early-stage Process Design,” presented at the Foundations of Computer-Aided Process Design, Breckenridge, Colorado, USA, Jul. 2024, pp. 621–628. doi: 10.69997/sct.141240.