2021 Annual Meeting
(346p) Chemometric Techniques for the Combined Calibration/Parameter Estimation of Pharmaceutical Drug Substance Manufacture
Authors
For effective parameter estimation, several aspects associated with the model formulation must be considered. First, from a structural point of view, a numerically robust model formulation is essential for reliable execution of the optimization runs required for parameter estimation. One relevant example arises in the choice of kinetic expressions, where various reformulations have been proposed [3]â[5] to facilitate optimization execution. Second, scaling of the state variables employed in the model also plays a role in the efficiency of model solution, since the solution methods invariably make use of numerical approximations. Lastly, for derivative-based optimization, reliable gradient information from the modeled system has direct impact on the convergence speed and accuracy of the of the parameter estimates.
The other important element of the parameter estimation problem is the experimental data. The experimental data obtained for operations such as reaction or crystallization typically come from offline/online spectroscopic measurements, resulting in datasets that relate variables of interest to multivariate measurements. This permits the use of rich datasets that contribute to the robustness of the measurements, in particular for multicomponent systems, but also poses some challenges related to handling the increased dimensionality of the mathematical problem which must be solved, and the feasibility of executing the parameter estimation within reasonable computation time.
Multivariate data can be incorporated in various ways into a parameter estimation framework. One of the most widespread approaches is to build chemometric models using samples prepared with known concentration of the analyzed components. The measured spectra of the samples are related to their corresponding concentration through these calibration models, either using data at a single wavelength or using multivariate, reduced-dimension linear models such as principal component regression (PCR) or partial least squares (PLS) [6]. On the other hand, multiple curve resolution coupled with alternating least squares (MCR-ALS) [7], [8] approaches the problem by directly analyzing the spectra resulting from the experiment. Here, the spectra changes as a result of the change in chemical composition of the multicomponent system under study are directly considered. Both the concentration of the species of interest and their individual absorptivities are unknown in this method, hence its two-layer structure. Depending whether a first-principles or a data-driven model representing the system is used, the combined methodology is termed as hard-modeling or soft-modeling [9]. The MCR assumes that the spectroscopic response of the system is caused by a linear combination of the absorbing components present in the system, a multicomponent version of the Beer-Lambert law with a bilinear mathematical structure [10].
At the top level of the hard-modeling variant of the MCR-ALS methodology, a first-principles model that contains the parameters to be estimated is used to calculate the concentration of interest, given an initial parameter estimate. Then, determination of the single-component absorptivity is dealt within the MCR component, which calculates a modeled spectra by using the concentrations returned by the model and the experimental spectra. Then, the ALS component performs the parameter optimization by comparing the experimental and calculated spectra and changing the values of the parameters until sufficient agreement is reached. In this manner, both the calibration and the parameter estimation are achieved in a simultaneous fashion. Finally, the mathematical structure of the MCR problem allows it to be posed as the calculation of the modeled spectra as a matrix orthogonal projection calculation, for which expressions for derivative calculations are available [11], [12]. Calculation of the derivative of the orthogonal projectors can be further coupled to the information gained by sensitivity of the original modeled states (e.g. concentration) to enhance the quality of the derivative information passed to the optimization algorithm.
In this work, we report a case study involving the synthesis and crystallization of lomustine, [13], [14]. We demonstrate using a dynamic simulation-optimization approach to determine the parameters of the first-principles models for the reaction and crystallization steps drawn from the PharmaPy library [17]. Liquid-phase reaction experiments are carried out in batch and semi-batch reaction setups with automated sampling, and several kinetic reaction mechanisms are evaluated and compared. Also, cooling batch crystallization experiments are used to determine nucleation, growth and dissolution parameters, using the method of moments [15] to model the crystallization process and determine the crystal size distribution. UV-Vis and infrared spectroscopic data are generated for both the reaction and crystallization systems, and analyzed under the MCR-ALS framework. Focused beam reflectance measurement (FBRM) and particle vision and measurement (PVM) are used for the crystallization experiments to determine nucleation and obtain in-situ particle images. Dynamic parametric sensitivities are calculated by a forward sensitivity method, using analytical jacobians for both reaction and crystallization [16], [17]. Lastly, parametric uncertainty in the models is quantified using sampling techniques (i.e. bootstrapping).
References
[1] L. X. Yu et al., âUnderstanding pharmaceutical quality by design,â AAPS J., vol. 16, no. 4, pp. 771â783, 2014.
[2] Q. Su et al., âA perspective on Quality-by-Control (QbC) in pharmaceutical continuous manufacturing,â Comput. Chem. Eng., vol. 125, pp. 216â231, 2019.
[3] D. G. Watts, âEstimating parameters in nonlinear rate equations,â Can. J. Chem. Eng., vol. 72, no. 4, pp. 701â710, Aug. 1994.
[4] P. Bilardello, X. Joulia, J. M. Le Lann, H. Delmas, and B. Koehret, âA general strategy for parameter estimation in differentialâalgebraic systems,â Comput. Chem. Eng., vol. 17, no. 5â6, pp. 517â525, May 1993.
[5] M. Quaglio et al., âAn online reparametrisation approach for robust parameter estimation in automated model identification platforms,â Comput. Chem. Eng., vol. 124, pp. 270â284, 2019.
[6] J. C. Marek, âMultivariate analysis in API development,â in Chemical Engineering in the Pharmaceutical Industry, 2nd ed., D. J. am Ende and M. T. am Ende, Eds. Hoboken, NJ, USA: John Wiley & Sons, Inc., 2019, pp. 909â918.
[7] C. Ruckebusch and L. Blanchet, âMultivariate curve resolution: A review of advanced and tailored applications and challenges,â Anal. Chim. Acta, vol. 765, pp. 28â36, 2013.
[8] G. Puxty, M. Maeder, and K. Hungerbühler, âTutorial on the fitting of kinetics models to multivariate spectroscopic measurements with non-linear least-squares regression,â Chemom. Intell. Lab. Syst., vol. 81, no. 2, pp. 149â164, 2006.
[9] M. Short, L. T. Biegler, S. GarcÃa-Muñoz, and W. Chen, âEstimating variances and kinetic parameters from spectra across multiple datasets using KIPET,â Chemom. Intell. Lab. Syst., vol. 203, no. March, 2020.
[10] R. Tauler, âMultivariate curve resolution applied to second order data,â Chemom. Intell. Lab. Syst., vol. 30, no. 1, pp. 133â146, 1995.
[11] G. Golub and V. Pereyra, âSeparable nonlinear least squares: the variable projection method and its applications,â Inverse Probl., vol. 19, no. 2, pp. R1âR26, Apr. 2003.
[12] K. M. Mullen, M. Vengris, and I. H. M. Van Stokkum, âAlgorithms for separable nonlinear least squares with application to modelling time-resolved spectra,â J. Glob. Optim., vol. 38, no. 2, pp. 201â213, 2007.
[13] L. T. Biegler, Nonlinear Programming: Concepts, Algorithms, and Applications to Chemical Processes. Philadelphia: Siam, 2010.
[14] Y. Bard, Nonlinear Parameter Estimation. London.: Academic Press, 1974.
[15] A. D. Randolph and M. A. Larson, Theory of Particulate Processes: Analysis and Techniques of Continuous Crystallization, 2nd ed. San Diego, CA: Academic Press, 1988.
[16] Z. K. Nagy and R. D. Braatz, âDistributional uncertainty analysis using power series and polynomial chaos expansions,â J. Process Control, vol. 17, no. 3, pp. 229â240, 2007.
[17] D. Casas-Orozco et al., âPharmaPy: an object-oriented tool for the development of hybrid pharmaceutical flowsheets,â Comput. Chem. Eng., 2021 (submitted).