Difference between revisions of "DOW Experimental Design"

From mintOC
Jump to: navigation, search
(Created page with "{{Dimensions |nd = 1 |nx = 11 |nw = 2 |nc = 4 |nre = 11 }} The '''DOW Experimental Design problem''' models the OED problem for the paramete...")
 
(Parameters)
 
(46 intermediate revisions by 2 users not shown)
Line 7: Line 7:
 
}}
 
}}
  
The '''DOW Experimental Design problem''' models the OED problem for the parameter estimation problem formulated by the DOW Chemical Co. in 1981. The following formulation is taken from "Nonlinear Parameter Estimation:
+
The '''DOW Experimental Design problem''' models the OED problem for the parameter estimation problem formulated by the DOW Chemical Co. in 1981. The following formulation is taken from [[#Biegler1986|[1]]].
a Case Study Comparison" by L. T. Biegler and J. J. Damiano.
+
  
The chemical species are disguised for proprietary reasons and the desired reaction is given by <math>HA + 2BM \rightarrow AB + HBMH</math>, where AB is the desired product. The reactions are described as follows:
+
== Chemical background ==
 +
 
 +
The chemical species are disguised for proprietary reasons and the desired reaction is given by <math>HA + 2BM \rightarrow AB + HBMH</math>, where <math>AB</math> is the desired product. The reactions are described as follows:
  
 
Slow Kinetic Reactions:
 
Slow Kinetic Reactions:
Line 35: Line 36:
 
In order to devise a model to account for these reactions, it is
 
In order to devise a model to account for these reactions, it is
 
first necessary to distinguish between the overall concentration of
 
first necessary to distinguish between the overall concentration of
a species and the concentration of its neutral form. Overall con-
+
a species and the concentration of its neutral form. Overall concentrations are defined for three components based on neutral
centrations are defined for three components based on neutral
+
 
and ionic species
 
and ionic species
 
<p>
 
<p>
 
<math>
 
<math>
  \begin{array}{c}  
+
  \begin{align}
   \left[HBMH\right] = \left[ (MBMH)_N \right] + \left[ MBM^- \right] \\
+
   \left[HBMH\right] &= \left[ (MBMH)_N \right] + \left[ MBM^- \right] \\
   \left[HA\right] = \left[ (HA)_N \right] + \left[ A^- \right] \\
+
   \left[HA\right] &= \left[ (HA)_N \right] + \left[ A^- \right] \\
   \left[HABM\right] = \left[ (HABM)_N \right] + \left[ ABM^- \right]
+
   \left[HABM\right] &= \left[ (HABM)_N \right] + \left[ ABM^- \right]
  \end{array}  
+
  \end{align}  
 
</math>
 
</math>
 
</p>
 
</p>
Here <math>[ \ ]</math> denotes the concentration of the species in <math>gmol/kg</math>.
+
Here <math>[ \ ]</math> denotes the concentration of the species in <math>\operatorname{gmol}/\operatorname{kg}</math>.
 
By assuming the rapid acid-base reactions are at equilibrium, the equilibrium constants <math>K_1,K_2,K_3</math> can be defined as
 
By assuming the rapid acid-base reactions are at equilibrium, the equilibrium constants <math>K_1,K_2,K_3</math> can be defined as
 
<p>
 
<p>
 
<math>
 
<math>
\begin{array}{c}
+
\begin{align}
  K_1 = \frac{[MBM^-][H^+]}{[(MBMH)_N]} \\
+
  K_1 &= \frac{[MBM^-][H^+]}{[(MBMH)_N]} \\
  K_1 = \frac{[A^-][H^+]}{[(HA)_N]} \\
+
  K_1 &= \frac{[A^-][H^+]}{[(HA)_N]} \\
  K_1 = \frac{[ABM^-][H^+]}{[(HABM)_N]}  
+
  K_1 &= \frac{[ABM^-][H^+]}{[(HABM)_N]}  
\end{array}
+
\end{align}
 
</math>
 
</math>
 
</p>
 
</p>
Line 62: Line 62:
 
<p>
 
<p>
 
<math>
 
<math>
\begin{array}{cr}
+
\begin{align}
  \left[MBM^-\right] = \frac{K_1[MBMH]}{K_1 + [H^+]} & \quad (a) \\
+
  \left[MBM^-\right] &= \frac{K_1[MBMH]}{K_1 + [H^+]} && \quad (a) \\
  \left[A^-\right] = \frac{K_2[HA]}{K_2 + [H^+]} & \quad (b)\\
+
  \left[A^-\right] &= \frac{K_2[HA]}{K_2 + [H^+]} && \quad (b)\\
  \left[ABM^-\right] = \frac{K_3[HABM]}{K_3 + [H^+]}& \quad (c)
+
  \left[ABM^-\right] &= \frac{K_3[HABM]}{K_3 + [H^+]} && \quad (c)
\end{array}
+
\end{align}
 
</math>
 
</math>
 
</p>
 
</p>
Line 73: Line 73:
 
<p>
 
<p>
 
<math>
 
<math>
\begin{array}{cr}
+
\begin{align}
  \frac{d[M^-]}{dt} = -k_1 \left[ M^- \right] \left[ BM \right] + k_{-1} \left[ MBM^- \right] - k_3 \left[ M^- \right]\left[ AB \right] + k_{-1} \left[ ABM^- \right] &  \quad (d)\\
+
  \frac{d[M^-]}{dt} &= -k_1 \left[ M^- \right] \left[ BM \right] + k_{-1} \left[ MBM^- \right] - k_3 \left[ M^- \right]\left[ AB \right] + k_{-1} \left[ ABM^- \right] &&  \quad (d)\\
  \frac{d[BM]}{dt} = -k_1 \left[ M^- \right] \left[ BM \right] + k_{-1} \left[ MBM^- \right] - k_2 \left[ A^- \right]\left[ BM \right] &  \quad (e)\\  
+
  \frac{d[BM]}{dt} &= -k_1 \left[ M^- \right] \left[ BM \right] + k_{-1} \left[ MBM^- \right] - k_2 \left[ A^- \right]\left[ BM \right] &&  \quad (e)\\  
  \frac{d[AB]}{dt} = -k_3 \left[ M^- \right] \left[ AB \right] + k_{-3} \left[ ABM^- \right]  &  \quad (f)  
+
  \frac{d[AB]}{dt} &= -k_3 \left[ M^- \right] \left[ AB \right] + k_{-3} \left[ ABM^- \right]  &&  \quad (f)  
\end{array}
+
\end{align}
 
</math>
 
</math>
 
</p>
 
</p>
Line 84: Line 84:
 
<p>
 
<p>
 
<math>
 
<math>
\begin{array}{cr}
+
\begin{align}
  \frac{d[MBMH]}{dt} = k_1 \left[ M^- \right] \left[ BM \right] + k_{-1} \left[ MBM^- \right]  &  \quad (g)\\
+
  \frac{d[MBMH]}{dt} &= k_1 \left[ M^- \right] \left[ BM \right] + k_{-1} \left[ MBM^- \right]  &&  \quad (g)\\
  \frac{d[HA]}{dt} = k_2 \left[ A^- \right] \left[ BM \right] &  \quad (h)\\
+
  \frac{d[HA]}{dt} &= k_2 \left[ A^- \right] \left[ BM \right] &&  \quad (h)\\
  \frac{d[HABM]}{dt} = k_2 \left[ A^- \right] \left[ BM \right] + k_3 \left[ M^- \right] \left[ AB \right] - k_{-3} \left[ ABM^- \right]&  \quad (i)
+
  \frac{d[HABM]}{dt} &= k_2 \left[ A^- \right] \left[ BM \right] + k_3 \left[ M^- \right] \left[ AB \right] - k_{-3} \left[ ABM^- \right] &&  \quad (i)
\end{array}
+
\end{align}
 
</math>
 
</math>
 
</p>
 
</p>
An electroneutrality constraint gives the hydrogen ion con-
+
An electroneutrality constraint gives the hydrogen ion concentration <math>\left[ H^+ \right]</math> as
centration <math>\left[ H^+ \right]</math> as
+
 
<p>
 
<p>
 
  <math>
 
  <math>
Line 111: Line 110:
 
  </math>
 
  </math>
 
</p>
 
</p>
Here <math>R</math> is the gas constant, <math>T</math> is reaction temperature in Kelvins and the parameters <math>\alpha_i,E_i</math> represent the pre-exponential factor and activation energy, respectively, for the appropriate rate constant.
+
Here <math>R</math> is the gas constant and <math>T</math> is reaction temperature in Kelvins. The parameter <math>\alpha_i</math>, given in <math>\operatorname{mol}/( \operatorname{kg} \cdot \operatorname{h})</math>, represent the pre-exponential factor and <math>E_i</math>, with unit <math>\operatorname{cal}/{\operatorname{mol}}</math>, is the activation energy.
  
 
== Mathematical formulation ==
 
== Mathematical formulation ==
Line 118: Line 117:
 
<p>
 
<p>
 
  <math>
 
  <math>
   \begin{array}{lr}
+
   \begin{align}
   \frac{d y_1}{dt} = -k_2 y_8 y_2 & \quad (1),(h) \\
+
   \dot{y}_1 &= -k_2 y_8 y_2 && \quad (1),(h) \\
   \frac{d y_2}{dt} = -k_1 y_6 y_2 + k_{-1} y_{10} - k_2 y_8 y_2 & \quad (2),(e) \\
+
   \dot{y}_2 &= -k_1 y_6 y_2 + k_{-1} y_{10} - k_2 y_8 y_2 && \quad (2),(e) \\
   \frac{d y_3}{dt} = -k_2 y_8 y_2 + k_1 y_6 y_4 - \frac{1}{2} k_{-1} y_9 & \quad (3),(i) \\
+
   \dot{y}_3 &= -k_2 y_8 y_2 + k_1 y_6 y_4 - \frac{1}{2} k_{-1} y_9 && \quad (3),(i) \\
   \frac{d y_4}{dt} = -k_1 y_6 y_4  + \frac{1}{2} k_{-1} y_9 & \quad (4),(f) \\
+
   \dot{y}_4 &= -k_1 y_6 y_4  + \frac{1}{2} k_{-1} y_9 && \quad (4),(f) \\
   \frac{d y_5}{dt} = -k_1 y_6 y_2 + k_{-1} y_{10} & \quad (5),(g) \\
+
   \dot{y}_5 &= -k_1 y_6 y_2 + k_{-1} y_{10} && \quad (5),(g) \\
   \frac{d y_6}{dt} = -k_1 (y_6 y_2 + y_6 y_4) + k_{-1} (y_{10} + \frac{1}{2} y_9) & \quad (6),(d) \\
+
   \dot{y}_6 &= -k_1 (y_6 y_2 + y_6 y_4) + k_{-1} (y_{10} + \frac{1}{2} y_9) && \quad (6),(d) \\
   y_7 = -\left[ Q^+ \right] + y_6 + y_8 + y_9 + y_{10} & \quad (7),(j)\\
+
   y_7 &= -\left[ Q^+ \right] + y_6 + y_8 + y_9 + y_{10} && \quad (7),(j)\\
   y_8 = \frac{\theta_8 y_1}{\theta_8 + y_7} & \quad (8),(b)\\
+
   y_8 &= \frac{\theta_8 y_1}{\theta_8 + y_7} && \quad (8),(b)\\
   y_9 = \frac{\theta_9 y_3}{\theta_9 + y_7} & \quad (9),(c)\\
+
   y_9 &= \frac{\theta_9 y_3}{\theta_9 + y_7} && \quad (9),(c)\\
   y_{10} = \frac{\theta_7 y_5}{\theta_7 + y_7} & \quad (10),(a)
+
   y_{10} &= \frac{\theta_7 y_5}{\theta_7 + y_7} && \quad (10),(a)
   \end{array}
+
   \end{align}
 
  </math>
 
  </math>
 
</p>
 
</p>
Here the letter stands for the corresponding chemical process.
+
Here the letters in parentheses stand for the corresponding chemical process and the quantity <math>\left[Q^+\right]</math> is a constant during the reaction.
 
The nine parameters form the vector  
 
The nine parameters form the vector  
 
<p>
 
<p>
Line 146: Line 145:
 
</p>
 
</p>
  
Let <math>f_k(\cdot)</math> denote the right hand side of equation <math>(k)</math> for <math>k=1,\ldots,6</math>. Moreover, let
+
Let <math>f_k(\cdot)</math> denote the right hand side of equation <math>(k)</math> for <math>k=1,\ldots,6</math>.  
 +
We reformulate the last four algebraic equations as differential ones:
 +
<p>
 +
<math>
 +
  \begin{align}
 +
  \dot{y}_7 &= f_7 = f_6 + f_8 + f_9 + f_{10} && \quad (7') \\
 +
  \dot{y}_8 &= f_8 = \frac{\theta_8 f_1 \cdot (\theta_8 + f_7) - \theta_8 y_1 f_7}{(\theta_8 + y_7)^2} && \quad (8') \\
 +
  \dot{y}_9 &= f_9 = \frac{\theta_9 f_3 \cdot (\theta_9 + f_7) - \theta_9 y_3 f_7}{(\theta_9 + y_7)^2}  && \quad (9')\\
 +
  \dot{y}_{10} &= f_{10} = \frac{\theta_7 f_5 \cdot (\theta_7 + f_7) - \theta_7 y_5 f_7}{(\theta_7 + y_7)^2} && \quad (10')
 +
  \end{align}
 +
</math>
 +
</p>
 +
The right hand sides of <math>(1)-(6)</math> and <math>(7')-(9')</math> are summarized as the vector-valued function <math>f</math>.
 +
Moreover, let
 +
<p>
 +
<math>
 +
  \begin{align}
 +
  f_{y,m,n}(\cdot) &= \frac{\partial f_m(\cdot)}{\partial y_n}, \quad m,n \in \{1,\ldots,10\} \\
 +
  f_{\theta,m,n}(\cdot) &= \frac{\partial f_m(\cdot)}{\partial \theta_n}, \quad m \in \{1,\ldots,10\}; \ n\in \{1,\ldots,9\}
 +
  \end{align}
 +
</math>
 +
</p>
 +
 
 +
== Parameters ==
 +
 
 +
The initial parameter estimates are:
 +
<table style="border-collapse: collapse;" border="1.">
 +
            <tr>
 +
                <td style="text-align: center; padding:5pt"> <math> \alpha_1 </math> </td>
 +
                <td style="text-align: center; padding:5pt"> <math>2.0 \times 10^{13}</math> </td>
 +
            </tr>
 +
            <tr>
 +
                <td style="text-align: center; padding:5pt"><math>E_1</math></td>
 +
                <td style="text-align: center; padding:5pt"><math>2.0 \times 10^{4}</math></td>
 +
            </tr>
 +
            <tr>
 +
                <td style="text-align: center; padding:5pt"><math>\alpha_2</math></td>
 +
                <td style="text-align: center; padding:5pt"><math>2.0 \times 10^{13}</math></td>
 +
            </tr>
 +
            <tr>
 +
                <td style="text-align: center; padding:5pt"><math>E_2</math></td>
 +
                <td style="text-align: center; padding:5pt"><math>2.0 \times 10^{4}</math></td>
 +
            </tr>
 +
            <tr>
 +
                <td style="text-align: center; padding:5pt"><math>\alpha_{-1}</math></td>
 +
                <td style="text-align: center; padding:5pt"><math>4.3 \times 10^{15}</math></td>
 +
            </tr>
 +
            <tr>
 +
                <td style="text-align: center; padding:5pt"><math>E_{-1}</math></td>
 +
                <td style="text-align: center; padding:5pt"><math>2.0 \times 10^{4} </math></td>
 +
            </tr>
 +
            <tr>
 +
                <td style="text-align: center; padding:5pt"><math>K_1</math></td>
 +
                <td style="text-align: center; padding:5pt"><math>1.0\cdot 10^{-17}</math></td>
 +
            </tr>
 +
            <tr>
 +
                <td style="text-align: center; padding:5pt"><math>K_2</math></td>
 +
                <td style="text-align: center; padding:5pt"><math>1.0\cdot 10^{-11}</math></td>
 +
            </tr>
 +
            <tr>
 +
                <td style="text-align: center; padding:5pt"><math>K_3</math></td>
 +
                <td style="text-align: center; padding:5pt"><math>1.0\cdot 10^{-17}</math></td>
 +
            </tr>
 +
    </table>
 +
 
 +
 
 +
There are three datasets for different temperatures <math>T</math>, with corresponding starting values
 +
<table style="border-collapse: collapse;" border="1.">
 +
            <tr style="border-bottom: 2pt solid black;">
 +
                <th style="border-right: 2pt solid black;"></th>
 +
                <th style="text-align: center; padding:5pt"> <math>40^\circ C </math></th>
 +
                <th style="text-align: center; padding:5pt"> <math>67^\circ C </math></th>
 +
                <th style="text-align: center; padding:5pt"> <math>100^\circ C </math></th>
 +
            </tr>
 +
            <tr>
 +
                <td style="text-align: center; padding:5pt; border-right: 2pt solid black;"> <math> y_1(0) </math> </td>
 +
                <td style="text-align: center; padding:5pt"> <math> 1.7066 </math> </td>
 +
                <td style="text-align: center; padding:5pt"> <math> 1.6749 </math> </td>
 +
                <td style="text-align: center; padding:5pt"> <math> 1.5608 </math> </td>           
 +
            </tr>
 +
            <tr>
 +
                <td style="text-align: center; padding:5pt; border-right: 2pt solid black"><math>y_2(0)</math></td>
 +
                <td style="text-align: center; padding:5pt"><math>8.32</math></td>
 +
                <td style="text-align: center; padding:5pt"><math>8.2262</math></td>
 +
                <td style="text-align: center; padding:5pt"><math>8.3546</math></td>
 +
            </tr>
 +
            <tr>
 +
                <td style="text-align: center; padding:5pt; border-right: 2pt solid black"><math>y_3(0)</math></td>
 +
                <td style="text-align: center; padding:5pt"><math>0.01</math></td>
 +
                <td style="text-align: center; padding:5pt"><math>0.0104</math></td>
 +
                <td style="text-align: center; padding:5pt"><math>0.0082</math></td>
 +
            </tr>
 +
            <tr>
 +
                <td style="text-align: center; padding:5pt; border-right: 2pt solid black"><math>y_4(0)</math></td>
 +
                <td style="text-align: center; padding:5pt"><math>0</math></td>
 +
                <td style="text-align: center; padding:5pt"><math>0.0017</math></td>
 +
                <td style="text-align: center; padding:5pt"><math>0.0086</math></td>
 +
            </tr>
 +
    </table>
 +
<!--
 +
<p>
 +
<math>
 +
  \begin{array}{lccc}
 +
  & 40^\circ C & 67^\circ C & 100^\circ C \\
 +
  y_1(0) = & 1.7066 & 1.6749 & 1.5608 \\
 +
  y_2(0) = & 8.32 & 8.2262 & 8.3546 \\
 +
  y_3(0) = & 0.01 & 0.0104 & 0.0082 \\
 +
  y_4 (0) = & 0 & 0.0017 & 0.0086
 +
  \end{array}
 +
</math>
 +
</p>
 +
-->
 +
The initial model conditions in addition to those given in the
 +
data sets are:
 
<p>
 
<p>
 
  <math>
 
  <math>
 
   \begin{array}{l}
 
   \begin{array}{l}
  f_{ymn}(\cdot) = \frac{\partial f_m(\cdot)}{\partial y_n}, \quad m,n \in \{1,\ldots,6\} \\
+
  y_5 = 0 \\
  f_{\theta mn}(\cdot) = \frac{\partial f_m(\cdot)}{\partial y_n}
+
  y_6 = 0.0131 \\
 +
  y_7 = \frac{1}{2} \cdot \left( -K_2 + \sqrt{K_2^2 + 4K_2 y_1(0)} \right) \\
 +
  y_8 = y_7 \\
 +
  y_9 = 0 \\
 +
  y_{10} = 0
 
   \end{array}
 
   \end{array}
 
  </math>
 
  </math>
</p>  
+
</p>
The non-zero derivatives are given by
+
 
 +
== Optimal Experimental Design Problem ==
 +
To be specified.
 +
 
 +
<!--
 +
To simplify the notation, the derivatives of the <math>k_i</math> with respect to the <math>\alpha_i</math> and <math>E_i</math> are written as
 +
<p>
 +
<math>
 +
  \begin{align}
 +
  k_{i,\alpha} &:= \frac{\partial k_i}{\partial \alpha_i} = \exp(-E_i/(RT)) \\
 +
  k_{i,E} &:= \frac{\partial k_i}{\partial E_i} = -\frac{\alpha_i}{RT} \exp(-E_i/(RT))
 +
  \end{align}
 +
</math>
 +
</p>
 +
The non-zero derivatives for the first six components are given by
 
<p>
 
<p>
 
<math>
 
<math>
  \begin{array}{l}
+
  \begin{array}{lcl}
  f_{y12} = -k_2 y_8 \\
+
  f_{y,1,2} = -k_2 y_8 & & \\
  f_{y18} = -k_2 y_2 \\
+
  f_{y,1,8} = -k_2 y_2 & & \\
  f_{y22} = -k_1 y_6 - k_2 y_8 \\
+
  f_{y,2,2} = -k_1 y_6 - k_2 y_8 & & \\
  f_{y26} = -k_1 y_2 \\
+
  f_{y,2,6} = -k_1 y_2 & & \\
  f_{y28} = -k_2 y_2 \\
+
  f_{y,2,8} = -k_2 y_2 & & \\
  f_{y32} = -k_2 y_8 \\
+
  f_{y,2,10} = k_{-1} & & \\
  f_{y34} = k_1 y_6 \\
+
f_{y,3,2} = -k_2 y_8 & & \\
  f_{y36} = k_1 y_4 \\
+
  f_{y,3,4} = k_1 y_6 & & \\
  f_{y44} = -k_1 y_6 \\
+
  f_{y,3,6} = k_1 y_4 & & \\
  f_{y46} = -k_1 y_4 \\
+
  f_{y,3,8} = -k_2 y_2 & & \\
  f_{52} = -k_1 y_6 \\
+
f_{y,3,9} = -\frac{1}{2} k_{-1} & & \\
  f_{y56} = -k_1 y_2 \\
+
f_{y,4,4} = -k_1 y_6 & & \\
  f_{y62} = -k_1 y_6 \\
+
  f_{y,4,6} = -k_1 y_4 & & \\
  f_{y64} = -k_1 y_6 \\
+
  f_{y,4,9} = \frac{1}{2} k_{-1} & & \\
  f_{y66} = -k_1 (y_2 + y_4)
+
f_{y,5,2} = -k_1 y_6 & & \\
 +
  f_{y,5,6} = -k_1 y_2 & & \\
 +
  f_{y,5,10} = k_{-1} & & \\
 +
f_{y,6,2} = -k_1 y_6 & & \\
 +
  f_{y,6,4} = -k_1 y_6 & & \\
 +
  f_{y,6,6} = -k_1 (y_2 + y_4) & & \\
 +
f_{y,6,9} = \frac{1}{2} k_{-1} & & \\
 +
f_{y,6,10} = k_{-1} & &
 
  \end{array}
 
  \end{array}
 
</math>
 
</math>
 +
</p>
 +
-->
 +
We are interested in when to measure (with an upper bound <math>M_i</math> on the measuring time for each observable).
 +
We define
 +
<p>
 +
<math>
 +
\begin{align}
 +
  f_y(\cdot) &\in \mathbb{R}^{10 \times 10} \quad &&\text{ with } (f_y)_{i,j} = f_{y,i,j}, \\
 +
  f_\theta(\cdot) &\in \mathbb{R}^{10 \times 9} \quad &&\text{ with } (f_\theta)_{i,j} = f_{\theta,i,j}
 +
\end{align}
 +
</math>
 +
</p>
 +
In this approach, we add the so-called sensitivities <math>G=dy/d\theta</math>. For the differential equations this means
 +
<p>
 +
<math>
 +
  \dot{G}(t) = f_y(y(t),\theta) G(t) + f_\theta(y(t),\theta), \quad G(0) = \frac{\partial y(0)}{\partial \theta}
 +
</math>
 
</p>
 
</p>
  
 +
Now we formulate the OED problem as described in [[#OEDUDE | [2]]].
 +
<p>
 +
<math>
 +
\begin{array}{lll}
 +
\displaystyle \min_{y,G,F,z,w} && \text{trace} \; \left( F^{-1}(t_f) \right) \\
 +
\text{subject to} \\
 +
\quad \dot{y}(t) & = & f(y(t),\theta) \\
 +
\quad \dot{G}(t) & = & f_y(y(t),\theta) G(t) + f_\theta(y(t),\theta) \\
 +
\quad \dot{F}(t) & = & \sum_{i=1}^{n_o} w_i(t)(h^i_y(y(t))G(t))^T(h^i_y(y(t))G(t)) \\
 +
\quad \dot{z}(t) & = & w(t), \\
 +
\quad y(0) & = & y_0 \\
 +
\quad G(0) & = & \frac{\partial y(0)}{\partial \theta} \\
 +
\quad F(0) & = & 0, \\
 +
\quad z(0) & = & 0 \\
 +
\quad w(t) & \in & \mathcal{W} \\
 +
\quad z_i(t_f) & \leq & M_i
 +
  \end{array}
 +
</math>
 +
</p>
  
We are interested in when to measure, with an upper bound <math>M</math> on the measuring time. We can measure the states directly, <math>h^1(x(t)) = x_1(t)</math> and <math>h^2(x(t)) = x_2(t)</math>. We use two different sampling functions, <math>w^1(\cdot)</math> and <math>w^2(\cdot)</math> in the same experimental setting. This can be seen either as a two-dimensional measurement function <math>h(x(t))</math>, or as a special case of a multiple experiment, in which <math>u(\cdot), x(\cdot)</math>, and <math>G(\cdot)</math> are identical. The experimental design problem then reads
+
Here <math>h:\mathbb{R}^{10} \rightarrow \mathbb{R}^{n_o}</math> is the observed function. The
 
+
evolution of the symmetric matrix <math>F: \left[0,t_f \right] \rightarrow \mathbb{R}^{9 \times 9}</math> is given by the weighted sum of observability Gramians
== Parameters ==
+
<math>h^i_y (y(t)) G(t), \ i = 1,\ldots,n_o</math> for each observed function of states. The weights <math>w_i (t) \in \{0, 1\}, \ i = 1,\ldots,n_o</math>
 
+
are the (binary) sampling decisions, where <math> w_i (t) = 1 </math> denotes the decision to perform a measurement at
We use <math>t_f=12</math>, <math>p_1 = p_2 = p_3 = p_4 = 1</math>, and <math>p_5 = 0.4</math>, <math>p_6 = 0.2</math>. The upper bound on the measurement time intervals is chosen as <math>M=4</math>.
+
time <math>t</math>.
  
 
== Miscellaneous and Further Reading ==
 
== Miscellaneous and Further Reading ==
The Lotka Volterra fishing problem was introduced by Sebastian Sager in a proceedings paper <bib id="Sager2006" /> and revisited in his PhD thesis <bib id="Sager2005" />. These are also the references to look for more details. The experimental design problem has been described in the habilitation thesis of Sager, <bib id="Sager2011d" />.
+
To be specified.
  
 
== References ==
 
== References ==
<biblist />
+
<span id="Biegler1986">[1]</span> "Nonlinear Parameter Estimation: a Case Study Comparison" by L. T. Biegler and J. J. Damiano  <br>
 +
<span id="OEDUDE">[2]</span> "Optimal Experimental Design for Universal Differential Equations" by C. Plate, C.J. Martensen and S. Sager <br>
 +
<span id="Stortelder1998">[3]</span> "Parameter estimation in nonlinear systems" by W.J.H. Stortelder <br>
  
 
<!--List of all categories this page is part of. List characterization of solution behavior, model properties, ore presence of implementation details (e.g., AMPL for AMPL model) here -->
 
<!--List of all categories this page is part of. List characterization of solution behavior, model properties, ore presence of implementation details (e.g., AMPL for AMPL model) here -->

Latest revision as of 09:56, 13 November 2024

DOW Experimental Design
State dimension: 1
Differential states: 11
Discrete control functions: 2
Path constraints: 4
Interior point equalities: 11


The DOW Experimental Design problem models the OED problem for the parameter estimation problem formulated by the DOW Chemical Co. in 1981. The following formulation is taken from [1].

Chemical background

The chemical species are disguised for proprietary reasons and the desired reaction is given by HA + 2BM \rightarrow AB + HBMH, where AB is the desired product. The reactions are described as follows:

Slow Kinetic Reactions:


\begin{array}{c}
 M^- + BM \underset{k_{-1}}{\leftarrow} \overset{k_1}{\rightarrow} MBM^- \\
 A^- + BM \overset{k_2}{\rightarrow} ABM^- \\
 M^- + AB \underset{k_{-3}}{\leftarrow} \overset{k_3}{\rightarrow} ABM^- 
\end{array}

Acid-Base Reactions:


\begin{array}{c}
 MBMH \leftarrow K_1 \rightarrow MBM^- + H^+ \\
 HA \leftarrow K_2 \rightarrow A^- + H^+ \\
 HABM \leftarrow K_2 \rightarrow ABM^- + H^+ 
 \end{array}

In order to devise a model to account for these reactions, it is first necessary to distinguish between the overall concentration of a species and the concentration of its neutral form. Overall concentrations are defined for three components based on neutral and ionic species


 \begin{align}
  \left[HBMH\right] &= \left[ (MBMH)_N \right] + \left[ MBM^- \right] \\
  \left[HA\right] &= \left[ (HA)_N \right] + \left[ A^- \right] \\
  \left[HABM\right] &= \left[ (HABM)_N \right] + \left[ ABM^- \right]
 \end{align}

Here [ \ ] denotes the concentration of the species in \operatorname{gmol}/\operatorname{kg}. By assuming the rapid acid-base reactions are at equilibrium, the equilibrium constants K_1,K_2,K_3 can be defined as


\begin{align}
 K_1 &= \frac{[MBM^-][H^+]}{[(MBMH)_N]} \\
 K_1 &= \frac{[A^-][H^+]}{[(HA)_N]} \\
 K_1 &= \frac{[ABM^-][H^+]}{[(HABM)_N]} 
\end{align}

The anionic species may then be represented by


\begin{align}
 \left[MBM^-\right] &= \frac{K_1[MBMH]}{K_1 + [H^+]} && \quad (a) \\
 \left[A^-\right] &= \frac{K_2[HA]}{K_2 + [H^+]} && \quad (b)\\
 \left[ABM^-\right] &= \frac{K_3[HABM]}{K_3 + [H^+]} && \quad (c)
\end{align}

Material balance equations for the three reactants in the slow kinetic reactions yield:


\begin{align}
 \frac{d[M^-]}{dt} &= -k_1 \left[ M^- \right] \left[ BM \right] + k_{-1} \left[ MBM^- \right] - k_3 \left[ M^- \right]\left[ AB \right] + k_{-1} \left[ ABM^- \right] &&  \quad (d)\\
 \frac{d[BM]}{dt} &= -k_1 \left[ M^- \right] \left[ BM \right] + k_{-1} \left[ MBM^- \right] - k_2 \left[ A^- \right]\left[ BM \right] &&  \quad (e)\\ 
 \frac{d[AB]}{dt} &= -k_3 \left[ M^- \right] \left[ AB \right] + k_{-3} \left[ ABM^- \right]  &&  \quad (f) 
\end{align}

From stoichiometry, rate expressions can also be written for the total species:


\begin{align}
 \frac{d[MBMH]}{dt} &= k_1 \left[ M^- \right] \left[ BM \right] + k_{-1} \left[ MBM^- \right]  &&  \quad (g)\\
 \frac{d[HA]}{dt} &= k_2 \left[ A^- \right] \left[ BM \right] &&  \quad (h)\\
 \frac{d[HABM]}{dt} &= k_2 \left[ A^- \right] \left[ BM \right] + k_3 \left[ M^- \right] \left[ AB \right] - k_{-3} \left[ ABM^- \right] &&  \quad (i)
\end{align}

An electroneutrality constraint gives the hydrogen ion concentration \left[ H^+ \right] as


  \left[ H^+ \right] + \left[ Q^+ \right] = \left[ M^- \right] + \left[ MBM^- \right] + \left[ A^- \right] + \left[ ABM^- \right] \quad \quad (j)

Based on similarities of reacting species, we assume

 k_3 = k_1, \quad k_{-3} = \frac{1}{2} k_{-1}

With these assumptions, the three rate constants k_1,k_2 and k_3 must be estimated. Each of these can be fitted with two adjustable model parameters, assuming an Arrhenius temperature dependence. That is

 
 \begin{array}{c}
  k_i = \alpha_i \exp(-E_i /(RT)), \quad i \in \{-1,1,2\}
 \end{array}

Here R is the gas constant and T is reaction temperature in Kelvins. The parameter \alpha_i, given in \operatorname{mol}/( \operatorname{kg} \cdot \operatorname{h}), represent the pre-exponential factor and E_i, with unit \operatorname{cal}/{\operatorname{mol}}, is the activation energy.

Mathematical formulation

The chemical processes (a)-(j) can be expressed mathematically as six differential equations and four algebraic equations:


  \begin{align}
  \dot{y}_1 &= -k_2 y_8 y_2 && \quad (1),(h) \\
  \dot{y}_2 &= -k_1 y_6 y_2 + k_{-1} y_{10} - k_2 y_8 y_2 && \quad (2),(e) \\
  \dot{y}_3 &= -k_2 y_8 y_2 + k_1 y_6 y_4 - \frac{1}{2} k_{-1} y_9 && \quad (3),(i) \\
  \dot{y}_4 &= -k_1 y_6 y_4  + \frac{1}{2} k_{-1} y_9 && \quad (4),(f) \\
  \dot{y}_5 &= -k_1 y_6 y_2 + k_{-1} y_{10} && \quad (5),(g) \\
  \dot{y}_6 &= -k_1 (y_6 y_2 + y_6 y_4) + k_{-1} (y_{10} + \frac{1}{2} y_9) && \quad (6),(d) \\
  y_7 &= -\left[ Q^+ \right] + y_6 + y_8 + y_9 + y_{10} && \quad (7),(j)\\
  y_8 &= \frac{\theta_8 y_1}{\theta_8 + y_7} && \quad (8),(b)\\
  y_9 &= \frac{\theta_9 y_3}{\theta_9 + y_7} && \quad (9),(c)\\
  y_{10} &= \frac{\theta_7 y_5}{\theta_7 + y_7} && \quad (10),(a)
  \end{align}

Here the letters in parentheses stand for the corresponding chemical process and the quantity \left[Q^+\right] is a constant during the reaction. The nine parameters form the vector

 
 \theta = (\alpha_1,E_1,\alpha_2,E_2,\alpha_{-1},E_{-1},K_1,K_2,K_3)

The predicted concentrations form the vector

 
 y = (HA,BM,HABM,AB,MBMH,M^-,H^+,A^-,ABM^-,MBM^-)

Let f_k(\cdot) denote the right hand side of equation (k) for k=1,\ldots,6. We reformulate the last four algebraic equations as differential ones:


  \begin{align}
   \dot{y}_7 &= f_7 = f_6 + f_8 + f_9 + f_{10} && \quad (7') \\
   \dot{y}_8 &= f_8 = \frac{\theta_8 f_1 \cdot (\theta_8 + f_7) - \theta_8 y_1 f_7}{(\theta_8 + y_7)^2} && \quad (8') \\
   \dot{y}_9 &= f_9 = \frac{\theta_9 f_3 \cdot (\theta_9 + f_7) - \theta_9 y_3 f_7}{(\theta_9 + y_7)^2}  && \quad (9')\\
   \dot{y}_{10} &= f_{10} = \frac{\theta_7 f_5 \cdot (\theta_7 + f_7) - \theta_7 y_5 f_7}{(\theta_7 + y_7)^2} && \quad (10') 
  \end{align}

The right hand sides of (1)-(6) and (7')-(9') are summarized as the vector-valued function f. Moreover, let


  \begin{align}
  f_{y,m,n}(\cdot) &= \frac{\partial f_m(\cdot)}{\partial y_n}, \quad m,n \in \{1,\ldots,10\} \\
  f_{\theta,m,n}(\cdot) &= \frac{\partial f_m(\cdot)}{\partial \theta_n}, \quad m \in \{1,\ldots,10\}; \ n\in \{1,\ldots,9\}
  \end{align}

Parameters

The initial parameter estimates are:

 \alpha_1 2.0 \times 10^{13}
E_1 2.0 \times 10^{4}
\alpha_2 2.0 \times 10^{13}
E_2 2.0 \times 10^{4}
\alpha_{-1} 4.3 \times 10^{15}
E_{-1} 2.0 \times 10^{4}
K_1 1.0\cdot 10^{-17}
K_2 1.0\cdot 10^{-11}
K_3 1.0\cdot 10^{-17}


There are three datasets for different temperatures T, with corresponding starting values

40^\circ C 67^\circ C 100^\circ C
 y_1(0)  1.7066  1.6749  1.5608
y_2(0) 8.32 8.2262 8.3546
y_3(0) 0.01 0.0104 0.0082
y_4(0) 0 0.0017 0.0086

The initial model conditions in addition to those given in the data sets are:


  \begin{array}{l}
   y_5 = 0 \\
   y_6 = 0.0131 \\
   y_7 = \frac{1}{2} \cdot \left( -K_2 + \sqrt{K_2^2 + 4K_2 y_1(0)} \right) \\
   y_8 = y_7 \\
   y_9 = 0 \\
   y_{10} = 0
  \end{array}

Optimal Experimental Design Problem

To be specified.

We are interested in when to measure (with an upper bound M_i on the measuring time for each observable). We define


\begin{align}
  f_y(\cdot) &\in \mathbb{R}^{10 \times 10} \quad &&\text{ with } (f_y)_{i,j} = f_{y,i,j}, \\
  f_\theta(\cdot) &\in \mathbb{R}^{10 \times 9} \quad &&\text{ with } (f_\theta)_{i,j} = f_{\theta,i,j}
 \end{align}

In this approach, we add the so-called sensitivities G=dy/d\theta. For the differential equations this means


  \dot{G}(t) = f_y(y(t),\theta) G(t) + f_\theta(y(t),\theta), \quad G(0) = \frac{\partial y(0)}{\partial \theta}

Now we formulate the OED problem as described in [2].


 \begin{array}{lll}
 \displaystyle \min_{y,G,F,z,w} && \text{trace} \; \left( F^{-1}(t_f) \right) \\
 \text{subject to} \\
\quad \dot{y}(t) & = & f(y(t),\theta) \\
\quad \dot{G}(t) & = & f_y(y(t),\theta) G(t) + f_\theta(y(t),\theta) \\
\quad \dot{F}(t) & = & \sum_{i=1}^{n_o} w_i(t)(h^i_y(y(t))G(t))^T(h^i_y(y(t))G(t)) \\
\quad \dot{z}(t) & = & w(t), \\
\quad y(0) & = & y_0 \\
\quad G(0) & = & \frac{\partial y(0)}{\partial \theta} \\
\quad F(0) & = & 0, \\ 
\quad z(0) & = & 0 \\
\quad w(t) & \in & \mathcal{W} \\
\quad z_i(t_f) & \leq & M_i
  \end{array}

Here h:\mathbb{R}^{10} \rightarrow \mathbb{R}^{n_o} is the observed function. The evolution of the symmetric matrix F: \left[0,t_f \right] \rightarrow \mathbb{R}^{9 \times 9} is given by the weighted sum of observability Gramians h^i_y (y(t)) G(t), \ i = 1,\ldots,n_o for each observed function of states. The weights w_i (t) \in \{0, 1\}, \ i = 1,\ldots,n_o are the (binary) sampling decisions, where  w_i (t) = 1 denotes the decision to perform a measurement at time t.

Miscellaneous and Further Reading

To be specified.

References

[1] "Nonlinear Parameter Estimation: a Case Study Comparison" by L. T. Biegler and J. J. Damiano
[2] "Optimal Experimental Design for Universal Differential Equations" by C. Plate, C.J. Martensen and S. Sager
[3] "Parameter estimation in nonlinear systems" by W.J.H. Stortelder