Jump to content

Category:GIOC

From mintOC

This category includes all generalized inverse optimal control problems. To formalize this problem class, we define the following bilevel problem with differential states x𝒳, controls u𝒰, model parameters pnp, and convex multipliers w𝒲

  min(p,w,x*,u*)Ω1h(x*,u*)η+R(p,w)
     subject to
     (x*,u*)argminx,uiM1wiϕi[x,u,p]
                subject to
                x˙(t)=iM2wifi(x(t),u(t),p)0wigi(x(t),u(t),p)iM3(x,u,p,w)Ω2

as a generalized inverse optimal control problem. Here 𝒳 and 𝒰 are properly defined function spaces. The variables w indicate which objective functions, right hand side functions, and constraints are relevant in the inner problem. The variables are normalized for given index sets M1,M2,M3 that partition the indices from 1 to nw. For normalization, we define the feasible set 𝒲:={w[0,1]nw:iMjwi=1 for j{1,2,3}}. On the outer level, the feasible set is Ω1:=np×𝒲×𝒳×𝒰, while on the inner level Ω2 contains bounds, boundary conditions, mixed path and control constraints, and more involved constraints such as dwell time constraints. We have observational data ηnη, a measurement function h:𝒳×𝒰nη, a regularization function with a priori knowledge on parameters and weights R:np×𝒲 and candidate functionals ϕi:𝒳×𝒰×np and functions fi:𝒳×𝒰×npnx and gi:𝒳×𝒰×npng for the unknown objective function, dynamics, and constraints, respectively. Two cases are of practical interest: first, the manual, often cumbersome and trial-and-error based a priori definition of all candidates ϕi,fi,gi by experts and second, a systematic, but often challenging automatic symbolic regression of these unknown functions.

On the outer level, a norm and the regularization term R define a data fit (regression) problem and relate to prior knowledge and statistical assumptions. On the inner level, the above bilevel problem is constrained by a possibly nonconvex optimal control problem. The unknown parts of this inner level optimal control problem are modeled as convex combinations of a finite set of candidates (and a multiplication of constraints gi with wi that can be either zero or strictly positive). On the one hand the problem formulation is restrictive in the interest of a clearer presentation and might be further generalized, e.g., to multi-stage formulations involving differential-algebraic or partial differential equations. On the other hand, the problem class is quite generic and allows, e.g., the consideration of switched systems, periodic processes, different underlying function spaces 𝒳 and 𝒰, and the usage of universal approximators such as neural networks as candidate functions.

Extensions

Note that a Lagrange term t0tfL(x(t),u(t),v(t),q,ρ)dt can be transformed into a Mayer-type objective functional.

Pages in category "GIOC"

This category contains only the following page.