Boundary Control and Parameter Estimation
Overview ¶ The previous lectures focused mostly on distributed controls, where the
control acts as a volume source term. This lecture moves to two situations
where the same optimal-control structure remains visible, but the functional
analytic details become more delicate:
The main new point is that the control-to-state map no longer has the same
regularity as in the distributed case. Boundary controls interact with trace
operators, while parameter controls enter the differential operator itself.
Both effects change the correct space in which gradients and optimality
conditions should be interpreted.
The logical path is:
formulate Neumann boundary control in weak form;
prove well-posedness by the trace theorem and Lax-Milgram;
derive the adjoint equation and the reduced gradient using a Lagrangian;
discuss Dirichlet boundary control through liftings;
formulate coefficient identification as a PDE-constrained inverse problem;
prove the sensitivity equation for the coefficient-to-state map;
derive the adjoint gradient for parameter estimation from a Lagrangian;
interpret box constraints as variational inequalities and projection
formulas when enough regularity is available.
Throughout the lecture, let Ω ⊂ R d \Omega\subset\mathbb R^d Ω ⊂ R d be a bounded Lipschitz
domain, and let Γ : = ∂ Ω \Gamma:=\partial\Omega Γ := ∂ Ω .
Boundary Controls and Trace Spaces ¶ For a distributed control, the state equation typically has the form
A y = B u + f in Ω , \mathcal A y = Bu+f
\qquad\text{in }\Omega, A y = B u + f in Ω , where B B B maps the control space into a volume dual space such as
H − 1 ( Ω ) H^{-1}(\Omega) H − 1 ( Ω ) .
For a boundary control, the action of u u u is instead mediated by the trace
operator. The fundamental mapping is
γ : H 1 ( Ω ) → H 1 / 2 ( Γ ) , \gamma:H^1(\Omega)\to H^{1/2}(\Gamma), γ : H 1 ( Ω ) → H 1/2 ( Γ ) , which is continuous. Since H 1 / 2 ( Γ ) H^{1/2}(\Gamma) H 1/2 ( Γ ) embeds continuously into
L 2 ( Γ ) L^2(\Gamma) L 2 ( Γ ) on bounded Lipschitz boundaries, there is a constant
C t r > 0 C_{\mathrm{tr}}>0 C tr > 0 such that
∥ γ v ∥ L 2 ( Γ ) ≤ C t r ∥ v ∥ H 1 ( Ω ) ∀ v ∈ H 1 ( Ω ) . \|\gamma v\|_{L^2(\Gamma)}
\le C_{\mathrm{tr}}\|v\|_{H^1(\Omega)}
\qquad \forall v\in H^1(\Omega). ∥ γ v ∥ L 2 ( Γ ) ≤ C tr ∥ v ∥ H 1 ( Ω ) ∀ v ∈ H 1 ( Ω ) . This estimate is the key reason why Neumann data in L 2 ( Γ ) L^2(\Gamma) L 2 ( Γ ) define a
bounded linear functional on H 1 ( Ω ) H^1(\Omega) H 1 ( Ω ) :
∣ ∫ Γ u γ v d s ∣ ≤ ∥ u ∥ L 2 ( Γ ) ∥ γ v ∥ L 2 ( Γ ) ≤ C t r ∥ u ∥ L 2 ( Γ ) ∥ v ∥ H 1 ( Ω ) . \left|\int_\Gamma u\,\gamma v\,ds\right|
\le
\|u\|_{L^2(\Gamma)}\|\gamma v\|_{L^2(\Gamma)}
\le
C_{\mathrm{tr}}\|u\|_{L^2(\Gamma)}\|v\|_{H^1(\Omega)}. ∣ ∣ ∫ Γ u γ v d s ∣ ∣ ≤ ∥ u ∥ L 2 ( Γ ) ∥ γ v ∥ L 2 ( Γ ) ≤ C tr ∥ u ∥ L 2 ( Γ ) ∥ v ∥ H 1 ( Ω ) . Thus Neumann boundary control is naturally compatible with weak formulations.
Dirichlet boundary control is subtler, because prescribing y = u y=u y = u on Γ \Gamma Γ
requires u u u to be a trace of an H 1 ( Ω ) H^1(\Omega) H 1 ( Ω ) function, hence
u ∈ H 1 / 2 ( Γ ) u\in H^{1/2}(\Gamma) u ∈ H 1/2 ( Γ ) is the natural energy space.
A Coercive Neumann Boundary Control Model ¶ To avoid the compatibility condition of the pure Neumann Laplacian, we first
consider the coercive elliptic equation
{ − Δ y + σ y = f in Ω , ∂ n y = u on Γ , \begin{cases}
-\Delta y+\sigma y=f & \text{in }\Omega,\\
\partial_n y=u & \text{on }\Gamma,
\end{cases} { − Δ y + σ y = f ∂ n y = u in Ω , on Γ , where σ > 0 \sigma>0 σ > 0 . The pure Neumann case σ = 0 \sigma=0 σ = 0 is discussed below.
Let
V : = H 1 ( Ω ) , U : = L 2 ( Γ ) , V:=H^1(\Omega),
\qquad
U:=L^2(\Gamma), V := H 1 ( Ω ) , U := L 2 ( Γ ) , and define
a ( y , v ) : = ∫ Ω ∇ y ⋅ ∇ v d x + σ ∫ Ω y v d x , a(y,v):=
\int_\Omega \nabla y\cdot\nabla v\,dx
+
\sigma\int_\Omega yv\,dx, a ( y , v ) := ∫ Ω ∇ y ⋅ ∇ v d x + σ ∫ Ω y v d x , F ( v ) : = ∫ Ω f v d x , B ( u , v ) : = ∫ Γ u γ v d s . F(v):=\int_\Omega f v\,dx,
\qquad
B(u,v):=\int_\Gamma u\,\gamma v\,ds. F ( v ) := ∫ Ω f v d x , B ( u , v ) := ∫ Γ u γ v d s . The weak state equation is:
find y ∈ V y\in V y ∈ V such that
a ( y , v ) = F ( v ) + B ( u , v ) ∀ v ∈ V . a(y,v)=F(v)+B(u,v)
\qquad \forall v\in V. a ( y , v ) = F ( v ) + B ( u , v ) ∀ v ∈ V . For a desired state y d ∈ L 2 ( Ω ) y_d\in L^2(\Omega) y d ∈ L 2 ( Ω ) and a regularization parameter
α > 0 \alpha>0 α > 0 , the boundary control problem is
min u ∈ U a d J ( y , u ) : = 1 2 ∥ y − y d ∥ L 2 ( Ω ) 2 + α 2 ∥ u ∥ L 2 ( Γ ) 2 \min_{u\in U_{\mathrm{ad}}}
J(y,u)
:=
\frac12\|y-y_d\|_{L^2(\Omega)}^2
+
\frac\alpha2\|u\|_{L^2(\Gamma)}^2 u ∈ U ad min J ( y , u ) := 2 1 ∥ y − y d ∥ L 2 ( Ω ) 2 + 2 α ∥ u ∥ L 2 ( Γ ) 2 subject to the weak state equation.
The admissible set is assumed to be nonempty, closed, and convex in
L 2 ( Γ ) L^2(\Gamma) L 2 ( Γ ) . A standard example is the box-constrained set
U a d : = { u ∈ L 2 ( Γ ) : u a ≤ u ≤ u b a.e. on Γ } , U_{\mathrm{ad}}
:=
\{u\in L^2(\Gamma): u_a\le u\le u_b \text{ a.e. on }\Gamma\}, U ad := { u ∈ L 2 ( Γ ) : u a ≤ u ≤ u b a.e. on Γ } , where u a , u b ∈ L ∞ ( Γ ) u_a,u_b\in L^\infty(\Gamma) u a , u b ∈ L ∞ ( Γ ) and u a ≤ u b u_a\le u_b u a ≤ u b a.e.
Well-Posedness of the Neumann State Equation ¶ The bilinear form a a a is continuous on V × V V\times V V × V because
∣ a ( y , v ) ∣ ≤ ∥ ∇ y ∥ L 2 ( Ω ) ∥ ∇ v ∥ L 2 ( Ω ) + σ ∥ y ∥ L 2 ( Ω ) ∥ v ∥ L 2 ( Ω ) ≤ C ∥ y ∥ H 1 ( Ω ) ∥ v ∥ H 1 ( Ω ) . |a(y,v)|
\le
\|\nabla y\|_{L^2(\Omega)}\|\nabla v\|_{L^2(\Omega)}
+
\sigma\|y\|_{L^2(\Omega)}\|v\|_{L^2(\Omega)}
\le
C\|y\|_{H^1(\Omega)}\|v\|_{H^1(\Omega)}. ∣ a ( y , v ) ∣ ≤ ∥∇ y ∥ L 2 ( Ω ) ∥∇ v ∥ L 2 ( Ω ) + σ ∥ y ∥ L 2 ( Ω ) ∥ v ∥ L 2 ( Ω ) ≤ C ∥ y ∥ H 1 ( Ω ) ∥ v ∥ H 1 ( Ω ) . It is also coercive on H 1 ( Ω ) H^1(\Omega) H 1 ( Ω ) :
a ( v , v ) = ∥ ∇ v ∥ L 2 ( Ω ) 2 + σ ∥ v ∥ L 2 ( Ω ) 2 ≥ min { 1 , σ } ∥ v ∥ H 1 ( Ω ) 2 . a(v,v)
=
\|\nabla v\|_{L^2(\Omega)}^2
+
\sigma\|v\|_{L^2(\Omega)}^2
\ge
\min\{1,\sigma\}\|v\|_{H^1(\Omega)}^2. a ( v , v ) = ∥∇ v ∥ L 2 ( Ω ) 2 + σ ∥ v ∥ L 2 ( Ω ) 2 ≥ min { 1 , σ } ∥ v ∥ H 1 ( Ω ) 2 . The right-hand side is bounded on V V V . Indeed,
∣ F ( v ) ∣ ≤ ∥ f ∥ L 2 ( Ω ) ∥ v ∥ L 2 ( Ω ) ≤ ∥ f ∥ L 2 ( Ω ) ∥ v ∥ H 1 ( Ω ) , |F(v)|
\le
\|f\|_{L^2(\Omega)}\|v\|_{L^2(\Omega)}
\le
\|f\|_{L^2(\Omega)}\|v\|_{H^1(\Omega)}, ∣ F ( v ) ∣ ≤ ∥ f ∥ L 2 ( Ω ) ∥ v ∥ L 2 ( Ω ) ≤ ∥ f ∥ L 2 ( Ω ) ∥ v ∥ H 1 ( Ω ) , and the trace estimate gives
∣ B ( u , v ) ∣ ≤ C t r ∥ u ∥ L 2 ( Γ ) ∥ v ∥ H 1 ( Ω ) . |B(u,v)|
\le
C_{\mathrm{tr}}\|u\|_{L^2(\Gamma)}\|v\|_{H^1(\Omega)}. ∣ B ( u , v ) ∣ ≤ C tr ∥ u ∥ L 2 ( Γ ) ∥ v ∥ H 1 ( Ω ) . By Lax-Milgram, for every u ∈ L 2 ( Γ ) u\in L^2(\Gamma) u ∈ L 2 ( Γ ) there exists a unique state
y ( u ) ∈ H 1 ( Ω ) y(u)\in H^1(\Omega) y ( u ) ∈ H 1 ( Ω ) and
∥ y ( u ) ∥ H 1 ( Ω ) ≤ C ( ∥ f ∥ L 2 ( Ω ) + ∥ u ∥ L 2 ( Γ ) ) . \|y(u)\|_{H^1(\Omega)}
\le
C\left(
\|f\|_{L^2(\Omega)}
+
\|u\|_{L^2(\Gamma)}
\right). ∥ y ( u ) ∥ H 1 ( Ω ) ≤ C ( ∥ f ∥ L 2 ( Ω ) + ∥ u ∥ L 2 ( Γ ) ) . Moreover, the control-to-state map
S : L 2 ( Γ ) → H 1 ( Ω ) , S ( u ) = y ( u ) , S:L^2(\Gamma)\to H^1(\Omega),
\qquad
S(u)=y(u), S : L 2 ( Γ ) → H 1 ( Ω ) , S ( u ) = y ( u ) , is affine and continuous. If f f f is fixed, its derivative is the linear map
S ′ ( u ) h = z S'(u)h=z S ′ ( u ) h = z , where z ∈ H 1 ( Ω ) z\in H^1(\Omega) z ∈ H 1 ( Ω ) solves
a ( z , v ) = B ( h , v ) ∀ v ∈ H 1 ( Ω ) . a(z,v)=B(h,v)
\qquad \forall v\in H^1(\Omega). a ( z , v ) = B ( h , v ) ∀ v ∈ H 1 ( Ω ) . For the pure Neumann equation σ = 0 \sigma=0 σ = 0 , constants belong to the kernel of
the operator. One either works in the quotient space H 1 ( Ω ) / R H^1(\Omega)/\mathbb R H 1 ( Ω ) / R ,
or fixes the representative by imposing zero mean:
V 0 : = { v ∈ H 1 ( Ω ) : ∫ Ω v d x = 0 } . V_0:=
\left\{
v\in H^1(\Omega):
\int_\Omega v\,dx=0
\right\}. V 0 := { v ∈ H 1 ( Ω ) : ∫ Ω v d x = 0 } . On V 0 V_0 V 0 , Poincare’s inequality gives coercivity of
a ( v , v ) = ∥ ∇ v ∥ L 2 ( Ω ) 2 a(v,v)=\|\nabla v\|_{L^2(\Omega)}^2 a ( v , v ) = ∥∇ v ∥ L 2 ( Ω ) 2 . If the equation is required to hold
for all test functions in H 1 ( Ω ) H^1(\Omega) H 1 ( Ω ) , the data must also satisfy the
compatibility condition
∫ Ω f d x + ∫ Γ u d s = 0. \int_\Omega f\,dx+\int_\Gamma u\,ds=0. ∫ Ω f d x + ∫ Γ u d s = 0. This is why adding a positive reaction term is often the cleaner model for
unconstrained Neumann boundary control.
Reduced Functional and Existence of an Optimal Control ¶ Since the state is uniquely determined by u u u , define the reduced functional
j ( u ) : = J ( S ( u ) , u ) . j(u):=J(S(u),u). j ( u ) := J ( S ( u ) , u ) . The estimate above implies that j j j is continuous on L 2 ( Γ ) L^2(\Gamma) L 2 ( Γ ) . The term
α 2 ∥ u ∥ L 2 ( Γ ) 2 \frac\alpha2\|u\|_{L^2(\Gamma)}^2 2 α ∥ u ∥ L 2 ( Γ ) 2 makes j j j coercive:
j ( u ) → + ∞ as ∥ u ∥ L 2 ( Γ ) → + ∞ . j(u)\to+\infty
\qquad\text{as}\qquad
\|u\|_{L^2(\Gamma)}\to+\infty. j ( u ) → + ∞ as ∥ u ∥ L 2 ( Γ ) → + ∞. Because S S S is affine and continuous, the tracking term
1 2 ∥ S ( u ) − y d ∥ L 2 ( Ω ) 2 \frac12\|S(u)-y_d\|_{L^2(\Omega)}^2 2 1 ∥ S ( u ) − y d ∥ L 2 ( Ω ) 2 is weakly lower semicontinuous. The
regularization term is weakly lower semicontinuous as well. Therefore, by
the direct method of the calculus of variations, j j j attains a minimizer on
every nonempty closed convex set U a d ⊂ L 2 ( Γ ) U_{\mathrm{ad}}\subset L^2(\Gamma) U ad ⊂ L 2 ( Γ ) .
Since α > 0 \alpha>0 α > 0 , the reduced functional is strictly convex. Hence the
optimal control is unique.
Adjoint Equation for Neumann Boundary Control ¶ We derive the adjoint through the Lagrangian formalism. The weak state
equation is the constraint
a ( y , v ) − F ( v ) − B ( u , v ) = 0 ∀ v ∈ V . a(y,v)-F(v)-B(u,v)=0
\qquad \forall v\in V. a ( y , v ) − F ( v ) − B ( u , v ) = 0 ∀ v ∈ V . Introduce the Lagrangian
L ( y , u , p ) = 1 2 ∥ y − y d ∥ L 2 ( Ω ) 2 + α 2 ∥ u ∥ L 2 ( Γ ) 2 − [ a ( y , p ) − F ( p ) − B ( u , p ) ] , \mathcal L(y,u,p)
=
\frac12\|y-y_d\|_{L^2(\Omega)}^2
+
\frac\alpha2\|u\|_{L^2(\Gamma)}^2
-
\bigl[
a(y,p)-F(p)-B(u,p)
\bigr], L ( y , u , p ) = 2 1 ∥ y − y d ∥ L 2 ( Ω ) 2 + 2 α ∥ u ∥ L 2 ( Γ ) 2 − [ a ( y , p ) − F ( p ) − B ( u , p ) ] , where p ∈ V p\in V p ∈ V is the Lagrange multiplier. The sign in front of the
constraint is a convention; this choice gives the gradient
α u + γ p \alpha u+\gamma p αu + γ p .
Stationarity with respect to p p p gives the state equation:
D p L ( y , u , p ) ψ = − [ a ( y , ψ ) − F ( ψ ) − B ( u , ψ ) ] = 0 ∀ ψ ∈ V . D_p\mathcal L(y,u,p)\,\psi
=
-\bigl[
a(y,\psi)-F(\psi)-B(u,\psi)
\bigr]
=0
\qquad \forall \psi\in V. D p L ( y , u , p ) ψ = − [ a ( y , ψ ) − F ( ψ ) − B ( u , ψ ) ] = 0 ∀ ψ ∈ V . Stationarity with respect to y y y gives the adjoint equation. For every
state variation η ∈ V \eta\in V η ∈ V ,
D y L ( y , u , p ) η = ( y − y d , η ) L 2 ( Ω ) − a ( η , p ) . D_y\mathcal L(y,u,p)\,\eta
=
(y-y_d,\eta)_{L^2(\Omega)}
-
a(\eta,p). D y L ( y , u , p ) η = ( y − y d , η ) L 2 ( Ω ) − a ( η , p ) . Thus
a ( η , p ) = ( y − y d , η ) L 2 ( Ω ) ∀ η ∈ V . a(\eta,p)=(y-y_d,\eta)_{L^2(\Omega)}
\qquad \forall \eta\in V. a ( η , p ) = ( y − y d , η ) L 2 ( Ω ) ∀ η ∈ V . The adjoint problem is well posed by the same Lax-Milgram argument. If the
solution is smooth, this weak equation corresponds to
{ − Δ p + σ p = y − y d in Ω , ∂ n p = 0 on Γ . \begin{cases}
-\Delta p+\sigma p=y-y_d & \text{in }\Omega,\\
\partial_n p=0 & \text{on }\Gamma.
\end{cases} { − Δ p + σ p = y − y d ∂ n p = 0 in Ω , on Γ. Finally, stationarity with respect to the control gives the derivative in
the control direction. For h ∈ L 2 ( Γ ) h\in L^2(\Gamma) h ∈ L 2 ( Γ ) ,
D u L ( y , u , p ) h = α ( u , h ) L 2 ( Γ ) + B ( h , p ) = ∫ Γ ( α u + γ p ) h d s . D_u\mathcal L(y,u,p)\,h
=
\alpha(u,h)_{L^2(\Gamma)}
+
B(h,p)
=
\int_\Gamma
\left(
\alpha u+\gamma p
\right)h\,ds. D u L ( y , u , p ) h = α ( u , h ) L 2 ( Γ ) + B ( h , p ) = ∫ Γ ( αu + γ p ) h d s . Because the state and adjoint stationarity equations cancel all implicit
state variations, this is the derivative of the reduced functional:
j ′ ( u ) h = ∫ Γ ( α u + γ p ) h d s . j'(u)h
=
\int_\Gamma
\left(
\alpha u+\gamma p
\right)h\,ds. j ′ ( u ) h = ∫ Γ ( αu + γ p ) h d s . Thus the L 2 ( Γ ) L^2(\Gamma) L 2 ( Γ ) -gradient of the reduced functional is
∇ j ( u ) = α u + γ p . \nabla j(u)=\alpha u+\gamma p. ∇ j ( u ) = αu + γ p . If one puts the opposite sign in front of the constraint in the Lagrangian,
the adjoint variable changes sign and the same optimality system can be
written with α u − γ p \alpha u-\gamma p αu − γ p instead.
Let u ˉ ∈ U a d \bar u\in U_{\mathrm{ad}} u ˉ ∈ U ad be the optimal control, with state y ˉ \bar y y ˉ
and adjoint p ˉ \bar p p ˉ . Since U a d U_{\mathrm{ad}} U ad is closed and convex, the
first-order necessary and sufficient condition is
j ′ ( u ˉ ) ( v − u ˉ ) ≥ 0 ∀ v ∈ U a d . j'(\bar u)(v-\bar u)\ge0
\qquad \forall v\in U_{\mathrm{ad}}. j ′ ( u ˉ ) ( v − u ˉ ) ≥ 0 ∀ v ∈ U ad . Using the gradient formula, this becomes
( α u ˉ + γ p ˉ , v − u ˉ ) L 2 ( Γ ) ≥ 0 ∀ v ∈ U a d . (\alpha\bar u+\gamma\bar p,v-\bar u)_{L^2(\Gamma)}
\ge0
\qquad \forall v\in U_{\mathrm{ad}}. ( α u ˉ + γ p ˉ , v − u ˉ ) L 2 ( Γ ) ≥ 0 ∀ v ∈ U ad . For box constraints, the variational inequality is equivalent to the
pointwise projection formula
u ˉ = P [ u a , u b ] ( − 1 α γ p ˉ ) a.e. on Γ . \bar u
=
P_{[u_a,u_b]}
\left(
-\frac1\alpha\gamma\bar p
\right)
\qquad\text{a.e. on }\Gamma. u ˉ = P [ u a , u b ] ( − α 1 γ p ˉ ) a.e. on Γ. Indeed, for almost every boundary point x ∈ Γ x\in\Gamma x ∈ Γ , the scalar condition is
( α u ˉ ( x ) + γ p ˉ ( x ) ) ( v − u ˉ ( x ) ) ≥ 0 ∀ v ∈ [ u a ( x ) , u b ( x ) ] . (\alpha\bar u(x)+\gamma\bar p(x))(v-\bar u(x))\ge0
\qquad
\forall v\in [u_a(x),u_b(x)]. ( α u ˉ ( x ) + γ p ˉ ( x )) ( v − u ˉ ( x )) ≥ 0 ∀ v ∈ [ u a ( x ) , u b ( x )] . This says exactly that
− ( α u ˉ ( x ) + γ p ˉ ( x ) ) ∈ N [ u a ( x ) , u b ( x ) ] ( u ˉ ( x ) ) , -\bigl(\alpha\bar u(x)+\gamma\bar p(x)\bigr)
\in
N_{[u_a(x),u_b(x)]}(\bar u(x)), − ( α u ˉ ( x ) + γ p ˉ ( x ) ) ∈ N [ u a ( x ) , u b ( x )] ( u ˉ ( x )) , where N C N_C N C denotes the normal cone to the convex set C C C . The normal-cone
identity for projections gives the formula above.
Equivalently:
if u a < u ˉ < u b u_a<\bar u<u_b u a < u ˉ < u b , then α u ˉ + γ p ˉ = 0 \alpha\bar u+\gamma\bar p=0 α u ˉ + γ p ˉ = 0 ;
if u ˉ = u a \bar u=u_a u ˉ = u a , then α u ˉ + γ p ˉ ≥ 0 \alpha\bar u+\gamma\bar p\ge0 α u ˉ + γ p ˉ ≥ 0 ;
if u ˉ = u b \bar u=u_b u ˉ = u b , then α u ˉ + γ p ˉ ≤ 0 \alpha\bar u+\gamma\bar p\le0 α u ˉ + γ p ˉ ≤ 0 .
Dirichlet Boundary Control ¶ For Dirichlet boundary control, the state equation is
{ − Δ y + σ y = f in Ω , y = u on Γ . \begin{cases}
-\Delta y+\sigma y=f & \text{in }\Omega,\\
y=u & \text{on }\Gamma.
\end{cases} { − Δ y + σ y = f y = u in Ω , on Γ. Now the control cannot be an arbitrary element of L 2 ( Γ ) L^2(\Gamma) L 2 ( Γ ) if the state
is required to lie in H 1 ( Ω ) H^1(\Omega) H 1 ( Ω ) . The natural space is
U = H 1 / 2 ( Γ ) . U=H^{1/2}(\Gamma). U = H 1/2 ( Γ ) . Let V 0 : = H 0 1 ( Ω ) V_0:=H_0^1(\Omega) V 0 := H 0 1 ( Ω ) . By the trace theorem, there exists a continuous
lifting operator
R : H 1 / 2 ( Γ ) → H 1 ( Ω ) R:H^{1/2}(\Gamma)\to H^1(\Omega) R : H 1/2 ( Γ ) → H 1 ( Ω ) such that
γ ( R u ) = u . \gamma(Ru)=u. γ ( R u ) = u . Write
y = w + R u , w ∈ H 0 1 ( Ω ) . y=w+Ru,
\qquad
w\in H_0^1(\Omega). y = w + R u , w ∈ H 0 1 ( Ω ) . The weak equation for w w w is
a ( w , v ) = F ( v ) − a ( R u , v ) ∀ v ∈ H 0 1 ( Ω ) . a(w,v)=F(v)-a(Ru,v)
\qquad \forall v\in H_0^1(\Omega). a ( w , v ) = F ( v ) − a ( R u , v ) ∀ v ∈ H 0 1 ( Ω ) . Since a a a is coercive on H 0 1 ( Ω ) H_0^1(\Omega) H 0 1 ( Ω ) , Lax-Milgram gives a unique
w ∈ H 0 1 ( Ω ) w\in H_0^1(\Omega) w ∈ H 0 1 ( Ω ) , hence a unique state y = w + R u y=w+Ru y = w + R u .
The estimate is
∥ y ∥ H 1 ( Ω ) ≤ C ( ∥ f ∥ H − 1 ( Ω ) + ∥ u ∥ H 1 / 2 ( Γ ) ) . \|y\|_{H^1(\Omega)}
\le
C\left(
\|f\|_{H^{-1}(\Omega)}
+
\|u\|_{H^{1/2}(\Gamma)}
\right). ∥ y ∥ H 1 ( Ω ) ≤ C ( ∥ f ∥ H − 1 ( Ω ) + ∥ u ∥ H 1/2 ( Γ ) ) . Thus the Dirichlet control-to-state map is continuous from
H 1 / 2 ( Γ ) H^{1/2}(\Gamma) H 1/2 ( Γ ) into H 1 ( Ω ) H^1(\Omega) H 1 ( Ω ) .
Dirichlet Gradient and Regularity Warning ¶ For Dirichlet control, it is useful to use the lifted variable
y = w + R u y=w+Ru y = w + R u , with w ∈ H 0 1 ( Ω ) w\in H_0^1(\Omega) w ∈ H 0 1 ( Ω ) . The weak constraint is
a ( w , v ) − F ( v ) + a ( R u , v ) = 0 ∀ v ∈ H 0 1 ( Ω ) . a(w,v)-F(v)+a(Ru,v)=0
\qquad \forall v\in H_0^1(\Omega). a ( w , v ) − F ( v ) + a ( R u , v ) = 0 ∀ v ∈ H 0 1 ( Ω ) . The Lagrangian is
L ( w , u , p ) = 1 2 ∥ w + R u − y d ∥ L 2 ( Ω ) 2 + α 2 ∥ u ∥ U 2 − [ a ( w , p ) − F ( p ) + a ( R u , p ) ] , \mathcal L(w,u,p)
=
\frac12\|w+Ru-y_d\|_{L^2(\Omega)}^2
+
\frac\alpha2\|u\|_U^2
-
\bigl[
a(w,p)-F(p)+a(Ru,p)
\bigr], L ( w , u , p ) = 2 1 ∥ w + R u − y d ∥ L 2 ( Ω ) 2 + 2 α ∥ u ∥ U 2 − [ a ( w , p ) − F ( p ) + a ( R u , p ) ] , where p ∈ H 0 1 ( Ω ) p\in H_0^1(\Omega) p ∈ H 0 1 ( Ω ) is the Lagrange multiplier. Stationarity with
respect to p p p gives the lifted state equation. Stationarity with respect to
w w w gives, for every η ∈ H 0 1 ( Ω ) \eta\in H_0^1(\Omega) η ∈ H 0 1 ( Ω ) ,
D w L ( w , u , p ) η = ( y − y d , η ) L 2 ( Ω ) − a ( η , p ) = 0. D_w\mathcal L(w,u,p)\,\eta
=
(y-y_d,\eta)_{L^2(\Omega)}
-
a(\eta,p)
=0. D w L ( w , u , p ) η = ( y − y d , η ) L 2 ( Ω ) − a ( η , p ) = 0. Therefore the adjoint equation is
a ( η , p ) = ( y − y d , η ) L 2 ( Ω ) ∀ η ∈ H 0 1 ( Ω ) . a(\eta,p)=(y-y_d,\eta)_{L^2(\Omega)}
\qquad \forall \eta\in H_0^1(\Omega). a ( η , p ) = ( y − y d , η ) L 2 ( Ω ) ∀ η ∈ H 0 1 ( Ω ) . The control derivative is obtained by differentiating the same Lagrangian
with respect to u u u . For h ∈ U = H 1 / 2 ( Γ ) h\in U=H^{1/2}(\Gamma) h ∈ U = H 1/2 ( Γ ) ,
D u L ( w , u , p ) h = α ⟨ R U u , h ⟩ U ′ , U + ( y − y d , R h ) L 2 ( Ω ) − a ( R h , p ) . D_u\mathcal L(w,u,p)\,h
=
\alpha\langle R_U u,h\rangle_{U',U}
+
(y-y_d,Rh)_{L^2(\Omega)}
-
a(Rh,p). D u L ( w , u , p ) h = α ⟨ R U u , h ⟩ U ′ , U + ( y − y d , R h ) L 2 ( Ω ) − a ( R h , p ) . The subtle point is that R h Rh R h is generally not an element of
H 0 1 ( Ω ) H_0^1(\Omega) H 0 1 ( Ω ) , because its trace is h h h . Therefore one cannot simply insert
R h Rh R h as a test function in the adjoint equation, which is valid only for
zero-trace variations.
Assume for a moment that p p p is smooth enough for Green’s identity to be used
classically. Since the adjoint satisfies
− Δ p + σ p = y − y d in Ω , p = 0 on Γ , -\Delta p+\sigma p=y-y_d
\qquad\text{in }\Omega,
\qquad
p=0
\qquad\text{on }\Gamma, − Δ p + σ p = y − y d in Ω , p = 0 on Γ , we compute
a ( R h , p ) = ∫ Ω ∇ ( R h ) ⋅ ∇ p d x + σ ∫ Ω R h p d x . a(Rh,p)
=
\int_\Omega \nabla(Rh)\cdot\nabla p\,dx
+
\sigma\int_\Omega Rh\,p\,dx. a ( R h , p ) = ∫ Ω ∇ ( R h ) ⋅ ∇ p d x + σ ∫ Ω R h p d x . Integrating the first term by parts gives
a ( R h , p ) = ∫ Ω R h ( − Δ p + σ p ) d x + ∫ Γ γ ( R h ) ∂ n p d s . a(Rh,p)
=
\int_\Omega Rh(-\Delta p+\sigma p)\,dx
+
\int_\Gamma \gamma(Rh)\,\partial_n p\,ds. a ( R h , p ) = ∫ Ω R h ( − Δ p + σ p ) d x + ∫ Γ γ ( R h ) ∂ n p d s . Using γ ( R h ) = h \gamma(Rh)=h γ ( R h ) = h and the strong adjoint equation, this becomes
a ( R h , p ) = ( y − y d , R h ) L 2 ( Ω ) + ∫ Γ h ∂ n p d s . a(Rh,p)
=
(y-y_d,Rh)_{L^2(\Omega)}
+
\int_\Gamma h\,\partial_n p\,ds. a ( R h , p ) = ( y − y d , R h ) L 2 ( Ω ) + ∫ Γ h ∂ n p d s . Hence the two non-regularization terms in the Lagrangian control derivative
reduce to the boundary term
( y − y d , R h ) L 2 ( Ω ) − a ( R h , p ) = − ⟨ ∂ n p , h ⟩ H − 1 / 2 , H 1 / 2 . (y-y_d,Rh)_{L^2(\Omega)}-a(Rh,p)
=
-\langle \partial_n p,h\rangle_{H^{-1/2},H^{1/2}}. ( y − y d , R h ) L 2 ( Ω ) − a ( R h , p ) = − ⟨ ∂ n p , h ⟩ H − 1/2 , H 1/2 . Hence the control stationarity condition is naturally written as
α R U u ˉ − ∂ n p ˉ = 0 in H − 1 / 2 ( Γ ) , \alpha R_U \bar u-\partial_n \bar p=0
\qquad\text{in }H^{-1/2}(\Gamma), α R U u ˉ − ∂ n p ˉ = 0 in H − 1/2 ( Γ ) , where R U : U → U ′ R_U:U\to U' R U : U → U ′ is the Riesz map of H 1 / 2 ( Γ ) H^{1/2}(\Gamma) H 1/2 ( Γ ) .
This is the key distinction from Neumann control:
Neumann control with u ∈ L 2 ( Γ ) u\in L^2(\Gamma) u ∈ L 2 ( Γ ) gives an L 2 ( Γ ) L^2(\Gamma) L 2 ( Γ ) gradient
involving the trace γ p \gamma p γ p ;
Dirichlet control gives a boundary gradient involving the normal derivative
∂ n p \partial_n p ∂ n p , which is generally only an element of H − 1 / 2 ( Γ ) H^{-1/2}(\Gamma) H − 1/2 ( Γ ) .
An L 2 ( Γ ) L^2(\Gamma) L 2 ( Γ ) projection formula for Dirichlet control is therefore a
formal or additional-regularity statement. It is valid, for instance, if
∂ n p ∈ L 2 ( Γ ) \partial_n p\in L^2(\Gamma) ∂ n p ∈ L 2 ( Γ ) and the control is regularized in
L 2 ( Γ ) L^2(\Gamma) L 2 ( Γ ) :
u ˉ = P [ u a , u b ] ( 1 α ∂ n p ˉ ) . \bar u
=
P_{[u_a,u_b]}
\left(
\frac1\alpha\partial_n\bar p
\right). u ˉ = P [ u a , u b ] ( α 1 ∂ n p ˉ ) . Without such regularity, the correct condition is the variational inequality
in the duality pairing between H − 1 / 2 ( Γ ) H^{-1/2}(\Gamma) H − 1/2 ( Γ ) and H 1 / 2 ( Γ ) H^{1/2}(\Gamma) H 1/2 ( Γ ) .
Parameter Estimation Problem ¶ We now consider an inverse problem in which the unknown is a coefficient in
the PDE. Let
0 < q min ≤ q ( x ) ≤ q max < ∞ a.e. in Ω . 0<q_{\min}\le q(x)\le q_{\max}<\infty
\qquad\text{a.e. in }\Omega. 0 < q m i n ≤ q ( x ) ≤ q m a x < ∞ a.e. in Ω. For each admissible coefficient q q q , the state y = y ( q ) ∈ H 0 1 ( Ω ) y=y(q)\in H_0^1(\Omega) y = y ( q ) ∈ H 0 1 ( Ω ) is
defined by
∫ Ω q ∇ y ⋅ ∇ v d x = ⟨ f , v ⟩ H − 1 , H 0 1 ∀ v ∈ H 0 1 ( Ω ) . \int_\Omega q\nabla y\cdot\nabla v\,dx
=
\langle f,v\rangle_{H^{-1},H_0^1}
\qquad \forall v\in H_0^1(\Omega). ∫ Ω q ∇ y ⋅ ∇ v d x = ⟨ f , v ⟩ H − 1 , H 0 1 ∀ v ∈ H 0 1 ( Ω ) . In strong form this corresponds to
{ − ∇ ⋅ ( q ∇ y ) = f in Ω , y = 0 on Γ . \begin{cases}
-\nabla\cdot(q\nabla y)=f & \text{in }\Omega,\\
y=0 & \text{on }\Gamma.
\end{cases} { − ∇ ⋅ ( q ∇ y ) = f y = 0 in Ω , on Γ. The coefficient identification problem is to choose q q q so that y ( q ) y(q) y ( q ) fits
observed data y d y_d y d . A standard Tikhonov functional is
j ( q ) = 1 2 ∥ y ( q ) − y d ∥ L 2 ( Ω ) 2 + α 2 ∥ q − q r e f ∥ L 2 ( Ω ) 2 . j(q)
=
\frac12\|y(q)-y_d\|_{L^2(\Omega)}^2
+
\frac\alpha2\|q-q_{\mathrm{ref}}\|_{L^2(\Omega)}^2. j ( q ) = 2 1 ∥ y ( q ) − y d ∥ L 2 ( Ω ) 2 + 2 α ∥ q − q ref ∥ L 2 ( Ω ) 2 . The admissible set is often taken as
Q a d = { q ∈ L ∞ ( Ω ) : q a ≤ q ≤ q b a.e. in Ω } , Q_{\mathrm{ad}}
=
\{q\in L^\infty(\Omega):
q_a\le q\le q_b \text{ a.e. in }\Omega\}, Q ad = { q ∈ L ∞ ( Ω ) : q a ≤ q ≤ q b a.e. in Ω } , with 0 < q a ≤ q b 0<q_a\le q_b 0 < q a ≤ q b a.e.
For every q ∈ Q a d q\in Q_{\mathrm{ad}} q ∈ Q ad , the bilinear form
a q ( y , v ) : = ∫ Ω q ∇ y ⋅ ∇ v d x a_q(y,v):=\int_\Omega q\nabla y\cdot\nabla v\,dx a q ( y , v ) := ∫ Ω q ∇ y ⋅ ∇ v d x is continuous and coercive on H 0 1 ( Ω ) H_0^1(\Omega) H 0 1 ( Ω ) :
∣ a q ( y , v ) ∣ ≤ ∥ q ∥ L ∞ ( Ω ) ∥ ∇ y ∥ L 2 ( Ω ) ∥ ∇ v ∥ L 2 ( Ω ) , |a_q(y,v)|
\le
\|q\|_{L^\infty(\Omega)}
\|\nabla y\|_{L^2(\Omega)}
\|\nabla v\|_{L^2(\Omega)}, ∣ a q ( y , v ) ∣ ≤ ∥ q ∥ L ∞ ( Ω ) ∥∇ y ∥ L 2 ( Ω ) ∥∇ v ∥ L 2 ( Ω ) , a q ( v , v ) ≥ q min ∥ ∇ v ∥ L 2 ( Ω ) 2 . a_q(v,v)
\ge
q_{\min}\|\nabla v\|_{L^2(\Omega)}^2. a q ( v , v ) ≥ q m i n ∥∇ v ∥ L 2 ( Ω ) 2 . Lax-Milgram therefore gives a unique state y ( q ) y(q) y ( q ) and the stability estimate
∥ y ( q ) ∥ H 0 1 ( Ω ) ≤ C P q min ∥ f ∥ H − 1 ( Ω ) . \|y(q)\|_{H_0^1(\Omega)}
\le
\frac{C_P}{q_{\min}}\|f\|_{H^{-1}(\Omega)}. ∥ y ( q ) ∥ H 0 1 ( Ω ) ≤ q m i n C P ∥ f ∥ H − 1 ( Ω ) . Sensitivity Equation for the Coefficient ¶ The coefficient-to-state map is nonlinear. We derive its derivative.
Let δ q ∈ L ∞ ( Ω ) \delta q\in L^\infty(\Omega) δ q ∈ L ∞ ( Ω ) be a perturbation such that
q + t δ q q+t\delta q q + t δ q remains uniformly positive for small t t t . Let
y t : = y ( q + t δ q ) , y : = y ( q ) . y_t:=y(q+t\delta q),
\qquad
y:=y(q). y t := y ( q + t δ q ) , y := y ( q ) . Subtracting the two weak state equations gives
∫ Ω q ∇ ( y t − y ) ⋅ ∇ v d x = − t ∫ Ω δ q ∇ y t ⋅ ∇ v d x ∀ v ∈ H 0 1 ( Ω ) . \int_\Omega q\nabla(y_t-y)\cdot\nabla v\,dx
=
-t\int_\Omega \delta q\nabla y_t\cdot\nabla v\,dx
\qquad \forall v\in H_0^1(\Omega). ∫ Ω q ∇ ( y t − y ) ⋅ ∇ v d x = − t ∫ Ω δ q ∇ y t ⋅ ∇ v d x ∀ v ∈ H 0 1 ( Ω ) . Divide by t t t and pass formally to the limit t → 0 t\to0 t → 0 . The sensitivity
z : = y ′ ( q ) δ q z:=y'(q)\delta q z := y ′ ( q ) δ q solves
∫ Ω q ∇ z ⋅ ∇ v d x = − ∫ Ω δ q ∇ y ⋅ ∇ v d x ∀ v ∈ H 0 1 ( Ω ) . \int_\Omega q\nabla z\cdot\nabla v\,dx
=
-\int_\Omega \delta q\nabla y\cdot\nabla v\,dx
\qquad \forall v\in H_0^1(\Omega). ∫ Ω q ∇ z ⋅ ∇ v d x = − ∫ Ω δ q ∇ y ⋅ ∇ v d x ∀ v ∈ H 0 1 ( Ω ) . In strong form,
{ − ∇ ⋅ ( q ∇ z ) = ∇ ⋅ ( δ q ∇ y ) in Ω , z = 0 on Γ . \begin{cases}
-\nabla\cdot(q\nabla z)=\nabla\cdot(\delta q\nabla y) & \text{in }\Omega,\\
z=0 & \text{on }\Gamma.
\end{cases} { − ∇ ⋅ ( q ∇ z ) = ∇ ⋅ ( δ q ∇ y ) z = 0 in Ω , on Γ. The equation is well posed because the right-hand side is bounded on
H 0 1 ( Ω ) H_0^1(\Omega) H 0 1 ( Ω ) :
∣ ∫ Ω δ q ∇ y ⋅ ∇ v d x ∣ ≤ ∥ δ q ∥ L ∞ ( Ω ) ∥ ∇ y ∥ L 2 ( Ω ) ∥ ∇ v ∥ L 2 ( Ω ) . \left|
\int_\Omega \delta q\nabla y\cdot\nabla v\,dx
\right|
\le
\|\delta q\|_{L^\infty(\Omega)}
\|\nabla y\|_{L^2(\Omega)}
\|\nabla v\|_{L^2(\Omega)}. ∣ ∣ ∫ Ω δ q ∇ y ⋅ ∇ v d x ∣ ∣ ≤ ∥ δ q ∥ L ∞ ( Ω ) ∥∇ y ∥ L 2 ( Ω ) ∥∇ v ∥ L 2 ( Ω ) . Thus
∥ z ∥ H 0 1 ( Ω ) ≤ C ∥ δ q ∥ L ∞ ( Ω ) ∥ y ∥ H 0 1 ( Ω ) . \|z\|_{H_0^1(\Omega)}
\le
C
\|\delta q\|_{L^\infty(\Omega)}
\|y\|_{H_0^1(\Omega)}. ∥ z ∥ H 0 1 ( Ω ) ≤ C ∥ δ q ∥ L ∞ ( Ω ) ∥ y ∥ H 0 1 ( Ω ) . This proves that the derivative maps bounded coefficient perturbations to
state perturbations continuously.
Adjoint Gradient for Parameter Estimation ¶ We again derive the adjoint and the gradient through the Lagrangian. The weak
state constraint is
∫ Ω q ∇ y ⋅ ∇ v d x − ⟨ f , v ⟩ H − 1 , H 0 1 = 0 ∀ v ∈ H 0 1 ( Ω ) . \int_\Omega q\nabla y\cdot\nabla v\,dx
-
\langle f,v\rangle_{H^{-1},H_0^1}
=0
\qquad \forall v\in H_0^1(\Omega). ∫ Ω q ∇ y ⋅ ∇ v d x − ⟨ f , v ⟩ H − 1 , H 0 1 = 0 ∀ v ∈ H 0 1 ( Ω ) . Introduce
L ( y , q , p ) = 1 2 ∥ y − y d ∥ L 2 ( Ω ) 2 + α 2 ∥ q − q r e f ∥ L 2 ( Ω ) 2 − [ ∫ Ω q ∇ y ⋅ ∇ p d x − ⟨ f , p ⟩ H − 1 , H 0 1 ] . \mathcal L(y,q,p)
=
\frac12\|y-y_d\|_{L^2(\Omega)}^2
+
\frac\alpha2\|q-q_{\mathrm{ref}}\|_{L^2(\Omega)}^2
-
\left[
\int_\Omega q\nabla y\cdot\nabla p\,dx
-
\langle f,p\rangle_{H^{-1},H_0^1}
\right]. L ( y , q , p ) = 2 1 ∥ y − y d ∥ L 2 ( Ω ) 2 + 2 α ∥ q − q ref ∥ L 2 ( Ω ) 2 − [ ∫ Ω q ∇ y ⋅ ∇ p d x − ⟨ f , p ⟩ H − 1 , H 0 1 ] . Stationarity with respect to p p p gives the state equation. Stationarity with
respect to y y y gives, for every η ∈ H 0 1 ( Ω ) \eta\in H_0^1(\Omega) η ∈ H 0 1 ( Ω ) ,
D y L ( y , q , p ) η = ( y − y d , η ) L 2 ( Ω ) − ∫ Ω q ∇ η ⋅ ∇ p d x = 0. D_y\mathcal L(y,q,p)\,\eta
=
(y-y_d,\eta)_{L^2(\Omega)}
-
\int_\Omega q\nabla \eta\cdot\nabla p\,dx
=0. D y L ( y , q , p ) η = ( y − y d , η ) L 2 ( Ω ) − ∫ Ω q ∇ η ⋅ ∇ p d x = 0. Therefore
∫ Ω q ∇ η ⋅ ∇ p d x = ( y − y d , η ) L 2 ( Ω ) ∀ η ∈ H 0 1 ( Ω ) . \int_\Omega q\nabla \eta\cdot\nabla p\,dx
=
(y-y_d,\eta)_{L^2(\Omega)}
\qquad \forall \eta\in H_0^1(\Omega). ∫ Ω q ∇ η ⋅ ∇ p d x = ( y − y d , η ) L 2 ( Ω ) ∀ η ∈ H 0 1 ( Ω ) . Equivalently, in strong form,
{ − ∇ ⋅ ( q ∇ p ) = y − y d in Ω , p = 0 on Γ . \begin{cases}
-\nabla\cdot(q\nabla p)=y-y_d & \text{in }\Omega,\\
p=0 & \text{on }\Gamma.
\end{cases} { − ∇ ⋅ ( q ∇ p ) = y − y d p = 0 in Ω , on Γ. The coefficient derivative is
D q L ( y , q , p ) δ q = ∫ Ω [ α ( q − q r e f ) − ∇ y ⋅ ∇ p ] δ q d x . D_q\mathcal L(y,q,p)\,\delta q
=
\int_\Omega
\left[
\alpha(q-q_{\mathrm{ref}})
-
\nabla y\cdot\nabla p
\right]
\delta q\,dx. D q L ( y , q , p ) δ q = ∫ Ω [ α ( q − q ref ) − ∇ y ⋅ ∇ p ] δ q d x . Since the state and adjoint equations are precisely the stationarity
conditions in p p p and y y y , this is the derivative of the reduced functional:
j ′ ( q ) δ q = D q L ( y , q , p ) δ q . j'(q)\delta q
=
D_q\mathcal L(y,q,p)\,\delta q. j ′ ( q ) δ q = D q L ( y , q , p ) δ q . If the product ∇ y ⋅ ∇ p \nabla y\cdot\nabla p ∇ y ⋅ ∇ p belongs to L 2 ( Ω ) L^2(\Omega) L 2 ( Ω ) , then the
L 2 ( Ω ) L^2(\Omega) L 2 ( Ω ) -gradient is
∇ j ( q ) = α ( q − q r e f ) − ∇ y ⋅ ∇ p . \nabla j(q)
=
\alpha(q-q_{\mathrm{ref}})
-
\nabla y\cdot\nabla p. ∇ j ( q ) = α ( q − q ref ) − ∇ y ⋅ ∇ p . In the minimal energy setting, ∇ y , ∇ p ∈ L 2 ( Ω ) d \nabla y,\nabla p\in L^2(\Omega)^d ∇ y , ∇ p ∈ L 2 ( Ω ) d , so the
product is a priori only in L 1 ( Ω ) L^1(\Omega) L 1 ( Ω ) . Therefore, the formula is always
valid as a duality expression, while interpreting it as an L 2 L^2 L 2 gradient
requires additional regularity or a discrete finite element setting.
Box Constraints for Parameter Estimation ¶ Let q ˉ ∈ Q a d \bar q\in Q_{\mathrm{ad}} q ˉ ∈ Q ad be a local minimizer, with state y ˉ \bar y y ˉ and
adjoint p ˉ \bar p p ˉ . The first-order condition is the variational inequality
j ′ ( q ˉ ) ( r − q ˉ ) ≥ 0 ∀ r ∈ Q a d . j'(\bar q)(r-\bar q)\ge0
\qquad \forall r\in Q_{\mathrm{ad}}. j ′ ( q ˉ ) ( r − q ˉ ) ≥ 0 ∀ r ∈ Q ad . Using the adjoint formula, this reads
∫ Ω [ α ( q ˉ − q r e f ) − ∇ y ˉ ⋅ ∇ p ˉ ] ( r − q ˉ ) d x ≥ 0 ∀ r ∈ Q a d . \int_\Omega
\left[
\alpha(\bar q-q_{\mathrm{ref}})
-
\nabla\bar y\cdot\nabla\bar p
\right]
(r-\bar q)\,dx
\ge0
\qquad \forall r\in Q_{\mathrm{ad}}. ∫ Ω [ α ( q ˉ − q ref ) − ∇ y ˉ ⋅ ∇ p ˉ ] ( r − q ˉ ) d x ≥ 0 ∀ r ∈ Q ad . When the gradient is an L 2 ( Ω ) L^2(\Omega) L 2 ( Ω ) function, this is equivalent to the
pointwise projection formula
q ˉ = P [ q a , q b ] ( q r e f + 1 α ∇ y ˉ ⋅ ∇ p ˉ ) a.e. in Ω . \bar q
=
P_{[q_a,q_b]}
\left(
q_{\mathrm{ref}}
+
\frac1\alpha\nabla\bar y\cdot\nabla\bar p
\right)
\qquad\text{a.e. in }\Omega. q ˉ = P [ q a , q b ] ( q ref + α 1 ∇ y ˉ ⋅ ∇ p ˉ ) a.e. in Ω. The corresponding active-set interpretation is:
on the inactive set q a < q ˉ < q b q_a<\bar q<q_b q a < q ˉ < q b ,
α ( q ˉ − q r e f ) − ∇ y ˉ ⋅ ∇ p ˉ = 0 ; \alpha(\bar q-q_{\mathrm{ref}})
-
\nabla\bar y\cdot\nabla\bar p=0; α ( q ˉ − q ref ) − ∇ y ˉ ⋅ ∇ p ˉ = 0 ; on the lower active set q ˉ = q a \bar q=q_a q ˉ = q a ,
α ( q ˉ − q r e f ) − ∇ y ˉ ⋅ ∇ p ˉ ≥ 0 ; \alpha(\bar q-q_{\mathrm{ref}})
-
\nabla\bar y\cdot\nabla\bar p\ge0; α ( q ˉ − q ref ) − ∇ y ˉ ⋅ ∇ p ˉ ≥ 0 ; on the upper active set q ˉ = q b \bar q=q_b q ˉ = q b ,
α ( q ˉ − q r e f ) − ∇ y ˉ ⋅ ∇ p ˉ ≤ 0. \alpha(\bar q-q_{\mathrm{ref}})
-
\nabla\bar y\cdot\nabla\bar p\le0. α ( q ˉ − q ref ) − ∇ y ˉ ⋅ ∇ p ˉ ≤ 0. Existence and Regularization in Inverse Problems ¶ For distributed and Neumann controls, the regularization term
α 2 ∥ u ∥ L 2 2 \frac\alpha2\|u\|_{L^2}^2 2 α ∥ u ∥ L 2 2 gives coercivity in the same Hilbert space in
which the control is optimized.
For coefficient estimation, the situation is more subtle:
the PDE requires q ∈ L ∞ ( Ω ) q\in L^\infty(\Omega) q ∈ L ∞ ( Ω ) with a positive lower bound;
the Tikhonov term above controls only the L 2 ( Ω ) L^2(\Omega) L 2 ( Ω ) distance from
q r e f q_{\mathrm{ref}} q ref ;
weak convergence of coefficients does not automatically imply strong
convergence of the corresponding fluxes q ∇ y q\nabla y q ∇ y .
In a finite element discretization, existence is automatic because the
admissible set is finite-dimensional and closed. In an infinite-dimensional
analysis, one usually strengthens the compactness by choosing, for example,
α 2 ∥ q − q r e f ∥ H 1 ( Ω ) 2 \frac\alpha2\|q-q_{\mathrm{ref}}\|_{H^1(\Omega)}^2 2 α ∥ q − q ref ∥ H 1 ( Ω ) 2 or a bounded-variation regularization. These choices improve compactness and
make the inverse problem stable with respect to noisy data.
This is the essential role of regularization: it is not merely a numerical
penalty, but part of the mathematical formulation that turns an ill-posed
inverse problem into a well-posed optimization problem.
Numerical Structure ¶ Both classes of problems lead to the same computational loop:
solve the state equation;
solve the adjoint equation;
compute the reduced gradient;
apply a descent, Newton, or active-set update;
enforce the admissible constraints.
For Neumann boundary control, the discrete gradient contains a boundary mass
matrix because the control inner product is on Γ \Gamma Γ :
M Γ G = α M Γ U + M Γ P Γ . M_\Gamma G
=
\alpha M_\Gamma U
+
M_\Gamma P_\Gamma. M Γ G = α M Γ U + M Γ P Γ . For Dirichlet boundary control, the discrete gradient involves the discrete
normal derivative of the adjoint. This is more sensitive to mesh quality and
boundary approximation.
For parameter estimation, the gradient at a quadrature point has the form
g h = α ( q h − q r e f , h ) − ∇ y h ⋅ ∇ p h . g_h
=
\alpha(q_h-q_{\mathrm{ref},h})
-
\nabla y_h\cdot\nabla p_h. g h = α ( q h − q ref , h ) − ∇ y h ⋅ ∇ p h . Thus the quality of the gradient depends on the accuracy of both state and
adjoint gradients. This is why coefficient identification problems often
require stabilization, mesh refinement, and careful treatment of noisy data.
Summary ¶ Boundary control and parameter estimation fit into the same reduced
optimization framework as distributed control, but the spaces change:
Neumann control uses trace terms and gives gradients involving γ p \gamma p γ p ;
Dirichlet control uses liftings and gives dual boundary gradients involving
∂ n p \partial_n p ∂ n p ;
parameter estimation differentiates the PDE operator itself and gives
gradients involving ∇ y ⋅ ∇ p \nabla y\cdot\nabla p ∇ y ⋅ ∇ p .
The main mathematical lessons are:
the weak formulation determines the correct control space;
Lax-Milgram gives the state and adjoint equations once coercivity and
boundedness are checked;
the reduced gradient is obtained by differentiating the Lagrangian with
respect to the control or parameter after imposing state and adjoint
stationarity;
projection formulas are pointwise versions of variational inequalities;
inverse problems require regularization not only for algorithms, but also
for stability and well-posedness.
Suggested Reading ¶ Tröltzsch ¶ Manzoni-Quarteroni-Salsa ¶ De Los Reyes ¶ Reduced formulation
Adjoint equations
Regularization