Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Boundary Control and Parameter Estimation

University of Pisa

Overview

The previous lectures focused mostly on distributed controls, where the control acts as a volume source term. This lecture moves to two situations where the same optimal-control structure remains visible, but the functional analytic details become more delicate:

The main new point is that the control-to-state map no longer has the same regularity as in the distributed case. Boundary controls interact with trace operators, while parameter controls enter the differential operator itself. Both effects change the correct space in which gradients and optimality conditions should be interpreted.

The logical path is:

Throughout the lecture, let ΩRd\Omega\subset\mathbb R^d be a bounded Lipschitz domain, and let Γ:=Ω\Gamma:=\partial\Omega.


Boundary Controls and Trace Spaces

For a distributed control, the state equation typically has the form

Ay=Bu+fin Ω,\mathcal A y = Bu+f \qquad\text{in }\Omega,

where BB maps the control space into a volume dual space such as H1(Ω)H^{-1}(\Omega).

For a boundary control, the action of uu is instead mediated by the trace operator. The fundamental mapping is

γ:H1(Ω)H1/2(Γ),\gamma:H^1(\Omega)\to H^{1/2}(\Gamma),

which is continuous. Since H1/2(Γ)H^{1/2}(\Gamma) embeds continuously into L2(Γ)L^2(\Gamma) on bounded Lipschitz boundaries, there is a constant Ctr>0C_{\mathrm{tr}}>0 such that

γvL2(Γ)CtrvH1(Ω)vH1(Ω).\|\gamma v\|_{L^2(\Gamma)} \le C_{\mathrm{tr}}\|v\|_{H^1(\Omega)} \qquad \forall v\in H^1(\Omega).

This estimate is the key reason why Neumann data in L2(Γ)L^2(\Gamma) define a bounded linear functional on H1(Ω)H^1(\Omega):

ΓuγvdsuL2(Γ)γvL2(Γ)CtruL2(Γ)vH1(Ω).\left|\int_\Gamma u\,\gamma v\,ds\right| \le \|u\|_{L^2(\Gamma)}\|\gamma v\|_{L^2(\Gamma)} \le C_{\mathrm{tr}}\|u\|_{L^2(\Gamma)}\|v\|_{H^1(\Omega)}.

Thus Neumann boundary control is naturally compatible with weak formulations. Dirichlet boundary control is subtler, because prescribing y=uy=u on Γ\Gamma requires uu to be a trace of an H1(Ω)H^1(\Omega) function, hence

uH1/2(Γ)u\in H^{1/2}(\Gamma)

is the natural energy space.


A Coercive Neumann Boundary Control Model

To avoid the compatibility condition of the pure Neumann Laplacian, we first consider the coercive elliptic equation

{Δy+σy=fin Ω,ny=uon Γ,\begin{cases} -\Delta y+\sigma y=f & \text{in }\Omega,\\ \partial_n y=u & \text{on }\Gamma, \end{cases}

where σ>0\sigma>0. The pure Neumann case σ=0\sigma=0 is discussed below.

Let

V:=H1(Ω),U:=L2(Γ),V:=H^1(\Omega), \qquad U:=L^2(\Gamma),

and define

a(y,v):=Ωyvdx+σΩyvdx,a(y,v):= \int_\Omega \nabla y\cdot\nabla v\,dx + \sigma\int_\Omega yv\,dx,
F(v):=Ωfvdx,B(u,v):=Γuγvds.F(v):=\int_\Omega f v\,dx, \qquad B(u,v):=\int_\Gamma u\,\gamma v\,ds.

The weak state equation is:

find yVy\in V such that

a(y,v)=F(v)+B(u,v)vV.a(y,v)=F(v)+B(u,v) \qquad \forall v\in V.

For a desired state ydL2(Ω)y_d\in L^2(\Omega) and a regularization parameter α>0\alpha>0, the boundary control problem is

minuUadJ(y,u):=12yydL2(Ω)2+α2uL2(Γ)2\min_{u\in U_{\mathrm{ad}}} J(y,u) := \frac12\|y-y_d\|_{L^2(\Omega)}^2 + \frac\alpha2\|u\|_{L^2(\Gamma)}^2

subject to the weak state equation.

The admissible set is assumed to be nonempty, closed, and convex in L2(Γ)L^2(\Gamma). A standard example is the box-constrained set

Uad:={uL2(Γ):uauub a.e. on Γ},U_{\mathrm{ad}} := \{u\in L^2(\Gamma): u_a\le u\le u_b \text{ a.e. on }\Gamma\},

where ua,ubL(Γ)u_a,u_b\in L^\infty(\Gamma) and uaubu_a\le u_b a.e.


Well-Posedness of the Neumann State Equation

The bilinear form aa is continuous on V×VV\times V because

a(y,v)yL2(Ω)vL2(Ω)+σyL2(Ω)vL2(Ω)CyH1(Ω)vH1(Ω).|a(y,v)| \le \|\nabla y\|_{L^2(\Omega)}\|\nabla v\|_{L^2(\Omega)} + \sigma\|y\|_{L^2(\Omega)}\|v\|_{L^2(\Omega)} \le C\|y\|_{H^1(\Omega)}\|v\|_{H^1(\Omega)}.

It is also coercive on H1(Ω)H^1(\Omega):

a(v,v)=vL2(Ω)2+σvL2(Ω)2min{1,σ}vH1(Ω)2.a(v,v) = \|\nabla v\|_{L^2(\Omega)}^2 + \sigma\|v\|_{L^2(\Omega)}^2 \ge \min\{1,\sigma\}\|v\|_{H^1(\Omega)}^2.

The right-hand side is bounded on VV. Indeed,

F(v)fL2(Ω)vL2(Ω)fL2(Ω)vH1(Ω),|F(v)| \le \|f\|_{L^2(\Omega)}\|v\|_{L^2(\Omega)} \le \|f\|_{L^2(\Omega)}\|v\|_{H^1(\Omega)},

and the trace estimate gives

B(u,v)CtruL2(Γ)vH1(Ω).|B(u,v)| \le C_{\mathrm{tr}}\|u\|_{L^2(\Gamma)}\|v\|_{H^1(\Omega)}.

By Lax-Milgram, for every uL2(Γ)u\in L^2(\Gamma) there exists a unique state y(u)H1(Ω)y(u)\in H^1(\Omega) and

y(u)H1(Ω)C(fL2(Ω)+uL2(Γ)).\|y(u)\|_{H^1(\Omega)} \le C\left( \|f\|_{L^2(\Omega)} + \|u\|_{L^2(\Gamma)} \right).

Moreover, the control-to-state map

S:L2(Γ)H1(Ω),S(u)=y(u),S:L^2(\Gamma)\to H^1(\Omega), \qquad S(u)=y(u),

is affine and continuous. If ff is fixed, its derivative is the linear map S(u)h=zS'(u)h=z, where zH1(Ω)z\in H^1(\Omega) solves

a(z,v)=B(h,v)vH1(Ω).a(z,v)=B(h,v) \qquad \forall v\in H^1(\Omega).

For the pure Neumann equation σ=0\sigma=0, constants belong to the kernel of the operator. One either works in the quotient space H1(Ω)/RH^1(\Omega)/\mathbb R, or fixes the representative by imposing zero mean:

V0:={vH1(Ω):Ωvdx=0}.V_0:= \left\{ v\in H^1(\Omega): \int_\Omega v\,dx=0 \right\}.

On V0V_0, Poincare’s inequality gives coercivity of a(v,v)=vL2(Ω)2a(v,v)=\|\nabla v\|_{L^2(\Omega)}^2. If the equation is required to hold for all test functions in H1(Ω)H^1(\Omega), the data must also satisfy the compatibility condition

Ωfdx+Γuds=0.\int_\Omega f\,dx+\int_\Gamma u\,ds=0.

This is why adding a positive reaction term is often the cleaner model for unconstrained Neumann boundary control.


Reduced Functional and Existence of an Optimal Control

Since the state is uniquely determined by uu, define the reduced functional

j(u):=J(S(u),u).j(u):=J(S(u),u).

The estimate above implies that jj is continuous on L2(Γ)L^2(\Gamma). The term α2uL2(Γ)2\frac\alpha2\|u\|_{L^2(\Gamma)}^2 makes jj coercive:

j(u)+asuL2(Γ)+.j(u)\to+\infty \qquad\text{as}\qquad \|u\|_{L^2(\Gamma)}\to+\infty.

Because SS is affine and continuous, the tracking term 12S(u)ydL2(Ω)2\frac12\|S(u)-y_d\|_{L^2(\Omega)}^2 is weakly lower semicontinuous. The regularization term is weakly lower semicontinuous as well. Therefore, by the direct method of the calculus of variations, jj attains a minimizer on every nonempty closed convex set UadL2(Γ)U_{\mathrm{ad}}\subset L^2(\Gamma).

Since α>0\alpha>0, the reduced functional is strictly convex. Hence the optimal control is unique.


Adjoint Equation for Neumann Boundary Control

We derive the adjoint through the Lagrangian formalism. The weak state equation is the constraint

a(y,v)F(v)B(u,v)=0vV.a(y,v)-F(v)-B(u,v)=0 \qquad \forall v\in V.

Introduce the Lagrangian

L(y,u,p)=12yydL2(Ω)2+α2uL2(Γ)2[a(y,p)F(p)B(u,p)],\mathcal L(y,u,p) = \frac12\|y-y_d\|_{L^2(\Omega)}^2 + \frac\alpha2\|u\|_{L^2(\Gamma)}^2 - \bigl[ a(y,p)-F(p)-B(u,p) \bigr],

where pVp\in V is the Lagrange multiplier. The sign in front of the constraint is a convention; this choice gives the gradient αu+γp\alpha u+\gamma p.

Stationarity with respect to pp gives the state equation:

DpL(y,u,p)ψ=[a(y,ψ)F(ψ)B(u,ψ)]=0ψV.D_p\mathcal L(y,u,p)\,\psi = -\bigl[ a(y,\psi)-F(\psi)-B(u,\psi) \bigr] =0 \qquad \forall \psi\in V.

Stationarity with respect to yy gives the adjoint equation. For every state variation ηV\eta\in V,

DyL(y,u,p)η=(yyd,η)L2(Ω)a(η,p).D_y\mathcal L(y,u,p)\,\eta = (y-y_d,\eta)_{L^2(\Omega)} - a(\eta,p).

Thus

a(η,p)=(yyd,η)L2(Ω)ηV.a(\eta,p)=(y-y_d,\eta)_{L^2(\Omega)} \qquad \forall \eta\in V.

The adjoint problem is well posed by the same Lax-Milgram argument. If the solution is smooth, this weak equation corresponds to

{Δp+σp=yydin Ω,np=0on Γ.\begin{cases} -\Delta p+\sigma p=y-y_d & \text{in }\Omega,\\ \partial_n p=0 & \text{on }\Gamma. \end{cases}

Finally, stationarity with respect to the control gives the derivative in the control direction. For hL2(Γ)h\in L^2(\Gamma),

DuL(y,u,p)h=α(u,h)L2(Γ)+B(h,p)=Γ(αu+γp)hds.D_u\mathcal L(y,u,p)\,h = \alpha(u,h)_{L^2(\Gamma)} + B(h,p) = \int_\Gamma \left( \alpha u+\gamma p \right)h\,ds.

Because the state and adjoint stationarity equations cancel all implicit state variations, this is the derivative of the reduced functional:

j(u)h=Γ(αu+γp)hds.j'(u)h = \int_\Gamma \left( \alpha u+\gamma p \right)h\,ds.

Thus the L2(Γ)L^2(\Gamma)-gradient of the reduced functional is

j(u)=αu+γp.\nabla j(u)=\alpha u+\gamma p.

If one puts the opposite sign in front of the constraint in the Lagrangian, the adjoint variable changes sign and the same optimality system can be written with αuγp\alpha u-\gamma p instead.


Variational Inequality and Projection Formula

Let uˉUad\bar u\in U_{\mathrm{ad}} be the optimal control, with state yˉ\bar y and adjoint pˉ\bar p. Since UadU_{\mathrm{ad}} is closed and convex, the first-order necessary and sufficient condition is

j(uˉ)(vuˉ)0vUad.j'(\bar u)(v-\bar u)\ge0 \qquad \forall v\in U_{\mathrm{ad}}.

Using the gradient formula, this becomes

(αuˉ+γpˉ,vuˉ)L2(Γ)0vUad.(\alpha\bar u+\gamma\bar p,v-\bar u)_{L^2(\Gamma)} \ge0 \qquad \forall v\in U_{\mathrm{ad}}.

For box constraints, the variational inequality is equivalent to the pointwise projection formula

uˉ=P[ua,ub](1αγpˉ)a.e. on Γ.\bar u = P_{[u_a,u_b]} \left( -\frac1\alpha\gamma\bar p \right) \qquad\text{a.e. on }\Gamma.

Indeed, for almost every boundary point xΓx\in\Gamma, the scalar condition is

(αuˉ(x)+γpˉ(x))(vuˉ(x))0v[ua(x),ub(x)].(\alpha\bar u(x)+\gamma\bar p(x))(v-\bar u(x))\ge0 \qquad \forall v\in [u_a(x),u_b(x)].

This says exactly that

(αuˉ(x)+γpˉ(x))N[ua(x),ub(x)](uˉ(x)),-\bigl(\alpha\bar u(x)+\gamma\bar p(x)\bigr) \in N_{[u_a(x),u_b(x)]}(\bar u(x)),

where NCN_C denotes the normal cone to the convex set CC. The normal-cone identity for projections gives the formula above.

Equivalently:


Dirichlet Boundary Control

For Dirichlet boundary control, the state equation is

{Δy+σy=fin Ω,y=uon Γ.\begin{cases} -\Delta y+\sigma y=f & \text{in }\Omega,\\ y=u & \text{on }\Gamma. \end{cases}

Now the control cannot be an arbitrary element of L2(Γ)L^2(\Gamma) if the state is required to lie in H1(Ω)H^1(\Omega). The natural space is

U=H1/2(Γ).U=H^{1/2}(\Gamma).

Let V0:=H01(Ω)V_0:=H_0^1(\Omega). By the trace theorem, there exists a continuous lifting operator

R:H1/2(Γ)H1(Ω)R:H^{1/2}(\Gamma)\to H^1(\Omega)

such that

γ(Ru)=u.\gamma(Ru)=u.

Write

y=w+Ru,wH01(Ω).y=w+Ru, \qquad w\in H_0^1(\Omega).

The weak equation for ww is

a(w,v)=F(v)a(Ru,v)vH01(Ω).a(w,v)=F(v)-a(Ru,v) \qquad \forall v\in H_0^1(\Omega).

Since aa is coercive on H01(Ω)H_0^1(\Omega), Lax-Milgram gives a unique wH01(Ω)w\in H_0^1(\Omega), hence a unique state y=w+Ruy=w+Ru.

The estimate is

yH1(Ω)C(fH1(Ω)+uH1/2(Γ)).\|y\|_{H^1(\Omega)} \le C\left( \|f\|_{H^{-1}(\Omega)} + \|u\|_{H^{1/2}(\Gamma)} \right).

Thus the Dirichlet control-to-state map is continuous from H1/2(Γ)H^{1/2}(\Gamma) into H1(Ω)H^1(\Omega).


Dirichlet Gradient and Regularity Warning

For Dirichlet control, it is useful to use the lifted variable y=w+Ruy=w+Ru, with wH01(Ω)w\in H_0^1(\Omega). The weak constraint is

a(w,v)F(v)+a(Ru,v)=0vH01(Ω).a(w,v)-F(v)+a(Ru,v)=0 \qquad \forall v\in H_0^1(\Omega).

The Lagrangian is

L(w,u,p)=12w+RuydL2(Ω)2+α2uU2[a(w,p)F(p)+a(Ru,p)],\mathcal L(w,u,p) = \frac12\|w+Ru-y_d\|_{L^2(\Omega)}^2 + \frac\alpha2\|u\|_U^2 - \bigl[ a(w,p)-F(p)+a(Ru,p) \bigr],

where pH01(Ω)p\in H_0^1(\Omega) is the Lagrange multiplier. Stationarity with respect to pp gives the lifted state equation. Stationarity with respect to ww gives, for every ηH01(Ω)\eta\in H_0^1(\Omega),

DwL(w,u,p)η=(yyd,η)L2(Ω)a(η,p)=0.D_w\mathcal L(w,u,p)\,\eta = (y-y_d,\eta)_{L^2(\Omega)} - a(\eta,p) =0.

Therefore the adjoint equation is

a(η,p)=(yyd,η)L2(Ω)ηH01(Ω).a(\eta,p)=(y-y_d,\eta)_{L^2(\Omega)} \qquad \forall \eta\in H_0^1(\Omega).

The control derivative is obtained by differentiating the same Lagrangian with respect to uu. For hU=H1/2(Γ)h\in U=H^{1/2}(\Gamma),

DuL(w,u,p)h=αRUu,hU,U+(yyd,Rh)L2(Ω)a(Rh,p).D_u\mathcal L(w,u,p)\,h = \alpha\langle R_U u,h\rangle_{U',U} + (y-y_d,Rh)_{L^2(\Omega)} - a(Rh,p).

The subtle point is that RhRh is generally not an element of H01(Ω)H_0^1(\Omega), because its trace is hh. Therefore one cannot simply insert RhRh as a test function in the adjoint equation, which is valid only for zero-trace variations.

Assume for a moment that pp is smooth enough for Green’s identity to be used classically. Since the adjoint satisfies

Δp+σp=yydin Ω,p=0on Γ,-\Delta p+\sigma p=y-y_d \qquad\text{in }\Omega, \qquad p=0 \qquad\text{on }\Gamma,

we compute

a(Rh,p)=Ω(Rh)pdx+σΩRhpdx.a(Rh,p) = \int_\Omega \nabla(Rh)\cdot\nabla p\,dx + \sigma\int_\Omega Rh\,p\,dx.

Integrating the first term by parts gives

a(Rh,p)=ΩRh(Δp+σp)dx+Γγ(Rh)npds.a(Rh,p) = \int_\Omega Rh(-\Delta p+\sigma p)\,dx + \int_\Gamma \gamma(Rh)\,\partial_n p\,ds.

Using γ(Rh)=h\gamma(Rh)=h and the strong adjoint equation, this becomes

a(Rh,p)=(yyd,Rh)L2(Ω)+Γhnpds.a(Rh,p) = (y-y_d,Rh)_{L^2(\Omega)} + \int_\Gamma h\,\partial_n p\,ds.

Hence the two non-regularization terms in the Lagrangian control derivative reduce to the boundary term

(yyd,Rh)L2(Ω)a(Rh,p)=np,hH1/2,H1/2.(y-y_d,Rh)_{L^2(\Omega)}-a(Rh,p) = -\langle \partial_n p,h\rangle_{H^{-1/2},H^{1/2}}.

Hence the control stationarity condition is naturally written as

αRUuˉnpˉ=0in H1/2(Γ),\alpha R_U \bar u-\partial_n \bar p=0 \qquad\text{in }H^{-1/2}(\Gamma),

where RU:UUR_U:U\to U' is the Riesz map of H1/2(Γ)H^{1/2}(\Gamma).

This is the key distinction from Neumann control:

An L2(Γ)L^2(\Gamma) projection formula for Dirichlet control is therefore a formal or additional-regularity statement. It is valid, for instance, if npL2(Γ)\partial_n p\in L^2(\Gamma) and the control is regularized in L2(Γ)L^2(\Gamma):

uˉ=P[ua,ub](1αnpˉ).\bar u = P_{[u_a,u_b]} \left( \frac1\alpha\partial_n\bar p \right).

Without such regularity, the correct condition is the variational inequality in the duality pairing between H1/2(Γ)H^{-1/2}(\Gamma) and H1/2(Γ)H^{1/2}(\Gamma).


Parameter Estimation Problem

We now consider an inverse problem in which the unknown is a coefficient in the PDE. Let

0<qminq(x)qmax<a.e. in Ω.0<q_{\min}\le q(x)\le q_{\max}<\infty \qquad\text{a.e. in }\Omega.

For each admissible coefficient qq, the state y=y(q)H01(Ω)y=y(q)\in H_0^1(\Omega) is defined by

Ωqyvdx=f,vH1,H01vH01(Ω).\int_\Omega q\nabla y\cdot\nabla v\,dx = \langle f,v\rangle_{H^{-1},H_0^1} \qquad \forall v\in H_0^1(\Omega).

In strong form this corresponds to

{(qy)=fin Ω,y=0on Γ.\begin{cases} -\nabla\cdot(q\nabla y)=f & \text{in }\Omega,\\ y=0 & \text{on }\Gamma. \end{cases}

The coefficient identification problem is to choose qq so that y(q)y(q) fits observed data ydy_d. A standard Tikhonov functional is

j(q)=12y(q)ydL2(Ω)2+α2qqrefL2(Ω)2.j(q) = \frac12\|y(q)-y_d\|_{L^2(\Omega)}^2 + \frac\alpha2\|q-q_{\mathrm{ref}}\|_{L^2(\Omega)}^2.

The admissible set is often taken as

Qad={qL(Ω):qaqqb a.e. in Ω},Q_{\mathrm{ad}} = \{q\in L^\infty(\Omega): q_a\le q\le q_b \text{ a.e. in }\Omega\},

with 0<qaqb0<q_a\le q_b a.e.

For every qQadq\in Q_{\mathrm{ad}}, the bilinear form

aq(y,v):=Ωqyvdxa_q(y,v):=\int_\Omega q\nabla y\cdot\nabla v\,dx

is continuous and coercive on H01(Ω)H_0^1(\Omega):

aq(y,v)qL(Ω)yL2(Ω)vL2(Ω),|a_q(y,v)| \le \|q\|_{L^\infty(\Omega)} \|\nabla y\|_{L^2(\Omega)} \|\nabla v\|_{L^2(\Omega)},
aq(v,v)qminvL2(Ω)2.a_q(v,v) \ge q_{\min}\|\nabla v\|_{L^2(\Omega)}^2.

Lax-Milgram therefore gives a unique state y(q)y(q) and the stability estimate

y(q)H01(Ω)CPqminfH1(Ω).\|y(q)\|_{H_0^1(\Omega)} \le \frac{C_P}{q_{\min}}\|f\|_{H^{-1}(\Omega)}.

Sensitivity Equation for the Coefficient

The coefficient-to-state map is nonlinear. We derive its derivative.

Let δqL(Ω)\delta q\in L^\infty(\Omega) be a perturbation such that q+tδqq+t\delta q remains uniformly positive for small tt. Let

yt:=y(q+tδq),y:=y(q).y_t:=y(q+t\delta q), \qquad y:=y(q).

Subtracting the two weak state equations gives

Ωq(yty)vdx=tΩδqytvdxvH01(Ω).\int_\Omega q\nabla(y_t-y)\cdot\nabla v\,dx = -t\int_\Omega \delta q\nabla y_t\cdot\nabla v\,dx \qquad \forall v\in H_0^1(\Omega).

Divide by tt and pass formally to the limit t0t\to0. The sensitivity

z:=y(q)δqz:=y'(q)\delta q

solves

Ωqzvdx=ΩδqyvdxvH01(Ω).\int_\Omega q\nabla z\cdot\nabla v\,dx = -\int_\Omega \delta q\nabla y\cdot\nabla v\,dx \qquad \forall v\in H_0^1(\Omega).

In strong form,

{(qz)=(δqy)in Ω,z=0on Γ.\begin{cases} -\nabla\cdot(q\nabla z)=\nabla\cdot(\delta q\nabla y) & \text{in }\Omega,\\ z=0 & \text{on }\Gamma. \end{cases}

The equation is well posed because the right-hand side is bounded on H01(Ω)H_0^1(\Omega):

ΩδqyvdxδqL(Ω)yL2(Ω)vL2(Ω).\left| \int_\Omega \delta q\nabla y\cdot\nabla v\,dx \right| \le \|\delta q\|_{L^\infty(\Omega)} \|\nabla y\|_{L^2(\Omega)} \|\nabla v\|_{L^2(\Omega)}.

Thus

zH01(Ω)CδqL(Ω)yH01(Ω).\|z\|_{H_0^1(\Omega)} \le C \|\delta q\|_{L^\infty(\Omega)} \|y\|_{H_0^1(\Omega)}.

This proves that the derivative maps bounded coefficient perturbations to state perturbations continuously.


Adjoint Gradient for Parameter Estimation

We again derive the adjoint and the gradient through the Lagrangian. The weak state constraint is

Ωqyvdxf,vH1,H01=0vH01(Ω).\int_\Omega q\nabla y\cdot\nabla v\,dx - \langle f,v\rangle_{H^{-1},H_0^1} =0 \qquad \forall v\in H_0^1(\Omega).

Introduce

L(y,q,p)=12yydL2(Ω)2+α2qqrefL2(Ω)2[Ωqypdxf,pH1,H01].\mathcal L(y,q,p) = \frac12\|y-y_d\|_{L^2(\Omega)}^2 + \frac\alpha2\|q-q_{\mathrm{ref}}\|_{L^2(\Omega)}^2 - \left[ \int_\Omega q\nabla y\cdot\nabla p\,dx - \langle f,p\rangle_{H^{-1},H_0^1} \right].

Stationarity with respect to pp gives the state equation. Stationarity with respect to yy gives, for every ηH01(Ω)\eta\in H_0^1(\Omega),

DyL(y,q,p)η=(yyd,η)L2(Ω)Ωqηpdx=0.D_y\mathcal L(y,q,p)\,\eta = (y-y_d,\eta)_{L^2(\Omega)} - \int_\Omega q\nabla \eta\cdot\nabla p\,dx =0.

Therefore

Ωqηpdx=(yyd,η)L2(Ω)ηH01(Ω).\int_\Omega q\nabla \eta\cdot\nabla p\,dx = (y-y_d,\eta)_{L^2(\Omega)} \qquad \forall \eta\in H_0^1(\Omega).

Equivalently, in strong form,

{(qp)=yydin Ω,p=0on Γ.\begin{cases} -\nabla\cdot(q\nabla p)=y-y_d & \text{in }\Omega,\\ p=0 & \text{on }\Gamma. \end{cases}

The coefficient derivative is

DqL(y,q,p)δq=Ω[α(qqref)yp]δqdx.D_q\mathcal L(y,q,p)\,\delta q = \int_\Omega \left[ \alpha(q-q_{\mathrm{ref}}) - \nabla y\cdot\nabla p \right] \delta q\,dx.

Since the state and adjoint equations are precisely the stationarity conditions in pp and yy, this is the derivative of the reduced functional:

j(q)δq=DqL(y,q,p)δq.j'(q)\delta q = D_q\mathcal L(y,q,p)\,\delta q.

If the product yp\nabla y\cdot\nabla p belongs to L2(Ω)L^2(\Omega), then the L2(Ω)L^2(\Omega)-gradient is

j(q)=α(qqref)yp.\nabla j(q) = \alpha(q-q_{\mathrm{ref}}) - \nabla y\cdot\nabla p.

In the minimal energy setting, y,pL2(Ω)d\nabla y,\nabla p\in L^2(\Omega)^d, so the product is a priori only in L1(Ω)L^1(\Omega). Therefore, the formula is always valid as a duality expression, while interpreting it as an L2L^2 gradient requires additional regularity or a discrete finite element setting.


Box Constraints for Parameter Estimation

Let qˉQad\bar q\in Q_{\mathrm{ad}} be a local minimizer, with state yˉ\bar y and adjoint pˉ\bar p. The first-order condition is the variational inequality

j(qˉ)(rqˉ)0rQad.j'(\bar q)(r-\bar q)\ge0 \qquad \forall r\in Q_{\mathrm{ad}}.

Using the adjoint formula, this reads

Ω[α(qˉqref)yˉpˉ](rqˉ)dx0rQad.\int_\Omega \left[ \alpha(\bar q-q_{\mathrm{ref}}) - \nabla\bar y\cdot\nabla\bar p \right] (r-\bar q)\,dx \ge0 \qquad \forall r\in Q_{\mathrm{ad}}.

When the gradient is an L2(Ω)L^2(\Omega) function, this is equivalent to the pointwise projection formula

qˉ=P[qa,qb](qref+1αyˉpˉ)a.e. in Ω.\bar q = P_{[q_a,q_b]} \left( q_{\mathrm{ref}} + \frac1\alpha\nabla\bar y\cdot\nabla\bar p \right) \qquad\text{a.e. in }\Omega.

The corresponding active-set interpretation is:


Existence and Regularization in Inverse Problems

For distributed and Neumann controls, the regularization term α2uL22\frac\alpha2\|u\|_{L^2}^2 gives coercivity in the same Hilbert space in which the control is optimized.

For coefficient estimation, the situation is more subtle:

In a finite element discretization, existence is automatic because the admissible set is finite-dimensional and closed. In an infinite-dimensional analysis, one usually strengthens the compactness by choosing, for example,

α2qqrefH1(Ω)2\frac\alpha2\|q-q_{\mathrm{ref}}\|_{H^1(\Omega)}^2

or a bounded-variation regularization. These choices improve compactness and make the inverse problem stable with respect to noisy data.

This is the essential role of regularization: it is not merely a numerical penalty, but part of the mathematical formulation that turns an ill-posed inverse problem into a well-posed optimization problem.


Numerical Structure

Both classes of problems lead to the same computational loop:

For Neumann boundary control, the discrete gradient contains a boundary mass matrix because the control inner product is on Γ\Gamma:

MΓG=αMΓU+MΓPΓ.M_\Gamma G = \alpha M_\Gamma U + M_\Gamma P_\Gamma.

For Dirichlet boundary control, the discrete gradient involves the discrete normal derivative of the adjoint. This is more sensitive to mesh quality and boundary approximation.

For parameter estimation, the gradient at a quadrature point has the form

gh=α(qhqref,h)yhph.g_h = \alpha(q_h-q_{\mathrm{ref},h}) - \nabla y_h\cdot\nabla p_h.

Thus the quality of the gradient depends on the accuracy of both state and adjoint gradients. This is why coefficient identification problems often require stabilization, mesh refinement, and careful treatment of noisy data.


Summary

Boundary control and parameter estimation fit into the same reduced optimization framework as distributed control, but the spaces change:

The main mathematical lessons are:


Suggested Reading

Tröltzsch

Manzoni-Quarteroni-Salsa

De Los Reyes