Bowen Li and Jun Zou

32
ESAIM: M2AN 55 (2021) 2013–2044 ESAIM: Mathematical Modelling and Numerical Analysis https://doi.org/10.1051/m2an/2021046 www.esaim-m2an.org AN ADAPTIVE EDGE ELEMENT METHOD AND ITS CONVERGENCE FOR AN ELECTROMAGNETIC CONSTRAINED OPTIMAL CONTROL PROBLEM Bowen Li 1 and Jun Zou 2,* Abstract. In this work, an adaptive edge element method is developed for an H(curl)-elliptic con- strained optimal control problem. We use the lowest-order NΒ΄ edΒ΄ elec’s edge elements of first family and the piecewise (element-wise) constant functions to approximate the state and the control, respectively, and propose a new adaptive algorithm with error estimators involving both residual-type error esti- mators and lower-order data oscillations. By using a local regular decomposition for H(curl)-functions and the standard bubble function techniques, we derive the a posteriori error estimates for the proposed error estimators. Then we exploit the convergence properties of the orthogonal 2 -projections and the mesh-size functions to demonstrate that the sequences of the discrete states and controls generated by the adaptive algorithm converge strongly to the exact solutions of the state and control in the energy- norm and 2 -norm, respectively, by first achieving the strong convergence towards the solution to a limiting control problem. Three-dimensional numerical experiments are also presented to confirm our theoretical results and the quasi-optimality of the adaptive edge element method. Mathematics Subject Classification. 65K10, 65N12, 65N15, 65N30, 49J20. Received August 8, 2020. Accepted August 17, 2021. 1. Introduction Many electromagnetic simulation problems involve the following H(curl)-elliptic equation [13, 44]: curl βˆ’1 curl y + y = f in Ξ©, (1.1) where and are the electric permittivity and the magnetic permeability, respectively. This problem is also encountered when the implicit time-stepping scheme is used for solving the full time-dependent Maxwell system [31]. In this work, we are interested in a relevant optimal control problem, namely to find a specially designed external current source profile so that the resulting electromagnetic field achieves an optimal target design. This optimal control problem has direct applications in many areas, such as the induction heating, the magnetic levitation and the electromagnetic material designs. We refer the readers to [12, 25, 63, 64] for some recent results on the theory and applications of electromagnetic optimal control problems. Keywords and phrases. Constrained optimal control, Maxwell’s equations, a posteriori error estimates; adaptive edge element method, convergence analysis of adaptive algorithm. 1 Department of Mathematics, Duke University, Durham, NC 27708, USA. 2 Department of Mathematics, The Chinese University of Hong Kong, Shatin, NT, Hong Kong. * Corresponding author: [email protected] c β—‹ The authors. Published by EDP Sciences, SMAI 2021 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Transcript of Bowen Li and Jun Zou

Page 1: Bowen Li and Jun Zou

ESAIM: M2AN 55 (2021) 2013–2044 ESAIM: Mathematical Modelling and Numerical Analysishttps://doi.org/10.1051/m2an/2021046 www.esaim-m2an.org

AN ADAPTIVE EDGE ELEMENT METHOD AND ITS CONVERGENCE FORAN ELECTROMAGNETIC CONSTRAINED OPTIMAL CONTROL PROBLEM

Bowen Li1 and Jun Zou2,*

Abstract. In this work, an adaptive edge element method is developed for an H(curl)-elliptic con-strained optimal control problem. We use the lowest-order Nedelec’s edge elements of first family andthe piecewise (element-wise) constant functions to approximate the state and the control, respectively,and propose a new adaptive algorithm with error estimators involving both residual-type error esti-mators and lower-order data oscillations. By using a local regular decomposition for H(curl)-functionsand the standard bubble function techniques, we derive the a posteriori error estimates for the proposederror estimators. Then we exploit the convergence properties of the orthogonal 𝐿2-projections and themesh-size functions to demonstrate that the sequences of the discrete states and controls generated bythe adaptive algorithm converge strongly to the exact solutions of the state and control in the energy-norm and 𝐿2-norm, respectively, by first achieving the strong convergence towards the solution to alimiting control problem. Three-dimensional numerical experiments are also presented to confirm ourtheoretical results and the quasi-optimality of the adaptive edge element method.

Mathematics Subject Classification. 65K10, 65N12, 65N15, 65N30, 49J20.

Received August 8, 2020. Accepted August 17, 2021.

1. Introduction

Many electromagnetic simulation problems involve the following H(curl)-elliptic equation [13,44]:

curl(πœ‡βˆ’1curly

)+ 𝜎y = f in Ω, (1.1)

where 𝜎 and πœ‡ are the electric permittivity and the magnetic permeability, respectively. This problem is alsoencountered when the implicit time-stepping scheme is used for solving the full time-dependent Maxwell system[31]. In this work, we are interested in a relevant optimal control problem, namely to find a specially designedexternal current source profile so that the resulting electromagnetic field achieves an optimal target design. Thisoptimal control problem has direct applications in many areas, such as the induction heating, the magneticlevitation and the electromagnetic material designs. We refer the readers to [12, 25, 63, 64] for some recentresults on the theory and applications of electromagnetic optimal control problems.

Keywords and phrases. Constrained optimal control, Maxwell’s equations, a posteriori error estimates; adaptive edge elementmethod, convergence analysis of adaptive algorithm.

1 Department of Mathematics, Duke University, Durham, NC 27708, USA.2 Department of Mathematics, The Chinese University of Hong Kong, Shatin, NT, Hong Kong.*Corresponding author: [email protected]

c The authors. Published by EDP Sciences, SMAI 2021

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0),which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Page 2: Bowen Li and Jun Zou

2014 B. LI AND J. ZOU

The main purpose of this work is to design and analyse an adaptive edge element method for a class of optimalcontrol problems with the applied current density u being the control variable and the magnetic potential ybeing the state variable. Let y𝑑 and u𝑑 be our target magnetic field and control, and Uπ‘Žπ‘‘ denote the unboundedclosed convex admissible set:

u ∈ L2(Ξ©); u β‰₯ πœ“ a.e. in Ξ©, (1.2)

where πœ“ is some obstacle function with a certain regularity. Then we can write the control problem as aconstrained minimization problem: find a pair (y*,u*) ∈ H0(curl, Ξ©)Γ—Uπ‘Žπ‘‘ to minimize the quadratic objectivefunctional:

𝐽(y,u) :=12β€–curl yβˆ’ y𝑑‖20,Ξ© +

𝛼

2β€–uβˆ’ u𝑑‖20,Ξ©, (1.3)

subject to the H(curl)-elliptic equation:

curl(πœ‡βˆ’1curl y

)+ 𝜎y = f + u in Ω. (1.4)

By considering a simple transformation u = u βˆ’ πœ“, we may set πœ“ in (1.2) to zero in our subsequent analysisfor the sake of simplicity, that is, the admissible set shall take the form:

Uπ‘Žπ‘‘ :=u ∈ L2(Ξ©); u β‰₯ 0 a.e. in Ξ©

.

In this work, we are interested in the numerical study of the practically important situation where the localsingularities may be expected for the solution to the optimal control problem (1.3) and (1.4), due to the possibleirregular geometry of the domain Ξ©, the (possibly large) jumps across the interface between two different physicalmedia and the non-smooth source terms (cf. [19, 20]). In these cases, one typically needs to solve the problemon a very fine mesh to fully resolve the singularities, while it may be computationally expensive on account ofthe exponential growth of the number of degrees of freedom (DoFs). To balance the computational cost andthe numerical accuracy, it has proved to be more promising to refine the mesh adaptively when the numericalaccuracy is insufficient.

The theory of adaptive finite element methods (AFEMs), due to the pioneer work of Babuska and Rheinboldt[7] in 1978 for elliptic boundary value problems, has become very popular and well developed in the past fourdecades; see [2, 48] and the references therein for an overview. The relevant research in the case of Maxwell’sequations, which dated back to Beck et al. [10], has also reached a mature level nowadays (cf. [34, 53, 67, 68]).However, the theory of AFEMs for PDE-constrained optimization problems is still a work in progress. Recently,Gong and Yan [27] proved the convergence and quasi-optimality of AFEMs for an elliptic optimal controlproblem with the help of the variational discretization and the duality argument, and in this setting there is noneed to mark the data oscillations as in [26, 29]. In [42, 43], the authors considered the elliptic optimal controlproblems with integral control constraints and gave a rigorous proof of the convergence and the quasi-optimality.People may find more recent advances on AFEMs for the control problems in [5, 11,28,38].

The first contribution towards the adaptive methods for optimal control problems of Maxwell’s equationswent back to Hoppe and Yousept [35]. An AFEM was designed for the same model problem (1.3), (1.4), andthe a posteriori error estimation was also presented, including the reliability and efficiency, with Nedelec’s edgeelement approximations for both the control and the state. But the convergence of the adaptive method wasnot established in [35]. Later, Pauly and Yousept [49] considered an optimal control problem of first-ordermagnetostatic equations and presented the a posteriori error analysis. Meanwhile, Xu and Zou [61] consideredthe adaptive approximation for an optimal control problem associated with an electromagnetic saddle-pointmodel and proved the convergence of discrete solutions.

The current work is a continuation and further improvement of the work [35], where a piecewise linearH(curl)-conforming edge element was used to approximate the optimal control variable u. One crucial mo-tivation of our work is based on the observation that the optimal control u* is generally of low regularity,at most H1 (still under some reasonable conditions) (cf. Prop. 2.1). Therefore we can not expect a better𝐿2-approximation accuracy of the control using any higher-order approximation than the piecewise constant

Page 3: Bowen Li and Jun Zou

AN ADAPTIVE EDGE ELEMENT METHOD AND ITS CONVERGENCE 2015

one (i.e., the simplest, and also cheapest, approximation). Because of this, we shall use the piecewise constantfunctions to approximate the control variable u, and propose a new adaptive algorithm with error indicatorsincorporating the residual-type error estimators and the lower-order data oscillations. As pointed out in [39,40],the precise structure of the admissible set and the way we discretize the distributed control problem are thecritical ingredients when designing an adaptive algorithm, which often influence the analysis of the resultingAFEM essentially. It turns out that in our case here, due to the pointwise unilateral constraints in the admissibleset Uπ‘Žπ‘‘, the inconsistency between the discrete spaces of the state and control as well as the low regularityof the given data, the data oscillation plays an important role in the performance of the algorithm and hasto be considered in the error estimates (as well as the marking strategy) to guarantee the convergence of thealgorithm. We can readily notice that such a change in the approximation of the control can significantly reducethe number of DoFs and make our algorithm much easier to implement than the algorithm in [35], while stillpreserving the same numerical accuracy; see Section 5 for the detailed discussions and the implementation issueon the algorithm. To theoretically justify the effectiveness of our adaptive algorithm, we use a Clement-typequasi-interpolation operator established by Schoberl [53] and the bubble functions [2] to derive the a posteriorierror estimates (reliability and efficiency estimates) of the new error estimators. It is worth mentioning thatthere is a technical issue in [35] when deriving the optimality system for the discrete optimization problem, thatis, the first-order optimality condition (2.8d) there cannot imply its complementarity condition (2.10) as theedge element discretization was adopted for the control (hence part of the estimates provided in [35] needs somenecessary modifications). This is another motivation of the current work. Instead, in our a posteriori analysis, weshall directly cope with the original forms of the first-order optimality conditions by using the contraction prop-erties of 𝐿2-projections without the help of the complementarity problems (although they hold in our setting),so that our estimates can be established in a rigorous manner. Apart from the aforementioned contributionstowards the algorithmic aspects, another main contribution of this work is the strong convergence of the finiteelement solutions generated by the proposed adaptive method. This work appears to be the first one on theconvergence analysis of AFEMs applied to the electromagnetic optimal control problem involving a variationalinequality structure. In the finite element analysis provided in [63] for a quasilinear H(curl)-elliptic optimalcontrol problem, the discrete compactness of the Nedelec edge elements [36] is the main tool in establishingstrong convergence. However, the discrete compactness can only be applied to the discrete divergence-free se-quence, and it is still unknown whether it is true for adaptively generated meshes. During the course of ourconvergence analysis, we shall borrow some techniques from the nonlinear optimization to avoid the need ofdiscrete compactness and exploit some strategies with limiting spaces, which was initially adopted in [9] andthen developed systemically in [45]. To overcome the difficulties arising from the structure of the constraint setand its discretization, we establish a convergence property of 𝐿2-projections associated with the meshes (cf.Prop. 4.2), which connects the convergence behaviors of the adaptive meshes and the discrete spaces. It furtherallows us to define a limiting optimization problem and characterize the limit point of the discrete controlsand states (cf. Thms. 4.4 and 4.5). Hereafter, we show that the maximal error estimators and the residualscorresponding to the sequence of adaptive states and adjoint states vanish (cf. Lems. 4.6 and 4.7), where we usea generalized convergence result of the mesh-size functions (cf. (4.4)), instead of introducing the buffer layer(cf. [61]) between the meshes at different levels, so that our proof can be significantly simplified. By meansof these auxiliary results and again the properties of 𝐿2-projections, we are able to prove that the limit pointof the discrete triplets (y*π‘˜,p*π‘˜,u*π‘˜) solves the variational inequality associated with the control problem (cf.Thm. 4.8). Hence, the strong convergence of Algorithm 1, i.e., Theorem 4.1, follows immediately.

This work is organized as follows. In Section 2, we briefly review the electromagnetic optimal control problemand consider its finite element approximation. Then we propose the a posteriori error estimator and design anadaptive algorithm. We show that the error estimator is both reliable and efficient in Section 3. After that,we address the issue of the convergence of the adaptive solutions in Section 4. Finally, some numerical resultsare presented in Section 5 to illustrate the theoretical results and indicate the effectiveness and robustness ofthe adaptive approach versus the uniform mesh refinement. We end this section with a shorthand notation:π‘₯ . 𝑦 for π‘₯ ≀ 𝐢𝑦 for some generic constant 𝐢 that is independent of the mesh size, but may depend on other

Page 4: Bowen Li and Jun Zou

2016 B. LI AND J. ZOU

quantities, e.g., 𝜎, πœ‡βˆ’1, 𝑓 and the shape regularity of the initial triangulation T0. If π‘₯ ≀ 𝐢𝑦 and π‘₯ β‰₯ 𝐢𝑦 holdsimultaneously, we denote it simply by π‘₯ β‰ˆ 𝑦.

2. The optimal control problem and its discretization

We start by introducing the standard notation for Sobolev spaces and the involved physical parameters (cf.[1,44]). Let Ξ© ∈ R3 be a bounded polyhedral domain with polyhedral Lipschitz subdomains Ω𝑖, 1 ≀ 𝑖 ≀ π‘š, suchthat

Ω𝑖 ∩ Ω𝑗 = βˆ… for 𝑖 = 𝑗 and Ξ© =π‘šβ‹ƒ

𝑖=1

Ω𝑗 .

Let 𝐺 be an open bounded subset of Ξ© with the Lipschitz continuous boundary πœ•πΊ and n being its unitoutward normal vector. For any real 𝑠 > 0, we define the Hilbert space 𝐻𝑠(𝐺) (resp.H𝑠(𝐺) := 𝐻𝑠(𝐺, R3)) forSobolev scalar functions (resp. vector fields) of order 𝑠 with an inner product (Β·, Β·)𝑠,𝐺 and a norm ‖·‖𝑠,𝐺. Wealso introduce the spaces:

H(curl, 𝐺) = v ∈ L2(𝐺); curl v ∈ L2(𝐺) and H(div, 𝐺) = v ∈ L2(𝐺); divv ∈ 𝐿2(𝐺),

equipped with the norm β€–vβ€–curl,𝐺 = (β€–vβ€–20.𝐺 + β€–curl vβ€–20,𝐺)1/2 and β€–vβ€–div,𝐺 = (β€–vβ€–20.𝐺 + β€–divvβ€–20,𝐺)1/2, respec-tively. Here and in what follows, a bold typeface is used to indicate a vector-valued function. The tangentialtrace mapping 𝛾𝑑(u) := nΓ— u and the normal trace mapping 𝛾𝑛(u) := u Β·n are well-defined on H(curl, 𝐺) andH(div, 𝐺), respectively. Then the zero tangential trace space can be introduced by

H0(curl, 𝐺) = v ∈ H(curl, 𝐺); 𝛾𝑑(v) = 0 on πœ•πΊ.

We will also use the fractional order curl-space

H𝑠(curl, Ξ©) = u ∈ H𝑠(Ξ©); curlu ∈ H𝑠(Ξ©), 𝑠 > 0, (2.1)

equipped with the norm ‖·‖𝐻𝑠(curl,Ξ©) := ‖·‖𝑠,Ξ© +β€–curl·‖𝑠,Ξ©. In what follows, we write V for the most frequentlyused Sobolev space H0(curl, Ξ©).

We are now well-prepared for the mathematical formulation of the control problem. In this work, we focuson the following constrained minimization problem:

minu∈Uπ‘Žπ‘‘

𝐽(u) :=12β€–curly(u)βˆ’ y𝑑‖20,Ξ© +

𝛼

2β€–uβˆ’ u𝑑‖20,Ξ©, (2.2)

where the state y(u) ∈ V is the solution to the variational system:

(πœ‡βˆ’1curly, curlπœ‘)0,Ξ© + (𝜎y,πœ‘)0,Ξ© = (f + u,πœ‘)0,Ξ© βˆ€ πœ‘ ∈ V. (2.3)

We assume that the given source term f ∈ L2(Ω) satisfies

f |Ω𝑖 ∈ H(div, Ω𝑖) and 𝛾𝑛(f) ∈ L2(πœ•Ξ©π‘–), 1 ≀ 𝑖 ≀ π‘š, (2.4)

while the target magnetic field y𝑑 and the target control u𝑑 satisfy

y𝑑 ∈ H0(curl, Ξ©) and u𝑑 ∈ L2(Ξ©). (2.5)

We remark that although f is not in H(div) globally in Ξ©, namely f /∈ H(div, Ξ©), div f is well-defined on each Ω𝑖

(hence almost everywhere on Ξ©) by the assumption (2.4). In what follows, we write div f forβˆ‘π‘š

𝑖=1 div fπœ’Ξ©π‘–, by

abuse of notation, where πœ’Ξ©π‘– is the characteristic function of Ω𝑖. It is also clear that the weak divergence of f isglobally defined on Ξ© if and only if the jump of the normal trace [𝛾𝑛(f)]Ξ“ across the interfaces Ξ“ := βˆͺπ‘š

𝑖=1πœ•Ξ©π‘–βˆ–πœ•Ξ©

Page 5: Bowen Li and Jun Zou

AN ADAPTIVE EDGE ELEMENT METHOD AND ITS CONVERGENCE 2017

vanishes. The physical parameters πœ‡ and 𝜎 are supposed to be polynomials on each subdomain Ω𝑖 (hencepiecewise polynomials on Ξ©) and satisfy πœ‡(π‘₯) β‰₯ πœ‡0 > 0 and 𝜎(π‘₯) β‰₯ 𝜎0 > 0. The real 𝛼 > 0 is a stabilizationparameter. We define the symmetric bilinear form 𝐡(Β·, Β·) associated with the state equation (2.3):

𝐡(u, v) :=(πœ‡βˆ’1curlu, curlv

)0,Ξ©

+ (𝜎u, v)0,Ω.

It readily follows from the assumptions of the physical parameters that 𝐡 is continuous and coercive, thusdefining an inner product on H(curl, Ξ©) whose induced energy norm ‖·‖𝐡 :=

√𝐡(Β·, Β·) is equivalent to β€–Β·β€–curl,Ξ©.

By the direct method in the calculus of variations (see, for instance, [58], Thms. 2.14–2.16) and noting thatthe cost functional 𝐽(u) is weakly lower semicontinuous and strictly convex, we have that there always exists aunique minimizer u* ∈ Uπ‘Žπ‘‘ to the optimization problem. Then the first-order optimality condition yields(

curly(u*)βˆ’ y𝑑, curly(u)βˆ’ curly(u*))0,Ξ©

+ 𝛼(u* βˆ’ u𝑑,uβˆ’ u*

)0,Ξ©

β‰₯ 0 βˆ€ u ∈ Uπ‘Žπ‘‘, (2.6)

since 𝐽(u) is Frechet differentiable. In what follows, we shall write y* for y(u*) for short. If we introduce theadjoint state p ∈ V associated with y ∈ V by

𝐡(p,πœ‘) =(curlyβˆ’ y𝑑, curlπœ‘

)0,Ξ©

βˆ€ πœ‘ ∈ V,

equation (2.6) can be equivalently written as the following optimality system in terms of the state and theadjoint state, as well as the control variable:⎧βŽͺβŽͺ⎨βŽͺβŽͺ⎩

𝐡(y*,πœ‘) = (f + u*,πœ‘)0,Ξ© βˆ€ πœ‘ ∈ V,

𝐡(p*,πœ‘) =(curly* βˆ’ y𝑑, curlπœ‘

)0,Ξ©

βˆ€ πœ‘ ∈ V,(p* + 𝛼

(u* βˆ’ u𝑑

),uβˆ’ u*

)0,Ξ©

β‰₯ 0 βˆ€ u ∈ Uπ‘Žπ‘‘.

(2.7a)

(2.7b)

(2.7c)

If we further define the Lagrange multiplier (or the adjoint control) for the variational inequality (2.7c):

βˆ’πœ†* = βˆ’p* βˆ’ 𝛼(u* βˆ’ u𝑑), (2.8)

then (2.7c) can be reformulated in a compact form:

βˆ’πœ†* ∈ πœ•πΌUπ‘Žπ‘‘(u*), (2.9)

where πœ•πΌUπ‘Žπ‘‘ is the subdifferential of the indicator function of the admissible set Uπ‘Žπ‘‘ [33]. We remark that πœ†* isactually the Frechet derivative of the functional 𝐽(u). We then conclude from (2.7c), or (2.9), that u* is nothingelse than the 𝐿2-projection of βˆ’p*

𝛼 + u𝑑 on Uπ‘Žπ‘‘, i.e.,

u* = PUπ‘Žπ‘‘

(βˆ’p*

𝛼+ u𝑑

). (2.10)

Here and throughout this work, we denote by P𝐸 the 𝐿2-projection on the convex subset 𝐸 of L2(Ξ©). It is worthmentioning that PUπ‘Žπ‘‘ in (2.10) can also be understood in the pointwise sense due to the unilateral form of Uπ‘Žπ‘‘:

u* = max0,βˆ’p*

𝛼+ u𝑑

a.e. in Ξ©,

where the maximum is taken componentwise. We now end our discussion on the continuous optimal controlproblem with some regularity results.

Proposition 2.1. Assume that 𝜎 is constant and Ξ© is convex or of class 𝐢1,1. If u𝑑 ∈ H1(Ξ©) and 𝑓 ∈ H(div, Ξ©),then the triplet (y*,p*,u*) satisfying the optimality system (2.7a)–(2.7c) has the regularity: u*,y*,p* ∈ H1(Ξ©).If we further assume that πœ‡ is constant, we have y*,p* ∈ H1(curl, Ξ©).

Page 6: Bowen Li and Jun Zou

2018 B. LI AND J. ZOU

Proof. It is clear that (2.7b) is equivalent to the following equation:

curlπœ‡βˆ’1curlp* + 𝜎p* = curl(curly* βˆ’ y𝑑

)in Ξ©, nΓ— p* = 0 on πœ•Ξ©, (2.11)

in the distributional sense. By taking the divergence on both sides of (2.11), we have div(𝜎p*) = 0. Recall thatthe space H0(curl, Ξ©) ∩H(div, Ξ©) is continuously imbedded in H1(Ξ©), if Ξ© is convex or of class 𝐢1,1 (cf. [6]).Since 𝜎 is assumed to be constant, we readily see p* ∈ H0(curl, Ξ©)∩H(div, Ξ©) β†’Λ“ H1(Ξ©). Noting that PUπ‘Žπ‘‘ is acontinuous mapping from H1(Ξ©) to H1(Ξ©) (cf. [24], Sect. 5.10, [37]), we obtain from (2.10) and u𝑑 ∈ H1(Ξ©) that𝑒* ∈ H1(Ξ©). Then using the fact that f+u* ∈ H(div, Ξ©), a similar argument applying to y* yields y* ∈ H1(Ξ©).

If we further assume that πœ‡ is constant, it follows from the equation (2.7a) and de Rham diagram ([44], (3.60))that curly* belongs to the space H(curl, Ξ©)∩H0(div, Ξ©), which is also continuously imbedded in H1(Ξ©). Hence,there holds curly* ∈ H1(Ξ©), which gives y* ∈ H1(curl, Ξ©) (see (2.1)). Similarly, we can conclude by (2.7b)that p* ∈ H1(curl, Ξ©), under the assumption (2.5) that y𝑑 ∈ H0(curl, Ξ©). The proof is complete.

Remark 2.2. The above result essentially relies on the regularity theory for Maxwell’s equations (cf. [3, 4, 60,62]). Here, we consider the case where the coefficients are isotropic and constant, so that the proof is direct.By using the recent result ([3], Thm. 1), one can similarly obtain the following generalization. Assume thatπœ‡ is constant, 𝜎 ∈ π‘Š 1,3+𝛿(Ξ©) for some 𝛿 > 0 and Ξ© is of class 𝐢1,1. If u𝑑 ∈ H1(Ξ©) and 𝑓 ∈ H(div, Ξ©) holdas above, then the solution (y*,p*,u*) to the optimality system (2.7a)–(2.7c) still satisfies u* ∈ H1(Ξ©) andy*,p* ∈ H1(curl, Ξ©).

From the above discussion about the regularity, we can see that if there are no additional assumptions onthe physical coefficients, the given data and the domain Ξ©, the optimal control u* should generally be of lowregularity (less regular than H1). Hence we may expect from the standard (β„Ž-version) FEM theory that onuniform meshes, the simplest piecewise constant approximation can already achieve the optimal approximationaccuracy, and any higher-order approximations can not improve the numerical accuracy of the optimal control.In fact, this is still true even if the optimal control u* has the H1-regularity, since we aim only at the 𝐿2-errorof the control variable. This is one of the main motivations of the current work. We would like to further remarkthat if the solution is piecewise analytic (singularity is allowed on a lower dimensional manifold), one may usegeneral versions of FEM (e.g. β„Žπ‘-FEM) to get better convergence rate, even the exponential rate [8, 54, 56, 57](but the concrete design and implementation of such algorithms would be typically difficult and beyond thescope of this work).

The rest of this section is devoted to the finite element discretization of the control problem. We consider afamily of conforming and shape regular triangulations Tβ„Ž of Ξ©, which coincide with the partition Ξ© =

βˆπ‘šπ‘–=1 Ω𝑖

in the sense that Ω𝑖 =⋃

π‘‡βŠ‚Ξ©π‘–π‘‡ . We set β„Ž := maxπ‘‡βˆˆTβ„Ž

β„Žπ‘‡ , where β„Žπ‘‡ denotes the diameter of 𝑇 . We also referto Fβ„Ž :=

β‹ƒπœ•π‘‡ ∩ Ξ©; 𝑇 ∈ Tβ„Ž, the set of all the interior faces, as the skeleton of the triangulation Tβ„Ž. The

diameter of a face 𝐹 is denoted by β„ŽπΉ . In our algorithm, we use the lowest-order H(curl)-conforming edgeelement space of Nedelec’s first family with zero tangential trace:

Vβ„Ž =vβ„Ž ∈ H0(curl, Ξ©); vβ„Ž|𝑇 = a𝑇 Γ— x + b𝑇 with a𝑇 , b𝑇 ∈ R3, 𝑇 ∈ Tβ„Ž

,

to approximate both the state and the adjoint state. For approximating the control variable, we employ thesimplest space of element-wise constant functions with respect to the triangulation Tβ„Ž:

Uβ„Ž =uβ„Ž ∈ L2(Ξ©); uβ„Ž|𝑇 ∈ 𝑃0(𝑇 ), 𝑇 ∈ Tβ„Ž

.

Here and in what follows, π‘ƒπ‘˜(𝑇 ) is the space of polynomials of degree ≀ π‘˜ over 𝑇 . The discrete admissible setUπ‘Žπ‘‘

β„Ž is then given byUπ‘Žπ‘‘

β„Ž := Uβ„Ž ∩Uπ‘Žπ‘‘ = uβ„Ž ∈ Uβ„Ž; uβ„Ž β‰₯ 0.With the help of these notions, we formulate the discrete optimal control problem:

minimize 𝐽(yβ„Ž,uβ„Ž) =12β€–curlyβ„Ž βˆ’ y𝑑‖20,Ξ© +

𝛼

2β€–uβ„Ž βˆ’ u𝑑‖20,Ξ© (2.12)

Page 7: Bowen Li and Jun Zou

AN ADAPTIVE EDGE ELEMENT METHOD AND ITS CONVERGENCE 2019

over (yβ„Ž,uβ„Ž) ∈ Vβ„Ž Γ—Uπ‘Žπ‘‘β„Ž

subject to 𝐡(yβ„Ž,πœ‘β„Ž) = (f + uβ„Ž,πœ‘β„Ž)0,Ξ© βˆ€ πœ‘β„Ž ∈ Vβ„Ž. (2.13)

As in the continuous case, the existence and uniqueness of the solution to the discrete system (2.12), (2.13) canbe guaranteed, and the corresponding optimality system reads as follows: find (y*β„Ž,p*β„Ž,u*β„Ž) ∈ Vβ„Ž Γ—Vβ„Ž Γ—Uπ‘Žπ‘‘

β„Ž

by solving ⎧βŽͺβŽͺ⎨βŽͺβŽͺ⎩𝐡(y*β„Ž,πœ‘β„Ž) = (f + u*β„Ž,πœ‘β„Ž)0,Ξ© βˆ€ πœ‘β„Ž ∈ Vβ„Ž,

𝐡(p*β„Ž,πœ‘β„Ž) =(curly*β„Ž βˆ’ y𝑑, curlπœ‘β„Ž

)0,Ξ©

βˆ€ πœ‘β„Ž ∈ Vβ„Ž,(Pβ„Žp

*β„Ž + 𝛼

(u*β„Ž βˆ’ u𝑑

β„Ž

),uβ„Ž βˆ’ u*β„Ž

)0,Ξ©

β‰₯ 0 βˆ€ uβ„Ž ∈ Uπ‘Žπ‘‘β„Ž ,

(2.14a)

(2.14b)

(2.14c)

where Pβ„Ž denotes the 𝐿2-projection PUβ„Ž:

(Pβ„Žv)𝑇 :=1|𝑇 |

βˆ«π‘‡

v(x)dx : L2(Ξ©) β†’ Uβ„Ž, (2.15)

and the approximate target control uπ‘‘β„Ž in (2.14c) is given by Pβ„Žu

𝑑. Similarly, we denote by yβ„Ž(uβ„Ž) the solutionto (2.13) and introduce the discrete reduced cost functional:

π½β„Ž(u) :=12β€–curlyβ„Ž(uβ„Ž)βˆ’ y𝑑‖20,Ξ© +

𝛼

2β€–uβ„Ž βˆ’ u𝑑‖20,Ξ©,

then its Frechet derivative is given by (one can actually show πœ†*β„Ž ∈ Uπ‘Žπ‘‘β„Ž , i.e., πœ†*β„Ž β‰₯ 0; see (2.18))

πœ†*β„Ž = Pβ„Žp*β„Ž + 𝛼(u*β„Ž βˆ’ u𝑑

β„Ž)∈ Uβ„Ž. (2.16)

We also see from (2.14c) and the relation PUπ‘Žπ‘‘β„Ž

= PUπ‘Žπ‘‘Pβ„Ž that

u*β„Ž = PUπ‘Žπ‘‘β„Ž

(βˆ’p*β„Ž

𝛼+ u𝑑

)= PUπ‘Žπ‘‘

(βˆ’Pβ„Žp

*β„Ž

𝛼+ u𝑑

β„Ž

), (2.17)

which directly implies a pointwise representation for the discrete optimal control:

u*β„Ž = max0,βˆ’Pβ„Žp

*β„Ž

𝛼+ u𝑑

β„Ž

. (2.18)

3. Adaptive algorithm and the a posteriori error analysis

In this section, we give a complete discussion of the adaptive edge element method for solving the H(curl)-elliptic control problem and derive the a posteriori error estimates. An adaptive finite element method typicallytakes the successive loops:

SOLVE β†’ ESTIMATE β†’ MARK β†’ REFINE,

where the a posteriori error estimation (module ESTIMATE) is the core step in the design of an adaptivealgorithm, through which we can extract the information on the error distribution. For our algorithm and anal-ysis, we shall use the residual-type a posteriori error estimators in terms of numerically computable quantities,which include the element residuals πœ‚π‘‡ , 𝑇 ∈ Tβ„Ž, and the face residuals πœ‚πΉ , 𝐹 ∈ Fβ„Ž:

πœ‚2𝑇 :=

βˆ‘π‘–=1,2

(πœ‚(𝑖)𝑦,𝑇

)2

+βˆ‘

𝑖=1,2,3

(πœ‚(𝑖)𝑝,𝑇

)2

, πœ‚2𝐹 :=

βˆ‘π‘–=1,2

(πœ‚(𝑖)𝑦,𝐹

)2

+(πœ‚(𝑖)𝑝,𝐹

)2

, (3.1)

Page 8: Bowen Li and Jun Zou

2020 B. LI AND J. ZOU

where πœ‚(𝑖)𝑦,𝑇 and πœ‚

(𝑖)𝑝.𝑇 are the element residuals defined by

πœ‚(1)𝑦,𝑇 := β„Žπ‘‡ β€–f + u*β„Ž βˆ’ curlπœ‡βˆ’1curly*β„Ž βˆ’ 𝜎y*β„Žβ€–0,𝑇 , πœ‚

(2)𝑦,𝑇 := β„Žπ‘‡ β€–div(fβˆ’ 𝜎y*β„Ž)β€–0,𝑇 ,

and

πœ‚(1)𝑝,𝑇 := β„Žπ‘‡ β€–curly𝑑 + curlπœ‡βˆ’1curlp*β„Ž + 𝜎p*β„Žβ€–0,𝑇 , πœ‚

(2)𝑝,𝑇 := β„Žπ‘‡ β€–div(𝜎p*β„Ž)β€–0,𝑇 , πœ‚

(3)𝑝,𝑇 := β€–p*β„Ž βˆ’ Pβ„Žp

*β„Žβ€–0,𝑇 ,

while πœ‚(𝑖)𝑦,𝐹 and πœ‚

(𝑖)𝑝,𝐹 are the face residuals defined by

πœ‚(1)𝑦,𝐹 := β„Ž

1/2𝐹 β€–[𝛾𝑑(πœ‡βˆ’1curly*β„Ž)]𝐹 β€–0,𝐹 , πœ‚

(2)𝑦,𝐹 := β„Ž

1/2𝐹 β€–[𝛾𝑛(f + u*β„Ž βˆ’ 𝜎y*β„Ž)]𝐹 β€–0,𝐹 ,

πœ‚(1)𝑝,𝐹 := β„Ž

1/2𝐹 β€–[𝛾𝑑(βˆ’πœ‡βˆ’1curlp*β„Ž + curly*β„Ž)]𝐹 β€–0,𝐹 , πœ‚

(2)𝑝,𝐹 := β„Ž

1/2𝐹 β€–[𝛾𝑛(𝜎p*β„Ž)]𝐹 β€–0,𝐹 .

Here [Β·]𝐹 := Β·|βˆ’ βˆ’ Β·|+ stands for the jump across the face 𝐹 , where the subscripts Β± denote the limits takenfrom outside and inside 𝐹 βŠ‚ πœ•π‘‡ , respectively. We remark that it may not be easy to see how to define theseterms at the first glance, but we shall see that they are generated naturally in the reliability analysis. For easeof exposition, we denote by

πœ‚2β„Ž(𝑇 ) = πœ‚2

𝑇 +12

βˆ‘πΉβˆˆπœ•π‘‡βˆ©Ξ©

πœ‚2𝐹 ,

the residual-type error indicator associated with an element 𝑇 . We should note that there are no face residualson the boundary of domain since the state equation satisfies the homogeneous boundary condition. For the aposteriori error estimates and the convergence analysis, a lower-order data oscillation related to u𝑑 is needed:

osc2β„Ž

(u𝑑)

:=βˆ‘

π‘‡βˆˆTβ„Ž

osc2𝑇

(u𝑑), osc𝑇

(u𝑑)

:= β€–u𝑑 βˆ’ uπ‘‘β„Žβ€–0,𝑇 , 𝑇 ∈ Tβ„Ž.

Some higher-order data oscillations associated with y𝑑 and f shall also be involved:

osc2β„Ž

(y𝑑)

:=βˆ‘

π‘‡βˆˆTβ„Ž

osc2𝑇

(y𝑑)

with osc𝑇 (y𝑑) := β„Žπ‘‡ β€–curl(y𝑑 βˆ’ y𝑑

β„Ž

)β€–0,𝑇 , 𝑇 ∈ Tβ„Ž,

and

osc2β„Ž(f) :=

βˆ‘π‘‡βˆˆTβ„Ž

osc2𝑇 (f)

with

osc𝑇 (f) := β„Žπ‘‡ β€–fβˆ’ fβ„Žβ€–div,𝑇 +βˆ‘

πΉβˆˆπœ•π‘‡βˆ©Ξ©

β„Ž1/2𝐹 β€–[𝛾𝑛(fβˆ’ fβ„Ž)]𝐹 β€–0,𝐹 , 𝑇 ∈ Tβ„Ž,

where we assume that yπ‘‘β„Ž ∈ Vβ„Ž and fβ„Ž ∈ Uβ„Ž are some approximations of y𝑑 and f, respectively. In contrast to

the element residuals and face residuals associated with the discrete solutions, the data oscillation oscβ„Ž

(u𝑑)

istypically of lower order for a non-smooth target control u𝑑, and at most of 𝑂(β„Ž), by the Poincare inequality,even if u𝑑 has a certain regularity. Meanwhile, the data oscillations oscβ„Ž

(y𝑑)

and oscβ„Ž(f) are of the same orderas the residuals, and shall be of higher order if the data y𝑑 and f have additional regularities. Therefore, wecould replace y𝑑 and f in the element residuals πœ‚

(𝑖)𝑦,𝑇 , 𝑖 = 1, 2 and πœ‚

(1)𝑝,𝑇 , as well as some face residuals, with y𝑑

β„Ž

and fβ„Ž for ease of implementation without any influence on the performance of the algorithm and any essentialchange of the analysis.

As we shall see, since the inconsistent discrete spaces are used and no additional regularity assumptions areadded on the data, the lower-order data oscillations may have a significant contribution to the total error so that

Page 9: Bowen Li and Jun Zou

AN ADAPTIVE EDGE ELEMENT METHOD AND ITS CONVERGENCE 2021

a single residual-type error estimator πœ‚β„Ž is not enough to capture the error distribution accurately. Therefore, todesign an efficient adaptive algorithm for our problem, it is necessary to take into account the data oscillations[26, 29]. In view of this, we introduce a new mixed error indicator, by incorporating the lower-order dataoscillation osc𝑇

(u𝑑)

into the residual-type error estimator πœ‚β„Ž,

πœ‚2β„Ž =

βˆ‘π‘‡βˆˆTβ„Ž

πœ‚2β„Ž(𝑇 ) with πœ‚2

β„Ž(𝑇 ) = πœ‚2β„Ž(𝑇 ) + osc2

𝑇

(u𝑑). (3.2)

We are now in a position to present the algorithm (see Algorithm 1 below), based on the error estimator πœ‚β„Ž.In what follows, we shall use the iteration index π‘˜ to indicate the dependence of a variable or a quantity on aparticular mesh generated in the adaptive process, for instance, we write πœ‚π‘˜(𝑇 ) for πœ‚β„Ž(𝑇 ) to emphasize that weare considering the error estimator πœ‚β„Ž(𝑇 ) on the mesh Tπ‘˜.

Algorithm 1. Adaptive edge element method.1: Specify a shape regular initial mesh T0 on Ξ© and set the iteration index π‘˜ := 0.2: (SOLVE) Compute the numerical solution (y*π‘˜,p*π‘˜ u*π‘˜) to the discrete optimality system (2.14a)–(2.14c) on the mesh

Tπ‘˜.3: (ESTIMATE) Compute the error estimator πœ‚π‘˜(𝑇 ) defined in (3.2) for each element 𝑇 ∈ Tπ‘˜.

4: (MARK) Mark a subset Mπ‘˜ βŠ‚ Tπ‘˜ containing at least one element π‘‡π‘˜ that satisfies

πœ‚π‘˜

(π‘‡π‘˜

)= max

π‘‡βˆˆTπ‘˜

πœ‚π‘˜(𝑇 ). (3.3)

5: (REFINE) Refine elements in Mπ‘˜ and other necessary elements by bisection to generate the smallest conformingmesh Tπ‘˜+1 with Tπ‘˜+1 ∩Mπ‘˜ = βˆ….

6: Set π‘˜ = π‘˜ + 1 and go to Step 2 until the preset stopping criterion is met.

Several further remarks concerning each module in Algorithm 1 are in order. First, in the module SOLVE, weare required to solve a large-scale quadratic optimization problem with discretized PDE-constraints efficiently.It is nowadays a very active research area, and many high-performance algorithms and preconditioners havebeen developed for this purpose (cf. [16, 50–52,55]).

Second, to guarantee the convergence of the algorithm, we only need the condition (3.3) in the moduleMARK that the marked elements contain an element with the largest error indicator, which can be met bymany popular marking strategies, such as the maximum strategy [7], the equidistribution strategy [23] and theDorfler’s strategy [21]. In our numerical experiments (see Sect. 5), we shall use the Dorfler’s strategy to selectthe marked elements Mπ‘˜ with minimal cardinality such that for a given πœƒ ∈ (0, 1), there holdsβˆ‘

π‘‡βˆˆMπ‘˜

πœ‚2π‘˜(𝑇 ) β‰₯ πœƒ

βˆ‘π‘‡βˆˆTπ‘˜

πœ‚2π‘˜(𝑇 ). (3.4)

Third, we only include the data oscillation oscβ„Ž

(u𝑑)

in the definition of πœ‚β„Ž since it is a lower-order term thatmay provide a dominant error contribution among all the data oscillations. On the other hand, the behavior ofthe optimal control u* relies largely on the properties of the obstacle function πœ“, whose information is furthertransferred to u𝑑. We hence expect that the approximation ability of Uβ„Ž associated with the current mesh Tβ„Ž

for u𝑑 can directly influence the performance of our AFEM; see also [26,29] for related discussions. In the casewhere πœ“ is a constant function and u𝑑 = 0, oscβ„Ž

(u𝑑)

vanishes and πœ‚β„Ž becomes the standard residual-type errorestimator πœ‚β„Ž.

Finally, for the module REFINE, all elements of Mπ‘˜ are bisected at least once, and some additional elementsin Tπ‘˜βˆ–Mπ‘˜ may also need to be subdivided in order to generate a sequence of uniformly shape regular andconforming meshes Tπ‘˜π‘˜β‰₯0; see [17] and Section 4 of [48] for the detailed mesh refinement algorithm and the

Page 10: Bowen Li and Jun Zou

2022 B. LI AND J. ZOU

necessary assumptions on the initial triangulation T0. Such a refinement process ensures that all the genericconstants involved in the inequalities below depend only on the shape regularity of the initial mesh and thegiven data.

We shall next present the a posteriori error analysis based on the error estimator πœ‚β„Ž (3.2), including both thereliability and efficiency estimates, which is similar in spirit to the ones given in [35,53], but with several maindifficulties and differences as stated in the introduction, especially those caused by the inconsistency betweenthe discrete spaces of the state and control.

3.1. Reliability

In this section, we show that the error estimator πœ‚β„Ž is reliable in the sense that it can provide an upper boundfor the total error between the true solution and the numerical solution:

β€–y*β„Ž βˆ’ y*β€–curl,Ξ© + β€–p*β„Ž βˆ’ p*β€–curl,Ξ© + β€–u*β„Ž βˆ’ u*β€–0,Ξ©.

For this purpose, following [53], we introduce a helpful quasi-interpolation operator Ξ β„Ž. We start with thedefinition of the extended neighborhood Ω𝑇 for an element 𝑇 ∈ Tβ„Ž and fix some notations. Recall that theneighborhood Ξ©v of a vortex v is defined as the union of all the elements that contain the vertex v, and theextended neighborhood of a vertex v is given by Ξ©v :=

⋃vβ€²βˆˆΞ©v

Ξ©vβ€² . We define the extended neighborhood of anelement 𝑇 by Ω𝑇 =

⋃vβˆˆπ‘‡

Ξ©v. An important consequence of the uniformly shape regularity of Tβ„Ž is that thecardinality of Ω𝑇 is uniformly bounded (cf. [48], Sect. 4.3):

maxπ‘‡βˆˆTβ„Ž

#Ω𝑇 ≀ 𝐢(T0). (3.5)

The corresponding converse fact is that the collection of extended neighborhoods Ω𝑇 , 𝑇 ∈ Tβ„Ž, covers eachelement in Tβ„Ž finite times uniformly:

maxπ‘‡βˆˆTβ„Ž

#

𝑇 β€² ∈ Tβ„Ž; 𝑇 ∈ Ω𝑇 β€²

≀ 𝐢(T0). (3.6)

Here the constants 𝐢(T0) only depend on the initial mesh T0.

Lemma 3.1 ([53], Thm. 1). There exists an interpolation operator Ξ β„Ž : H0(curl, Ξ©) β†’ Vβ„Ž such that for anyu ∈ H0(curl, Ξ©), uβˆ’Ξ β„Žu has the decomposition:

uβˆ’Ξ β„Žu = βˆ‡πœ™ + z,

with πœ™ ∈ 𝐻10 (Ξ©), z ∈ H1

0(Ξ©). Moreover, the following estimates hold

β„Žβˆ’1𝑇 β€–πœ™β€–0,𝑇 + β€–βˆ‡πœ™β€–0,𝑇 . β€–uβ€–0,Ω𝑇

, (3.7)

β„Žβˆ’1𝑇 β€–zβ€–0,𝑇 + β€–βˆ‡zβ€–0,𝑇 . β€–curluβ€–0,Ω𝑇

. (3.8)

Since the optimal state y* and adjoint state p* satisfy the coupled system (2.7a)–(2.7c), the Galerkin orthog-onality, which is important for the a posteriori error analysis for the linear boundary value problems, does nothold here. To compensate it, we introduce the intermediate state and adjoint state, y(u*β„Ž) and p(u*β„Ž), by theequations:

𝐡(y(u*β„Ž),πœ‘) = (f + u*β„Ž,πœ‘) βˆ€ πœ‘ ∈ V, (3.9)

𝐡(p(u*β„Ž),πœ“) =(curly(u*β„Ž)βˆ’ y𝑑, curlπœ“

)βˆ€ πœ“ ∈ V. (3.10)

If we use the variational discretization for the control variable, we can derive the following error equivalence(cf. [27, 30]):

β€–y(u*β„Ž)βˆ’ y*β„Žβ€–curl,Ξ© + β€–p(u*β„Ž)βˆ’ p*β„Žβ€–curl,Ξ© β‰ˆ β€–y*β„Ž βˆ’ y*β€–curl,Ξ© + β€–p*β„Ž βˆ’ p*β€–curl,Ξ© + β€–u*β„Ž βˆ’ u*β€–0,Ξ©,

Page 11: Bowen Li and Jun Zou

AN ADAPTIVE EDGE ELEMENT METHOD AND ITS CONVERGENCE 2023

which allows us to directly conclude the reliability of the error estimator from the known result concerningMaxwell’s equations ([53], Cor. 2). If we use the edge element discretization for the control as in [35], the aboveerror equivalence still holds, up to the data oscillation oscβ„Ž

(u𝑑). However, this is not the case in our algorithm

since the discrete spaces of the control and state variables are different. Instead, we have the following result.

Lemma 3.2. Let the triplets (y*,p*,u*) and (y*β„Ž,p*β„Ž,u*β„Ž) be the solutions to (2.7a)–(2.7c) and (2.14a)–(2.14c),respectively. Then it holds that

β€–y*β„Ž βˆ’ y*β€–curl,Ξ© + β€–p*β„Ž βˆ’ p*β€–curl,Ξ© + β€–u*β„Ž βˆ’ u*β€–0,Ξ©

. β€–y(u*β„Ž)βˆ’ y*β„Žβ€–curl,Ξ© + β€–p(u*β„Ž)βˆ’ p*β„Žβ€–curl,Ξ© + β€–Pβ„Žp*β„Ž βˆ’ p*β„Žβ€–0,Ξ© + oscβ„Ž

(u𝑑).

Proof. By the well-posedness of the H(curl)-elliptic variational problem, we have

β€–y* βˆ’ y(u*β„Ž)β€–curl,Ξ© . β€–uβˆ’ u*β„Žβ€–0,Ξ©, (3.11)β€–p* βˆ’ p(u*β„Ž)β€–curl,Ξ© . β€–curly* βˆ’ curly(u*β„Ž)β€–0,Ξ© . β€–uβˆ’ u*β„Žβ€–0,Ξ©, (3.12)

which, combined with the triangle inequality, reduce the proof of the lemma to the estimate of β€–u*β„Ž βˆ’ u*β€–0,Ξ©.In view of (2.10) and (2.17), we get

β€–u* βˆ’ u*β„Žβ€–20,Ξ© ≀(u* βˆ’ u*β„Ž,βˆ’p* βˆ’ Pβ„Žp

*β„Ž

𝛼+ u𝑑 βˆ’ u𝑑

β„Ž

)0,Ξ©

, (3.13)

by the contraction property of 𝐿2-projections ([15], Prop. 5.3). Moreover, we can deduce, by taking πœ‘ =p* βˆ’ p(u*β„Ž) in (3.9) and πœ“ = y* βˆ’ y(u*β„Ž) in (3.10), that

(u* βˆ’ u*β„Ž,p* βˆ’ p(u*β„Ž))0,Ξ© = β€–curly* βˆ’ curly(u*β„Ž)β€–20,Ξ© β‰₯ 0. (3.14)

Combining (3.14) with (3.13) helps us obtain

β€–u*β„Ž βˆ’ u*β€–0,Ξ© . β€–Pβ„Žp*β„Ž βˆ’ p*β„Žβ€–0,Ξ© + β€–p*β„Ž βˆ’ p(u*β„Ž)β€–0,Ξ© + β€–u𝑑

β„Ž βˆ’ u𝑑‖0,Ξ©,

which completes the proof of the lemma.

With the above preparations, we are now ready to prove the reliability of the error estimator.

Theorem 3.3. Let the triplets (y*,p*,u*) and (y*β„Ž,p*β„Ž,u*β„Ž) be the solutions to the continuous and discrete op-timality systems (2.7a)–(2.7c) and (2.14a)–(2.14c), respectively. Then we have the following reliability estimate:

β€–y*β„Ž βˆ’ y*β€–curl,Ξ© + β€–p*β„Ž βˆ’ p*β€–curl,Ξ© + β€–u*β„Ž βˆ’ u*β€–0,Ξ© . πœ‚β„Ž. (3.15)

Proof. By Lemma 3.2, it suffices to estimate β€–y(u*β„Ž) βˆ’ y*β„Žβ€–curl,Ξ© + β€–p(u*β„Ž) βˆ’ p*β„Žβ€–curl,Ξ© to obtain the reliabilityestimate (3.15). We first consider the estimate for the state variable y. Let ey be y(u*β„Ž) βˆ’ y*β„Ž, and recall thenorm equivalence: β€–v‖𝐡 β‰ˆ β€–vβ€–curl,Ξ©. We can derive, by the Galerkin orthogonality,

β€–eyβ€–2curl,Ξ© β‰ˆ 𝐡(ey, ey βˆ’Ξ β„Žey) = (f + u*β„Ž βˆ’ 𝜎y*β„Ž, ey βˆ’Ξ β„Žey)0,Ξ© βˆ’(πœ‡βˆ’1curly*β„Ž, curl(ey βˆ’Ξ β„Žey)

)0,Ξ©

. (3.16)

A direct application of Lemma 3.1 gives us the decomposition: eyβˆ’Ξ β„Žey = βˆ‡πœ™+z with πœ™ ∈ 𝐻10 (Ξ©), z ∈ H1

0(Ξ©).Substituting it into (3.16) and using integration by parts for each 𝑇 , we have

(f + u*β„Ž βˆ’ 𝜎y*β„Ž,βˆ‡πœ™ + z)0,Ξ© βˆ’(πœ‡βˆ’1curly*β„Ž, curl(βˆ‡πœ™ + z)

)0,Ξ©

=βˆ‘

π‘‡βˆˆTβ„Ž

βˆ’(div f, πœ™)0,𝑇 +(f + u*β„Ž βˆ’ 𝜎y*β„Ž βˆ’ curlπœ‡βˆ’1curly*β„Ž, z

)0,𝑇

+ (div(𝜎y*β„Ž), πœ™)0,𝑇

Page 12: Bowen Li and Jun Zou

2024 B. LI AND J. ZOU

+βˆ‘

𝐹∈Fβ„Ž

([𝛾𝑛(f + u*β„Ž βˆ’ 𝜎y*β„Ž)]𝐹 , πœ™)0,𝐹 +([

𝛾𝑑(πœ‡βˆ’1curly*β„Ž)]𝐹, z)0,𝐹

. (3.17)

To proceed, by the scaled trace inequality ([48], Cor. 6.1):

‖𝑀‖0,𝐹 . β„Žβˆ’1/2𝐹 ‖𝑀‖0,𝑇 + β„Ž

1/2𝐹 β€–βˆ‡π‘€β€–0,𝑇 for 𝐹 ∈ πœ•π‘‡, 𝑀 ∈ 𝐻1(𝑇 ), (3.18)

and estimates (3.7) and (3.8), we obtain from (3.16) and (3.17) and the definitions of the error estimators,

β€–eyβ€–2curl,Ξ© .βˆ‘

π‘‡βˆˆTβ„Ž

(πœ‚(1)𝑦,𝑇 + πœ‚

(2)𝑦,𝑇

)β€–eyβ€–curl,Ω𝑇

+βˆ‘

π‘‡βˆˆTβ„Ž

βˆ‘πΉβˆˆπœ•π‘‡βˆ©Ξ©

(πœ‚(1)𝑦,𝐹 + πœ‚

(2)𝑦,𝐹

)β€–eyβ€–curl,Ω𝑇

. (3.19)

By the property (3.6), the desired estimate follows from (3.19) and the Cauchy’s inequality:

β€–y(u*β„Ž)βˆ’ y*β„Žβ€–curl,Ξ© . πœ‚β„Ž. (3.20)

The error ep := p(u*β„Ž)βˆ’ p*β„Ž for the adjoint state can be analysed similarly. We note

β€–epβ€–2curl,Ξ© β‰ˆπ΅(ep, ep βˆ’Ξ β„Žep) + 𝐡(ep, Ξ β„Žep),

and write ep βˆ’Ξ β„Žep = βˆ‡πœ™ + z by Lemma 3.1. Then some elementary calculations give us that

𝐡(ep, ep βˆ’Ξ β„Žep) = 𝐡(p(u*β„Ž)βˆ’ p*β„Ž,βˆ‡πœ™ + z)

.(curly*β„Ž βˆ’ y𝑑 βˆ’ πœ‡βˆ’1curlp*β„Ž, curl z

)0,Ξ©

βˆ’ (𝜎p*β„Ž,βˆ‡πœ™ + z)0,Ξ© + β€–y*β„Ž βˆ’ y(u*β„Ž)β€–curl,Ξ©β€–epβ€–curl,Ξ©.

Moreover, by equations (2.14b) and (3.10), we have

|𝐡(ep, Ξ β„Žep)| . β€–y*β„Ž βˆ’ y(u*β„Ž)β€–curl,Ξ©β€–epβ€–curl,Ξ©.

Then we can derive by using integration by parts and the above estimates that

β€–epβ€–2curl,Ξ© .(curly*β„Ž βˆ’ y𝑑 βˆ’ πœ‡βˆ’1curlp*β„Ž, curl z

)0,Ξ©

βˆ’ (𝜎p*β„Ž,βˆ‡πœ™ + z)0,Ξ© + β€–y*β„Ž βˆ’ y(u*β„Ž)β€–curl,Ξ©β€–epβ€–curl,Ξ©

=βˆ‘

π‘‡βˆˆTβ„Ž

(βˆ’curly𝑑 βˆ’ curlπœ‡βˆ’1curlp*β„Ž βˆ’ 𝜎p*β„Ž, z)0,𝑇 + (div(𝜎p*β„Ž), πœ™)0,𝑇

βˆ’βˆ‘

𝐹∈Fβ„Ž

([𝛾𝑑(curly*β„Ž βˆ’ πœ‡βˆ’1curlp*β„Ž)]𝐹 , z)0,𝐹 βˆ’ ([𝛾𝑛(𝜎p*β„Ž)]𝐹 , πœ™)0,𝐹

+ β€–y*β„Ž βˆ’ y(u*β„Ž)β€–curl,Ξ©β€–epβ€–curl,Ξ©,

which, by (3.20), the trace inequality (3.18) and Lemma 3.1, gives

β€–p(u*β„Ž)βˆ’ p*β„Žβ€–curl,Ξ© . πœ‚β„Ž. (3.21)

Combining estimates (3.20) and (3.21) with Lemma 3.2 and the definition of πœ‚β„Ž completes the proofof (3.15).

3.2. Efficiency

In this section, we consider the efficiency estimate, which is another aim of the a posteriori error analysis. Forthis, we need the so-called bubble functions, which plays a similar role to the cut-off functions and can help usestimate the local errors. As we shall see soon, when we deal with the divergence parts of the residual-type errorestimator πœ‚β„Ž (i.e., πœ‚(2)

𝑦,𝑇 , πœ‚(2)

𝑝,𝑇 , πœ‚(2)

𝑦,𝐹 and πœ‚(2)

𝑝,𝐹 ), the higher-order bubble functions have to be used to ensure thevanishing boundary traces of some terms. Moreover, the curl structures in the right-hand sides of the adjoint

Page 13: Bowen Li and Jun Zou

AN ADAPTIVE EDGE ELEMENT METHOD AND ITS CONVERGENCE 2025

equations (2.7b) and (2.14b) also need our special and careful treatment. These important points were notaddressed in [35].

For the reader’s convenience, we next briefly review the definition of the bubble functions and some basicresults, see [2, 59] for a comprehensive introduction of this topic. We define the bubble function for an element𝑇 ∈ Tβ„Ž by 𝑏𝑇 (x) = 256Ξ 4

𝑖=1πœ†π‘‡π‘– (x), x ∈ 𝑇 , where πœ†π‘‡

𝑖 , 1 ≀ 𝑖 ≀ 4, are the barycentric coordinate functionsassociated with four vertices of 𝑇 ∈ Tβ„Ž. Similarly, the bubble function for a face 𝐹 ∈ Fβ„Ž is given by 𝑏𝐹 |𝑇 (x) =27Ξ 3

𝑖=1πœ†πΉπ‘– (x), x ∈ 𝑇 ∈ πœ”πΉ . Here, πœ”πΉ := 𝑇 ∈ Tβ„Ž; 𝐹 βŠ‚ πœ•π‘‡ is the element pair for a face 𝐹 ∈ Fβ„Ž, and πœ†πΉ

𝑖 arethe barycentric coordinate functions associated with the vertices of the face 𝐹 ∈ Fβ„Ž, which can be naturallyextended to πœ”πΉ . To extend the face residuals defined on 𝐹 to πœ”πΉ , we introduce the extension operator as follows.We first define the operator : 𝐢(𝐹 ) β†’ 𝐢(𝑇 ) on the reference element 𝑇 in R3 by

[𝑝](, 𝑦, 𝑧) := 𝑝(, 𝑦),

where 𝐹 is the face of 𝑇 lying on the (, 𝑦)-plane. By using the affine mapping 𝐹𝑇 (x) = 𝐴𝑇 x+a𝑇 : 𝑇 β†’ 𝑇 ∈ πœ”πΉ ,the general extension operator 𝐸 : 𝐢(𝐹 ) β†’ 𝐢(πœ”πΉ ) can be introduced by

𝐸[𝑝]|𝑇 = [𝑝 ∘ 𝐹𝑇 ] ∘ πΉβˆ’1𝑇 , 𝑇 ∈ πœ”πΉ , (3.22)

where the mappings 𝐹𝑇 , 𝑇 ∈ πœ”πΉ , are chosen such that 𝐹 is mapped to 𝐹 and 𝐸[𝑝] is well-defined on πœ”πΉ andcontinuous. The next lemma summarizes the important properties of the bubble functions, which can be easilyverified by the equivalence of norms in a finite-dimensional linear space and the standard scaling argument.

Lemma 3.4. Let π‘˜ be a positive integer and 𝑠 be a positive real number. For any 𝑇 ∈ Tβ„Ž and 𝐹 ∈ Fβ„Ž, thereholds

β€–πœ‘β€–0,𝑇 . ‖𝑏𝑠𝑇 πœ‘β€–0,𝑇 ≀ β€–πœ‘β€–0,𝑇 , β€–πœ™β€–0.𝐹 . ‖𝑏𝑠

𝐹 πœ™β€–0.𝐹 ≀ β€–πœ™β€–0,𝐹 , (3.23)

for all πœ‘ ∈ π‘ƒπ‘˜(𝑇 ) and πœ™ ∈ π‘ƒπ‘˜(𝐹 ), and

β„Ž1/2𝐹 β€–πœ™β€–0,𝐹 . ‖𝑏𝑠

𝐹 𝐸(πœ™)β€–0,𝑇 . β„Ž1/2𝐹 β€–πœ™β€–0,𝐹 , (3.24)

for 𝑇 ∈ 𝑀𝐹 and πœ™ ∈ π‘ƒπ‘˜(𝐹 ).

We are now in a position to state and prove the main result of this section.

Theorem 3.5. Let the triplets (y*,p*,u*) and (y*β„Ž,p*β„Ž,u*β„Ž) be the solutions to the continuous and discreteoptimality systems (2.7a)–(2.7c) and (2.14a)–(2.14c), respectively, and the multipliers πœ†* and πœ†*β„Ž be given by(2.8) and (2.16). Then we have the efficiency estimate:

πœ‚β„Ž . β€–y*β„Ž βˆ’ y*β€–curl,Ξ© + β€–p*β„Ž βˆ’ p*β€–curl,Ξ© + β€–u*β„Ž βˆ’ u*β€–0,Ξ© + β€–πœ†*β„Ž βˆ’ πœ†*β€–0,Ξ©

+ oscβ„Ž

(y𝑑)

+ oscβ„Ž(f) + oscβ„Ž

(u𝑑). (3.25)

Before we start our proof, we remark that it is necessary to consider the error of the multiplier β€–πœ†*β„Žβˆ’πœ†*β€–0,Ξ©

here in order to estimate πœ‚β„Ž, in comparison with [35], since the additional error estimator πœ‚(3)

𝑝,𝑇 is included inπœ‚β„Ž. We start with the following local efficiency estimate:

πœ‚(3)𝑝,𝑇 = β€–p*β„Ž βˆ’ p* + p* βˆ’ Pβ„Žp

*β„Žβ€–0,𝑇

≀ β€–πœ†* βˆ’ πœ†*β„Ž βˆ’ 𝛼(u* βˆ’ u*β„Ž βˆ’ u𝑑 + u𝑑

β„Ž

)β€–0,𝑇 + β€–p*β„Ž βˆ’ p*β€–0,𝑇

≀ β€–p* βˆ’ p*β„Žβ€–0,𝑇 + β€–πœ†* βˆ’ πœ†*β„Žβ€–0,𝑇 + 𝛼(β€–u* βˆ’ u*β„Žβ€–0,𝑇 + osc𝑇

(u𝑑))

,

Page 14: Bowen Li and Jun Zou

2026 B. LI AND J. ZOU

by the definitions of πœ†*, πœ†*β„Ž and πœ‚(3)𝑝,𝑇 and the triangle inequality. Our proof proceeds by establishing more

local efficiency estimates for πœ‚π‘‡ , which are divided into the following four groups of estimates, for 𝑇 ∈ Tβ„Ž and𝐹 ∈ Fβ„Ž, ⎧⎨⎩ πœ‚

(1)𝑦,𝑇 . β„Žπ‘‡ β€–u* βˆ’ u*β„Žβ€–0,𝑇 + β€–y* βˆ’ y*β„Žβ€–curl,𝑇 + osc𝑇 (f),

πœ‚(1)𝑝,𝑇 . β€–p

* βˆ’ p*β„Žβ€–curl,𝑇 + β€–y* βˆ’ y*β„Žβ€–curl,𝑇 + osc𝑇

(y𝑑),⎧⎨⎩ πœ‚

(2)𝑦,𝑇 . β€–u

* βˆ’ u*β„Žβ€–0,𝑇 + β€–y* βˆ’ y*β„Žβ€–0,𝑇 + osc𝑇 (f),

πœ‚(2)𝑝,𝑇 . β€–p

* βˆ’ p*β„Žβ€–0,𝑇 ,⎧⎨⎩ πœ‚(1)𝑦,𝐹 . β„Žπ‘‡ β€–u* βˆ’ u*β„Žβ€–0,𝑀𝐹

+ β€–y* βˆ’ y*β„Žβ€–curl,𝑀𝐹+ πœ‚

(1)𝑦,𝑇+ + πœ‚

(1)𝑦,π‘‡βˆ’ ,

πœ‚(1)𝑝,𝐹 . β€–p

* βˆ’ p*β„Žβ€–curl,𝑀𝐹+ β€–y* βˆ’ y*β„Žβ€–curl,𝑀𝐹

+ πœ‚(1)𝑝,𝑇+ + πœ‚

(1)𝑝,π‘‡βˆ’ ,⎧⎨⎩ πœ‚

(2)𝑦,𝐹 . β€–u

* βˆ’ u*β„Žβ€–0,𝑀𝐹+ β€–y* βˆ’ y*β„Žβ€–0,𝑀𝐹

+ πœ‚(2)𝑦,𝑇+ + πœ‚

(2)𝑦,π‘‡βˆ’ + osc𝑇+(f),

πœ‚(2)𝑝,𝐹 . β€–p

* βˆ’ p*β„Žβ€–0,𝑀𝐹+ πœ‚

(2)𝑝,𝑇+ + πœ‚

(2)𝑝,π‘‡βˆ’ ,

where 𝑇+ and π‘‡βˆ’ are two elements in πœ”πΉ with 𝐹 = 𝑇+ ∩ π‘‡βˆ’.

Proof. We give the proof of the above four groups of inequalities by the following four steps.

(1) We start with πœ‚(1)

𝑦,𝑇 and readily see by the triangle inequality that

πœ‚(1)𝑦,𝑇 ≀ β„Žπ‘‡ β€–fβ„Ž + u*β„Ž βˆ’ curlπœ‡βˆ’1curly*β„Ž βˆ’ 𝜎y*β„Žβ€–0,𝑇 + osc𝑇 (f). (3.26)

It is clear that 𝑏𝑇 vanishes on πœ•π‘‡ , and hence we can define zβ„Ž := 𝑏𝑇 (fβ„Ž + u*β„Ž βˆ’ curlπœ‡βˆ’1curly*β„Ž βˆ’ 𝜎y*β„Ž) ∈H0(curl, Ξ©). Using the estimate (3.23) with πœ‘ = fβ„Ž + u*β„Ž βˆ’ curlπœ‡βˆ’1curly*β„Ž βˆ’ 𝜎y*β„Ž and 𝑠 = 1/2, we obtain

β€–fβ„Ž + u*β„Ž βˆ’ curlπœ‡βˆ’1curly*β„Ž βˆ’ 𝜎y*β„Žβ€–20,𝑇 β‰ˆ (fβ„Ž + u*β„Ž βˆ’ curlπœ‡βˆ’1curly*β„Ž βˆ’ 𝜎y*β„Ž, zβ„Ž)0,𝑇

= (fβ„Ž βˆ’ f, zβ„Ž)0,𝑇 + (u*β„Ž βˆ’ u*, zβ„Ž)0,𝑇 + 𝐡(y* βˆ’ y*β„Ž, zβ„Ž). (3.27)

By estimates (3.26) and (3.27), and the inverse inequality:

β€–zβ„Žβ€–curl,𝑇 . β„Žβˆ’1𝑇 β€–zβ„Žβ€–0,𝑇 ,

as well as the norm equivalence: β€–zβ„Žβ€–0,𝑇 β‰ˆ β€–fβ„Ž + u*β„Ž βˆ’ curlπœ‡βˆ’1curly*β„Ž βˆ’ 𝜎y*β„Žβ€–0,𝑇 , we can derive

πœ‚(1)𝑦,𝑇 . β„Žπ‘‡ β€–u* βˆ’ u*β„Žβ€–0,𝑇 + β€–y* βˆ’ y*β„Žβ€–curl,𝑇 + osc𝑇 (f).

The estimate of πœ‚(1)

𝑝,𝑇 is similar. We note

πœ‚(1)𝑝.𝑇 . β„Žπ‘‡ β€–curly𝑑

β„Ž + curlπœ‡βˆ’1curlp*β„Ž + 𝜎p*β„Žβ€–0,𝑇 + osc𝑇

(y𝑑), (3.28)

and define zβ„Ž := 𝑏𝑇 (curlyπ‘‘β„Ž + curlπœ‡βˆ’1curlp*β„Ž + 𝜎p*β„Ž) ∈ H0(curl, Ξ©). Then a similar estimate as above

gives

β€–curlyπ‘‘β„Ž + curlπœ‡βˆ’1curlp*β„Ž + 𝜎p*β„Žβ€–20,𝑇 . (curly𝑑

β„Ž + curlπœ‡βˆ’1curlp*β„Ž + 𝜎p*β„Ž, zβ„Ž)0,𝑇

.(curly𝑑 + curlπœ‡βˆ’1curlp*β„Ž + 𝜎p*β„Ž, zβ„Ž

)0,𝑇

βˆ’ (curly*, curl zβ„Ž)0,𝑇

+(curly𝑑

β„Ž βˆ’ curly𝑑, zβ„Ž

)0,𝑇

+ (curl(y* βˆ’ y*β„Ž), curlπ‘§β„Ž)0,𝑇

(3.29)

Page 15: Bowen Li and Jun Zou

AN ADAPTIVE EDGE ELEMENT METHOD AND ITS CONVERGENCE 2027

. β€–curlyπ‘‘β„Ž βˆ’ curly𝑑‖0,𝑇 β€–zβ„Žβ€–0,𝑇 + β€–y*β„Ž βˆ’ y*β€–curl,𝑇 β€–zβ„Žβ€–curl,𝑇

+ β€–p*β„Ž βˆ’ p*β€–curl,𝑇 β€–zβ„Žβ€–curl,𝑇 , (3.30)

where in (3.29) we have used (curly*β„Ž, curl zβ„Ž)0,𝑇 = 0 from the fact that y*β„Ž is a first-order polynomial on𝑇 , and in (3.30) we have used(

curly𝑑 + curlπœ‡βˆ’1curlp*β„Ž + 𝜎p*β„Ž, zβ„Ž

)0,𝑇

βˆ’ (curly*, curl zβ„Ž)0,𝑇

= 𝐡(p*β„Ž βˆ’ p*, zβ„Ž) . β€–p*β„Ž βˆ’ p*β€–curl,𝑇 β€–zβ„Žβ€–curl,𝑇 .

Further applying the inverse estimate for β€–zβ„Žβ€–curl,𝑇 in (3.30) and recalling (3.28), we come to

πœ‚(1)𝑝,𝑇 . β€–p

* βˆ’ p*β„Žβ€–curl,𝑇 + β€–y* βˆ’ y*β„Žβ€–curl,𝑇 + osc𝑇

(y𝑑).

(2) Define zβ„Ž := div(fβ„Ž βˆ’ 𝜎y*β„Ž)𝑏2𝑇 with β€–zβ„Žβ€–0,𝑇 β‰ˆ β€–div(fβ„Ž βˆ’ 𝜎y*β„Ž)β€–0,𝑇 . It is clear that βˆ‡zβ„Ž is a polynomial on 𝑇

and vanishes on the boundary πœ•π‘‡ , which gives βˆ‡zβ„Ž ∈ H0(curl, Ξ©). By a direct calculation, we have

(div(fβ„Ž βˆ’ 𝜎y*β„Ž), zβ„Ž)0,𝑇 = (div(fβ„Ž βˆ’ 𝜎y*β„Ž + u*β„Ž), zβ„Ž)0,𝑇

= (div(fβˆ’ 𝜎y*β„Ž + u*β„Ž), zβ„Ž)0,𝑇 + (div(fβ„Ž βˆ’ f), zβ„Ž)0,𝑇

= (βˆ’πœŽy* + 𝜎y*β„Ž + u* βˆ’ u*β„Ž,βˆ‡zβ„Ž)0,𝑇 + (div(fβ„Ž βˆ’ f), zβ„Ž)0,𝑇 , (3.31)

where we have used the following observation in the last equality:

𝐡(y*,βˆ‡zβ„Ž) = (𝜎y*,βˆ‡zβ„Ž)0,Ξ© = (f + u*,βˆ‡zβ„Ž)0,Ξ©,

which is from (2.7a) with the test function πœ‘ = βˆ‡zβ„Ž. Again, by Lemma 3.4 and the inverse estimate, wecan derive from the definition of πœ‚(2)

𝑦,𝑇 and (3.31) that

πœ‚(2)𝑦,𝑇 . β€–u

*β„Ž βˆ’ u*β€–0,𝑇 + β€–y*β„Ž βˆ’ y*β€–0,𝑇 + osc𝑇 (f).

Likewise, for πœ‚(2)

𝑝,𝑇 , taking zβ„Ž = div(𝜎p*β„Ž)𝑏2𝑇 and observing from (2.7b):

(𝜎p*,βˆ‡zβ„Ž)0,Ξ© =(curly* βˆ’ y𝑑, curlβˆ‡zβ„Ž

)0,Ξ©

= 0,

we can derive, by almost the same arguments as above, that

πœ‚(2)𝑝,𝑇 . β€–p

* βˆ’ p*β„Žβ€–0,𝑇 .

(3) Since the lowest-order edge element is used for the discretization, curly*β„Ž is a piecewise constant vector.Then 𝛾𝑑(πœ‡βˆ’1curly*β„Ž) is a polynomial defined on 𝐹 and can be extended to 𝑀𝐹 by the extension operator𝐸 introduced in (3.22). Define

zβ„Ž := 𝑏𝐹 𝐸([

𝛾𝑑

(πœ‡βˆ’1curly*β„Ž

)]𝐹

)∈ H0(curl, Ω).

By the estimate (3.23) and integration by parts over πœ”πΉ , we have(πœ‚(1)𝑦,𝐹

)2

= β„ŽπΉ β€–[𝛾𝑑

(πœ‡βˆ’1curly*β„Ž

)]𝐹‖20,𝐹 β‰ˆ β„ŽπΉ

([𝛾𝑑

(πœ‡βˆ’1curly*β„Ž

)]𝐹, zβ„Ž

)0,𝐹

= β„ŽπΉ

(curlπœ‡βˆ’1curly*β„Ž + 𝜎y*β„Ž βˆ’ fβˆ’ u*β„Ž, zβ„Ž

)0.𝑀𝐹

βˆ’ β„ŽπΉ

(πœ‡βˆ’1curly*β„Ž βˆ’ πœ‡βˆ’1curly*, curl zβ„Ž

)0,𝑀𝐹

+ β„ŽπΉ (u*β„Ž βˆ’ u*, zβ„Ž)0,𝑀𝐹+ β„ŽπΉ (𝜎y* βˆ’ 𝜎y*β„Ž, zβ„Ž)0,𝑀𝐹

. (3.32)

Page 16: Bowen Li and Jun Zou

2028 B. LI AND J. ZOU

The estimate (3.24) and the inverse estimate give us

β„Ž1/2𝐹 β€–

[𝛾𝑑

(πœ‡βˆ’1curly*β„Ž

)]𝐹‖0,𝐹 β‰ˆ β€–zβ„Žβ€–0,𝑀𝐹

, β€–curl zβ„Žβ€–0,𝑀𝐹. β„Žβˆ’1

𝐹 β€–zβ„Žβ€–0,𝑀𝐹. (3.33)

Combining (3.33) with the formula (3.32), we get

πœ‚(1)𝑦,𝐹 . πœ‚

(1)𝑦,𝑇+ + πœ‚

(1)𝑦,π‘‡βˆ’ + β€–y*β„Ž βˆ’ y*β€–curl,𝑀𝐹

+ β„ŽπΉ β€–u*β„Ž βˆ’ u*β€–0,𝑀𝐹.

For πœ‚(1)

𝑝,𝐹 , let zβ„Ž := 𝑏𝐹 𝐸([

𝛾𝑑

(βˆ’πœ‡βˆ’1curlp*β„Ž + curly*β„Ž

)]𝐹

). By similar calculations, it follows that(

πœ‚(1)𝑝,𝐹

)2

β‰ˆ β„ŽπΉ

([𝛾𝑑

(βˆ’πœ‡βˆ’1curlp*β„Ž + curly*β„Ž

)]𝐹, zβ„Ž

)0,𝐹

= β„ŽπΉ

(βˆ’curlπœ‡βˆ’1curlp*β„Ž βˆ’ curly𝑑 βˆ’ 𝜎p*β„Ž, zβ„Ž

)0,𝑀𝐹

βˆ’ β„ŽπΉ

(βˆ’πœ‡βˆ’1curlp*β„Ž + curly*β„Ž βˆ’ y𝑑, curl zβ„Ž

)0,𝑀𝐹

+ β„ŽπΉ (𝜎p*β„Ž, zβ„Ž)0,𝑀𝐹(3.34)

= β„ŽπΉ

(βˆ’curlπœ‡βˆ’1curlp*β„Ž βˆ’ curly𝑑 βˆ’ 𝜎p*β„Ž, zβ„Ž

)0,𝑀𝐹

+ β„ŽπΉ 𝐡(p*β„Ž βˆ’ p*, zβ„Ž)

+ β„ŽπΉ (curl(y* βˆ’ y*β„Ž), curl zβ„Ž)0,πœ”πΉ, (3.35)

where we have used (curl curly*β„Ž, zβ„Ž)0,𝑇 = 0 for 𝑇 ∈ πœ”πΉ in (3.34). Then, by the inverse estimate andLemma 3.4, a direct estimate leads to

πœ‚(1)𝑝,𝐹 . πœ‚

(1)𝑝,𝑇+ + πœ‚

(1)𝑝,π‘‡βˆ’ + β€–p* βˆ’ p*β„Žβ€–0,𝑀𝐹

+ β€–y*β„Ž βˆ’ y*β€–curl,𝑀𝐹.

(4) For πœ‚(2)𝑦,𝐹 , we define zβ„Ž := 𝑏2

𝐹 𝐸([𝛾𝑛(fβ„Ž + u*β„Ž βˆ’ 𝜎y*β„Ž)]𝐹 ). It is easy to see that βˆ‡zβ„Ž ∈ H0(curl, Ξ©). Takingπœ‘ = βˆ‡zβ„Ž in (2.7a) gives

(f + u* βˆ’ 𝜎y*,βˆ‡zβ„Ž)0,𝑀𝐹= 0. (3.36)

We note by the triangle inequality that

πœ‚(2)𝑦,𝐹 ≀ β„Ž

1/2𝐹 β€–[𝛾𝑛(fβ„Ž + u*β„Ž βˆ’ 𝜎y*β„Ž)]𝐹 β€–0,𝐹 + β„Ž

1/2𝐹 β€–[𝛾𝑛(fβˆ’ fβ„Ž)]𝐹 β€–0,𝐹 . (3.37)

Then we have, again by the property of the bubble functions (3.23) and integration by parts over πœ”πΉ ,

β€–[𝛾𝑛(fβ„Ž + u*β„Ž βˆ’ 𝜎y*β„Ž)]𝐹 β€–20,𝐹 β‰ˆ ([𝛾𝑛(fβ„Ž + u*β„Ž βˆ’ 𝜎y*β„Ž)]𝐹 , zβ„Ž)

0,𝐹

. (div(fβˆ’ 𝜎y*β„Ž), zβ„Ž)0,𝑀𝐹+ ([𝛾𝑛(fβ„Ž βˆ’ f)]𝐹 , zβ„Ž)

0,𝐹

+ (f + u*β„Ž βˆ’ 𝜎y*β„Ž,βˆ‡zβ„Ž)0,πœ”πΉ, (3.38)

where we can use (3.36) to rewrite the last term as

(f + u*β„Ž βˆ’ 𝜎y*β„Ž,βˆ‡zβ„Ž)0,πœ”πΉ= (u*β„Ž βˆ’ u*,βˆ‡zβ„Ž)0,𝑀𝐹

+ (𝜎y* βˆ’ 𝜎y*β„Ž,βˆ‡zβ„Ž)0,𝑀𝐹. (3.39)

Therefore, we obtain from (3.37) to (3.39), the Cauchy’s inequality, applying the inverse estimate toβ€–βˆ‡zβ„Žβ€–0,πœ”πΉ

and the trace inequality to β€–zβ„Žβ€–0,𝐹 (cf. (3.18)) that

πœ‚(2)𝑦,𝐹 . β€–u

* βˆ’ u*β„Žβ€–0,𝑀𝐹+ β€–y* βˆ’ y*β„Žβ€–0,𝑀𝐹

+ πœ‚(2)𝑦,𝑇+ + πœ‚

(2)𝑦,π‘‡βˆ’ + osc𝑇+(f),

where we have used the trivial bound that β„Ž1/2𝐹 β€–[𝛾𝑛(fβˆ’ fβ„Ž)]𝐹 β€–0,𝐹 ≀ osc𝑇+(f). The estimate for πœ‚(2)

𝑝,𝐹 followsfrom the same (even simpler) argument. In fact, we can define zβ„Ž = 𝑏2

𝐹 𝐸([𝛾𝑛(𝜎p*β„Ž)]𝐹 ), which impliesβˆ‡zβ„Ž ∈ H0(curl, Ξ©) and, by (2.7b), (𝜎p*,βˆ‡zβ„Ž)0,Ξ© = 0. Then, a typical calculation gives

β€–[𝛾𝑛(𝜎p*β„Ž)]𝐹 β€–20,𝐹 β‰ˆ ([𝛾𝑛(𝜎p*β„Ž)]𝐹 , zβ„Ž)

0,𝐹= (div(𝜎p*β„Ž), zβ„Ž)0,𝑀𝐹

+ (𝜎p*β„Ž βˆ’ 𝜎p*,βˆ‡zβ„Ž)0,𝑀𝐹,

which yields, by Lemma 3.4 and the inverse estimate,

πœ‚(2)𝑝,𝐹 . β€–p

* βˆ’ p*β„Žβ€–0,𝑀𝐹+ πœ‚

(2)𝑝,𝑇+ + πœ‚

(2)𝑝,π‘‡βˆ’ .

Theorem 3.5 follows now by adding up the above local efficiency estimates over all 𝑇 ∈ Tβ„Ž.

Page 17: Bowen Li and Jun Zou

AN ADAPTIVE EDGE ELEMENT METHOD AND ITS CONVERGENCE 2029

4. Convergence

We devote this whole section to establish our main result that the sequence of adaptively generated finiteelement solutions (y*π‘˜,p*π‘˜,u*π‘˜)π‘˜β‰₯0 converges strongly to the true solution (y*,p*,u*).

Theorem 4.1. Let (y*π‘˜,p*π‘˜,u*π‘˜)π‘˜β‰₯0 be the sequence of discrete triplets generated by the adaptive Algorithm 1and (y*,p*,u*) be the solution to the system (2.7a)–(2.7c). Then we have the strong convergences:

limπ‘˜β†’βˆž

β€–y*π‘˜ βˆ’ y*β€–curl,Ξ© = 0, limπ‘˜β†’βˆž

β€–p*π‘˜ βˆ’ p*β€–curl,Ξ© = 0 and limπ‘˜β†’βˆž

β€–u*π‘˜ βˆ’ u*β€–0,Ξ© = 0. (4.1)

As already pointed out in the introduction, the first step of the proof of convergence is the introduction ofa limiting minimization problem that characterizes the limit of the discrete solutions to the system (2.14a)–(2.14c) while the second step is to show that the solution to the limiting problem actually coincides with theone to (2.7a)–(2.7c). To do so, let us first state some helpful auxiliary results on the convergence behavior of theadaptive meshes Tπ‘˜π‘˜β‰₯0. We associate each triangulation Tπ‘˜ with a mesh-size function β„Žπ‘˜ ∈ 𝐿∞(Ξ©), definedby β„Žπ‘˜|𝑇 = β„Žπ‘‡ for 𝑇 ∈ Tπ‘˜. Thanks to the monotonicity of β„Žπ‘˜π‘˜β‰₯0, we are allowed to define a limiting mesh-sizefunction by the pointwise limit: β„Žπ‘˜(x) β†’ β„Žβˆž(x), as π‘˜ β†’βˆž. It is also known that the pointwise convergence ofβ„Žπ‘˜ can be further improved to the uniform convergence ([48], Lem. 4.2):

limπ‘˜β†’βˆž

β€–β„Žπ‘˜ βˆ’ β„Žβˆžβ€–βˆž,Ξ© = 0. (4.2)

We should note that β„Žβˆž(x) ≑ 0 in general. If β„Žβˆž(x) > 0 at some point x, there is an element 𝑇 containing xand an index π‘˜(x) depending on x such that 𝑇 ∈ T𝑙 for all 𝑙 β‰₯ π‘˜(x). This observation motivates us to split Tπ‘˜

into two classes of elements:T +

π‘˜ :=⋂𝑙β‰₯π‘˜

T𝑙 and T 0π‘˜ := Tπ‘˜βˆ–Tπ‘˜

+. (4.3)

T +π‘˜ consists of the elements in Tπ‘˜ that are not refined after the π‘˜th iteration, while T 0

π‘˜ βŠ‚ Tπ‘˜ contains theelements that are refined at least once in the subsequent iterations. We are thus allowed to decompose thedomain Ξ© into two parts: Ξ©+

π‘˜ :=⋃

π‘‡βˆˆT +π‘˜

𝑇 and Ξ©0π‘˜ :=

β‹ƒπ‘‡βˆˆT 0

π‘˜π‘‡ . A direct application of (4.2) and the uniform

shape regularity of Tπ‘˜ yields the following result (cf. [48], Cor. 7.1):

limπ‘˜β†’βˆž

β€–β„Žπ‘˜β€–βˆž,Ξ©0π‘˜

= 0, (4.4)

where Ξ©0π‘˜ :=

β‹ƒπ‘‡βˆˆT 0

π‘˜

Ω𝑇 is the extended neighborhood of Ξ©0π‘˜. We remark that this uniform convergence result

for the mesh-size functions is crucial for our subsequent analysis. We next give an interesting characterizationof the limiting behavior of 𝐿2-projections Pπ‘˜π‘˜β‰₯0 := PUπ‘˜

π‘˜β‰₯0 with the help of the convergence propertyof the adaptive meshes Tπ‘˜π‘˜β‰₯0, which establishes a connection between the limit of mesh-size functions, the𝐿2-projections and the limiting problem that we shall propose and deal with in the next section.

Proposition 4.2. Let Pπ‘˜π‘˜β‰₯0 be the orthogonal 𝐿2-projections (defined by (2.15)) associated with the adaptivemeshes Tπ‘˜π‘˜β‰₯0 generated by Algorithm 1. Then for each f ∈ L2(Ξ©), the limit of the sequence Pπ‘˜fπ‘˜β‰₯0,denoted by P∞f, exists as π‘˜ β†’ ∞. Furthermore, the corresponding limiting operator P∞ is also an orthogonal𝐿2-projection with the range and kernel given by

ran(P∞) = U∞ :=β‹ƒπ‘˜β‰₯0

Uπ‘˜

𝐿2

and ker(P∞) = UβŠ₯∞. (4.5)

Proof. It suffices to show that the limit of the sequence Pπ‘˜f exists for all f ∈ Cβˆžπ‘ (Ξ©) (infinitely differentiable

vector-valued functions with compact support in Ξ©), by the facts that the operators Pπ‘˜ are uniformly boundedand the space C∞

𝑐 (Ξ©) is dense in L2(Ξ©). Given two iteration indices π‘˜1, π‘˜2 with π‘˜2 > π‘˜1, define T 0π‘˜1,π‘˜2

:=

Page 18: Bowen Li and Jun Zou

2030 B. LI AND J. ZOU

Tπ‘˜1βˆ–(Tπ‘˜1 ∩Tπ‘˜2), which is a subset of T 0π‘˜1 consisting of the elements that are refined between the π‘˜1th iteration

and π‘˜2th iteration. We then have, by the equivalent definition of Pπ‘˜ in (2.15),

β€–Pπ‘˜1fβˆ’ Pπ‘˜2fβ€–20,Ξ© = βˆ‘

π‘‡βˆˆT 0π‘˜1,π‘˜2

βˆ‘π‘‡π‘–βŠ‚π‘‡,π‘‡π‘–βˆˆTπ‘˜2

(1|𝑇 |

βˆ«π‘‡

f(x)dxβˆ’ 1|𝑇𝑖|

βˆ«π‘‡π‘–

f(x)dx)

πœ’π‘‡π‘–

2

0,Ξ©

=βˆ‘

π‘‡βˆˆT 0π‘˜1π‘˜2

βˆ‘π‘‡π‘–βŠ‚π‘‡,π‘‡π‘–βˆˆTπ‘˜2

(1|𝑇 |

βˆ«π‘‡

f(x)dxβˆ’ 1|𝑇𝑖|

βˆ«π‘‡π‘–

f(x)dx)2

|𝑇𝑖|,

where πœ’π‘‡π‘– is the characteristic function of 𝑇𝑖. Recalling the limiting behavior of mesh-size functions β„Žπ‘˜ in(4.4), we have that for any 𝛿 > 0, there exists an index π‘˜(𝛿) depending on 𝛿 such that for all π‘˜ > π‘˜(𝛿), thereholds β€–β„Žπ‘˜β€–βˆž,Ξ©0

π‘˜β‰€ 𝛿. Combining it with the uniform continuity of f, we have that for any πœ€ > 0, there exists an

index π‘˜(πœ€, 𝛿) depending on πœ€ and 𝛿 such that for any integers π‘˜1, π‘˜2 satisfying π‘˜2 > π‘˜1 > π‘˜(πœ€, 𝛿), and for anyelements 𝑇 ∈ T 0

π‘˜1,π‘˜2, 𝑇𝑖 ∈ Tπ‘˜2 with 𝑇𝑖 βŠ‚ 𝑇 , it holds that

1|𝑇 |

βˆ«π‘‡

f(x)dxβˆ’ 1|𝑇𝑖|

βˆ«π‘‡π‘–

f(x)dx≀ πœ€.

We hence have that Pπ‘˜f is a Cauchy sequence in L2(Ξ©). Then it follows from the completeness of L2(Ξ©) thatthe limit of Pπ‘˜f exists. We have proved that for any f ∈ L2(Ξ©), Pβˆžπ‘“ is well-defined, which further allows usto define the bounded linear operator P∞ on L2(Ξ©), by the uniform boundedness principle. We now show thatP∞ is an orthogonal 𝐿2-projection with the property (4.5). To do so, we first observe that for any g ∈ L2(Ξ©),it holds that

(P∞f, g)0,Ξ© = limπ‘›β†’βˆž

(Pπ‘˜f, g)0,Ξ© = limπ‘›β†’βˆž

(Pπ‘˜g, f)0,Ξ© = (f, P∞g)0,Ξ©,

which implies that P∞ is self-adjoint. On the other hand, we have

β€–(P2∞ βˆ’ P∞

)fβ€–0,Ξ© = β€–

(P2∞ βˆ’ Pπ‘˜P∞ + Pπ‘˜P∞ βˆ’ P2

π‘˜ + Pπ‘˜ βˆ’ P∞)fβ€–0,Ξ©

≀ β€–(P∞ βˆ’ Pπ‘˜)(P∞f)β€–0,Ξ© + β€–Pπ‘˜β€–β€–(P∞ βˆ’ Pπ‘˜)fβ€–0,Ξ©

+ β€–(Pπ‘˜ βˆ’ P∞)fβ€–0,Ξ© β†’ 0 as π‘˜ β†’ 0,

that is, P2∞ = P∞. Thus, we can conclude that P∞ is an orthogonal 𝐿2-projection. To characterize its range

and kernel, we denote by P the orthogonal projection associated with the closed space U∞ defined in (4.5).By definition, there holds ran(P) = U∞ and ker(P) = UβŠ₯

∞. We readily see from the definition of P∞ thatran(P∞) βŠ‚ U∞, since every element in ran(P∞) can be approximated by a sequence from

β‹ƒπ‘˜β‰₯0 Uπ‘˜. Conversely,

to prove U∞ βŠ‚ ran(P∞), we note that for any f ∈ U𝑛, Pπ‘˜f = f holds for each π‘˜ β‰₯ 𝑛. Then, letting π‘˜ β†’βˆž givesus

P∞f = limπ‘›β†’βˆž

Pπ‘˜f = f,

which indicates U𝑛 βŠ‚ ran(P∞). Since 𝑛 is arbitrary, we readily see⋃

π‘˜β‰₯0 Uπ‘˜πΏ2

βŠ‚ ran(P∞). The proof iscomplete.

4.1. The limiting problem

In order to find the limit point of the discrete triplets (y*π‘˜,p*π‘˜,u*π‘˜)π‘˜β‰₯0, we first define several limiting spacesas the closure of the union of discrete spaces at each level:

V∞ :=β‹ƒπ‘˜β‰₯0

Vπ‘˜

H(curl)

, Uπ‘Žπ‘‘βˆž :=

β‹ƒπ‘˜β‰₯0

Uπ‘Žπ‘‘π‘˜

𝐿2

. (4.6)

Page 19: Bowen Li and Jun Zou

AN ADAPTIVE EDGE ELEMENT METHOD AND ITS CONVERGENCE 2031

which immediately implies Uπ‘Žπ‘‘βˆž = U∞ ∩Uπ‘Žπ‘‘, where the space U∞ is defined in (4.5). Since Uπ‘Žπ‘‘

∞ is a convexsubset of L2(Ω), it is closed in the weak topology if and only if it is closed in the strong topology induced bythe norm. We hence have the following key lemma.

Lemma 4.3. Let uπ‘˜ ∈ Uπ‘Žπ‘‘π‘˜ , π‘˜ β‰₯ 0, be a sequence weakly converging to a u in L2(Ξ©). Then u ∈ Uπ‘Žπ‘‘

∞ holds.

We now consider the following limiting problem defined on the limiting spaces:

minimize 𝐽(y∞,u∞) =12β€–curly∞ βˆ’ y𝑑‖20,Ξ© +

𝛼

2β€–u∞ βˆ’ u𝑑‖20,Ξ© (4.7)

over (y∞,u∞) ∈ V∞ Γ—Uπ‘Žπ‘‘βˆž

subject to 𝐡(y∞,πœ‘βˆž) = (f + u∞,πœ‘βˆž)0,Ξ© βˆ€ πœ‘βˆž ∈ V∞. (4.8)

The existence and uniqueness of the minimizer to the above problem can be obtained by the standard argumentsas the continuous and discrete cases, and the reduced cost functional for the limiting problem is defined by

𝐽∞(u∞) :=12β€–curly∞(u∞)βˆ’ y𝑑‖20,Ξ© +

𝛼

2β€–u∞ βˆ’ u𝑑‖20,Ξ©,

where y∞(u∞) denotes the solution to the equation (4.8) with the source u∞. The rest of this subsection isdevoted to proving that the limit point of (y*π‘˜,p*π‘˜,u*π‘˜)π‘˜β‰₯0 exists and happens to be the solution to the limitingoptimization problem. As pointed out in the introduction, to compensate the lack of the discrete compactness,we shall investigate the weak limit of the sequence u*π‘˜π‘˜β‰₯0 first, and then improve it to the strong convergence;see the next two theorems.

Theorem 4.4. Suppose that (u*π‘˜,y*π‘˜)π‘˜β‰₯0 is the sequence of discrete optimal controls and states generated byAlgorithm 1, and (u*∞,y*∞) is the optimal control and state to the limiting optimization problem (4.7) and (4.8).Then, we have the following weak convergences: as π‘˜ β†’βˆž,

u*π‘˜π‘€ u*∞ in L2(Ξ©) and y*π‘˜

𝑀 y*∞ in H0(curl, Ξ©).

Proof. Our proof starts with a simple but important observation that the sequence u*π‘˜ is bounded, which isnot a trivial fact due to the unboundedness of Uπ‘Žπ‘‘. Indeed, for a fixed u ∈ Uπ‘Žπ‘‘, we have

𝛼

2β€–u*π‘˜ βˆ’ u𝑑‖20,Ξ© ≀ π½π‘˜(u*π‘˜) ≀ π½π‘˜(Pπ‘˜u) =

12β€–curlyπ‘˜(Pπ‘˜u)βˆ’ y𝑑‖20,Ξ© +

𝛼

2β€–Pπ‘˜uβˆ’ u𝑑‖20,Ξ©

. β€–uβ€–20,Ξ© + β€–y𝑑‖20,Ξ© + β€–u𝑑‖20,Ξ©,

which gives the boundedness of u*π‘˜ in L2(Ξ©). Then the boundedness of y*π‘˜ follows immediately. Hence, bythe Banach-Alaoglu theorem, we can extract a subsequence (y*π‘˜π‘›

,u*π‘˜π‘›)𝑛β‰₯1 of (y*π‘˜,u*π‘˜)π‘˜β‰₯0 such that u*π‘˜π‘›

𝑀 w

in L2(Ξ©) and y*π‘˜π‘›

𝑀 y in H(curl, Ξ©), as 𝑛 tends to infinity, which further implies that y*π‘˜π‘›

and curly*π‘˜π‘›

weakly converge to y and curly, respectively, in L2(Ξ©) (since H(curl, Ξ©) is continuously imbedded in L2(Ξ©)and curl is a bounded linear operator from H(curl, Ξ©) to L2(Ξ©)). Moreover, Lemma 4.3 yields w ∈ Uπ‘Žπ‘‘

∞ . NotingV𝑙 βŠ‚ Vπ‘˜π‘›

for 𝑙 ≀ π‘˜π‘› and that there holds

𝐡(y*π‘˜π‘›, v𝑙) = (f + u*π‘˜π‘›

, v𝑙)0,Ξ© for 𝑙 ≀ π‘˜π‘› and v𝑙 ∈ V𝑙, (4.9)

we let 𝑛 tend to infinity in (4.9) and obtain, by the weak convergences of u*π‘˜π‘› and y*π‘˜π‘›

,

𝐡(y, v𝑙) = (f + w, v𝑙)0,Ξ© for all 𝑙 β‰₯ 0 and v𝑙 ∈ V𝑙.

Since⋃

𝑙β‰₯0 V𝑙 is dense in V∞, we readily have

𝐡(y, v∞) = (f + w, v∞)0,Ξ© βˆ€ v∞ ∈ V∞ (4.10)

Page 20: Bowen Li and Jun Zou

2032 B. LI AND J. ZOU

which means that (y,w) satisfies the constraint (4.8) of the limiting problem, that is, y = y∞(w).We claim that w is the minimizer u*∞ of the cost functional 𝐽∞ over the set Uπ‘Žπ‘‘

∞ , namely,

𝐽∞(w) ≀ 𝐽∞(u∞) βˆ€ u∞ ∈ Uπ‘Žπ‘‘βˆž , (4.11)

which, by (4.10), also implies that the weak limit y of the sequence y*π‘˜π‘› is y*∞. To prove the claim, we first

note

𝐽∞(w) =12β€–curly∞(w)βˆ’ y𝑑‖20,Ξ© +

𝛼

2β€–wβˆ’ u𝑑‖20,Ξ©

≀ lim infπ‘›β†’βˆž

12β€–curly*π‘˜π‘›

βˆ’ y𝑑‖20,Ξ© + lim infπ‘›β†’βˆž

𝛼

2β€–u*π‘˜π‘›

βˆ’ u𝑑‖20,Ξ© = lim infπ‘›β†’βˆž

π½π‘˜π‘›(u*π‘˜π‘›

), (4.12)

by the uniform boundedness principle and the weak converges of curly*π‘˜π‘› and u*π‘˜π‘›

to curly∞(w) and w inL2(Ξ©). Clearly, by (4.12) and the fact that u*π‘˜π‘›

is the minimizer of π½π‘˜π‘› over Uπ‘Žπ‘‘π‘˜π‘›

, there holds, for any sequence,uπ‘˜ ∈ Uπ‘Žπ‘‘

π‘˜ , π‘˜ β‰₯ 0,𝐽∞(w) ≀ lim inf

π‘›β†’βˆžπ½π‘˜π‘›

(u*π‘˜π‘›) ≀ lim sup

π‘›β†’βˆžπ½π‘˜π‘›

(u*π‘˜π‘›) ≀ lim sup

π‘˜β†’βˆžπ½π‘˜(uπ‘˜). (4.13)

We next prove an auxiliary fact that for any u∞ ∈ Uπ‘Žπ‘‘βˆž and a sequence uπ‘˜ ∈ Uπ‘Žπ‘‘

π‘˜ for π‘˜ β‰₯ 0, if β€–uπ‘˜βˆ’uβˆžβ€–0,Ξ© β†’ 0as π‘˜ β†’βˆž, then

limπ‘˜β†’βˆž

β€–y∞(u∞)βˆ’ yπ‘˜(uπ‘˜)β€–curl,Ξ© = 0, (4.14)

which readily giveslim

π‘˜β†’βˆžπ½π‘˜(uπ‘˜) = 𝐽∞(u∞). (4.15)

This fact (4.15), along with (4.13), completes our proof of the claim (4.11). To prove (4.14), we note, by theassumption,

lim supπ‘˜β†’βˆž

β€–y∞(u∞)βˆ’ yπ‘˜(uπ‘˜)β€–curl,Ξ© ≀ lim supπ‘˜β†’βˆž

(β€–y∞(u∞)βˆ’ yπ‘˜(u∞)β€–curl,Ξ© + β€–yπ‘˜(u∞)βˆ’ yπ‘˜(uπ‘˜)β€–curl,Ξ©)

. lim supπ‘˜β†’βˆž

( infvπ‘˜βˆˆπ‘‰π‘˜

β€–y∞(u∞)βˆ’ vπ‘˜β€–curl,Ξ© + β€–u∞ βˆ’ uπ‘˜β€–0,Ξ©) = 0,

where we have used the Galerkin orthogonality and the density of⋃

π‘˜β‰₯0 Vπ‘˜ in V∞.By the above arguments, we can conclude that for any subsequence (u*π‘˜π‘›

,y*π‘˜π‘›)𝑛β‰₯1 of (u*π‘˜,y*π‘˜)π‘˜β‰₯0, we can

extract a subsequence weakly converging to (u*∞,y*∞) in L2(Ξ©) Γ—H0(curl, Ξ©), which immediately yields theweak convergence of the whole sequence (u*π‘˜,y*π‘˜)π‘˜β‰₯0:

u*π‘˜π‘€ u*∞ in L2(Ξ©) and y*π‘˜

𝑀 y*∞ in H0(curl, Ξ©), as π‘˜ β†’βˆž.

The proof is complete.

Thanks to the above results, we are ready to show the main theorem of this subsection.

Theorem 4.5. Under the same assumptions as in Theorem 4.4, there holds

limπ‘˜β†’βˆž

β€–u*π‘˜ βˆ’ u*βˆžβ€–0,Ξ© = 0 and limπ‘˜β†’βˆž

β€–y*π‘˜ βˆ’ y*βˆžβ€–curl,Ξ© = 0. (4.16)

Proof. We note from the claim (4.14) in the proof of Theorem 4.4 that the second convergence in (4.16) isa consequence of the first one. Hence, it suffices to prove the convergence of u*π‘˜π‘˜β‰₯0. To this end, a directcalculation gives

β€–curly*π‘˜ βˆ’ curly*βˆžβ€–20,Ξ© + 𝛼‖u*π‘˜ βˆ’ u*βˆžβ€–20,Ξ©

Page 21: Bowen Li and Jun Zou

AN ADAPTIVE EDGE ELEMENT METHOD AND ITS CONVERGENCE 2033

= β€–curly*π‘˜ βˆ’ y𝑑 + y𝑑 βˆ’ curly*βˆžβ€–20,Ξ© + 𝛼‖u*π‘˜ βˆ’ u𝑑 + u𝑑 βˆ’ u*βˆžβ€–20,Ξ©

= β€–curly*π‘˜ βˆ’ y𝑑‖20,Ξ© + β€–curly*∞ βˆ’ y𝑑‖20,Ξ© βˆ’ 2(curly*π‘˜ βˆ’ y𝑑, curly*∞ βˆ’ y𝑑

)0,Ξ©

+ 𝛼‖u*π‘˜ βˆ’ u𝑑‖20,Ξ© + 𝛼‖u*∞ βˆ’ u𝑑‖20,Ξ© βˆ’ 2𝛼(u*π‘˜ βˆ’ u𝑑,u*∞ βˆ’ u𝑑

)0,Ξ©

,

which, by taking the upper limit on both sides and using Theorem 4.4, implies

2𝐽∞(u*∞) + lim supπ‘˜β†’βˆž

(β€–curly*π‘˜ βˆ’ curly*βˆžβ€–20,Ξ© + 𝛼‖u*π‘˜ βˆ’ u*βˆžβ€–20,Ξ©

)≀ lim sup

π‘˜β†’βˆž2π½π‘˜(u*π‘˜). (4.17)

Then it follows that𝐽∞(u*∞) ≀ lim sup

π‘˜β†’βˆžπ½π‘˜(u*π‘˜). (4.18)

We choose a sequence uπ‘˜ ∈ Uπ‘Žπ‘‘π‘˜ , π‘˜ β‰₯ 0, such that β€–uπ‘˜ βˆ’ u*βˆžβ€–0,Ξ© β†’ 0, as π‘˜ β†’ ∞, and then we can derive, by

(4.18) and (4.15), that𝐽∞(u*∞) ≀ lim sup

π‘˜β†’βˆžπ½π‘˜(u*π‘˜) ≀ lim

π‘˜β†’βˆžπ½π‘˜(uπ‘˜) = 𝐽∞(u*∞). (4.19)

Combining the above estimate (4.19) with (4.17), we obtain the strong convergence of u*π‘˜π‘˜β‰₯0.

It is easy to write the optimality system for the limiting problem by a standard argument:⎧βŽͺβŽͺ⎨βŽͺβŽͺ⎩𝐡(y*∞,πœ‘βˆž) = (f + u*∞,πœ‘βˆž)0,Ξ© βˆ€ πœ‘βˆž ∈ V∞,

𝐡(p*∞,πœ™βˆž) =(curly*∞ βˆ’ y𝑑, curlπœ™βˆž

)0,Ξ©

βˆ€ πœ™βˆž ∈ V∞,(p*∞ + 𝛼

(u*∞ βˆ’ u𝑑

),u∞ βˆ’ u*∞

)0,Ξ©

β‰₯ 0 βˆ€ u∞ ∈ Uπ‘Žπ‘‘βˆž ,

(4.20a)

(4.20b)

(4.20c)

and see that the sequence of discrete triplets (y*π‘˜,p*π‘˜,u*π‘˜)π‘˜β‰₯0 converges strongly to the solution (y*∞,p*∞,u*∞)to the limiting optimality system (4.20a)–(4.20c). We remark that the variational inequality (4.20c) will be usedin the next subsection.

4.2. Proof of convergence

We have proved the strong convergence of the discrete solutions (y*π‘˜,p*π‘˜,u*π‘˜) in Section 4.1, where we haveonly used the structure of the control problem, while the adaptive process does not have an essential involvement.We shall see that in the subsequent analysis, the error estimator πœ‚β„Ž defined in (3.2) and the marking requirement(3.3) in Algorithm 1 play a crucial role. To complete the proof of Theorem 4.1, we start with the following keylemma.

Lemma 4.6. Let π‘‡π‘˜ ∈ Tπ‘˜ be one of the elements that achieve the maximum value of πœ‚π‘˜(𝑇 ) over 𝑇 ∈ Tπ‘˜, i.e.,πœ‚π‘˜

(π‘‡π‘˜

):= max

π‘‡βˆˆTπ‘˜

πœ‚π‘˜(𝑇 ). Then we have

limπ‘˜β†’βˆž

πœ‚π‘˜

(π‘‡π‘˜

)= 0. (4.21)

Proof. We start with the estimates for πœ‚(1)

𝑦,𝑇 and πœ‚(1)

𝑦,𝐹 on a given mesh Tπ‘˜. A direct application of the inverseestimate and the triangle inequality gives

πœ‚(1)𝑦,𝑇 . β„Žπ‘‡ β€–fβ€–0,𝑇 + β„Žπ‘‡ β€–u*β„Žβ€–0,𝑇 + β€–curly*β„Žβ€–0,𝑇 + β„Žπ‘‡ β€–y*β„Žβ€–0,𝑇 ,

and, by the assumption of πœ‡ and the trace inequality (3.18), we have

πœ‚(1)𝑦,𝐹 . β„Ž

1/2𝐹 β€–(curly*β„Ž)|βˆ’β€–0,𝐹 + β„Ž

1/2𝐹 β€–(curly*β„Ž)|+β€–0,𝐹 . β€–curly*β„Žβ€–0,πœ”πΉ

.

Page 22: Bowen Li and Jun Zou

2034 B. LI AND J. ZOU

We now consider πœ‚(2)

𝑦,𝑇 and πœ‚(2)

𝑦,𝐹 . Similarly, we have

πœ‚(2)𝑦,𝑇 . β„Žπ‘‡ β€–div fβ€–0,𝑇 + β€–y*β„Žβ€–0,𝑇 ,

and

πœ‚(2)𝑦,𝐹 . β„Ž

1/2𝐹 β€–[𝛾𝑛(f)]𝐹 β€–0,𝐹 + β„Ž

1/2𝐹 β€–[u*β„Ž]𝐹 β€–0,𝐹 + β„Ž

1/2𝐹 β€–[y*β„Ž]𝐹 β€–0,𝐹 (4.22)

by the triangle inequality. Then the trace inequality (3.18) gives

β„Ž1/2𝐹 β€–[u*β„Ž]𝐹 β€–0,𝐹 + β„Ž

1/2𝐹 β€–[y*β„Ž]𝐹 β€–0,𝐹 . β€–u*β„Žβ€–0,πœ”πΉ

+ β€–y*β„Žβ€–curl,πœ”πΉ, (4.23)

since for 𝑇 ∈ πœ”πΉ , u*β„Ž|𝑇 is a constant vector and y*β„Ž|𝑇 ∈ H1(𝑇 ) satisfies√

2|βˆ‡y*β„Ž| = |curly*β„Ž| which can bedirectly checked by using y*β„Ž|𝑇 = a𝑇 Γ—x+b𝑇 for some a𝑇 , b𝑇 ∈ R3. It is clear from (2.4) that β€–[𝛾𝑛(f)]𝐹 β€–0,𝐹 > 0,only if 𝐹 βŠ‚ Ξ“ = βˆͺπ‘š

𝑖=1πœ•Ξ©π‘–βˆ–πœ•Ξ©. Hence, it follows from (4.22) and (4.23) that

πœ‚(2)𝑦,𝐹 . β„Ž

1/2𝐹 β€–[𝛾𝑛(f)]𝐹 β€–0,πΉβˆ©Ξ“ + β€–u*β„Žβ€–0,πœ”πΉ

+ β€–y*β„Žβ€–curl,πœ”πΉ.

The same analysis applies to the other terms in πœ‚π‘˜. Then a careful but straightforward computation shows, for𝑇 ∈ Tπ‘˜,

πœ‚π‘˜(𝑇 ) . β€–y*π‘˜β€–curl,πœ”π‘‡+ β€–p*π‘˜β€–curl,πœ”π‘‡

+ β€–u*π‘˜β€–0,πœ”π‘‡+ β„Žπ‘‡ β€–curly𝑑‖0,𝑇

+ β„Žπ‘‡ β€–fβ€–div,𝑇 +βˆ‘

πΉβˆˆπœ•π‘‡βˆ©Ξ“

β„Ž1/2𝐹 β€–[𝛾𝑛(f)]𝐹 β€–0,𝐹 . (4.24)

Here πœ”π‘‡ denotes the union of elements that share a common face with 𝑇 . Then, it follows from (4.24) and theestimate

β„Žπ‘‡π‘˜β€–fβ€–div,π‘‡π‘˜

+βˆ‘

πΉβˆˆπœ• π‘‡π‘˜βˆ©Ξ“

β„Ž1/2𝐹 β€–[𝛾𝑛(f)]𝐹 β€–0,𝐹 . β„Ž

1/2π‘‡π‘˜

(β€–fβ€–0,Ξ© +

π‘šβˆ‘π‘–=1

β€–div fβ€–0,Ω𝑖+ β€–[𝛾𝑛(f)]Ξ“β€–0,Ξ“

)

that

πœ‚π‘˜

(π‘‡π‘˜

). β„Ž

1/2π‘‡π‘˜

C(f) + β€–y*π‘˜ βˆ’ y*βˆžβ€–curl,Ξ© + β€–p*π‘˜ βˆ’ p*βˆžβ€–curl,Ξ© + β€–u*π‘˜ βˆ’ u*βˆžβ€–0,Ξ©

+ β„Žπ‘‡π‘˜β€–curly𝑑‖0,π‘‡π‘˜

+ β€–yβˆžβ€–curl,πœ” π‘‡π‘˜+ β€–pβˆžβ€–curl,πœ” π‘‡π‘˜

+ β€–uβˆžβ€–0,πœ” π‘‡π‘˜, (4.25)

where C(f) := β€–fβ€–0,Ξ© +βˆ‘π‘š

𝑖=1β€–div fβ€–0,Ω𝑖 + β€–[𝛾𝑛(f)]Ξ“β€–0,Ξ“ is well-defined by (2.4). Hence, by the definition of πœ‚β„Ž,

we have the estimate for πœ‚π‘˜

(π‘‡π‘˜

):

πœ‚π‘˜

(π‘‡π‘˜

). πœ‚π‘˜

(π‘‡π‘˜

)+ β€–u𝑑‖0,π‘‡π‘˜

, (4.26)

where πœ‚π‘˜

(π‘‡π‘˜

)has been bounded by (4.25). Since π‘‡π‘˜ will be marked in the (π‘˜ + 1)th iteration, it holds that

π‘‡π‘˜ ∈ T 0π‘˜ and |πœ”π‘‡π‘˜

| . β€–β„Žπ‘˜β€–3∞,Ξ©0π‘˜

β†’ 0 as π‘˜ β†’βˆž,

by (3.5) and the uniform convergence (4.4) of adaptive meshes. Taking advantage of the absolute continuity ofthe Lebesgue integral and Theorem 4.5, we can get the desired vanishing limit of

πœ‚π‘˜

(π‘‡π‘˜

)from (4.25) and

(4.26) when π‘˜ tends to infinity. The proof is complete.

Page 23: Bowen Li and Jun Zou

AN ADAPTIVE EDGE ELEMENT METHOD AND ITS CONVERGENCE 2035

We are now well-prepared to show that the limiting state y*∞ and adjoint state p*∞ actually satisfy thevariational problems (2.7a) and (2.7b), respectively. It is worth emphasizing that in our proof, there is noneed to introduce a buffer layer of elements between the meshes at different levels as in [61], by virtue of thegeneralized convergence result of mesh-size functions (4.4).

Lemma 4.7. Suppose that (y*∞,p*∞,u*∞) is the solution to the limiting optimality system (4.20a)–(4.20c). Thenit satisfies the variational problems (2.7a) and (2.7b), namely,

𝐡(y*∞,πœ™) = (f + u*∞,πœ™)0,Ξ© βˆ€ πœ™ ∈ H0(curl, Ξ©), (4.27)

𝐡(p*∞,πœ™) =(curly*∞ βˆ’ y𝑑, curlπœ™

)0,Ξ©

βˆ€ πœ™ ∈ H0(curl, Ξ©). (4.28)

Proof. We only prove (4.27) since the proof of (4.28) is similar. For this, we first introduce the following residualfunctionals on H0(curl, Ξ©):

β„›(Β·) := 𝐡(y*∞, Β·)βˆ’ (f + u*∞, Β·)0,Ξ©,

andβ„›π‘˜(Β·) := 𝐡(y*π‘˜, Β·)βˆ’ (f + u*π‘˜, Β·)0,Ξ©, π‘˜ β‰₯ 0.

Note that Theorem 4.5 gives us the operator-norm convergence:

limπ‘˜β†’βˆž

supπœ™βˆˆH0(curl,Ξ©)

|β„›(πœ™)βˆ’β„›π‘˜(πœ™)|β€–πœ™β€–H0(curl,Ξ©)

= 0. (4.29)

Therefore, to prove β„› is actually a zero functional, it is sufficient to show

limπ‘˜β†’βˆž

β„›π‘˜(πœ™) = 0 βˆ€ πœ™ ∈ Cβˆžπ‘ (Ξ©),

by the operator-norm convergence (4.29) and the density of Cβˆžπ‘ (Ξ©) in H0(curl, Ξ©). Recall the standard interpo-

lation operator Ξ β„Ž : H𝑠(curl, 𝐺) β†’ Vβ„Ž(𝐺) for 1/2 + 𝛿 ≀ 𝑠 ≀ 1, 𝛿 > 0, associated with Nedelec’s edge elements,and the corresponding error estimate (cf. [44], Thm. 5.41):

β€–vβˆ’ Ξ β„Žvβ€–curl,𝐺 . β„Žπ‘ (β€–v‖𝑠,𝐺 + β€–curl v‖𝑠,𝐺), (4.30)

where 𝐺 is a polyhedral Lipschitz subdomain of Ξ© and Vβ„Ž(𝐺) is the lowest-order conforming edge elementspace without the specified boundary condition. Defining w := πœ™βˆ’ Ξ π‘˜πœ™ ∈ H0(curl, Ξ©) and applying the quasi-interpolation operator Ξ π‘˜ introduced in Lemma 3.1 to w, we have, by definition of β„›π‘˜ and Ξ π‘˜πœ™, Ξ π‘˜w ∈ Vπ‘˜,

β„›π‘˜(πœ™) = 𝐡(y*π‘˜,πœ™)βˆ’ (f + u*π‘˜,πœ™)0,Ξ© = 𝐡(y*π‘˜,wβˆ’Ξ β„Žw)βˆ’ (f + u*π‘˜,wβˆ’Ξ β„Žw)0,Ξ©.

By the splitting of the mesh Tπ‘˜ introduced in (4.3), a derivation similar to the one for (3.17) and (3.19) gives

|β„›π‘˜(πœ™)| = |β„›π‘˜(wβˆ’Ξ β„Žw)| .βˆ‘

π‘‡βˆˆTπ‘˜

(πœ‚(1)𝑦,𝑇 + πœ‚

(2)𝑦,𝑇

)β€–wβ€–curl,Ω𝑇

+βˆ‘

π‘‡βˆˆTπ‘˜

βˆ‘πΉβˆˆπœ•π‘‡βˆ©Ξ©

(πœ‚(1)𝑦,𝐹 + πœ‚

(2)𝑦,𝐹

)β€–wβ€–curl,Ω𝑇

.βˆ‘

π‘‡βˆˆT +𝑙

πœ‚π‘˜(𝑇 )β€–πœ™βˆ’ Ξ π‘˜πœ™β€–0,Ω𝑇+

βˆ‘π‘‡βˆˆTπ‘˜βˆ–T +

𝑙

πœ‚π‘˜(𝑇 )β€–πœ™βˆ’ Ξ π‘˜πœ™β€–0,Ω𝑇, (4.31)

where 𝑙 is an iteration index less than π‘˜ (clearly, T +𝑙 βŠ‚ Tπ‘˜). Note that

πœ‚π‘˜(𝑇 ) ≀ πœ‚π‘˜

(π‘‡π‘˜

)= max

π‘‡βˆˆTπ‘˜

πœ‚π‘˜(𝑇 )

Page 24: Bowen Li and Jun Zou

2036 B. LI AND J. ZOU

holds for all 𝑇 ∈ T +𝑙 , where π‘‡π‘˜ is as defined in Lemma 4.6. We also observe from (4.30) that

β€–πœ™βˆ’ Ξ π‘˜πœ™β€–0,Ω𝑇. β€–πœ™β€–π»π‘ (curl,Ξ©).

Hence, by these two observations and Cauchy’s inequality, (4.31) implies

|β„›π‘˜(πœ™)| . #(T +𝑙 )πœ‚π‘˜

(π‘‡π‘˜

)β€–πœ™β€–π»π‘ (curl,Ξ©) +

βŽ›βŽ βˆ‘π‘‡βˆˆTπ‘˜βˆ–T +

𝑙

πœ‚2π‘˜(𝑇 )

⎞⎠1/2

β€–πœ™βˆ’ Ξ π‘˜πœ™β€–0,Ξ©0𝑙. (4.32)

To derive the second term in the right-hand side of (4.32), we have also used another observation, by definitionand the shape regularity of the meshes, that for any 𝑇 ∈ Tπ‘˜βˆ–T +

𝑙 , there exists an element 𝑇 β€² ∈ T 0𝑙 such that

𝑇 βŠ‚ 𝑇 β€² and the extended neighborhood Ω𝑇 of 𝑇 in Tπ‘˜ is contained in the extended neighborhood Ω𝑇 β€² of 𝑇 β€² inT𝑙, i.e., Ω𝑇 βŠ‚ Ω𝑇 β€² , which, along with the properties (3.6), allows us to writeβˆ‘

π‘‡βˆˆTπ‘˜βˆ–T +𝑙

β€–πœ™βˆ’ Ξ π‘˜πœ™β€–20,Ω𝑇=

βˆ‘π‘‡ β€²βˆˆT 0

𝑙

βˆ‘π‘‡βˆˆTπ‘˜,π‘‡βŠ‚π‘‡ β€²

β€–πœ™βˆ’ Ξ π‘˜πœ™β€–20,Ω𝑇≀ 𝐢

βˆ‘π‘‡ β€²βˆˆT 0

𝑙

β€–πœ™βˆ’ Ξ π‘˜πœ™β€–20,Ω𝑇 β€²,

with 𝐢 independent of the meshes. For a fixed 𝑙, by Lemma 4.6, the first term in (4.32) vanishes when π‘˜ tendsto infinity. For the second term in (4.32), we note from the inequality (4.24) and the boundedness of numericalsolutions (y*π‘˜,p*π‘˜,u*π‘˜) that βˆ‘

π‘‡βˆˆTπ‘˜βˆ–T +𝑙

πœ‚2π‘˜(𝑇 ) ≀

βˆ‘π‘‡βˆˆTπ‘˜

πœ‚2π‘˜(𝑇 ) ≀ 𝐢, (4.33)

where the constant 𝐢 is independent of π‘˜. We hence have from (4.32) that

lim supπ‘˜β†’βˆž

|β„›π‘˜(πœ™)| . lim supπ‘˜β†’βˆž

β€–πœ™βˆ’ Ξ π‘˜πœ™β€–0,Ξ©0𝑙. (4.34)

We note that Ξ©0𝑙 inherits a triangulation from the mesh Tπ‘˜ in the sense that Ξ©0

𝑙 =⋃

π‘‡βˆˆTπ‘˜,π‘‡βŠ‚Ξ©0𝑙𝑇 . Then the

interpolation error estimate (4.30) gives

β€–πœ™βˆ’ Ξ π‘˜πœ™β€–0,Ξ©0𝑙. β€–β„Žπ‘™β€–π‘ 

∞,Ξ©0𝑙

β€–πœ™β€–π»π‘ (curl,Ξ©),

which, by (4.34), implies

lim supπ‘˜β†’βˆž

|β„›π‘˜(πœ™)| . β€–β„Žπ‘™β€–π‘ βˆž,Ξ©0

𝑙

β€–πœ™β€–π»π‘ (curl,Ξ©). (4.35)

Letting 𝑙 β†’βˆž in (4.35) and recalling the uniform convergence of mesh funcitons (4.4), we can readily concludethat for all πœ™ ∈ C∞

𝑐 (Ξ©), there holdsβ„›(πœ™) = lim

π‘˜β†’βˆžβ„›π‘˜(πœ™) = 0,

which completes the proof.

Lemma 4.7 suggests that it suffices to show that u*∞ is actually the minimizer to the cost functional 𝐽(u)over the admissible set Uπ‘Žπ‘‘. For this, we prove the following result by exploiting the similar splitting of Tπ‘˜ asin Lemma 4.7 and some fundamental properties of 𝐿2-projections.

Theorem 4.8. Under the same assumptions as in Lemma 4.7, u*∞ satisfies the variational inequality:(p*∞ + 𝛼

(u*∞ βˆ’ u𝑑

),uβˆ’ u*∞

)0,Ξ©

β‰₯ 0 βˆ€ u ∈ Uπ‘Žπ‘‘. (4.36)

Page 25: Bowen Li and Jun Zou

AN ADAPTIVE EDGE ELEMENT METHOD AND ITS CONVERGENCE 2037

Proof. Note that the convex set D+(Ξ©) := v ∈ Cβˆžπ‘ (Ξ©) | v β‰₯ 0 is dense in Uπ‘Žπ‘‘ with respect to the 𝐿2-norm.

It suffices to show that the variational inequality (4.36) holds for all u in D+(Ξ©). However, we have(p*∞ + 𝛼

(u*∞ βˆ’ u𝑑

),uβˆ’ u*∞

)0,Ξ©

β‰₯ 0 βˆ€ u ∈ Uπ‘Žπ‘‘βˆž ,

since (y*∞,p*∞,u*∞) is the solution to the limiting optimality system (4.20a)–(4.20c). The above fact, along withthe relation Uπ‘Žπ‘‘

π‘˜ βŠ‚ Uπ‘Žπ‘‘βˆž , implies that for all u ∈ D+(Ξ©), there holds(

p*∞ + 𝛼(u*∞ βˆ’ u𝑑

),uβˆ’ Pπ‘˜u + Pπ‘˜uβˆ’ u*∞

)0,Ξ©

β‰₯(p*∞ + 𝛼

(u*∞ βˆ’ u𝑑

),uβˆ’ Pπ‘˜u

)0,Ξ©

. (4.37)

It follows from the strong convergence of (y*π‘˜,p*π‘˜,u*π‘˜) and β€–uβˆ’ Pπ‘˜uβ€–0,Ξ© ≀ 2β€–uβ€–0,Ξ© that

lim infπ‘˜β†’βˆž

(p*∞ + 𝛼

(u*∞ βˆ’ u𝑑

),uβˆ’ Pπ‘˜u

)0,Ξ©

= lim infπ‘˜β†’βˆž

(p*π‘˜ + 𝛼

(u*π‘˜ βˆ’ u𝑑

),uβˆ’ Pπ‘˜u

)0,Ξ©

. (4.38)

Noting that Iβˆ’ Pπ‘˜ is an orthogonal 𝐿2-projection and using the Poincare inequality for (Iβˆ’ Pπ‘˜)u:

β€–(Iβˆ’ Pπ‘˜)uβ€–0,𝑇 . β„Žπ‘‡ β€–βˆ‡uβ€–0,𝑇 ,

we derive(p*π‘˜ + 𝛼

(u*π‘˜ βˆ’ u𝑑

),uβˆ’ Pπ‘˜u

)0,Ξ©

=(

(Iβˆ’ Pπ‘˜)(p*π‘˜ βˆ’ 𝛼u𝑑

), (Iβˆ’ Pπ‘˜)u

)0,Ξ©

.βˆ‘

π‘‡βˆˆTπ‘˜

(β„Žπ‘‡ β€–p*π‘˜ βˆ’ Pπ‘˜p

*π‘˜β€–0,𝑇 + β„Žπ‘‡ β€–u𝑑 βˆ’ Pπ‘˜u

𝑑‖0,𝑇

)β€–βˆ‡uβ€–0,𝑇

.βˆ‘

π‘‡βˆˆT +𝑙

β„Žπ‘‡ Β· πœ‚π‘˜(𝑇 )β€–βˆ‡uβ€–0,𝑇 +βˆ‘

π‘‡βˆˆTπ‘˜βˆ–T +𝑙

β„Žπ‘‡ Β· πœ‚π‘˜(𝑇 )β€–βˆ‡uβ€–0,𝑇 , (4.39)

by the definition of πœ‚π‘˜(𝑇 ). Here we have used the same splitting of Tπ‘˜ as the one in Lemma 4.7 with 𝑙 beinga fixed iteration index less than π‘˜. Similarly to the proof of Lemma 4.7, we estimate the two terms in (4.39),respectively, as follows. For the first term in (4.39), we haveβˆ‘

π‘‡βˆˆT +𝑙

β„Žπ‘‡ Β· πœ‚π‘˜(𝑇 )β€–βˆ‡uβ€–0,𝑇 . #(T +𝑙 )πœ‚π‘˜

(π‘‡π‘˜

)β€–βˆ‡uβ€–0,Ξ©,

which, by Lemma 4.6, vanishes as π‘˜ β†’βˆž. For the second term in (4.39), note from (4.33) thatβˆ‘π‘‡βˆˆTπ‘˜βˆ–T +

𝑙

πœ‚2π‘˜(𝑇 ) =

βˆ‘π‘‡βˆˆTπ‘˜βˆ–T +

𝑙

πœ‚2π‘˜(𝑇 ) + osc2

𝑇

(u𝑑)≀ 𝐢

holds with the constant 𝐢 independent of π‘˜. Then, by Cauchy’s inequality, we haveβˆ‘π‘‡βˆˆTπ‘˜βˆ–T +

𝑙

β„Žπ‘‡ Β· πœ‚π‘˜(𝑇 )β€–βˆ‡uβ€–0,𝑇 . β€–β„Žπ‘™β€–βˆž,Ξ©0π‘™β€–βˆ‡uβ€–0,Ξ©,

which vanishes when 𝑙 tends to infinity. By above arguments, we have proven

limπ‘˜β†’βˆž

|(p*π‘˜ + 𝛼(u*π‘˜ βˆ’ u𝑑),uβˆ’ Pπ‘˜u)0,Ξ©| = 0. (4.40)

In view of (4.37), (4.38) and (4.40), we complete the proof, since the left-hand side of (4.37) is independent ofπ‘˜.

We end our theoretical analysis for the adaptive approximation of the control problem with an additionalremark.Remark 4.9. If the function u𝑑 has the regularity: u𝑑 ∈ H1(Ξ©), then using Theorem 4.1, the uniform conver-gence (4.4) and the Poincare inequality, we can argue in a manner similar to Theorem 6.2 of [61] to concludethe convergence of the error estimators πœ‚π‘˜. However, for a general 𝐿2-data u𝑑, the optimal control u* canonly be guaranteed to have a 𝐿2-regularity, and the data oscillation oscπ‘˜

(u𝑑) may not converge to zero in the

adaptive process.

Page 26: Bowen Li and Jun Zou

2038 B. LI AND J. ZOU

5. Numerical experiments

In this section, we provide a detailed documentation of the numerical results to illustrate the performanceof our adaptive algorithm for the optimal control problem. All the examples presented below are implementedbased on the MATLAB package iFEM [18] using MATLAB 2017a on a personal laptop with 8.00 GB RAM anddual-core 2.4 GHz CPU. When solving the discrete optimization problems (2.12) and (2.13), we use the projectedgradient algorithm in which the HX-preconditioner [32] is also adopted for solving the H(curl)-elliptic problem.More precisely, we start with an initial guess u(0)

β„Ž ∈ Uπ‘Žπ‘‘β„Ž , and in 𝑛th step (𝑛 β‰₯ 1) we solve two H(curl)-elliptic

problems to obtain the state y(𝑛)

β„Ž and the adjoint state p(𝑛)

β„Ž with the help of HX-preconditioner. Then the controlvariable uβ„Ž can be explicitly updated as follows:

u(𝑛)β„Ž = max

0,u

(π‘›βˆ’1)β„Ž + 𝑠

(βˆ’Pβ„Žp

(𝑛)β„Ž βˆ’ 𝛼

(u

(π‘›βˆ’1)β„Ž βˆ’ u𝑑

β„Ž

)), (5.1)

where the parameter 𝑠 can be computed explicitly by solving a one dimensional quadratic optimization problem.Compared to the algorithm in [35] where the edge element was used to approximate the control and an additionalleast squares problems with inequality constraints need to be solved to realize the box constraint (1.2), whichis very expensive and time-consuming, our algorithm uses only about half the number of DoFs for computingthe control variable but gives equally accurate numerical resolution, and can be trivially implemented.

In the following, we carry out some numerical experiments for two benchmark problems. For both examples,we shall first compute the numerical solutions and errors on the uniform meshes with five refinement iterationsfor the purpose of comparison, and then conduct the new adaptive algorithm, which is terminated when thecorresponding DoFs are almost the same as that of the finest uniform mesh. The first example is modifiedfrom [22, 67], where there is a corner singularity in the solution. We consider an optimal control problem onthe L-shape domain: Ξ© = [βˆ’1, 1]3βˆ–[0, 1]Γ— [0, 1]Γ— [βˆ’1, 1] with both the electric permittivity 𝜎 and the magneticpermeability πœ‡ taken to be 1. We set the target applied current density (control) u𝑑 and the target magnetic field(state) y𝑑 to zero, and set the parameter 𝛼 to 0.1. We choose the inhomogeneous Dirichlet boundary conditionand the source term f such that the exact solution are given as follows:

u* = p* = 0 and y* = grad(

π‘Ÿ23 sin

(23πœƒ

))in cylindrical coordinates.

We set πœƒ = 0.5 in the marking strategy (3.4) and remark that there is no need to consider the data oscillationosc𝑇

(u𝑑)

in this example because of the vanishing desired control u𝑑.In our simulations, the computation starts on a very coarse mesh with 480 DoFs. The adaptively refined

meshes in the 10th, 14th and 18th iterations are presented in Figure 1, in which we clearly observe that themeshes are locally refined around the 𝑧-axis where the singularity of y* occurs. Note the non-smooth functiony* /∈ H1(Ξ©) and that the lowest-order edge elements of first family are used to discretize the state variable.We typically cannot expect the optimal convergence order 𝑂(β„Ž) on a uniformly refined mesh according to thestandard interpolation result (4.30). However, this theoretical order of convergence can be recovered by theadaptive mesh refinement in the sense that

β€–y*π‘˜ βˆ’ y*β€–curl,Ξ© . (#DoFs)βˆ’1/3,

which has been confirmed by the convergence history plotted in Figure 2 (left) on a double logarithmic scale. Weshould also observe that the errors of the control β€–u*βˆ’u*π‘˜β€–0,Ξ© and the adjoint state β€–p*βˆ’p*π‘˜β€–curl,Ξ© reduce witha slope βˆ’2/3, which is twice the one of the error associated with the state y. The difference in the convergencerates may be explained by the fact that u* and p* are zero functions and hence very smooth while the optimalstate y* is non-smooth and has a strong singularity near the 𝑧-axis. Moreover, Figure 2 (right) shows thereduction of the total error (almost dominated by the error β€–y* βˆ’ y*π‘˜β€–curl,Ξ©):

β€–y* βˆ’ y*π‘˜β€–curl,Ξ© + β€–p* βˆ’ p*π‘˜β€–curl,Ξ© + β€–u* βˆ’ u*π‘˜β€–0,Ξ©,

Page 27: Bowen Li and Jun Zou

AN ADAPTIVE EDGE ELEMENT METHOD AND ITS CONVERGENCE 2039

Figure 1. Evolution of the adaptive mesh for the first example in steps π‘˜ = 10, 14 and 18(final mesh).

Figure 2. Convergence histories of the control, state and adjoint state (left), and the totalerrors for the uniform mesh refinement and the adaptive mesh refinement, as well as the errorestimator πœ‚β„Ž (right) for the first example.

and the convergence behavior of the a posteriori error estimator πœ‚β„Ž = πœ‚β„Ž, which confirms the efficiency of πœ‚β„Ž

and the superiority of the adaptive mesh refinement over the uniform mesh refinement: the total error on theadaptively refined mesh reduces with an order βˆ’1/3, double the one (βˆ’0.15) on the uniformly refined mesh.This is also confirmed by the computing times and computational costs: for achieving the error over the meshgenerated by the 5th uniform refinement, the adaptive algorithm takes only about 80 s and 105 DoFs, whereasthe uniform one takes about 580 s and 1 410 240 DoFs.

The second example is chosen to be similar to the one in [35], where there are non-smooth source terms andlarge jumps of physical coefficients across the interface between two different media. To be precise, we consideran optimal control problem on Ξ© = [βˆ’1, 1]3 with a high-contrast inclusion: Ω𝑐 := x ∈ R3; π‘₯2 +𝑦2 +𝑧2 < 0.62,

Page 28: Bowen Li and Jun Zou

2040 B. LI AND J. ZOU

Figure 3. Evolution of the adaptively refined mesh (2D slice) for the second example in stepsπ‘˜ = 8, 12, 16 and 19 (final mesh).

on which the coefficients πœ‡ and 𝜎 are given by

𝜎 =

10 in Ω𝑐,

1 in Ξ©βˆ–Ξ©π‘,πœ‡βˆ’1 =

0.1 in Ω𝑐,

1 in Ξ©βˆ–Ξ©π‘.

We set the target state y𝑑 = 0 and the target control u𝑑 = 10(πœ’Ξ©π‘ , 0, 0). Here πœ’Ξ©π‘ is the characteristic functionof Ω𝑐. We define a scalar function:

πœ‘(x) =1

2πœ‹sin(2πœ‹π‘₯) sin(2πœ‹π‘¦) sin(2πœ‹π‘§),

and then introduce the non-smooth source term f:

f(x) = πœŽβˆ‡πœ‘(x)βˆ’ 10(πœ’Ξ©π‘, 0, 0).

We readily see that the unique solution to the optimality system (2.7a)–(2.7c) is given by

y* = βˆ‡πœ‘, p* = 0, and u* = 10(πœ’Ξ©π‘, 0, 0).

Other parameters are chosen to be same as the first example.We still set πœƒ = 0.5 in the marking strategy (3.4) and start our computation on a very coarse initial mesh

with 604 DoFs. We show the evolution of adaptive meshes on the (𝑦, 𝑧)-cross section at π‘₯ = 0 in Figure 3,from which we immediately see that the meshes are strongly concentrated in the high-contrast inclusion Ω𝑐 andclearly capture the shape of the interface. We can also observe, from Figure 4 (left), an optimal convergenceorder βˆ’1/3 for the control and state variable and a little bit faster decrease of the error for the adjoint stateβ€–p*βˆ’p*π‘˜β€–curl,Ξ©, which is possibly because p* is quite smooth (a zero function), compared to u* and y*. Figure 4(right) again verifies the effectiveness of the error estimator πœ‚π‘˜ and the gain of computational efficiency from theadaptive mesh refinement: the total error on the adaptively refined mesh reduces with an order βˆ’1/3, whereasthe error on the uniform one reduces only with an order βˆ’0.2.

6. Concluding remarks

In this work, we have studied an electromagnetic optimal control problem and used the lowest-order edgeelements to approximate the state and adjoint state, while used the piecewise constant functions to approxi-mate the control. We have designed an adaptive finite element method with an error indicator involving boththe residual-type error estimator and the lower-order data oscillation. We have established the reliability andefficiency of the a posteriori error estimator and the strong convergence of the adaptive finite element solutions

Page 29: Bowen Li and Jun Zou

AN ADAPTIVE EDGE ELEMENT METHOD AND ITS CONVERGENCE 2041

Figure 4. Convergence histories of the control, state and adjoint state (left), and the totalerrors for the uniform mesh refinement and the adaptive mesh refinement, as well as the errorestimator πœ‚β„Ž (right) for the second example.

for both the state and control variables. With a very minor modification, our analysis and arguments can bedirectly applied to the more realistic case [61, 63] where the control u is added on a subdomain Ω𝑐 of Ξ© andthe case [27] where the control satisfies a bilateral constraint. It is worth pointing out that our arguments canalso be modified to cope with the inhomogeneous Dirichlet boundary condition. To be exact, suppose that theboundary condition is given by 𝛾𝑑y = 𝛾𝑑g with g ∈ H(curl, Ξ©) being a known function satisfying the givenDirichlet trace data [14]. Then we can check that the optimality systems (2.7a)–(2.7c) and (2.14a)–(2.14c) stillhold, except that (2.7a) and (2.14a) are solved on the affine spaces g + V and Ξ β„Žg + Vβ„Ž, respectively. Hence,up to a possible data oscillation: β€–gβˆ’ Ξ β„Žgβ€–curl,Ξ©, our a posteriori error estimates and the convergence analysiscan be readily applied.

As mentioned in the Introduction, the model of our interest can be connected with the discretization of thecontrol problem of time-dependent eddy current equations [41,46,47], where the implicit time-stepping schemeis adopted for the sake of stability [10,31]. In this case, the coefficient 𝜎 will be scaled by the current time-stepsize. Therefore, it is important to design an error estimator which is robust with respect to the scaling of thecoefficients, namely, the generic constants involved in the a posteriori error analysis should be independentof the scaling factors of the coefficients. For this, we may measure the errors of the states (y and p) by theenergy norm on H0(curl, Ξ©): β€–Β·β€–2𝐡 = β€–

βˆšπœ‡βˆ’1curlΒ·β€–20,Ξ© + β€–

βˆšπœŽΒ·β€–20,Ξ©, and the error estimators may also need to

be scaled correspondingly. If we assume that πœ‡ and 𝜎 are element-wise constant with respect to the initialmesh T0 [10], or that there exists a scaled norm β€– Β· β€–2V := πœ‡βˆ’1

* β€–curl Β· β€–20,Ξ© + 𝜎*β€– Β· β€–20,Ξ© with πœ‡* and 𝜎* beingpositive constants, such that 𝐡(Β·, Β·) is continuous and inf-sup stable with respect to β€– Β· β€–V with the involvedgeneric constants independent of πœ‡ and 𝜎 [53], one can naturally follow the a posteriori error analysis providedin this work to obtain a robust error analysis by using the energy norm and the scaled error estimators (forinstance, if the norm β€–Β·β€–V defined above exists, by adding a scaling factor

βˆšπœ‡*, we can modify πœ‚(1)

𝑦,𝑇 and πœ‚(1)

𝑦,𝐹

as follows: πœ‚(1)

𝑦,𝑇 := β„Žπ‘‡βˆš

πœ‡*β€–f + u*β„Ž βˆ’ curlπœ‡βˆ’1curly*β„Ž βˆ’ 𝜎y*β„Žβ€–0,𝑇 and πœ‚(1)

𝑦,𝐹 :=√

β„ŽπΉ πœ‡*β€–[𝛾𝑑(πœ‡βˆ’1curly*β„Ž)]𝐹 β€–0,𝐹 ).We refer the readers to [17, 48] for related discussions. However, the detailed and rigorous treatments for thegeneral coefficients and the time-dependent model are not trivial tasks and need further investigations. Wefinally remark that it is also of great interest to design the adaptive algorithm for more complicated models,such as the nonlinear electromagnetic control problem [64] and the Maxwell variational inequalities [65,66].

Page 30: Bowen Li and Jun Zou

2042 B. LI AND J. ZOU

Acknowledgements. The work of Jun Zou is substantially supported by Hong Kong Research Grants Council (projects14306718 and 14306719). The authors would like to thank the anonymous referees for their very careful reading of themanuscript and their numerous constructive comments and suggestions, which have helped us improve the presentationand results of this work essentially.

References

[1] R.A. Adams and J.J.F. Fournier, Sobolev Spaces. Academic Press 140 (2003).

[2] M. Ainsworth and J.T. Oden, A Posteriori Error Estimation in Finite Element Analysis. John Wiley & Sons 37 (2011).

[3] G.S. Alberti and Y. Capdeboscq, Elliptic regularity theory applied to time harmonic anisotropic Maxwell’s equations with lessthan Lipschitz complex coefficients. SIAM J. Math. Anal. 46 (2014) 998–1016.

[4] G.S. Alberti and Y. Capdeboscq, Lectures on Elliptic Methods for Hybrid Inverse Problems. Societe Mathematique de FranceParis 25 (2018).

[5] A. Allendes, E. Otarola, R. Rankin and A.J. Salgado, Adaptive finite element methods for an optimal control problem involvingDirac measures. Numer. Math. 137 (2017) 159–197.

[6] C. Amrouche, C. Bernardi, M. Dauge and V. Girault, Vector potentials in three-dimensional non-smooth domains. Math.Methods Appl. Sci. 21 (1998) 823–864.

[7] I. Babuska and W.C. Rheinboldt, Error estimates for adaptive finite element computations. SIAM J. Numer. Anal. 15 (1978)736–754.

[8] I. Babuska and M. Suri, The 𝑝 and β„Žβ€“π‘ versions of the finite element method, basic principles and properties. SIAM Rev. 36(1994) 578–632.

[9] I. Babuska and M. Vogelius, Feedback and adaptive finite element solution of one-dimensional boundary value problems.Numer. Math. 44 (1984) 75–102.

[10] R. Beck, R. Hiptmair, R.H.W. Hoppe and B. Wohlmuth, Residual based a posteriori error estimators for eddy current com-putation. ESAIM: M2AN 34 (2000) 159–182.

[11] R. Becker and S. Mao, Quasi-optimality of an adaptive finite element method for an optimal control problem. Comput. MethodsAppl. Math. Comput. Methods Appl. Math. 11 (2011) 107–128.

[12] V. Bommer and I. Yousept, Optimal control of the full time-dependent maxwell equations. ESAIM: M2AN 50 (2016) 237–261.

[13] A. Bossavit, Computational Electromagnetism: Variational Formulations, Complementarity, Edge Elements. Academic Press(1998).

[14] S. Brenner and R. Scott, The Mathematical Theory of Finite Element Methods. Springer Science & Business Media 15 (2007).

[15] H. Brezis, Functional Analysis, Sobolev Spaces and Partial Differential Equations. Springer Science & Business Media (2010).

[16] A. Bunger, V. Simoncini and M. Stoll, A low-rank matrix equation method for solving PDE-constrained optimization problems.Preprint arXiv:2005.14499 (2020).

[17] J.M. Cascon, C. Kreuzer, R.H. Nochetto and K.G. Siebert, Quasi-optimal convergence rate for an adaptive finite elementmethod. SIAM J. Numer. Anal. 46 (2008) 2524–2550.

[18] L. Chen, 𝑖FEM: An Innovative Finite Element Methods Package in MATLAB. University of Maryland (2008).

[19] M. Costabel and M. Dauge, Singularities of electromagnetic fields in polyhedral domains. Arch. Ration. Mech. Anal. 151(2000) 221–276.

[20] M. Costabel, M. Dauge and S. Nicaise, Singularities of Maxwell interface problems. ESAIM: M2AN 33 (1999) 627–649.

[21] W. Dorfler, A convergent adaptive algorithm for Poisson’s equation. SIAM J. Numer. Anal. 33 (1996) 1106–1124.

[22] H. Duan, F. Qiu, R.C. Tan and W. Zheng, An adaptive FEM for a Maxwell interface problem. J. Sci. Comput. 67 (2016)669–704.

[23] K. Eriksson and C. Johnson, Adaptive finite element methods for parabolic problems I: a linear model problem. SIAM J.Numer. Anal. 28 (1991) 43–77.

[24] L.C. Evans, Partial differential equations. In: Vol. 19 of Graduate Studies in Mathematics. American Mathematical Society,Providence, RI (1998).

[25] Z. Fang, J. Li and X. Wang, Optimal control for electromagnetic cloaking metamaterial parameters design. Comput. Math.App. 79 (2020) 1165–1176.

[26] A. Gaevskaya, Y. Iliash, M. Kieweg and R.H.W. Hoppe, Convergence analysis of an adaptive finite element method fordistributed control problems with control constraints. In: Control of Coupled Partial Differential Equations. Springer (2007)47–68.

[27] W. Gong and N. Yan, Adaptive finite element method for elliptic optimal control problems: convergence and optimality. Numer.Math. 135 (2017) 1121–1170.

[28] W. Gong, H. Liu and N. Yan, Adaptive finite element method for parabolic equations with Dirac measure. Comput. MethodsAppl. Mech. Eng. 328 (2018) 217–241.

[29] M. Hintermuller, R.H.W. Hoppe, Y. Iliash and M. Kieweg, An a posteriori error analysis of adaptive finite element methods fordistributed elliptic control problems with control constraints. ESAIM: Control Optim. Calculus Variations 14 (2008) 540–560.

[30] M. Hinze, A variational discretization concept in control constrained optimization: the linear-quadratic case. Comput. Optim.App. 30 (2005) 45–61.

[31] R. Hiptmair, Multigrid method for Maxwell’s equations. SIAM J. Numer. Anal. 36 (1998) 204–225.

Page 31: Bowen Li and Jun Zou

AN ADAPTIVE EDGE ELEMENT METHOD AND ITS CONVERGENCE 2043

[32] R. Hiptmair and J. Xu, Nodal auxiliary space preconditioning in H(curl) and H(div) spaces. SIAM J. Numer. Anal. 45 (2007)2483–2509.

[33] J.B. Hiriart-Urruty and C. Lemarechal, Convex Analysis and Minimization Algorithms I: Fundamentals. Springer Science &Business Media 305 (2013).

[34] R.H.W. Hoppe and J. Schoberl, Convergence of adaptive edge element methods for the 3D eddy currents equations. J. Comput.Math. 27 (2009) 657–676.

[35] R.H.W. Hoppe and I. Yousept, Adaptive edge element approximation of H(curl)-elliptic optimal control problems with controlconstraints. BIT Numer. Math. 55 (2015) 255–277.

[36] F. Kikuchi, On a discrete compactness property for the Nedelec finite elements. J. Faculty Sci. Univ. Tokyo. Sect. 1 A Math.36 (1989) 479–490.

[37] D. Kinderlehrer and G. Stampacchia, An Introduction to Variational Inequalities and their Applications. Siam 31 (1980).

[38] K. Kohls, A. Rosch and K.G. Siebert, A posteriori error analysis of optimal control problems with control constraints. SIAMJ. Control Optim. 52 (2014) 1832–1861.

[39] K. Kohls, K.G. Siebert and A. Rosch, Convergence of adaptive finite elements for optimal control problems with controlconstraints. In: Trends in PDE Constrained Optimization. Springer (2014) 403–419.

[40] K. Kohls, C. Kreuzer, A. Rosch and K.G. Siebert, Convergence of adaptive finite element methods for optimal control problemswith control constraints. North-West. Eur. J. Math 4 (2018) 157–184.

[41] M. Kolmbauer and U. Langer, A robust preconditioned minres solver for distributed time-periodic eddy current optimal controlproblems. SIAM J. Sci. Comput. 34 (2012) B785–B809.

[42] H. Leng and Y. Chen, Convergence and quasi-optimality of an adaptive finite element method for optimal control problemson 𝐿2 errors. J. Sci. Comput. 73 (2017) 438–458.

[43] H. Leng and Y. Chen, Convergence and quasi-optimality of an adaptive finite element method for optimal control problemswith integral control constraint. Adv. Comput. Math. 44 (2018) 367–394.

[44] P. Monk, Finite Element Methods for Maxwell’s Equations. Oxford University Press (2003).

[45] P. Morin, K.G. Siebert and A. Veeser, A basic convergence result for conforming adaptive finite elements. Math. ModelsMethods Appl. Sci. 18 (2008) 707–737.

[46] S. Nicaise, S. Stingelin and F. Troltzsch, On two optimal control problems for magnetic fields. Comput. Methods Appl. Math.14 (2014) 555–573.

[47] S. Nicaise, S. Stingelin and F. Troltzsch, Optimal control of magnetic fields in flow measurement. Discrete Continuous Dyn.Syst.-S 8 (2015) 579.

[48] R.H. Nochetto, A. Siebert and K.G. Veeser, Theory of adaptive finite element methods: an introduction. In: Multiscale,Nonlinear and Adaptive Approximation. Springer (2009) 409–542.

[49] D. Pauly and I. Yousept, A posteriori error analysis for the optimal control of magneto-static fields. ESAIM: M2AN 51 (2017)2159–2191.

[50] J.W. Pearson and J. Gondzio, Fast interior point solution of quadratic programming problems arising from PDE-constrainedoptimization. Numer. Math. 137 (2017) 959–999.

[51] J.W. Pearson and A.J. Wathen, A new approximation of the Schur complement in preconditioners for PDE-constrainedoptimization. Numer. Linear Algebra App. 19 (2012) 816–829.

[52] J.W. Pearson, M. Stoll and A.J. Wathen, Regularization-robust preconditioners for time-dependent PDE-constrained opti-mization problems. SIAM J. Matrix Anal. App. 33 (2012) 1126–1152.

[53] J. Schoberl, A posteriori error estimates for Maxwell equations. Math. Comput. 77 (2008) 633–649.

[54] C. Schwab, 𝑝- and β„Žπ‘-Finite Element Methods: Theory and Applications to Solid and Fluid Mechanics. Oxford UniversityPress, Oxford (1998).

[55] M. Stoll and T. Breiten, A low-rank in time approach to PDE-constrained optimization. SIAM J. Sci. Comput. 37 (2015)B1–B29.

[56] M. Suri, The 𝑝 and β„Žπ‘ finite element method for problems on thin domains. J. Comput. Appl. Math. 128 (2001) 235–260.

[57] B. Szabo and I. Babuska. Finite Element Analysis. John Wiley & Sons (1991).

[58] F. Troltzsch, Optimal Control of Partial Differential Equations: Theory, Methods, and Applications. American MathematicalSociety 112 (2010).

[59] R. Verfurth, A posteriori error estimation and adaptive mesh-refinement techniques. J. Comput. Appl. Math. 50 (1994) 67–83.

[60] C. Weber and P. Werner, Regularity theorems for Maxwell’s equations. Math. Methods Appl. Sci. 3 (1981) 523–536.

[61] Y. Xu and J. Zou, A convergent adaptive edge element method for an optimal control problem in magnetostatics. ESAIM:M2AN 51 (2017) 615–640.

[62] H.M. Yin, Regularity of weak solution to Maxwell’s equations and applications to microwave heating. J. Differ. Equ. 200(2004) 137–161.

[63] I. Yousept, Optimal control of quasilinear H(curl)-elliptic partial differential equations in magnetostatic field problems. SIAMJ. Control Optim. 51 (2013) 3624–3651.

[64] I. Yousept, Optimal control of non-smooth hyperbolic evolution Maxwell equations in type-II superconductivity. SIAM J.Control Optim. 55 (2017) 2305–2332.

[65] I. Yousept, Hyperbolic Maxwell variational inequalities of the second kind. ESAIM: Control Optim. Calculus Variations 26(2020) 34.

Page 32: Bowen Li and Jun Zou

2044 B. LI AND J. ZOU

[66] I. Yousept, Well-posedness theory for electromagnetic obstacle problems. J. Differ. Equ. 269 (2020) 8855–8881.

[67] L. Zhong, S. Shu, L. Chen and J. Xu, Convergence of adaptive edge finite element methods for H(curl)-elliptic problems.Numer. Linear Algebra App. 17 (2010) 415–432.

[68] L. Zhong, L. Chen, S. Shu, G. Wittum and J. Xu, Convergence and optimality of adaptive edge finite element methods fortime-harmonic Maxwell equations. Math. Comput. 81 (2012) 623–642.

This journal is currently published in open access under a Subscribe-to-Open model (S2O). S2O is a transformativemodel that aims to move subscription journals to open access. Open access is the free, immediate, online availability ofresearch articles combined with the rights to use these articles fully in the digital environment. We are thankful to oursubscribers and sponsors for making it possible to publish this journal in open access, free of charge for authors.

Please help to maintain this journal in open access!

Check that your library subscribes to the journal, or make a personal donation to the S2O programme, by [email protected]

More information, including a list of sponsors and a financial transparency report, available at: https://www.edpsciences.org/en/maths-s2o-programme