Final Value Problems for Parabolic Differential

Full text

Turn on search term navigation

1. Introduction

In this article, we establish well-posedness of final value problems for a large class of parabolic differential equations. Seemingly, this clarifies a longstanding gap in the comprehension of such problems.

Taking the heat equation as a first example, we address the problem of characterising the functions u(t,x) that, in a ^C∞ -smooth bounded open set Ω⊂^Rn with boundary ∂Ω , fulfil the equations, where Δ=_{∂_x12}+⋯+_{∂_xn2}denotes the Laplacian,

_∂tu(t,x)−Δu(t,x)=f(t,x)fort∈]0,T[,x∈Ω,u(t,x)=g(t,x)fort∈]0,T[,x∈∂Ω,u(T,x)=_uT(x)forx∈Ω.

Motivation for doing so could be given by imagining a nuclear power plant, which is hit by a power failure at time t=0 . Once power is regained at time t=T , and a measurement of the reactor temperature _uT(x) is obtained, it is of course desirable to calculate backwards in time to provide an answer to the question: were temperatures u(t,x) around some earlier time _t0<Thigh enough to cause a meltdown of the fuel rods ?

We provide here a theoretical analysis of such problems and prove that they are well-posed, that is, they have existence, uniqueness and stability of solutions u∈X for given data (f,g,_uT)∈Y , in certain normed spaces X, Y to be specified below. The results were announced without proofs in the short note [1].

Although well-posedness is of decisive importance for the interpretation and accuracy of numerical schemes, which one would use in practice, such a theory has seemingly not been worked out before. Explained roughly, our method is to provide a useful structure on the reachable set for a general class of parabolic differential equations.

1.1. Background

Let us first describe the case f=0 , g=0 . Then the mere heat equation (_∂t−Δ)u=0 is clearly solved for all t∈R by the function u(t,x)=^e(T−t)λv(x) , if v(x) is an eigenfunction of the Dirichlet realization −_ΔD of the Laplace operator with eigenvalue λ.

In view of this, the homogeneous final value problem (1) would obviously have the above u as a basic solution if, coincidentally, the final data _uT(x) were given as the eigenfunction v(x) . The theory below includes the set B of such basic solutions u together with its linear hull E=spanB and a certain completion E¯.

It is easy to describe E in terms of the eigenvalues 0<_λ1≤_λ2≤… and the associated _L2(Ω) -orthonormal basis _e1,_e2,… of eigenfunctions of −_ΔD : corresponding to final data _uT in span(_ej) , which are the _uT having finite expansions _uT(x)=_∑j(_uT|_ej)_ej(x) in _L2(Ω) , the space E consists of solutions u(t,x)being finite sums

u(t,x)=_∑j^e(T−t)_λj(_uT|_ej)_ej(x).

Moreover, at time t=0 there is, because of the finiteness, a vector u(0,x) in _L2(Ω)that trivially fulfills

^{∥u(0,·)∥2}=_∑j^e2T_λj ^|(_uT|_ej)|2<∞.

However, when summation is extended to all j∈N , condition (3) becomes very strong, as it is only satisfied for special _uT : Weyl’s law for the counting function, cf. ([2], Chapter 6.4), entails the well-known _λj=O(^j2n) , so a single term in (3) yields |(_uT|_ej)|≤cexp(−T^j2n) ; i.e., the _L2 -coordinates of such _uT decay rapidly for j→∞.

Condition (3) has been known at least since the 1950s; the work of John [3] and Miranker [4] are the earliest we know. While many authors have avoided an analysis of it, Payne found it scientifically intolerable because _uT is likely to be imprecisely measured; cf. his treatise [5] on the variety of methods applied to (1) until the mid 1970s.

More recently, e.g., Isakov [6] emphasized the classical observation, found already in [4], that (2) implies a phenomenon of instability. Indeed, the sequence of final data _uT,k=_ek has constant length 1, yet via (2) it gives the initial states _uk(0,x)=^eT_λk _ek(x) having _L2 -norms ^eT_λk , which clearly blow up rapidly for k→∞.

This _L2 -instability cannot be explained away, of course, but it does not rule out that (1) is well-posed. It rather indicates that the _L2 -norm is an insensitive choice for (1).

In fact, here there is an analogy with the classical stationary Dirichlet problem

−Δu=finΩ,u=gon∂Ω.

This is unsolvable for u∈^C2(Ω¯) given certain f∈^C0(Ω¯) , g∈^C0(∂Ω) : Günther proved prior to 1934, cf. ([7], p. 85), that when f(x)=χ(x)(3_x32|x^|−2−1)/log|x| for some radial cut-off function χ∈_C0∞(Ω) equal to 1 around the origin, Ω being the unit ball of ^R3 , then f∈^C0(Ω¯) while the convolution w=14π|x|∗f is in ^C1(Ω¯) but not in ^C2 at x=0 ; so w∈^C1(Ω¯)\^C2(Ω¯) . Yet w is the unique ^C1(Ω¯) -solution of (4) in the distribution space ^D′(Ω) in the case g is given as g=_w|∂Ω . Thus the ^Ck -scales constitute an insensitive choice for (4). Nonetheless, replacing ^C2(Ω¯) by its completion ^H1(Ω) in the Sobolev norm (_∑|α|≤1 _∫Ω|^Dαu^{^|2dx)1/2} , it is classical that (4) is well-posed with u in ^H1(Ω).

To obtain similarly well-adapted spaces for (1) with f=0 , g=0 , one could base the analysis on (3). Indeed, along with the above space E of basic solutions, a norm |||_uT||| on the space of final data _uT∈span(_ej) can be defined by (3), leading to the norm |||_uT|||=(_∑j=1∞ ^e2T_λj|(_uT|_ej)^{^|2)1/2} on the _uT that correspond to solutions u in the completion E¯ . This would give well-posedness of (1) with u∈E¯; cf. Remark 16.

But the present paper goes much beyond this. For one thing, we have freed the discussion from −_ΔDand its specific eigenvalue distribution by using sesqui-linear forms, cf. Lax–Milgram’s lemma, which allowed us to extend the proofs to a general class of elliptic operators A.

Secondly we analyse the fully inhomogeneous problem (1) for general f, g in Section 5. In this situation well-posedness is not just a matter of choosing the norm on the data (f,g,_uT) suitably (as one might think from the above |||_uT||| ). In fact, prior to this choice, one has to restrict the (f,g,_uT)to a subspace characterised by certain compatibility conditions. While such conditions are well known in the theory of parabolic boundary problems, they are shown here to have a new and special form for final value problems.

Indeed, the compatibility conditions stem from the unbounded operator _uT↦u(0), which maps the final data to the corresponding initial state in the presence of the source term f. The fact that this operator is well defined, and that its domain endowed with the graph norm yields the data space, is the leitmotif of this article.

1.2. The Abstract Final Value Problem

Let us outline our analysis given for a Lax–Milgram operator A defined in H from a V-elliptic sesquilinear form a(·,·) in a Gelfand triple, i.e., in a set-up of three Hilbert spaces V↪H↪^V∗ having norms denoted ∥·∥ , |·| and _∥·∥∗, and where V is the form domain of a.

In this framework, we consider the following general final value problem: given data

f∈_L2(0,T;^V∗),_uT∈H,

determine the V-valued vector distributions u(t) on ]0,T[ , that is the u∈^D′(0,T;V), fulfilling

_∂tu+Au=fin^D′(0,T;^V∗),u(T)=_uTinH.

Classically a wealth of parabolic Cauchy problems with homogeneous boundary conditions have been efficiently treated with the triples (H,V,a) and the ^D′(0,T;^V∗) set-up in (6). For this the reader may consult the work of Lions and Magenes [8], Tanabe [9], Temam [10], Amann [11]. Also recently, e.g., Almog, Grebenkov, Helffer, Henry studied variants of the complex Airy operator via such triples [12,13,14], and our results should at least extend to final value problems for those of their realisations that have non-empty spectrum.

To compare (6) with the analogous Cauchy problem, we recall that whenever ^u′+Au=f is solved under the initial condition u(0)=_u0∈H , for some f∈_L2(0,T;^V∗), there is a unique solution u in the Banach space

X=_L2(0,T;V)⋂C([0,T];H)⋂^H1(0,T;^V∗),_∥u∥X=^{_∫0T ^∥u(t)∥2dt+sup0≤t≤T^|u(t)|2+_∫0T _{(∥u(t)∥∗2}+∥^u′(t)_∥∗2)dt1/2}.

For (6) it would thus be natural to envisage solutions u in the same space X. This turns out to be true, but only under substantial further conditions on the data (f,_uT).

To formulate these, we exploit that −A generates an analytic semigroup ^e−tA in B(H) . This is crucial for the entire article, because analytic semigroups always are invertible in the class of closed operators, as we show in Proposition 1. We denote its inverse by ^etA , consistent with the case −Agenerates a group,

^{(^e−tA)−1}=^etA.

Its domain is the Hilbert space D(^etA)=R(^e−tA) that is normed by ∥u∥=^(|u|2+|^etAu^{^|2)1/2} . In Proposition 10 we show that a non-empty spectrum, σ(A)≠∅, yields strict inclusions

D(^{e^t′A})⊊D(^etA)⊊Hfor0<t<^t′.

For t=T these domains play a crucial role in the well-posedness result below, cf. (11), where also the full yield _yfof the source term f on the system appears, namely

_yf=_∫0T ^e−(T−s)Af(s)ds.

The map f↦_yf takes values in H, and it is a continuous surjection _yf:_L2(0,T;^V∗)→H.

Theorem 1.

For the final value problem (6) to have a solution u in the space X in (7), it is necessary and sufficient that the data (f,_uT) belong to the subspace Y of _L2(0,T;^V∗)⊕H defined by the condition

_uT−_∫0T ^e−(T−t)Af(t)dt∈D(^eTA).

Moreover, in X the solution u is unique and depends continuously on the data (f,_uT) in Y, that is, we have _∥u∥X≤c_{∥(f,_uT)∥Y} , when Y is given the graph norm

∥(f,_uT)_∥Y=^{|_uT ^|2+_∫0T _{∥f(t)∥∗2}dt+|^eTA_uT−_∫0T ^e−(T−t)Af(t)dt^|21/2}.

(The full statements are found in Theorems 7 and 8 below.)

Condition (11) is a fundamental novelty for the above class of final value problems, but more generally it also gives an important clarification for parabolic differential equations.

As for its nature, we note that the data (f,_uT) fulfilling (11) form a Hilbert(-able) space Y embedded into _L2(0,T;^V∗)⊕H , in view of its norm in (12).

Using the above _yf , (12) is seen to be the graph norm of (f,_uT)↦^eTA(_uT−_yf) , which in terms of Φ(f,_uT)=_uT−_yf is the unbounded operator ^eTAΦ from _L2(0,T;^V∗)⊕H to H. As (11) means that the operator ^eTAΦ must be defined at (f,_uT) , the space Y is its domain. Thus ^eTAΦ is a key ingredient in the rigorous treatment of (6).

The role of ^eTAΦ is easy to elucidate in control theoretic terms: its value ^eTAΦ(f,_uT) simply equals the particular initial state u(0) which is steered by f to the final state u(T)=_uT at time T; cf. (13) below.

Because of ^e−(T−t)A and the integral over [0,T] , (11) involves non-local operators in both space and time as an inconvenient aspect—exacerbated by use of the abstract domain D(^eTA) , which for longer lengths T of the time interval gives increasingly stricter conditions; cf. (9).

Anyhow, we propose to regard (11) as a compatibility condition on the data (f,_uT), and thus we generalise the notion of compatibility.

For comparison we recall that Grubb and Solonnikov [15] made a systematic investigation of a large class of initial-boundary problems of parabolic (pseudo-)differential equations and worked out compatibility conditions, which are necessary and sufficient for well-posedness in full scales of anisotropic _L2 -Sobolev spaces. Their conditions are explicit and local at the curved corner ∂Ω×{0} , except for half-integer values of the smoothness s that were shown to require so-called coincidence, which is expressed in integrals over the product of the two boundaries {0}×Ω and ]0,T[×∂Ω; hence it also is a non-local condition.

However, while the conditions of Grubb and Solonnikov [15] are decisive for the solution’s regularity, condition (11) is crucial for the existence question; cf. the theorem.

Previously, uniqueness was shown by Amann ([11], Section V.2.5.2) in a t-dependent set-up, but injectivity of u(0)↦u(T) was proved much earlier for problems with t-dependent sesquilinear forms by Lions and Malgrange [16].

Showalter [17] attempted to characterise the possible _uT in terms of Yosida approximations for f=0 and A having half-angle π4 . As an ingredient, invertibility of analytic semigroups was claimed in [17] for such A, but the proof was flawed as A can have semi-angle π/4 even if ^A2is not accretive; cf. our example in Remark 9.

Theorem 1 is proved largely by comparing with the corresponding problem ^u′+Au=f , u(0)=_u0 . It is well known in functional analysis, cf. (7), that this is well-posed for f∈_L2(0,T;^V∗) , _u0∈H , with solutions u∈X . However, as shown below by adaptation of a classical argument, u is also in this set-up necessarily given by Duhamel’s principle, or the variation of constants formula, for the analytic semigroup ^e−tA in ^V∗,

u(t)=^e−tAu(0)+_∫0t ^e−(t−s)Af(s)ds.

For t=T this yields a bijective correspondence u(0)↔u(T) between the initial and terminal states (in particular backwards uniqueness of the solutions in the large class X)—but this relies crucially on the previously mentioned invertibility of ^e−tA ; cf. (8).

As a consequence of (13) one finds the necessity of (11), as the difference Φ(f,_uT)=_uT−_yf in (11) must equal the vector ^e−TAu(0) , which obviously belongs to D(^eTA).

Moreover, (13) yields that u(T)in a natural way consists of two parts, that differ radically even when A has nice properties:

First, ^e−tAu(0) solves the semi-homogeneous problem with f=0 , and for u(0)≠0 there is the precise property in non-selfadjoint dynamics that the “height” function h(t)is strictly convex,

h(t)=|^e−tAu(0)|.

This is shown in Proposition 4 when A belongs to the broad class of hyponormal operators, studied by Janas [18], or in case ^A2 is accretive; then h(t) is also strictly decreasing with ^h′(0)≤−m(A) , where m(A)is the lower bound of A.

The stiffness inherent in strict convexity is supplemented by the fact that u(T)=^e−TAu(0)is confined to a dense, but very small space, as by a well-known property of analytic semigroups,

u(T)∈_⋂n∈ND(^An).

Secondly, for _u0=0 the integral in (13) solves the initial value problem, and it has a rather different nature since its final value _yf in (10) is surjective _yf:_L2(0,T;^V∗)→H, hence can be anywhere in H, regardless of the Lax–Milgram operator A in our set-up. This we show in Proposition 6 using a kind of control-theoretic argument in case A is self-adjoint with compact inverse; and for general A by means of the Closed Range Theorem, cf. Proposition 5.

For the reachable set of the equation ^u′+Au=f , or rather the possible final data _uT , they will be a sum of an arbitrary vector _yf in H and a term ^e−TAu(0) of great stiffness (cf. (15)). Thus _uT can be prescribed in the affine space _yf+D(^eTA) . As any _yf≠0 will push the dense set D(^eTA)⊂H in some arbitrary direction, u(T) can be expected anywhere in H (unless _yf∈D(^eTA) is known a priori). Consequently neither u(T)∈D(^eTA) nor (15) can be expected to hold for _yf≠0 , not even if its norm |_yf| is much smaller than |^e−TAu(0)|.

As for final state measurements in real life applications, we would like to prevent a misunderstanding by noting that it is only under the peculiar circumstance that _yf=0 is known a priori to be an exact identity that (15) would be a valid expectation on u(T).

Indeed, even if f is so small that it is (quantitatively) insignificant for the time development of the system governed by ^u′+Au=f , so that f=0 is a valid dynamical approximation, the (qualitative) mathematical expectation that u(T) should fulfill (15) cannot be justified from such an approximation; cf. the above.

In view of this fundamental difference between the problems that are truly and merely approximately homogeneous, it seems that proper understanding of final value problems is facilitated by treating inhomogeneous problems from the very beginning.

1.3. The Inhomogeneous Heat Problem

For (1) with general data (f,g,_uT) the above is applied with A=−_ΔD, that is the Dirichlet realisation of the Laplacian. The results are analogous, but less simple to state and more demanding to obtain.

First of all, even though it is a linear problem, the compatibility condition (11) destroys the old trick of reducing to boundary data g=0 , for when w∈^H1 fulfils w=g≠0 on the curved boundary S=]0,T[×∂Ω , then w lacks the regularity needed to test (11) on the data (f˜,0,_u˜T) of the reduced problem; cf. (127) ff.

Secondly, it is, therefore, non-trivial to clarify that every g≠0 does give rise to an extra term _zg , in the sense that (11) is replaced by the compatibility condition

_uT−_yf+_zg∈D(^e−T_ΔD).

Thirdly, due to the low reqularity, it requires technical diligence to show that _zg , despite the singularity of Δ^e(T−s)_ΔD at s=T, has the structure of a single convergent improper Bochner integral, namely

_zg=_⨍0TΔ^e(T−s)_ΔD _K0g(s)ds.

The reader is referred to Section 5 for the choice of the Poisson operator _K0 and for an account of the results on the fully inhomogeneous problem in (1), especially Theorem 10 and Corollary 3, which we sum up here:

Theorem 2.

For given data f∈_L2(0,T;^H−1(Ω)) , g∈^H1/2(S) , _uT∈_L2(Ω) the final value problem (1) is solved by a function u in _X1=_L2(0,T;^H1(Ω))⋂C([0,T];_L2(Ω))⋂^H1(0,T;^H−1(Ω)) , if and only if the data in terms of (10) and (17) satisfy the compatibility condition (16). In the affirmative case, u is uniquely determined in _X1 and has the representation, with all terms in _X1 ,

u(t)=^et_ΔD ^e−T_ΔD(_uT−_yf+_zg)+_∫0t ^e(t−s)Δf(s)ds−_⨍0tΔ^e(t−s)_ΔD _K0g(s)ds,

The unique solution u in _X1 depends continuously on the data (f,g,_uT) in the Hilbert space _Y1 , when these are given the norms in (130) and (158) below, respectively.

1.4. Contents

Our presentation is aimed at describing methods and consequences in a concise way, readable for a broad audience within evolution problems. Therefore we have preferred a simple set-up, leaving many examples and extensions to future work, cf. Section 6.

Notation is given in Section 2 together with the set-up for Lax–Milgram operators and semigroup theory. Some facts on forward evolution problems are recalled in Section 3, followed by our analysis of abstract final value problems in Section 4. The heat equation and its final and boundary value problems are treated in Section 5. Section 6 concludes with remarks on the method’s applicability and notes on the literature of the problem.

2. Preliminaries

In the sequel specific constants will appear as _Cj , j∈N , whereas constants denoted by c may vary from place to place. _1Sdenotes the characteristic function of the set S.

Throughout V and H denote two separable Hilbert spaces, such that V is algebraically, topologically and densely contained in H. Then there is a similar inclusion into the anti-dual ^V∗, i.e., the space of conjugated linear functionals on V,

V⊆H≡^H∗⊆^V∗.

(V,H,^V∗) is also known as a Gelfand triple. Denoting the norms by ∥·∥ , |·| and _∥·∥∗ , respectively, there are constants such that for all v∈V,

_∥v∥∗≤_C1|v|≤_C2∥v∥.

The inner product on H is denoted by (·|·) ; and the sesquilinear scalar product on ^V∗×V by _{⟨·,·⟩^V∗,V} or ⟨·,·⟩ , it fulfils |⟨u,v⟩|≤_∥u∥∗∥v∥ . The second inclusion in (19) means that for u∈H,

⟨u,v⟩=(u|v)forallv∈V.

For a linear transformation A in H, the domain is written D(A) , while R(A) denotes its range and Z(A) its null-space. ρ(A) , σ(A) and ν(A)={(Au|u)∣u∈D(A),|u|=1} denote the resolvent set, spectrum and numerical range, while m(A)=infReν(A) is the lower bound of A. Throughout B(H)stands for the Banach space of bounded linear operators on H.

For a given Banach space B and T>0 , we denote by _L1(0,T;B) the space of equivalence classes of functions f:[0,T]→B that are strongly measurable with _∫0T∥f(t)∥dt finite. For such f the Bochner integral is denoted by _∫0Tf(t)dt , cf. [19]; it fulfils ⟨_∫0Tf(t)dt,λ⟩=_∫0T⟨f(t),λ⟩dt for every functional λ in the dual space ^B′ . Likewise _L2(0,T,B) consists of the strongly measurable f with finite norm (_∫0T ^∥f(t)∥2 ^dt)1/2.

On an open set Ω⊂^Rn , n≥1 , the space _C0∞(Ω) consists of the infinitely differentiable functions having compact support in Ω; it is given the usual LF -topology, cf. [20,21]. The dual space of continuous linear functionals ^D′(Ω) is the distribution space on Ω. We use the standard distribution theory as exposed by Grubb [20] and Hörmander [22].

More generally, the space of B-valued vector distributions is denoted by ^D′(Ω;B) ; it consists of the continuous linear maps Λ:_C0∞(Ω)→B , cf. [21], the value of which at φ∈_C0∞(Ω) is indicated by ⟨Λ,φ⟩ . If Ω is the interval ]0,T[ we also write ^D′(Ω;B)=^D′(0,T;B).

The Sobolev space ^H1(0,T;B) consists of the u∈^D′(0,T;B) for which both u, ^u′ belong to _L2(0,T;B) ; it is normed by _∫0T ^(∥u∥2+∥^u′ ^∥2 ^)dt)1/2 . More generally ^W1,1(0,T;B) is defined by replacing _L2 by _L1.

2.1. Lax–Milgram Operators

Our main tool will be the Lax–Milgram operator associated to an elliptic sesquilinear form, cf. the set-up in ([20], Section 12.4). For the reader’s sake we review this, also to establish a few additional points from the proofs in [20].

We let a(·,·) be a bounded, V-elliptic sesquilinear form on V, i.e., certain _C3,_C4>0 fulfil for all u,v∈V

|a(u,v)|≤_C3∥u∥∥v∥,Rea(v,v)≥_C4 ^∥v∥2.

Obviously, the adjoint sesquilinear form ^a∗(u,v)=a(v,u)¯ inherits these properties (with the same _C3 , _C4 ), and so does the “real part”, _aRe(u,v)=12(a(u,v)+^a∗(u,v)) . Since _aRe(u,u)≥0 , the form _aReis an inner product on V, inducing the equivalent norm

|||u|||=_aRe ^(u,u)1/2,foru∈V.

We recall that s(u,v)=_(Su|v)V gives a bijective correspondence between bounded sesquilinear forms s(·,·) on V and bounded operators S∈B(V) , which is isometric since ∥S∥ equals the operator norm of the sesquilinear form |s|=sup|s(u,v)||∥u∥=1=∥v∥ . So the given form a induces an _A0∈B(V)given by

a(u,v)=_{(_A0u|v)V}∀u,v∈V;

and the adjoint form ^a∗ similarly induces an operator _A0∗∈B(V) , which is seen at once to be the adjoint of _A0 in the sense that _{(_A0∗v|u)V}=_{(v|_A0u)V}.

The V-ellipticity in (22) shows that _A0 , _A0∗ are both injective with positive lower bounds m(_A0),m(_A0∗)≥_C4 , so _A0 , _A0∗ are in fact bijections on V (cf. ([20], Theorem 12.9)).

By Riesz’s representation theorem, there exists a bijective isometry J∈B(V,^V∗) such that for every ^v∗=Jv˜ one has ⟨Jv˜,v⟩=_(v˜|v)V for all v∈V . Therefore A:=J∘_A0 is an operator in B(V,^V∗) , for which (24) gives

⟨Au,v⟩=a(u,v),∀u,v∈V.

Similarly ^A′:=J∘_A0∗ fulfils ⟨^A′u,v⟩=_{(_A0∗u|v)V}=^a∗(u,v) for all u,v∈V.

Clearly A and ^A′ are bijections, as composites of such. Hence they give rise to a Hilbert space structure on ^V∗with the inner product

_{(_w1|_w2)^V∗}=_aRe(^A−1 _w1,^A−1 _w2),

inducing the norm |||w_|||∗=_aRe ^{(^A−1w,^A−1w)1/2}=|||^A−1w||| on ^V∗ , equivalent to _∥w∥∗.

The Lax–Milgram operator A is defined by restriction of Ato an operator in H, i.e.,

Av=Avforv∈D(A):=^A−1(H).

So D(A) consists of the u∈V for which some f∈H fulfils (f|v)=a(u,v) for all v∈V.

The reader may consult ([20], Section 12.4) for elementary proofs of the following: A is closed in H, with D(A) dense in H as well as in V; in H also ^A′ has these properties, and it equals the adjoint of A in H; i.e., ^A′ _{|^A′−1(H)}=^A∗ . As A is closed, D(A) is a Hilbert space with the graph norm _∥v∥D(A)2=^|v|2+^|Av|2 , and D(A)↪V is bounded due to (22). Geometrically, σ(A) and ν(A) are contained in the sector of z∈Cgiven by

|Imz|≤_C3 _C4−1Rez.

Actually 0∈ρ(A) since a is V-elliptic, so ^A−1∈B(H) ; moreover m(A)≥_C1 _C4/_C2>0.

Both the closed operator A in H and its extension A∈B(V,^V∗)are used throughout. (For simplicity, they were both denoted by A in the introduction, though.)

2.2. The Self-Adjoint Case

As is well known, if A is selfadjoint, i.e., ^A∗=A (or ^a∗=a ), and has compact inverse, then H has an orthonormal basis of eigenvectors of A, which can be scaled to orthonormal bases of V and ^V∗ . This is recalled, because our results can be given a more explicit form in this case, e.g., for −Δ in (1).

The properties that A is selfadjoint, closed, and densely defined with dense range in H carry over to ^A−1 (e.g., ([20], Theorem 12.7)), so when ^A−1 in addition is compact in H (e.g., if V↪H is compact), then the spectral theorem for compact selfadjoint operators states that H has an orthonormal basis (_ej) consisting of eigenvectors of ^A−1 , where the eigenvalues _μj of ^A−1by the positivity can be ordered such that

_μ1≥_μ2≥…≥_μj≥⋯>0,with_μj→0ifj→∞.

The orthonormal basis (_ej) also consists of eigenvectors of A with eigenvalues _λj=1/_μj . Hence σ(A)=_σpoint(A)={_λj∣j∈N} . Indeed, _σres(A)=∅ as ^A∗=A ; and ^A−1∈B(H) while A−νI=(^ν−1I−^A−1)νA has a bounded inverse for ν≠_λj , as ^ν−1∉σ(^A−1).

As _aRe=a here, V is now renormed by |||v^|||2=a(v,v) . However, if moreover V is considered with a(u,v) as inner product, then A:V→^V∗is the Riesz isometry; and one has

Fact 1.

For every v∈V the H-expansion v=_∑j=1∞(v|_ej)_ej converges in V. Moreover, the sequence _{(_ej/_λj)j∈N} is an orthonormal basis for V, and |||v^|||2=_∑j=1∞ _λj ^|(v|_ej)|2 .

Proof.

The _ej/_λj are orthonormal in V since a(_ej,_ek)=⟨A_ej,_ek⟩=_λj(_ej|_ek) , cf. (25). They also yield a basis for V since similarly w∈V⊖span(_ej/_λj) implies w=0 . As _λj>0, expansion of any v in V gives

v=∑j=1∞a(v,_λj−1/2 _ej)_λj−1/2 _ej=∑j=1∞a(_ej,v)¯_λj−1 _ej=∑j=1∞(v|_ej)_ej,

whence the rightmost side converges in V. This means that v=_∑j=1∞_λj(v|_ej)_ej/_λj is an orthogonal expansion in V, whence |||v^|||2has the stated expression. ☐

For ^V∗ the set-up (26), (25) here gives _{(_w1|_w2)^V∗}=a(^A−1 _w1,^A−1 _w2)=⟨_w1,^A−1 _w2⟩.

Fact 2.

For every w∈^V∗ the expansion w=_∑j=1∞⟨w,_ej⟩_ej converges in ^V∗ . Moreover, the sequence _{(_λj_ej)j∈N} is an orthonormal basis of ^V∗ and |||w_|||∗2=_∑j=1∞ _λj−1 ^{|⟨w,_ej⟩|2} .

Proof.

(_λj_ej) is orthonormal as _{(_ej|_ek)^V∗}=⟨_ej,^A−1 _ek⟩=_λk−1(_ej|_ek) ; and if w∈^V∗ for all j fulfils 0=⟨_ej,^A−1w⟩=(_ej|^A−1w) , then w=0 as ^A−1is injective. Therefore

w=∑j=1∞_{(w|_λj1/2 _ej)^V∗} _λj1/2 _ej=∑j=1∞⟨w,^A−1 _ej⟩_λj _ej=∑j=1∞⟨w,_ej⟩_ej,

so the rightmost side converges in ^V∗ , and the expression for |||w_|||∗2results. ☐

2.3. Semigroups

Assuming that the reader is familiar with the theory of semigroups ^etA , we review a few needed facts in a setting with a general complex Banach space B. The books of Pazy [23], Tanabe [9] and Yosida [19] may serve as general references.

The generator is Ax=_{limt→⁰⁺}1t(^etAx−x) , with domain D(A) consisting of the x∈B for which the limit exists. A is a densely defined, closed linear operator in B that for certain ω≥0 and M≥1 satisfies ^{∥(A−λ)−n} _∥B(B)≤M/^(λ−ω)n for λ>ω , n∈N.

The corresponding semigroup of operators is written ^etA , it belongs to B(B)with

∥^etA _∥B(B)≤M^eωtfor0≤t<∞.

Its basic properties are that ^etA ^esA=^e(s+t)A for s,t≥0 , ^e0A=I , _{limt→⁰⁺} ^etAx=x for x∈B, and the first of these gives at once the range inclusions

R(^e(s+t)A)⊂R(^etA)⊂B.

The following well-known theorem gives a criterion for A to generate an analytic semigroup that is uniformly bounded, i.e., has ω=0 . It summarises the most relevant parts of Theorems 1.7.7 and 2.5.2 in [23], and it involves sectors of the form

Σ:=λ∈||argλ|<π2+θ∪0.

Theorem 3.

If θ∈]0,π2[ and M>0 are such that the resolvent set ρ(A)⊇Σ and

^{∥(λI−A)−1} _∥B(B)≤M|λ|,forλ∈Σ,λ≠0,

then A generates an analytic semigroup ^ezA for |argz|<θ , for which ∥^ezA∥ is bounded for |argz|≤^θ′<θ , and ^etA is differentiable in B(B) for t>0 with ^{(^etA)′}=A^etA . Here

∥A^etA _∥B(B)≤ctfort>0.

Furthermore, if ^etA is analytic, ^u′=Au , u(0)=_u0 is uniquely solved by u(t)=^etA _u0 for every _u0∈B.

2.3.1. Injectivity

Often it is crucial to know whether the semigroup ^etA consists of injective operators. Injectivity is, e.g., equivalent to the geometric property that the trajectories of two solutions ^etA _v0 and ^etA _w0 of ^u′=Au have no point of confluence in B for _v0≠_w0.

However, the literature seems to have focused on examples with non-invertibility of ^etA , e.g., ([23], Example 2.2.1). However, injectivity always holds in the analytic case, as we now show:

Proposition 1.

When a semigroup ^ezA on a complex Banach space B is analytic S→B(B) in the sector S=z∈∣|argz|<θ for some θ>0 , then ^ezA is injective for all z∈S .

Proof.

Let ^e_z0A _u0=0 hold for some _u0∈B , _z0∈S . The analyticity of ^ezA in S carries over to the map f:z↦^ezA _u0 , and to _gv:z↦⟨v,f(z)⟩ for arbitrary v in the dual space ^B′ . So _gv has in a ball B(_z0,r)⊂Sthe Taylor expansion

_gv(z)=∑n=0∞1n!⟨v,^f(n)(_z0)⟩^(z−_z0)n.

By the properties of analytic semigroups (cf. ([23], Lemma 2.4.2)) and of _u0,

^f(n)(_z0)=^An ^e_z0A _u0=0foralln≥0,

so that _gv≡0 holds on B(_z0,r)and consequently on S by unique analytic extension.

Now f(_z1)≠0 would yield _gv(_z1)≠0 for a suitable v in ^B′ , hence f≡0on S and

_u0=limt→0^etA _u0=limt→0f(t)=0,

since ^etA is a strongly continuous semigroup. Altogether Z(^e_z0A)={0}is proved. ☐

Remark 1.

We have only been able to track a claim of the injectivity in Proposition 1 in case z>0 , θ≤π/4 and B is a Hilbert space; cf. Showalter’s paper [17]. However, his proof is flawed, as ^A2 is non-accretive for some A with θ≤π/4 , cf. the counter-example in Remark 9 below.

Remark 2.

Injectivity also follows directly when A is defined on a Hilbert space H having an orthonormal basis _{(_en)n∈N} such that A_ej=_λj _ej : Clearly ^etA _ej=^et_λj _ej as both sides satisfy ^x′−Ax=0 , x(0)=_ej . So if ^etAv=0 , boundedness of ^etA gives

0=^etAv=∑(v|_ej)^etA _ej=∑(v|_ej)^et_λj _ej,

so that v⊥_ej for all j, and thus v∈span^(_en)⊥=^H⊥={0} . Hence ^etA is invertible for such A .

We have chosen to use the symbol ^e−tA to denote the inverse of the analytic semigroup ^etA generated by A , consistent with the case in which ^etA does form a group in B(B), i.e.,

^e−tA:=^{(^etA)−1}forallt∈R.

This notation is convenient for our purposes (with some diligence).

For simplicity we observe the following when B=H is a Hilbert space and t>0 : clearly ^e−tA maps D(^e−tA)=R(^etA) bijectively onto H, and it is an unbounded closed operator in H. As ^{(^etA)∗}=^{et^A∗} also is analytic, so that Z(^{et^A∗})={0} by Proposition 1, we have D(^e−tA)¯=H, i.e., the domain is dense in H.

A partial group phenomenon and other algebraic properties are collected here:

Proposition 2.

The inverses ^e−tA in (41) form a semigroup of unbounded operators,

^e−tA ^e−sA=^e−(t+s)Afort,s≥0.

This extends to (s,t)∈]−∞,0]×R , but the right-hand side may be unbounded for t+s>0 .

Moreover, as unbounded operators the ^e−tA commute with ^esA∈B(H) , i.e.,

^esA ^e−tA⊂^e−tA ^esAfort,s≥0,

and there is a descending chain of domain inclusions

D(^{e−^t′A})⊂D(^e−tA)⊂Hfor0<t<^t′.

Proof.

When s,t≥0 , clearly ^e−tA ^e−sA ^e(s+t)A=_IH holds, so that ^e−(s+t)A⊂^e−tA ^e−sA ; but equality necessarily holds, as the injection ^e−tA ^e−sA cannot be a proper extension of the surjection ^e−(s+t)A . Whence (42). For t+s≥0≥s this yields ^e−tA ^e−sA=^e−(t+s)A ^esA ^e−sA=^e−(t+s)A . The case −s>t≥0is similar.

Also the commutation follows at once, for the semigroup property gives

^esA ^e−tA=^e−tA ^etA ^esA ^e−tA=^e−tA ^e(s+t)A ^e−tA=^e−tA ^esA _IR(^etA),

where the right-hand side is a restriction of ^e−tA ^esA . Finally (33) yields (44). ☐

Remark 3.

D(^e−tA ^esA)=D(^e−(t−s)A) holds in (43), because (42) extends to negative s as stated. Hence (43) is a strict inclusion if the the first one in (44) is so for all t,^t′ .

2.3.2. Some Regularity Properties

As a preparation we treat a few regularity questions for s↦^e(t−s)Af(s) , where the analytic operator function E(s)=^e(t−s)A has a singularity at s=t ; cf. (36). This will be controlled when f∈_L1(0,t;B).

That Ef=^e(t−·)Af also is in _L1(0,t;B) is undoubtedly known. So let us recall briefly how to prove it strongly measurable, i.e., to find a sequence of simple functions converging pointwise to E(s)f(s) for a.e. s∈[0,t] ; cf. [19]. Now f can be so approximated by a sequence (_fn) , and E can by its continuity [0,t[→B(B) also be approximated pointwise for s<t by _En defined on each subinterval [t(j−1)²⁻ⁿ,tj²⁻ⁿ[ , j=1,⋯,²ⁿ , as the value of E at the left end point. Then Ef=_limn _En _fn on [0,t] a.e. Therefore ^e(t−·)Af∈_L1(0,t;B) follows directly from (32),

∥^e(t−·)A _{f∥_L1(0,t;B)}≤_∫0t∥^e(t−s)A∥∥f(s)∥ds≤M^eωt _{∥f∥_L1(0,t;B)}.

Moreover, ⟨η,^e(t−·)Af⟩ is seen to be in _L1(0,t) by majorizing with ∥^e(t−s)A _f(s)∥B _{∥η∥^B∗} , for strong measurability implies weak measurability; cf. ([24], Section IV.5 appendix).

The main concern is to obtain a Leibniz rule for the derivative:

_∂s(^e(T−s)Aw(s))=(−A)^e(T−s)Aw(s)+^e(T−s)A _∂sw(s).

For w∈^C1(0,T;B) this is unproblematic for s<T : w(s+h)=w(s)+h_∂sw(s)+o(h) , where o(h)/h→0 for h→0 ; and the operator is differentiable in B(B) for s<T , cf. Theorem 3, so that ^{e(T−(s+h))A}=^e(T−s)A+h(−A)^e(T−s)A+o(h) . Hence a multiplication of the two expansions gives the right-hand side of (47) to the first order in h. The Leibniz rule is more generally valid in the vector distribution sense:

Proposition 3.

When A generates an analytic semigroup on a Banach space B and w∈^H1(0,T;B) , then the Leibniz rule (47) holds in ^D′(0,T;B) .

Proof.

It suffices to cover the case ω=0 , for the other cases then follow by applying the formula to the semigroup ^e−ωt ^etA generated by A−ωI . For w∈^H1(0,T;B) the standard convolution procedure gives a sequence (_wk) in ^C1([0,T];B)such that

_wk→win_L2(0,T;B),_wk′→^w′in_L2,loc(0,T;B).

For arbitrary ϕ∈_C0∞(]0,T[), we find using the Bochner inequality that

∥_∫0T ^e(T−s)A(w(s)−_wk(s))_ϕ(s)ds∥B≤C_{∥w(s)−_wk(s)∥_L2(0,T;B)},

with C=M(_∫suppϕ ^|ϕ(s)|2 ^ds)1/2 , where M is the constant in (32).

Hence ^e(T−s)A _wk→^e(T−s)Aw in ^D′(0,T;B) , so via the ^C1 -case above, as _∂s is continuous in ^D′, we get

_∂s(^e(T−s)Aw)=limk→∞(_∂s(^e(T−s)A _wk))=limk→∞((−A)^e(T−s)A _wk)+limk→∞(^e(T−s)A _∂s _wk)=(−A)^e(T−s)Aw+^e(T−s)A _∂sw.

Indeed, the last limits exist in ^D′(0,T;B) by the choice of _wk , for if ϵ>0is small enough,

∥_∫suppϕ ^e(T−s)A(^w′(s)−_wk′(s))_ϕ(s)ds∥B≤c_∫εT−ε _{∥^w′(s)−_wk′(s)∥B}ds,

∥_∫0T(−A)^e(T−s)A(w(s)−_wk(s))_ϕ(s)ds∥B≤C˜_{∥w−_wk∥_L2(0,T;B)}

with C˜=^{(_∫suppϕ|cϕ(s)T−s^|2ds)1/2} , using the bound on (−A)^e(T−s)Ain Theorem 3. ☐ 3. Functional Analysis of Initial Value Problems

Having set the scene by recalling elliptic Lax–Milgram operators A in Gelfand triples (V,H,^V∗) in Section 2.1, we now discuss solutions of the classical initial value problem

_∂tu+Au=fin^D′(0,T;^V∗)u(0)=_u0inH.

By definition of vector distributions, the above equation means that for every scalar test function φ∈_C0∞(]0,T[) one has ⟨u,−^φ′⟩+⟨Au,φ⟩=⟨f,φ⟩ as an identity in ^V∗.

First we recall the fundamental theorem for vector functions from ([10], Lemma III.1.1). Further below, it will be crucial for obtaining a solution formula for (53).

Lemma 1.

For a Banach space B and u,g∈_L1(a,b;B) the following are equivalent:

(i) u is a.e. equal to a primitive function of g, i.e., for some vector ξ∈B

u(t)=ξ+_∫atg(s)dsfora.e.t∈[a,b].

(ii) For each test function ϕ∈_C0∞(]a,b[) one has _∫abu(t)^ϕ′(t)dt=−_∫abg(t)ϕ(t)dt .

(iii) For each η in the dual space ^B′ , ddt⟨η,u⟩=⟨η,g⟩ holds in ^D′(a,b) .

In the affirmative case, ^u′=g as vector distributions in ^D′(a,b;B) by (ii), the right-hand side in (i) is a continuous representative of u such that ξ=u(a) and

supa≤t≤b_∥u(t)∥B≤^(b−a)−1 _{∥u∥_L1(a,b;B)}+_{∥g∥_L1(a,b;B)}.

Remark 4.

Lemma 1 is proved in [10], except for the estimate (55): the continuous function _∥u(t)∥B attains its minimum at some _t0∈[a,b] , so applying the Bochner inequality in (i) and the Mean Value Theorem,

_∥u(t)∥B≤∥u(_t0)_∥B+|_{∫_t0t}∥g(t)_∥Bdt|≤1b−a_∫ab∥u(t)_∥Bdt+_∫ab _∥g(t)∥Bdt.

This yields (55), hence the Sobolev embedding ^W1,1(a,b;B)↪C([a,b];B) . If furthermore u,g∈_L2(a,b;B) , we get the Sobolev embedding ^H1(a,b;B)↪C([a,b];B) similarly,

supa≤t≤b_∥u(t)∥B≤^{(b−a)−1/2} _{∥u∥_L2(a,b;B)}+^(b−a)1/2 _{∥g∥_L2(a,b;B)}≤c_{∥u∥^H1(a,b;B)}.

Secondly we recall the Leibniz rule ddt(f(t)|g(t))=(^f′(t)|g(t))+(f(t)|^g′(t)) valid for f,g∈^C1([0,T];H) . The well-known generalization below was proved in real vector spaces in ([10], Lemma III.1.2) for u=v . We briefly extend this to the general complex case, which we mainly use to obtain that _∂t ^|u|2=2Re⟨^u′,u⟩ , though also u≠vwill be needed.

Lemma 2.

If u,v∈_L2(0,T;V)∩^H1(0,T;^V∗) , then t↦(u(t)|v(t)) is in _L1(0,T) and

ddt(u|v)=⟨^u′,v⟩+⟨^v′,u⟩¯in^D′(0,T).

Furthermore, u and v have continuous representatives on [0,T] , i.e., u,v∈C([0,T];H) .

Proof.

Let u,v∈_L2(0,T;V) with distributional derivatives ^u′,^v′∈_L2(0,T;^V∗) . As in the proof of Proposition 3 we obtain _um∈^C∞([0,T];V) such that _um→u in _L2(0,T;V) while _um′→^u′ in _L2,loc(0,T;^V∗) . Similarily v gives rise to _vm.

By continuity of inner products, the function (u|v) is measurable on [0,T] for u,v∈_L2(0,T;V) , and _∫0T|(u|v)|dt<∞ . Sesquilinearity yields (_um|_vm)→(u|v) in _L1(0,T) for m→∞ , while both ⟨_um′,_vm⟩→⟨^u′,v⟩ and ⟨_vm′,_um⟩→⟨^v′,u⟩ hold in _L2,loc(0,T) , hence in ^D′(0,T).

As differentiation is continuous in ^D′(0,T) , one finds from the ^C1 -case and (21) that

ddt(u|v)=limmddt(_um|_vm)=limm(_um′|_vm)+limm(_vm′|_um)¯=⟨^u′,v⟩+⟨^v′,u⟩¯.

Taking v=u the function t↦^|u(t)|2 is seen to be in ^W1,1(0,T)⊂C([0,T]) , and since any u∈^H1(0,T;^V∗) is continuous in ^V∗ by Remark 4, one can also here obtain from Lemma III.1.4 in [10] that u:[0,T]→His continuous. Similarly for v. ☐

3.1. Existence and Uniqueness

In our presentation the following result is a cornerstone, relying on the full framework in Section 2.1; in particular A need not be selfadjoint:

Theorem 4.

Let V be a separable Hilbert space with V⊆H algebraically, topologically and densely, cf. (19) and (20), and let A:V→^V∗ be the bounded Lax–Milgram operator induced by a V-elliptic sesquilinear form, cf. (25). When _u0∈H and f∈_L2(0,T;^V∗) are given, then (53) has a uniquely determined solution u(t) belonging to the space

X=_L2(0,T;V)⋂C([0,T];H)⋂^H1(0,T;^V∗).

We omit a proof of this theorem, as it is a special case of a more general result of Lions and Magenes ([8], Section 3.4.4) on t-dependent forms a(t;u,v) a(t;u,v) . Clearly the conjunction of u∈_L2(0,T;V) u∈_L2(0,T;V) and ^u′∈_L2(0,T;^V∗) ^u′∈_L2(0,T;^V∗) , which appears in [8], is equivalent to the claim in (60) that u belongs to the intersection of _L2(0,T,V) _L2(0,T,V) and ^H1(0,T;^V∗) ^H1(0,T;^V∗).

Alternatively one can use Theorem III.1.1 in Temam’s book [10], where proof is given using Lemma 1 to reduce to the scalar differential equation _∂t⟨u,η⟩+a(u,η)=⟨f,η⟩ _∂t⟨u,η⟩+a(u,η)=⟨f,η⟩ in ^D′(0,T) ^D′(0,T) , for η∈V η∈V , which is treated by Faedo–Galerkin approximation and basic functional analysis. His proof extends straightforwardly, from a specific triple (H,V,a) (H,V,a) for the Navier-Stokes equations, to the general set-up in Section 2.1, also when ^A∗≠A ^A∗≠A.

However, either way, we need the finer theory described in the next two subsections.

3.2. Well-Posedness

We now substantiate that the unique solution from Theorem 4 depends continuously on the data, so that (53) is well-posed in the sense of Hadamard. First we note that the solution in Theorem 4 is an element of the space X in (60), which is a Banach space when normed, as done throughout, by

_∥u∥X=^{_{∥u∥_L2(0,T;V)2}+sup0≤t≤T^|u(t)|2+_{∥u∥^H1(0,T;^V∗)2}1/2}.

To clarify a redundancy in this choice, we need a Sobolev inequality for vector functions.

Lemma 3.

There is an inclusion _L2(0,T;V)∩^H1(0,T;^V∗)⊂C([0,T];H) _L2(0,T;V)∩^H1(0,T;^V∗)⊂C([0,T];H) and

sup0≤t≤T^|u(t)|2≤(1+_C22_C12T)_∫0T ^∥u∥2dt+_∫0T _{∥^u′∥∗2}dt.

Proof.

If u belongs to the intersection, the continuity follows from Lemma 2, where the formula gives _∂t ^|u|2=2Re⟨^u′,u⟩ _∂t ^|u|2=2Re⟨^u′,u⟩. By Lemma 1, integration of both sides entails

^|u(t)|2≤|u(_t0)^|2+_∫0T ^(∥u∥2+∥^u′_∥∗2)dt,

which by use of the Mean Value Theorem as in Remark 4 leads to the estimate. ☐

Remark 5.

In our solution set X in (60) one can safely omit the space C([0,T];H) C([0,T];H) , according to Lemma 3. Likewise sup|u| sup|u| can be removed from _∥·∥X _∥·∥X , as one just obtains an equivalent norm (similarly for the term _∫0T _{∥u(t)∥∗2}dt _∫0T _{∥u(t)∥∗2}dt in (7)). Thus X is more precisely a Hilbertable space; we omit this detail in the sequel for the sake of simplicity. However, we shall keep X as stated in order to emphasize the properties of the solutions.

The next result on stability is well known among experts, and while it may be derived from the abstract proofs in [8], we shall give a direct proof based on explicit estimates:

Corollary 1.

The unique solution u of (53), given by Theorem 4, depends continuously as an element of X on the data (f,_u0)∈_L2(0,T;^V∗)⊕H (f,_u0)∈_L2(0,T;^V∗)⊕H , i.e.,

_∥u∥X2≤c(|_u0 ^|2+_{∥f∥_L2(0,T;^V∗)2}).

That is, the solution operator (f,_u0)↦u (f,_u0)↦u is a bounded linear map _L2(0,T;^V∗)⊕H→X _L2(0,T;^V∗)⊕H→X .

Proof.

Clearly u∈_L2(0,T;V) u∈_L2(0,T;V) while the functions ^u′,f ^u′,f and Au Au belong to _L2(0,T;^V∗) _L2(0,T;^V∗), so as an identity of integrable functions,

Re⟨_∂tu,u⟩+Re⟨Au,u⟩=Re⟨f,u⟩.

Hence Lemma 2 and the V-ellipticity gives

_∂t ^|u|2+2_C4 ^∥u∥2≤2|⟨f,u⟩|≤_C4−1 _∥f∥∗2+_C4 ^∥u∥2.

Using again that ^|u(t)|2 ^|u(t)|2 and _∂t ^|u(t)|2 _∂t ^|u(t)|2 are in _L1(0,T) _L1(0,T) , taking B=C B=Cin Lemma 1 yields

^|u(t)|2+_C4 _∫0t ^∥u(s)∥2ds≤|_u0 ^|2+_C4−1 _{∥f∥_L2(0,T;^V∗)2}.

For the first two contributions to the X-norm this gives

sup0≤t≤T^|u(t)|2≤|_u0 ^|2+_C4−1 _{∥f∥_L2(0,T;^V∗)2},

_{∥u∥_L2(0,T;V)2}≤_C4−1|_u0 ^|2+_C4−2 _{∥f∥_L2(0,T;^V∗)2}.

Since u solves (53) it is clear that ∥_∂t _u(t)∥∗2≤_{(∥f(t)∥∗}+_∥Au∥∗ ⁾² ∥_∂t _u(t)∥∗2≤_{(∥f(t)∥∗}+_∥Au∥∗ ⁾², so we get

_∫0T∥_∂t _u(t)∥∗2dt≤2_∫0T _{∥f(t)∥∗2}dt+_{2∥A∥B(V,^V∗)2} _∫0T ^∥u∥2dt,

which upon substitution of (69) altogether shows (64). ☐

3.3. The First Order Solution Formula

We now supplement the well-posedness by a direct proof of the variation of constants formula, which requires that the extended Lax–Milgram operator A A generates an analytic semigroup in ^V∗ ^V∗ . This is known, cf. [9], but lacking a concise proof in the literature, we begin by analysing A in H:

Lemma 4.

For a V-elliptic Lax–Milgram operator A, both −A −A and −^A∗ −^A∗ have the sector Σ in (34) in their resolvent sets for θ=arccot(_C3/_C4) θ=arccot(_C3/_C4) and they generate analytic semigroups on H. This holds verbatim for the extensions −A −A and −^A′ −^A′ in ^V∗ ^V∗ .

Proof.

To apply Theorem 3, we let λ≠0 λ≠0 be given in the sector Σ for some angle θ satisfying 0<θ<arccot(_C3/_C4) 0<θ<arccot(_C3/_C4) . Then it is clear that δ=−sgn(Imλ)θ δ=−sgn(Imλ)θ or δ=0 δ=0gives

Re(^eiδλ)≥0.

In case δ∈±θ δ∈±θ a multiplication of the inequalities (28) by −sinδ −sinδyields

−sinδIma(u,u)≥−_C3 _C4−1sinθRea(u,u).

In addition _Cθ:=_C4cosθ−_C3sinθ>0 _Cθ:=_C4cosθ−_C3sinθ>0 , because cotθ>_C3 _C4−1 cotθ>_C3 _C4−1 . So for u∈D(A) u∈D(A),

Re(^eiδ(a(u,u)+λ(u|u)))≥Re(^eiδa(u,u))=cosδRea(u,u)−sinδIma(u,u)≥(cosθ−_C3 _C4−1sinθ)Rea(u,u)≥_Cθ ^∥u∥2.

This V-ellipticity holds also if δ=0 δ=0 , cf. (71), so ^eiδ(A+λI) ^eiδ(A+λI) is in any case bijective; and so is −A−λI −A−λI.

To bound −^(A+λI)−1 −^(A+λI)−1 , we see from (73) that for u∈D(A) u∈D(A),

|λ|(u|u)≤|((A+λ)u|u)|+|a(u,u)|≤|((A+λ)u|u)|+_C3 ^∥u∥2≤(1+_C3 _Cθ−1)|((A+λ)u|u)|.

This implies (35) for −A −A . Since ^A∗ ^A∗ is the Lax–Milgram operator associated to the elliptic form ^a∗ ^a∗ , the above also entails the statement for −^A∗ −^A∗.

For A A it follows at once from (73) that Re⟨^eiδ(A+λ)u,u⟩≥_Cθ ^∥u∥2 Re⟨^eiδ(A+λ)u,u⟩≥_Cθ ^∥u∥2 for u∈V u∈V . Hence R(A+λI) R(A+λI) is closed in ^V∗ ^V∗ , and it is also dense since R(A+λI)⊃R(A+λI)=H R(A+λI)⊃R(A+λI)=H by the above; i.e., A+λI A+λI is surjective. Mimicking (74), we get for u≠0 u≠0 , ∥w∥=1 ∥w∥=1, both in V,

|λ|·_∥u∥∗≤supw|⟨(A+λ)u,w⟩|+_C3 _Cθ−1|⟨(A+λ)u,1∥u∥u⟩|≤c_{∥(A+λ)u∥∗}.

This yields injectivity of A+λI A+λI and the resolvent estimate. ^A′ ^A′ is covered through ^a∗ ^a∗. ☐

We denote by ^e−tA ^e−tA the semigroup generated by −A −A on ^V∗ ^V∗ , to distinguish it from ^e−tA ^e−tA on H. Analogously for ^{e−t^A′}∈B(^V∗) ^{e−t^A′}∈B(^V∗) . As A⊂A A⊂A implies that ^(A+λI)−1 _|H=^(A+λI)−1 ^(A+λI)−1 _|H=^(A+λI)−1 , and since A and A A have the same sector Σ by Lemma 4, the well-known Laplace transformation formula, cf. ([23], Theorem 1.7.7), yields the corresponding fact, say ^e−tA _|H=^e−tA ^e−tA _|H=^e−tAfor the semigroups:

Lemma 5.

For all x∈H x∈H one has ^e−tAx=^e−tAx ^e−tAx=^e−tAx as well as ^{e−t^A′}x=^{e−t^A∗}x ^{e−t^A′}x=^{e−t^A∗}x .

We could add that A and ^A∗ ^A∗ are dissipative, as m(A)>0 m(A)>0 , m(^A∗)>0 m(^A∗)>0 in H, so ^e−tA ^e−tA , ^{e−t^A∗} ^{e−t^A∗} are contractions for t≥0 t≥0 by the Lumer–Philips theorem; cf. ([20], Corollary 14.12).

Using Lemmas 4 and 5, the announced formula results as an addendum to Theorem 4:

Theorem 5.

The unique solution u in X provided by Theorem 4 satisfies that

u(t)=^e−tA _u0+_∫0t ^e−(t−s)Af(s)dsfor0≤t≤T,

where each of the three terms belongs to X.

Proof.

Once (76) has been shown, Theorem 4 applies in particular to cases with f=0 f=0 , yielding that u(t) u(t) and hence ^e−tA _u0 ^e−tA _u0 belongs to X. For general data (f,_u0) (f,_u0)this means that the last term containing f necessarily is a member of X too.

To derive Formula (76) in the present general context, one should note that all terms in the equation _∂tu+Au=f _∂tu+Au=f belong to the space _L2(0,T;^V∗) _L2(0,T;^V∗) . Therefore the operator ^e−(T−t)A ^e−(T−t)Aapplies to both sides as an integration factor, yielding

^e−(T−t)A _∂tu(t)+^e−(T−t)AAu(t)=^e−(T−t)Af(t).

Now ^e−(T−t)Au(t) ^e−(T−t)Au(t) is in _L1(0,T;^V∗) _L1(0,T;^V∗) , cf. the argument prior to (46). For its derivative in ^D′(0,T;^V∗) ^D′(0,T;^V∗) the Leibniz rule in Proposition 3 gives, as u(t)∈V=D(A) u(t)∈V=D(A)for t a.e.,

_∂t(^e−(T−t)Au(t))=^e−(T−t)A _∂tu(t)+^e−(T−t)AAu(t).

As both terms on the right-hand side are in _L2(0,T;^V∗) _L2(0,T;^V∗), the implication (ii)⇒ (i) in Lemma 1 gives

^e−(T−t)Au(t)=^e−TA _u0+_∫0t ^e−(T−s)Af(s)ds.

From this identity in C([0,T];^V∗) C([0,T];^V∗) formula (76) results in case t=T t=T by evaluation, when also Lemma 5 is used for the term containing _u0 _u0 . However, obviously the above argument applies to any subinterval [0,_T1]⊂[0,T] [0,_T1]⊂[0,T] , whence (76) is valid for all t in [0,T] [0,T]. ☐

Alternatively one could conclude by applying ^e−(T−s)A=^e−(T−t)A ^e−(t−s)A ^e−(T−s)A=^e−(T−t)A ^e−(t−s)A in (79) and use the Bochner identity to commute ^e−(T−t)A ^e−(T−t)A with the integral: as analytic semigroups like ^e−(T−t)A ^e−(T−t)A are always injective, cf. Proposition 1, formula (76) then results at once.

For later reference we show similarly the next inequality:

Corollary 2.

The solution ^e−tA _u0 ^e−tA _u0 to the problem with f=0 f=0 in Theorem 4 belongs to _L2(0,T;V) _L2(0,T;V) and fulfils, for every _u0∈H _u0∈H ,

sup0≤t≤T(T−t)|^e−tA _u0 ^|2≤_C5 _∫0T ^{∥^e−tA _u0∥2}dt.

Proof.

It is seen from Theorem 5 that u(t)=^e−tA _u0 u(t)=^e−tA _u0 always is in _L2(0,T;V) _L2(0,T;V) , as a member of X. By taking scalar products with (T−·)u (T−·)u on both sides of the differential equation, one obtains in _L1(0,T) _L1(0,T)the identity

(T−t)⟨^u′(t),u(t)⟩+(T−t)a(u(t),u(t))=0.

Taking real parts here, applying Lemma 2 to u and integrating partially on [t,T] [t,T], one obtains

_∫tT ^|u(s)|2ds−(T−t)^|u(t)|2=−2_∫tT(T−s)Rea(u(s),u(s))ds.

By reorganising this, a crude estimate yields the result at once for _C5=_C2 _C1+2T_C3 _C5=_C2 _C1+2T_C3. ☐

3.4. Non-Selfadjoint Dynamics

It is classical that ^e−tA _u0 ^e−tA _u0 in (76) is a term that decays exponentially for t→∞ t→∞ if A is self-adjoint and has compact inverse on H. This follows from the eigenfunction expansions, cf. the formulas in the introduction and Section 2.2, which imply for the ’height’ function h(t)=|^e−tA _u0| h(t)=|^e−tA _u0| that h(t)=O(^e−tRe_λ1) h(t)=O(^e−tRe_λ1).

However, it is a much more precise dynamical property that h(t) h(t) is a strictly convex function for _u0≠0 _u0≠0 (we refer to [25] for a lucid account of convex functions). Strict convexity is established below for wide classes of non-self-adjoint A, namely if A is hyponormal or such that ^A2 ^A2is accretive.

Moreover, it seems to be a novelty that the injectivity of ^e−tA ^e−tA provided by Proposition 1 implies the strict convexity. For simplicity we first explain this for the square h^(t)2 h^(t)2.

Indeed, differentiating twice for t>0 t>0 one finds for u=^e−tA _u0 u=^e−tA _u0,

^{(^h2)′′}=^{(−2Re(A^e−tA _u0|^e−tA _u0))′}=2Re(^A2u|u)+2(Au|Au).

In case ^A2 ^A2 is accretive, that is when m(^A2)≥0 m(^A2)≥0 , we may keep only the last term in (83) to get that ^{(^h2)′′}(t)≥2^{|A^e−tA _u0|2} ^{(^h2)′′}(t)≥2^{|A^e−tA _u0|2} , which for _u0≠0 _u0≠0 implies ^{(^h2)′′}>0 ^{(^h2)′′}>0 as both A and ^e−tA ^e−tA are injective; cf. (34) and Proposition 1. Hence ^h2 ^h2 is strictly convex for t>0 t>0 if m(^A2)≥0 m(^A2)≥0.

Another case is when A is hyponormal. For an unbounded operator A this means, cf. the work of Janas [18], that

D(A)⊂D(^A∗)with|^A∗u|≤|Au|forallu∈D(A).

Note that if both A, ^A∗ ^A∗are hyponormal, then A is normal. This is a quite general class, but it fits most naturally into the present discussion:

For hyponormal A we have R(^e−tA)⊂D(A)⊂D(^A∗) R(^e−tA)⊂D(A)⊂D(^A∗) , which shows that ^A∗ ^e−tA _u0 ^A∗ ^e−tA _u0 is defined. Using this and hyponormality once more in (83), we get

^{(^h2)′′}(t)≥(Au|^A∗u)+(^A∗u|Au)+^|Au|2+|^A∗ ^u|2=^{|(A+^A∗)^e−tA _u0|2}.

Now ^{(^h2)′′}>0 ^{(^h2)′′}>0 follows for _u0≠0 _u0≠0 from injectivity of ^e−tA ^e−tA and of A+^A∗ A+^A∗ ; the latter holds since 2_aRe 2_aRe is V-elliptic. So ^h2 ^h2is also strictly convex for hyponormal A.

Also on the closed half-line with t≥0 t≥0 there is a result on non-selfadjoint dynamics. Here we return to h(t) h(t) itself and normalise, at no cost, to |_u0|=1 |_u0|=1to get cleaner statements:

Proposition 4.

Let A denote a V-elliptic Lax–Milgram operator, defined from a triple (H,V,a) (H,V,a) , such that A is hyponormal, as above, or such that ^A2 ^A2 is accretive, and let u be the solution from Theorem 4 for f=0 f=0 and |_u0|=1 |_u0|=1 . Then h(t)=|u(t)| h(t)=|u(t)| is strictly decreasing and strictly convex for t≥0 t≥0 and differentiable from the right at t=0 t=0 with

^h′(0)=−Re(A_u0|_u0)for_u0∈D(A),

and generally

^h′(0)≤−m(A).

Remark 6.

The derivative ^h′(0) ^h′(0) might be −∞ −∞ if _u0∈H∖D(A) _u0∈H∖D(A) .

Proof.

By the convexity shown above, ^{(^h2)′} ^{(^h2)′} is increasing. Since m(A)>0 m(A)>0 holds by the V-ellipticity, ^h2 ^h2 is strictly decreasing (and so is h) for t>0 t>0as

^{(^h2)′}(t)=−2Re(A^e−tA _u0|^e−tA _u0)≤−2m(A)^{|^e−tA _u0|2}<0.

These properties give that ^h′=^{(^h2)′}/(2^h2) ^h′=^{(^h2)′}/(2^h2) is strictly increasing for t>0 t>0 , so the Mean Value Theorem yields that (h(t)−h(s))/(t−s)<(h(u)−h(t))/(u−t) (h(t)−h(s))/(t−s)<(h(u)−h(t))/(u−t) for 0<s<t<u 0<s<t<u ; i.e., h is strictly convex on ]0,∞[ ]0,∞[.

The inequality h((1−θ)t+θs)≤(1−θ)h(t)+θh(s) h((1−θ)t+θs)≤(1−θ)h(t)+θh(s) , θ∈]0,1[ θ∈]0,1[ now extends by continuity to t=0 t=0. So does strict convexity of h, using twice that the slope function is increasing.

By convexity ^h′ ^h′ is increasing for t>0 t>0 , so _{limt→⁰⁺} ^h′(t)=inf^h′≥−∞ _{limt→⁰⁺} ^h′(t)=inf^h′≥−∞ . For each 0<s<1 0<s<1 the continuity of h yields |^e−tA _u0|≥s|_u0|=s |^e−tA _u0|≥s|_u0|=s for all sufficiently small t≥0 t≥0 . By the above formulas for ^h′ ^h′ and ^{(^h2)′} ^{(^h2)′} we have ^h′(t)=−Re(A^e−tA _u0|^e−tA _u0)/|^e−tA _u0| ^h′(t)=−Re(A^e−tA _u0|^e−tA _u0)/|^e−tA _u0| , so the Mean Value Theorem gives for some τ∈]0,t[ τ∈]0,t[,

^t−1(h(t)−h(0))=^h′(τ)≤−m(A)s<0.

Hence h(0)>h(t) h(0)>h(t) for all t>0 t>0 . Moreover, the limit of ^h′(τ) ^h′(τ) was shown above to exist for τ→⁰⁺ τ→⁰⁺ , so ^h′(0) ^h′(0) exists in [−∞,−m(A)] [−∞,−m(A)] . If _u0∈D(A) _u0∈D(A) we may commute A with the semigroup in the formula for ^h′(τ) ^h′(τ) , which by continuity gives ^h′(0)=−Re(A_u0|_u0) ^h′(0)=−Re(A_u0|_u0). ☐

Proposition 4 is a stiffness result for u=^e−tA _u0 u=^e−tA _u0 , due to strict convexity of |^e−tA _u0| |^e−tA _u0| . It is noteworthy that when A≠^A∗ A≠^A∗ , then Proposition 4 gives conditions under which the eigenvalues in C\R C\R(if any) never lead to oscillations in the size of the solution.

Remark 7.

Since ^h′(0) ^h′(0) is estimated in terms of the lower bound m(A) m(A) , it is the numerical range ν(A) ν(A) , rather than σ(A) σ(A) , that controls short-time decay of the solutions ^e−tA _u0 ^e−tA _u0 .

Remark 8.

In Proposition 4 we note that when ^A2 ^A2 is accretive, i.e., m(^A2)≥0 m(^A2)≥0 , then A is necessarily sectorial with half-angle π/4 π/4 ; that is ν(A)⊂z∈||arg(z)|≤π/4 ν(A)⊂z∈||arg(z)|≤π/4 . This may be seen as in ([17], Lemma 3), where reduction to bounded operators was made in order to invoke the operator monotonicity of the square root.

Remark 9.

We take the opportunity to point out an error in ([17], Lemma 3), where it incorrectly was claimed that having half-angle π/4 π/4 also is sufficient for m(^A2)≥0 m(^A2)≥0 . A counter-example is available already for A in B(H) B(H) (if dimH≥2 dimH≥2 ), as A=X+iY A=X+iY for self-adjoint X, Y∈B(H) Y∈B(H) : here m(A)≥0 m(A)≥0 if and only if X≥0 X≥0 , and we can even arrange that A has half-angle π/4 π/4 , that is |Im(Av|v)|≤Re(Av|v) |Im(Av|v)|≤Re(Av|v) or |(Yv|v)|≤(Xv|v) |(Yv|v)|≤(Xv|v) , by designing Y so that −X≤Y≤X −X≤Y≤X . Here we may take Y=δX+_λ1U Y=δX+_λ1U , where δ>0 δ>0 is small enough and U is a partial isometry that interchanges two eigenvectors _v1 _v1 , _v2 _v2 of X with eigenvalues _λ2>_λ1>0 _λ2>_λ1>0 , U=0 U=0 on H⊖span(_v1,_v2) H⊖span(_v1,_v2) . In fact, writing v=_c1 _v1+_c2 _v2+_v⊥ v=_c1 _v1+_c2 _v2+_v⊥ for _v⊥∈H⊖span(_v1,_v2) _v⊥∈H⊖span(_v1,_v2) , since _v1⊥_v2 _v1⊥_v2 , the above inequalities for Y are equivalent to 2_λ1|Re(_c1 _c¯2)|≤_λ1(1−δ)|_c1 ^|2+(1−δ)_λ2 ^|_c2|2+(1−δ)(X_v⊥|_v⊥) 2_λ1|Re(_c1 _c¯2)|≤_λ1(1−δ)|_c1 ^|2+(1−δ)_λ2 ^|_c2|2+(1−δ)(X_v⊥|_v⊥) , which by the positivity of X and Young’s inequality is implied by 1/(1−δ)≤(1−δ)_λ2 _λ1 1/(1−δ)≤(1−δ)_λ2 _λ1 , that is if 0<δ≤1−_λ1/_λ2 0<δ≤1−_λ1/_λ2 . Now, m(^A2)≥0 m(^A2)≥0 if and only if ^|Xv|2≥^|Yv|2 ^|Xv|2≥^|Yv|2 for all v in H, but this will always be violated, as one can see from ^|Yv|2=^δ2 ^|Xv|2+_λ12 ^|Uv|2+2δ_λ1Re(Xv|Uv) ^|Yv|2=^δ2 ^|Xv|2+_λ12 ^|Uv|2+2δ_λ1Re(Xv|Uv) by inserting v=_v1 v=_v1 , for the last term drops out as _v1⊥_v2=U_v1 _v1⊥_v2=U_v1 , so that actually |Y_v1 ^|2=(^δ2+1)_λ12>^|X_v1|2 |Y_v1 ^|2=(^δ2+1)_λ12>^|X_v1|2 . Thus A=λ004λ+iλδ114δ A=λ004λ+iλδ114δ is a counter-example in ^C2 ^C2 for any λ>0 λ>0 , 0<δ≤1/2 0<δ≤1/2 .

Remark 10.

It is perhaps useful to emphasize the benefit from joining the two methods. Within semigroup theory the “mild solution” given in (76) is the only possible solution to (53); but as our class of solutions is larger, the extension of the old uniqueness argument in Theorem 5 was needed. Existence of a solution is for analytic semigroups classical if f:[0,T]→H f:[0,T]→H is Hölder continuous, cf. ([23], Corollary 4.3.3). Using functional analysis, this gap to the weaker condition f∈_L2(0,T;^V∗) f∈_L2(0,T;^V∗) is bridged by Theorem 5, which states that the mild solution is indeed the solution in the space of vector distributions in Theorem 4; albeit at the expense that the generator A is a V-elliptic Lax–Milgram operator.

4. Abstract Final Value Problems

In this section, we show for a Lax–Milgram operator A Athat the final value problem

_∂tu+Au=fin^D′(0,T;^V∗),u(T)=_uTinH,

is well-posed when the final data belong to an appropriate space, to be identified below. This is obtained via comparison with the initial value problem treated in Section 3.

4.1. A Bijection From Initial to Terminal States

According to Theorem 4, the solutions to the differential equation ^u′+Au=f ^u′+Au=f are for fixed f parametrised by the initial states u(0)∈H u(0)∈H . To study the terminal states u(T) u(T) we note that (76) yields

u(T)=^e−TAu(0)+_∫0T ^e−(T−s)Af(s)ds.

This representation of u(T) u(T) is essential in what follows, as it gives a bijective correspondence u(0)↔u(T) u(0)↔u(T)between the initial and terminal states, as accounted for below.

First we analyse the integral term above by introducing the yield map f↦_yf f↦_yfgiven by

_yf=_∫0T ^e−(T−s)Af(s)ds,f∈_L2(0,T;^V∗).

Clearly _yf _yf is a vector in ^V∗ ^V∗ by definition of the integral (and since C([0,T];^V∗)⊂_L1(0,T;^V∗) C([0,T];^V∗)⊂_L1(0,T;^V∗) ). But actually it is in the smaller space H, for _yf=u(T) _yf=u(T) holds in H when u is the solution for _u0=0 _u0=0 of (53), and then Corollary 1 yields an estimate of _supt∈[0,T]|u(t)| _supt∈[0,T]|u(t)| by the _L2 _L2 -norm of f; cf. (61). In particular, we have

|_yf _{|≤c∥f∥_L2(0,T;^V∗)}.

Moreover, f↦_yf f↦_yf is by (93) bounded _L2(0,T;^V∗)→H _L2(0,T;^V∗)→H , and it has dense range in H containing all x∈D(^eεA) x∈D(^eεA) for every ε>0 ε>0 , for if in (92) we insert the piecewise continuous function

_fε(s)=_1[T−ε,T](s)^{e(T−ε−s)A}(1ε^eεAx),

then the semigroup property gives _{y_fε}=_∫T−εT ^e−εA(1ε^eεAx)ds=1ε_∫T−εTxds=x _{y_fε}=_∫T−εT ^e−εA(1ε^eεAx)ds=1ε_∫T−εTxds=x. However, standard operator theory gives the optimal result, that is, surjectivity:

Proposition 5.

The yield map f↦_yf f↦_yf is in B(_L2(0,T;^V∗),H) B(_L2(0,T;^V∗),H) and it is surjective, R(_yf)=H R(_yf)=H . Its adjoint in B(H,_L2(0,T;V)) B(H,_L2(0,T;V)) is the orbit map given by v↦^{e−(T−·)^A∗}v v↦^{e−(T−·)^A∗}v .

Proof.

To determine the adjoint of f↦_yf f↦_yf , we first calculate for f∈_L2(0,T;H) f∈_L2(0,T;H) so that the integrand in (92) belongs to C([0,T];H) C([0,T];H) . For v∈H v∈Hwe get, using the Bochner identity twice,

(_yf|v)=_∫0T(^e−(T−s)Af(s)|v)ds=_∫0T(f(s)|^{e−(T−s)^A∗}v)ds=⟨f,^{e−(T−s)^A∗}v⟩.

The last scalar product makes sense because s↦^{e−(T−s)^A∗}v s↦^{e−(T−s)^A∗}v is in _L2(0,T;V) _L2(0,T;V) , as seen by applying Corollary 2 to the Lax–Milgram operator ^A∗ ^A∗ , and _L2(0,T;V) _L2(0,T;V) is the dual space to _L2(0,T;^V∗) _L2(0,T;^V∗) ; cf. Remark 11 below. Since _L2(0,T;H) _L2(0,T;H) is dense in _L2(0,T;^V∗) _L2(0,T;^V∗) , it follows by closure that the left- and right-hand sides are equal for every f∈_L2(0,T;^V∗) f∈_L2(0,T;^V∗) and v∈H v∈H . Hence v↦^{e−(T−·)^A∗}v v↦^{e−(T−·)^A∗}v is the adjoint of _yf _yf.

Applying Corollary 2 to ^A∗ ^A∗ for t=0 t=0 , a change of variables yields for every v∈H v∈H,

^|v|2≤_C5T_∫0T ^{∥^{e−(T−s)^A∗}v∥2}ds.

This estimate from below of the adjoint is equivalent to closedness of the range of _yf _yf , as the range is dense by (94). This follows from the Closed Range Theorem; cf. ([26], Theorem 3.1) for a general result on this. ☐

Remark 11.

The Banach spaces _L2(0,T;V) _L2(0,T;V) , _L2(0,T;^V∗) _L2(0,T;^V∗) are in duality, and _L2 ^(0,T;V)∗ _L2 ^(0,T;V)∗ identifies with _L2(0,T;^V∗) _L2(0,T;^V∗) : for each Λ∈_L2 ^(0,T;V)∗ Λ∈_L2 ^(0,T;V)∗ the inner product _aRe _aRe and Riesz’ theorem yield h∈_L2(0,T;V) h∈_L2(0,T;V) that for g∈_L2(0,T;V) g∈_L2(0,T;V) fulfils ⟨Λ,g⟩=_∫0T _aRe(h,g)dt ⟨Λ,g⟩=_∫0T _aRe(h,g)dt ; so ⟨Λ,g⟩=_∫0T⟨f,g⟩dt ⟨Λ,g⟩=_∫0T⟨f,g⟩dt for f=12(A+^A′)h f=12(A+^A′)h in _L2(0,T;^V∗) _L2(0,T;^V∗) ; cf. (23) and (25).

The surjectivity of _yf _yfcan be shown in important cases using an explicit construction, which is of interest in control theory (cf. Remark 12), and given here for completeness:

Proposition 6.

If ^A∗=A ^A∗=A and ^A−1 ^A−1 is compact, every v∈H v∈H equals _yf _yf for some computable f∈_L2(0,T;^V∗) f∈_L2(0,T;^V∗) .

Proof.

Fact 1 yields an ortonormal basis _{(_en)n∈N} _{(_en)n∈N} so that A_en=_λn _en A_en=_λn _en , hence any v in H fulfils v=_∑j _αj _ej v=_∑j _αj _ej with _∑j ^|_αj|2<∞ _∑j ^|_αj|2<∞ . By Fact 2 every f∈_L2(0,T;^V∗) f∈_L2(0,T;^V∗)has an expansion

f(t)=∑j=1∞_βj(t)_ej=∑j=1∞⟨f(t),_ej⟩_ej

converging in ^V∗ ^V∗ for t a.e. Since ^e−(T−s)A _ej=^{e−(T−s)_λj} _ej ^e−(T−s)A _ej=^{e−(T−s)_λj} _ej, cf. Remark 2, such f fulfill

_yf=_∫0T ^e−(T−s)Af(s)ds=∑j=1∞^e−T_λj(_∫0T _βj(s)^es_λjds)_ej.

Hence _yf=v _yf=v is equivalent to the validity of _∫0T _βj(s)^es_λjds=_αj ^eT_λj _∫0T _βj(s)^es_λjds=_αj ^eT_λj for j∈N j∈N . So if, in terms of some _θj∈]0,1[ _θj∈]0,1[ to be determined, we take the coefficients of f(t) f(t)as

_βj(t)=_kj _{1[_θjT,T]}(t)exp(t(_λj−_λj)),

then the condition will be satisfied if and only if _kj=_αj ^eT_λj_λj^{(^eT_λj−^e_θjT_λj)−1} _kj=_αj ^eT_λj_λj^{(^eT_λj−^e_θjT_λj)−1}.

Moreover, using the equivalent norm |||·_|||∗ |||·_|||∗ on ^V∗ ^V∗in Fact 2,

_{∥f∥_L2(0,T;^V∗)2}=_∫0T|||f(t)_|||∗2dt=∑j=1∞_λj−1 _∫0T ^|_βj(t)|2dt.

Therefore f is in _L2(0,T;^V∗) _L2(0,T;^V∗) whenever _∫0T|_βj ^|2dt≤C_λj ^|_αj|2 _∫0T|_βj ^|2dt≤C_λj ^|_αj|2 holds eventually for some C>0 C>0, and here a direct calculation gives

_∫0T|_βj ^|2|_kj ^|2dt=^{e2T(_λj−_λj)}−^{e2_θjT(_λj−_λj)}2_λj−2_λj=^e2T_λj(^{e2T(1−_θj)(_λj−_λj)}−1)2^e2T_λj(_λj−_λj).

So in view of the expression for _kj _kj , the quadratic integrability of f follows if the _θj _θj can be chosen so that the above numerator is estimated by C(_λj−_λj)^{(^eT_λj−^e_θjT_λj)2} C(_λj−_λj)^{(^eT_λj−^e_θjT_λj)2} with C independent of j≥J j≥Jfor a suitable J, or more simply if

^{e2T(1−_θj)(_λj−_λj)}−1≤C(_λj−_λj)^{(1−^{e−(1−_θj)T_λj})2}.

We may take J so that _λj>3 _λj>3 for all j≥J j≥J , since at most finitely many eigenvalues do not fulfill this. Then _θj:=1−^{(_λj−_λj)−1} _θj:=1−^{(_λj−_λj)−1} belongs to ]0,1[ ]0,1[, and the above is reduced to

exp(2T)−1≤C(_λj−_λj)^{(1−exp(−T_λj−1))2}.

Applying the Mean Value Theorem to exp on [−T_λj−1,0] [−T_λj−1,0], we obtain the inequality

(_λj−_λj)^{(1−exp(−T_λj−1))2}≥exp(−2T3−1)^T2 _λj_λj−_λj>exp(−4T)^T2>0.

Hence (103) is fulfilled for C=exp(6T)/^T2 C=exp(6T)/^T2. ☐

Remark 12.

In the above proof supp _βj⊂[_θjT,T] supp _βj⊂[_θjT,T] , so the given v can be attained by _yf _yf by arranging the coefficients _βj _βj in each dimension successively as time approaches T, as _θj↗1 _θj↗1 follows in (99) by counting the eigenvalues so that _λj↗∞ _λj↗∞ . This can even be postponed to any given _T0<T _T0<T , for supp _βj⊂[_T0,T] supp _βj⊂[_T0,T] holds whenever _θjT≥_T0 _θjT≥_T0 , and we may reset to _θj=_T0/T _θj=_T0/T and adjust the _kj _kj accordingly, for the finitely many remaining j. Both themes may be of interest in infinite dimensional control theory.

In order to isolate u(0) u(0) in (91), it will of course be decisive that the operator ^e−TA ^e−TAhas an inverse, as was shown for general analytic semigroups in Proposition 1.

For our Lax–Milgram operator A with analytic semigroup ^e−tA ^e−tA generated by A=−A A=−A , it is the symbol ^etA ^etA that denotes the inverse, consistent with the sign convention in (41). Hence the properties of ^etA ^etA can be read off from Proposition 2, where (43) gives

^e−tA ^eTA⊂^e(T−t)Afor0≤t≤T.

Moreover, it is decisive for the interpretation of the compatibility conditions in Section 4.2 below to know that the domain inclusions in Proposition 2 are strict. We include a mild sufficient condition along with a characterisation of the domain D(^etA) D(^etA).

Proposition 7.

If H has an orthonormal basis of eigenvectors _{(_ej)j∈N} _{(_ej)j∈N} of A so that the corresponding eigenvalues fulfil Re_λj→∞ Re_λj→∞ for j→∞ j→∞ , then the inclusions in (44) are both strict , and D(^etA) D(^etA) is the completion of span_{(_ej)j∈N} span_{(_ej)j∈N} with respect to the graph norm,

_{∥x∥D(^etA)2}=∑j=1∞(1+^e2Re_λjt)^|(x|_ej)|2.

The domain D(^etA) D(^etA) equals the subspace S⊂H S⊂H in which the right-hand side is finite.

Proof.

If x∈S x∈S the vector v=_∑j=1∞ ^e_λjt(x|_ej)_ej v=_∑j=1∞ ^e_λjt(x|_ej)_ej is well defined in H, and with methods from Remark 2 it follows that ^e−tAv=x ^e−tAv=x ; i.e., x∈D(^etA) x∈D(^etA).

Conversely, for x∈D(^etA) x∈D(^etA) there is a vector y∈H y∈H such that x=^e−tAy=_∑j=1∞(y|_ej)^e−t_λj _ej x=^e−tAy=_∑j=1∞(y|_ej)^e−t_λj _ej . That is, ^e_λjt(x|_ej)=(y|_ej)∈_ℓ2 ^e_λjt(x|_ej)=(y|_ej)∈_ℓ2 , so x∈S x∈S . Then |^etA ^x|2=∑^e2Re_λjt ^|(x|_ej)|2 |^etA ^x|2=∑^e2Re_λjt ^|(x|_ej)|2 yields (106).

Now any x∈D(^{e^t′A}) x∈D(^{e^t′A}) is also in D(^etA) D(^etA) for t<^t′ t<^t′ , since Re_λj>0 Re_λj>0 holds in (106) for all j by V-ellipticity. As Re_λj→∞ Re_λj→∞ , we may choose a subsequence so that Re_{λ_jn}>n Re_{λ_jn}>nand set

x=∑n=1∞1n^{e−_{λ_jn}t} _{e_jn}.

Here x∈D(^etA) x∈D(^etA) as it is in S by construction for t≥0 t≥0 ; but not in D(^{e^t′A}) D(^{e^t′A}) for ^t′>t ^t′>tas

∑j=1∞^{e2Re_λj ^t′} ^|(x|_ej)|2=∑n=1∞^{e2Re_{λ_jn}(^t′−t)}1ⁿ²>∑n=1∞^{e2n(^t′−t)} ⁿ²=∞.

Furthermore, using orthogonality, it follows for any x∈D(^etA) x∈D(^etA) that, for N→∞ N→∞,

∥x−∑j≤N(x|_ej)_ej _∥D(^etA)2=∑J>N(1+^e2Re_λjt)^|(x|_ej)|2→0.

Hence the space D(^etA) D(^etA) has span_{(_ej)j∈N} span_{(_ej)j∈N}as a dense subspace. That is, the completion of the latter with respect to the graph norm identifies with the former. ☐

After this study of the map _yf _yf , the injectivity of the operator ^e−tA ^e−tA and the domain D(^etA) D(^etA) , cf. Propositions 1, 2, 5 and 7, we address the final value problem (90) by solving (91) for the vector u(0) u(0). This is done by considering the map

u(0)↦^e−TAu(0)+_yf.

This is composed of the bijection ^e−TA ^e−TA and a translation by the vector _yf _yf , hence is bijective from H to the affine space R(^e−TA)+_yf R(^e−TA)+_yf . In fact, using (41), inversion gives

u(0)=^eTAu(T)−_∫0T ^e−(T−s)Af(s)ds=^eTA(u(T)−_yf).

This may be summed up thus:

Theorem 6.

For the set of solutions u in X of the differential equation (_∂t+A)u=f (_∂t+A)u=f with fixed data f∈_L2(0,T;^V∗) f∈_L2(0,T;^V∗) , the Formulas (91) and (111) give a bijective correspondence between the initial states u(0) u(0) in H and the terminal states u(T) u(T) in _yf+D(^eTA) _yf+D(^eTA) .

In view of the linearity, the affine space _yf+D(^eTA) _yf+D(^eTA)might seem surprising. However, a suitable reinterpretation gives the compatibility condition introduced in the next section.

4.2. Well-Posedness of the Final Value Problem

Since R(^eTA)⊂H R(^eTA)⊂H , the initial state in (111) can be inserted into Formula (76), so any solution u of (90) must satisfy

u(t)=^e−tA ^eTA(_uT−_yf)+_∫0t ^e−(t−s)Af(s)ds.

Here one could contract the first term a bit, as ^e−tA ^eTA⊂^e(T−t)A ^e−tA ^eTA⊂^e(T−t)A by (105). But we refrain from this because ^e−tA ^eTA ^e−tA ^eTA rather obviously applies to _uT−_yf _uT−_yf if and only if this vector belongs to D(^eTA) D(^eTA) —and the following theorem corroborates that this is equivalent to the unique solvability in X of the final value problem (90):

Theorem 7.

Let V be a separable Hilbert space contained algebraically, topologically and densely in H, and let A be the Lax–Milgram operator defined in H from a bounded V-elliptic sesquilinear form a, and having bounded extension A:V→^V∗ A:V→^V∗ . For given f∈_L2(0,T;^V∗) f∈_L2(0,T;^V∗) and _uT∈H _uT∈H , the condition

_uT−_yf∈D(^eTA)

is necessary and sufficient for the existence of some u∈X u∈X , cf. (60), that solves the final value problem (90). Such a function u is uniquely determined and given by (112), where all terms belong to X as functions of t.

Proof.

When (90) has a solution u∈X u∈X , then _uT _uT is reachable from the initial state u(0) u(0) determined from the bijection in Theorem 6, which gives that _uT−_yf=^e−TAu(0)∈D(^eTA) _uT−_yf=^e−TAu(0)∈D(^eTA) . Hence (113) is necessary and (112) follows by insertion, as explained prior to (112). Uniqueness is obvious from the right-hand side of (112).

When _uT _uT , f fulfill (113), then _u0=^eTA(_uT−_yf) _u0=^eTA(_uT−_yf) defines a vector in H, so Theorem 4 yields a function u∈X u∈X solving (_∂t+A)u=f (_∂t+A)u=f and u(0)=_u0 u(0)=_u0 . According to Theorem 6 this u has final state u(T)=^e−TA ^eTA(_uT−_yf)+_yf=_uT u(T)=^e−TA ^eTA(_uT−_yf)+_yf=_uT , hence solves (90).

Finally, the fact that the integral in (112) defines a function in X follows at once from Theorem 5, for it states that it equals the solution in X of ^u˜′+Au˜=f ^u˜′+Au˜=f , u˜(0)=0 u˜(0)=0 . Since u∈X u∈X in (112), also ^e−tA ^eTA(_uT−_yf) ^e−tA ^eTA(_uT−_yf)is a function in X. ☐

Remark 13.

When (f,_uT) (f,_uT) fulfils (113), then (111) yields that _uT−_yf=^e−TAu(0) _uT−_yf=^e−TAu(0) .

Remark 14.

Writing condition (113) as _uT=^e−TAu(0)+_yf _uT=^e−TAu(0)+_yf , cf. Remark 13, this part of Theorem 7 is natural inasmuch as each set of admissible terminal data _uT _uT are in effect a sum of the terminal state, ^e−TAu(0) ^e−TAu(0) , of the semi-homogeneous initial value problem (53) with f=0 f=0 and of the terminal state _yf _yf of the semi-homogeneous problem (53) with u(0)=0 u(0)=0 . Moreover, the _uT _uT fill at least a dense set in H, as for fixed u(0) u(0) this follows from Proposition 5; for fixed f from the density of D(^eTA) D(^eTA) seen prior to Proposition 2.

Remark 15.

To elucidate the criterion _uT−_yf∈D(^eTA) _uT−_yf∈D(^eTA) in formula (113) of Theorem 7, we consider the matrix operator _PA=_∂t+A_rT _PA=_∂t+A_rT , with _rT _rT denoting restriction at t=T t=T , and the “forward” map Φ(f,_uT)=_uT−_yf Φ(f,_uT)=_uT−_yf , which by (61) and Proposition 5 give bounded operators

X→_PA_L2(0,T;^V∗)⊕H→ΦH.

Then, in terms of the range R(_PA) R(_PA) , clearly (90) has a solution if and only if f_uT∈R(_PA) f_uT∈R(_PA) , so the compatibility condition (113) means that R(_PA)=^Φ−1(D(^eTA))=D(^eTAΦ) R(_PA)=^Φ−1(D(^eTA))=D(^eTAΦ) .

The paraphrase at the end of Remark 15 is convenient for the choice of a useful norm on the data. Indeed, we now introduce the space of admissible data Y=D(^eTAΦ) Y=D(^eTAΦ), i.e.,

Y=(f,_uT)∈_L2(0,T;^V∗)⊕H|_uT−_yf∈D(^eTA),

endowed with the graph norm on D(^eTAΦ) D(^eTAΦ)given by

∥(f,_uT)_∥Y2=|_uT ^|2+_{∥f∥_L2(0,T;^V∗)2}+^{|^eTA(_uT−_yf)|2}.

Using the equivalent norm |||·_|||∗ |||·_|||∗ from (26) for ^V∗ ^V∗, the above is induced by the inner product

(_uT|_vT)+_∫0T _{(f(s)|g(s))^V∗}ds+(^eTA(_uT−_yf)|^eTA(_vT−_yg)).

This space Y is complete: as Φ in Remark 15 is bounded, the composite map ^eTAΦ ^eTAΦ is a closed operator from _L2(0,T;^V∗)⊕H _L2(0,T;^V∗)⊕H to H, so its domain D(^eTAΦ)=Y D(^eTAΦ)=Y is complete with respect to the graph norm given in (116). Hence Y is a Hilbert(-able) space—but we shall often just work with the equivalent norm on the Banach space Y obtained by using simply _∥·∥∗ _∥·∥∗ on ^V∗ ^V∗.

Moreover, the norm in (116) also leads to continuity of the solution operator for (90):

Theorem 8.

The solution u∈X u∈X in Theorem 7 depends continuously on the data (f,_uT) (f,_uT) in the Hilbert space Y in (115), or equivalently, for some constant c we have

_∫0T ^∥u(t)∥2dt+supt∈[0,T]^|u(t)|2+_∫0T _{∥_∂tu(t)∥∗2}dt≤|_uT ^|2+c_∫0T _{∥f(t)∥∗2}dt+|^eTA(_uT−_∫0T ^e−(T−t)Af(t)dt)^|2.

Another equivalent norm on the Hilbert space Y is obtained by omitting the term |_uT ^|2 |_uT ^|2 .

Proof.

This follows from Corollary 1 by inserting _u0=^eTA(_uT−_yf) _u0=^eTA(_uT−_yf) from (111) into (64), for this gives _∥u∥X2≤c|^eTA(_uT−_yf)^|2+c_{∥f∥_L2(0,T;^V∗)2} _∥u∥X2≤c|^eTA(_uT−_yf)^|2+c_{∥f∥_L2(0,T;^V∗)2} , where one can add |_uT ^|2 |_uT ^|2 . Conversely the boundedness of _yf _yf and ^e−TA ^e−TA yield that |_uT ^|2≤^c∥f∥2+c^{|^eTA(_uT−_yf)|2} |_uT ^|2≤^c∥f∥2+c^{|^eTA(_uT−_yf)|2}. ☐

Of course, Theorems 7 and 8 together mean that the final value problem in (90) is well posed in the spaces X and Y.

5. The Heat Equation With Final Data

To apply the theory in Section 4, we treat the heat equation and its final value problem. In the sequel Ω stands for a smooth, open bounded set in ^Rn ^Rn , n≥2 n≥2 as described in ([20], Appendix C). In particular Ω is locally on one side of its boundary Γ:=∂Ω Γ:=∂Ω.

For such sets we consider the problem of finding the u satisfying

_∂tu(t,x)−Δu(t,x)=f(t,x)inQ:=]0,T[×Ω,_γ0u(t,x)=g(t,x)onS:=]0,T[×∂Ω,_rTu(x)=_uT(x)atT×Ω.

Hereby the trace of functions on Γ is written in the operator notation _γ0 _u=u|Γ _γ0 _u=u|Γ ; similarly we also use _γ0 _γ0 for traces on S. _rT _rT denotes the trace operator at t=T t=T.

We shall also use _H01(Ω) _H01(Ω) , which is the subspace obtained by closing _C0∞(Ω) _C0∞(Ω) in the Sobolev space ^H1(Ω) ^H1(Ω) . Dual to this one has ^H−1(Ω) ^H−1(Ω) , which identifies with the set of restrictions to Ω from ^H−1(^Rn) ^H−1(^Rn) , endowed with the infimum norm. The reader is referred to Chapter 6 and Remark 9.4 in [20] for the spaces ^Hs(^Rn) ^Hs(^Rn)and the infimum norm.

5.1. The Boundary Homogeneous Case

In case g≡0 g≡0 in (119), the consequences of the abstract results in Section 4.2 are straightforward to account for. Indeed, with

V=_H01(Ω),H=_L2(Ω),^V∗=^H−1(Ω),

the boundary condition _γ0u=0 _γ0u=0 is imposed via the condition that u(t)∈V u(t)∈V for all t, or rather through use of the Dirichlet realization of the Laplacian−_{Δ_γ0} −_{Δ_γ0} (denoted by −_ΔD −_ΔD in the introduction), which is the Lax–Milgram operator A induced by the triple (_L2(Ω),_H01(Ω),s) (_L2(Ω),_H01(Ω),s)for

s(u,v)=∑j=1n_{(_∂ju|_∂jv)_L2(Ω)}.

In fact, the Poincaré inequality yields that the form s(u,v) s(u,v) is _H01(Ω) _H01(Ω) -elliptic, and as it is symmetric too, A=−_{Δ_γ0} A=−_{Δ_γ0} is a selfadjoint unbounded operator in _L2(Ω) _L2(Ω) , with D(−_{Δ_γ0})⊂_H01(Ω) D(−_{Δ_γ0})⊂_H01(Ω).

Hence the operator −A=_{Δ_γ0} −A=_{Δ_γ0} generates an analytic semigroup ^et_{Δ_γ0} ^et_{Δ_γ0} in B(_L2(Ω)) B(_L2(Ω)) ; the bounded extension −A=Δ:_H01(Ω)→^H−1(Ω) −A=Δ:_H01(Ω)→^H−1(Ω) generates the analytic semigroup ^e−tA=^etΔ ^e−tA=^etΔ on ^H−1(Ω) ^H−1(Ω) ; cf. Lemma 4. Consistently with Section 4.1 we also set ^{(^et_{Δ_γ0})−1}=^{e−t_{Δ_γ0}} ^{(^et_{Δ_γ0})−1}=^{e−t_{Δ_γ0}}.

For the homogeneous problem with g=0 g=0 in (119) we have the solution and data spaces

_X0=_L2(0,T;_H01(Ω))⋂C([0,T];_L2(Ω))⋂^H1(0,T;^H−1(Ω)),

_Y0=(f,_uT)∈_L2(0,T;^H−1(Ω))⊕_L2(Ω)|_uT−_yf∈D(^{e−T_{Δ_γ0}}).

Here, with _yf _yf as the usual integral (cf. (125) below), the data norm in (116) amounts to

∥(f,_uT)_{∥_Y02}=_∫0T∥f(t)_{∥^H−1(Ω)2}dt+_∫Ω(|_uT ^|2+|^{e−T_{Δ_γ0}}(_uT−_yf)^|2)dx.

From Theorems 7 and 8 we may now read off the following result, which is a novelty even though the problem is classical:

Theorem 9.

Let A=−_{Δ_γ0} A=−_{Δ_γ0} be the Dirichlet realization of the Laplacian in Ω and A=−Δ A=−Δ its extension, as introduced above. When g=0 g=0 in the final value problem (119) and f∈_L2(0,T;^H−1(Ω)) f∈_L2(0,T;^H−1(Ω)) , _uT∈_L2(Ω) _uT∈_L2(Ω) , then there exists a solution u in _X0 _X0 of (119) if and only if the data (f,_uT) (f,_uT) are given in _Y0 _Y0 , i.e., if and only if

_uT−_∫0T ^e−(T−s)Af(s)dsbelongstoD(^{e−T_{Δ_γ0}}).

In the affirmative case, such u are uniquely determined in _X0 _X0 and fulfil the estimate _{∥u∥_X0}≤c_{∥(f,_uT)∥_Y0} _{∥u∥_X0}≤c_{∥(f,_uT)∥_Y0} . Furthermore the difference in (125) equals ^eT_{Δ_γ0}u(0) ^eT_{Δ_γ0}u(0) in _L2(Ω) _L2(Ω) .

Remark 16.

For A=−_{Δ_γ0} A=−_{Δ_γ0} one has the equivalent norms in Facts 1, 2 and the characterisation of D(^{e−T_{Δ_γ0}}) D(^{e−T_{Δ_γ0}}) in Proposition 7. This is a classical consequence of the compact embedding of _H01(Ω) _H01(Ω) into _L2(Ω) _L2(Ω) for bounded sets Ω (e.g., ([20], Theorem 8.2)). Thus one obtains for f=0 f=0 , g=0 g=0 the situation described in the introduction, where the space of final data, normed by |||_uT||| |||_uT||| , via Proposition 7 is seen to be D(^{e−T_{Δ_γ0}}) D(^{e−T_{Δ_γ0}}) with equivalent norms. As the completed solution space E¯ E¯ in the introduction one may take the Banach space E¯=_X0 E¯=_X0 , cf. Theorem 9.

5.2. The Inhomogeneous Case

For non-zero data, i.e., when g≠0 g≠0 on S, cf. (119), one may of course try to reduce to an equivalent homogeneous problem by choosing a function w so that _γ0w=g _γ0w=gon the surface S. Here we recall the classical

Lemma 6.

_γ0:^H1(Q)→^H1/2(S) _γ0:^H1(Q)→^H1/2(S) is a continuous surjection having a bounded right inverse _K˜0 _K˜0 , so w=_K˜0g w=_K˜0g maps every g∈^H1/2(S) g∈^H1/2(S) to w∈^H1(Q) w∈^H1(Q) fulfilling _γ0w=g _γ0w=g and

_{∥w∥^H1(Q)}≤c_{∥g∥^H1/2(S)}.

Lacking a reference with details, we note that the lemma is well known for sets like Ω, hence for smooth open bounded sets _Ω1⊂^Rn+1 _Ω1⊂^Rn+1 with operators _{γ0,_Ω1} _{γ0,_Ω1} and _{K˜0,_Ω1} _{K˜0,_Ω1} ; cf. Theorem B.1.9 in [22] or Theorem 9.5 in [20] for the flat case. In particular, one can stretch Q to ]−2T,2T[×Ω ]−2T,2T[×Ω and attach rounded ends in a smooth way to obtain a set _Ω1⊂]−3T,3T[×Ω _Ω1⊂]−3T,3T[×Ω equal to Q for 0<t<T 0<t<T . Here ^H1(Q)=_rQ ^H1(_Ω1) ^H1(Q)=_rQ ^H1(_Ω1) is a classical result, when the latter space of restrictions to Q has the infimum norm. While ^Hs(∂_Ω1) ^Hs(∂_Ω1) is defined using local coordinates in a standard way, cf. ([20], Formula (8.10)), the Sobolev space ^Hs(S) ^Hs(S) on the surface S can be defined as the set of restrictions _rS ^Hs(∂_Ω1) _rS ^Hs(∂_Ω1) . When _rSg˜=g _rSg˜=g , then _K˜0g=_rQ _{K˜0,_Ω1}g˜ _K˜0g=_rQ _{K˜0,_Ω1}g˜ defines the desired operator _K˜0 _K˜0 , as _{γ0,_Ω1} _{γ0,_Ω1} acts as _γ0 _γ0in Q.

Remark 17.

The norm in ^Hs(S) ^Hs(S) can be chosen so that this is a Hilbert space; cf. ([20], Formula (8.10)). However, Sobolev spaces on smooth surfaces is a vast subject, requiring so-called distribution densities as explained in ([22], Section 6.3). We refer the reader to ([20], Section 8.2) for a short introduction to this subject; as there, we prefer a more intuitive approach (exploiting the surface measure on _Ω1 _Ω1 ) but skip details. A systematic exposition of this framework can be found in ([27], Section 4), albeit in a general _Lp _Lp -setting with mixed-norms leading to anisotropic Triebel–Lizorkin spaces _{Fp→,qs,a→}(S) _{Fp→,qs,a→}(S) on the curved boundary, which in general are the correct boundary data spaces for parabolic problems with different integrability properties in space and time, as noted in [28]; cf. the discussion of the heat equation in ([27], Section 6.5) and the more detailed account in ([29], Chapter 7).

However, when splitting the solution of (119) as u=v+w u=v+w for w as in Lemma 6, then v should satisfy (119) with data (f˜,0,_u˜T) (f˜,0,_u˜T),

f˜=f−(_∂tw−Δw),_u˜T=_uT−_rTw.

At first glance one might therefore think that w is inconsequential for the compatibility condition (125), for _u˜T−_yf˜ _u˜T−_yf˜ there equals the usual term _uT−_yf _uT−_yf minus _rTw−_{y_∂tw−Δw} _rTw−_{y_∂tw−Δw} , where the latter seemingly belongs to D(^{e−T_{Δ_γ0}}) D(^{e−T_{Δ_γ0}}) as the pair (_∂tw−Δw,_rTw) (_∂tw−Δw,_rTw) could seem to be a vector in the range of the operator _P−Δ _P−Δin Remark 15.

But obviously this is not the case, because the function w is outside the domain _X0 _X0 of _P−Δ _P−Δ . Indeed, w∈_L2(0,T;^H1(Ω)) w∈_L2(0,T;^H1(Ω)) and has _γ0w=g¬≡0 _γ0w=g¬≡0 in the non-homogeneous case, whence w∉_L2(0,T;_H01(Ω)) w∉_L2(0,T;_H01(Ω)) . So one might think it would be necessary to discuss homogeneous problems with larger solution spaces _X˜0 _X˜0 than _X0 _X0.

We propose to circumvent these difficulties by applying Lemma 6 to the corresponding linear initial value problem instead, since in the present spaces of low regularity there is no compatibility condition needed for this:

_∂tu−Δu=finQ,_γ0u=gonS,_r0u=_u0at{0}×Ω.

More precisely, we shall analogously to Section 4 obtain a bijection u(0)↔u(T) u(0)↔u(T) between initial and final states by establishing a solution formula as in Theorem 5. (For general background material on (128) the reader could consult Section III.6 in [30], and for the fine theory including compatibility conditions we refer to [15].)

Analogously to Theorem 4 and Corollary 1, we depart from well-posedness of (128). This is well known per se, but we need to briefly review the explanation in order to account later for the decisive existence of an improper integral showing up when g≠0 g≠0 in (119).

Since the solutions now take values in the full space ^H1(Ω) ^H1(Ω) , we shall in this section denote the solution space by _X1 _X1. It is given by

_X1=_L2(0,T;^H1(Ω))⋂C([0,T];_L2(Ω))⋂^H1(0,T;^H−1(Ω)),

and _X1 _X1 is a Banach space when normed analogously to (61),

_{∥u∥_X1}=_{(∥u∥_L2(0,T;^H1(Ω))2}+sup0≤t≤T_{∥u(t)∥_L2(Ω)2}+_{∥u∥^H1(0,T;^H−1(Ω))2} ^)1/2.

As ^H1 ^H1 , ^H−1 ^H−1 are not dual on Ω, the redundancy in Remark 5 does not extend to the term _sup[0,T] _{∥u∥_L2} _sup[0,T] _{∥u∥_L2}above.

Proposition 8.

The heat initial value problem (128) has a unique solution u∈_X1 u∈_X1 for given data f∈_L2(0,T;^H−1(Ω)) f∈_L2(0,T;^H−1(Ω)) , g∈^H1/2(S) g∈^H1/2(S) , _u0∈_L2(Ω) _u0∈_L2(Ω) , and there is an estimate

_{∥u∥_X12}≤c(∥_u0 _{∥_L2(Ω)2}+_{∥f∥_L2(0,T;^H−1(Ω))2}+_{∥g∥^H1/2(S)2}).

Proof.

With w=_K˜0g w=_K˜0g as in Lemma 6, we write u=v+w u=v+w for some v∈_X1 v∈_X1 solving (128) for data

f˜=f−(_∂t−Δ)w,g˜=0,_u˜0=_u0−w(0).

Here w(0) w(0) is well defined, as w∈^H1(Q) w∈^H1(Q) implies w∈C([0,T];_L2(Ω)) w∈C([0,T];_L2(Ω)) , by an application of Lemma 1. That w even is in _X1 _X1 results from the easy estimates, where I=]0,T[ I=]0,T[,

∥^w′ _{∥_L2(I;^H−1)2}+_{∥Δw∥_L2(I;^H−1)2}≤_{∥w∥^H1(I;_L2)2}+_{c∥w∥_L2(I;^H1)2}≤c_{∥w∥^H1(Q)2}.

This moreover yields that f˜∈_L2(0,T;^H−1(Ω)) f˜∈_L2(0,T;^H−1(Ω)) , and _u˜0∈_L2(Ω) _u˜0∈_L2(Ω) , so by Theorem 4, the boundary homogeneous problem for v has a solution in _X0 _X0 ; cf. (122). Hence (128) has the solution u=v+w u=v+w in _X1 _X1; and by linearity this is unique in view of Theorem 4.

Inspecting the above arguments, we first note that by (57),

sup0≤t≤T_{∥w(t)∥_L2(Ω)}≤_{c(∥w∥_L2(0,T;_L2(Ω))}+∥_∂t _{w∥_L2(0,T;_L2(Ω))} _{)≤c∥w∥^H1(Q)2},

so the estimate (133) can be sharpened to _{∥w∥_X12}≤c_{∥w∥^H1(Q)2} _{∥w∥_X12}≤c_{∥w∥^H1(Q)2}. Now Corollary 1 gives

_{∥u∥_X12}≤_{2(∥v∥_X02}+_{∥w∥_X12})≤c(∥_u˜0 _{∥_L2(Ω)2}+∥f˜_{∥_L2(0,T;^H−1(Ω))2}+_{∥w∥_X12})≤c(∥_u0 _{∥_L2(Ω)2}+∥f_{∥_L2(0,T;^H−1(Ω))2}+∥(_∂t−Δ)w_{∥_L2(0,T;^H−1)2}+∥w_∥^H1(Q)2)

which via (133) and (126) entails the stated estimate (131). ☐

As a crucial addendum, we may apply Theorem 5 directly to the function v constructed during the above proof and then substitute v=u−w v=u−wto derive that

u(t)=w(t)+^et_{Δ_γ0}(_u0−w(0))+_∫0t ^e−(t−s)A(f−(_∂s−Δ)w)ds.

This formula for the u solving the inhomogeneous final value problem applies especially for t=T t=T , but we shall keep t in [0,T] [0,T]to deduce a formula for its solution.

Our strategy in the following will be to simplify the contributions from w, and ultimately to reintroduce the boundary data g instead of w. To do so, we apply the Leibniz rule in Proposition 3 to our function w in ^H1(0,t;_L2(Ω)) ^H1(0,t;_L2(Ω))and get

_∂s(^{e(t−s)_{Δ_γ0}}w(s))=^{e(t−s)_{Δ_γ0}} _∂sw(s)−_{Δ_γ0} ^{e(t−s)_{Δ_γ0}}w(s).

As the first inconvenience, _{Δ_γ0} _{Δ_γ0} does not commute with the semigroup, since w as an element of ^H1\_H01 ^H1\_H01 belongs to neither the domain of the realization −_{Δ_γ0} −_{Δ_γ0} , nor to that of A A.

Secondly, the right-hand side is only integrable on [0,t−ε] [0,t−ε] for ε>0 ε>0 , as the last term has a singularity at s=t s=t; cf. Theorem 3. As a remedy, we may use the improper Bochner integral

_⨍0t _{Δ_γ0} ^{e(t−s)_{Δ_γ0}}w(s)ds=limε→0_∫0t−ε _{Δ_γ0} ^{e(t−s)_{Δ_γ0}}w(s)ds.

Lemma 7.

For every w∈^H1(Q) w∈^H1(Q) the limit (138) exists in _L2(Ω) _L2(Ω) and

w(t)−^et_{Δ_γ0}w(0)=_∫0t ^{e(t−s)_{Δ_γ0}} _∂sw(s)ds−_⨍0t _{Δ_γ0} ^{e(t−s)_{Δ_γ0}}w(s))ds.

Proof.

As ^et_{Δ_γ0} ^et_{Δ_γ0} is uniformly bounded according to Theorem 3 and w∈C([0,T],_L2(Ω)) w∈C([0,T],_L2(Ω)) was seen in the above proof, bilinearity gives that in _L2(Ω) _L2(Ω),

^{e(t−(t−ε))_{Δ_γ0}}w(t−ε)→w(t)forε→0.

Moreover, integration of both sides in (137) gives, cf. Lemma 1,

_{[^{e(t−s)_{Δ_γ0}}w(s)]s=0s=t−ε}=_∫0t−ε(−_{Δ_γ0})^{e(t−s)_{Δ_γ0}}w(s)ds+_∫0t−ε ^{e(t−s)_{Δ_γ0}} _∂sw(s)ds.

The left-hand side converges by (140), and by dominated convergence the rightmost term does so for ε→⁰⁺ ε→⁰⁺ (through an arbitrary sequence), so also _∫0t−ε _{Δ_γ0} ^{e(t−s)_{Δ_γ0}}w(s)ds _∫0t−ε _{Δ_γ0} ^{e(t−s)_{Δ_γ0}}w(s)ds converges in _L2(Ω) _L2(Ω) as claimed. Then (139) is the resulting identity among the limits. ☐

Identity (139) from the lemma applies directly in the solution formula (136), and because terms with _∂sw _∂swcancel, one obtains

u(t)=^et_{Δ_γ0} _u0+_∫0t ^e−(t−s)Afds+_∫0t ^e−(t−s)AΔwds−_⨍0t _{Δ_γ0} ^{e(t−s)_{Δ_γ0}}wds.

We shall reduce the difference of the last two integrals in order to reintroduce the boundary data g instead of w.

First we use that Δ=A^A−1Δ Δ=A^A−1Δ on ^H1(Ω) ^H1(Ω)and write both terms as improper integrals,

−_⨍0tA^e−(t−s)A(I−^A−1Δ)w(s)ds.

Here Q=I−^A−1Δ Q=I−^A−1Δis a well-known projection from the fine elliptic theory of the problem

−Δu=f,_γ0u=g.

Indeed, if this is treated via the matrix operator −Δ_γ0 −Δ_γ0 , which has an inverse in row form −^A−1_K0 −^A−1_K0 that applies to the data fg fg , the basic composites appear in the two operator identities on ^H1(Ω) ^H1(Ω) and ^H−1(Ω)⊕^H1/2(Γ) ^H−1(Ω)⊕^H1/2(Γ)respectively,

I=−^A−1_K0−Δ_γ0=^A−1Δ+_K0 _γ0,

I00I=−Δ_γ0−^A−1_K0=Δ^A−1Δ_K0−_γ0 ^A−1_γ0 _K0.

Thus we get from the first formula that Q=I−^A−1Δ=_K0 _γ0 Q=I−^A−1Δ=_K0 _γ0 on ^H1(Ω) ^H1(Ω).

However, before we implement this, we emphasize that the simplicity of the Formulas (145) and (146) relies on a specific choice of _K0 _K0explained in the following:

As A=Δ_{|_H01} A=Δ_{|_H01} holds in the distribution sense, P:=^A−1Δ P:=^A−1Δ clearly fulfils ^P2=P ^P2=P , is bounded ^H1→_H01 ^H1→_H01 and equals I on _H01 _H01 , so P is the projection onto _H01(Ω) _H01(Ω) along its null space, which evidently is the closed subspace of harmonic ^H1 ^H1-functions, namely

Z(−Δ)={u∈^H1(Ω)∣−Δu=0}.

Therefore ^H1 ^H1is a direct sum,

^H1(Ω)=_H01(Ω)∔Z(−Δ).

We also let Q=I−P Q=I−P denote the projection on Z(−Δ) Z(−Δ) along _H01(Ω) _H01(Ω), as from the context it can be distinguished from the time cylinder (also denoted by Q).

Since _γ0:^H1(Ω)→^H1/2(Γ) _γ0:^H1(Ω)→^H1/2(Γ) is surjective with _H01 _H01 as the null-space, it has an inverse _K0 _K0 on the complement Z(−Δ) Z(−Δ), which by the open mapping principle is bounded

_K0:^H1/2(Γ)→Z(−Δ).

Hence _K0:^H1/2(Γ)→^H1(Ω) _K0:^H1/2(Γ)→^H1(Ω) is a bounded right-inverse, i.e., _γ0 _K0=_I^H1/2(Γ) _γ0 _K0=_I^H1/2(Γ) . The rest of (146) follows at once. Moreover, since _γ0P=0 _γ0P=0,

_K0 _γ0=_K0 _γ0(P+Q)=_K0 _γ0Q=_IZ(−Δ)Q=Q,

which by definition of Q and P gives (145). (_K0 _K0 is known as a Poisson operator; these are amply discussed within the pseudo-differential boundary operator calculus in [31].)

Using this set-up we obtain:

Proposition 9.

If u denotes the unique solution to the initial boundary value problem (128) provided by Proposition 8, then u fulfils the identity

u(t)=^et_{Δ_γ0} _u0+_∫0t ^e−(t−s)Af(s)ds−_⨍0tA^{e(t−s)_{Δ_γ0}} _K0g(s)ds,

where the improper integral converges in _L2(Ω) _L2(Ω) for every t∈[0,T] t∈[0,T] .

Proof.

Because of (150) we may write (I−^A−1Δ)w=Qw=_K0 _γ0w=_K0g (I−^A−1Δ)w=Qw=_K0 _γ0w=_K0g when _γ0w=g _γ0w=g , and when this is applied in (143), the solution formula (142) simplifies to (151). ☐

For t=T t=T the second term in (151) gives back _yf=_∫0T ^e−(T−s)Af(s)ds _yf=_∫0T ^e−(T−s)Af(s)ds from Section 4. However, the full influence on u(T) u(T)from the boundary data g is collected in the third term as

_zg=_⨍0TA^{e(T−s)_{Δ_γ0}} _K0g(s)ds.

That the map g↦_zg g↦_zg is well defined is clear by taking t=T t=T in Proposition 9; this is a non-trivial result. The map is linear by the calculus of limits. In case f=0 f=0 , _u0=0 _u0=0 it is seen from (151) that _zg=u(T) _zg=u(T) , so obviously ∥_zg _{∥_L2(Ω)}≤_supt _{∥u(t)∥_L2(Ω)} ∥_zg _{∥_L2(Ω)}≤_supt _{∥u(t)∥_L2(Ω)} , which in turn is estimated by _{c∥g∥^H1/2(S)} _{c∥g∥^H1/2(S)}using Proposition 8. This proves

Lemma 8.

The linear operator g↦_zg g↦_zg is bounded ^H1/2(S)→_L2(Ω) ^H1/2(S)→_L2(Ω) .

Finally, from Proposition 9, we conclude for an arbitrary solution in _X1 _X1 of the heat equation ^u′−Δu=f ^u′−Δu=f with _γ0u=g _γ0u=gon S that

u(T)=^eT_{Δ_γ0}u(0)+_yf−_zg.

Therefore we also here have a bijection u(0)↔u(T) u(0)↔u(T) , for the above breaks down to application of the bijection ^eT_{Δ_γ0} ^eT_{Δ_γ0} , cf. Proposition 1, and a translation in _L2(Ω) _L2(Ω) by the fixed vector _yf−_zg _yf−_zg.

We are now ready to obtain the unique solvability of the inhomogeneous final value problem (119). Our result for this is similar to the abstract Theorem 7 (as is its proof), except for the important clarification that the boundary data g do appear in the compatibility condition, but only via the term _zg _zg:

Theorem 10.

For given data f∈_L2(0,T;^H−1(Ω)) f∈_L2(0,T;^H−1(Ω)) , g∈^H1/2(S) g∈^H1/2(S) , _uT∈_L2(Ω) _uT∈_L2(Ω) the final value problem (119) is solved by a function u∈_X1 u∈_X1 , whereby

_X1=_L2(0,T;^H1(Ω))⋂C([0,T];_L2(Ω))⋂^H1(0,T;^H−1(Ω)),

if and only if the data in terms of (92) and (152) satisfy the compatibility condition

_uT−_yf+_zg∈D(^{e−T_{Δ_γ0}}).

In the affirmative case, u is uniquely determined in _X1 _X1 and has the representation

u(t)=^et_{Δ_γ0} ^{e−T_{Δ_γ0}}(_uT−_yf+_zg)+_∫0t ^e(t−s)Δf(s)ds−_⨍0tΔ^{e(t−s)_{Δ_γ0}} _K0g(s)ds,

where the three terms all belong to _X1 _X1 as functions of t.

Proof.

Given a solution u∈_X1 u∈_X1 , the bijective correspondence yields _uT=^eT_{Δ_γ0}u(0)+_yf−_zg _uT=^eT_{Δ_γ0}u(0)+_yf−_zg , so that (155) necessarily holds. Inserting its inversion u(0)=^{e−T_{Δ_γ0}}(_uT−_yf+_zg) u(0)=^{e−T_{Δ_γ0}}(_uT−_yf+_zg) into the solution formula from Proposition 9 yields (156); thence uniqueness of u.

If (155) does hold, _u0=^{e−T_{Δ_γ0}}(_uT−_yf+_zg) _u0=^{e−T_{Δ_γ0}}(_uT−_yf+_zg) is a vector in _L2(Ω) _L2(Ω) , so the initial value problem with data (f,g,_u0) (f,g,_u0) can be solved by means of Proposition 8. Then one obtains a function u∈_X1 u∈_X1 that also solves the final value problem (119), since in particular u(T)=_uT u(T)=_uT is satisfied, cf. the bijection (153) and the definition of _u0 _u0.

The final regularity statement follows from the fact that _X1 _X1 also is the solution space for the initial value problem in Proposition 8. Indeed, even the improper integral is a solution in _X1 _X1 to (128) with data (f,g,_u0)=(0,g,0) (f,g,_u0)=(0,g,0) , according to Proposition 9; cf. the proof of Lemma 8. Similarly the integral containing f solves an initial value problem with data (f,0,0) (f,0,0) , hence is in _X1 _X1 . In addition, the first term in (156) is the solution of (128) for data (0,0,^{e−T_{Δ_γ0}}(_uT−_yf+_zg)) (0,0,^{e−T_{Δ_γ0}}(_uT−_yf+_zg)).

We let _Y1 _Y1 stand for the set of admissible data. Within _L2(0,T;^H−1(Ω))⊕^H1/2(Γ)⊕_L2(Ω) _L2(0,T;^H−1(Ω))⊕^H1/2(Γ)⊕_L2(Ω) it is the subspace given, via the map _Φ1(f,g,_uT)=_uT−_yf+_zg _Φ1(f,g,_uT)=_uT−_yf+_zg, as

_Y1=(f,g,_uT)|_uT−_yf+_zg∈D(^{e−T_{Δ_γ0}})=D(^{e−T_{Δ_γ0}} _Φ1).

Correspondingly we endow _Y1 _Y1 with the graph norm of the operator ^{e−T_{Δ_γ0}} _Φ1 ^{e−T_{Δ_γ0}} _Φ1 , that is, of the composite map (f,g,_uT)↦^{e−T_{Δ_γ0}}(_uT−_yf+_zg) (f,g,_uT)↦^{e−T_{Δ_γ0}}(_uT−_yf+_zg) . Again, ^e−T_ΔD _Φ1(f,g,_uT) ^e−T_ΔD _Φ1(f,g,_uT) equals the initial state u(0) u(0) steered by f, g to the final state u(T)=_uT u(T)=_uT , as is evident for t=0 t=0 in (156).

Recalling that A=−Δ:_H01(Ω)→^H−1(Ω) A=−Δ:_H01(Ω)→^H−1(Ω), the above-mentioned graph norm is given by

∥(f,g,_uT)_{∥_Y12}=∥_uT _{∥_L2(Ω)2}+_{∥g∥^H1/2(Q)2}+_{∥f∥_L2(0,T;^H−1(Ω))2}+_∫Ω|^{e−T_{Δ_γ0}}_uT−_∫0T^e−(T−s)Af(s)ds+_⨍0TA^{e(T−s)_{Δ_γ0}} _K0g(s)ds^|2dx.

Here the last term is written with explicit integrals to emphasize the complexity of the fully inhomogeneous boundary and final value problem (119).

Completeness of _Y1 _Y1 follows from continuity of _Φ1 _Φ1 , cf. Lemma 8 concerning _zg _zg . Indeed, its composition to the left with the closed operator ^{e−T_{Δ_γ0}} ^{e−T_{Δ_γ0}} in _L2(Ω) _L2(Ω) (cf. Proposition 2) is also closed. Hence its domain D(^{e−T_{Δ_γ0}} _Φ1)=_Y1 D(^{e−T_{Δ_γ0}} _Φ1)=_Y1 is complete with respect to the graph norm in (158). As this norm is induced by an inner product when the norm of ^H−1(Ω) ^H−1(Ω) is taken as |||·_|||∗ |||·_|||∗ from (26), and when ^H1/2(Q) ^H1/2(Q) is normed as in Remark 17, _Y1 _Y1is a Hilbert(-able) space.

Analogously to the proof of Theorem 8, continuity of (f,g,_uT)↦u (f,g,_uT)↦u is now seen at once by inserting the expression _u0=^{e−T_{Δ_γ0}}(_uT−_yf+_zg) _u0=^{e−T_{Δ_γ0}}(_uT−_yf+_zg) from (153) into the estimate in Proposition 8. Thus we obtain:

Corollary 3.

The unique solution u of problem (119) lying in the Banach space _X1 _X1 depends continuously on the data (f,g,_uT) (f,g,_uT) in the Hilbert space _Y1 _Y1 , when these are given the norms in (130) and (158), respectively.

Taken together, Theorem 10 and Corollary 3 yield that the fully inhomogeneous final value problem (119) for the heat equation is well posed in the spaces _X1 _X1 and _Y1 _Y1.

6. Final Remarks 6.1. Applicability

For the special features of final value problems for Lax–Milgram operators A, it is of course decisive to have a proper subspace D(^eTA)⊊H D(^eTA)⊊H , for if D(^eTA) D(^eTA) fills H the compatibility condition (113) will be redundant—and (113) moreover only becomes stronger as the terminal time T increases, if D(^eTA) D(^eTA)decreases with larger T.

Within semigroup theory on a Banach space B, the above means that the ranges R(^etA) R(^etA) should form a strictly descending chain of inclusions in the sense that, for ^t′>t>0 ^t′>t>0,

R(^{e^t′A})⊊R(^etA)⊊B.

Non-strictness is here characterised by the rather special spectral properties of A Ain (iv):

Theorem 11.

For a _C0 _C0 -semigroup ^etA ^etA with ∥^etA∥≤M^eωt ∥^etA∥≤M^eωt the following are equivalent:

(i) ^etA ^etA is injective and R(^{e^t′A})=R(^etA) R(^{e^t′A})=R(^etA) holds for some t,^t′ t,^t′ with ^t′>t≥0 ^t′>t≥0 .

(ii) ^etA ^etA is injective with range R(^etA)=B R(^etA)=B for every t≥0 t≥0 .

(iii) The semigroup is embedded into a _C0 _C0 -group G(t) G(t) satisfying ∥G(t)∥≤M^eω|t| ∥G(t)∥≤M^eω|t| ;

(iv) The spectrum σ(A) σ(A) is contained in the strip in where −ω≤Reλ≤ω −ω≤Reλ≤ω and

^{∥(A−λ)−n} ^{∥≤M(|Reλ|−ω)−n}for|Reλ|>ω,n∈N.

Proof.

Given (i) for t>0 t>0 , then R(^e(t+δ)A)=R(^etA) R(^e(t+δ)A)=R(^etA) holds for all δ∈[0,^t′−t] δ∈[0,^t′−t] in view of the inclusions (33); and to every x∈B x∈B some y satisfies ^etA ^eδAy=^etAx ^etA ^eδAy=^etAx , which by injectivity gives x=^eδAy x=^eδAy , so that ^eδA ^eδA is surjective for such δ. Hence ^etA=^{(^e(t/N)A)N} ^etA=^{(^e(t/N)A)N} is a bijection on B with bounded inverse, i.e., 0∈ρ(^etA) 0∈ρ(^etA) . If (i) holds for t=0 t=0 , clearly 0∈ρ(^{e^t′A}) 0∈ρ(^{e^t′A}) . In both cases (ii) holds because 0∈ρ(^esA) 0∈ρ(^esA) must necessarily hold for s>0 s>0 according to ([23], Theorem 1.6.5), which also states that (iii) holds. (The proof there uses ([23], Lem. 1.6.4) that can be invoked directly from (ii) since the inverse of ^etA ^etA is bounded by the Closed Graph Theorem.) Conversely (iii) yields R(^etA)=R(G(t))=B R(^etA)=R(G(t))=B and injectivity for all t≥0 t≥0 , so (ii) and hence (i) holds. That (iv)⇒(iii) is part of the content of ([23], Theorem 1.6.3), which also states that (iii) implies (iv) for real λ, but the full statement in (iv) is then obtained from ([23], Remark 1.5.4). ☐

This result is essentially known, but nonetheless given as a theorem, as it clarifies how widely the present paper applies. Indeed, for V-elliptic Lax–Milgram operators A, the semigroups are uniformly bounded, so ω=0 ω=0 ; thus the strip in (iv) is the imaginary axis iR iR , but this is contained in ρ(A) ρ(A) by Lemma 4. So except in the pathological case σ(A)=∅ σ(A)=∅ , (iv) will always be violated, as will (i) and (ii). However, since in (i) and (ii) the operator ^e−tA ^e−tA is injective by Proposition 1, the strict inclusions in (159) hold for A=−A A=−A. This proves:

Proposition 10.

For a V-elliptic Lax–Milgram operator A with σ(A)≠∅ σ(A)≠∅ there is a strictly descending chain of dense domains D(^etA) D(^etA) of the inverses ^etA=^{(^e−tA)−1} ^etA=^{(^e−tA)−1} , i.e.,

D(^{e^t′A})⊊D(^etA)⊊Hfor^t′>t>0.

Therefore, for elliptic Lax–Milgram operators A with non-empty spectrum, the compatibility condition (113) is without redundancy, and it gets effectively stronger on longer time intervals. Previously, these properties were verified only in a special case in Proposition 7.

Example 1.

It is illuminating to consider the final value problem on ^Rn ^Rn , for α∈C\R α∈C\R ,

_∂tu−Δu+α_x1u=f,u(T)=_uT.

At first glance this might seem to be a minor variation on the heat problem in Section 5, in fact just a zero-order perturbation; and notably a change to Ω=^Rn Ω=^Rn . However, interestingly it cannot be treated within the present framework: in a paper fundamental to analysis of the Stark effect, Herbst [32] proved for the operator h(α)=−Δ+α_x1I h(α)=−Δ+α_x1I with Imα≠0 Imα≠0 that the minimal realisation h¯(α) h¯(α) also is maximal in _L2(^Rn) _L2(^Rn) with empty spectrum,

σ(h¯(α))=∅.

Moreover, the numerical range of h(α) h(α) itself is an open, slanted halfplane

ν(h(α))={z∈∣Rez>ReαImαImz}.

Therefore h¯(α) h¯(α) is not sectorial, as ν(h¯(α))⊂ν(h(α))¯ ν(h¯(α))⊂ν(h(α))¯ shows that (28) does not hold, so existence and uniqueness for the forward problem cannot be derived from Theorem 4. The fact proved in [32] that ^{e−ith¯(α)/α} ^{e−ith¯(α)/α} is a contraction semigroup, which for α=i α=i applies to ^e−th¯(i) ^e−th¯(i) that pertains to (162), entails via the Hille–Yosida theorem the estimate in Theorem 11 (iv) for −h¯(i) −h¯(i) , but only for Reλ>0 Reλ>0 . Since Reλ<0 Reλ<0 is not covered, it is despite the empty spectrum of A=−h¯(i)=Δ−i_x1 A=−h¯(i)=Δ−i_x1 not clear whether (159) holds with strict inclusions. Thus it seems open which properties the final value problem (162) for the Herbst operator h(α) h(α) can be shown to have.

Remark 18.

Recently Grebenkov, Helffer and Henry [14] studied the complex Airy operator A=−Δ+i_x1 A=−Δ+i_x1 in dimension n=1 n=1 . They considered realizations defined on _R+ _R+ by Dirichlet, Neumann and Robin conditions using the Lax–Milgram lemma, so results on boundary homogenous final value problems for −^d2d^x2+ix −^d2d^x2+ix should be straightforward to write down, as in Section 5.1. The study was extended to dimension n=2 n=2 , under the name of the Bloch–Torrey operator, by Grebenkov and Helffer in [13], where bounded and unbounded domains with ^C∞ ^C∞ boundary was treated; in cases with non-empty spectrum there should be easy consequences for the associated final value problems. The realisations induced by a transmission condition at an interface, which was the main theme in [13,14], are defined from a recent extension of the Lax–Milgram lemma due to Almog and Helffer [12], so in this case the properties of the corresponding final value problems are as yet unclear.

Remark 19.

We expect that extension of the theory to certain systems of parabolic equations with prescribed boundary and final value data should be possible. A useful framework for the discussion of this type of problems could be the pseudo-differential boundary operator calculus, with matrix-formed operators acting in Sobolev spaces of sections of vector bundles, as described in Section 4.1 of [31]. At least the present discussion should carry over to this kind of problems when the realisations called _(P+G)T _(P+G)T there are variational, i.e., when they are Lax–Milgram operators for certain triples (H,V,a) (H,V,a) ; this property is analysed in great depth in Section 1.7 of [31], to which we refer the interested reader. It is conceivable that the variational property is unnecessary, and might be avoided using the pseudo-differential boundary operator calculus, but this seems to require an addition to the theory of parabolic systems covered by the calculus in the form of a result on backward uniqueness.

6.2. Notes

Classical considerations were collected by Liebermann [33] for second order parabolic differential operators (cf. also Evans [34]), with references back to the fundamental _L2 _L2 -theory including boundary points of Ladyshenskaya, Solonnikov and Uraltseva [35]. A fundamental framework of functional analysis for parabolic Cauchy problems was developed by Lions and Magenes [8]. Later a full regularity theory in scales of anisotropic _L2 _L2 -Sobolev spaces was worked out for general pseudo-differential parabolic problems by Grubb and Solonnikov [15], who obtained the necessary and sufficient compatibility conditions on the data, including coincidence for half-integer values of the smoothness; cf. also ([31], Theorem 4.1.2). This study was carried over to the corresponding anisotropic _Lp _Lp -Sobolev spaces by Grubb [36]. A further extension to different integrability properties in time and space was taken up in a systematic study of anisotropic mixed-norm Triebel–Lizorkin spaces on a time cylinder and its flat and curved boundaries by Munch Hansen, the second author and Sickel [27]. Compatibility conditions were addressed for the heat equation in mixed-norm Triebel–Lizorkin spaces in ([27], Section 6.5) and ([29], Chapter 7). In particular, the latter showed that, except for coincidence at half integer smoothness, the recursive formulation of the compatibility conditions in [15] is equivalent to the requirement that the data belong to the null space of a certain matrix-formed operator at the curved corner {0}×∂Ω {0}×∂Ω . Recent semigroup and Laplace transformation methods were exposed in [30]. Denk and Kaip [37] treated parabolic multi-order systems via the Newton polygon and obtained _Lp _Lp –_Lq _Lq regularity results using R R-boundedness.

To our knowledge, the literature contains no previous account for pairs of spaces X and Y in which final value problems for parabolic differential equations are well posed.

An early contribution on final value problems for the heat equation was given in 1955 by John [3], who dealt with numerical aspects. In 1961, the idea of reducing the data space to obtain well-posedness was adopted by Miranker [4] for the homogeneous heat equation on R R , and he showed that in the space of _L2 _L2-functions having compactly supported Fourier transform there is a bijection between the initial and terminal states.

In addition to the injectivity of analytic semigroups in Proposition 1, it is known that u(0) u(0) is uniquely determined from u(T) u(T) even for t-dependent sesquilinear forms a(t;v,w) a(t;v,w) . This was shown by Lions and Malgrange [16] with an involved argument. It would take us too far to quote the large amount of work on the backward uniqueness in more loosely connected situations, often adopting the log-convexity method (if |u(t)|≤^|u(T)|t/T ^{|u(0)|1−t/T} |u(t)|≤^|u(T)|t/T ^{|u(0)|1−t/T} then u(T)=0 u(T)=0 implies u(t)=0 u(t)=0 for all t>0 t>0 , hence u(0)=0 u(0)=0 by continuity) attributed to Krein, Agmon and Nirenberg. Instead we refer the reader to [38,39,40] and the references therein.

The method of quasi-reversibility for final value problems was introduced systematically in 1967 by Lattès and Lions [41]. The idea is to perturb the equation ^u′+Au=0 ^u′+Au=0 by adding, e.g., −^ε2 ^A2 −^ε2 ^A2 to obtain a well-posed problem and to derive for its solution _uε _uε that _uε(x,T) _uε(x,T) approaches _uT _uT for ε→0 ε→0 , circumventing analysis of well-posedness of the original final value problem. They assumed f=0 f=0for a V-elliptic self-adjoint A.

Showalter [17] addressed questions that were partly similar to ours. He proposed to perturb instead by εA_∂t εA_∂t under the condition that A is m-accretive with semiangle θ≤π/4 θ≤π/4 on a Hilbert space for f=0 f=0 . He claimed uniqueness of solutions, and existence if and only if the final data via the Yosida approximations of −A −A allow approximation of the initial state. Showalter also identified injectivity of operators in analytic semigroups as an important tool. However, his reduction had certain shortcomings; cf. Remark 1. In comparison we obtain the full well-posedness for general f≠0 f≠0 and V-elliptic operators of semiangle θ=arccot(_C3 _C4−1) θ=arccot(_C3 _C4−1) belonging to the larger interval ]0,π/2[ ]0,π/2[.

An extensive account of the area around 1975, and of the many previous contributions using a variety of techniques, was provided by Payne [5]. A more recent exposition can be found in Chapters 2 and 3 in Isakov’s book [6], and for methods for inverse problems in general the reader may consult Kirsch [42].

In the closely related area of exact and null controllability of parabolic problems, the inequality in Corollary 2 is a little weaker than the observability inequality for the full subdomain O=Ω O=Ω . In this context, the role of observability was reviewed by Fernandez-Cara and Guerrero [43], emphasising Carleman estimates as a powerful tool in the area. A treatise on Carleman estimates in the parabolic context was given by Koch and Tataru [44].

Author Contributions

Conceptualization, A.-E.C. and J.J.; Method, J.J.; Investigation, A.-E.C. and J.J.; Formal Analysis, A.-E.C.; References, A.-E.C. and J.J.; Original Preparation in Ph.D-Thesis, A.-E.C.; Supervision, J.J.; Editing, J.J. (who also added Section 3.4 and Section 6).

Acknowledgments

Jon Johnsen was supported by the Danish Research Council, Natural Sciences grant no. 4181-00042. The authors thank H. Amann for his interest and comments on the literature. Also our thanks are due to an anonymous reviewer for indicating the concise proof of Corollary 2.

Conflicts of Interest

The authors declare no conflict of interest.

References

1. Christensen, A.-E.; Johnsen, J. On parabolic final value problems and well-posedness. C. R. Acad. Sci. Paris Ser. I 2018, 356, 301–305.

2. Courant, R.; Hilbert, D. Methods of Mathematical Physics; Interscience Publishers, Inc.: New York, NY, USA, 1953.

3. John, F. Numerical solution of the equation of heat conduction for preceding times. Ann. Mat. Pura Appl. 1955, 40, 129–142.

4. Miranker, W.L. A well posed problem for the backward heat equation. Proc. Am. Math. Soc. 1961, 12, 243–247.

5. Payne, L.E. Improperly Posed Problems in Partial Differential Equations; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 1975.

6. Isakov, V. Inverse Problems for Partial Differential Equations; Applied Mathematical Sciences; Springer: New York, NY, USA, 1998; Volume 127.

7. Günter, N.M. Potential Theory and Its Applications to Basic Problems of Mathematical Physics; Frederick Ungar Publishing Co.: New York, NY, USA, 1967.

8. Lions, J.-L.; Magenes, E. Non-Homogeneous Boundary Value Problems And Applications; Springer: New York, NY, USA; Heidelberg, Germany, 1972.

9. Tanabe, H. Equations of Evolution; Monographs and Studies in Mathematics; Pitman: Boston, MA, USA, 1979.

10. Temam, R. Navier—Stokes Equations, Theory and Numerical Analysis, 3rd ed.; Elsevier Science Publishers B.V.: Amsterdam, The Netherlands, 1984.

11. Amann, H. Linear and Quasilinear Parabolic Problems; Monographs in Mathematics; Birkhäuser, Inc.: Boston, MA, USA, 1995; Volume 89.

12. Almog, Y.; Helffer, B. On the Spectrum of Non-Selfadjoint Schrödinger Operators with Compact Resolvent. Commun. Partial Differ. Equ. 2015, 40, 1441–1466.

13. Grebenkov, D.S.; Helffer, B. On Spectral Properties of the Bloch—Torrey Operator in Two Dimensions. SIAM J. Math. Anal. 2018, 50, 622–676.

14. Grebenkov, D.S.; Helffer, B.; Henry, R. The complex Airy operator on the line with a semipermeable barrier. SIAM J. Math. Anal. 2017, 49, 1844–1894.

15. Grubb, G.; Solonnikov, V.A. Solution of parabolic pseudo-differential initial-boundary value problems. J. Differ. Equ. 1990, 87, 256–304.

16. Lions, J.-L.; Malgrange, B. Sur l’unicité rétrograde dans les problèmes mixtes parabolic. Math. Scand. 1960, 8, 227–286.

17. Showalter, R.E. The final value problem for evolution equations. J. Math. Anal. Appl. 1974, 47, 563–572.

18. Janas, J. On unbounded hyponormal operators III. Stud. Math. 1994, 112, 75–82.

19. Yosida, K. Functional Analysis, 6th ed.; Springer: Berlin, Germany; New York, NY, USA, 1980.

20. Grubb, G. Distributions and Operators; Graduate Texts in Mathematics; Springer: New York, NY, USA, 2009; Volume 252.

21. Schwartz, L. Théorie des Distributions; Hermann: Paris, France, 1966.

22. Hörmander, L. The Analysis of Linear Partial Differential Operators; Grundlehren der mathematischen Wissenschaften; Springer: Berlin, Germany, 1983; p. 1985.

23. Pazy, A. Semigroups of Linear Operators and Applications to Partial Differential Equations; Applied Mathematical Sciences; Springer: New York, NY, USA, 1983.

24. Reed, M.; Simon, B. Methods of Modern Mathemtical Physics. I: Functional Analysis; Academic Press: Cambridge, MA, USA, 1980.

25. Niculescu, C.P.; Persson, L.-E. Convex Functions and Their Applications. a Contemporary Approach; CMS Books in Mathematics/Ouvrages de Mathématiques de la SMC; Springer: New York, NY, USA, 2006.

26. Johnsen, J. On spectral properties of Witten-Laplacians, their range projections and Brascamp–Lieb’s inequality. Integr. Equ. Oper. Theory 2000, 36, 288–324.

27. Johnsen, J.; Hansen, S.M.; Sickel, W. Anisotropic Lizorkin–Triebel spaces with mixed norms—Traces on smooth boundaries. Math. Nachr. 2015, 288, 1327–1359.

28. Johnsen, J.; Sickel, W. On the trace problem for Lizorkin–Triebel spaces with mixed norms. Math. Nachr. 2008, 281, 1–28.

29. Hansen, S.M. On Parabolic Boundary Problems Treated in Mixed-Norm Lizorkin–Triebel Spaces. Ph.D. Thesis, Aalborg University, Aalborg, Denmark, 2013.

30. Arendt, W.; Batty, C.J.K.; Hieber, M.; Neubrander, F. Vector-Valued Laplace Transforms and Cauchy Problems, 2nd ed.; Monographs in Mathematics; Springer: Basel, Switzerland, 2011; Volume 96.

31. Grubb, G. Functional Calculus of Pseudo-Differential Boundary Problems, 2nd ed.; Progress in Mathematics; Birkhäuser: Boston, MA, USA, 1996; Volume 65.

32. Herbst, I.W. Dilation analyticity in constant electric field. I. The two body problem. Commun. Math. Phys. 1979, 64, 279–298.

33. Lieberman, G.M. Second Order Parabolic Differential Equations, 2nd ed.; World Scientific Publishing: River Edge, NJ, USA, 2005.

34. Evans, L.C. Partial Differential Equations, 2nd ed.; Graduate Studies in Mathematics; American Mathematical Society: Providence, RI, USA, 2010; Volume 19.

35. Ladyzenskaya, O.A.; Solonnikov, V.A.; Ural’ceva, N.N. Linear and Quasilinear Equations of Parabolic Type; Translations of Mathematical Monographs; American Mathematical Society: Providence, RI, USA, 1968.

36. Grubb, G. Parameter-elliptic and parabolic pseudodifferential boundary problems in global Lp Sobolev spaces. Math. Z. 1995, 218, 43–90.

37. Denk, R.; Kaip, M. General Parabolic Mixed Order Systems In Lp and Applications; Operator Theory: Advances and Applications; Birkhäuser: Boston, MA, USA, 2013; Volume 239.

38. Dardé, J.; Ervedoza, S. Backward Uniqueness Results for Some Parabolic Equations in an Infinite Rod. Available online: https://hal.archives-ouvertes.fr/hal-01677033 (accessed on 29 March 2018).

39. Hào, D.N.; van Duc, N. Stability results for backward parabolic equations with time-dependent coefficients. Inverse Probl. 2011, 25, 20.

40. Kukavica, I. Log-log convexity and backward uniqueness. Proc. Am. Math. Soc. 2007, 135, 2415–2421.

41. Lattès, R.; Lions, J.-L. Méthode de quasi-réversibilité et applications; Travaux et Recherches Mathématiques: Dunod, Paris, 1967.

42. Kirsch, A. An Introduction to the Mathematical Theory of Inverse Problems; Applied Mathematical Sciences; Springer-Verlag: New York, NY, USA, 1996; Volume 120.

43. Fernández-Cara, E.; Guerrero, S. Global Carleman inequalities for parabolic systems and applications to controllability. SIAM J. Control Optim. 2006, 45, 1399–1446.

44. Koch, H.; Tataru, D. Carleman estimates and unique continuation for second order parabolic equations with nonsmooth coefficients. Commun. Part. Differ. Equ. 2009, 34, 305–366.

AuthorAffiliation

¹ Unit of Epidemiology and Biostatistics, Aalborg University Hospital, Hobrovej 18-22, DK-9000 Aalborg, Denmark

² Department of Mathematics, Aalborg University, Skjernvej 4A, DK-9220 Aalborg Øst, Denmark

* Correspondence: [email protected]; Tel.: +45-9940-8847

^† These authors contributed equally to this work.

Word count: 16999

Show less

© 2018. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Translate

This article concerns the basic understanding of parabolic final value problems, and a large class of such problems is proved to be well posed. The clarification is obtained via explicit Hilbert spaces that characterise the possible data, giving existence, uniqueness and stability of the corresponding solutions. The data space is given as the graph normed domain of an unbounded operator occurring naturally in the theory. It induces a new compatibility condition, which relies on the fact, shown here, that analytic semigroups always are invertible in the class of closed operators. The general set-up is evolution equations for Lax–Milgram operators in spaces of vector distributions. As a main example, the final value problem of the heat equation on a smooth open set is treated, and non-zero Dirichlet data are shown to require a non-trivial extension of the compatibility condition by addition of an improper Bochner integral.

Details

Title

Final Value Problems for Parabolic Differential Equations and Their Well-Posedness

Author

Christensen, Ann-Eva; Johnsen, Jon

Publication year

2018

Publication date

Jun 2018

Publisher

MDPI AG

e-ISSN

20751680

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.3390/axioms7020031

ProQuest document ID

2125177873

Final Value Problems for Parabolic Differential Equations and Their Well-Posedness

Jump to:

Full text

Abstract

Details

Suggested sources