Lower Semicontinuity of the Universal Functional in Paramagnetic Current–Density Functional Theory

A cornerstone of current–density functional theory (CDFT) in its paramagnetic formulation is proven. After a brief outline of the mathematical structure of CDFT, the lower semicontinuity and expectation-valuedness of the CDFT constrained-search functional is proven, meaning that there is always a minimizing density matrix in the CDFT constrained-search universal density functional. These results place the mathematical framework of CDFT on the same footing as that of standard DFT.

Density functional theory (DFT) is at present the most widely used tool for first-principles electronic structure calculations in solid-state physics and quantum chemistry. DFT was put on a solid mathematical ground by Lieb in a landmark paper 1 from 1983, where he introduced the universal density functional F(ρ) as the convex conjugate to the concave ground-state energy E(v) for an electronic system in the external scalar potential v.
For electronic systems under the influence of a classical external magnetic potential A, current−density functional theory (CDFT) was introduced by Vignale and Rasolt in 1987. 2 In addition to the density ρ, the paramagnetic current density j p becomes a basic variable. The mathematical foundation of CDFT was put in place by Tellgren et al. 3 and Laestadius 4,5 in the 2010s on the basis of Lieb's treatment of the field-free standard case. However, a central piece of the puzzle has been missingnamely, whether the CDFT constrained-search functional F(ρ, j p ) is lower-semicontinuous and expectation-valued, 6 i.e., whether the infimum in its definition (see eq 3 below) is in fact attained. These foundational issues are important because CDFT is the natural extension of DFT to treat general magnetic systems and several numerical implementations have been reported, although the development of practical functionals lags behind standard DFT. 7−13 In this Letter, we provide proofs of the above assertions. The CDFT constrained-search functional is indeed convex lowersemicontinuous and can therefore be identified with the CDFT Lieb functionalthat is, the Legendre−Fenchel transform of the energy. Without this fact, the ground-state energy functional E(v, A) and the constrained-search functional F(ρ, j p ) contain dif ferent information. If F(ρ, j p ) were not expectation-valued, one would lose the interpretation of the universal functional as intrinsic energy, which is very useful in standard DFT. For the interested reader, suggested further reading for convex analysis are van Tiel's excellent introductory text 14 and the monograph by Ekeland and Teḿam. 15 Also, the monograph by Barbu and Precupanu, 16 which treats convex analysis in Banach spaces, and the one by Bauschke and Combettes, 17 which focuses on the Hilbert space formulation, are highly recommended. For more details on trace-class operators, the monograph by Weidmann is an accessible starting point, 18 as well as the now classic volume by Reed and Simon. 19 For an N-electron system in sufficiently regular external potentials v and A, the ground-state energy is given by the Rayleigh−Ritz variation principle as 2 and two-electron repulsion operator W. The minimization is over all N-electron density matrices Γ of finite kinetic energy, for which the one-electron density is 3 . 20 (The boldface notation indicates a space of vector fields.) The external potential energy (v|ρ) = , the paramagnetic and diamagnetic terms and thus the Hamil- 3 , where X L ′ and X p ′ are the dual spaces of X L and X p , respectively. Examples of such potentials are the nuclear Coulomb potentials and uniform magnetic fields inside bounded domains. The symbol X L for the space of densities is so chosen to indicate that it is the density space of Lieb's analysis, while X p indicates "paramagnetic" current densities. By a well-known reformulation of eq 1, we obtain the CDFT Hohenberg−Kohn variation principle: Here the Vignale−Rasolt constrained-search density functional F: where H 0 = T(0) + W is the intrinsic electronic Hamiltonian, and Γ → (ρ, j p ) means that the infimum is taken over all Nelectron density matrices Γ with density−current pair Thus, if (ρ, j p ) is not Nrepresentable, we have F(ρ, j p ) = +∞. The universal density functional F is the central quantity in any flavor of DFT, and its mathematical properties and approximation are of utmost importance to the field.
Although E in eq 2 is not concave, it is readily seen that the reparametrized energy is concave. This reparametrization relies on a technical notion of compatibility of function spaces for the scalar and vector potentials, 20 which is satisfied for the potentials we consider here. From the concavity and upper semicontinuity of the modified ground-state energy E, one can deduce the existence of an alternative universal density functional F: X L × X p → [0, +∞] that is related to the ground-state energy by Legendre−Fenchel transformations in the manner where the optimizations are over the space X L × X p and its dual X L ′ × X p ′, respectively. As a Legendre−Fenchel transform, the functional F̂is convex and lower-semicontinuous. In this formulation of CDFT, the ground-state energy Ẽand the universal density functional F̂contain precisely the same information: each functional can be obtained from the other and therefore contains the same of the information about ground-state electronic systems in external scalar and vector fields. From a comparison of the Hohenberg−Kohn variation principles in eqs 2 and 5, it is tempting to conclude that F̂and F are the same functional (i.e., F̂= F), producing the same ground-state energy for each (v, A). However, there exist infinitely many functionals F: X L × X p → [0, +∞] that give the correct ground-state energy E(v, A) (but not necessarily the same minimizing densities, if any) for each (v, A) in the Hohenberg−Kohn variation principle. Each such F̃is said to be an admissible density functional. 6 Among these, the functional F̂stands out as being the only lower-semicontinuous and convex universal density functional and a lower bound to all other admissible density functionals (i.e., F̂≤ F). The functional F, the closed convex hull of all admissible density functionals, is thus the most well-behaved admissible density functional. Indeed, we may view it as a regularization of all admissible density functionals, known as the Γ regularization in convex analysis. (This name is unrelated to our notation of density matrices.) A fundamental result of Lieb's analysis of DFT is the identification of the transparent constrained-search density functional with the mathematically well-behaved closed convex hull F. The identification follows since F is convex and lowersemicontinuous. Whereas convexity follows easily for the CDFT Vignale−Rasolt functional F, the proof of lower semicontinuity is nontrivial. For standard DFT it is given in ref 1, and for CDFT it is provided in the present Letter.
We simplify our analysis by merely assuming that the density−current pairs are (ρ,j p ) 1 3 4 [ ] which we denote as X. With this topology, the potentials must be taken to be bounded functions, 3 4 ] This simplification is irrelevant in this context: if F can be shown to be lowersemicontinuous in the  L ( ) 1 3 4 [ ] topology, it will be lowersemicontinuous in any stronger topology, as required if we enlarge the potential space to include more singular functions such as those in X L ′ × X p ′. Indeed, the original proof of lower semicontinuity of the standard DFT Levy−Lieb functional (eq 3) was with respect to the  L ( ) 1 3 topology, from which the same property with respect to the X L topology immediately follows.
Theorem and Proof. The intrinsic Hamiltonian H 0 is self- where TC(L N 2 ) is the set of trace-class operators over L N 2 , the largest set of operators to which a basis-independent trace can be assigned. An operator A is trace-class if and only if the if and only if λ k ≥ 0, ∑ k λ k = 1, and {ϕ k } ⊂ H N 1 and if the total kinetic energy is finite (i.e., ∑ k λ k ⟨ϕ k |T|ϕ k ⟩ < +∞).
The theorem involves the weak topology on Weak convergence of a sequence {x n }⊂ X, written as x n ⇀ x ∈ X, means that for any bounded linear functional ω ∈ X′, we have ω(x n ) → ω(x) as a sequence of numbersthat is, weak convergence is the pointwise convergence of all bounded linear functionals. Recall that the dual space of Theorem 1. Suppose that (ρ, j p ) ∈ X and {(ρ n , j pn )} ⊂ X are such that F(ρ, j p ) < +∞ and F(ρ n , j pn ) < +∞ for each  n ∈ and f urther suppose that (ρ n , j pn ) ⇀ (ρ, j p ). Then there exists N Γ ∈ such that Γ → (ρ, j p ) and Tr(H 0 Γ) ≤ lim inf n F(ρ n , j pn ).
Proof of Theorem 1. The initial setup follows ref 1, which we here restate. Without loss of generality, we may replace H 0 = T + W by h 2 = T + W + 1, which is self-adjoint and positivedefinite. The operator h is taken to be the unique positive selfadjoint square root of T + W + 1.
Consider the sequence {g n } with elements g n ≔ F(ρ n , j pn ). If g n → +∞, then the statement of the theorem is trivially true. We therefore assume that {g n } is bounded. There then exists a subsequence such that g ≔ lim n g n exists. Furthermore, for each n there exists n N Γ ∈ such that Γ n → (ρ n , j pn ) and Tr(hΓ n h) = Tr(h 2 Γ n ) ≤ g + 1/n. To demonstrate this, we select for each n a density matrix Γ n → (ρ n , j pn ) that satisfies Tr(h 2 Γ n ) < g n + 1/2n and choose m such that |g − g n | < 1/2n for each n > m Using the sequence {hΓ n h}, we next establish a candidate limit density operator N Γ ∈ . The dual-space sequence of (positive-semidefinite) operators y n ≔ hΓ n h ∈ TC(L N 2 ) is uniformly bounded in the trace norm: ∥y n ∥ TC ≤ g + 1. By the Banach−Alaoglu theorem, a norm-closed ball of finite radius in the dual space is compact in the weak-* topology. Thus, there exists y ∈ TC(L N 2 ) such that for a subsequence, Tr(By n ) → Tr(By) for each B K L ( ) N 2 ∈ , meaning that y is the (possibly nonunique) weak-* limit of a subsequence of {y n }. The limit is positive-definite, since the orthogonal projector P Φ onto L N 2 Φ ∈ is a compact operator, which gives Let (ρ′, j p ′) ← Γ be the density associated with Γ. To demonstrate that (ρ′, j p ′) = (ρ, j p ), we recall that (ρ n , j pn ) ⇀ (ρ, j p ) by assumption. Since weak limits are unique, our proof is complete if we can show that (ρ n , j pn ) ⇀ (ρ′, . The proof that ρ n ⇀ ρ′ is given in ref 1. Here we demonstrate that j pn ⇀ j p ′ by showing for each  a L ( ) be a bounded domain with characteristic function χ equal to 1 on Ω and 0 elsewhere.
We now let S = ∏ i=1 N χ(r i ) be the characteristic function of  . By the definition of U α , we then have and using the symmetry of |ψ μ | 2 , we obtain for the two factors We conclude that I(U α ) 2 ≤ 2Ngε. Introducing U n,α = N Im diag ∂ 1α Γ n and proceeding in the same manner, we arrive at the bound I(U n,α ) 2 ≤ 4Ngε, assuming that n has been chosen to be sufficiently large that ∫ (1 − χ)ρ n dr < 2ε holds.
We are now ready to consider the weak convergence j pn ⇀ j p ′ in  L( ) 1 3 . For each  a L ( ) 3

∈ ∞
and for sufficiently large n, using the Cauchy−Schwarz inequality and the Holder inequality in combination with the bounds I(U α ) 2 ≤ 2Ngε and I(U n,α ) 2 ≤ 4Ngε, we obtain the inequality Since ε > 0 is arbitrary, it only remains to show that we have ∫ (U n,α − U α )a α (r 1 )S dτ → 0 as n → ∞.
Let M be the compact multiplication operator associated with a α (r 1 )S(τ), a bounded function with compact support over  N 3 . Let Ω σ = {↑, ↓} be the set consisting of the two spin states of the electrons. We note that  (14) viewing Γ n as an operator over L 2 ((Ω×Ω σ ) N ) by domain restriction of the spectral decomposition elementsthat is, ψ μ ∈ H 1 ((Ω×Ω σ ) N ), meaning that the 2 N spin components of ψ μ are in H 1 (Ω N ). For simplicity, the spaces used here are not antisymmetrized. Our next task is to demonstrate that B = h −1 M∂ 1α h −1 is compact over L 2 ((Ω×Ω σ ) N ). We first show that h −1 is compact with range H 1 ((Ω×Ω σ ) N ). We have h T W 1 = + + with domain H 1 ((Ω×Ω σ ) N ). Now h −1 exists and is bounded since −1 is not in the spectrum of T + Wthat is, h −1 : By the Rellich− Kondrachov theorem, H 1 (Ω N ) (the standard Sobolev space without spin) is a compact subset of L 2 (Ω N ). It follows that H 1 ((Ω×Ω σ ) N ) is a compact subset of L 2 ((Ω×Ω σ ) N ), since the tensor product of compact sets is compact. Hence, h −1 is compact.
From the compactness of B, it follows that  (15) by the weak-* convergence of y n to y. We conclude that j pn ⇀ j p ′ and hence that (ρ, j p ) = (ρ′, j p ′), completing the proof.
In conclusion, we have extended Theorem 4.4 of ref 1 to CDFT. As immediate corollaries, the constrained-search functional F(ρ, j p ) is lower-semicontinuous and expectationvalued, that is, if F(ρ, j p ) < +∞, then there exists Γ → (ρ, j p ) such that F(ρ, j p ) = Tr(H 0 Γ). These mathematical results are the final pieces in the puzzle of placing CDFT on a solid mathematical ground in a similar manner as done by Lieb for standard DFT.
The Journal of Physical Chemistry Letters pubs.acs.org/JPCL Letter