For any causal nonlinear electrodynamics theory that is “self-dual” (electromagnetic $U(1)$ -duality invariant), the Legendre-dual pair of Lagrangian and Hamiltonian densities $\{\mathcal{L},\mathcal{H}\}$ are constructed from functions $\{\ell,\mathfrak{h}\}$ on $\hbox{\mybb R}^{+}$ related to a particle-mechanics Lagrangian and Hamiltonian. We show how a ‘duality’ relating $\ell$ to $\mathfrak{h}$ implies that $\mathcal{L}$ and $\mathcal{H}$ are related by a simple map between appropriate pairs of variables. We also discuss Born’s “Legendre self-duality” and implications of a new “ $\Phi$ -parity” duality. Our results are illustrated with many examples.

1 Introduction

Nonlinear theories of electrodynamics (NLED) are generally defined [1, 2, 3, 4] by means of a Lagrangian density function $\mathcal{L}(S,P)$ of the two Lorentz (pseudo)scalars

S=\frac{1}{2}\left(E^{2}-B^{2}\right)\,,\qquad P={\bf E}\cdot{\bf B}\,,

(1.1)

where $({\bf E},{\bf B})$ are the (electric, magnetic) 3-vector-field components of the abelian 2-form field strength $F=dA$ on Minkowski spacetime, and $(E,B)$ are their respective magnitudes. A feature of all NLED theories in this “Plebanski” class is that the degrees of freedom remain those of the free-field Maxwell theory, although superluminal signal propagation (and hence causality violation) is potentially possible, and the physical theories are those for which it is not possible. These features are shared by some NLED theories outside the Plebanski class, some of which are physical [5, 6, 7, 8], but they have no weak-field limit. In this paper we assume the existence of a (conformal) weak-field limit.

We also focus on a class of NLED theories that share with the free-field Maxwell electrodynamics the property of electromagnetic duality invariance. In the Maxwell case this can be viewed as an invariance of the (source-free) Maxwell equations under any constant shift of the phase of the complex 3-vector field ${\bf E}+i{\bf B}$ . However, this definition applies only in Cartesian coordinate systems since ${\bf E}$ and ${\bf B}$ are 3-vectors in dual vector spaces. A better definition, which not only applies for any coordinate system but also generalises to nonlinear electrodynamics, is as an invariance of the Hamiltonian density under any constant phase shift of the complex 3-vector field

{\bf D}+i{\bf B}\,,\qquad{\bf D}:=\frac{\partial\mathcal{L}}{\partial{\bf E}}\,.

(1.2)

Invariance of the field equations is then a consequence of the invariance of the Hamiltonian. For the Maxwell case, ${\bf D}={\bf E}$ in Cartesian coordinates, and we therefore recover the earlier definition. Following what has become standard terminology, we shall say that NLED theory with this $U(1)$ symmetry is “self-dual”.

Within the Lagrangian formulation, the restriction to a self-dual theory is achieved by requiring the Lagrangian density to satisfy the following partial differential equation (PDE) [5]:

P\left(\mathcal{L}_{S}^{2}-\mathcal{L}_{P}^{2}-1\right)=2S\mathcal{L}_{S}% \mathcal{L}_{P}\,.

(1.3)

The first example, excepting the free-field Maxwell case, was the Born-Infeld theory [2], although its self-duality was noticed by Schrödinger a few years later [9]. Since the re-appearance of Born-Infeld electrodynamics in the 1990s as (or as part of) an effective theory for open strings of string theories [10, 11, 12, 13] there has been a resurgence of interest in nonlinear electrodynamics. In particular, the possibility of a Born scale in electrodynamics is now taken seriously for its potential relevance to the physics of magnetars, e.g. [14, 15], and to particle physics experiments at future colliders [16].

One motivation for our focus on self-dual NLED theories is that many of the special properties of Born-Infeld are consequences of its self-duality. Another motivation is the recent result that strong-field causality is implied by weak-field causality for all self-dual NLED with a weak-field limit [17]. We elaborate below on the significance of this fact, and one purpose of this paper is to provide more details of the results of [17].

Another purpose is to expand the results of [17] to include the Hamiltonian formulation. As this is equivalent to the Lagrangian formulation for all causal NLED theories, we did not expect surprises. However, various additional remarkable properties of self-dual theories emerge from the conjunction of the Lagrangian and Hamiltonian formulations. For the remainder of this Introduction we provide the necessary background and a sketch of our main new results.

The self-duality PDE (1.3) can be simplified by expressing $\mathcal{L}$ as a function of $S$ and

\Phi:=\sqrt{S^{2}+P^{2}}\,.

(1.4)

This is possible only if $\mathcal{L}$ preserves parity since both $S$ and $\Phi$ are parity even whereas $P$ is parity odd, but this restriction is not a limitation for self-dual NLED because self-duality implies parity, for a reason to be explained below. The self-duality PDE for $\mathcal{L}(S,\Phi)$ is [18]

\mathcal{L}_{S}^{2}-\mathcal{L}_{\Phi}^{2}=1\,.

(1.5)

For some purposes it is convenient to use the alternative independent variables

U=\frac{1}{2}(\Phi-S)\ ,\qquad V=\frac{1}{2}(\Phi+S)\,.

(1.6)

Notice that $(U,V)$ as defined are both non-negative because $\Phi\geq|S|$ . This implies that the ‘physical’ values of $(U,V)$ are restricted to the positive quadrant in the $(U,V)$ -plane. The self-duality PDE for $\mathcal{L}(U,V)$ is [19]

\mathcal{L}_{U}\mathcal{L}_{V}=-1\,.

(1.7)

The general solution to this equation in the positive $(U,V)$ -quadrant, in terms of the boundary function $\ell(V):=\mathcal{L}(0,V)$ , is [20]

\mathcal{L}=\ell(\tau)-\frac{2U}{\dot{\ell}(\tau)}\ ,\qquad\tau=V+\frac{U}{% \dot{\ell}^{2}(\tau)}\,,

(1.8)

where

\dot{\ell}(\tau)=\frac{d\ell(\tau)}{d\tau}>0\,.

(1.9)

We shall call this the Courant-Hilbert (CH) solution, and $\ell(\tau)$ a “CH-function”. Notice that $\tau\geq 0$ by definition, with equality only for $U=V=0$ . The choice $\ell(\tau)=\tau$ yields the free-field Maxwell case.

To verify that (1.8) solves (1.7), we may take the differential of both sides of both equations of (1.8). The resulting pair of equations for the differentials may then be solved for $d\mathcal{L}$ and $d\tau$ in terms of $dU$ and $dV$ . The result for $d\mathcal{L}$ implies

\mathcal{L}_{V}=\dot{\ell}\,,\qquad\mathcal{L}_{U}=-{\dot{\ell}}^{-1}\,,

(1.10)

which confirms that $\mathcal{L}_{U}\mathcal{L}_{V}=-1$ . The result for $d\tau$ is

Gd\tau=\dot{\ell}(dU+\dot{\ell}^{2}dV)\,,

(1.11)

where

G:=\dot{\ell}^{3}+2\ddot{\ell}U\,.

(1.12)

The main implications of (1.11) were briefly discussed in [17] and we review this, with more detail, in the following section.

Within the Plebanski NLED class, the necessary and sufficient conditions for causality were found in [21], subject to an assumption about the domain of the function $\mathcal{L}(S,P)$ that can be interpreted physically as the existence of a weak-field limit. These conditions can be separated into two sets according to whether a violation is possible (generically) for weak fields, or only for strong fields. The former set are equivalent to convexity of the function $\mathcal{L}(S,P)$ , which are also the conditions for convexity of $\mathcal{L}$ as a function of ${\bf E}$ [22]. The remaining (strong-field) causality condition was provided with some intuition and an alternative derivation in [23]. For self-dual NLED theories with a weak-field limit, we showed in [17] that all these causality conditions reduce to the following simple inequalities to be satisfied by derivatives of the CH-function $\ell(\tau)$ :

\dot{\ell}\geq 1\,,\qquad\ddot{\ell}\geq 0\,.

(1.13)

Apart from its simplicity, this result is remarkable because there was no a priori reason to suppose that the causality conditions on $\ell$ would be independent of $(U,V)$ . Notice that the condition $\ddot{\ell}\geq 0$ tells us that $\ell(\tau)$ is a convex function.

The assumption of a weak-field limit can also be expressed in terms of the function $\ell(\tau)$ ; it is the statement that $\ell(\tau)$ should have a Taylor-series expansion in $\tau$ . Omitting the constant term in this expansion on the grounds that it is irrelevant to the NLED dynamics; we have

\ell(\tau)=e^{\gamma}\tau+\mathcal{O}(\tau^{2})\,,

(1.14)

for some dimensionless constant $\gamma$ , which must be non-negative in order to satisfy the causality condition $\dot{\ell}\geq 1$ in the weak-field limit. In this limit $\ell(\tau)=e^{\gamma}\tau$ , which yields the free-field Maxwell theory for $\gamma=0$ and ModMax (the “modified Maxwell” theory [24]) for $\gamma>0$ . Both are conformal because the conformality condition for self-dual NLED is equivalent to degree-1 homogeneity of $\ell(\tau)$ , as we show in section 3. Another feature of the existence of a weak-field expansion is that $\ell(\tau)$ defines a function not only for $\tau\geq 0$ (which is all that is relevant to the CH solution of the self-duality PDE) but also for $\tau<0$ , at least in some neighbourhood of $\tau=0$ . We shall see later the significance of this fact.

A feature of the CH equations (1.8) is that many simple functions $\ell(\tau)$ satisfying (1.13) allow $\mathcal{L}(U,V)$ to be found analytically, leading to explicit Lagrangian densities for a variety of causal self-dual NLED theories. These include Born-Infeld [2] and its Mod-Max-type generalisation [24, 25] that we call, for brevity, “ModMaxBorn”. Other examples were given in [17] and more will be given here.

We shall expand on the results of [17] in the following section but, as stated above, our main purpose is to explore the Hamiltonian formulation for self-dual NLED. An advantage of this formulation is that self-duality can be implemented simply by restricting the Hamiltonian density $\mathcal{H}({\bf D},{\bf B})$ to be a function of the two duality-invariant rotation scalars

s=\frac{1}{2}\left(D^{2}+B^{2}\right)\,,\qquad p=|{\bf D}\times{\bf B}|\,% \qquad(D=|{\bf D}|).

(1.15)

As both $s$ and $p$ are parity-even (parity flips the sign of ${\bf D}$ ) it follows that any function $\mathcal{H}(s,p)$ is duality invariant, and hence that all self-dual NLED theories preserve parity, as claimed above.

A disadvantage of the Hamiltonian formulation is that Lorentz invariance is not manifest. The condition for a generic, and not necessarily duality-invariant, Hamiltonian density to define a Lorentz invariant NLED was found in [5]. Here we need this condition for functions of $(s,p)$ only, and if we trade these variables for $s$ and

\varphi:=\sqrt{s^{2}-p^{2}}\,,

(1.16)

then the Lorentz invariance condition for $\mathcal{H}(s,\varphi)$ is the PDE

\mathcal{H}_{s}^{2}-\mathcal{H}_{\varphi}^{2}=1\,,

(1.17)

which is formally identical to the Lagrangian self-duality PDE of (1.5).

For some purposes it is convenient to use the new independent variables¹¹1These differ from the definitions of $(u,v)$ in [24] by the exchange $u\leftrightarrow v$ , which facilitates comparison between the Lagrangian and Hamiltonian formulations of self-dual NLED.

u=\frac{1}{2}\left(s-\varphi\right)\ ,\qquad v=\frac{1}{2}\left(s+\varphi% \right)\,,

(1.18)

Notice that $(u,v)$ are both non-negative, and that $v\geq u$ , so the ‘physical’ region in the $(u,v)$ -plane is the region of the positive quadrant bounded by $u=0$ and $v=u$ . The condition for $\mathcal{H}(u,v)$ to define a Lorentz invariant NLED is [25]

\mathcal{H}_{u}\mathcal{H}_{v}=1\,.

(1.19)

This is mathematically equivalent to (1.7) since the sign on the right-hand side can be changed by using $(-u,v)$ instead of $(u,v)$ as the independent variables, and the general solution for $\mathcal{H}$ is then formally the same as the solution of (1.8) for $\mathcal{L}$ . However, we shall use the variables $(u,v)$ as defined above because they are both non-negative. Notice that (1.19) has the solution

\mathcal{H}(u,v)=\sqrt{4uv}=p\,,

(1.20)

which defines the conformal Bilaynicki-Birula electrodynamics [5]. There is no analogous solution of (1.7) because of the different sign on the right-hand side.

All other solutions of (1.19), expressed in terms of the boundary function $\mathcal{H}(0,v)=\mathfrak{h}(v)$ , are given by

\mathcal{H}=\mathfrak{h}(\sigma)+\frac{2u}{\mathfrak{h}^{\prime}(\sigma)}\ ,% \qquad\sigma=v-\frac{u}{\left(\mathfrak{h}^{\prime}(\sigma)\right)^{2}}\qquad(% \mathfrak{h}^{\prime}>0),

(1.21)

where $\mathfrak{h}(\sigma)$ is a new CH-function analogous to $\ell(\tau)$ .

Corresponding to every causal NLED defined by a function $\mathcal{L}(U,V)$ there is a Hamiltonian density function $\mathcal{H}(u,v)$ and the two are related by a Legendre transform. This is because convexity of $\mathcal{L}$ (as a function of ${\bf E}$ ) implies convexity of $\mathcal{H}$ (as a function of ${\bf D}$ ) and this implies that the Legendre transform is an involution, although “strict” convexity (non-zero Hessian determinant) is needed to apply this theorem to the Plebanski class of NLED theories. For self-dual theories a corollary of this correspondence is that the functions $\ell(\tau)$ and $\mathfrak{h}(\sigma)$ must be related in some way that allows one to be found from the other. What we find is that the following functions are Legendre transforms of each other:

L(\sqrt{2\tau})=\ell(\tau)\,,\qquad H(\sqrt{2\sigma})=\mathfrak{h}(\sigma)\,.

(1.22)

In other words, the functions $\ell$ and $\mathfrak{h}$ are related by a Legendre transform²²2As we explain later, this requires $\sigma\geq 0$ . but in terms of the new variables $\sqrt{2\tau}$ and $\sqrt{2\sigma}$ . Our choice of notation is motivated by the fact that the functions $L$ and $H$ can be interpreted as the Lagrangian and Hamiltonian of a particle mechanics model associated to the NLED defined by the Lagrangian and Hamiltonian densities $\mathcal{L}$ and $\mathcal{H}$ . This was a motivating analogy for Born’s original NLED theory, and a correspondence between Born-Infeld and the massive relativistic particle is a consequence of T-duality for the effective worldvolume field theories of D-branes (see e.g. [26, 27]). However, the correspondence applies more generally, as we discuss in section 7.

Our Hamiltonian results for self-dual NLED theories allow us to ‘translate’ the causality conditions on $\ell(\tau)$ to corresponding causality conditions on $\mathfrak{h}(\sigma)$ . As we shall see, these are

0<\mathfrak{h}^{\prime}(\sigma)\leq 1\,,\qquad\mathfrak{h}^{\prime}{}^{\prime}% (\sigma)\leq 0\,,

(1.23)

and

\mathfrak{h}^{\prime}(\sigma)+2\sigma\mathfrak{h}^{\prime}{}^{\prime}(\sigma)>% 0\,.

(1.24)

This last condition is equivalent to strict convexity of the function $H(\sqrt{2\sigma})$ , which is required for its interpretation as the Legendre dual of $L(\sqrt{2\tau})$ , which is in turn required for the interpretation of $\mathcal{H}$ as the Legendre dual of $\mathcal{L}$ . Notice that $\mathfrak{h}(\sigma)$ is required to be a concave function ( $\mathfrak{h}^{\prime}{}^{\prime}\leq 0$ ) in contrast to the convexity condition ( $\ddot{\ell}\geq 0$ ) on $\ell(\tau)$ .

Surprisingly, the function $\mathfrak{h}$ can be used to directly construct not only the Hamiltonian density but also the Lagrangian density, again via the Courant-Hilbert solution but now for boundary conditions at $V=0$ rather than $U=0$ . Since $U$ and $V$ are exchanged by an exchange of ${\bf E}$ and ${\bf B}$ , this is a type of electromagnetic duality, which is indirectly equivalent to a Legendre duality. As we shall see, this fact implies a remarkably simple relation between the Lagrangian and Hamiltonian densities of any self-dual NLED. For example, given the Lagrangian density in the form $\mathcal{L}(S,\Phi)$ the Hamiltonian density in the form $\mathcal{H}(s,\varphi)$ can be found from the following procedure:

\boxed{\mathcal{L}(S,\Phi)\ \longrightarrow\ -\mathcal{L}(-s,\varphi)=\mathcal% {H}(s,\varphi)}\ .

(1.25)

This allows us to find $\mathcal{H}$ from $\mathcal{L}$ without the need for a Legendre transform! For the free-field Maxwell case, we have

\mathcal{L}=S\ \longrightarrow\ -(-s)=\mathcal{H}\quad\Rightarrow\ \mathcal{H}% =s\,.

(1.26)

This result suggests that $\ell$ and $\mathfrak{h}$ must be similarly related since they determine $\mathcal{L}$ and $\mathcal{H}$ . We find, in some cases, that there is indeed a very simple relation between these two CH-functions, but the general case requires consideration of what we call the “ $\Phi$ -parity” (equivalently, “ $\varphi$ -parity”) dual NLED defined by

\hat{\mathcal{L}}(S,\Phi):=\mathcal{L}(S,-\Phi)\,,\qquad\hat{\mathcal{H}}(s,% \varphi):=\mathcal{H}(s,-\varphi)\,.

(1.27)

In some cases, such as Born-Infeld, $\hat{\mathcal{L}}=\mathcal{L}$ . For this $\Phi$ -parity invariant subset of self-dual NLED we find that

\ell(x)+\mathfrak{h}(-x)=0\,,\qquad x\in\hbox{\mybb R}\,,\qquad\ \ (\hat{% \mathcal{L}}=\mathcal{L}).

(1.28)

As mentioned above, only the values of $\ell(x)$ for $x\geq 0$ are relevant to the CH solution for $\mathcal{L}(U,V)$ , but $\ell(x)$ is defined for $x<0$ if there is a weak field limit (which we are assuming here). We now see that for any $\Phi$ -parity invariant NLED (with a weak-field limit) the CH function $\ell(x)$ for $x\leq 0$ determines the other CH function $\mathfrak{h}$ . It remains true, of course, that $\ell$ and $\mathfrak{h}$ are related by their relation to the Legendre dual pair of functions $\{L,H\}$ , but no Legendre transform is needed to find one from the other!

More generally, $\hat{\mathcal{L}}\neq\mathcal{L}$ and $\hat{\mathcal{L}}$ is associated with a pair of CH-functions $\{\hat{\ell},\hat{\mathfrak{h}}\}$ that differ from $\{\ell,\mathfrak{h}\}$ . For these cases we find that $\{\ell,\hat{\ell}\}$ are related related to $\{\mathfrak{h},\hat{\mathfrak{h}}\}$ by a pair of relations similar to (1.28) but intertwined by $\Phi$ -parity.

An obvious question is whether there is a simple characterisation, in terms of restrictions on the CH functions $\{\ell,\mathfrak{h}\}$ , of the subclass of self-dual NLED theories that are also $\Phi$ -parity invariant. There is, but it involves an alternative solution of the self-duality PDE, also given by Courant and Hilbert [20], in terms of a function $\omega(x)$ of a positive variable $x$ . The $\Phi$ -parity invariant self-dual theories are those for which $\omega(x)$ is invariant under $x\to 1/x$ , and for these theories we show that $\omega(x)$ is the Legendre-dual of $\ell(\tau)$ with respect to $\tau$ (rather than $\sqrt{2\tau})$ . Born-Infeld corresponds to the choice of a linear function of $(x+1/x)$ .

Another topic that we discuss is “Legendre self-duality”, which has no direct connection to the topics described above, but could potentially be confused with them. The Hamiltonian density $\mathcal{H}$ is the Legendre transform of the Lagrangian density $\mathcal{L}$ with respect to ${\bf E}$ . If we now take the Legendre transform of $\mathcal{H}$ with respect to ${\bf B}$ we arrive at a ‘dual’ Lagrangian density $\tilde{\mathcal{L}}$ , which is a function of Lorentz scalars constructed from the Legendre-duals of $({\bf E},{\bf B})$ . It was noticed by Born, for Born-Infeld, that $\mathcal{L}$ and $\tilde{\mathcal{L}}$ are the same function³³3Born refers, confusingly, to the dual Lagrangian as the “Hamiltonian”, with the recognition that this is an abuse of terminology. of their respective Lorentz scalars [28]. Much later, it was shown by Gaillard and Zumino [18] that any (electromagnetically) self-dual theory shares this property of “Legendre self-duality”, and we prove this here by using the CH formula (1.8) for the general self-dual NLED theory. A subsequent clarification of Kuzenko and Theisen was the observation that “Legendre self-duality” relies only on invariance under a discrete $Z_{2}$ subgroup of the $U(1)$ electromagnetic duality group [29]. Here we give another proof based on the observation that if $\mathcal{H}({\bf D},{\bf B})$ is invariant under ${\bf D}\leftrightarrow{\bf B}$ then a Legendre transform with respect to ${\bf D}$ must yield the same function as a Legendre transform with respect to ${\bf B}$ . As an illustration, we explain how Born’s original NLED theory of 1933 [1] is Legendre self-dual without also being self-dual in the sense used here (and in [18, 29]).

We shall conclude with a summary of our main results and a brief discussion of further implications and future directions.

2 Strong-field causality redux

As mentioned in the introduction, weak-field causality implies strong-field causality for self-dual NLED theories if a weak-field limit is assumed. Without this assumption, causality requires the additional condition $G>0$ , where $G$ is given in (1.12). Either way, $G>0$ for causal theories and we can investigate its implications, as we did briefly in [17]. We now elaborate on some aspects of this topic here because it will be useful when we later extend the results to the Hamiltonian formulation.

From the equation for $\tau$ in (1.8) we learn that fixing $\tau$ restricts $(U,V)$ to a line in the positive-quadrant in the $(U,V)$ plane; i.e. the curves of constant $\tau$ in this quadrant are straight lines, with slopes

(dV/dU)(\tau)=-1/[\dot{\ell}(\tau)|^{2}\,.

(2.1)

Recalling the equation (1.11) for $d\tau$ we see that if $G=0$ at some point in the positive $(U,V)$ quadrant then we can take $\tau\to\tau+d\tau$ for $(U,V)$ . In other words, $G=0$ at the intersection point of the lines of constant $\tau$ and constant $\tau+d\tau$ . It follows that if $G>0$ everywhere in the domain of $\mathcal{L}(U,V)$ (which is either the entire positive $(U,V)$ quadrant or a connected subregion of it that includes the origin) then no two lines of $\tau$ can intersect in this domain. This is because the line of constant $\tau$ can intersect the line of constant $\tau+c$ , for any positive constant $c$ , only if it also intersects the line of constant $\tau+d\tau$ for positive infinitesimal $d\tau$ . Thus, if $G>0$ in the domain of $\mathcal{L}$ then this domain is foliated by lines of constant $\tau$ , as illustrated for Maxwell and Born-Infeld in Fig. 1.

Refer to caption — Figure 1: The lines of constant $\tau$ for the two cases: (a) Maxwell, $\ell(\tau)=\tau$ . (b) Born-Infeld, $\ell(\tau)=T-\sqrt{T(T-2\tau)}$ (for $T=1$ ).

We can interpret this conclusion in another way. If the solution of (1.8) for $\mathcal{L}(U,V)$ is unique then $\tau$ is uniquely determined by $(U,V)$ at each point in the domain of $\mathcal{L}(U,V)$ . However, there are at least two distinct values for $\tau$ at an intersection point, so $\tau$ cannot be uniquely determined by $(U,V)$ in any region that includes an intersection point. A necessary and sufficient condition for the uniqueness of the solution (1.8) is therefore that $G$ is nowhere zero in the domain of $\mathcal{L}(U,V)$ . In those cases for which $\ell(\tau)$ has a power-series expansion of the form (1.14), we know that $G>0$ is implied by the causality/convexity inequalities of (1.13), and hence that the solution for $\tau$ will be unique if these inequalities on $\dot{\ell}$ and $\ddot{\ell}$ are satisfied. Given the importance of this point, we shall show how it can be deduced in a more direct way.

We first rewrite the equation for $\tau$ in (1.8) as

f(\tau)=F_{(U,V)}(\tau)\,;\qquad f:=\dot{\ell}^{2}\,,\quad F_{(U,V)}:=\frac{U}% {\tau-V}\,.

(2.2)

The solution for $\tau$ is unique if the graph of the function $f$ has precisely one intersection with the graph of the function $F_{(U,V)}$ , for any choice of $(U,V)$ in the domain of $\mathcal{L}$ . The function $f$ has the properties

f(0)\geq 1\,,\qquad\dot{f}(\tau)\geq 0\,,

(2.3)

which follow from (1.14) and (1.13); i.e. $f(\tau)$ is a function of non-negative slope for all $\tau>0$ , with the minimum value being $f(0)\geq 1$ . The graph of the function $F_{(U,V)}(\tau)$ for $U>0$ is the branch of a hyperbola that has the line $\tau=V$ as one asymptote, and the $\tau$ -axis as the other asymptote. From this description it is obvious that the graphs of the two functions intersect at precisely one point for each choice of $(U,V)$ , which confirms that (2.2) has a unique solution for $\tau$ , as illustrated in Fig. 2 for ModMax and Born-Infeld.

2.1 Auxiliary fields and the stress-energy tensor

It was observed in [17] that the two equations of (1.8) may be combined (for causal theories) into the single equation

\mathcal{L}(U,V;\lambda,\tau)=\ell(\tau)-\frac{2U}{\dot{\ell}(\tau)}-\lambda% \left(\tau-V-\frac{U}{[\dot{\ell}(\tau)]^{2}}\right)\,,

(2.4)

where $\lambda$ is a Lagrange multiplier. This is because $\lambda$ and $\tau$ are, jointly, a pair of auxiliary fields that can be consistently eliminated by their algebraic field equations. Varying $\tau$ yields the equation $G(\lambda-\dot{\ell})=0$ , which implies $\lambda=\dot{\ell}$ if $G>0$ . Varying $\lambda$ yields a constraint that uniquely determines $\tau$ when $G>0$ (as illustrated in the previous subsection). Elimination of $(\lambda,\tau)$ thus yields the Lagrangian density defined by (1.8). Since $(U,V)$ are parity-even, we might expect to be able to make parity assignments for the auxiliary fields $(\lambda,\tau)$ such that the Lagrangian density of (2.4) has even parity. This is true: if we assign even parity to both $\lambda$ and $\tau$ then both $\ell(\tau)$ and $\dot{\ell}(\tau)$ are parity-even, and hence so is $\mathcal{L}(U,V;\lambda,\tau)$ .

An implicit assumption in the definitions of $(S,P)$ , and hence of $(U,V)$ , is that the Minkowski spacetime metric is the standard Minkowski metric (with a “mostly plus” signature). To generalize to curvilinear coordinates $\{x^{\mu};\mu=0,1,2,3\}$ , we have only to define $(S,P)$ as the scalar fields

S=-\frac{1}{4}\,{\rm g}^{\mu\rho}{\rm g}^{\nu\sigma}\,F_{\mu\nu}F_{\rho\sigma}% \,,\qquad P=-\frac{1}{8\sqrt{|{\rm g}|}}\varepsilon^{\mu\nu\rho\sigma}F_{\mu% \nu}F_{\rho\sigma}\,,

(2.5)

where ${\rm g}$ is the Minkowski metric in the chosen coordinates (with $|{\rm g}|=-\det{\rm g}$ ) and $F=dA$ is the 2-form abelian field strength for 1-form potential $A$ on the Minkowski spacetime. It then follows that $(U,V)$ are scalars and hence so is $\tau$ and $\ell(\tau)$ , and the equations of (1.8) still apply but with $\mathcal{L}$ a scalar rather than a scalar density. With this understood, (2.4) is unchanged but the Lagrangian scalar density is now

\mathfrak{L}:=\sqrt{|{\rm g}|}\,\mathcal{L}=\sqrt{|\rm g|}\,\left[\ell(\tau)-% \lambda\tau\right]+\left[\frac{\lambda-2\dot{\ell}}{\dot{\ell}^{2}}\right]% \mathcal{U}+\lambda\mathcal{V}\,,

(2.6)

where $(\mathcal{U},\mathcal{V})$ are the scalar densities $\sqrt{|{\rm g}|}\,(U,V)$ , which are related to the scalar densities $(\mathcal{S},\mathcal{P})=\sqrt{|{\rm g}|}\,(S,P)$ in the same way that $(U,V)$ are related to $(S,P)$ .

We are not restricted to Minkowski spacetime; by re-interpreting the metric ${\rm g}$ as an arbitrary spacetime metric that can be freely varied, we can find the stress-energy tensor $\mathcal{T}$ by the Hilbert formula

\mathcal{T}_{\mu\nu}=-\frac{2}{\sqrt{|{\rm g}|}}\frac{\partial\mathfrak{L}}{% \partial{\rm g}^{\mu\nu}}\,.

(2.7)

Since $\mathcal{P}$ is metric independent, this formula yields

\mathcal{T}_{\mu\nu}=(\ell-\lambda\tau){\rm g}_{\mu\nu}+\left\{\lambda\frac{% \partial\mathcal{V}}{\partial\mathcal{S}}+\left[\frac{\lambda-2\dot{\ell}}{% \dot{\ell}^{2}}\right]\frac{\partial\mathcal{U}}{\partial\mathcal{S}}\right\}% \mathcal{T}^{\rm Max}_{\mu\nu}\,,

(2.8)

where

\mathcal{T}^{\rm Max}_{\mu\nu}:=-\frac{2}{\sqrt{|{\rm g}|}}\frac{\partial% \mathcal{S}}{\partial{\rm g}^{\mu\nu}}\,,

(2.9)

which is the Maxwell stress-energy tensor. Using

\frac{\partial\mathcal{V}}{\partial\mathcal{S}}=\frac{\partial V}{\partial S}=% \frac{V}{U+V}\,,\qquad\frac{\partial\mathcal{U}}{\partial\mathcal{S}}=\frac{% \partial U}{\partial S}=-\frac{U}{U+V}\,,

(2.10)

and the auxiliary-field equations, we can simplify this result to⁴⁴4We thank Dmitri Sorokin for pointing out an error in the stress-energy tensor formula appearing in the original arXiv version of [17].

\mathcal{T}_{\mu\nu}=\left[\frac{\tau\dot{\ell}}{U+V}\right]\mathcal{T}^{\rm Max% }_{\mu\nu}+(\ell-\tau\dot{\ell}){\rm g}_{\mu\nu}\,,\qquad\left(\tau=V+\dot{% \ell}^{-2}U\right).

(2.11)

This agrees with the result of [30]; the novelty here is that we have taken as our starting point the auxiliary-field formulation (2.6) for the Lagrangian density of a generic self-dual NLED in a general spacetime.

3 The self-dual NLED Hamiltonian

We have seen in the Introduction that the Hamiltonian density for the general self-dual and Lorentz invariant NLED may be expressed in terms of a one-variable CH function $\mathfrak{h}(\sigma)$ via the equations of (1.21). This was by analogy to the equations of (1.8) for the Lagrangian density, and the same steps may be used here to verify it. By taking the exterior derivative of both sides of both equations of (1.21) we find two equations that are jointly equivalent to

d\mathcal{H}=\mathfrak{h}^{\prime}dv+\frac{du}{\mathfrak{h}^{\prime}}\,,\qquad% \tilde{G}d\sigma=\mathfrak{h}^{\prime}\left[(\mathfrak{h}^{\prime})^{2}dv-du% \right]\,,

(3.1)

where $\tilde{G}$ is the Hamiltonian analog of the function $G$ of (1.12):

\tilde{G}=(\mathfrak{h}^{\prime})^{3}-2u\mathfrak{h}^{\prime}{}^{\prime}\,.

(3.2)

We shall see later that causality requires $\tilde{G}>0$ , with consequences analogous to those that follow from $G>0$ .

The first equation of (3.1) tells us that

\mathcal{H}_{v}=\mathfrak{h}^{\prime}\,,\qquad\mathcal{H}_{u}=1/\mathfrak{h}^{% \prime}\,,

(3.3)

and hence that $\mathcal{H}$ solves (1.19). Notice that any constant term in $\mathfrak{h}(\sigma)$ , which makes no contribution to $\mathfrak{h}^{\prime}(\sigma)$ , appears only as a constant term in $\mathcal{H}$ ; it represents a constant uniform background energy density that has no effect on the NLED field equations.

Our first task will be to determine the relation between the functions $\ell$ and $\mathfrak{h}$ implied by Legendre duality of the Lagrangian and Hamiltonian densities. The existence of this duality is guaranteed by the convexity of $\mathcal{L}$ as a function of ${\bf E}$ , the fact that the Legendre transform of any function is convex, and the theorem that the Legendre transform is an involution when acting on convex functions. For ${\bf B}={\bf 0}$ , this transform is

	$\displaystyle\mathcal{L}({\bf E},{\bf 0})$	$\displaystyle=\sup_{\bf D}\left\{{\bf D}\cdot{\bf E}-\mathcal{H}({\bf D},{\bf 0% })\right\}\,,$		(3.4)
	$\displaystyle\mathcal{H}({\bf D},{\bf 0})$	$\displaystyle=\sup_{\bf E}\left\{{\bf E}\cdot{\bf D}-\mathcal{L}({\bf E},{\bf 0% })\right\}\,.$		(3.4)

When ${\bf B}={\bf 0}$ we also have

	$\displaystyle(U,V)$	$\displaystyle=(0,\tau)\,,\qquad\tau=\frac{1}{2}E^{2}\,,$		(3.5)
	$\displaystyle(u,v)$	$\displaystyle=(0,\sigma)\,,\qquad\sigma=\frac{1}{2}D^{2}\,,$		(3.5)

and hence, from (1.8) and (1.21),

	$\displaystyle\mathcal{L}({\bf E},{\bf 0})$	$\displaystyle=\ell(\tau)=L(E)\,,$		(3.6)
	$\displaystyle\mathcal{H}({\bf D},{\bf 0})$	$\displaystyle=\mathfrak{h}(\sigma)=H(D)\,,$		(3.6)

where $L$ and $H$ are the functions introduced in (1.22). Combining this with (3.4), we have

	$\displaystyle L(E)$	$\displaystyle=\sup_{\bf D}\left\{{\bf D}\cdot{\bf E}-H(D)\right\}\,,$		(3.7)
	$\displaystyle H(D)$	$\displaystyle=\sup_{\bf E}\left\{{\bf E}\cdot{\bf D}-L(E)\right\}\,.$		(3.7)

Notice that although $L(E)$ and $H(D)$ were defined in (1.22) as functions of a single variable (respectively, $E=\sqrt{2\tau}$ and $D=\sqrt{2\sigma}$ ), we are required by (3.7) to consider them as functions of ${\bf E}$ and ${\bf D}$ , respectively. In contrast, the claim in the Introduction that $L$ and $H$ are each other’s Legendre transform is the claim that

	$\displaystyle L(E)$	$\displaystyle=\sup_{D}\left\{DE-H(D)\right\}\,,$		(3.8)
	$\displaystyle H(D)$	$\displaystyle=\sup_{E}\left\{ED-L(E)\right\}\,.$		(3.8)

However, it is not difficult to see that (3.7) implies (3.8). Variation of ${\bf D}$ and ${\bf E}$ in the respective expressions of (3.7) for $L(E)$ and $H(D)$ yields

	$\displaystyle{\bf E}$	$\displaystyle=\left(\frac{1}{D}\frac{\partial H}{\partial D}\right){\bf D}=% \mathfrak{h}^{\prime}(\sigma){\bf D}\quad\Rightarrow\quad{\bf D}\cdot{\bf E}=% DE\,,$		(3.9)
	$\displaystyle{\bf D}$	$\displaystyle=\left(\frac{1}{E}\frac{\partial L}{\partial E}\right){\bf E}=% \dot{\ell}(\tau){\bf E}\qquad\Rightarrow\quad{\bf E}\cdot{\bf D}=ED\,,$		(3.9)

and a further implication is

E=\mathfrak{h}^{\prime}(\sigma)D\,,\qquad D=\dot{\ell}(\tau)E\,,

(3.10)

which is exactly what one finds from variation of $D$ and $E$ in the expressions of (3.8) for $L(E)$ and $H(D)$ , respectively. The variation of 3-vector fields needed to find the functions $L$ and $H$ from (3.7) therefore yields the same result as the variation of scalar fields in (3.8).

Further implications of (3.10) are the relations⁵⁵5A relation similar to (3.11) appears in [31] in relation to an involution defined in the context of 6D chiral electrodynamics.

\dot{\ell}(\tau)\mathfrak{h}^{\prime}(\sigma)=1\,,

(3.11)

and

\sigma=\tau\dot{\ell}^{2}\,,\qquad\tau=\sigma(\mathfrak{h}^{\prime})^{2}\,.

(3.12)

These relations allow us to find $\mathfrak{h}(\sigma)$ (up to the addition of a constant) given $\dot{\ell}(\tau)$ , and vice versa.

The fact that functions $\mathfrak{h}$ and $\ell$ are related by Legendre transformations, but with respect to variable $\sqrt{2\tau}$ and $\sqrt{2\sigma}$ , can be summarized by the equations

\ell(\tau)+\mathfrak{h}(\sigma)=2\tau\dot{\ell}=2\sigma\mathfrak{h}^{\prime}\,.

(3.13)

The second equality is equivalent to (3.12) given (3.11). The first equality tells us that any constant term in the power-series expansion of $\ell(\tau)$ also appears in the power-series expansion of $\mathfrak{h}(\sigma)$ with the opposite sign, while the remaining information of this equality may be verified by taking the exterior derivative of both sides to get

\mathfrak{h}^{\prime}d\sigma=(\dot{\ell}+2\tau\ddot{\ell})d\tau\,.

(3.14)

Using (3.11), we may rewrite this as $d\sigma=d\left(\tau\dot{\ell}^{2}\right)$ , which is true as a consequence of the relation $\sigma=\tau\dot{\ell}^{2}$ of (3.12).

There are other differences between the Lagrangian and Hamiltonian formulations of self-dual NLED theories that go beyond sign changes. One is the difference in the potential range of the independent variables: although $\tau$ is non-negative by its definition in (1.8), the definition of $\sigma$ in (1.21), which we may rewrite as

\sigma=(v-u)+\frac{[(\mathfrak{h}^{\prime})^{2}-1]u}{(\mathfrak{h}^{\prime})^{% 2}}\,,

(3.15)

allows $\sigma$ to be negative. For $\mathfrak{h}^{\prime}=1$ , which is the free-field (Maxwell) case, $\sigma=v-u\geq 0$ , and this remains true for all causal NLED theories that have Maxwell as their weak-field limit. This is easily seen by writing the second equation in (1.21) as

f(\sigma):=(\mathfrak{h}^{\prime})^{2}(\sigma)=\frac{u}{v-\sigma}:=g(\sigma)\,.

(3.16)

The function $f(\sigma)$ is positive, with $f(0)=1$ (because the weak-field limit is Maxwell). It is also a monotonically decreasing function of $\sigma$ (because $\mathfrak{h}^{\prime}>0$ but $\mathfrak{h}^{\prime\prime}<0$ for any causal interacting NLED). The function $g(\sigma)$ takes the value $u/v\leq 1$ at $\sigma=0$ but then increases monotonically, becoming infinite at $\sigma=v$ , and then negative. There is therefore a unique non-negative value of $\sigma$ at which $f=g$ (as illustrated in fig. 4a below).

In contrast, if the weak-field limit is ModMax (with $\gamma>0$ ) then $f(0)<1$ . This means that there will be a choice of $(u,v)$ such that $f(0)<g(0)$ , which implies that $f=g$ for $\sigma<0$ , as illustrated in fig. 4b. In these cases $\sigma<0$ is not excluded by its definition in (1.21); instead the inequality $\sigma\geq 0$ is a restriction on the domain of $\mathcal{H}$ required by equivalence to the Lagrangian formulation. Specifically, it restricts the Hamiltonian fields to the region in field-space for which $\mathcal{H}(u,v)$ is a convex function of ${\bf D}$ ; i.e. to its “convex domain”. For ModMax, the boundary of this convex domain corresponds to Lagrangian fields with $U=V=0$ , which includes all exact plane-wave solutions of the ModMax field equations[24].

3.1 Convexity/Concavity and Causality

In the Lagrangian formalism, and assuming the existence of a weak-field limit, the necessary and sufficient conditions for causality are the conditions for convexity of $\mathcal{L}$ , which are equivalent to [17]

\dot{\ell}(\tau)\geq 1\ ,\qquad\ddot{\ell}\geq 0\,.

(3.17)

By combining the first of these inequalities with the relation (3.11) we deduce that

0<\mathfrak{h}^{\prime}\leq 1\,.

(3.18)

Next, we take the exterior derivative of the first of the relations in (3.11) to find, again using (3.11), that

\dot{\ell}^{2}\mathfrak{h}^{\prime}{}^{\prime}=-\left(\frac{d\tau}{d\sigma}% \right)\ddot{\ell}\,.

(3.19)

Taking the exterior derivative of the first of the relations of (3.12), we also find that

\frac{d\tau}{d\sigma}=\frac{1}{\dot{\ell}\left(\dot{\ell}+2\tau\ddot{\ell}% \right)}\,,

(3.20)

and hence that

-\mathfrak{h}^{\prime}{}^{\prime}=\frac{\ddot{\ell}}{\dot{\ell}^{3}\left(\dot{% \ell}+2\tau\ddot{\ell}\right)}\,.

(3.21)

Using both inequalities of (3.17), and the fact that $\tau$ is non-negative, we see that the right-hand side of this equation is non-negative, and hence

\mathfrak{h}^{\prime}{}^{\prime}\leq 0\,.

(3.22)

We have now found the ‘translation’ of the causality/convexity conditions (3.17) on $\ell(\tau)$ to the corresponding conditions to be satisfied by $\mathfrak{h}(\sigma)$ . They are

0<\mathfrak{h}^{\prime}\leq 1\,,\qquad\mathfrak{h}^{\prime}{}^{\prime}\leq 0\,.

(3.23)

The second of these equations is equivalent to the statement that $\mathfrak{h}(\sigma)$ is a concave function, and a corollary of this is that

\tilde{G}>0\,,

(3.24)

where $\tilde{G}$ was defined in (3.2). We postpone a discussion of the consequences of this corollary as we still need to explain the origin of the condition (1.24) of the Introduction.

By taking the exterior derivative on both sides of the second of the relations of (3.12), we get another formula for $d\tau/d\sigma$ :

\frac{d\tau}{d\sigma}=\mathfrak{h}^{\prime}(\mathfrak{h}^{\prime}+2\sigma% \mathfrak{h}^{\prime}{}^{\prime})\,.

(3.25)

Comparing this with (3.20), and again using (3.11), we find that

\big{(}\dot{\ell}+2\tau\ddot{\ell}\big{)}\big{(}\mathfrak{h}^{\prime}+2\sigma% \mathfrak{h}^{\prime}{}^{\prime}\big{)}=1\,.

(3.26)

The first factor on the left-hand side is positive, for reasons explained above. The second factor is not obviously positive, but is required to be so; this is the condition (1.24). To understand its significance, we return to the functions $L(E)$ and $H(D)$ . Because they are each other’s Legendre transform, we know that they are both convex; in fact strictly convex because $\mathcal{L}$ is a strictly convex function of ${\bf E}$ . Thus

0<\frac{\partial^{2}L(E)}{\partial E\partial E}=\dot{\ell}+2\tau\ddot{\ell}\,,% \qquad 0<\frac{\partial^{2}H(D)}{\partial D\partial D}=\mathfrak{h}^{\prime}+2% \sigma\mathfrak{h}^{\prime}{}^{\prime}\,.

(3.27)

This allows us to interpret (3.26) as the statement that the Hessian of $L(E)$ is the inverse of the Hessian of $H(D)$ . This requires, of course, that both Hessians are non-zero and finite, which is equivalent to the statement that both $L(E)$ and $H(D)$ are both strictly convex functions.

We now return to the significance of (3.24). We see from (1.21) that the curves of constant $\sigma$ in the $(u,v)$ -plane are straight lines. Only the half-lines in the ‘physical’ region of this plane are relevant; this is the wedge-shaped region bounded by the lines $u=0$ (the $v$ -axis) and the line $v=u$ (since $v\geq u\geq 0$ by definition). Because $\tilde{G}>0$ , no two lines of constant $\sigma$ can intersect in this region (for reasons identical to those explained in our discussion of section 2 for $G>0$ in the context of lines of constant $\tau$ in the positive $(U,V)$ quadrant). The lines of constant $\sigma$ therefore foliate either the entire physical region or some connected subregion of it.

From (1.21) we see that all lines of constant $\sigma$ intersect the $v$ -axis at $v=\sigma$ , which implies (since $v\geq 0$ ) that the lowest line is the one with $\sigma=0$ ; this confirms that $\sigma\geq 0$ is required for an equivalence of the Lagrangian and Hamiltonian formulations. The slope of the lines is

(dv/du)(\sigma)=1/[\mathfrak{h}^{\prime}(\sigma)]^{2}\geq 1\,,

(3.28)

where the inequality follows from the first causality condition of (3.23). The slope of the lowest line is therefore $1/[\mathfrak{h}^{\prime}(0)]^{2}$ . Assuming that $\mathfrak{h}(\sigma)$ has a power-series expansion about $\sigma=0$ (which is equivalent to the assumption of a weak-field limit) we conclude (omitting the irrelevant constant term in the expansion) that

\mathfrak{h}(\sigma)=e^{-\gamma}\sigma+\mathcal{O}(\sigma^{2})\,,

(3.29)

for some constant $\gamma\geq 0$ . The special case for which $\mathfrak{h}(\sigma)=e^{-\gamma}\sigma$ yields ModMax, with Maxwell as the free-field $\gamma=0$ subcase. For Maxwell, the lines of constant $\sigma$ foliate the entire wedge-shaped physical region in the $(u,v)$ -plane. For ModMax ( $\gamma>0$ ) they foliate the wedge-shaped subregion that is bounded from below by the $\sigma=0$ line, which is

v=e^{2\gamma}u\qquad(\sigma=0).

(3.30)

For both Maxwell and ModMax the lines of constant $\sigma$ are parallel because $h^{\prime}$ is constant. For BI, the slope increases as $\sigma$ increases because $\mathfrak{h}^{\prime}{}^{\prime}<0$ . These three cases are illustrated in Fig. 3.

For the examples that we consider in the following section, there is no maximum value of $\sigma$ , so the lines of constant $\sigma$ foliate the wedge-shaped region bounded by the positive $v$ -axis and the $\sigma=0$ line. We have found an example with an upper bound on $\sigma$ but we do not discuss it here.

A further implication of $\tilde{G}>0$ is that there is a Hamiltonian counterpart to (2.4). The two equations of (1.21) may be combined into the one equation

\mathcal{H}=\mathfrak{h}(\sigma)+\frac{2u}{\mathfrak{h}^{\prime}(\sigma)}-% \tilde{\lambda}\left(\sigma-v+\frac{u}{\left[\mathfrak{h}^{\prime}(\sigma)% \right]^{2}}\right)\,,

(3.31)

where $\tilde{\lambda}$ is a Lagrange multiplier imposing the constraint on $\sigma$ , but the fields $(\tilde{\lambda},\sigma)$ are an auxiliary pair. Varying $\sigma$ we get the equation $\tilde{G}(\tilde{\lambda}-\mathfrak{h}^{\prime})=0$ , which is equivalent to $\tilde{\lambda}=\mathfrak{h}^{\prime}$ when $\tilde{G}>0$ . Varying $\tilde{\lambda}$ we get the equation for $\sigma$ , which has a unique solution when $\tilde{G}>0$ for reasons identical to those explained in section 2 for $G>0$ in the context of the equation for $\tau$ . This is illustrated in Fig. 4. Elimination of the auxiliary fields in (3.31) therefore yields precisely the Hamiltonian density defined by (1.21).

As for the Lagrangian auxiliary-field formulation of (2.4), we can make parity assignments for the Hamiltonian auxiliary fields $(\tilde{\lambda},\sigma)$ such that the Hamiltonian density of (3.31) has even parity. Since both $u$ and $v$ are parity-even, this is achieved by assigning even parity to both $\tilde{\lambda}$ and $\sigma$ . A consequence of parity conservation is that the $U(1)\cong SO(2)$ electromagnetic duality group is enhanced to $O(2)$ because parity acts by the transformation ${\bf D}\to-{\bf D}$ .

3.2 Simple examples

We now illustrate the construction of the Hamiltonian from $\mathfrak{h}$ and the causality conditions on $\mathfrak{h}$ with a few examples.

ModMax

The ModMax Lagrangian and Hamiltonian densities are⁶⁶6Recall that $(u,v)$ as defined in this paper differ from the definitions in [24] by $u\leftrightarrow v$ . [24]

	$\displaystyle\mathcal{L}_{MM}$	$\displaystyle=e^{\gamma}V-e^{-\gamma}U=(\cosh\gamma)S+(\sinh\gamma)\sqrt{S^{2}% +P^{2}}\,,$		(3.32)
	$\displaystyle\mathcal{H}_{MM}$	$\displaystyle=e^{-\gamma}v+e^{\gamma}u=(\cosh\gamma)s-(\sinh\gamma)\sqrt{s^{2}% -p^{2}}\,.$		(3.32)

Maxwell is included as the special case with $\gamma=0$ . The Lagrangian function of one-variable for ModMax is $\ell=e^{\gamma}\tau+{\rm const.}$ [17] (but we may ignore the constant term as it has no effect on the field equations). The convexity/causality conditions of (1.13) require $\gamma\geq 0$ , as expected since $\mathcal{L}_{MM}$ is a convex function of ${\bf E}$ for $\gamma\geq 0$ but not for $\gamma<0$ . The function $\ell(\tau)=e^{\gamma}\tau$ corresponds to $L(E)=\frac{1}{2}e^{\gamma}E^{2}$ . Its Legendre transform is $H(D)=\frac{1}{2}e^{-\gamma}D^{2}$ , which yields

\mathfrak{h}(\sigma)=e^{-\gamma}\sigma\,.

(3.33)

Using this in (1.21) we have

\mathcal{H}=e^{-\gamma}\sigma+2ue^{\gamma}\ ,\qquad\sigma=v-e^{2\gamma}u\ ,

(3.34)

which gives us the ModMax Hamiltonian density.

As shown in [24], $\mathcal{H}_{MM}$ is a convex function of ${\bf D}$ for $\gamma>0$ only for those values of $({\bf D},{\bf B})$ for which

s\geq(\cosh\gamma)p\,.

(3.35)

Values of $({\bf D},{\bf B})$ violating this bound do not correspond to any values of $({\bf E},{\bf B})$ . In other words, the bound is needed for a correspondence between the Lagrangian and Hamiltonian formulations of ModMax. However, we have seen in section 3, for any self-dual NLED, that this correspondence exists iff $\sigma\geq 0$ . It follows that the ModMax convexity bound (3.35) must be equivalent to $\sigma\geq 0$ , and this conclusion is easily verified: from (1.21) we see that

\sigma\geq 0\quad\Leftrightarrow\quad v\geq e^{2\gamma}u\,,

(3.36)

but this constraint on the values of $(u,v)$ is equivalent to (3.35).

ModMaxBorn

The Born-Infeld-type generalization of ModMax, introduced in [24] in its Hamiltonian formulation, was called ModMaxBorn in [23, 17]. The ModMaxBorn Lagrangian density was found by Legendre transform in [25]. The Lagrangian and Hamiltonian densities are, respectively,

	$\displaystyle\mathcal{L}_{\rm MMB}$	$\displaystyle=-\sqrt{T^{2}-2T\mathcal{L}_{MM}-P^{2}}\,,$		(3.37)
	$\displaystyle\mathcal{H}_{\rm MMB}$	$\displaystyle=\sqrt{T^{2}+2T\mathcal{H}_{MM}+p^{2}}\,.$		(3.37)

The Born-Infeld theory is included as the $\gamma=0$ case. Here and in other examples to follow, there is a non-zero vacuum energy, which can be simply removed by the addition of a constant.

It was shown in [17] that $\mathcal{L}_{\rm MMB}$ is associated with the function

\ell_{\rm MMB}(\tau)=-\sqrt{T(T-2e^{\gamma}\tau)}=-T\left(1-\frac{2e^{\gamma}% \tau}{T}\right)^{\frac{1}{2}}\,.

(3.38)

From the results of section 3 we find that the corresponding function $\mathfrak{h}(\sigma)$ is

\mathfrak{h}_{\rm MMB}(\sigma)=\sqrt{T(T+2e^{-\gamma}\sigma)}=T\left(1+\frac{2% e^{-\gamma}\sigma}{T}\right)^{\frac{1}{2}}\,,

(3.39)

Using this result in (1.21) we recover the ModMaxBorn Hamiltonian density, and we find that

\sigma=\frac{T(v-e^{2\gamma}u)}{T+2e^{\gamma}u}\,.

(3.40)

For $\gamma>0$ this allows $\sigma<0$ , but for reasons already explained we must impose $\sigma\geq 0$ , which is again equivalent to the bound (3.35).

$q$ -deformed $\mathfrak{h}_{\rm MMB}$

Consider the following choice:

\mathfrak{h}=T\left(1+\frac{e^{-\gamma}\sigma}{qT}\right)^{q}\,,

(3.41)

for which

\mathfrak{h}^{\prime}=e^{-\gamma}\left(1+\frac{e^{-\gamma}\sigma}{qT}\right)^{% q-1}\,,\qquad\mathfrak{h}^{\prime\prime}=-\frac{e^{-2\gamma}(1-q)}{qT}\left(1+% \frac{e^{-\gamma}\sigma}{qT}\right)^{q-2}\,.

(3.42)

From this we see that the conditions $\mathfrak{h}^{\prime}\leq 1$ and $\mathfrak{h}^{\prime}{}^{\prime}\leq 0$ require $0<q\leq 1$ . We also have

\mathfrak{h}^{\prime}+2\sigma\mathfrak{h}^{\prime\prime}=e^{-\gamma}\left(1+% \frac{e^{-\gamma}\sigma}{qT}\right)^{q-2}\left(1+(2q-1)\frac{\sigma e^{-\gamma% }}{qT}\right)\,,

(3.43)

which is positive, as required, if $q\geq\frac{1}{2}$ . Therefore, this class of self-dual NLED theories is causal for

\frac{1}{2}\leq q\leq 1\,.

(3.44)

However, the Hamiltonian density can be found explicitly only for special values of $q$ in this range; for example $q=\frac{1}{2}$ , which yields Born-Infeld. Another special choice is $q=\tfrac{3}{4}$ , which will be discussed later.

3.3 Conformal Invariance Redux

The condition for conformal invariance of any Hamiltonian density $\mathcal{H}({\bf D},{\bf B})$ is degree-2 homogeneity in the electric and magnetic fields fields $({\bf D},{\bf B})$ . For a self-dual NLED with Hamiltonian density function $\mathcal{H}(u,v)$ this condition becomes degree-1 homogeneity in $(u,v)$ :

u\mathcal{H}_{u}+v\mathcal{H}_{v}=\mathcal{H}\,.

(3.45)

We recall here that our $(u,v)$ variables, defined in (1.18), differ (by the exchange $u\leftrightarrow v$ ) from those used in [24]. The general solution of this equation can be expressed in the form

\mathcal{H}=vf(x)\,,\qquad x:=u/v\,.

(3.46)

Notice that $u/v$ remains finite as $v\to 0$ since $u\leq v$ . The Lorentz invariance condition (1.19) then implies that $f^{\prime}(f-xf^{\prime})=1$ . This equation is solved by (i) any linear function of $x$ and (ii) $f=\pm\sqrt{4x}$ , which yield the following solutions for $\mathcal{H}$ [24]:

(i):\quad\mathcal{H}=\tilde{a}v+\tilde{a}^{-1}u\,,\qquad(ii):\quad\mathcal{H}=% \pm\sqrt{4uv}\,.

(3.47)

The first of these is ModMax if $\tilde{a}=e^{-\gamma}$ with $\gamma\geq 0$ . The second solution defines (for positive sign) the Bialynicki-Birula (BB) electrodynamics theory [5, 6], which has no weak-field limit. ModMax is therefore the unique interacting causal ‘extension’ of Maxwell electrodynamics with the same symmetries [24].

The condition for conformal invariance of any Lagrangian density $\mathcal{L}(S,P)$ is the homogeneity condition

S\mathcal{L}_{S}+P\mathcal{L}_{P}=\mathcal{L}\,.

(3.48)

Any function linear in $(S,P)$ will satisfy this relation, but this does not include ModMax. If parity is assumed then, as observed in the Introduction, one may replace the variables $(S,P)$ by $(S,\Phi)$ (we recall that $\Phi=\sqrt{S^{2}+P^{2}}$ ). The homogeneity condition is then solved by any function linear in $S$ and $\Phi$ , and self-duality selects the particular linear function that is the ModMax Lagrangian density, found originally by Legendre transform of the ModMax Hamiltonian density [24]. This observation was made in [32], but it does not exclude the possibility of other conformal self-dual NLED theories for which $\mathcal{L}$ is a nonlinear homogeneous function of $(S,\Phi)$ ; for this we need a general solution to the homogeneity condition (3.48).

One might expect to be able to express the general solution to (3.48) in terms of an arbitrary function $f$ of one dimensionless ratio of functions of $(S,P)$ , by analogy to the general solution of (3.46) to the Hamiltonian homogeneity condition (3.45). However, the fact that both $S$ and $P$ may have either sign, and may be zero for non-vacuum field configurations, prevents it. For example, the formula $\mathcal{L}=\sqrt{SP}f(\sqrt{S/P})$ was suggested in [33] but even $\mathcal{L}=S$ cannot be written in this form when $S<0$ . The alternative formula $\mathcal{L}=Sf(P/S)$ , suggested in [24], has a similar problem with $\mathcal{L}=\sqrt{S^{2}+P^{2}}$ . If parity invariance is assumed then we may use the variables $(U,V)$ , in which case (3.48) is replaced by

V\mathcal{L}_{V}+U\mathcal{L}_{U}=\mathcal{L}\,

(3.49)

In this case we could attempt to solve the homogeneity condition by setting $\mathcal{L}(U,V)=Vf(U/V)$ . This is the natural Lagrangian analog of (3.46), and imposing the self-duality condition leads formally to $f^{\prime}(f-xf^{\prime})=-1$ ; the different sign on the right-hand side now allows only a linear function of $x$ , which again leads uniquely to ModMax. However this is still unsatisfactory because $U/V$ is generically infinite at $V=0$ , so the initial expression for $\mathcal{L}$ is not well-defined for all $(U,V)$ .

It appears that the only way to establish directly that the ModMax Lagrangian density is the unique possibility compatible with conformal invariance and self-duality is to first solve the self-duality condition, e.g. as in (1.8). We then impose the homogeneity condition (3.49). Using (1.10), this leads to

V\dot{\ell}-\frac{U}{\dot{\ell}}=\ell-\frac{2U}{\dot{\ell}}

(3.50)

and hence

\ell=\left(V+\frac{U}{\dot{\ell}^{2}}\right)\dot{\ell}=\tau\dot{\ell}

(3.51)

where the last equality uses the definition of $\tau$ in (1.8). We thus arrive at the conclusion, for self-dual NLED, that $\mathcal{L}$ will satisfy the homogeneity condition (3.49) iff $\ell$ satisfies the homogeneity condition

\tau\dot{\ell}(\tau)=\ell(\tau)\,.

(3.52)

The general solution is $\ell(\tau)=a\tau$ for constant $a$ . Causality restricts to $a\geq 1$ , which yields ModMax.

4 Hamiltonian without Legendre transform

The solution (1.8) to the self-duality PDE (1.7) results from a choice of boundary conditions on the $U=0$ boundary of the positive $(U,V)$ quadrant: $\mathcal{L}(0,V)=\ell(V)$ . However, we could equally well choose initial conditions on the $V=0$ boundary of the positive $(U,V)$ quadrant; i.e. $\mathcal{L}(U,0)=-m(U)$ for some new one-variable function $m$ (the minus sign is included for later convenience). The solution analogous to (1.8) is then

\mathcal{L}(U,V)=-m(\kappa)+\frac{2V}{\dot{m}(\kappa)}\,,\qquad\kappa=U+\frac{% V}{\dot{m}^{2}(\kappa)}\,,

(4.1)

where

\dot{m}(\kappa):=\frac{dm(\kappa)}{d\kappa}\,>\,0\,.

(4.2)

For the identity function $m(\kappa)=\kappa$ these equations yield $\mathcal{L}=V-U=S$ . To verify that they yield a solution for arbitrary $m(\kappa)$ , we proceed as before by taking the differential of both sides of both equations to find that

d\mathcal{L}=\frac{1}{\dot{m}}dV-\dot{m}dU\,,\qquad(\dot{m}^{3}+2\ddot{m}V)d% \kappa=\dot{m}\left(dV+\dot{m}^{2}dU\right).

(4.3)

From the first of these equations we have

\mathcal{L}_{U}=-\dot{m}\,,\qquad\mathcal{L}_{V}=1/\dot{m}\,,

(4.4)

and hence $\mathcal{L}_{U}\mathcal{L}_{V}=-1$ , as required.

We now have two different ways in which the Lagrangian density function $\mathcal{L}(U,V)$ of any given self-dual NLED theory can be constructed from an associated one-variable function; in one case we call the function $\ell(\tau)$ and in the other case we call it $m(\kappa)$ . By comparing (4.4) with (1.10) we see that these two functions are such that⁷⁷7Recall that $\dot{\ell}=d\ell/d\tau$ and $\dot{m}=dm/d\kappa$ .

\dot{\ell}(\tau)\dot{m}(\kappa)=1\,.

(4.5)

Using this relation, a comparison of the equation (1.8) for $\tau$ with equation (4.1) for $\kappa$ provides an equation for $\tau$ as a function of $\kappa$ , and vice versa:

\tau=\kappa\dot{m}^{2}(\kappa)\,,\qquad\kappa=\tau\dot{\ell}^{2}(\tau)\,.

(4.6)

If we use the relations (4.5) and (4.6) in the equation of (4.1) for $\kappa$ we deduce that

\tau=V+\frac{U}{\dot{\ell}^{2}}\,,

(4.7)

which is the equation for $\tau$ of (1.8). Since the equations for the auxiliary variable ( $\tau$ or $\kappa$ ) are equivalent in both solutions of the self-duality PDE, which yield the same Lagrangian density function, it follows that

\ell(\tau)-\frac{2U}{\dot{\ell}(\tau)}=-m(\kappa)+\frac{2V}{\dot{m}(\kappa)}

(4.8)

or, equivalently,

\ell(\tau)+m(\kappa)=2\left[\dot{m}U+\dot{\ell}V\right]\,.

(4.9)

A surprising feature of this ‘dual’ description of the Lagrangian density $\mathcal{L}(U,V)$ of a self-dual NLED is that the new one-variable function $m$ is same as the one-variable Hamiltonian function $\mathfrak{h}(\sigma)$ ! This can be seen as follows: replacing $m(\kappa)$ by $\mathfrak{h}(\sigma)$ in (4.5) and (4.6) we get precisely the relations that determine $\mathfrak{h}(\sigma)$ in terms of $\ell(\tau)$ , and vice versa. Furthermore, we know how the functions $\ell(\tau)$ and $\mathfrak{h}(\sigma)$ are related (by a Legendre transform in terms of the variables $\sqrt{2\tau}$ and $\sqrt{2\sigma}$ ), so $\ell$ and $m$ are related in the same way, which is

\ell(\tau)+m(\kappa)=2\kappa\dot{m}=2\tau\dot{\ell}\,.

(4.10)

Comparing this with (4.9) we recover the equations for $\kappa$ and for $\tau$ in terms of $(U,V)$ .

Returning to (4.1), let us replace $m(\kappa)$ by $\mathfrak{h}(\sigma)$ , since they are the same function, and then relabel the independent variables of the function $\mathcal{L}$ as follows:

(U,V)\to(v,-u)\,.

(4.11)

We then get

-\mathcal{L}(v,-u)=\mathfrak{h}(\sigma)+\frac{2u}{\mathfrak{h}^{\prime}}\,,% \qquad\sigma=v-\frac{u}{(\mathfrak{h}^{\prime})^{2}}\,.

(4.12)

Comparison with (1.21) shows that $\mathcal{H}(u,v)$ is the same function as $-\mathcal{L}(v,-u)$ . Explicitly, given any Lagrangian density $\mathcal{L}(U,V)$ , we may find its Legendre dual Hamiltonian density $\mathcal{H}(u,v)$ by the following procedure:

\mathcal{L}(U,V)\ \ \longrightarrow\ \ -\mathcal{L}(v,-u)=\mathcal{H}(u,v)\ ,

(4.13)

Given any Hamiltonian density $\mathcal{H}(u,v)$ we can similarly find its Legendre-dual Lagrangian density $\mathcal{L}(U,V)$ :

\mathcal{H}(u,v)\ \ \longrightarrow\ \ -\mathcal{H}(-V,U)=\mathcal{L}(U,V)\ .

(4.14)

Notice that the change of variables (4.11) implies that

V-U\to-(v+u)\,,\qquad V+U\to v-u\,.

(4.15)

Using the expressions given in the Introduction for $(U,V)$ in terms of $(S,P)$ , and $(u,v)$ in terms of $(s,p)$ , we deduce that

S\to-s\,,\qquad\Phi\to\varphi\,.

(4.16)

For Maxwell, for example, we get $\mathcal{H}=-\mathcal{L}(s)=s$ , as expected. More generally, once $\mathcal{L}$ is expressed in the form $\mathcal{L}(S,\Phi)$ we get $\mathcal{H}$ in the form $\mathcal{H}(s,\varphi)$ by the following procedure:

\boxed{\mathcal{L}(S,\Phi)\ \ \longrightarrow\ \ -\mathcal{L}(-s,\varphi)=% \mathcal{H}(s,\varphi)}\ ,

(4.17)

which is the boxed equation (1.25) of the Introduction. The converse formula is

\boxed{\mathcal{H}(s,\varphi)\ \ \longrightarrow\ \ -\mathcal{H}(-S,\Phi)=% \mathcal{L}(S,\Phi)}\ .

(4.18)

As we shall see in the following section, these results enormously simplify the task of finding the Hamiltonian density associated to any known Lagrangian density of a self-dual NLED, and vice versa.

4.1 Further examples

No maximum- $\tau$ case

This possibility was illustrated in [17] by the choice $\ell=T(1+2e^{\gamma}\tau/(3T))^{\frac{3}{2}}$ which is defined for all $\tau\geq 0$ and satisfies the causality conditions of (1.13). The corresponding Lagrangian density is

\mathcal{L}=\sqrt{2}\,T\left(1+\frac{2e^{\gamma}V}{3T}-\frac{\Delta}{2}\right)% \sqrt{1+\frac{2e^{\gamma}V}{3T}+\Delta}\,,

(4.19)

with $\Delta=\sqrt{\left(1+\frac{2e^{\gamma}V}{3T}\right)^{2}+8e^{-\gamma}\frac{U}{3% T}}$ . As expected, it is defined in the entire positive $(U,V)$ quadrant, and it reduces to ModMax with coupling constant $\gamma$ in the weak-field limit.

The function $\mathfrak{h}$ for this case is

\mathfrak{h}=\frac{2}{\sqrt{3}}\,\sqrt{\frac{e^{-\gamma}T}{\sigma}(\Lambda-1)}% \left(\sigma-\frac{3}{8}e^{\gamma}T(1+\Lambda)\right)\ ,\qquad\Lambda=\sqrt{1+% 8e^{-\gamma}\sigma/(3T)}\,.

(4.20)

The weak-field expansion is $\mathfrak{h}={\rm const.}+e^{-\gamma}\sigma+O(\sigma^{2})$ , as expected.

The standard method of computing the Hamiltonian as a Legendre transform of $\mathcal{L}$ leads to complicated equations. The Courant-Hilbert construction of the Hamiltonian based on (1.21) using the above $\mathfrak{h}(\sigma)$ also leads to complicated equations. However, the Hamiltonian can be immediately written down by using (4.13). This gives

\mathcal{H}=\sqrt{2}\,T\left(-1+\frac{2e^{\gamma}u}{3T}+\frac{\Delta^{\prime}}% {2}\right)\sqrt{1-\frac{2e^{\gamma}u}{3T}+\Delta^{\prime}}\,,

(4.21)

with $\Delta^{\prime}=\sqrt{\left(1-\frac{2e^{\gamma}u}{3T}\right)^{2}+8e^{-\gamma}% \frac{v}{3T}}$ .

Logarithmic self-dual electrodynamics

The choice $\ell=-e^{\gamma}T\log(1-\tau/T)$ yields the Lagrangian density [17]

\mathcal{L}=-e^{\gamma}(\Sigma_{0}-T)-e^{\gamma}T\log\left(\frac{e^{2\gamma}}{% 2U}(\Sigma_{0}-T)\right)\ ,

(4.22)

where

\Sigma_{0}=\sqrt{T^{2}+4e^{-2\gamma}U(T-V)}\ .

(4.23)

The corresponding $\mathfrak{h}$ -function is

\mathfrak{h}=Te^{\gamma}\left(M-1-\log\frac{1+M}{2}\right)\,,\qquad M=\sqrt{1+% \frac{4e^{-2\gamma}\sigma}{T}}\,.

(4.24)

As in the previous case, in order to obtain the Hamiltonian, one can circumvent the long calculation through a Legendre transform and directly make use of (4.13). This gives

\mathcal{H}=e^{\gamma}(\Sigma_{0}^{\prime}-T)+e^{\gamma}T\log\left(\frac{e^{2% \gamma}}{2v}(\Sigma_{0}^{\prime}-T)\right)\ ,

(4.25)

where

\Sigma_{0}^{\prime}=\sqrt{T^{2}+4e^{-2\gamma}v(T+u)}\ .

(4.26)

5 $\Phi$ -parity duality

The self-duality PDE (1.5) is invariant under the “ $\Phi$ -parity” transformation $\Phi\to-\Phi$ , which therefore takes a given solution $\mathcal{L}(S,\Phi)$ into its $\Phi$ -parity dual solution

\hat{\mathcal{L}}(S,\Phi)=\mathcal{L}(S,-\Phi)\ .

(5.1)

The Hamiltonian density $\hat{\mathcal{H}}(s,\varphi)$ corresponding to $\hat{\mathcal{L}}(S,\Phi)$ can therefore be found by using the formula (4.17):

\mathcal{L}(S,-\Phi)\ \longrightarrow\ -\mathcal{L}(-s,-\varphi)=\mathcal{H}(s% ,-\varphi)\,,

(5.2)

and hence

\hat{\mathcal{H}}(s,\varphi)=\mathcal{H}(s,-\varphi)\,.

(5.3)

Obviously, the new solution $\hat{\mathcal{L}}$ is the same as the old solution whenever $\mathcal{L}(S,\Phi)$ is $\Phi$ -parity invariant; in this class of NLED theories the weak-field expansion of $\mathcal{L}(S,\Phi)$ is a power-series expansion in $S$ and $P^{2}$ . Otherwise, $\hat{\mathcal{L}}\neq\mathcal{L}$ and both have a weak-field expansion in powers of $S$ and $\Phi$ , with odd powers of $\Phi$ that cannot be rewritten as a sum of positive powers of $S$ and $P^{2}$ .

The transformation $\Phi\to-\Phi$ for fixed $S$ is equivalent to

(U,V)\to-(V,U)\,.

(5.4)

Similarly, the transformation $\varphi\to-\varphi$ for fixed $s$ is equivalent to

(u,v)\to(v,u)\,.

(5.5)

The formula (5.2) connects $\hat{\mathcal{H}}(u,v)$ to the original Lagrangian density $\mathcal{L}$ . Using the version of this formula for $\mathcal{L}(U,V)$ , i.e. (4.13), we have

\hat{\mathcal{L}}(U,V)=\mathcal{L}(-V,-U)\longrightarrow-\mathcal{L}(u,-v)=% \hat{\mathcal{H}}(u,v)\,,

(5.6)

Similarly, applying (4.14) to the $\varphi$ -parity dual of $\mathcal{H}(u,v)$ yields

\hat{\mathcal{H}}(u,v)=\mathcal{H}(v,u)\ \longrightarrow\ -\mathcal{H}(U,-V)=% \hat{\mathcal{L}}(U,V)\,.

(5.7)

These results show how the Lagrangian and Hamiltonian densities of generic NLED theories are related to those of their $\Phi$ -parity duals. We now turn to the special class of (electromagnetic) self-dual NLED theories.

5.1 The $\Phi$ -parity dual of self-dual NLED

Using the CH constructions of $\mathcal{L}$ and $\hat{\mathcal{H}}$ in (5.6) we see that

-\ell(\tau)+\frac{2u}{\dot{\ell}(\tau)}=\hat{\mathfrak{h}}(\sigma)+\frac{2u}{[% \hat{\mathfrak{h}}^{\prime}](\sigma)}\,,

(5.8)

with

\tau=-v+\frac{u}{\dot{\ell}^{2}(\tau)}\,,\qquad\sigma=v-\frac{u}{[\hat{% \mathfrak{h}}^{\prime}]^{2}(\sigma)}\,.

(5.9)

These equations imply that

\hat{\mathfrak{h}}(\sigma)=-\ell(\tau)\,,\qquad\tau=-\sigma\qquad\left[% \Rightarrow\ \hat{\mathfrak{h}}^{\prime}(\sigma)=\dot{\ell}(\tau)\right]\,,

(5.10)

and hence that

\boxed{\ell(-\sigma)=-\hat{\mathfrak{h}}(\sigma)}\,.

(5.11)

Similarly, using the CH constructions for $\mathcal{H}$ and $\hat{\mathcal{L}}$ in (5.7), we conclude that

\boxed{\mathfrak{h}(-\tau)=-\hat{\ell}(\tau)}\,.

(5.12)

We mentioned in the Introduction that the existence of a weak-field limit implies that $\ell(\tau)$ is defined for $\tau<0$ , even though only its values for $\tau\geq 0$ are relevant to the CH formula for $\mathcal{L}(U,V)$ . Now we see from (5.11) that the function $\ell(\tau)$ for $\tau\leq 0$ is (minus) the one-parameter function $\hat{\mathfrak{h}}$ for the “ $\varphi$ -parity” dual of the Hamiltonian density $\mathcal{H}$ that is Legendre-dual to $\mathcal{L}$ . Similarly, we see from (5.12) that the function $\mathfrak{h}(\sigma)$ for $\sigma\leq 0$ is (minus) the one-parameter function $\hat{\ell}$ for the “ $\Phi$ -parity” dual of the Lagrangian density $\mathcal{L}$ . The general picture is illustrated in fig. 5, and we present some illustrative examples below.

For the special case of $\Phi$ -parity invariant theories, $\hat{\ell}=\ell$ and $\hat{\mathfrak{h}}=\mathfrak{h}$ and the two relations (5.11) and (5.12) reduce to the one relation

\boxed{\ell(\kappa)+\mathfrak{h}(-\kappa)=0\,,\quad\kappa\in\hbox{\mybb R}}\,.

(5.13)

Born-Infeld provides a simple example, and we return to a study of $\Phi$ -parity invariant self-dual NLED theories at the end of this section.

We now present examples that illustrate the general case.

Illustrative examples

Let us consider self-dual NLED theories defined by

\ell(\tau)=-T\left(1-\frac{e^{\gamma}\tau}{qT}\right)^{q}\ .

(5.14)

According to the formula (5.11), the $\mathfrak{h}$ -function of the $\Phi$ -dual theory is

\hat{\mathfrak{h}}(\sigma)=-\ell(-\sigma)=T\left(1+\frac{e^{\gamma}\sigma}{qT}% \right)^{q}\ .

(5.15)

For $\gamma=0$ this is the same as the “ $q$ -deformed” function of (3.41), which tells us that the case under consideration now is, for $\gamma=0$ , the $\Phi$ -parity dual of the “ $q$ -deformed” case of subsection (3.2)

For $q=1/2$ we have the ModMaxBorn theory. In this case

	$\displaystyle\ell_{\rm MMB}(\tau)$	$\displaystyle=\ -\sqrt{T^{2}-2e^{\gamma}T\tau}$		(5.16)
	$\displaystyle\mathfrak{h}_{\rm MMB}(\sigma)$	$\displaystyle=\sqrt{T^{2}+2e^{-\gamma}T\sigma}\,,$		(5.16)

but $\Phi$ -duality flips the sign of $\gamma$ , so that

	$\displaystyle\hat{\ell}_{\rm MMB}(\tau)$	$\displaystyle=\ -\sqrt{T^{2}-2e^{-\gamma}T\tau}$		(5.17)
	$\displaystyle\hat{\mathfrak{h}}_{\rm MMB}(\sigma)$	$\displaystyle=\sqrt{T^{2}+2e^{\gamma}T\sigma}\,,$		(5.17)

and both (5.11) and (5.12) are therefore satisfied. Obviously, in the BI ( $\gamma=0$ ) case there is no distinction between the dual (hatted) functions and the original functions, since BI theory is $\Phi$ -parity self-dual.

The $q=3/4$ case

The Lagrangian and Hamiltonian densities can also be found explicitly in the $q=3/4$ case, where

\ell(\tau)=-T\left(1-\frac{4e^{\gamma}\tau}{3T}\right)^{\frac{3}{4}}\ .

(5.18)

Using (1.8), we find

\mathcal{L}=-T\left(\Lambda-\frac{2e^{-\gamma}U}{3T}\right)^{\frac{1}{2}}\left% (\Lambda+\frac{4e^{-\gamma}U}{3T}\right)\,,

(5.19)

where

\Lambda=\sqrt{1-\frac{4Ve^{\gamma}}{3T}+\left(\frac{2e^{-\gamma}U}{3T}\right)^% {2}}\,.

(5.20)

Using the relations (3.11), (3.12), (3.13) we find the corresponding $\mathfrak{h}$ -function:

\mathfrak{h}(\sigma)=T\left(\sqrt{1+\frac{4e^{-2\gamma}\sigma^{2}}{9T^{2}}}+% \frac{4e^{-\gamma}\sigma}{3T}\right)\sqrt{\sqrt{1+\frac{4e^{-2\gamma}\sigma^{2% }}{9T^{2}}}-\frac{2e^{-\gamma}\sigma}{3T}}\,.

(5.21)

The Hamiltonian can now be found via the CH construction of (1.21), but it is much simpler to use (4.13) to obtain

\mathcal{H}=T\left(\Lambda^{\prime}-\frac{2e^{-\gamma}v}{3T}\right)^{\frac{1}{% 2}}\left(\Lambda^{\prime}+\frac{4e^{-\gamma}v}{3T}\right)\,,

(5.22)

where

\Lambda^{\prime}=\sqrt{1+\frac{4ue^{\gamma}}{3T}+\left(\frac{2e^{-\gamma}v}{3T% }\right)^{2}}\,.

(5.23)

Notice that $\hat{\mathfrak{h}}(\sigma)$ of (5.21) is different from $-\ell(-\sigma)$ of (5.18), even at $\gamma=0$ . This tells us that this theory is not $\Phi$ -parity self-dual, even at $\gamma=0$ . The one-parameter CH functions of the $\Phi$ -parity dual theory are easily found from (5.11) and (5.12):

\hat{\mathfrak{h}}=T\left(1+\frac{4e^{\gamma}\sigma}{3T}\right)^{\frac{3}{4}}\,,

(5.24)

\hat{\ell}(\tau)=-T\left(\sqrt{1+\frac{4e^{-2\gamma}\tau^{2}}{9T^{2}}}-\frac{4% e^{-\gamma}\tau}{3T}\right)\sqrt{\sqrt{1+\frac{4e^{-2\gamma}\tau^{2}}{9T^{2}}}% +\frac{2e^{-\gamma}\tau}{3T}}\,.

(5.25)

Comparing with the example (3.41) for $q=3/4$ , we note that $\Phi$ -duality has again flipped the sign of $\gamma$ ; for weak fields the $\Phi$ -parity dual theory becomes ModMax but with $\gamma\to-\gamma$ .

The $\Phi$ -dual Lagrangian $\hat{\mathcal{L}}$ and Hamiltonian $\hat{\mathcal{H}}$ can now be found by the maps (5.4), (5.5). Alternatively, they can be obtained from the CH construction using the above expressions for $\hat{\ell}$ and $\hat{\mathfrak{h}}$ . For example, from (1.21) and (5.24), we have the equations

\sigma+e^{2\gamma}u\sqrt{1+\frac{4e^{\gamma}\sigma}{3T}}-v=0\ .

(5.26)

This gives

\sqrt{1+\frac{4e^{\gamma}\sigma}{3T}}=-\frac{2e^{-\gamma}u}{3T}+\Sigma\ ,% \qquad\Sigma=\sqrt{1+\frac{4ve^{\gamma}}{3T}+\left(\frac{2e^{-\gamma}u}{3T}% \right)^{2}}\,,

(5.27)

which leads to

\hat{\mathcal{H}}=T\left(\Sigma+\frac{4e^{-\gamma}u}{3T}\right)\left(\Sigma-% \frac{2e^{-\gamma}u}{3T}\right)^{\frac{1}{2}}\,,

(5.28)

in accordance with the formula $\hat{\mathcal{H}}(u,v)=\mathcal{H}(v,u)$ .

5.2 The alternative CH construction

In addition to the construction of (1.8) that gives the Lagrangian density $\mathcal{L}$ of the general self-dual NLED in terms of a boundary function $\ell$ , Courant and Hilbert show that the solution to the partial differential equation (1.7) may also be expressed as [20]

\mathcal{L}(U,V)=\frac{V}{x}-xU+\omega(x)\ ,

(5.29)

where $\omega(x)$ is defined for positive dimensionless variable $x$ , which is determined implicitly by the equation

x\omega^{\prime}(x)=xU+\frac{V}{x}\ .

(5.30)

To verify this we take the differentials of both sides of (5.29). Simplifying the result by using (5.30) we find that

d\mathcal{L}=\frac{dV}{x}-xdU\,,

(5.31)

and hence that $\mathcal{L}_{U}\mathcal{L}_{V}=-1$ . We also see, by comparison with (1.10), that the relation of $\omega(x)$ to $\ell(\tau)$ must be such that $\dot{\ell}=1/x$ . In fact, the relation is given implicitly by

\ell(\tau)=\omega(x)+x\omega^{\prime}(x)\,,\qquad\tau=x^{2}\omega^{\prime}(x)\,,

(5.32)

from which we find that

\dot{\ell}=\left(2\omega^{\prime}+x^{2}\omega^{\prime}{}^{\prime}\right)\frac{% dx}{d\tau}=\frac{1}{x}\frac{d(x^{2}\omega^{\prime})}{dx}\frac{dx}{d\tau}=\frac% {1}{x}\,,

(5.33)

as expected. This alternative to the CH constructions described in the Introduction is useful when considering the implications of $\Phi$ -parity; conversely, consideration of $\Phi$ -parity yields insights into the relation between $\ell$ and $\omega$ that are in some respects similar to what we have already found for $\ell$ and $\mathfrak{h}$ .

Recall that $\Phi\to-\Phi$ is equivalent to $(U,V)\to-(V,U)$ . Applying this to (5.29) we find that the $\Phi$ -parity transform of $\mathcal{L}(U,V)$ is

\hat{\mathcal{L}}(U,V)=\ xV-\frac{U}{x}+\omega(x)

(5.34)

where $x$ is now determined by the equation

-x\omega^{\prime}(x)=xV+\frac{U}{x}\ .

(5.35)

If we now define a new variable $y$ and a new function $\hat{\omega}$ by

y:=\frac{1}{x}\,,\qquad\hat{\omega}(y):=\omega(x)\,,

(5.36)

then the equations (5.34) and (5.35) defining $\hat{\mathcal{L}}(U,V)$ become, respectively

\hat{\mathcal{L}}(U,V)=\frac{V}{y}-yU+\hat{\omega}(y)\ ,

(5.37)

and

y\hat{\omega}^{\prime}(y)=yU+\frac{V}{y}\ .

(5.38)

These are formally the same as the original equations (5.29) and (5.30) that define $\mathcal{L}(U,V)$ , but the function $\hat{\omega}$ determining $\hat{\mathcal{L}}$ is generally different from the function $\omega$ determining $\mathcal{L}$ , since $\hat{\omega}(x)=\omega(1/x)$ . In the following subsection we focus on the special class of $\Phi$ -parity invariant theories for which $\omega(x)=\omega(1/x)$ , and hence $\hat{\mathcal{L}}=\mathcal{L}$ .

There is also a CH construction of $\hat{\mathcal{L}}$ in terms of a function $\hat{\ell}$ , with a relation of $\hat{\ell}$ to $\hat{\omega}$ that is formally the same as the relation of $\ell$ to $\omega$ expressed by the equations of (5.32). The relation of $\ell$ to $\hat{\omega}$ is different, however. In terms of the new variable $y$ and the new function $\hat{\omega}$ , the equations of (5.32) become

\ell(\tau)=\hat{\omega}(y)+\tau y\,,\qquad\tau=-\hat{\omega}^{\prime}(y)\,.

(5.39)

This has a remarkably simple interpretation: it tells us that $\ell(\tau)$ is the Legendre transform of $-\hat{\omega}(y)$ with respect to $y$ :

-\hat{\omega}(y)=\sup_{\tau}\left\{y\tau-\ell(\tau)\right\}\,.

(5.40)

From the equation for $\tau$ in (5.39) we have

-\hat{\omega}^{\prime}{}^{\prime}(y)=\frac{d\tau}{dy}=-x^{2}\frac{d\tau}{dx}\,,

(5.41)

which is the inverse of $\ddot{\ell}$ (as $\dot{\ell}=1/x$ ); i.e.

\ddot{\ell}(\tau)\hat{\omega}^{\prime}{}^{\prime}(y)=-1\,.

(5.42)

This result tells us that $\hat{\omega}(y)$ is a strictly concave function iff $\ell(\tau)$ is a strictly convex function, as is required for causality, except that causality also allows $\ddot{\ell}=0$ , which is realized by ModMax and its Maxwell limit. These conformal NLED theories are therefore not obviously included in the alternative CH construction of self-dual NLED theories.

To better understand why ModMax and Maxwell are special cases, we observe that the converse of (5.42) is

-\hat{\omega}(y)=\sup_{\tau}\left\{y\tau-\ell(\tau)\right\}\,.

(5.43)

That is, $-\hat{\omega}(y)$ is the Legendre transform of $\ell(\tau)$ , with respect to $\tau$ (recall that $\mathfrak{h}(\sigma)$ , expressed as the function $H(\sqrt{2\sigma})$ , is its Legendre transform with respect to $\sqrt{2\tau}$ ).

For the choice $\ell(\tau)=e^{\gamma}\tau$ , we have

\hat{\omega}(y)=\sup_{\tau}\left\{\left(y-e^{\gamma}\right)\tau\right\}\,,

(5.44)

which is defined only for $y=e^{\gamma}$ , and is zero at this one point in its domain. Using this function in (5.34) yields the ModMax Lagrangian density, and Maxwell for $\gamma=0$ . Thus, (5.37) does include Modmax and Maxwell if the function $\hat{\omega}$ is defined in terms of the CH function $\ell$ , as in (5.44), and a similar (dual) statement applies if the function $\omega$ in (5.29) is defined as (minus) the Legendre transform of $\hat{\ell}$ .

One utility of the alternative CH construction described above is that new explicit examples of self-dual NLED theories can be found that would otherwise be difficult to find. This is illustrated by the following example.

Generalized Logarithmic NLED

We start from the function

\omega(x)=cT-\frac{T}{2}\left(e^{\gamma}x+\frac{1}{e^{\gamma}x}\right)+\eta T% \log(x)\ ,

(5.45)

where $c$ is a parameter that can be chosen to arrange for zero vacuum energy, and $\eta$ is a further real parameter. Using (5.29) we find the Lagrangian density:

\mathcal{L}=cT-\Sigma-\eta T\log\left(\frac{\Sigma-\eta T}{e^{\gamma}T+U}% \right)\ ,

(5.46)

with

\Sigma\equiv\sqrt{(T+2e^{-\gamma}U)(T-2e^{\gamma}V)+\eta^{2}T^{2}}\ .

(5.47)

For $\eta=0$ this reduces to ModMaxBorn.

Recalling again that $\Phi$ -parity takes $(U,V)$ to $-(V,U)$ , one sees from inspection that this generalized logarithmic NLED theory is $\Phi$ -parity invariant iff $\eta=0$ and $\gamma=0$ , in which case it reduces to Born-Infeld. For all other choices of these parameters the $\Phi$ -parity dual is found by changing the signs of both $\eta$ and $\gamma$ .

5.3 The general “ $\Phi$ -parity” invariant self-dual theory

Comparing (5.34) to (5.37) we see that $\hat{\mathcal{L}}=\mathcal{L}$ whenever $\hat{\omega}=\omega$ , i.e. whenever $\omega(1/x)=\omega(x)$ . In this case

\omega^{\prime}(x)=-\frac{1}{x^{2}}\omega^{\prime}(1/x)\,,

(5.48)

which implies that $\omega^{\prime}(1)=0$ . From the equation for $\tau$ in (5.32) we see that $x=1$ is equivalent to $\tau=0$ , so the weak-field expansion

\ell(\tau)=\tau+\frac{1}{2T}\tau^{2}+\mathcal{O}(\tau^{3})

(5.49)

must be equivalent to an expansion of $\omega(x)$ about $x=1$ . From the expressions for $\ell(\tau)$ and $\tau$ in (5.32), one finds that this expansion is

\omega(x)=-\frac{T}{2}(1-x)^{2}+\mathcal{O}[(1-x)^{3}]\,.

(5.50)

This result is a direct consequence of the fact that $\omega^{\prime}(1)=0$ and the identity (5.42) (for $\tau=(1-x)=0$ ). The corresponding weak-field expansion of $\mathcal{L}$ is

\mathcal{L}(S,\Phi)=S+\frac{\Phi^{2}}{2T}+\mathcal{O}(1/T^{2})\,.

(5.51)

A very simple choice for $\omega(x)$ that is manifestly invariant under $x\to 1/x$ is

\omega(x)=T-\frac{T}{2}\ \big{(}x+x^{-1}\big{)}\,.

(5.52)

In this case the solution of (5.30) for $x$ is

x=\sqrt{\frac{T-2U}{T+2V}}\,.

(5.53)

This yields the Born-Infeld theory. The weak-field expansions of $\omega$ and $\ell$ are exactly as above in this case; more generally, a rescaling of the parameter $T$ in (5.49) and (5.52) will be necessary.

It is a simple matter to write down other functions $\omega(x)$ that are invariant under $x\to 1/x$ , but any such function must also have the property that the equation (5.30) has a unique solution for $x$ , and we must also impose causality conditions. For example, the condition $\dot{\ell}\geq 1$ requires $x\leq 1$ . Another aid to separating the causal from the acausal NLED theories is the relation (5.42), which implies that $\hat{\omega}(y)$ is a concave function of $y$ whenever $\ell(\tau)$ is a convex function of $\tau$ , as required. The implications for $\omega(x)$ are generically not obvious but $\hat{\omega}=\omega$ for $\Phi$ -parity invariant theories, and therefore $\omega(x)$ must also be concave (for $x\leq 1$ ).

Consider, for example the following one-parameter generalization of Born-Infeld, defined by

\omega(x)=-\frac{T}{2}\left\{\left(x+\frac{1}{x}\right)+a\left(x+\frac{1}{x}% \right)^{2}\right\}\,,

(5.54)

where $a$ is a constant. One finds that

\omega^{\prime}(x)=T\left\{\frac{(1-x^{2})}{2x^{2}}+a\frac{(1-x^{4})}{x^{3}}\right\}

(5.55)

and

\omega^{\prime}{}^{\prime}(x)=-\frac{T}{x^{4}}\left(x+ax^{4}+3a\right)\,.

(5.56)

Using (5.55) in (5.30) we find the following equation for $x$ :

(T-2V)+\frac{2aT(1-x^{4})}{x}=(T+2U)x^{2}\,.

(5.57)

Inspection of the graphs of the functions of $x$ on both sides of this equation shows that a unique solution exists for all $(U,V)$ in the positive quadrant iff $a\geq 0$ . From (5.56) we see that this is also required for $\omega(x)$ to be a concave function for $0\leq x\leq 1$ . For $a>0$ we have a one parameter self-dual deformation of Born-Infeld that preserves $\Phi$ -parity invariance. As (5.57) is a quartic equation for $x$ , which has an explicit and unique solution for $0\leq x\leq 1$ , the Lagrangian density can still be found explicitly.

6 Legendre self-duality

So far we have considered the Lagrangian and Hamiltonian densities. In both cases, the NLED field equations may be written in first-order form as the “macroscopic Maxwell equations”

	$\displaystyle\dot{\bf D}=\bm{\nabla}\times{\bf H}$	$\displaystyle\,,\qquad\bm{\nabla}\cdot{\bf D}=0\,,$		(6.1)
	$\displaystyle\dot{\bf B}=-\bm{\nabla}\times{\bf E}$	$\displaystyle\,,\qquad\bm{\nabla}\cdot{\bf B}=0\,,$		(6.1)

together with constitutive relations which are either

{\bf D}=\partial{\mathcal{L}}/\partial{\bf E}\,,\qquad{\bf H}=-\partial{% \mathcal{L}}/\partial{\bf B}\,,

(6.2)

{\bf E}=\partial{\mathcal{H}}/\partial{\bf D}\,,\qquad{\bf H}=\partial{% \mathcal{H}}/\partial{\bf B}\,.

(6.3)

However, we may also specify the constitutive relations in terms of a ‘dual’ Hamiltonian density

\tilde{\mathcal{H}}({\bf E},{\bf H})=\sup_{\bf B}\left\{-{\bf B}\cdot{\bf H}-% \mathcal{L}\right\}\,,

(6.4)

in which case

{\bf D}=-\frac{\partial\tilde{\mathcal{H}}}{\partial{\bf E}}\,,\qquad{\bf B}=-% \frac{\partial\tilde{\mathcal{H}}}{\partial{\bf H}}\,,

(6.5)

or by a ‘dual’ Lagrangian density

\tilde{\mathcal{L}}({\bf D},{\bf H})=\sup_{({\bf E},{\bf B})}\left\{\mathcal{L% }-{\bf E}\cdot{\bf D}+{\bf B}\cdot{\bf H}\right\}\,,

(6.6)

in which case

{\bf E}=-\frac{\partial\tilde{\mathcal{L}}}{\partial{\bf D}}\,,\qquad{\bf B}=% \frac{\partial\tilde{\mathcal{L}}}{\partial{\bf H}}\,.

(6.7)

The possibility of a description in terms of one of four “fundamental functions” was observed by Born in [28] but here we use the more standard terminology of Bialynicki-Birula [5], except for sign changes to ensure that the addition of a constant to $\mathcal{L}$ implies the addition of the same constant to $\tilde{\mathcal{L}}$ , and its subtraction from both $\mathcal{H}$ and $\tilde{\mathcal{H}}$ .

Another of Born’s observations was that, for Born-Infeld, $\mathcal{L}$ and $\tilde{\mathcal{L}}$ are identical functions of their respective scalar variables, appropriately defined; this has been called “Legendre self-duality”. As mentioned in the Introduction, this was shown by Gaillard and Zumino to be a property of any (electromagnetically) self-dual NLED theory [18], and a later proof of Theisen and Kuzenko [29] showed that only a $Z_{2}$ electromagnetic duality was needed. The starting point of this proof was (in our notation) the Lagrangian density

\mathcal{L}(F,\tilde{A})=\mathcal{L}(F)-{\bf B}\cdot\tilde{\bf E}-{\bf E}\cdot% \tilde{\bf B}\,,

(6.8)

where $({\bf E},{\bf B})$ are the electric/magnetic components of $F$ , now an arbitrary 2-form field, and $(\tilde{\bf E},\tilde{\bf B})$ are the electric/magnetic components of $\tilde{F}=d\tilde{A}$ . The combined field equations found from varying both $F$ and $\tilde{A}$ are equivalent to those of $\mathcal{L}(F)$ for $F=dA$ (since variation of $\tilde{A}$ yields the equation $dF=0$ ). However, the equations found from varying $F$ , which are

\tilde{\bf B}=\frac{\partial\mathcal{L}(F)}{\partial{\bf E}}={\bf D}\,,\qquad% \tilde{\bf E}=\frac{\partial\mathcal{L}(F)}{\partial{\bf B}}=-{\bf H}\,,

(6.9)

may be used to eliminate $F$ ; this yields the dual Lagrangian density $\tilde{\mathcal{L}}(\tilde{F})$ . Theisen and Kuzenko show that the functions $\mathcal{L}(F)$ and $\tilde{\mathcal{L}}(\tilde{F})$ are the same for all NLED invariant under a discrete $Z_{2}$ electromagnetic duality transformation. To state this result in our notation, we observe that a further implication of the equations (6.9) is

	$\displaystyle\tilde{S}=$	$\displaystyle\ \frac{1}{2}\left(\|\tilde{\bf E}\|^{2}-\|\tilde{\bf B}\|^{2}\right)% \ \equiv-\frac{1}{2}\left(\|{\bf D}\|^{2}-\|{\bf H}\|^{2}\right)\,,$		(6.10)
	$\displaystyle\tilde{P}=$	$\displaystyle\ \tilde{\bf E}\cdot\tilde{\bf B}\ \equiv-{\bf D}\cdot{\bf H}\,.$		(6.10)

In other words, Legendre self-duality can be restated as the equivalence, for self-dual NLED, of the functions $\mathcal{L}(S,P)$ and $\tilde{\mathcal{L}}(\tilde{S},\tilde{P})$ , with $(\tilde{S},\tilde{P})$ defined in terms of $({\bf H},{\bf D})$ according to (6.10).

Here we show that this result follows directly from the definition of the dual Lagrangian density $\tilde{\mathcal{L}}$ whenever the Hamiltonian density is invariant under the $-\pi/2$ duality-rotation taking $({\bf D},{\bf B})$ to $({\bf B},-{\bf D})$ , which implies that

\mathcal{H}({\bf D},{\bf B})=\mathcal{H}({\bf B},-{\bf D})\,.

(6.11)

This is obviously a property of any self-dual NLED, but it also a property of some other NLED theories that are not self-dual.

We begin with the observation that

\mathcal{H}({\bf D},{\bf B})=\sup_{\bf E}\left\{{\bf D}\cdot{\bf E}-\mathcal{L% }({\bf E},{\bf B})\right\}\,.

(6.12)

This relation implies the following two relations

	$\displaystyle\mathcal{L}({\bf E},{\bf B})=$	$\displaystyle\ \sup_{\bf D}\left\{{\bf E}\cdot{\bf D}-\mathcal{H}({\bf D},{\bf B% })\right\}\,,$		(6.13)
	$\displaystyle\tilde{\mathcal{L}}({\bf D},{\bf H})=$	$\displaystyle\ \sup_{\bf B}\left\{{\bf H}\cdot{\bf B}-\mathcal{H}({\bf D},{\bf B% })\right\}\,.$		(6.13)

The first of these is just the inverse of (6.12). The second follows by using (6.12) to replace $\mathcal{H}$ on the right-hand side; this yields the definition of (6.6) for $\tilde{\mathcal{L}}$ . We see from these relations that both $\mathcal{L}$ and $\tilde{\mathcal{L}}$ are Legendre transforms of $\mathcal{H}({\bf D},{\bf B})$ , but one is with respect to the first 3-vector variable and the other is with respect to the second 3-vector variable. Let us now rewrite (6.13) more abstractly as

	$\displaystyle\mathcal{L}({\bf X},{\bf Y})=$	$\displaystyle\ \sup_{\bf Z}\left\{{\bf X}\cdot{\bf Z}-\mathcal{H}({\bf Z},{\bf Y% })\right\}\,,$		(6.14)
	$\displaystyle\tilde{\mathcal{L}}({\bf Y},-{\bf X})=$	$\displaystyle\ \sup_{\bf Z}\left\{{\bf X}\cdot{\bf Z}-\mathcal{H}({\bf Y},-{% \bf Z})\right\}\,,$		(6.14)

for 3-vectors $({\bf X},{\bf Y},{\bf Z})$ . From this, we see that the property (6.11) of any self-dual theory implies that

\mathcal{L}({\bf X},{\bf Y})=\tilde{\mathcal{L}}({\bf Y},-{\bf X})\,.

(6.15)

Given Lorentz invariance of the function $\mathcal{L}({\bf X},{\bf Y})$ on the left-hand side, it may be expressed as a function of the Lorentz scalars $|{\bf X}|^{2}-|{\bf Y}|^{2}$ and ${\bf X}\cdot{\bf Y}$ . The same is true of the right-hand side but with $(X,Y)$ replaced by $(Y,-X)$ , as we should expect from the minus signs in the definitions of $(\tilde{S},\tilde{P})$ in (6.10). We thus conclude that $\mathcal{L}(S,P)$ and $\tilde{\mathcal{L}}(\tilde{S},\tilde{P})$ are the same, as functions, for any self-dual NLED theory.

As a simple illustration of Legendre self-duality, we consider a class of NLED theories, introduced in [34], that may be defined by the following one-parameter family of Lagrangian densities in which $(a,b)$ are a pair of auxiliary fields:

\mathcal{L}_{\rm RT}=T-\frac{T}{2}\left[a+\frac{(1+b^{2})}{a}\right]+aS+\xi bP\,.

(6.16)

The family parameter is $\xi$ , which we may assume to be non-negative. For $\xi=1$ we have the Roček-Tseytlin (RT) formulation of Born-Infeld [35]; elimination of the auxiliary fields yields $\mathcal{L}_{\rm BI}(S,P)$ . For $\xi=0$ , we get the original Born theory [28]; the general case was discussed in detail in [23]. An advantage of this auxiliary-field formulation is that the 3-vector fields $({\bf D},{\bf H})$ are now linear functions of $({\bf E},{\bf B})$ :

\left(\begin{array}[]{c}{\bf D}\\ {\bf H}\end{array}\right)=\left(\begin{array}[]{cc}a&\xi b\\ -\xi b&a\end{array}\right)\left(\begin{array}[]{c}{\bf E}\\ {\bf B}\end{array}\right)\,.

(6.17)

This implies that

{\bf E}\cdot{\bf D}-{\bf B}\cdot{\bf H}=2(aS+\xi bP)\,,

(6.18)

and hence that the dual RT Lagrangian density is

\tilde{\mathcal{L}}_{\rm RT}=T-\frac{T}{2}\left[a+\frac{(1+b^{2})}{a}\right]-(% aS+\xi bP)\,,

(6.19)

but with $(S,P)$ expressed as functions of $(\tilde{S},\tilde{P})$ .

Using (6.10) and (6.17), it is straightforward to show that

\left(\begin{array}[]{c}S\\ P\end{array}\right)=\frac{1}{(a^{2}+\xi^{2}b^{2})^{2}}\left(\begin{array}[]{cc% }\xi^{2}b^{2}-a^{2}&2\xi ab\\ -2\xi ab&\xi^{2}b^{2}-a^{2}\end{array}\right)\left(\begin{array}[]{c}\tilde{S}% \\ \tilde{P}\end{array}\right)\,,

(6.20)

and hence that

aS+\xi bP=-\left(\tilde{a}\tilde{S}+\xi\tilde{b}\tilde{P}\right)\,,

(6.21)

where

\tilde{a}=\frac{a}{a^{2}+\xi^{2}b^{2}}\,,\qquad\tilde{b}=-\frac{b}{a^{2}+\xi^{% 2}b^{2}}\,.

(6.22)

This auxiliary-field redefinition is such that

a+\frac{(1+b^{2})}{a}=\tilde{a}+\frac{(f_{\xi}(\tilde{a},\tilde{b})+\tilde{b}^% {2})}{\tilde{a}}\,,

(6.23)

where

f_{\xi}(\tilde{a},\tilde{b})=1+(\xi^{2}-1)\tilde{b}^{2}\left[1-\frac{1}{\tilde% {a}^{2}+\xi^{2}\tilde{b}^{2}}\right]

(6.24)

We thus deduce for $\xi=1$ , that

\tilde{\mathcal{L}}^{(\xi=1)}_{\rm RT}=T+\frac{T}{2}\left[\tilde{a}+\frac{(1+% \tilde{b}^{2})}{\tilde{a}}\right]+\tilde{a}\tilde{S}+\tilde{b}\tilde{P}\,.

(6.25)

As this is formally the same as the $\xi=1$ case of (6.16), elimination of the auxiliary fields $(\tilde{a},\tilde{b})$ now yields the BI Lagrangian density but with $(S,P)$ replaced by $(\tilde{S},\tilde{P})$ ; i.e.

\tilde{\mathcal{L}}_{\rm BI}(\tilde{S},\tilde{P})=T-\sqrt{T^{2}-2T\tilde{S}-% \tilde{P}^{2}}\,,

(6.26)

which is formally identical to $\mathcal{L}_{\rm BI}(S,P)$ .

For all other values of $\xi$ , we have $\tilde{\mathcal{L}}_{\rm RT}\neq\mathcal{L}_{\rm RT}$ , but $\xi=0$ is a special case because then the only $\tilde{b}$ -dependence is via the $\tilde{b}^{2}$ term of $f_{\xi}$ , which implies that $f_{\xi}\to 1$ upon elimination of $\tilde{b}$ . Elimination of $\tilde{a}$ then yields $\mathcal{L}_{\rm Born}$ , so Born’s original theory is also Legendre self-dual. The reason for this is that $\mathcal{H}_{\rm Born}$ satisfies the condition (6.11). The results of [23] for the Hamiltonian density for arbitrary $\xi$ show that (6.11) is satisfied only for $\xi=0$ and $\xi=1$ .

6.1 A proof from the CH formula

We shall now present a proof that (electromagnetic) self-duality implies Legendre self-duality, by taking the CH formula (1.8) as our starting point. We know the first derivatives of $\mathcal{L}(U,V)$ from (1.10), and we know how $(U,V)$ depend on $(S,P)$ and hence on ${\bf E}$ and ${\bf B}$ . This allows us to compute the derivatives of $\mathcal{L}$ with respect to both ${\bf E}$ and ${\bf B}$ . Recalling the definitions of $({\bf D},{\bf H})$ as derivatives of $\mathcal{L}$ , we find that

\left(\begin{array}[]{c}{\bf D}\\ {\bf H}\end{array}\right)=\left(\begin{array}[]{cc}a&b\\ -b&a\end{array}\right)\left(\begin{array}[]{c}{\bf E}\\ {\bf B}\end{array}\right)\,,

(6.27)

where, now,

a=\frac{\dot{\ell}^{2}V+U}{\dot{\ell}(V+U)}\,,\qquad b=\frac{(\dot{\ell}^{2}-1% )P}{2\dot{\ell}(V+U)}\,.

(6.28)

From this result we may compute $(\tilde{S},\tilde{P})$ . One finds that

\tilde{S}=\frac{U}{\dot{\ell}^{2}}-\dot{\ell}^{2}V\,\qquad\tilde{P}=-P\,,

(6.29)

which allows us to determine $(\tilde{U},\tilde{V})$ in terms of $(U,V)$ and $\dot{\ell}$ . The result is

\tilde{U}=\dot{\ell}^{2}V\,,\qquad\tilde{V}=\frac{U}{\dot{\ell}^{2}}\,.

(6.30)

We also find from (6.27) that

-{\bf E}\cdot{\bf D}+{\bf B}\cdot{\bf H}=\frac{2U}{\dot{\ell}}-2\dot{\ell}V\,,

(6.31)

and hence, using the CH formula for $\mathcal{L}$ , that

	$\displaystyle\tilde{\mathcal{L}}$	$\displaystyle=\ \ell(\tau)-\frac{2U}{\dot{\ell}}+\left[\frac{2U}{\dot{\ell}}-2% \dot{\ell}V\right]\equiv\ell(\tau)-2\dot{\ell}V$		(6.32)
		$\displaystyle=\ \ell(\tau)-\frac{2\tilde{U}}{\dot{\ell}}\,,$		(6.32)

where

\tau=V+\frac{U}{\dot{\ell}^{2}}=\tilde{V}+\frac{\tilde{U}}{\dot{\ell}^{2}}\,.

(6.33)

We thus deduce that $\tilde{\mathcal{L}}(\tilde{U},\tilde{V})$ is given by the CH formula for the same function $\ell$ that we used for $\mathcal{L}$ . This implies that $\mathcal{L}$ and $\tilde{\mathcal{L}}$ are identical functions.

7 The NLED/Particle-mechanics correspondence

The starting point for section 3 was the obvious fact that setting the magnetic field to zero in any Legendre pair of functions $\mathcal{L}({\bf E},{\bf B})$ and $\mathcal{H}({\bf D},{\bf B})$ yields functions $L(E)$ and $H(D)$ , respectively, that are a Legendre pair when viewed as functions of ${\bf E}$ and ${\bf D}$ , respectively. We then showed that this remains true if $L(E)$ and $H(D)$ are viewed as one-variable functions, and we explained how they are related to the one-variable functions $\ell$ and $\mathfrak{h}$ that determine $\mathcal{L}$ and $\mathcal{H}$ for a self-dual NLED.

We now provide a different interpretation of the functions $L(E)$ and $H(D)$ , viewed as functions of ${\bf E}$ and ${\bf D}$ , respectively. Rather than set to zero the magnetic field, as we did in section 3, we replace the Euclidean 3-space with a flat 3-torus of 3-volume $v_{3}$ , and we truncate the Fourier expansion of fields on this 3-torus by setting all space derivatives of the 1-form potential to zero. We then have ${\bf B}={\bf 0}$ but also ${\bf E}=-\dot{\bf A}$ , where ${\bf A}(t)$ is the 3-vector potential, now a function only of the time coordinate $t$ . The Lagrangian obtained by integrating over $T^{3}$ is therefore

\hbox{\mybb L}(\bm{\nu})=v_{3}L(E)\,,\qquad\bm{\nu}:=-\sqrt{v_{3}/m}\,{\bf E}\,,

(7.1)

where $m$ is an arbitrary mass parameter needed to make $\nu$ dimensionless (for unit speed of light). The Hamiltonian (obtained by Legendre transform of 𝕃) is

\hbox{\mybb H}(\bm{\pi})=v_{3}H(D)\,,\qquad\bm{\pi}=-\sqrt{mv_{3}}\,{\bf D}\,,

(7.2)

where $\bm{\pi}$ is the Legendre dual of $\bm{\nu}$ . We may interpret 𝕃 and ℍ as the Lagrangian and Hamiltonian for a point particle with velocity $\bm{\nu}$ and momentum $\bm{\pi}$ in a locally-Euclidean 3-space.

For Maxwell we have ( $\nu=|\bm{\nu}|$ and $\pi=|\bm{\pi}|$ )

\hbox{\mybb L}=\frac{1}{2}v_{3}E^{2}=\frac{1}{2}m\nu^{2}\,,\qquad\hbox{\mybb H% }=\frac{1}{2}v_{3}D^{2}=\frac{\pi^{2}}{2m}\,,

(7.3)

which are the Lagrangian and Hamiltonian for a non-relativistic particle of mass $m$ .

For the Born-Infeld theory we may (since $m$ was an arbitrary mass parameter) set

T=m/v_{3}\,,

(7.4)

in which case

\hbox{\mybb L}=-m\sqrt{1-\nu^{2}}\,,\qquad\hbox{\mybb H}=\sqrt{m^{2}+\pi^{2}}\,,

(7.5)

which are the Lagrangian and Hamiltonian for a relativistic particle of mass $m$ . As a consistency check, we observe that

\tau=\frac{1}{2}T\nu^{2}\,,\qquad\sigma=\frac{1}{2}T(\pi/m)^{2}\,,

(7.6)

and the relations of (3.12) then imply

\pi=\frac{m\nu}{\sqrt{1-\nu^{2}}}

(7.7)

as expected.

There is a string-theory interpretation of this correspondence between the Born-Infeld field theory and the massive relativistic particle, because (as mentioned in the Introduction) it is implied by the T-duality relation between the D3-brane and D0-brane of Type II superstring theory. However the correspondence obtained above is more general because it applies to any self-dual NLED. For ModMaxBorn, for example, we find that

\hbox{\mybb L}=-m\sqrt{1-e^{\gamma}\nu^{2}}\,,\qquad\hbox{\mybb H}=\sqrt{m^{2}% +e^{-\gamma}\pi^{2}}\,,

(7.8)

which are again the Lagrangian and Hamiltonian for a relativistic particle but with a “modified light-speed” of $e^{-2\gamma}$ , which is subluminal for $\gamma>0$ and superluminal for $\gamma>1$ . For this “relativistic” particle mechanics model, considered in isolation, we could redefine the “speed of light” to be $e^{-\gamma/2}$ . However, as derived above, the particle mechanics model describes the ‘corner’ of full self-dual NLED model for which $|{\bf B}|=0$ and $-{\bf E}$ is a space-independent but time-dependent 3-vector that we interpret as a particle velocity vector, and the speed of light is what it is in the full theory, i.e. unity. From this perspective, we should expect that the particle velocity can be superluminal only for an acausal NLED, and our ModMaxBorn example confirms this.

8 Summary and outlook

In any generalisation of an established physical theory, such as Maxwell electrodynamics, the question arises of which features should be preserved and which may be discarded. The principal feature preserved by the Plebanski class of nonlinear electrodynamics is the canonical structure, and hence the number of degrees of freedom per space point. This means that small amplitude waves still have two distinct polarisations, but these waves will typically interact with each other. In addition, they need not travel at light-speed, which leads to the possibility of superluminal propagation in some backgrounds. This was initially investigated by considering shock waves in generic smooth electromagnetic backgrounds, but equivalent results are found by considering plane-wave perturbations of a generic constant uniform background, which can be viewed as a homogeneous optical medium.

For weak-field backgrounds (typically defined in relation to a Born scale introduced by interactions) the absence of superluminal propagation can be ensured by imposing simple convexity conditions on the Lagrangian density. However, generic theories satisfying these conditions will still allow superluminal propagation for some strong-field backgrounds. The systematic study of this possibility dates back to a 2016 paper by Schellstede et al. [21], whose results we have confirmed, and explored in the context of models proposed for a variety of phenomenological reasons over the past few decades [23]. One lesson from this work is that the simplest way to find a causal model is to choose one that is self-dual because weak-field causality implies strong-field causality (given the existence of a weak-field limit) [17].

Thus, one major reason for the study of self-dual NLED theories is that it is easy to separate the causal from the acausal cases. In fact, this becomes even easier once it is appreciated that the Lagrangian density $\mathcal{L}$ of any self-dual NLED theories (with a weak-field limit) can be constructed from a corresponding one-variable function $\ell$ . This function, defined on a half-line, provides the boundary condition needed to integrate the PDE that $\mathcal{L}$ must satisfy for any self-dual theory; we have called this the Courant-Hilbert (CH) construction since the PDE and its solution can be found in [20]. The causality conditions then reduce to simple constraints on the first and second derivatives of the function $\ell$ [17].

The initial aim of this paper was to to extend these results to the Hamiltonian formulation. Self-duality in this context is trivially ensured by restricting the Hamiltonian density $\mathcal{H}$ to depend on duality-invariant variables, but now Lorentz invariance requires $\mathcal{H}$ to satisfy a PDE, which (in appropriate variables) is formally the same as the one that $\mathcal{L}$ must satisfy to ensure self-duality. This means that there is a CH construction for $\mathcal{H}$ in terms of some other one-variable function $\mathfrak{h}$ , also defined on a half-line, and $\{\mathcal{L},\mathcal{H}\}$ is a Legendre-dual pair for any causal self-dual NLED. We have shown that the one-variable “CH-functions” $\{L,H\}$ defined by $L(\sqrt{2\tau})=\ell(\tau)$ and $H(\sqrt{2\sigma})=\mathfrak{h}(\sigma)$ are also a Legendre-dual pair. This defines a correspondence between any causal self-dual NLED and a particle-mechanics model, with Born-Infeld corresponding to the massive relativistic point particle.

The results just summarized also simplify the construction of $\mathcal{H}$ from $\mathcal{L}$ , and vice versa, by reducing this problen to the Legendre transform of a one-variable function. However, a much greater simplification is possible by taking advantage of a ‘dual’ CH construction of $\mathcal{L}$ from $\mathfrak{h}$ . The fact that both $\mathcal{L}$ and $\mathcal{H}$ can be constructed from $\mathfrak{h}$ implies a very simple relation between them. A procedure for finding $\mathcal{H}$ given $\mathcal{L}$ , for example, is given in the one-line boxed equation (1.25) of the Introduction. This is one of our main results, derived from an unexpected ‘duality’.

Since $\mathcal{L}$ and $\mathcal{H}$ are so simply related, it is natural to suppose that the CH-functions $\ell$ and $\mathfrak{h}$ must also be related in a way that is simpler than via Legendre transform of the associated one-variable functions $(L,H)$ . This is indeed true for some “simple” cases, such as Born-Infeld, but the general case requires consideration of what we have called “ $\Phi$ -parity”. The variables $(U,V)$ are linear combinations of variables $(S,\Phi)$ , with $\Phi=\sqrt{S^{2}+P^{2}}$ , and the $\Phi$ -parity dual of $\mathcal{L}(S,\Phi)$ is $\hat{\mathcal{L}}(S,\Phi)=\mathcal{L}(S,-\Phi)$ . The “simple” cases referred to above are those for which $\hat{\mathcal{L}}=\mathcal{L}$ ; i.e. the $\Phi$ -parity invariant NLED theories. For these cases the CH functions $\{\ell,\mathfrak{h}\}$ , both defined on a half-line, collectively define a single variable on a whole line; more precisely, they are related by the boxed equation (5.13) of section 5.

Generically, $\hat{\mathcal{L}}\neq\mathcal{L}$ and we have a $\Phi$ -parity pair of NLED theories with CH-functions $(\ell,\mathfrak{h})$ and $(\hat{\ell},\hat{\mathfrak{h}})$ , which are related in a similar way to (5.13), but with a $\Phi$ -parity twist; more precisely, the relations are those of the boxed equations (5.11) and (5.12) of section 5. In other words, the CH-function $\ell$ ( $\mathfrak{h}$ ) of $\mathcal{L}$ is simply related to the CH-function $\hat{\mathfrak{h}}$ ( $\hat{\ell}$ ) of $\hat{\mathcal{L}}$ , and this reduces to the simple relation of (5.13) for the $\Phi$ -parity invariant cases.

A major theme of this paper has been that many interesting features of self-dual NLED theories are a corollary of simple features of their associated CH functions $\{\ell,\mathfrak{h}\}$ . Examples are causality and conformal invariance, and the simple relation between Lagrangian and Hamiltonian densities summarised in the boxed equation (1.25) of the Introduction.

We have shown an alternative CH construction, again described by Courant and Hilbert, introduces a new CH function, that in “simple” cases is (minus) the Legendre transform of the CH $\ell$ -function. More generally, it is the Legendre transform of the $\ell$ -function of a “ $\Phi$ -parity” dual theory. The “simple” cases are therefore those that are “ $\Phi$ -parity” invariant, and the simplest example is Born-Infeld. We have thus uncovered a new special property of Born-Infeld that may be of relevance in its applications, e.g. in string theory.

Various other aspects of generic self-dual NLED theories deserve further investigation. It appears that only Born-Infeld is compatible with maximal (Minkowski spacetime) supersymmetry, but the constraints of non-maximal supersymmetry are usually weaker. We have omitted coupling to electric and magnetic charges; results of [36], for example, may be generalizable. We also expect the CH construction of self-dual NLED theories to be useful in the exploration of NLED theories coupled to gravity. For example, the spacetime metric describing the analog of the Reissner-Nordstrom black hole might be expected to be invariant under an electromagnetic duality rotation of its parameters. We certainly expect causality to be a significant issue, and our previous result that the stress-energy tensors of causal NLED theories obey the same energy conditions as the Maxwell stress-energy tensor [30] is an indication that results for Maxwell-Einstein will generalize simply to self-dual NLED theories.

To conclude, we remark that our Hamiltonian results can be applied directly to chiral 2-form electrodynamics in 6D Minkowski spacetime, for reasons spelled out in detail in [25]; essentially, one only has to re-interpret the variables $(u,v)$ . As the 4D NLED theory is then a dimensional reduction from 6D we expect that the 4D NLED causality conditions on $\mathfrak{h}$ will still apply, and may still be sufficient as well as necessary conditions for causality in 6D. This may be relevant to the recently investigated $T\bar{T}$ -flows of 6D chiral 2-form theories [37].

Acknowledgements

PKT has been partially supported by STFC consolidated grant ST/T000694/1. JGR acknowledges financial support from grants 2021-SGR-249 (Generalitat de Catalunya) and a MINECO grant PID2019-105614GB-C21.

References

[1] M. Born and L. Infeld, “Electromagnetic mass,” Nature 132 (1933) no.3347, 970.1
[2] M. Born and L. Infeld, “Foundations of the new field theory,” Proc. Roy. Soc. Lond. A 144 (1934) no.852, 425-451
[3] J. Plebański, “Lectures on non-linear electrodynamics”, (The Niels Bohr Institute and NORDITA, Copenhagen, 1970).
[4] G. Boillat, “Nonlinear electrodynamics - Lagrangians and equations of motion,” J. Math. Phys. 11 (1970) no.3, 941-951
[5] I. Bialynicki-Birula, “Nonlinear Electrodynamics: Variations on a theme by Born and Infeld”, in Quantum Theory of Particles and Fields, eds. B. Jancewicz and J. Lukierski, (World Scientific, 1983) pp. 31-48.
[6] I. Bialynicki-Birula, “Field theory of photon dust,” Acta Phys. Polon. B 23 (1992), 553-559
[7] J. G. Russo and P. K. Townsend, “Nonlinear electrodynamics without birefringence,” JHEP 01 (2023), 039 [arXiv:2211.10689 [hep-th]].
[8] L. Mezincescu, J. G. Russo and P. K. Townsend, “Hamiltonian birefringence and Born-Infeld limits,” [arXiv:2311.04278 [hep-th]].
[9] E. Schrödinger, “Contributions to Born’s new theory of the electromagnetic field,” Proc. Roy. Soc. Lond. A 150 (1935) no.870, 465-477
[10] E. S. Fradkin and A. A. Tseytlin, “Nonlinear Electrodynamics from Quantized Strings,” Phys. Lett. B 163 (1985), 123-130
[11] E. Bergshoeff, E. Sezgin, C. N. Pope and P. K. Townsend, “The Born-Infeld Action From Conformal Invariance of the Open Superstring,” Phys. Lett. B 188, 70 (1987)
[12] R. G. Leigh, “Dirac-Born-Infeld Action from Dirichlet Sigma Model,” Mod. Phys. Lett. A 4 (1989), 2767
[13] A. A. Tseytlin, “Born-Infeld action, supersymmetry and string theory,” [arXiv:hep-th/9908105 [hep-th]].
[14] J. P. Pereira, J. G. Coelho and R. C. R. de Lima, “Born–Infeld magnetars: larger than classical toroidal magnetic fields and implications for gravitational-wave astronomy,” Eur. Phys. J. C 78 (2018) no.5, 361 [arXiv:1804.10182 [astro-ph.SR]].
[15] V. I. Denisov and S. I. Svertilov, “Vacuum nonlinear electrodynamic effects in hard emission of pulsars and magnetars,” Astron. Astrophys. 399 (2003), L39-L42 [arXiv:astro-ph/0305557 [astro-ph]].
[16] J. Ellis, N. E. Mavromatos, P. Roloff and T. You, “Light-by-light scattering at future $e^{+}e^{-}$ colliders,” Eur. Phys. J. C 82 (2022) no.7, 634 [arXiv:2203.17111 [hep-ph]].
[17] J. G. Russo and P. K. Townsend, “Causal Self-Dual Electrodynamics,” [arXiv:2401.06707 [hep-th]].
[18] M. K. Gaillard and B. Zumino, “Nonlinear electromagnetic selfduality and Legendre transformations,” [arXiv:hep-th/9712103 [hep-th]].
[19] G. W. Gibbons and D. A. Rasheed, “Electric - magnetic duality rotations in nonlinear electrodynamics,” Nucl. Phys. B 454 (1995), 185-206 [arXiv:hep-th/9506035 [hep-th]].
[20] R. Courant and D. Hilbert, “Methods of Mathematical Physics”, Vol.II (Wiley Interscience, 1962) pp.91-94.
[21] G. O. Schellstede, V. Perlick and C. Lämmerzahl, “On causality in nonlinear vacuum electrodynamics of the Plebański class,” Annalen Phys. 528 (2016) no.9-10, 738-749 [arXiv:1604.02545 [gr-qc]].
[22] I. Bandos, K. Lechner, D. Sorokin and P. K. Townsend, “ModMax meets Susy,” JHEP 10 (2021), 031 [arXiv:2106.07547 [hep-th]].
[23] J. G. Russo and P. K. Townsend, “Born Again,” SciPost Phys. 16 (2024), 124 [arXiv:2401.04167 [hep-th]].
[24] I. Bandos, K. Lechner, D. Sorokin and P. K. Townsend, “A non-linear duality-invariant conformal extension of Maxwell’s equations,” Phys. Rev. D 102 (2020), 121703 [arXiv:2007.09092 [hep-th]].
[25] I. Bandos, K. Lechner, D. Sorokin and P. K. Townsend, “On p-form gauge theories and their conformal limits,” JHEP 03 (2021), 022 [arXiv:2012.09286 [hep-th]].
[26] E. Bergshoeff and M. De Roo, “D-branes and T duality,” Phys. Lett. B 380 (1996), 265-272 [arXiv:hep-th/9603123 [hep-th]].
[27] M. B. Green, C. M. Hull and P. K. Townsend, “D-brane Wess-Zumino actions, t duality and the cosmological constant,” Phys. Lett. B 382 (1996), 65-72 [arXiv:hep-th/9604119 [hep-th]].
[28] M. Born, “Nonlinear theory of the electromagnetic field,” Ann. Inst. Henri Poincare 7 (1937) no.4, 155-265
[29] S. M. Kuzenko and S. Theisen, “Nonlinear selfduality and supersymmetry,” PoS tmr2000 (2000), 022
[30] J. G. Russo and P. K. Townsend, “Causality and Energy Conditions in Nonlinear Electrodynamics,” [arXiv:2404.09994 [hep-th]].
[31] M. Perry and J. H. Schwarz, “Interacting chiral gauge fields in six-dimensions and Born-Infeld theory,” Nucl. Phys. B 489 (1997), 47-64 [arXiv:hep-th/9611065 [hep-th]].
[32] B. P. Kosyakov, “Nonlinear electrodynamics with the maximum allowable symmetries,” Phys. Lett. B 810 (2020), 135840 [arXiv:2007.13878 [hep-th]].
[33] B. P. Kosyakov, “Introduction to the classical theory of particles and fields,” (Springer 2007).
[34] S. I. Kruglov, “On generalized Born-Infeld electrodynamics,” J. Phys. A 43 (2010), 375402 [arXiv:0909.1032 [hep-th]].
[35] M. Rocek and A. A. Tseytlin, “Partial breaking of global D = 4 supersymmetry, constrained superfields, and three-brane actions,” Phys. Rev. D 59 (1999), 106001 doi:10.1103/PhysRevD.59.106001 [arXiv:hep-th/9811232 [hep-th]].
[36] K. Lechner, P. Marchetti, A. Sainaghi and D. P. Sorokin, “Maximally symmetric nonlinear extension of electrodynamics and charged particles,” Phys. Rev. D 106 (2022) no.1, 016009 [arXiv:2206.04657 [hep-th]].
[37] C. Ferko, S. M. Kuzenko, K. Lechner, D. P. Sorokin and G. Tartaglino-Mazzucchelli, “Interacting Chiral Form Field Theories and $T\overline{T}$ -like Flows in Six and Higher Dimensions,” [arXiv:2402.06947 [hep-th]].


(a)	(b)


(a)	(b)

1 Introduction

2 Strong-field causality redux

2.1 Auxiliary fields and the stress-energy tensor

3 The self-dual NLED Hamiltonian

3.1 Convexity/Concavity and Causality

3.2 Simple examples

ModMax

ModMaxBorn

q𝑞qitalic_q-deformed 𝔥MMBsubscript𝔥MMB\mathfrak{h}_{\rm MMB}fraktur_h start_POSTSUBSCRIPT roman_MMB end_POSTSUBSCRIPT

3.3 Conformal Invariance Redux

4 Hamiltonian without Legendre transform

4.1 Further examples

No maximum-τ𝜏\tauitalic_τ case

Logarithmic self-dual electrodynamics

5 ΦΦ\Phiroman_Φ-parity duality

5.1 The ΦΦ\Phiroman_Φ-parity dual of self-dual NLED

Illustrative examples

The q=3/4𝑞34q=3/4italic_q = 3 / 4 case

5.2 The alternative CH construction

Generalized Logarithmic NLED

5.3 The general “ΦΦ\Phiroman_Φ-parity” invariant self-dual theory

6 Legendre self-duality

6.1 A proof from the CH formula

7 The NLED/Particle-mechanics correspondence

8 Summary and outlook

Acknowledgements

References

$q$ -deformed $\mathfrak{h}_{\rm MMB}$

No maximum- $\tau$ case

5 $\Phi$ -parity duality

5.1 The $\Phi$ -parity dual of self-dual NLED

The $q=3/4$ case

5.3 The general “ $\Phi$ -parity” invariant self-dual theory