aInstitució Catalana de Recerca i Estudis Avançats (ICREA),
Pg. Lluis Companys, 23, 08010 Barcelona, Spain.
bDepartament de Física Cuántica i Astrofísica and Institut de Ciències del Cosmos,
Universitat de Barcelona, Martí Franquès, 1, 08028
Barcelona, Spain.
cDepartment of Applied Mathematics and Theoretical Physics,
Centre for Mathematical Sciences, University of Cambridge,
Wilberforce Road, Cambridge, CB3 0WA, U.K.
For any causal nonlinear electrodynamics theory that is “self-dual” (electromagnetic -duality invariant),
the Legendre-dual pair of Lagrangian and Hamiltonian densities are constructed from functions
on related to a particle-mechanics Lagrangian and Hamiltonian.
We show how a ‘duality’ relating to implies that and are related by a simple map
between appropriate pairs of variables. We also discuss Born’s “Legendre self-duality” and implications of a new
“-parity” duality. Our results are illustrated with many examples.
Nonlinear theories of electrodynamics (NLED) are generally defined [1, 2, 3, 4]
by means of a Lagrangian density function of the two Lorentz (pseudo)scalars
(1.1)
where are the (electric, magnetic) 3-vector-field components of the abelian 2-form field strength on Minkowski spacetime, and are their respective magnitudes. A feature of all NLED theories in this “Plebanski” class is that the degrees of freedom remain those of the free-field Maxwell theory, although superluminal signal propagation (and hence causality violation) is potentially possible, and the physical theories are those for which it is not possible. These features are shared by some NLED theories outside the Plebanski class, some of which are physical [5, 6, 7, 8], but they have no weak-field limit. In this paper we assume the existence of a (conformal) weak-field limit.
We also focus on a class of NLED theories that share with the free-field Maxwell electrodynamics the property of electromagnetic duality invariance. In the Maxwell case this can be viewed as an invariance of the (source-free) Maxwell equations under any constant shift of the phase of the complex 3-vector field . However, this definition applies only in Cartesian coordinate systems since and are 3-vectors in dual vector spaces. A better definition, which not only applies for any coordinate system but also generalises to nonlinear electrodynamics, is as an invariance of the Hamiltonian density under any constant phase shift of the complex 3-vector field
(1.2)
Invariance of the field equations is then a consequence of the invariance of the
Hamiltonian. For the Maxwell case, in Cartesian coordinates, and
we therefore recover the earlier definition. Following what has become standard terminology, we shall say that NLED theory with this symmetry is “self-dual”.
Within the Lagrangian formulation, the restriction to a self-dual theory is achieved by requiring the Lagrangian density to satisfy the following partial differential equation (PDE) [5]:
(1.3)
The first example, excepting the free-field Maxwell case, was the Born-Infeld
theory [2], although its self-duality was noticed by Schrödinger
a few years later [9]. Since the re-appearance of Born-Infeld electrodynamics in the 1990s as (or as part of) an effective theory for open strings of string theories [10, 11, 12, 13]
there has been a resurgence of interest in nonlinear electrodynamics. In particular, the possibility of a Born scale in electrodynamics is now taken seriously for its potential relevance to the physics of magnetars, e.g. [14, 15], and to particle physics experiments at future colliders [16].
One motivation for our focus on self-dual NLED theories is that many of the special properties of Born-Infeld are consequences of its self-duality. Another motivation is the recent result that strong-field causality is implied by weak-field causality for all self-dual NLED with a weak-field limit [17]. We elaborate below on the significance of this fact, and one purpose of this paper is to provide more details of the results of [17].
Another purpose is to expand the results of [17] to include the Hamiltonian formulation. As this is equivalent to the Lagrangian formulation for all causal NLED theories, we did not expect surprises. However, various additional remarkable properties of self-dual theories emerge from the conjunction of the Lagrangian and Hamiltonian formulations. For the remainder of this Introduction we provide the necessary background and a sketch of our main new results.
The self-duality PDE (1.3) can be simplified by expressing as a function of
and
(1.4)
This is possible only if preserves parity since both and are parity even whereas is parity odd, but this restriction is not a limitation for self-dual NLED because self-duality implies parity, for a reason to be explained below. The self-duality
PDE for is [18]
(1.5)
For some purposes it is convenient to use the alternative independent variables
(1.6)
Notice that as defined are both non-negative because . This implies that the ‘physical’ values of are restricted to the positive quadrant in the -plane. The self-duality PDE for is [19]
(1.7)
The general solution to this equation in the positive -quadrant, in terms of the boundary function , is [20]
(1.8)
where
(1.9)
We shall call this the Courant-Hilbert (CH) solution, and a “CH-function”. Notice that by definition, with equality only for . The choice yields the free-field Maxwell case.
To verify that (1.8) solves (1.7), we may take the differential of both sides of both equations of (1.8). The resulting pair of equations for the differentials may then be solved for and in terms of and . The result for implies
(1.10)
which confirms that . The result for is
(1.11)
where
(1.12)
The main implications of (1.11) were briefly discussed in [17] and we review this, with more detail, in the following section.
Within the Plebanski NLED class, the necessary and sufficient conditions for causality were found in [21], subject to an assumption about the domain of the function that can be interpreted physically as the existence of a weak-field limit. These conditions can be separated into two sets according to whether a violation is possible (generically) for weak fields, or only for strong fields. The former set are
equivalent to convexity of the function , which are also the conditions for convexity of as a function of [22]. The remaining (strong-field) causality condition was provided with some intuition and an alternative derivation in [23]. For self-dual NLED theories with a weak-field limit, we showed in [17] that all these causality conditions reduce to the following simple inequalities to be satisfied by derivatives of the CH-function :
(1.13)
Apart from its simplicity, this result is remarkable because there was no a priori reason to suppose that the
causality conditions on would be independent of . Notice that the condition tells us that is a convex function.
The assumption of a weak-field limit can also be expressed in terms of the function ; it is the statement that should have a Taylor-series expansion in . Omitting the constant term in this expansion on the grounds that it is irrelevant to the NLED dynamics; we have
(1.14)
for some dimensionless constant , which must be non-negative in order to satisfy the causality condition in the weak-field limit. In this limit , which yields the free-field Maxwell theory for and ModMax (the “modified Maxwell” theory [24]) for . Both are conformal because the conformality condition for self-dual NLED is equivalent to degree-1 homogeneity of , as we show in section 3. Another feature of the existence of a weak-field expansion is that defines a function not only for (which is all that is relevant to the CH solution of the self-duality PDE) but also for , at least in some neighbourhood of . We shall see later the significance of this fact.
A feature of the CH equations (1.8) is that many simple functions satisfying (1.13) allow to be found analytically, leading to explicit Lagrangian densities for a variety of causal self-dual NLED theories. These include Born-Infeld [2] and its Mod-Max-type generalisation [24, 25] that we call, for brevity, “ModMaxBorn”. Other examples were given in [17] and more will be given here.
We shall expand on the results of [17] in the following section but, as stated above, our main purpose is to explore the Hamiltonian formulation for self-dual NLED. An advantage of this formulation is that self-duality can be implemented simply by restricting the Hamiltonian density to be a function of the
two duality-invariant rotation scalars
(1.15)
As both and are parity-even (parity flips the sign of ) it follows that any function is duality invariant, and hence that all self-dual NLED theories preserve parity, as claimed above.
A disadvantage of the Hamiltonian formulation is that Lorentz invariance is not manifest.
The condition for a generic, and not necessarily duality-invariant, Hamiltonian density to define a Lorentz invariant NLED was found in [5]. Here we need this condition for functions of only, and if we trade these variables for and
(1.16)
then the Lorentz invariance condition for is the PDE
(1.17)
which is formally identical to the Lagrangian self-duality PDE of (1.5).
For some purposes it is convenient to use the new independent variables111These differ from the definitions of in [24] by the exchange , which facilitates comparison between the Lagrangian and Hamiltonian formulations of self-dual NLED.
(1.18)
Notice that are both non-negative, and that , so the ‘physical’ region in the
-plane is the region of the positive quadrant bounded by and .
The condition for to define a Lorentz invariant NLED is [25]
(1.19)
This is mathematically equivalent to (1.7) since the sign on the right-hand side can be changed by using instead of as the independent variables, and the general solution for is then formally the same as the solution of (1.8) for . However, we shall use the variables as defined above because they are both non-negative. Notice that (1.19) has the solution
(1.20)
which defines the conformal Bilaynicki-Birula electrodynamics [5]. There is no analogous solution of (1.7) because of the different sign on the right-hand side.
All other solutions of (1.19), expressed in terms of the boundary function , are given by
(1.21)
where is a new CH-function analogous to .
Corresponding to every causal NLED defined by a function there is a Hamiltonian density function and the two are related by a Legendre transform. This is because convexity of (as a function of ) implies convexity of (as a function of ) and this implies that the Legendre transform is an
involution, although “strict” convexity (non-zero Hessian determinant) is needed to apply this theorem to the Plebanski class of NLED theories. For self-dual theories a corollary of this correspondence is that the functions and must be related in some way that allows one to be found from the other. What we find is that the following functions are Legendre transforms of each other:
(1.22)
In other words, the functions and are related by a Legendre transform222As we explain later, this requires .
but in terms of the new variables and .
Our choice of notation is motivated by the fact that the functions and can be interpreted as the Lagrangian and Hamiltonian of a particle mechanics model associated to the NLED defined by the Lagrangian and Hamiltonian densities and . This was a motivating analogy for Born’s original NLED theory, and a correspondence between Born-Infeld and the massive relativistic particle is a consequence of T-duality for the effective worldvolume field theories of D-branes (see e.g. [26, 27]). However, the
correspondence applies more generally, as we discuss in section 7.
Our Hamiltonian results for self-dual NLED theories allow us to ‘translate’ the causality conditions on to corresponding causality conditions on . As we shall see, these are
(1.23)
and
(1.24)
This last condition is equivalent to strict convexity of the function , which is required for its interpretation as the Legendre dual of , which is in turn required for the interpretation of as the Legendre dual of . Notice that is required to be a concave function () in contrast to the convexity condition () on .
Surprisingly, the function can be used to directly construct
not only the Hamiltonian density but also the Lagrangian density, again via the
Courant-Hilbert solution but now for boundary conditions at rather than .
Since and are exchanged by an exchange of and , this is a
type of electromagnetic duality, which is indirectly equivalent to a Legendre duality.
As we shall see, this fact implies a remarkably simple relation between the Lagrangian and
Hamiltonian densities of any self-dual NLED. For example, given the Lagrangian density
in the form the Hamiltonian density in the form can be found from the following procedure:
(1.25)
This allows us to find from without the need for a Legendre transform!
For the free-field Maxwell case, we have
(1.26)
This result suggests that and must be similarly related since they determine and . We find, in some cases, that there is indeed a very simple relation between these two CH-functions, but the
general case requires consideration of what we call the “-parity” (equivalently, “-parity”) dual NLED defined by
(1.27)
In some cases, such as Born-Infeld, . For this -parity invariant subset of self-dual NLED we find that
(1.28)
As mentioned above, only the values of for are relevant to the CH solution for , but is defined for if there is a weak field limit (which we are assuming here). We now see that for any -parity invariant NLED (with a weak-field limit) the CH function for determines the other CH function . It remains true, of course, that and are related by their relation to the Legendre dual pair of functions , but no Legendre transform is needed to find one from the other!
More generally, and is associated with a pair of CH-functions that differ from . For these cases we find that are related related to by a pair of relations similar to (1.28) but intertwined by -parity.
An obvious question is whether there is a simple characterisation, in terms of restrictions on the CH functions , of the subclass of self-dual NLED theories that are also -parity invariant. There is, but it involves an alternative solution of the self-duality PDE, also given by Courant and Hilbert [20], in terms of a function of a positive variable . The -parity invariant self-dual theories are those for which is invariant under , and for these theories
we show that is the Legendre-dual of with respect to (rather than . Born-Infeld corresponds to the choice of a linear function of .
Another topic that we discuss is “Legendre self-duality”, which has no direct
connection to the topics described above, but could potentially be confused
with them. The Hamiltonian density is the Legendre transform of the Lagrangian density with respect to . If we now take the Legendre transform of with respect to we arrive at a ‘dual’ Lagrangian density , which is a function of Lorentz scalars constructed from the
Legendre-duals of . It was noticed by Born, for Born-Infeld, that and are the same function333Born refers, confusingly, to the dual Lagrangian as the “Hamiltonian”, with the recognition that this is an abuse of terminology. of their respective Lorentz scalars [28]. Much later, it was shown by Gaillard and Zumino [18] that any (electromagnetically) self-dual theory shares this property of “Legendre self-duality”, and we prove this here by using the CH formula (1.8) for the general self-dual NLED theory. A subsequent clarification of Kuzenko and Theisen was the observation that “Legendre self-duality” relies only on invariance under a discrete subgroup of the electromagnetic duality group [29]. Here we give another proof based on the observation that if is invariant under then a Legendre transform with respect to must yield the same function as a Legendre transform with respect to . As an illustration, we explain how Born’s original NLED theory of 1933 [1] is Legendre self-dual without also being self-dual in the sense used here (and in [18, 29]).
We shall conclude with a summary of our main results and a brief discussion of further implications and future directions.
2 Strong-field causality redux
As mentioned in the introduction, weak-field causality implies strong-field causality for self-dual NLED theories
if a weak-field limit is assumed. Without this assumption, causality requires the additional condition , where
is given in (1.12). Either way, for causal theories and we can investigate its implications, as we did briefly in [17]. We now elaborate on some aspects
of this topic here because it will be useful when we later extend the results to the Hamiltonian formulation.
From the equation for in (1.8) we learn that fixing restricts to a line in the positive-quadrant in the plane; i.e. the curves of constant in this quadrant are straight lines, with slopes
(2.1)
Recalling the equation (1.11) for we see that if at some point in the positive quadrant then we can take for . In other words,
at the intersection point of the lines of constant and constant . It follows that if everywhere in the domain of (which is either the entire positive quadrant or a connected subregion of it that includes the origin) then no two lines of can intersect in this domain. This is because the line of constant can intersect the line of constant , for any positive constant , only if it also intersects the line of constant for positive infinitesimal . Thus, if in the domain of then this domain is foliated by lines of constant , as illustrated for Maxwell and Born-Infeld in Fig. 1.
(a)
(b)
Figure 1: The lines of constant for the two cases: (a) Maxwell, .
(b) Born-Infeld,
(for ).
We can interpret this conclusion in another way. If the solution of (1.8) for
is unique then is uniquely determined by at each point in the domain
of . However, there are at least two distinct values for at an intersection
point, so cannot be uniquely determined by in any region that includes an intersection point. A necessary and sufficient condition for the uniqueness of the solution (1.8) is therefore that is nowhere zero in the domain of .
In those cases for which has a power-series expansion of the form (1.14), we know that is implied by the causality/convexity inequalities of (1.13), and hence that the solution for will be unique if these inequalities on and are satisfied. Given the importance of this point, we shall show how it can be deduced in a more direct way.
The solution for is unique if the graph of the function has precisely one intersection with the graph of the function , for any choice of in the domain of . The function has the properties
(2.3)
which follow from (1.14) and (1.13); i.e. is a function of non-negative slope for all , with the minimum value being . The graph of the function for is the branch of a hyperbola that has the line as one asymptote, and the -axis as the other asymptote. From this description it is obvious that the graphs of the two functions intersect at precisely one point for each choice of ,
which confirms that (2.2) has a unique solution for , as illustrated in
Fig. 2 for ModMax and Born-Infeld.
(a)
(b)
Figure 2: Unique intersection (determining )
of the graph of (blue curve) with the graph
of (red curve) for a) ModMax ( with ), and b) Born-Infeld ().
2.1 Auxiliary fields and the stress-energy tensor
It was observed in [17] that the two equations of (1.8) may be combined (for causal theories) into the single equation
(2.4)
where is a Lagrange multiplier. This is because and are, jointly, a pair of auxiliary fields that can be consistently eliminated by their algebraic field equations. Varying
yields the equation , which implies if . Varying yields a constraint that uniquely determines when (as illustrated in the previous subsection). Elimination of thus yields the Lagrangian density defined by (1.8). Since are parity-even, we might expect to be able to make parity assignments for the auxiliary fields such that the Lagrangian density of (2.4) has even parity.
This is true: if we assign even parity to both and then both and are parity-even, and hence so is .
An implicit assumption in the definitions of , and hence of , is that the Minkowski spacetime metric is the standard Minkowski metric (with a “mostly plus” signature). To generalize to curvilinear coordinates , we have only to define as the scalar fields
(2.5)
where is the Minkowski metric in the chosen coordinates
(with ) and is the 2-form abelian field strength for 1-form potential on the Minkowski spacetime. It then follows that are scalars and hence so is and , and the equations of (1.8) still apply but with a scalar rather than a scalar density. With this understood, (2.4) is unchanged but the Lagrangian scalar density is now
(2.6)
where
are the scalar densities , which are related to the scalar densities
in the same way that are related to .
We are not restricted to Minkowski spacetime; by re-interpreting the metric as an arbitrary spacetime metric that can be freely varied, we can find the stress-energy tensor by the Hilbert formula
(2.7)
Since is metric independent, this formula yields
(2.8)
where
(2.9)
which is the Maxwell stress-energy tensor. Using
(2.10)
and the auxiliary-field equations, we
can simplify this result to444We thank Dmitri Sorokin for pointing out an error in the stress-energy tensor formula appearing in the original arXiv version of [17].
(2.11)
This agrees with the result of [30]; the novelty here is that we have taken as our starting point the auxiliary-field formulation (2.6) for the Lagrangian density of a generic self-dual NLED in a general spacetime.
3 The self-dual NLED Hamiltonian
We have seen in the Introduction that the Hamiltonian density for the general self-dual and Lorentz invariant NLED
may be expressed in terms of a one-variable CH function via the equations of (1.21). This
was by analogy to the equations of (1.8) for the Lagrangian density, and the same steps may be used here to verify it. By taking the exterior derivative of both
sides of both equations of (1.21) we find two equations that are
jointly equivalent to
(3.1)
where is the Hamiltonian analog of the function of (1.12):
(3.2)
We shall see later that causality requires , with consequences analogous to those that follow from .
and hence that solves (1.19). Notice that any constant term in , which makes no contribution to
, appears only as a constant term in ; it represents a constant uniform background energy
density that has no effect on the NLED field equations.
Our first task will be to determine the relation between the functions and implied by Legendre duality
of the Lagrangian and Hamiltonian densities. The existence of this duality is guaranteed by the convexity
of as a function of , the fact that the Legendre transform of any function is convex, and the theorem that
the Legendre transform is an involution when acting on convex functions. For , this transform is
where and are the functions introduced in (1.22). Combining this with (3.4), we have
(3.7)
Notice that although and were defined in (1.22) as functions of a single variable
(respectively, and ), we are required by (3.7) to consider them
as functions of and , respectively. In contrast, the claim in the Introduction that
and are each other’s Legendre transform is the claim that
(3.8)
However, it is not difficult to see that (3.7) implies (3.8).
Variation of and in the respective expressions
of (3.7) for and yields
(3.9)
and a further implication is
(3.10)
which is exactly what one finds from variation of and in the expressions of (3.8) for and , respectively.
The variation of 3-vector fields needed to find the functions and from (3.7) therefore yields the same result
as the variation of scalar fields in (3.8).
Further implications of (3.10) are the relations555A relation similar to (3.11) appears in [31] in relation to an involution defined in the context of 6D chiral electrodynamics.
(3.11)
and
(3.12)
These relations allow us to find (up to the addition of a constant) given , and vice versa.
The fact that functions and
are related by Legendre transformations, but with respect to variable
and , can be summarized by the equations
(3.13)
The second equality is equivalent to (3.12) given (3.11). The first equality tells us that any constant term
in the power-series expansion of also appears in the power-series expansion of with the
opposite sign, while the remaining information of this equality may be verified by taking the exterior derivative of both sides
to get
(3.14)
Using (3.11), we may rewrite this as , which is
true as a consequence of the relation of (3.12).
There are other differences between the Lagrangian and Hamiltonian formulations
of self-dual NLED theories that go beyond sign changes. One is the difference in the potential range of the independent variables: although is non-negative by its definition in (1.8), the definition of in (1.21), which we may rewrite as
(3.15)
allows to be negative. For , which is the free-field (Maxwell) case,
, and this remains true for all causal NLED theories that have Maxwell as their weak-field limit. This is easily seen by writing the second equation in (1.21) as
(3.16)
The function is positive, with (because the weak-field limit is Maxwell). It is also a monotonically decreasing function of (because but for any causal interacting NLED). The function takes the value at but then increases monotonically, becoming infinite at , and then negative. There is therefore a unique non-negative value of at which (as illustrated in fig. 4a below).
In contrast, if the weak-field limit is ModMax (with ) then . This means that there will be a choice of such that , which implies that for , as illustrated in fig. 4b. In these cases is not excluded by its definition in (1.21); instead the inequality is a restriction on the domain of required by equivalence to the Lagrangian formulation. Specifically, it restricts the Hamiltonian fields to the region in field-space for which is a convex function of ; i.e. to its “convex domain”. For ModMax, the boundary of this convex domain corresponds to Lagrangian fields with , which includes all exact plane-wave solutions of the ModMax field equations[24].
3.1 Convexity/Concavity and Causality
In the Lagrangian formalism, and assuming the existence of a weak-field limit, the necessary and sufficient conditions for
causality are the conditions for convexity of , which are equivalent to [17]
(3.17)
By combining the first of these inequalities with the relation (3.11) we deduce that
(3.18)
Next, we take the exterior derivative of the first of the relations in (3.11) to find, again using (3.11), that
(3.19)
Taking the exterior derivative of the first of the relations of (3.12), we also find that
(3.20)
and hence that
(3.21)
Using both inequalities of (3.17), and the fact that is non-negative, we see that
the right-hand side of this equation is non-negative, and hence
(3.22)
We have now found the ‘translation’ of the causality/convexity conditions (3.17) on to the corresponding
conditions to be satisfied by . They are
(3.23)
The second of these equations is equivalent to the statement that is a concave function,
and a corollary of this is that
(3.24)
where was defined in (3.2). We postpone a discussion of the consequences of this corollary
as we still need to explain the origin of the condition (1.24) of the Introduction.
By taking the exterior derivative on both sides of the second of the relations of (3.12),
we get another formula for :
(3.25)
Comparing this with (3.20), and again using (3.11), we find that
(3.26)
The first factor on the left-hand side is positive, for reasons explained above. The second factor is not obviously positive,
but is required to be so; this is the condition (1.24). To understand its significance, we return to the functions and . Because they are each other’s Legendre transform, we know that they are both convex; in fact strictly convex because is a strictly convex function of . Thus
(3.27)
This allows us to interpret (3.26) as the statement that the Hessian of is the inverse of the Hessian of
. This requires, of course, that both Hessians are non-zero and finite, which is equivalent to the statement
that both and are both strictly convex functions.
We now return to the significance of (3.24). We see from (1.21) that the curves of constant in the -plane are straight lines. Only the
half-lines in the ‘physical’ region of this plane are relevant; this is the wedge-shaped region bounded by the lines (the -axis) and the line (since by definition). Because , no two lines of constant can intersect in this region (for reasons identical to those explained in our discussion of
section 2 for in the context of lines of constant in the positive
quadrant). The lines of constant therefore foliate either the entire physical region or some connected subregion of it.
From (1.21) we see that all lines of constant intersect the -axis at , which implies (since ) that the lowest line is the one with ; this confirms that is required for an equivalence of the Lagrangian and Hamiltonian formulations. The slope of the lines is
(3.28)
where the inequality follows from the first causality condition of (3.23). The slope of the lowest line is therefore
. Assuming that has a power-series expansion about (which is equivalent to the assumption of a weak-field limit) we conclude (omitting the irrelevant constant term in the expansion) that
(3.29)
for some constant . The special case for which yields ModMax, with Maxwell as the free-field subcase. For Maxwell, the lines of constant foliate the entire wedge-shaped physical region in the -plane. For ModMax () they foliate the wedge-shaped subregion that is bounded from below by the line, which is
(3.30)
For both Maxwell and ModMax the lines of constant are parallel because is constant. For BI, the slope increases as increases because . These three cases are illustrated in Fig. 3.
(a)
(b)
(c)
Figure 3: Lines of constant , defined in (1.21), for (a) Maxwell, , (b) ModMax, , . and (c) Born-Infeld, (for ).
For the examples that we consider in the following section, there is no maximum value of , so the lines of constant foliate the wedge-shaped region bounded by the positive -axis and the line. We have found an example with an upper bound on but we do not discuss it here.
A further implication of is that there is a Hamiltonian counterpart to (2.4). The two equations of (1.21) may be combined into the one equation
(3.31)
where is a Lagrange multiplier imposing the constraint on , but the fields are an auxiliary pair. Varying we get the equation
, which is equivalent to when
. Varying we get the equation for , which has a unique solution when for reasons identical to those
explained in section 2 for in the context of the equation for . This is illustrated in Fig. 4.
Elimination of the auxiliary fields in (3.31) therefore yields precisely the Hamiltonian density
defined by (1.21).
(a)
(b)
Figure 4: Unique intersection (determining ) of the graph of (blue curve) with the graph of (red curve) for: a) Born-Infeld at . For any , , intersection occurs at . b) ModMaxBorn with . When the intersection occurs at , but when the intersection occurs at ().
As for the Lagrangian auxiliary-field formulation of (2.4),
we can make parity assignments for the Hamiltonian auxiliary fields such that the Hamiltonian density of (3.31) has even parity. Since both and are parity-even, this is achieved by assigning even parity to both and . A consequence of parity conservation is that the electromagnetic duality group is enhanced
to because parity acts by the transformation .
3.2 Simple examples
We now illustrate the construction of the Hamiltonian
from and the causality conditions on with a few examples.
ModMax
The ModMax Lagrangian and Hamiltonian densities are666Recall that
as defined in this paper differ from the definitions in [24]
by . [24]
(3.32)
Maxwell is included as the special case with . The Lagrangian function of one-variable for ModMax
is [17] (but we may ignore the constant term as it has
no effect on the field equations). The convexity/causality conditions of (1.13) require , as
expected since is a convex function of for but not for .
The function corresponds to . Its Legendre transform is
, which yields
As shown in [24], is a convex function of for only for those values of for which
(3.35)
Values of violating this bound do not correspond to any values of .
In other words, the bound is needed for a correspondence between the Lagrangian and Hamiltonian
formulations of ModMax. However, we have seen in section 3, for any self-dual NLED, that
this correspondence exists iff . It follows that the ModMax convexity bound (3.35)
must be equivalent to , and this conclusion is easily verified: from (1.21) we see that
(3.36)
but this constraint on the values of is equivalent to (3.35).
ModMaxBorn
The Born-Infeld-type generalization of ModMax, introduced in [24] in its Hamiltonian formulation, was called
ModMaxBorn in [23, 17]. The ModMaxBorn Lagrangian density was found by Legendre transform
in [25]. The Lagrangian and Hamiltonian densities are, respectively,
(3.37)
The Born-Infeld theory is included as the case. Here and in other examples to follow, there is a non-zero vacuum energy, which can be simply removed by the addition of a constant.
It was shown in [17] that is associated with the function
(3.38)
From the results of section 3 we find that the corresponding function is
(3.39)
Using this result in (1.21) we recover the ModMaxBorn Hamiltonian density, and we find that
(3.40)
For this allows , but for reasons already explained
we must impose , which is again equivalent to the bound (3.35).
-deformed
Consider the following choice:
(3.41)
for which
(3.42)
From this we see that the conditions and require .
We also have
(3.43)
which is positive, as required, if . Therefore, this class of self-dual NLED theories is causal for
(3.44)
However, the Hamiltonian density can be found explicitly only for special values of in this range; for example , which yields Born-Infeld.
Another special choice is , which will be discussed later.
3.3 Conformal Invariance Redux
The condition for conformal invariance of any Hamiltonian density is degree-2 homogeneity in the electric and magnetic fields fields . For a self-dual NLED with Hamiltonian density function this condition becomes degree-1 homogeneity in :
(3.45)
We recall here that our variables, defined in (1.18), differ (by the exchange )
from those used in [24]. The general solution of this equation can be expressed in the form
(3.46)
Notice that remains finite as since . The Lorentz invariance condition (1.19) then implies that . This equation is solved by (i) any linear function of and (ii)
, which yield the following solutions for [24]:
(3.47)
The first of these is ModMax if with . The second solution defines (for positive sign)
the Bialynicki-Birula (BB) electrodynamics theory [5, 6], which has no weak-field limit. ModMax is therefore the unique interacting causal ‘extension’ of Maxwell electrodynamics with the same symmetries [24].
The condition for conformal invariance of any Lagrangian density is the homogeneity condition
(3.48)
Any function linear in will satisfy this relation, but this does not include ModMax. If parity is assumed then, as observed in the Introduction, one may replace the variables by (we recall that ). The homogeneity condition is then solved by any function linear in and , and self-duality selects the
particular linear function that is the ModMax
Lagrangian density, found originally by Legendre transform of the
ModMax Hamiltonian density [24]. This observation was made in [32], but it does not exclude the possibility of other conformal self-dual NLED theories for which is a nonlinear homogeneous function of ; for this we need a general solution to the homogeneity condition (3.48).
One might expect to be able to express the general solution to (3.48) in terms of an arbitrary function of one dimensionless ratio of functions of , by analogy to the general solution of (3.46) to the Hamiltonian homogeneity condition (3.45). However, the fact that both and may have either sign, and may be zero for non-vacuum field configurations, prevents it. For example, the formula was suggested in [33] but even cannot be written in this form when . The alternative formula , suggested in [24], has a similar problem with . If parity invariance is assumed then we may use the variables , in which case (3.48) is replaced by
(3.49)
In this case we could attempt to solve the homogeneity condition by
setting . This is the natural Lagrangian analog of (3.46),
and imposing the self-duality condition leads formally to ; the different sign on the right-hand side now allows only a linear function of , which again leads uniquely to ModMax. However this is still unsatisfactory because is generically infinite at , so the initial expression for is not
well-defined for all .
It appears that the only way to establish directly that the ModMax Lagrangian density is the
unique possibility compatible with conformal invariance and self-duality is to first solve the self-duality condition, e.g. as in (1.8). We then impose the homogeneity condition (3.49). Using (1.10), this leads to
(3.50)
and hence
(3.51)
where the last equality uses the definition of in (1.8). We thus arrive at the conclusion, for self-dual NLED, that will satisfy the homogeneity condition
(3.49) iff satisfies the homogeneity condition
(3.52)
The general solution is for constant . Causality restricts to
, which yields ModMax.
4 Hamiltonian without Legendre transform
The solution (1.8) to the self-duality PDE (1.7) results from a choice of boundary conditions on the boundary of the positive quadrant: . However, we could equally well choose initial conditions on the boundary of the positive quadrant; i.e. for some new one-variable function (the minus sign is included for later convenience). The solution analogous to (1.8) is then
(4.1)
where
(4.2)
For the identity function these equations yield . To verify that they yield a solution for arbitrary , we proceed as before by taking the differential of both sides of both equations to find that
(4.3)
From the first of these equations we have
(4.4)
and hence , as required.
We now have two different ways in which the Lagrangian density function of any given self-dual NLED theory can be constructed from an associated one-variable function; in one case we call the function and in the other case we call it . By comparing (4.4) with (1.10) we see that these
two functions are such that777Recall that and .
(4.5)
Using this relation, a comparison of the equation (1.8) for with equation (4.1) for provides
an equation for as a function of , and vice versa:
(4.6)
If we use the relations (4.5) and (4.6) in the equation of (4.1) for we deduce that
(4.7)
which is the equation for of (1.8). Since the equations for the auxiliary variable ( or ) are equivalent in both solutions of the self-duality PDE, which yield the same Lagrangian density function, it follows that
(4.8)
or, equivalently,
(4.9)
A surprising feature of this ‘dual’ description of the Lagrangian density of a self-dual NLED is that the new one-variable function is same as the one-variable Hamiltonian function !
This can be seen as follows: replacing by in (4.5) and (4.6) we get
precisely the relations that determine in terms of , and vice versa. Furthermore, we know how the functions and are related (by a Legendre transform in terms of the variables and ), so and are related in the same way, which is
(4.10)
Comparing this with (4.9) we recover the equations for and for in terms of .
Returning to (4.1), let us replace by , since they are the same function, and
then relabel the independent variables of the function as follows:
(4.11)
We then get
(4.12)
Comparison with (1.21) shows that is the same function
as . Explicitly, given any Lagrangian density , we may find its Legendre dual
Hamiltonian density by the following procedure:
(4.13)
Given any Hamiltonian density we can similarly find its Legendre-dual Lagrangian density :
(4.14)
Notice that the change of variables (4.11) implies that
(4.15)
Using the expressions given in the Introduction for in terms of , and in terms of , we deduce that
(4.16)
For Maxwell, for example, we get , as expected. More generally, once is expressed in the form we get in the form by the following procedure:
(4.17)
which is the boxed equation (1.25) of the
Introduction. The converse formula is
(4.18)
As we shall see in the following section, these results enormously simplify the task of finding the Hamiltonian density associated to any known Lagrangian density of a self-dual NLED, and vice versa.
4.1 Further examples
No maximum- case
This possibility was illustrated in [17] by the choice which is defined for all and satisfies the causality conditions of (1.13). The
corresponding Lagrangian density is
(4.19)
with .
As expected, it is defined in the entire positive quadrant, and it reduces to ModMax
with coupling constant in the weak-field limit.
The function for this case is
(4.20)
The weak-field expansion is , as expected.
The standard method of computing the Hamiltonian as a Legendre transform of leads to complicated equations.
The Courant-Hilbert construction of the Hamiltonian based on (1.21)
using the above also leads
to complicated equations.
However, the Hamiltonian can be immediately written down by using (4.13). This gives
As in the previous case, in order to obtain the Hamiltonian, one can circumvent the long calculation through a Legendre transform and directly make use of (4.13).
This gives
(4.25)
where
(4.26)
5 -parity duality
The self-duality PDE (1.5) is invariant under the “-parity” transformation , which therefore takes a given solution into its -parity dual solution
(5.1)
The Hamiltonian density corresponding to can therefore be found by using the formula (4.17):
(5.2)
and hence
(5.3)
Obviously, the new solution is the same as the old solution whenever is -parity invariant; in this class of NLED theories the weak-field expansion of is a power-series expansion in and . Otherwise, and both have a weak-field expansion in powers of and , with odd powers of that cannot be rewritten as a sum of positive powers of and .
The transformation for fixed is equivalent to
(5.4)
Similarly, the transformation for fixed is equivalent to
(5.5)
The formula (5.2) connects to the original Lagrangian
density . Using the version of this formula for , i.e. (4.13), we have
(5.6)
Similarly, applying (4.14) to the -parity dual of yields
(5.7)
These results show how the Lagrangian and Hamiltonian densities of generic NLED theories are related to those of their -parity duals. We now turn to the special class of (electromagnetic) self-dual NLED theories.
5.1 The -parity dual of self-dual NLED
Using the CH constructions of and in (5.6) we see that
(5.8)
with
(5.9)
These equations imply that
(5.10)
and hence that
(5.11)
Similarly, using the CH constructions for and in (5.7), we conclude that
(5.12)
We mentioned in the Introduction that the existence of a weak-field limit implies that is defined for , even though only its values for are relevant to the CH formula for . Now we see from (5.11) that the function for is (minus) the one-parameter function for the “-parity” dual of the Hamiltonian density that is Legendre-dual to . Similarly, we see from (5.12) that the function for is (minus) the one-parameter function for the “-parity” dual of the Lagrangian density . The general picture is illustrated in fig. 5,
and we present some illustrative examples below.
For the special case of -parity invariant theories, and and the two relations (5.11) and (5.12)
reduce to the one relation
(5.13)
Born-Infeld provides a simple example, and we return to a study of -parity invariant self-dual NLED theories at the end of this section.
We now present examples that illustrate the general case.
Illustrative examples
Let us consider self-dual NLED theories defined by
(5.14)
According to the formula (5.11), the -function of the -dual theory is
(5.15)
For this is the same as the “-deformed” function of (3.41), which tells us that the case under consideration now is, for , the -parity dual of the “-deformed”
case of subsection (3.2)
For we have the ModMaxBorn theory. In this case
(5.16)
but -duality flips the sign of , so that
(5.17)
and both (5.11) and (5.12) are therefore satisfied.
Obviously, in the BI () case there is no distinction
between the dual (hatted) functions and the original functions, since BI theory is -parity self-dual.
The case
The Lagrangian and Hamiltonian densities can also be found explicitly in the case, where
Using the relations (3.11), (3.12), (3.13) we find the
corresponding -function:
(5.21)
The Hamiltonian can now be found via the CH construction of (1.21), but it is much simpler to use (4.13) to obtain
(5.22)
where
(5.23)
Notice that of (5.21) is different from of (5.18), even at . This tells us that this theory is not -parity self-dual, even at . The one-parameter CH functions of the -parity dual theory are easily found from (5.11) and (5.12):
(5.24)
(5.25)
Comparing with the example (3.41) for , we note that -duality has again flipped the sign of ; for weak fields the -parity dual theory becomes ModMax but with .
The -dual Lagrangian and Hamiltonian can now be found by the maps (5.4), (5.5). Alternatively, they can be obtained
from the CH construction using the above expressions for and . For example, from (1.21) and (5.24), we have the equations
(5.26)
This gives
(5.27)
which leads to
(5.28)
in accordance with the formula .
5.2 The alternative CH construction
In addition to the construction of (1.8) that gives the Lagrangian density of the general self-dual NLED in terms of a boundary function , Courant and Hilbert show that the solution to the partial differential equation (1.7) may also be expressed as [20]
(5.29)
where is defined for positive dimensionless variable , which is determined implicitly by the equation
(5.30)
To verify this we take the differentials of both sides of (5.29). Simplifying the result by using (5.30) we find that
(5.31)
and hence that . We also see, by comparison with (1.10), that the relation of to must be such that . In fact, the relation is given implicitly by
(5.32)
from which we find that
(5.33)
as expected. This alternative to the CH constructions described in the Introduction is useful when considering the implications of -parity; conversely,
consideration of -parity yields insights into the relation between and that are in some respects similar to what we have already found for and .
Recall that is equivalent to . Applying this to (5.29) we find that the -parity transform of is
(5.34)
where is now determined by the equation
(5.35)
If we now define a new variable and a new function
by
(5.36)
then the equations (5.34) and (5.35)
defining become, respectively
(5.37)
and
(5.38)
These are formally the same as the original equations
(5.29) and (5.30) that define ,
but the function determining is
generally different from the function determining , since . In the following
subsection we focus on the special class of -parity
invariant theories for which , and hence .
There is also a CH construction of in terms of a
function , with a relation of
to that is formally the same as the relation
of to expressed by the equations of (5.32). The relation of to
is different, however. In terms of the new variable and the new function , the equations of (5.32) become
(5.39)
This has a remarkably simple interpretation: it
tells us that is the Legendre transform of
with respect to :
This result tells us that is a strictly concave function iff is a strictly
convex function, as is required for causality, except that causality also allows ,
which is realized by ModMax and its Maxwell limit. These conformal NLED theories are therefore
not obviously included in the alternative CH construction of self-dual NLED theories.
To better understand why ModMax and Maxwell are special cases, we observe that the converse of (5.42) is
(5.43)
That is, is the Legendre transform of
, with respect to
(recall that , expressed as the function , is its Legendre transform with respect to ).
For the choice , we have
(5.44)
which is defined only for , and is zero at this one point in its domain. Using this function in (5.34) yields the ModMax Lagrangian density, and Maxwell for . Thus, (5.37) does include Modmax and Maxwell if the function is defined in terms of the CH function , as in (5.44), and a similar (dual) statement
applies if the function in (5.29) is defined as (minus) the Legendre transform of .
One utility of the alternative CH construction described above is that new explicit examples of self-dual NLED theories can be found that would otherwise be difficult to find. This is illustrated by the following example.
Generalized Logarithmic NLED
We start from the function
(5.45)
where is a parameter that can be chosen to arrange for zero vacuum energy,
and is a further real parameter. Using (5.29) we find the Lagrangian density:
(5.46)
with
(5.47)
For this reduces to ModMaxBorn.
Recalling again that -parity takes to , one sees from inspection
that this generalized logarithmic NLED theory is -parity invariant iff
and , in which case it reduces to Born-Infeld. For all other choices of these
parameters the -parity dual is found by changing the signs of both and .
5.3 The general “-parity” invariant self-dual theory
Comparing (5.34) to (5.37) we see that whenever , i.e. whenever .
In this case
(5.48)
which implies that . From the equation for in (5.32) we
see that is equivalent to , so the weak-field expansion
(5.49)
must be equivalent to an expansion of about . From the expressions for
and in (5.32), one finds that this expansion is
(5.50)
This result is a direct consequence of the fact that and the identity
(5.42) (for ). The corresponding weak-field expansion of is
(5.51)
A very simple choice for that is manifestly invariant under is
This yields the Born-Infeld theory. The weak-field expansions of and are exactly as above
in this case; more generally, a rescaling of the parameter in (5.49) and (5.52) will be necessary.
It is a simple matter to write down other functions that are invariant under , but any such function
must also have the property that the equation (5.30) has a unique solution for ,
and we must also impose causality conditions. For example, the condition requires .
Another aid to separating the causal from the acausal NLED theories is the relation (5.42),
which implies that is a concave function of whenever is a convex function of , as required.
The implications for are generically not obvious but for -parity invariant theories, and therefore
must also be concave (for ).
Consider, for example the following one-parameter generalization of Born-Infeld, defined by
(5.54)
where is a constant. One finds that
(5.55)
and
(5.56)
Using (5.55) in (5.30) we find the following equation for :
(5.57)
Inspection of the graphs of the functions of on both sides of this equation shows that a unique solution exists for all in the positive quadrant iff .
From (5.56) we see that this is also required for to be a concave function for .
For we have a one parameter self-dual deformation of Born-Infeld that preserves -parity invariance.
As (5.57) is a quartic equation for , which has an explicit and unique solution for , the Lagrangian density can still be found explicitly.
6 Legendre self-duality
So far we have considered the Lagrangian and Hamiltonian densities. In both cases, the NLED field equations
may be written in first-order form as the “macroscopic Maxwell equations”
(6.1)
together with constitutive relations which are either
(6.2)
or
(6.3)
However, we may also specify the constitutive relations in terms of a ‘dual’ Hamiltonian density
(6.4)
in which case
(6.5)
or by a ‘dual’ Lagrangian density
(6.6)
in which case
(6.7)
The possibility of a description in terms of one of four “fundamental functions” was observed by Born in [28] but here we use the
more standard terminology of Bialynicki-Birula [5], except for sign changes to ensure that the addition of a constant to
implies the addition of the same constant to , and
its subtraction from both and .
Another of Born’s observations was that, for Born-Infeld, and are identical functions of their respective scalar variables, appropriately defined; this has been called “Legendre self-duality”. As mentioned in the Introduction, this was shown by Gaillard and Zumino to be a property of any
(electromagnetically) self-dual NLED theory [18], and a later proof of Theisen and Kuzenko [29] showed that only a electromagnetic duality was needed. The starting point
of this proof was (in our notation) the Lagrangian density
(6.8)
where are the electric/magnetic components of , now an arbitrary 2-form field, and
are the electric/magnetic components of . The combined
field equations found from varying both and are equivalent to those of for
(since variation of yields the equation ). However, the equations found from varying , which are
(6.9)
may be used to eliminate ; this yields the dual Lagrangian density . Theisen and Kuzenko show that the functions and are the same for all NLED invariant under a discrete electromagnetic duality transformation. To state this result in our notation, we observe that
a further implication of the equations (6.9) is
(6.10)
In other words, Legendre self-duality can be restated as
the equivalence, for self-dual NLED, of the functions and ,
with defined in terms of according to (6.10).
Here we show that this result follows directly from the definition of the dual Lagrangian density whenever
the Hamiltonian density is invariant under the duality-rotation taking to ,
which implies that
(6.11)
This is obviously a property of any self-dual NLED, but it also a property of some other NLED theories
that are not self-dual.
We begin with the observation that
(6.12)
This relation implies the following two relations
(6.13)
The first of these is just the inverse of (6.12). The second follows by using (6.12)
to replace on the right-hand side; this yields the definition of (6.6) for .
We see from these relations that both and are Legendre transforms of ,
but one is with respect to the first 3-vector variable and the other is with respect to the second 3-vector variable.
Let us now rewrite (6.13) more abstractly as
(6.14)
for 3-vectors . From this, we see that the property (6.11)
of any self-dual theory implies that
(6.15)
Given Lorentz invariance of the function on the left-hand side, it may be expressed as a function of the Lorentz scalars
and . The same is true of the right-hand side but with replaced by , as we should expect from the minus signs in the definitions of in (6.10).
We thus conclude that and are the same, as functions, for any self-dual NLED theory.
As a simple illustration of Legendre self-duality,
we consider a class of NLED theories, introduced in [34], that may be defined by the following one-parameter family of Lagrangian densities in which are a pair of auxiliary fields:
(6.16)
The family parameter is , which we may assume to be non-negative. For we have the Roček-Tseytlin (RT) formulation of Born-Infeld [35]; elimination of the auxiliary fields yields . For , we get the original Born theory
[28]; the general case was discussed in detail in [23]. An advantage of this auxiliary-field formulation is that the 3-vector fields are now linear functions of :
(6.17)
This implies that
(6.18)
and hence that the dual RT Lagrangian density is
(6.19)
but with expressed as functions of .
Using (6.10) and (6.17), it is straightforward to show that
(6.20)
and hence that
(6.21)
where
(6.22)
This auxiliary-field redefinition is such that
(6.23)
where
(6.24)
We thus deduce for , that
(6.25)
As this is formally the same as the case of (6.16), elimination of the auxiliary fields now yields the BI Lagrangian density but with replaced by ; i.e.
(6.26)
which is formally identical to .
For all other values of , we have , but is a special case
because then the only -dependence is via the term of , which implies that
upon elimination of . Elimination of then yields , so Born’s original
theory is also Legendre self-dual. The reason for this is that satisfies the condition (6.11).
The results of [23] for the Hamiltonian density for arbitrary show that (6.11) is satisfied
only for and .
6.1 A proof from the CH formula
We shall now present a proof that (electromagnetic) self-duality implies Legendre self-duality,
by taking the CH formula (1.8) as our starting point. We know the first
derivatives of from (1.10), and we know how
depend on and hence on and . This allows us to compute the derivatives
of with respect to both and . Recalling the definitions of
as derivatives of , we find that
(6.27)
where, now,
(6.28)
From this result we may compute . One finds that
(6.29)
which allows us to determine in terms of and . The result is
We thus deduce that is given by the CH formula for the same function that we
used for . This implies that and are identical functions.
7 The NLED/Particle-mechanics correspondence
The starting point for section 3 was the obvious fact that setting the magnetic field to zero
in any Legendre pair of functions and yields functions and ,
respectively, that are a Legendre pair when viewed as functions of and , respectively. We then showed that
this remains true if and are viewed as one-variable functions, and we explained how they are related
to the one-variable functions and that determine and for a self-dual NLED.
We now provide a different interpretation of the functions
and , viewed as functions of and , respectively. Rather than set to zero the magnetic field, as we did in section 3, we replace the Euclidean 3-space with a flat 3-torus of 3-volume , and we truncate the Fourier expansion of fields
on this 3-torus by setting all space derivatives of the 1-form potential to zero.
We then have but also , where is the 3-vector potential, now a function only of the time coordinate . The Lagrangian obtained by integrating over is therefore
(7.1)
where is an arbitrary mass parameter needed to make dimensionless (for unit speed of light).
The Hamiltonian (obtained by Legendre transform of 𝕃) is
(7.2)
where is the Legendre dual of . We may interpret 𝕃 and ℍ
as the Lagrangian and Hamiltonian for a point particle with velocity and
momentum in a locally-Euclidean 3-space.
For Maxwell we have ( and )
(7.3)
which are the Lagrangian and Hamiltonian for a non-relativistic particle of mass .
For the Born-Infeld theory we may (since was an arbitrary mass parameter) set
(7.4)
in which case
(7.5)
which are the Lagrangian and Hamiltonian for a relativistic particle of mass . As a consistency check, we observe that
There is a string-theory interpretation of this correspondence between the Born-Infeld field theory and the massive relativistic particle, because (as mentioned in the Introduction) it is implied by the T-duality relation between the D3-brane and D0-brane of Type II superstring theory. However the correspondence obtained above is more general because it applies to any
self-dual NLED. For ModMaxBorn, for example, we find that
(7.8)
which are again the Lagrangian and Hamiltonian for a relativistic particle but with a “modified light-speed” of , which is subluminal for and superluminal for .
For this “relativistic” particle mechanics model, considered in isolation, we could redefine the “speed of light” to be . However, as derived above, the particle mechanics model describes the ‘corner’ of full self-dual NLED model for which and
is a space-independent but time-dependent 3-vector that we interpret as a particle velocity vector, and the speed of light is what it is in the full theory, i.e. unity. From this perspective, we should expect that the particle velocity can be superluminal only for an acausal NLED, and our ModMaxBorn example confirms this.
8 Summary and outlook
In any generalisation of an established physical theory, such as Maxwell electrodynamics, the question arises of which
features should be preserved and which may be discarded. The principal feature preserved by the Plebanski class
of nonlinear electrodynamics is the canonical structure, and hence the number of degrees of freedom
per space point. This means that small amplitude waves still have two distinct polarisations, but these waves
will typically interact with each other. In addition, they need not travel at light-speed, which leads to the possibility
of superluminal propagation in some backgrounds. This was initially investigated by considering shock waves in
generic smooth electromagnetic backgrounds, but equivalent results are found by considering
plane-wave perturbations of a generic constant uniform background, which can be viewed as a homogeneous
optical medium.
For weak-field backgrounds (typically defined in relation to a Born scale introduced by interactions) the
absence of superluminal propagation can be ensured by imposing simple convexity conditions on the Lagrangian density.
However, generic theories satisfying these conditions will still allow superluminal propagation for some strong-field
backgrounds. The systematic study of this possibility dates back to a 2016 paper by Schellstede et al. [21], whose results
we have confirmed, and explored in the context of models proposed for a variety of phenomenological reasons over the past few decades [23]. One lesson from this work is that the simplest way to find a causal model is to choose one that
is self-dual because weak-field causality implies strong-field causality (given the existence of a weak-field limit) [17].
Thus, one major reason for the study of self-dual NLED theories is that it is easy to separate the causal from the acausal cases.
In fact, this becomes even easier once it is appreciated that the Lagrangian density of any self-dual NLED theories (with a weak-field limit) can be constructed from a corresponding one-variable function . This function, defined on a half-line, provides the boundary condition needed to integrate the PDE that must satisfy for any self-dual theory; we have called this the Courant-Hilbert (CH) construction since the PDE and its solution can be found in [20]. The causality conditions then reduce to simple constraints on the first and second derivatives of the function [17].
The initial aim of this paper was to to extend these results to the Hamiltonian formulation. Self-duality in this context is trivially ensured by restricting the Hamiltonian density
to depend on duality-invariant variables, but now Lorentz invariance requires to satisfy a PDE, which (in appropriate variables) is formally the same as the one that must satisfy to ensure self-duality. This means that there is a CH construction for in terms of some other one-variable function , also defined on a half-line, and is a Legendre-dual pair for any causal self-dual NLED. We have shown that the one-variable “CH-functions” defined by and are also a Legendre-dual pair. This defines a
correspondence between any causal self-dual NLED and a particle-mechanics model, with Born-Infeld corresponding to the massive relativistic point particle.
The results just summarized also simplify the construction of from , and vice versa,
by reducing this problen to the Legendre transform of a one-variable function. However, a much greater
simplification is possible by taking advantage of a ‘dual’ CH construction of from . The fact that
both and can be constructed from implies a very simple relation between them.
A procedure for finding given , for example, is given in the one-line boxed equation
(1.25) of the Introduction. This is one of our main results, derived from an unexpected
‘duality’.
Since and are so simply related, it is natural to suppose that the CH-functions and must also be related in a way that is simpler than via Legendre transform of the associated one-variable functions . This is indeed true for some “simple” cases, such as Born-Infeld, but the general case requires consideration of what we have called “-parity”. The variables are linear combinations of variables , with , and the -parity dual of is
. The “simple” cases referred to above are those for which ; i.e. the -parity invariant NLED theories. For these cases the CH functions , both defined on a half-line, collectively define a single variable on a whole line; more precisely, they are related by the boxed equation (5.13) of section 5.
Generically, and we have a -parity pair of NLED theories with CH-functions and , which are related in a similar way to (5.13), but with a -parity twist; more precisely, the relations are those of the boxed equations (5.11) and (5.12) of section 5. In other words, the CH-function () of is simply related to the CH-function () of , and this reduces to the simple relation of
(5.13) for the -parity invariant cases.
A major theme of this paper has been that many interesting features of self-dual NLED theories are a corollary of simple features of their associated CH functions
. Examples are causality and conformal invariance, and the simple relation between Lagrangian and Hamiltonian densities summarised in the boxed equation (1.25) of the Introduction.
We have shown an alternative CH construction, again described by Courant and Hilbert, introduces a new
CH function, that in “simple” cases is (minus) the Legendre transform of the CH -function. More generally, it is the Legendre transform of the -function of a “-parity” dual theory. The “simple” cases are therefore those that are “-parity” invariant, and the simplest example is
Born-Infeld. We have thus uncovered a new special
property of Born-Infeld that may be of relevance in
its applications, e.g. in string theory.
Various other aspects of generic self-dual NLED theories deserve further investigation. It appears that only Born-Infeld is compatible with maximal (Minkowski spacetime)
supersymmetry, but the constraints of non-maximal supersymmetry are usually weaker. We have omitted coupling to electric and magnetic charges; results of [36], for example, may be generalizable. We also expect the CH construction of self-dual NLED theories to be useful in the exploration of NLED theories coupled to gravity. For example, the spacetime metric describing the analog of the Reissner-Nordstrom black hole might be expected to be invariant under an electromagnetic duality rotation of its parameters. We certainly
expect causality to be a significant issue, and our previous result that the stress-energy tensors of causal NLED theories obey the same energy conditions as the Maxwell stress-energy tensor [30] is
an indication that results for Maxwell-Einstein will
generalize simply to self-dual NLED theories.
To conclude, we remark that our Hamiltonian results can be applied directly to chiral 2-form electrodynamics in 6D Minkowski spacetime, for reasons spelled out in detail in [25]; essentially, one only has to re-interpret the variables . As the 4D NLED theory is then a dimensional reduction from 6D we expect that
the 4D NLED causality conditions on will still apply, and may still be sufficient as well as necessary conditions for causality in 6D. This may be relevant to the recently investigated -flows of 6D chiral
2-form theories [37].
Acknowledgements
PKT has been partially supported by STFC consolidated grant ST/T000694/1. JGR acknowledges financial support from grants 2021-SGR-249 (Generalitat de Catalunya) and a MINECO
grant PID2019-105614GB-C21.
References
[1]
M. Born and L. Infeld,
“Electromagnetic mass,”
Nature 132 (1933) no.3347, 970.1
[2]
M. Born and L. Infeld,
“Foundations of the new field theory,”
Proc. Roy. Soc. Lond. A 144 (1934) no.852, 425-451
[3]
J. Plebański, “Lectures on non-linear electrodynamics”, (The Niels Bohr Institute and NORDITA, Copenhagen, 1970).
[4]
G. Boillat,
“Nonlinear electrodynamics - Lagrangians and equations of motion,”
J. Math. Phys. 11 (1970) no.3, 941-951
[5]
I. Bialynicki-Birula,
“Nonlinear Electrodynamics: Variations on a theme by Born and Infeld”,
in Quantum Theory of Particles and Fields, eds. B. Jancewicz and
J. Lukierski, (World Scientific, 1983) pp. 31-48.
[6]
I. Bialynicki-Birula,
“Field theory of photon dust,”
Acta Phys. Polon. B 23 (1992), 553-559
[7]
J. G. Russo and P. K. Townsend,
“Nonlinear electrodynamics without birefringence,”
JHEP 01 (2023), 039
[arXiv:2211.10689 [hep-th]].
[8]
L. Mezincescu, J. G. Russo and P. K. Townsend,
“Hamiltonian birefringence and Born-Infeld limits,”
[arXiv:2311.04278 [hep-th]].
[9]
E. Schrödinger,
“Contributions to Born’s new theory of the electromagnetic field,”
Proc. Roy. Soc. Lond. A 150 (1935) no.870, 465-477
[10]
E. S. Fradkin and A. A. Tseytlin,
“Nonlinear Electrodynamics from Quantized Strings,”
Phys. Lett. B 163 (1985), 123-130
[11]
E. Bergshoeff, E. Sezgin, C. N. Pope and P. K. Townsend,
“The Born-Infeld Action From Conformal Invariance of the Open Superstring,”
Phys. Lett. B 188, 70 (1987)
[12]
R. G. Leigh,
“Dirac-Born-Infeld Action from Dirichlet Sigma Model,”
Mod. Phys. Lett. A 4 (1989), 2767
[13]
A. A. Tseytlin,
“Born-Infeld action, supersymmetry and string theory,”
[arXiv:hep-th/9908105 [hep-th]].
[14]
J. P. Pereira, J. G. Coelho and R. C. R. de Lima,
“Born–Infeld magnetars: larger than classical toroidal magnetic fields and implications for gravitational-wave astronomy,”
Eur. Phys. J. C 78 (2018) no.5, 361
[arXiv:1804.10182 [astro-ph.SR]].
[15]
V. I. Denisov and S. I. Svertilov,
“Vacuum nonlinear electrodynamic effects in hard emission of pulsars and magnetars,”
Astron. Astrophys. 399 (2003), L39-L42
[arXiv:astro-ph/0305557 [astro-ph]].
[16]
J. Ellis, N. E. Mavromatos, P. Roloff and T. You,
“Light-by-light scattering at future colliders,”
Eur. Phys. J. C 82 (2022) no.7, 634
[arXiv:2203.17111 [hep-ph]].
[17]
J. G. Russo and P. K. Townsend,
“Causal Self-Dual Electrodynamics,”
[arXiv:2401.06707 [hep-th]].
[18]
M. K. Gaillard and B. Zumino,
“Nonlinear electromagnetic selfduality and Legendre transformations,”
[arXiv:hep-th/9712103 [hep-th]].
[19]
G. W. Gibbons and D. A. Rasheed,
“Electric - magnetic duality rotations in nonlinear electrodynamics,”
Nucl. Phys. B 454 (1995), 185-206
[arXiv:hep-th/9506035 [hep-th]].
[20]
R. Courant and D. Hilbert, “Methods of Mathematical Physics”, Vol.II (Wiley Interscience, 1962) pp.91-94.
[21]
G. O. Schellstede, V. Perlick and C. Lämmerzahl,
“On causality in nonlinear vacuum electrodynamics of the Plebański class,”
Annalen Phys. 528 (2016) no.9-10, 738-749
[arXiv:1604.02545 [gr-qc]].
[22]
I. Bandos, K. Lechner, D. Sorokin and P. K. Townsend,
“ModMax meets Susy,”
JHEP 10 (2021), 031
[arXiv:2106.07547 [hep-th]].
[23]
J. G. Russo and P. K. Townsend,
“Born Again,”
SciPost Phys. 16 (2024), 124
[arXiv:2401.04167 [hep-th]].
[24]
I. Bandos, K. Lechner, D. Sorokin and P. K. Townsend,
“A non-linear duality-invariant conformal extension of Maxwell’s equations,”
Phys. Rev. D 102 (2020), 121703
[arXiv:2007.09092 [hep-th]].
[25]
I. Bandos, K. Lechner, D. Sorokin and P. K. Townsend,
“On p-form gauge theories and their conformal limits,”
JHEP 03 (2021), 022
[arXiv:2012.09286 [hep-th]].
[26]
E. Bergshoeff and M. De Roo,
“D-branes and T duality,”
Phys. Lett. B 380 (1996), 265-272
[arXiv:hep-th/9603123 [hep-th]].
[27]
M. B. Green, C. M. Hull and P. K. Townsend,
“D-brane Wess-Zumino actions, t duality and the cosmological constant,”
Phys. Lett. B 382 (1996), 65-72
[arXiv:hep-th/9604119 [hep-th]].
[28]
M. Born,
“Nonlinear theory of the electromagnetic field,”
Ann. Inst. Henri Poincare 7 (1937) no.4, 155-265
[29]
S. M. Kuzenko and S. Theisen,
“Nonlinear selfduality and supersymmetry,”
PoS tmr2000 (2000), 022
[30]
J. G. Russo and P. K. Townsend,
“Causality and Energy Conditions in Nonlinear Electrodynamics,”
[arXiv:2404.09994 [hep-th]].
[31]
M. Perry and J. H. Schwarz,
“Interacting chiral gauge fields in six-dimensions and Born-Infeld theory,”
Nucl. Phys. B 489 (1997), 47-64
[arXiv:hep-th/9611065 [hep-th]].
[32]
B. P. Kosyakov,
“Nonlinear electrodynamics with the maximum allowable symmetries,”
Phys. Lett. B 810 (2020), 135840
[arXiv:2007.13878 [hep-th]].
[33]
B. P. Kosyakov,
“Introduction to the classical theory of particles and fields,”
(Springer 2007).
[34]
S. I. Kruglov,
“On generalized Born-Infeld electrodynamics,”
J. Phys. A 43 (2010), 375402
[arXiv:0909.1032 [hep-th]].
[35]
M. Rocek and A. A. Tseytlin,
“Partial breaking of global D = 4 supersymmetry, constrained superfields, and three-brane actions,”
Phys. Rev. D 59 (1999), 106001
doi:10.1103/PhysRevD.59.106001
[arXiv:hep-th/9811232 [hep-th]].
[36]
K. Lechner, P. Marchetti, A. Sainaghi and D. P. Sorokin,
“Maximally symmetric nonlinear extension of electrodynamics and charged particles,”
Phys. Rev. D 106 (2022) no.1, 016009
[arXiv:2206.04657 [hep-th]].
[37]
C. Ferko, S. M. Kuzenko, K. Lechner, D. P. Sorokin and G. Tartaglino-Mazzucchelli,
“Interacting Chiral Form Field Theories and -like Flows in Six and Higher Dimensions,”
[arXiv:2402.06947 [hep-th]].