Upper and Lower Bounds on Phase-Space Rearrangements

E. J. Kolmes [email protected] Department of Astrophysical Sciences, Princeton University, Princeton, New Jersey 08544, USA N. J. Fisch Department of Astrophysical Sciences, Princeton University, Princeton, New Jersey 08544, USA

(May 9, 2024)

Abstract

Broad classes of plasma phenomena can be understood in terms of phase-space rearrangements. For example, the net effect of a wave-particle interaction may consist of moving populations of particles from one region of phase space to another. Different phenomena drive rearrangements that obey different rules. When those rules can be specified, it is possible to calculate bounds that limit the possible effects the rearrangement could have (such as limits on how much energy can be extracted from the particles). This leads to two problems. The first is to understand the map** between the allowed class of rearrangements and the possible outcomes that these rearrangements can have on the overall distribution. The second is to understand which rules are appropriate for which physical systems. There has been recent progress on both fronts, but a variety of interesting questions remain.

I Introduction

Consider the interaction of a wave with a plasma. Depending on the details of the interaction, there can be a transfer of energy between the plasma and the wave, with the wave either being amplified or damped. If the wave is amplified, the maximum amplification is set by how much energy it can remove from the plasma. If, in particular, the wave is fed by the kinetic energy of the plasma particles, this means that the wave-particle interaction somehow rearranges those particles in phase space so as to liberate some of their energy.

Given basic rules for how that phase-space rearrangement takes place, it is possible to calculate bounds on what effects those rearrangements can have. For example, this can lead to limits on how much energy can be extracted from a given distribution of particles for very general classes of wave-particle interactions. This allows us to formalize and quantify the idea that some distributions have more accessible energy than others (for example, a bump-on-tail distribution compared with a Maxwellian).

There are different rules that we could pick for these rearrangements, appropriate for different situations, and the ways in which different rules change the possible outcomes is often nontrivial. One simple rule, sometimes called Gardner restacking, is to permit any rearrangement of phase space that respects Liouville’s theorem.[1] Another, sometimes called diffusive exchange, models phase-mixing processes by instead averaging the contents of phase-space elements.[2] Either basic rule can be modified by the imposition of further constraints such as conservation laws[3, 4, 5, 6] (for example, requiring that any rearrangements must preserve one or more adiabatic invariants). Different authors use different conventions, but the energy that can be extracted is sometimes called the free or available energy. We will use these terms interchangeably here. The energy that can be extracted using Gardner’s restacking operations is sometimes called the Gardner free or available energy; the energy that can be extracted using mixing operations is sometimes called the diffusive free or available energy.

This leads to two essential problems. The first is to understand the range of outcomes that a given class of rearrangement operations can bring about.[7, 8, 9, 10] The second is to determine which class of rearrangement operations most appropriately captures the physics of a given physical system.[3, 4, 11, 12, 13, 6]

If we can solve these two problems, then we can construct robust thermodynamic bounds on wave-particle interactions – indeed, on any phase-space rearrangements – for a variety of applications. These include efficiency limits for alpha channeling[14, 2, 15, 16] and models for turbulent transport.[11, 12, 13] In fact, these rearrangement problems are closely connected with (and, in some cases, formally identical to) a variety of other problems both inside and outside plasma physics, in fields ranging from physical chemistry to the quantification of income inequality.[17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33]

This paper is based on an invited talk at the 2023 APS-DPP meeting, and very roughly follows the structure of that talk. Part of the paper functions as an introduction to the subject, summarizing advances over the last few years in understanding and applying theories of free energy in plasma systems. However, several parts of this paper have not been presented elsewhere. In particular, the explicit free-energy calculations in Section IV are new; the application of free-energy theory to loss-cone modes was previously explored in Ref. 6, but that document was focused only on the thresholds at which the free energy vanishes. The proof of the maximum-energy ground state for the loss-cone-truncated Maxwellian in the same section is also new.

This paper is structured as follows. Section II defines the different free energies being considered. Section III discusses what is and is not known about the spectrum of ground states that can be accessed using diffusive exchange operations. Section IV discusses how these free-energy theories play out in the context of loss-cone instabilities in a centrifugal mirror trap, and describes how to constrain the free energy in order to take into account the flute-like nature of many of these modes. Section V is a discussion of the results.

II Defining the Problem

The key to quantifying the amount of energy that can be extracted from a system is to define the rules governing the ways in which that system may be rearranged. Different physical scenarios call for different rules for these rearrangements.

II.1 Two Basic Operations

The earliest version of this theory for plasma physics was introduced by Gardner in 1963.[1] If the process underlying the rearrangement is Hamiltonian, then it must conserve the volumes of elements in phase space. Gardner’s theory (sometimes called Gardner restacking) allows any rearrangement that conserves phase-space volumes. If the phase-space elements are rearranged such that the highest-population elements sit in the lowest-energy parts of phase space, then rearrangements of this kind can remove no further energy from the system. The energy that can be removed by transforming the initial state in this way is the Gardner free energy. The unique final state which has zero Gardner free energy is called the ground state.

For a simple example, consider a discrete phase space consisting of three states, associated with energies $\varepsilon_{0}=0$ , $\varepsilon_{1}=1$ , and $\varepsilon_{2}=2$ , respectively. If these states have initial populations $f_{0}=0$ , $f_{1}=1$ , and $f_{2}=2$ , then a Gardner restacking procedure that transforms the system to its unique Gardner ground state is as follows:

\displaystyle\begin{array}[]{|c|c|c|}\hline\cr 0&2&1\\ \hline\cr\end{array}

\displaystyle\rightarrow\begin{array}[]{|c|c|c|}\hline\cr 2&0&1\\ \hline\cr\end{array}\rightarrow\begin{array}[]{|c|c|c|}\hline\cr 2&1&0\\ \hline\cr\end{array}\,.

This maps the system from a state with energy $\sum_{i}f_{i}\varepsilon_{i}=4$ to one with $\sum_{i}f_{i}\varepsilon_{i}=1$ , resulting in a release of 3 units of dimensionless energy.

However, particularly for applications involving wave-particle interactions, the rearrangement of the distribution often involves phase mixing, wherein very fine-scale structures in phase space can lead to apparent smoothing effects. In other words, Liouville’s theorem may still be respected on sufficiently microscopic scales, but on larger scales the dynamics can appear to be diffusive. This motivates an alternative rule, first proposed by Fisch and Rax,[2] in which phase space elements’ contents are averaged rather than being exchanged. These two operations are illustrated in Figure 1.

Refer to caption — Figure 1: Above: Gardner’s restacking operation consists of exchanging the populations in two elements of phase space. Below: the diffusive exchange operation consists of mixing the populations of two elements. Mixing of elements in phase space is commonly interpreted as the result of phase mixing on some small scale.

The maximum energy that can extracted from the same three-box system considered above can be accessed as follows:

\displaystyle\begin{array}[]{|c|c|c|}\hline\cr 0&2&1\\ \hline\cr\end{array}\rightarrow\begin{array}[]{|c|c|c|}\hline\cr 1/2&2&1/2\\ \hline\cr\end{array}\rightarrow\begin{array}[]{|c|c|c|}\hline\cr 5/4&5/4&1/2\\ \hline\cr\end{array}\,.

This releases $7/4$ units of energy rather than the $3$ that Gardner restacking operations were able to extract. Note, however, that we could have averaged the boxes in a different order, leading to a different ground state:

\displaystyle\begin{array}[]{|c|c|c|}\hline\cr 0&2&1\\ \hline\cr\end{array}\rightarrow\begin{array}[]{|c|c|c|}\hline\cr 1&1&1\\ \hline\cr\end{array}\,.

This final state is clearly also a ground state, but it sits at a higher energy than the other, and corresponding to an energy release of only 1. Unlike Gardner restacking, diffusive mixing may lead a given initial state to any of a spectrum of possible ground states. Even for a simple three-box system like this one, there may be infinitely many such diffusively accessible ground states. The maximum possible energy release is called the diffusively accessible free energy.

For a discrete system – that is, a collection of $N$ boxes with populations – the Gardner free energy is always greater than the diffusively accessible free energy. Hay, Schiff, and Fisch showed[7, 8] that although an $N$ -box system may have an infinite number of diffuisvely accessible ground states, only a finite number of these could possibly correspond to the maximum energy release. In principle, then, one can simply write down the list of possible candidates and check which is best. However, the number of such candidates goes like $\mathcal{O}(N^{N^{2}})$ . This makes a direct search very difficult for larger values of $N$ .

II.2 Discrete and Continuous Phase Spaces

There are some systems for which small $N$ is the case of greatest interest: for example, transitions between discrete atomic states.[2] However, for plasma physics applications, we are often interested in systems for which phase space is continuous. If we think of the discrete system as a coarse-grained approximation of a continuous system, this pushes us to the large- $N$ limit, where the approach of direct optimization is least tractable.

It is also possible to define the continuous Gardner and diffusive-exchange problems directly in terms of a continuous system. The Gardner restacking problem can be formulated directly in terms of continuous curves,[34] and can be posed formally in terms of the “symmetric decreasing rearrangement” of the initial state.[35, 36, 37, 38, 39] One way of doing this is to describe the Gardner ground state $f_{G}$ as the decreasing function of energy alone that also satisfies

\displaystyle\int\Theta[f_{G}(\varepsilon(\mathbf{x}))-\lambda]\,\mathrm{d}% \mathbf{x}=\int\Theta[f(\mathbf{x})-\lambda]\,\mathrm{d}\mathbf{x}\quad\forall% \lambda\in\mathbb{R}

(1)

for the initial distribution $f$ . Here $\mathbf{x}$ is the phase-space coordinate (which generally includes position and velocity) and $\varepsilon(\mathbf{x})$ is the energy of a particle at $\mathbf{x}$ .

The problem of maximum energy extraction under diffusive exchange, as it was originally posed by Fisch and Rax,[2] is to minimize

\displaystyle W_{\text{final}}\doteq\lim_{t\rightarrow\infty}\int\varepsilon(% \mathbf{x})f(\mathbf{x},t)\,\mathrm{d}\mathbf{x},

(2)

where $f(\mathbf{x},t)$ is the distribution at time $t$ and phase-space coordinate $\mathbf{x}$ , and where $f$ evolves according to

\displaystyle\frac{\partial f}{\partial t}=\int K(\mathbf{x},\mathbf{x}^{% \prime},t)\big{[}f(\mathbf{x}^{\prime},t)-f(\mathbf{x},t)\big{]}\mathrm{d}% \mathbf{x}^{\prime},

(3)

and the optimization is over all kernels $K(\mathbf{x},\mathbf{x}^{\prime},t)$ that are nonnegative and symmetric with respect to exchange of $\mathbf{x}$ and $\mathbf{x}^{\prime}$ . (Strictly speaking, the original formulation was in terms of a one-dimensional phase space in which $\mathbf{x}$ is a scalar velocity $v$ , but the generalization is straightforward).

Although the original formulation is in terms of this space of kernels $K$ , the space of possible kernels is large and in practice a direct search to find the optimal $K$ is prohibitively difficult. When trying to determine the optimal kernel $K$ – whether to minimize the final energy, or with respect to any other metric – it is typically advantageous to seek an indirect approach. This will be discussed in greater detail in Section III.

II.3 Additional Constraints

Of course, real systems often obey a variety of constraints beyond, for example, phase-space volume conservation. For either of the two rearrangement operations discussed here, it is possible to impose additional constraints – for example, by limiting which phase-space elements can be restacked or mixed with which. This idea was first introduced in the context of Gardner restacking by Helander;[3, 4] it works in essentially the same way for the diffusive exchange problem.[5] Enforcing conservation of a given quantity for each particle means that exchanges are only allowed between pairs of elements with the same value of that quantity. This reduces the rearrangement problem to a set of independent rearrangement problems, each performed on the hyperplane of phase space on which the given quantity is constant. The inclusion of conservation laws for the appropriate adiabatic invariants turns out to be important in the application of the Gardner free energy for certain applications involving turbulence in magnetic confinement systems.[11, 12, 13] Constraints other than per-particle conservation laws are also possible;[6] this will be discussed in more detail in Section IV.

III Characterizing the Spectrum of Diffusively Accessible Ground States

Originally, the diffusive-exchange problem was proposed in the context of alpha channeling, where waves are injected in order to remove energy from fusion products.[14, 2] For this reason, the early work on the subject largely focused on determining the upper bound on energy extraction. This upper bound is what we mean when we refer to the diffusively accessible free energy.

This upper-bound problem (corresponding to the lower bound for the energy of the final ground state) was solved in Ref. 9 for continuous systems. It turns out that in a continuous system, it is possible to construct a series of mixing operations that approaches the Gardner ground state arbitrarily closely. It is possible to show (and intuitively straightforward to see) that it is not possible to reach a lower-energy state than the Gardner ground state through mixing. Therefore, the Gardner free energy and the diffusively accessible free energy are identical in the continuous case.

However, there is increasing interest in the application of free-energy calculations to phenomena like turbulence. For these applications, it is desirable not only to understand the largest possible energy release, but also to understand the full range of possible outcomes that can be brought about by mixing operations. This motivates the identification of the minimum stabilizing energy release, which is the lower bound on the possible release of energy that maps the initial state to a ground state. This can be posed equivalently in terms of the highest-energy accessible ground state. Note that although the original formulation of the problem allows mixing operations that can either increase or decrease the energy of the system (and it can be shown the mixing operations that increase the system energy are never necessary in order to reach the maximum possible release of energy[7]), this minimum-energy-release problem is only interesting if “annealing operations” (those that deposit energy into the system) are prohibited. Annealing operations do not correspond to the expected behavior of natural modes, and if they are permitted it is typically possible to reach ground states with much higher energy than the initial state (infinitely higher, for most continuous systems, since one can simply mix the populated regions of phase space with empty regions at arbitrarily high velocity).

The minimum stabilizing energy release is not known for arbitrary initial conditions, but it is known in certain particular cases. For the case of a bump-on-tail distribution, the minimum stabilizing energy release corresponds precisely to the classical quasilinear plateau solution, in which the region of the distribution with the population inversion is simply flattened.[10] Similar plateau-like solutions can be shown to be optimal for certain close relatives of the bump-on-tail distribution. The minimum stabilizing energy release for loss-cone distributions will be discussed in Section IV. For a discrete system, it is possible to enumerate the solutions to this problem when the system is small. This was done explicitly for the three-box system in Ref. 10.

One corollary of the proof in Ref. 9 is that any state that can be reached through Gardner restacking – not just the Gardner ground state – is also accessible (or arbitrarily close to being accessible) through diffusive exchange operations. It follows that any weighted average of these restacked states is also diffusively accessible. The Lynden-Bell equilibrium appears in statistical descriptions of astrophysical systems,[40, 41, 42] and can be understood as an average over an ensemble of systems that individually satisfy Liouville’s theorem for some initial condition. Therefore, it also follows that the Lynden-Bell equilibrium is itself diffusively accessible.

IV Case Study: Flute-Like Loss-Cone Modes in Rotating Mirrors

The stabilization of flute-like loss-cone modes in rotating mirror configurations provides an interesting case study for how the theory of free energy may be applied in practice. There are two reasons for this. First, the intuition behind loss-cone modes revolves around rearrangement operations. The existence of an empty loss region alongside populated regions of phase space at higher energies means that it is possible to release energy by drop** particles from higher-energy trapped regions into lower-energy loss regions.

Second, this system illustrates the role of constraints in the free-energy theory. By first computing the stabilization thresholds directly from the dispersion relations, then calculating the dependences of the free energy in these systems, it is possible to see whether or not the suppression of the free energy closely corresponds to the stability thresholds. In other words, it is possible to check how closely the suppression of the free energy corresponds to the stabilization of the various modes. As we will see, the basic form of the Gardner free energy does very poorly at explaining the stabilization thresholds of these modes. However, the inclusion of an additional constraint, taking into account the flute-like nature of the modes, leads to a much better-performing theory.

IV.1 Modeling the Effects of Rotation

To perform this calculation, it is necessary to write down a model for how the distribution function $f(\mathbf{v})$ depends on the mirror parameters. Consider a mirror-type configuration in which the magnetic field strength $B$ is maximized at the midplane and minimized at the edge, with the ratio of the maximum field strength to the minimum strength given by the mirror ratio $R$ . Suppose the mirror is rotating, that is, suppose that a largely radial electric field crossed with the largely axial magnetic field causes the plasma to undergo drift in the azimuthal direction. Let $\Delta\Phi$ denote the difference in the combined centrifugal and electrostatic potentials along a field line between the edge and the midplane. Then, so long as the rotation frequency is small compared to the ion cyclotron frequency (so that corrections to the adiabatic invariants can be neglected),[43, 44] the condition for a particle to be trapped can be written as

\displaystyle(R-1)v_{\perp}^{2}-v_{||}^{2}+\frac{2\Delta\Phi}{m}\geq 0,

(4)

where $v_{||}$ and $v_{\perp}$ are the velocity components parallel and perpendicular to the magnetic field and $m$ is the particle mass. There is no unique map** between $(R,\Delta\Phi)$ and $f(\mathbf{v})$ ; the details of the distribution will depend on the particle sources, heating terms, and so on. However, it is sensible to expect $f(\mathbf{v})$ to vanish in regions of phase space where Eq. (4) is not satisfied.

A few models have been used for this problem in the literature.[43, 45, 6] One that is both reasonably simple and matches Fokker-Planck simulations reasonably well for some choices of source[6] is a Maxwellian, truncated so as to vanish inside the loss region:

\displaystyle f_{T}(\mathbf{v})=Ae^{-mv^{2}/2T}\Theta\bigg{[}(R-1)v_{\perp}^{2% }-v_{||}^{2}+\frac{2\Delta\Phi}{m}\bigg{]}.

(5)

Here $A$ is a normalization constant (which depends on $R$ and $\Delta\Phi$ ), $\Theta$ is the Heaviside step function, and $T$ is the temperature. Spatial variations in $f$ are neglected, and the magnetic field is taken to be a square well, so that it does not vary in the interior of the mirror. For the sake of simplicity, we will focus on this model in this paper. For further discussion of alternatives, advantages, disadvantages, and numerical validation of the model, see Ref. 6. Note that the choice of model for $f$ is an important part of this calculation, and can have a significant impact on the resulting stability thresholds. It can be understood as a kind of initial condition for the analysis.

It is often convenient to work with dimensionless coordinates. Let

\displaystyle\mathbf{u}\doteq\mathbf{v}\sqrt{\frac{m}{2T}}

(6)

and

\displaystyle\phi\doteq\frac{\Delta\Phi}{T}\,.

(7)

Then $f_{T}$ can be rewritten as

\displaystyle f_{T}(\mathbf{u})=Ae^{-u^{2}}\Theta\big{[}(R-1)u_{\perp}^{2}-u_{% ||}^{2}+\phi\big{]}.

(8)

Let $R>1$ and $\phi\geq 0$ , as these are the cases of greatest practical interest. Then if $f_{T}$ is normalized to $N$ ,

	$\displaystyle A=N\bigg{(}\frac{m}{2\pi T}\bigg{)}^{3/2}$
	$\displaystyle\hskip 20.0pt\times\bigg{\{}\text{erf}\sqrt{\phi}-2\sqrt{\frac{% \phi}{\pi}}e^{-\phi}$
	$\displaystyle\hskip 50.0pt+2\sqrt{\frac{R-1}{\pi R}}e^{\phi/(R-1)}\Gamma\bigg{% (}\frac{3}{2},\frac{R\phi}{R-1}\bigg{)}\bigg{\}}^{-1},$		(9)

where $\Gamma(a,b)$ is the incomplete gamma function.

IV.2 Unconstrained Bounds

To begin, it is instructive to consider the Gardner free energy theory without any further modification. Largely following the notation in Ref. 3, define the level-set volume function

\displaystyle H(\lambda)\doteq\int\Theta\big{[}f(\mathbf{u})-\lambda\big{]}% \mathrm{d}^{3}\mathbf{u}.

(10)

Within the trapped region, $f_{T}(u_{\lambda})=\lambda$ when

\displaystyle u_{\lambda}=\sqrt{-\log\bigg{(}\frac{\lambda}{A}\bigg{)}}.

(11)

Then using spherical $(u,\theta,\varphi)$ coordinates in velocity space, we can calculate the level-set function for $f_{T}$ as follows:

	$\displaystyle H(\lambda)=$
	$\displaystyle\hskip 15.0pt2\pi\int_{0}^{u_{\lambda}}u^{2}\,\mathrm{d}u\int_{0}% ^{\pi}\Theta\bigg{[}R\sin^{2}\theta-1+\frac{\phi}{u^{2}}\bigg{]}\sin\theta\,% \mathrm{d}\theta.$		(12)

Define

\displaystyle\lambda_{*}\doteq Ae^{-\phi}.

(13)

Then

\displaystyle H(\lambda>\lambda_{*})=\frac{4\pi}{3}\bigg{[}-\log\bigg{(}\frac{% \lambda}{A}\bigg{)}\bigg{]}^{3/2}

(14)

and

	$\displaystyle H(\lambda\leq\lambda_{*})=\frac{4\pi}{3}\phi^{3/2}$
	$\displaystyle\hskip 10.0pt+\frac{4\pi}{3}\frac{\big{[}-(R-1)\log(\lambda/A)+% \phi\big{]}^{3/2}-(R\phi)^{3/2}}{\sqrt{R}(R-1)}\,.$		(15)

The Gardner ground state $f_{G}$ can be computed by setting

\displaystyle H(f_{G})=\frac{4\pi}{3}\,u^{3}.

(16)

This leads to

\displaystyle f_{G}(\mathbf{u})=\begin{cases}Ae^{-u^{2}}&u<\sqrt{\phi}\\ A\exp\bigg{\{}\frac{1}{R-1}\bigg{[}-\big{[}\sqrt{R}(R-1)\big{(}u^{3}-\phi^{3/2% }\big{)}+(R\phi)^{3/2}\big{]}^{2/3}+\phi\bigg{]}\bigg{\}}&u\geq\sqrt{\phi}\,.% \end{cases}

(17)

In order to calculate the energy released when $f_{T}$ is transformed to $f_{G}$ , it is necessary to find the kinetic energy in each distribution.

The energy $W_{T}$ in the initial distribution can be found analytically.

\displaystyle W_{T}

\displaystyle\doteq T\bigg{(}\frac{2T}{m}\bigg{)}^{3/2}\int\mathrm{d}^{3}% \mathbf{u}\,u^{2}f_{T}.

(18)

This can be evaluated to get

	$\displaystyle W_{T}=\frac{3AT}{2}\bigg{(}\frac{2\pi T}{m}\bigg{)}^{3/2}$
	$\displaystyle\times\bigg{[}\text{erf}\sqrt{\phi}+\frac{R-1-(2/3)\phi}{\sqrt{R(% R-1)}}e^{\phi/(R-1)}\text{erfc}\sqrt{\frac{R\phi}{R-1}}\,\bigg{]},$		(19)

where erf is the error function erfc is the complementary error function.

In the limit of $\phi\rightarrow 0$ , this is

\displaystyle W_{T}\big{|}_{\phi=0}=\frac{3NT}{2}\,.

(20)

In this same limit,

\displaystyle f_{G}=A\exp\bigg{[}-\bigg{(}\frac{R}{R-1}\bigg{)}^{1/3}u^{2}% \bigg{]},

(21)

that is, the Gardner-restacked distribution is a Maxwellian with temperature $[R/(R-1)]^{1/3}T$ , and the Gardner ground state has energy

\displaystyle W_{G}\big{|}_{\phi=0}=\frac{3NT}{2}\bigg{(}\frac{R}{R-1}\bigg{)}% ^{1/3}.

(22)

The resulting energy fractional energy release is

\displaystyle\frac{W_{T}-W_{G}}{W_{T}}\bigg{|}_{\phi=0}=1-\bigg{(}\frac{R}{R-1% }\bigg{)}^{1/3}.

(23)

Increasing $R$ reduces the available energy, as it narrows the loss cone and reduces the volume of empty phase space into which particles can be moved.

In the opposite limit of $\phi\rightarrow\infty$ , the loss cone vanishes, $f_{T}$ becomes a Maxwellian with temperature $T$ , $A\rightarrow(m/2\pi T)^{3/2}N$ , and $W_{T}\rightarrow 3NT/2$ . In this limit, $f_{G}$ also becomes a Maxwellian with temperature $T$ , so the Gardner free energy vanishes.

More generally, it is possible to calculate the kinetic energy in $f_{G}$ from Eq. (17) numerically and compare the value with $W_{T}$ to get the fraction of the initial energy released. This is shown in Figure 2 for several choices of $R$ and $\phi$ . The fraction of the initial energy that is released when the system is transformed to its Gardner ground state is a monotonically decreasing function of both $R$ and $\phi$ .

Recall from Section III that for a continuous system, the Gardner free energy also constitutes the least upper bound on the energy that can be released using diffusive mixing operations. However, we might also wish to know the minimum stabilizing energy release: that is, the least energy that can be released while still map** $f_{T}$ to a ground state (or, more formally, the infimum of the accessible range). Interestingly, this bound is zero; it is possible to construct a sequence of mixing operations that results in a ground state while releasing vanishingly little energy.

To see this, consider the part of the loss region with energy $\varepsilon$ . Consider a series of mixing operations that mixes the contents of this shell with the part of the trapped region with energy $\varepsilon+\delta\varepsilon$ , where $\delta\varepsilon$ is taken to be very small. It is possible to use mixing operations to equalize the density of $f$ throughout the part of the loss region with energy $\varepsilon$ and the part of the trapped region with energy $\varepsilon+\delta\varepsilon$ , and to do so while releasing energy at every step. If every part of the loss region with energy $\varepsilon$ were homogenized with the part of the trapped region with energy $\varepsilon+\delta\varepsilon$ in this way, then in the limit where $\delta\varepsilon\rightarrow 0$ , we would map the initial state to a ground state at the same energy.

IV.3 Constraint for Flute-Like Modes

With these results in hand, we can return to the question of how to match the free-energy results with the behavior of the loss-cone instabilities. Many of the most important of these modes are flute-like: their wave-numbers vanish in the direction of the magnetic field $(k_{||}=0)$ . Physically, this is related to the large mobility of the electrons in the direction of the magnetic field.[46] Flute-like loss-cone modes include the high-frequency convective loss cone (HFCLC), drift-cyclotron loss cone (DCLC), and Dory-Guest-Harris (DGH) modes.[47, 48, 49, 46, 50]

In the limit of sufficiently large $\phi$ , these instabilities must be suppressed: at some point, the loss region has been lifted to such high energies that no particles are affected by it. The key question, for present purposes, is whether the suppression of the instabilities correlates closely with the suppression of the free energy. In other words: is the free energy a good way of predicting the behavior of these modes? This is interesting as a way of testing the free-energy theory. It is also interesting for the purposes of understanding this class of modes. After all, there are many different loss-cone modes, so it would be very convenient to evaluate stability thresholds using a single free-energy metric rather than needing to check each mode’s dispersion relation separately.

Calculations of the HFCLC rotational stabilization criteria can be found in Refs. 43, 45. Calculations of the HFCLC, DCLC, and DGH stabilization criteria can be found in Ref. 6. The details of the behavior of these modes is not the focus of this paper, but we will outline the key results as relevant for the comparison with the free-energy theory.

1.

All three modes become stable when $\phi$ is sufficiently large and positive.
2.

For distributions modeled by $f_{T}$ , stabilizing values of $\phi$ are typically $\phi\leq 1$ .
3.

For distributions modeled by $f_{T}$ , the HFCLC and DCLC stabilizing value of $\phi$ is higher when $R$ is higher. This behavior is not a unique feature of $f_{T}$ , and also appears in other analytic and numerical models. (The DGH is typically only unstable for $f_{T}$ when $\phi\leq 0$ ).
4.

For any distribution $f(v_{||},v_{\perp})$ , all three modes are stable if the projection $\int f\,\mathrm{d}v_{||}$ is a monotonically decreasing function of $v_{\perp}$ . This is a sufficient condition for stability, but not a necessary condition.

These points are discussed in greater detail in Ref. 6.

Point by point, are these characteristics successfully captured by the unmodified free-energy theory discussed in Section IV.2?

1.

Yes. We can see from Figure 2 that the available energy vanishes when $\phi$ is large.
2.

No. When $\phi=1$ , the available energy curves shown in Figure 2 may only be reduced by a factor of $\sim~{}1/4$ relative to their values at $\phi=0$ . This suggests that the mode might saturate at a lower level, but it does not suggest that the mode should vanish altogether.
3.

No. The unmodified free-energy theory predicts the opposite trend, with lower free energy when $R$ is larger.
4.

No. The unmodified free-energy theory predicts that a system is in a stable ground state when $f$ is a monotonically decreasing function of energy, not when its projection is monotonic.

The unmodified version of the free-energy theory does rather poorly at predicting when these modes will be stable. This is because it is missing a key constraint.[6]

A hint can be found in the appearance of the projected distribution in point (4). The quasilinear velocity-space diffusion operator can be written as[51]

\displaystyle\frac{\partial f}{\partial t}\bigg{|}_{\text{QL}}

\displaystyle=\frac{\partial}{\partial\mathbf{v}}\cdot\mathsf{D}\cdot\frac{% \partial}{\partial\mathbf{v}}\,f,

(24)

where

\displaystyle\mathsf{D}\doteq D_{0}\int\frac{\omega_{i}\mathcal{E}_{\mathbf{k}% }}{(\mathbf{k}\cdot\mathbf{v}-\omega_{r})^{2}+\omega_{i}^{2}}\frac{\mathbf{k}% \mathbf{k}}{k^{2}}\,\mathrm{d}\mathbf{k}.

(25)

Here $D_{0}$ is a species-dependent constant, $\mathcal{E}_{\mathbf{k}}$ is the spectral energy density, $\omega_{r}$ and $\omega_{i}$ are the real and imaginary parts of the wave frequency, and $\mathbf{k}$ is the wavenumber. Recall that the modes under consideration are all flute-like. If $k_{||}=0$ , then the quasilinear diffusion operator does not drive velocity-space diffusion in the parallel direction, and the operator does not distinguish between different values of $v_{||}$ .

This motivates a constraint on the allowed rearrangement operations.[6] If the mixing is driven by flute-like modes, then we should allow only mixing in the perpendicular direction, and we should require that any rearrangement acting on a point $(v_{||},\mathbf{v}_{\perp})$ must act identically on all other points with the same $\mathbf{v}_{\perp}$ . Intuitively, this means that flute-like rearrangements act on the projection $\int f\,\mathrm{d}v_{||}$ and cannot access any free energy associated with population inversions in the parallel direction.

Continuing to take $\phi\geq 0$ and $R>1$ ,

\displaystyle\int_{-\infty}^{+\infty}f_{T}(\mathbf{u})\,\mathrm{d}u_{||}

\displaystyle=\sqrt{\pi}Ae^{-u_{\perp}^{2}}\text{erf}\sqrt{(R-1)u_{\perp}^{2}+% \phi}\,.

(26)

The projection is a monotonically decreasing function of $u_{\perp}$ if and only if

\displaystyle\sqrt{\pi\phi}\,e^{\phi}\text{erf}\sqrt{\phi}\geq R-1.

(27)

The perpendicular energy that can be released by restacking operations with this “flute-like” constraint is equivalent to the original Gardner problem in two dimensions for the projected distribution $\int f_{T}\,\mathrm{d}u_{||}$ . For the purposes of tracking the fraction of the total energy that is available, we also need to take into account the (entirely inaccessible) parallel component of the kinetic energy. This parallel component is

$\displaystyle W_{T\|\|}$	$\displaystyle\doteq T\bigg{(}\frac{2T}{m}\bigg{)}^{3/2}\int\mathrm{d}^{3}% \mathbf{u}\,u_{\|\|}^{2}f_{T}$	(28)
	$\displaystyle=\frac{AT}{2}\bigg{(}\frac{2\pi T}{m}\bigg{)}^{3/2}\bigg{[}-\frac% {2}{R}\sqrt{\frac{\phi}{\pi}}e^{-\phi}+\text{erf}\sqrt{\phi}$
	$\displaystyle\hskip 30.0pt+\bigg{(}\frac{R-1}{R}\bigg{)}^{3/2}e^{\phi/R-1}% \text{erfc}\sqrt{\frac{R\phi}{R-1}}\,\bigg{]}.$	(29)

Taking this into account, Figure 3 shows the fraction of the energy that is available in $f_{T}$ under this new constraint. The constraint reduces the fraction of the energy that is available, but more importantly, it qualitatively changes the dependence of the free energy on the mirror ratio $R$ . Before adding the constraint, the free energy was a monotonically decreasing function of $R$ . After adding the constraint, it is generally non-monotonic, though it always becomes a decreasing function of $R$ when $R$ becomes sufficiently large. This is shown most clearly in the case highlighted in Figure 4. The resulting stability condition was first introduced in Ref. 6.

The non-monotonic dependence on $R$ is a surprise, but it is understandable. In fact, this non-monotonicity helps to resolve a discrepancy between the usual intuitions about the role of the mirror ratio and the actual behavior of the stabilization thresholds. If the free energy exists due to the loss cone, and larger $R$ means a smaller loss cone, then it would be reasonable to expect that larger $R$ should translate to less free energy. Indeed, this is what appears in Figure 2, for the unconstrained free energy. But the $\phi$ thresholds for stabilizing flute-like loss-cone modes are higher when $R$ is larger. The reason for this is that when $R$ decreases, there is more and more free energy, but less and less of it is accessible to the kinds of rearrangements that flute-like modes can produce. But the original intuition is recovered when we look at the large- $R$ limit: the modes may not stabilize, but the amount of free energy available to them eventually drops as $R$ increases (likely corresponding to modes that are unstable but saturate at a very low level). This happens because as $R$ increases, the loss-cone modes can access greater and greater fractions of the free energy, but there is less and less of it to begin with.

IV.4 Modification to Account for Lost Particles

The analysis thus far has treated loss-cone modes as being those modes which are associated with loss-cone distributions. In other words, the presence of a loss cone is associated with a particular class of kinetic distributions (those which vanish inside the loss cone), and distributions with this structure generally have population inversions which can drive instabilities. However, there is a meaningful distinction between the free energy in a system with an actual loss region and the free energy in a system for which the initial distribution simply happens to vanish in a given region. If a phase-space element is moved into a loss region, it promptly exits the system, and the loss region does not remain occupied. In the simplest case, it can be modeled as leaving the system without any additional interaction, so that whatever energy it has in its final phase-space position (inside the loss region) is carried away and not counted as available. This idea was first explored in Ref. 6, and is illustrated in Figure 5.

In many cases, this can substantially increase the quantity of energy that can be extracted. For example, in a non-rotating mirror, the loss cone includes a region at vanishingly small energy, so in the absence of any additional constraints, Gardner restacking operations can release the entire energy content of the system.

The effects of the loss-cone sink are perhaps most interesting in the case of the free energy with the “flute-like” constraint discussed in Section IV.3. Without the loss region, the constraint prevents the system from performing rearrangements that distinguish between different values of $v_{||}$ ; entire columns of phase-space elements with a given $\mathbf{v}_{\perp}$ must be moved together. With the loss region, it is possible to partially circumvent this constraint by moving the column of elements so that only part of it falls into the loss region. Then a segment of the column can be left behind while the surviving phase-space elements can be moved elsewhere.

In other forms of the Gardner and diffusive-exchange theories, it is never necessary to perform so-called “annealing” operations (that is, those that raise the energy of the system) in order to reach the minimum-energy ground state.[7] However, when we include both a loss region and the flute-like constraint, there are scenarios in which the minimum-energy state can only be reached using sequences of rearrangements that include annealing (because these operations are sometimes necessary in order to drop off part of a column of phase space elements in the loss region).

Interestingly, this implies that a configuration can be a ground state if the loss region is treated as unoccupied space, but have nonzero available energy if the loss region is instead treated as a sink. In other words, it suggests the existence of instabilities which rely on the loss of particles that get into the loss region, and it suggests that these instabilities would not be captured by an analysis which treated the loss region only as an initially unoccupied part of phase space (as, indeed, analytic treatments of loss-cone modes typically do).

The appearance of these annealing operations motivates a distinction between strong and weak ground states. In a strong ground state, no sequence of rearrangements can possibly lead to a lower-energy state. In a weak ground state, no single rearrangement operation can reduce the energy of the state. A state that would be a ground state in the absence of any loss-cone sink is always a weak ground state in the presence of the sink, but may not be a strong ground state. The stability conditions that appear in linear analyses of the HFCLC, DCLC, and DGH modes appear to more closely match the weak ground-state condition.[6] This makes sense, given that these linear analyses did not include any particle sinks in the loss-cone region, instead treating the loss-cone as simply a region of phase space which happens to be unoccupied by the leading-order kinetic distribution.

V Discussion

Theories of free or available energy offer a way to calculate very general bounds on the possible behavior of plasma systems without the need to resolve (or even specify) all of the details of the dynamics of those systems. This can be useful in cases where the bound is computationally less expensive than the detailed calculation, as is often the case for applications involving turbulence.[11, 12, 13] It can also be intrinsically useful to be able to draw conclusions about entire classes of systems with a single calculation. For example, there are many flute-like loss-cone modes,[46] and the calculations discussed in Section IV should be applicable to all of them. The approach elaborated upon here, namely considering phase space rearrangements as a series of steps operating on a finite set of phase space volumes, has been shown to be particularly powerful despite its apparent simplicity. In the limit of many volumes, we were able to recover the surprising result that Gardner restacking could be achieved through diffusive operations. We were able also to recover theorems in a variety of constrained problems.

For processes described by the diffusive-exchange operator, one major problem of interest is to characterize the spectrum of possible states that can be reached from a given initial condition.[2, 7, 8, 5, 9, 10] When phase space is continuous, the Gardner ground state is the lower bound for the final energy, and it is proved that it is possible to get arbitrarily close to that state diffusively.[9] The highest-energy accessible ground state (when only energy-releasing operations are allowed) is known in certain cases but not others. Ref. 10 shows that the quasilinear plateau solution is the highest-energy accessible ground state for the bump-on-tail distribution. This paper shows that the highest-energy ground state for a loss-cone-truncated Maxwellian is simply an isotropic Maxwellian at the same temperature.

Another major problem of interest – both for Gardner restacking and for diffusive exchange – is to identify and understand the appropriate additional constraints to impose. On its own, the Gardner free energy is often a significant overestimate of how much energy is realistically extractable from a given system. One reason for this is that real systems often obey a variety of constraints in addition to Liouville’s theorem. However, it is possible to formulate a constrained rearrangement problem in which other constraints are also considered. This was first done for the case of constraints that take the form of conservation laws applied to each phase-space element.[3, 4] However, there are also systems for which the relevant constraints take other forms.

One example of this is the free energy associated with the loss region of a mirror configuration (rotating or otherwise). Many of the major loss-cone instabilities are flute-like, which means that they not only do not rearrange phase-space elements in the velocity direction parallel to the magnetic field, but they also cannot separately rearrange phase-space elements with different values of $v_{||}$ . This leads to a more restrictive additional constraint on the allowed rearrangements. It turns out that this constraint greatly improves how well the Gardner free energy tracks the actual stability thresholds of the modes.

Despite progress on these problems, there is work yet to be done. There remain open questions regarding the characterization of the spectrum of diffusively accessible states. In addition, there are many systems for which the question of which constraints are necessary (and how those constraints behave) has not yet been explored. As our understanding of these rearrangement processes improves, there are good reasons to hope that theories of free or available energy can be an increasingly practical tool with which to understand the behavior of plasma systems.

Acknowledgements.

The authors thank Robbie Ewart, Alex Glasser, Mike Mlodik, Ian Ochs, Jean-Marcel Rax, Tal Rubin, and Alex Schekochihin for helpful conversations. This work was supported by ARPA-E Grant No. DE-AR0001554. This work was also supported by the DOE Fusion Energy Sciences Postdoctoral Research Program, administered by the Oak Ridge Institute for Science and Education (ORISE) and managed by Oak Ridge Associated Universities (ORAU) under DOE Contract No. DE-SC0014664.

Data Availability Statement

Data sharing is not applicable to this article as no new data were created or analyzed in this study.

References

Gardner [1963] C. S. Gardner, Phys. Fluids 6, 839 (1963).
Fisch and Rax [1993] N. J. Fisch and J.-M. Rax, Phys. Fluids B 5, 1754 (1993).
Helander [2017] P. Helander, J. Plasma Phys. 83, 715830401 (2017).
Helander [2020] P. Helander, J. Plasma Phys. 86, 905860201 (2020).
Kolmes et al. [2020] E. J. Kolmes, P. Helander, and N. J. Fisch, Phys. Plasmas 27, 062110 (2020).
Kolmes et al. [2024] E. J. Kolmes, I. E. Ochs, and N. J. Fisch, J. Plasma Phys. 90, 905900203 (2024).
Hay et al. [2015] M. J. Hay, J. Schiff, and N. J. Fisch, Phys. Plasmas 22, 102108 (2015).
Hay et al. [2017] M. J. Hay, J. Schiff, and N. J. Fisch, Physica A 473, 225 (2017).
Kolmes and Fisch [2020] E. J. Kolmes and N. J. Fisch, Phys. Rev. E 102, 063209 (2020).
Kolmes and Fisch [2022] E. J. Kolmes and N. J. Fisch, Phys. Rev. E 106, 055209 (2022).
Mackenbach et al. [2022] R. J. J. Mackenbach, J. H. E. Proll, and P. Helander, Phys. Rev. Lett. 128, 175001 (2022).
Mackenbach et al. [2023a] R. J. J. Mackenbach, J. H. E. Proll, R. Wakelkamp, and P. Helander, J. Plasma Phys. 89, 905890513 (2023a).
Mackenbach et al. [2023b] R. J. J. Mackenbach, J. H. E. Proll, G. Snoep, and P. Helander, J. Plasma Phys. 89, 905890522 (2023b).
Fisch and Rax [1992] N. J. Fisch and J.-M. Rax, Phys. Rev. Lett. 69, 612 (1992).
Fisch and Herrmann [1995] N. J. Fisch and M. C. Herrmann, Nucl. Fusion 35, 1753 (1995).
Fetterman and Fisch [2008] A. J. Fetterman and N. J. Fisch, Phys. Rev. Lett. 101, 205003 (2008).
Dalton [1920] H. Dalton, Econ. J. 30, 348 (1920).
Horn [1964] F. Horn, Attainable and non-attainable regions in chemical reaction techniques, in Proceedings of the 3rd European Symposium on Chemical Reaction Engineering (Pergamon, 1964) pp. 1–10.
Atkinson [1970] A. B. Atkinson, J. Econ. Theory 2, 244 (1970).
Berk et al. [1970] H. L. Berk, C. E. Nielsen, and K. V. Roberts, Phys. Fluids 13, 980 (1970).
Bartholomew [1971] P. Bartholomew, Mon. Not. R. Astr. Soc. 151, 333 (1971).
Zylka [1985] C. Zylka, Theor. Chim. Acta 68, 363 (1985).
Morrison and Pfirsch [1989] P. J. Morrison and D. Pfirsch, Phys. Rev. A 40, 3898 (1989).
Morrison [1998] P. J. Morrison, Rev. Mod. Phys. 70, 467 (1998).
Thon and Wallace [2004] D. Thon and S. W. Wallace, Soc. Choice Welfare 22, 447 (2004).
Aboudi et al. [2010] R. Aboudi, D. Thon, and S. W. Wallace, J. Econ. Inequal. 8, 47 (2010).
Lemou et al. [2012] M. Lemou, F. Méhats, and P. Raphaël, Invent. Math. 187, 145 (2012).
Levy et al. [2014] M. C. Levy, S. C. Wilks, M. Tabak, S. B. Libby, and M. G. Baring, Nat. Commun. 5, 4149 (2014).
Lostaglio et al. [2015] M. Lostaglio, K. Korzekwa, D. Jennings, and T. Rudolph, Phys. Rev. X 5, 021001 (2015).
Brandão and Gour [2015] F. G. S. L. Brandão and G. Gour, Phys. Rev. Lett. 115, 070503 (2015).
[31] F. Baldovin, A. Cappellaro, E. Orlandini, and L. Salasnich, J. Stat. Mech. 2016, 063303.
Korzekwa et al. [2019] K. Korzekwa, C. T. Chubb, and M. Tomamichel, Phys. Rev. Lett. 122, 110403 (2019).
[33] D. N. Hosking, D. Wasserman, and S. C. Cowley, Metastability of stratified magnetohydrodynamic equilibria and their relaxation, arXiv:2401.01336.
Dodin and Fisch [2005] I. Y. Dodin and N. J. Fisch, Phys. Lett. A 341, 187 (2005).
Riesz [1930] F. Riesz, J. London Math. Soc. 5, 162 (1930).
Hardy et al. [1934] G. H. Hardy, J. E. Littlewood, and G. Pólya, Inequalities (Cambridge University Press, Cambridge, UK, 1934).
Brascamp et al. [1974] H. J. Brascamp, E. H. Lieb, and J. M. Luttinger, J. Funct. Anal. 17, 227 (1974).
Almgren and Lieb [1989] F. J. Almgren, Jr. and E. H. Lieb, J. Am. Math. Soc. 2, 683 (1989).
Baernstein et al. [2019] A. Baernstein, II, D. Drasin, and R. S. Laugesen, Symmetrization in Analysis (Cambridge University Press, Cambridge, UK, 2019).
Lynden-Bell [1967] D. Lynden-Bell, Mon. Not. R. Astron. Soc. 136, 101 (1967).
Ewart et al. [2022] R. J. Ewart, A. Brown, T. Adkins, and A. A. Schekochihin, J. Plasma Phys. 88, 925880501 (2022).
Ewart et al. [2023] R. J. Ewart, M. L. Nastac, and A. A. Schekochihin, J. Plasma Phys. 89, 905890516 (2023).
Volosov et al. [1969] V. I. Volosov, V. E. Pal’chikov, and F. A. Tsel’nik, Sov. Phys. – Dokl. 13, 691 (1969).
Thyagaraja and McClements [2009] A. Thyagaraja and K. G. McClements, Phys. Plasmas 16, 092506 (2009).
Turikov [1973] V. A. Turikov, Sov. Phys. – Tech. Phys. 18, 48 (1973).
Post [1987] R. F. Post, Nucl. Fusion 27, 1579 (1987).
Dory et al. [1965] R. A. Dory, G. E. Guest, and E. G. Harris, Phys. Rev. Lett. 14, 131 (1965).
Rosenbluth and Post [1965] M. N. Rosenbluth and R. F. Post, Phys. Fluids 8, 547 (1965).
Post and Rosenbluth [1966] R. F. Post and M. N. Rosenbluth, Phys. Fluids 9, 730 (1966).
Kotelnikov et al. [2017] I. A. Kotelnikov, I. S. Chernoshtanov, and V. V. Prikhodko, Phys. Plasmas 24, 122512 (2017).
Krall and Trivelpiece [1973] N. A. Krall and A. W. Trivelpiece, Principles of Plasma Physics (McGraw-Hill, New York, 1973).