Multiparameter Fuss–Catalan numbers with application to algebraic equations

S. R. Mane [email protected] Convergent Computing Inc., P. O. Box 561, Shoreham, NY 11786, USA
Abstract

We present an exposition on the Fuss–Catalan numbers, which are a generalization of the well known Catalan numbers. The literature on the subject is scattered (especially for the case of multiple independent parameters, as will be explained in the text), with overlap** definitions by different authors and duplication of proofs. This paper collects the main theorems and identities, with a consistent notation. Contact is made with the works of numerous authors, including the early works of Lambert and Euler. We demonstrate the application of the formalism to solve algebraic equations by infinite series. Our main result in this context is a new necessary and sufficient formula for the domain of absolute convergence of the series solutions of algebraic equations, which corrects and extends previous work in the field. Some historical material is placed in an Appendix.

keywords:
Fuss–Catalan numbers , generating functions , solutions of algebraic equations by infinite series , complete Reinhardt domain , domain of absolute convergence
MSC:
[2010] primary 05-02 , 32A05; secondary 30B10 , 32A07
journal: Expositiones Mathematicae

1 Introduction

We employ the standard notation \mathbb{C}blackboard_C for the complex numbers, \mathbb{R}blackboard_R for the reals and \mathbb{N}blackboard_N for the natural numbers {0,1,2,}012\{0,1,2,\dots\}{ 0 , 1 , 2 , … }. The Catalan numbers are defined, for t𝑡t\in\mathbb{N}italic_t ∈ blackboard_N, as

Ct=1t+1(2tt)=(2t)!(t+1)!t!.subscript𝐶𝑡1𝑡1binomial2𝑡𝑡2𝑡𝑡1𝑡C_{t}=\frac{1}{t+1}\binom{2t}{t}=\frac{(2t)!}{(t+1)!t!}\,.italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_t + 1 end_ARG ( FRACOP start_ARG 2 italic_t end_ARG start_ARG italic_t end_ARG ) = divide start_ARG ( 2 italic_t ) ! end_ARG start_ARG ( italic_t + 1 ) ! italic_t ! end_ARG . (1.1)

(It is more usual to write Cnsubscript𝐶𝑛C_{n}italic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT instead of Ctsubscript𝐶𝑡C_{t}italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, but there are too many other meanings for n𝑛nitalic_n later in this paper, so to avoid confusion I shall employ t𝑡titalic_t not n𝑛nitalic_n.) Catalan numbers have been claimed to be the most ubiquitions numbers in combinatorics, second only to the binomial coefficients themselves, e.g. see the text by Stanley [41]. It is also shown in [41] that Catalan numbers are the solutions to numerous counting problems. For example, Euler showed that Ctsubscript𝐶𝑡C_{t}italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT gives the number of triangulations of a convex (t+2)𝑡2(t+2)( italic_t + 2 )-gon. (See [41] for an extensive historical description, including quotes from the correspondence of Euler and other authors.) A generalization of the Catalan numbers, known as the Fuss–Catalan numbers, are the principal objects of interest in this paper. (They are named after Nicolas Fuss and Eugène Charles Catalan; see the text by Graham et al. [23] for a historical discussion.) First let m,t𝑚𝑡m,t\in\mathbb{N}italic_m , italic_t ∈ blackboard_N and define

At(m)=1(m1)t+1(mtt).subscript𝐴𝑡𝑚1𝑚1𝑡1binomial𝑚𝑡𝑡A_{t}(m)=\frac{1}{(m-1)t+1}\binom{mt}{t}\,.italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_m ) = divide start_ARG 1 end_ARG start_ARG ( italic_m - 1 ) italic_t + 1 end_ARG ( FRACOP start_ARG italic_m italic_t end_ARG start_ARG italic_t end_ARG ) . (1.2)

The Catalan numbers are the special case where m=2𝑚2m=2italic_m = 2. Then At(m+1,1)subscript𝐴𝑡𝑚11A_{t}(m+1,1)italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_m + 1 , 1 ) counts the number of dissections of a convex (mt+2)𝑚𝑡2(mt+2)( italic_m italic_t + 2 )-gon into regions that are (m+2)𝑚2(m+2)( italic_m + 2 )-gons [41, exercise A14]. The term ‘dissection’ means the diagonals joining the vertices of the (mt+2)𝑚𝑡2(mt+2)( italic_m italic_t + 2 )-gon, to form the (m+2)𝑚2(m+2)( italic_m + 2 )-gon, do not intersect in their interiors. See [41] for details. However, our interest extends beyond combinatorics. We require a definition not restricted to integers. We define the Fuss–Catalan numbers, for μ,r𝜇𝑟\mu,r\in\mathbb{C}italic_μ , italic_r ∈ blackboard_C and t𝑡t\in\mathbb{N}italic_t ∈ blackboard_N, as A0(μ,r):=1assignsubscript𝐴0𝜇𝑟1A_{0}(\mu,r):=1italic_A start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_μ , italic_r ) := 1 and for t1𝑡1t\geq 1italic_t ≥ 1 via

At(μ,r):=rt!j=1t1(tμ+rj).assignsubscript𝐴𝑡𝜇𝑟𝑟𝑡superscriptsubscriptproduct𝑗1𝑡1𝑡𝜇𝑟𝑗A_{t}(\mu,r):=\frac{r}{t!}\prod_{j=1}^{t-1}(t\mu+r-j)\,.italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_μ , italic_r ) := divide start_ARG italic_r end_ARG start_ARG italic_t ! end_ARG ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t - 1 end_POSTSUPERSCRIPT ( italic_t italic_μ + italic_r - italic_j ) . (1.3)

The above expression is well-defined for all μ,r𝜇𝑟\mu,r\in\mathbb{C}italic_μ , italic_r ∈ blackboard_C. We can employ the Gamma function to write

At(μ,r)=rΓ(tμ+r)Γ(t+1)Γ(t(μ1)+r+1).subscript𝐴𝑡𝜇𝑟𝑟Γ𝑡𝜇𝑟Γ𝑡1Γ𝑡𝜇1𝑟1A_{t}(\mu,r)=r\,\frac{\Gamma(t\mu+r)}{\Gamma(t+1)\Gamma(t(\mu-1)+r+1)}\,.italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_μ , italic_r ) = italic_r divide start_ARG roman_Γ ( italic_t italic_μ + italic_r ) end_ARG start_ARG roman_Γ ( italic_t + 1 ) roman_Γ ( italic_t ( italic_μ - 1 ) + italic_r + 1 ) end_ARG . (1.4)

However, this expression contains potential 0/0000/00 / 0 problems if the arguments of the Gamma functions equal zero or a negative integer. We shall employ eq. (1.3) in this paper. There are other equivalent definitions of the Fuss–Catalan numbers; for example the text by Graham et al. [23] employs generalized binomial coefficients. All of the applications in this paper will in fact treat only μ,r𝜇𝑟\mu,r\in\mathbb{R}italic_μ , italic_r ∈ blackboard_R. Note that eq. (1.2) is the special case μ=m𝜇𝑚\mu=mitalic_μ = italic_m and r=1𝑟1r=1italic_r = 1.

Concomitant with the Fuss–Catalan numbers is their generating function, and in fact we shall mostly work with the generating function below (here z𝑧z\in\mathbb{C}italic_z ∈ blackboard_C)

Bμ(r;z)=t=0At(μ,r)zt.subscript𝐵𝜇𝑟𝑧superscriptsubscript𝑡0subscript𝐴𝑡𝜇𝑟superscript𝑧𝑡B_{\mu}(r;z)=\sum_{t=0}^{\infty}A_{t}(\mu,r)z^{t}\,.italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_r ; italic_z ) = ∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_μ , italic_r ) italic_z start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT . (1.5)

It is proved in [23] that Bμ(r;z)subscript𝐵𝜇𝑟𝑧B_{\mu}(r;z)italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_r ; italic_z ) has the remarkable property Bμ(1;z)r=Bμ(r;z)subscript𝐵𝜇superscript1𝑧𝑟subscript𝐵𝜇𝑟𝑧B_{\mu}(1;z)^{r}=B_{\mu}(r;z)italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( 1 ; italic_z ) start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT = italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_r ; italic_z ). Also, and very importantly, Bμ(1;z)subscript𝐵𝜇1𝑧B_{\mu}(1;z)italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( 1 ; italic_z ) satisfies the following equation for f(z)𝑓𝑧f(z)italic_f ( italic_z ) (again, see [23])

f=1+zfμ.𝑓1𝑧superscript𝑓𝜇f=1+zf^{\mu}\,.italic_f = 1 + italic_z italic_f start_POSTSUPERSCRIPT italic_μ end_POSTSUPERSCRIPT . (1.6)

Variations of this equation were solved, using power series, by Lambert [28, 29] and Euler [16]. In both cases, their solutions are now known to be Fuss–Catalan series; this will be shown below. (I shall use the term ‘Fuss–Catalan series’ as a shorthand for ‘power series whose coefficients are Fuss–Catalan numbers.’) It is then very natural to extend eq. (1.6) to functions of multiple k>1𝑘1k>1italic_k > 1 complex variables

f=1+z1fμ1++zkfμk.𝑓1subscript𝑧1superscript𝑓subscript𝜇1subscript𝑧𝑘superscript𝑓subscript𝜇𝑘f=1+z_{1}f^{\mu_{1}}+\cdots+z_{k}f^{\mu_{k}}\,.italic_f = 1 + italic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_f start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT + ⋯ + italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_f start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT . (1.7)

Here z1,,zksubscript𝑧1subscript𝑧𝑘z_{1},\dots,z_{k}\in\mathbb{C}italic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ blackboard_C and also μ1,,μksubscript𝜇1subscript𝜇𝑘\mu_{1},\dots,\mu_{k}\in\mathbb{C}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ blackboard_C. Analogous to eq. (1.6), the solution of eq. (1.7) is also given by a generating function, a multinomial power series in z1,,zksubscript𝑧1subscript𝑧𝑘z_{1},\dots,z_{k}italic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, where the series coefficients are ‘multiparameter Fuss–Catalan numbers.’

This brings us to the heart of this paper. The multiparameter Fuss–Catalan numbers will be defined below. However, it turns out that the literature on the multiparameter Fuss–Catalan numbers is scattered. As can be seen from above, there are two broad threads, i.e. combinatorics and the theory of several complex variables. Different authors have published overlap** (not always equivalent) definitions, with duplication of theorems and proofs. It is the purpose of this paper to collect together the literature on the multiparameter Fuss–Catalan numbers, with a consistent notation and references to the various theorems and proofs by diverse authors. In particular, consider the general algebraic equation of degree n𝑛nitalic_n, with x𝑥x\in\mathbb{C}italic_x ∈ blackboard_C and complex coefficients a0,,ansubscript𝑎0subscript𝑎𝑛a_{0},\dots,a_{n}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , … , italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT

a0+a1x++anxn=0.subscript𝑎0subscript𝑎1𝑥subscript𝑎𝑛superscript𝑥𝑛0a_{0}+a_{1}x+\cdots+a_{n}x^{n}=0\,.italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x + ⋯ + italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT = 0 . (1.8)

It is known that eq. (1.8) can be solved by expressing x𝑥xitalic_x in a multivariate series (more accurately, a Laurent–Puiseux series) in the coefficients a0,,ansubscript𝑎0subscript𝑎𝑛a_{0},\dots,a_{n}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , … , italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. This can be accomplished by an application of the Lagrange Inversion Theorem; indeed Lagrange himself did so as a demonstration of his theorem, for the special case of the trinomial [27]. Clearly, eq. (1.8) can be cast in the form of eq. (1.7), and the solution is a (multiparameter) Fuss–Catalan series. This highlights two broad themes in this paper: the literature on combinatorics treats integer-valued parameters, whereas that on complex variables treats algebraic equations (polynomials), but both are subsumed into a general framework of Fuss–Catalan series. Note that the exponents μ1,,μksubscript𝜇1subscript𝜇𝑘\mu_{1},\dots,\mu_{k}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT in eq. (1.7) are arbitrary real (or in principle complex) numbers, and are not restricted to be integers (or rational numbers). Some authors, such as Mohanty [34], recognize this fact, but most do not. Mohanty’s work will be important below. Contact will also be made with the works of numerous other authors such as Euler, Lagrange and Lambert (mentioned above), Klein (solution of the quintic), Gould, Mellin and Raney, to name a few. Mellin [33] employed his eponymous transform to solve eq. (1.8); it will be shown below that his solution is a Fuss–Catalan series. Ramanujan [5] also published briefly on the subject, the equation and his series solution will be cited below; it is of course a Fuss–Catalan series. Significantly, Ramanujan derived a bound for the radius of convergence of his series (many other authors did not). We shall derive two bounds for the absolute convergence of the series solution of eq. (1.7). The first is necessary but in general not sufficient and the second is sufficient but in general not necessary. A necessary and sufficient bound for absolute convergence is not known at this time. However, for the special case of an algebraic equation, we shall present a new necessary and sufficient bound for the absolute convergence of the series solution of eq. (1.8) in Sec. 8. The new bound is based on earlier work by Passare and Tsikh [36]. Some counterexamples to their results will be displayed below; this indicates the need for a more careful treatment of the problem.

On a more personal level, in a recent paper [13], Dilworth and this author derived the analytical solution for the probability mass function of the geometric distribution of order k𝑘kitalic_k [18]. The roots of the associated recurrence relation were obtained as series in Fuss–Catalan numbers. It was recognized that Fuss–Catalan series are a potentially powerful tool to solve related problems, and in a follow-up paper [14], they were applied to solve additional problems for success runs of Bernoulli trials. The title of [14] was deliberately worded “Applications of Fuss–Catalan Numbers to Success Runs of Bernoulli Trials.” This paper will not treat problems of probability and statistics, but it is this author’s personal belief that (multiparameter) Fuss–Catalan series offer great promise to solve problems in numerous subfields of mathematics. This motivates the desire to collect the literature on the subject in one place, with a consistent notation and to assemble together the various duplicated theorems and proofs.

The structure of this paper is as follows. The basic definitions of Fuss–Catalan numbers, their generating functions and relevant theorems are presented in Sec. 2. Bounds for the absolute convergence of Fuss–Catalan series are derived in Sec. 3. The application to algebraic equations is presented in Sec. 4. The quintic is sufficiently important that it is placed in a separate section in Sec. 5. The trinomial equation is also sufficiently important that it is placed in a separate section in Sec. 6. The domain of absolute convergence for the solutions of algebraic equations by infinite series is discussed in Sec. 7, where it is shown that a new, more careful treatment is required, and a new necessary and sufficient bound is presented in Sec. 8. A sample nontrivial application of the new bound is presented in Sec. 9, for the principal and Brioschi quintics. Some material, including historical material, is relegated to A. In B, contact is made with the work of Sturmfels [44] on the solutions of algebraic equations via so-called 𝒜𝒜\mathscr{A}script_A-hypergeometric series.

A few disclaimers and words of caution follow. First, it is important to note that there are complex roots in many of the series, hence branch cuts are required to obtain well-defined expressions. Overall, this detail is not clearly (or explicitly) addressed in the literature, but it is important. The claimed series ‘solution’ of eq. (1.8) may be erroneous (or meaningless) if an appropriate branch cut is not specified. The series may converge, but not to the root of the original equation. The subject of branch cuts will be discussed below.

Next, no claim is made here that the use of a series to solve eq. (1.7) is a computationally efficient algorithm, nor that a series solution of the algebraic equation eq. (1.8) converges rapidly to a root of the polynomial. McClintock did make such a claim [32], but in 1895 modern digital computers and the concomitant numerical algorithms did not exist. Indeed, a power series will not converge rapidly close to its circle of convergence. Nevertheless, an analytical expression can indicate properties of a function not evident from a purely numerical solution. For example, no alternative analytical expression is known, at the present time, for the probability mass function of the geometric distribution of order k𝑘kitalic_k [13].

Finally, this paper is not intended to be an encyclopedia. There is a vast literature on the solution of algebraic equations by infinite series, as well as on combinatorics using Catalan and Fuss–Catalan numbers. Any omissions are inadvertent and not deliberate. For example, the text by Appell and Kampé de Fériet [1] derives solutions of algebraic equations using generalized hypergeometric functions. The general sextic equation can be solved using Kampé de Fériet functions. It is beyond the scope of this paper to discuss such functions. The paper by Kamber [25] contains interesting material on the coefficients of certain inverse power series, but is also beyond the scope of this paper.

2 Basic definitions and theorems

For ease of reference, some of the equations displayed in the introduction will be repeated below. The Fuss–Catalan numbers are defined, for μ,r𝜇𝑟\mu,r\in\mathbb{C}italic_μ , italic_r ∈ blackboard_C and t𝑡t\in\mathbb{N}italic_t ∈ blackboard_N, as A0(μ,r):=1assignsubscript𝐴0𝜇𝑟1A_{0}(\mu,r):=1italic_A start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_μ , italic_r ) := 1 and for t1𝑡1t\geq 1italic_t ≥ 1 via

At(μ,r):=rt!j=1t1(tμ+rj).assignsubscript𝐴𝑡𝜇𝑟𝑟𝑡superscriptsubscriptproduct𝑗1𝑡1𝑡𝜇𝑟𝑗A_{t}(\mu,r):=\frac{r}{t!}\prod_{j=1}^{t-1}(t\mu+r-j)\,.italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_μ , italic_r ) := divide start_ARG italic_r end_ARG start_ARG italic_t ! end_ARG ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t - 1 end_POSTSUPERSCRIPT ( italic_t italic_μ + italic_r - italic_j ) . (2.1)

As stated in the introduction, all of the applications in this paper will treat μ,r𝜇𝑟\mu,r\in\mathbb{R}italic_μ , italic_r ∈ blackboard_R. The above numbers are also known as Raney numbers, at least when μ𝜇\muitalic_μ and r𝑟ritalic_r are nonnegative integers, in which case At(μ,r)subscript𝐴𝑡𝜇𝑟A_{t}(\mu,r)italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_μ , italic_r ) is itself a nonnegative integer. Raney’s work [37] will be cited below. The generating function of the Fuss–Catalan numbers is (where z𝑧z\in\mathbb{C}italic_z ∈ blackboard_C)

Bμ(r;z)=t=0At(μ,r)zt.subscript𝐵𝜇𝑟𝑧superscriptsubscript𝑡0subscript𝐴𝑡𝜇𝑟superscript𝑧𝑡B_{\mu}(r;z)=\sum_{t=0}^{\infty}A_{t}(\mu,r)z^{t}\,.italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_r ; italic_z ) = ∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_μ , italic_r ) italic_z start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT . (2.2)

The following results are known:

Theorem 2.1.
  1. 1.

    The generating function Bμ(1;z)subscript𝐵𝜇1𝑧B_{\mu}(1;z)italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( 1 ; italic_z ) satisfies the following equation for f(z)𝑓𝑧f(z)italic_f ( italic_z )

    f=1+zfμ.𝑓1𝑧superscript𝑓𝜇f=1+zf^{\mu}\,.italic_f = 1 + italic_z italic_f start_POSTSUPERSCRIPT italic_μ end_POSTSUPERSCRIPT . (2.3)
  2. 2.

    The generating function Bμ(r;z)subscript𝐵𝜇𝑟𝑧B_{\mu}(r;z)italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_r ; italic_z ) also has the property

    Bμ(1;z)r=Bμ(r;z).subscript𝐵𝜇superscript1𝑧𝑟subscript𝐵𝜇𝑟𝑧B_{\mu}(1;z)^{r}=B_{\mu}(r;z)\,.italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( 1 ; italic_z ) start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT = italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_r ; italic_z ) . (2.4)

    Let s𝑠s\in\mathbb{C}italic_s ∈ blackboard_C and using Bμ(1;z)r+s=Bμ(1;z)rBμ(1;z)ssubscript𝐵𝜇superscript1𝑧𝑟𝑠subscript𝐵𝜇superscript1𝑧𝑟subscript𝐵𝜇superscript1𝑧𝑠B_{\mu}(1;z)^{r+s}=B_{\mu}(1;z)^{r}B_{\mu}(1;z)^{s}italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( 1 ; italic_z ) start_POSTSUPERSCRIPT italic_r + italic_s end_POSTSUPERSCRIPT = italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( 1 ; italic_z ) start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( 1 ; italic_z ) start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT, eq. (2.4) is equivalent to the statement

    Bμ(r+s;z)=Bμ(r;z)Bμ(s;z).subscript𝐵𝜇𝑟𝑠𝑧subscript𝐵𝜇𝑟𝑧subscript𝐵𝜇𝑠𝑧B_{\mu}(r+s;z)=B_{\mu}(r;z)B_{\mu}(s;z)\,.italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_r + italic_s ; italic_z ) = italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_r ; italic_z ) italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_s ; italic_z ) . (2.5)
  3. 3.

    The Fuss–Catalan numbers satisfy the following convolution identity

    At(μ,s+r)=u=0tAu(μ,r)Atu(μ,s).subscript𝐴𝑡𝜇𝑠𝑟superscriptsubscript𝑢0𝑡subscript𝐴𝑢𝜇𝑟subscript𝐴𝑡𝑢𝜇𝑠A_{t}(\mu,s+r)=\sum_{u=0}^{t}A_{u}(\mu,r)A_{t-u}(\mu,s)\,.italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_μ , italic_s + italic_r ) = ∑ start_POSTSUBSCRIPT italic_u = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_A start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ( italic_μ , italic_r ) italic_A start_POSTSUBSCRIPT italic_t - italic_u end_POSTSUBSCRIPT ( italic_μ , italic_s ) . (2.6)
  4. 4.

    The Fuss–Catalan numbers satisfy the recurrence relation (there are other equivalent ways to express the recurrence)

    At(μ,r+1)=At(μ,r)+At1(μ,r+μ).subscript𝐴𝑡𝜇𝑟1subscript𝐴𝑡𝜇𝑟subscript𝐴𝑡1𝜇𝑟𝜇A_{t}(\mu,r+1)=A_{t}(\mu,r)+A_{t-1}(\mu,r+\mu)\,.italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_μ , italic_r + 1 ) = italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_μ , italic_r ) + italic_A start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT ( italic_μ , italic_r + italic_μ ) . (2.7)

Note that Theorems 2.1(b) and (c) are equivalent. Write out eq. (2.5) in full, then

t=0At(μ,r+s)zt=(t=0At(μ,r)zt)(t′′=0At′′(μ,s)zt′′).superscriptsubscript𝑡0subscript𝐴𝑡𝜇𝑟𝑠superscript𝑧𝑡superscriptsubscriptsuperscript𝑡0subscript𝐴superscript𝑡𝜇𝑟superscript𝑧superscript𝑡superscriptsubscriptsuperscript𝑡′′0subscript𝐴superscript𝑡′′𝜇𝑠superscript𝑧superscript𝑡′′\sum_{t=0}^{\infty}A_{t}(\mu,r+s)z^{t}=\biggl{(}\sum_{t^{\prime}=0}^{\infty}A_% {t^{\prime}}(\mu,r)z^{t^{\prime}}\biggr{)}\biggl{(}\sum_{t^{\prime\prime}=0}^{% \infty}A_{t^{\prime\prime}}(\mu,s)z^{t^{\prime\prime}}\biggr{)}\,.∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_μ , italic_r + italic_s ) italic_z start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT = ( ∑ start_POSTSUBSCRIPT italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_A start_POSTSUBSCRIPT italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_μ , italic_r ) italic_z start_POSTSUPERSCRIPT italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ) ( ∑ start_POSTSUBSCRIPT italic_t start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_A start_POSTSUBSCRIPT italic_t start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_μ , italic_s ) italic_z start_POSTSUPERSCRIPT italic_t start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ) . (2.8)

Selecting a particular value of t𝑡titalic_t on the left hand side and equating terms, we obtain a sum of terms t+t′′=tsuperscript𝑡superscript𝑡′′𝑡t^{\prime}+t^{\prime\prime}=titalic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + italic_t start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT = italic_t on the right-hand side and eq. (2.6) follows. Reversing the steps proves the converse. Also, using eqs. (2.2) and (2.4), eq. (2.7) can easily be employed to show that

Bμ(1;z)r+1=Bμ(1;z)r+zBμ(1;z)r+μ.subscript𝐵𝜇superscript1𝑧𝑟1subscript𝐵𝜇superscript1𝑧𝑟𝑧subscript𝐵𝜇superscript1𝑧𝑟𝜇B_{\mu}(1;z)^{r+1}=B_{\mu}(1;z)^{r}+zB_{\mu}(1;z)^{r+\mu}\,.italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( 1 ; italic_z ) start_POSTSUPERSCRIPT italic_r + 1 end_POSTSUPERSCRIPT = italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( 1 ; italic_z ) start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT + italic_z italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( 1 ; italic_z ) start_POSTSUPERSCRIPT italic_r + italic_μ end_POSTSUPERSCRIPT . (2.9)

This is simply eq. (2.3) multiplied through by Bμ(1;z)rsubscript𝐵𝜇superscript1𝑧𝑟B_{\mu}(1;z)^{r}italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( 1 ; italic_z ) start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT.

To generalize to k1𝑘1k\geq 1italic_k ≥ 1 multiple parameters, we employ a vector notaton and introduce the k𝑘kitalic_k-tuples 𝒕=(t1,,tk)k𝒕subscript𝑡1subscript𝑡𝑘superscript𝑘\bm{t}=(t_{1},\dots,t_{k})\in\mathbb{N}^{k}bold_italic_t = ( italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ∈ blackboard_N start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT, 𝝁=(μ1,,μk)k𝝁subscript𝜇1subscript𝜇𝑘superscript𝑘\bm{\mu}=(\mu_{1},\dots,\mu_{k})\in\mathbb{C}^{k}bold_italic_μ = ( italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ∈ blackboard_C start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT and 𝒛=(z1,,zk)k𝒛subscript𝑧1subscript𝑧𝑘superscript𝑘\bm{z}=(z_{1},\dots,z_{k})\in\mathbb{C}^{k}bold_italic_z = ( italic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ∈ blackboard_C start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT (and recall r,s𝑟𝑠r,s\in\mathbb{C}italic_r , italic_s ∈ blackboard_C). Also define |𝒕|=|t1|++|tk|𝒕subscript𝑡1subscript𝑡𝑘|\bm{t}|=|t_{1}|+\cdots+|t_{k}|| bold_italic_t | = | italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | + ⋯ + | italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT |. For brevity we shall frequently write t=|𝒕|𝑡𝒕t=|\bm{t}|italic_t = | bold_italic_t | below. Also, for |𝒕|>0𝒕0|\bm{t}|>0| bold_italic_t | > 0, define the ‘unit vector’ 𝒕^=𝒕/|𝒕|^𝒕𝒕𝒕\hat{\bm{t}}=\bm{t}/|\bm{t}|over^ start_ARG bold_italic_t end_ARG = bold_italic_t / | bold_italic_t |. We also define the zero vector 𝟎=(0,,0)000\bm{0}=(0,\dots,0)bold_0 = ( 0 , … , 0 ) and the ‘basis vectors’ 𝒆j=(0,,0,1,0,,0)=(δ1j,,δkj)subscript𝒆𝑗00100subscript𝛿1𝑗subscript𝛿𝑘𝑗\bm{e}_{j}=(0,\dots,0,1,0,\dots,0)=(\delta_{1j},\dots,\delta_{kj})bold_italic_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ( 0 , … , 0 , 1 , 0 , … , 0 ) = ( italic_δ start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , … , italic_δ start_POSTSUBSCRIPT italic_k italic_j end_POSTSUBSCRIPT ).

Definition 2.2 (multiparameter Fuss–Catalan numbers).

We define the multiparameter Fuss–Catalan numbers 𝒜𝐭(𝛍,r)subscript𝒜𝐭𝛍𝑟\mathscr{A}_{\bm{t}}(\bm{\mu},r)script_A start_POSTSUBSCRIPT bold_italic_t end_POSTSUBSCRIPT ( bold_italic_μ , italic_r ) via 𝒜𝟎(𝛍,r):=1assignsubscript𝒜0𝛍𝑟1\mathscr{A}_{\bm{0}}(\bm{\mu},r):=1script_A start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT ( bold_italic_μ , italic_r ) := 1 for 𝐭=𝟎𝐭0\bm{t}=\bm{0}bold_italic_t = bold_0 and for |𝐭|>0𝐭0|\bm{t}|>0| bold_italic_t | > 0 via

𝒜𝒕(𝝁,r):=rt1!tk!j=1|𝒕|1(𝒕𝝁+rj).assignsubscript𝒜𝒕𝝁𝑟𝑟subscript𝑡1subscript𝑡𝑘superscriptsubscriptproduct𝑗1𝒕1𝒕𝝁𝑟𝑗\mathscr{A}_{\bm{t}}(\bm{\mu},r):=\frac{r}{t_{1}!\cdots t_{k}!}\prod_{j=1}^{|% \bm{t}|-1}(\bm{t}\cdot\bm{\mu}+r-j)\,.script_A start_POSTSUBSCRIPT bold_italic_t end_POSTSUBSCRIPT ( bold_italic_μ , italic_r ) := divide start_ARG italic_r end_ARG start_ARG italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ! ⋯ italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ! end_ARG ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT | bold_italic_t | - 1 end_POSTSUPERSCRIPT ( bold_italic_t ⋅ bold_italic_μ + italic_r - italic_j ) . (2.10)

If k=1𝑘1k=1italic_k = 1 this reduces to the single-parameter definition eq. (2.1). Equivalently, for all |𝐭|0𝐭0|\bm{t}|\geq 0| bold_italic_t | ≥ 0,

𝒜𝒕(𝝁,r)=(tt1,,tk)At(𝒕^𝝁,r).subscript𝒜𝒕𝝁𝑟binomial𝑡subscript𝑡1subscript𝑡𝑘subscript𝐴𝑡^𝒕𝝁𝑟\mathscr{A}_{\bm{t}}(\bm{\mu},r)=\binom{t}{t_{1},\dots,t_{k}}\,A_{t}(\hat{\bm{% t}}\cdot\bm{\mu},r)\,.script_A start_POSTSUBSCRIPT bold_italic_t end_POSTSUBSCRIPT ( bold_italic_μ , italic_r ) = ( FRACOP start_ARG italic_t end_ARG start_ARG italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG ) italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( over^ start_ARG bold_italic_t end_ARG ⋅ bold_italic_μ , italic_r ) . (2.11)

Note that a 0/0000/00 / 0 indeterminate expression for 𝐭^^𝐭\hat{\bm{t}}over^ start_ARG bold_italic_t end_ARG does not arise in eq. (2.11) because of the definition A0():=1assignsubscript𝐴01A_{0}(\cdot):=1italic_A start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ⋅ ) := 1.

Definition 2.3 (multiparameter generating function).

The multiparameter Fuss–Catalan generating function is defined as

(𝝁;r;𝒛):=𝒕k𝒜𝒕(𝝁,r)z1t1zktk.assign𝝁𝑟𝒛subscript𝒕superscript𝑘subscript𝒜𝒕𝝁𝑟superscriptsubscript𝑧1subscript𝑡1superscriptsubscript𝑧𝑘subscript𝑡𝑘\mathcal{B}(\bm{\mu};r;\bm{z}):=\sum_{\bm{t}\in\mathbb{N}^{k}}\mathscr{A}_{\bm% {t}}(\bm{\mu},r)\,z_{1}^{t_{1}}\cdots z_{k}^{t_{k}}\,.caligraphic_B ( bold_italic_μ ; italic_r ; bold_italic_z ) := ∑ start_POSTSUBSCRIPT bold_italic_t ∈ blackboard_N start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_POSTSUBSCRIPT script_A start_POSTSUBSCRIPT bold_italic_t end_POSTSUBSCRIPT ( bold_italic_μ , italic_r ) italic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ⋯ italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT . (2.12)

Technically, the above expression is not well-defined because the answer can depend on the order of summation. In all the applications in this paper, we collect the terms in level sets in t=|𝐭|𝑡𝐭t=|\bm{t}|italic_t = | bold_italic_t |

(𝝁;r;𝒛)=t=0t1++tk=t(tt1,,tk)At(𝒕^𝝁,r)z1t1zktk.𝝁𝑟𝒛superscriptsubscript𝑡0subscriptsubscript𝑡1subscript𝑡𝑘𝑡binomial𝑡subscript𝑡1subscript𝑡𝑘subscript𝐴𝑡^𝒕𝝁𝑟superscriptsubscript𝑧1subscript𝑡1superscriptsubscript𝑧𝑘subscript𝑡𝑘\mathcal{B}(\bm{\mu};r;\bm{z})=\sum_{t=0}^{\infty}\sum_{t_{1}+\cdots+t_{k}=t}% \binom{t}{t_{1},\dots,t_{k}}A_{t}(\hat{\bm{t}}\cdot\bm{\mu},r)\,z_{1}^{t_{1}}% \cdots z_{k}^{t_{k}}\,.caligraphic_B ( bold_italic_μ ; italic_r ; bold_italic_z ) = ∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + ⋯ + italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_t end_POSTSUBSCRIPT ( FRACOP start_ARG italic_t end_ARG start_ARG italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG ) italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( over^ start_ARG bold_italic_t end_ARG ⋅ bold_italic_μ , italic_r ) italic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ⋯ italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT . (2.13)

However, to obtain rigorous results, we must specify a domain of absolute convergence. Then the answer will not depend on the order of summation. The topic of absolute convergence will be discussed below.

Theorem 2.4.
  1. 1.

    The generating function (𝝁;1;𝒛)𝝁1𝒛\mathcal{B}(\bm{\mu};1;\bm{z})caligraphic_B ( bold_italic_μ ; 1 ; bold_italic_z ) satisfies the following equation for f(𝒛)𝑓𝒛f(\bm{z})italic_f ( bold_italic_z )

    f=1+z1fμ1++zkfμk.𝑓1subscript𝑧1superscript𝑓subscript𝜇1subscript𝑧𝑘superscript𝑓subscript𝜇𝑘f=1+z_{1}f^{\mu_{1}}+\cdots+z_{k}f^{\mu_{k}}\,.italic_f = 1 + italic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_f start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT + ⋯ + italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_f start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT . (2.14)
  2. 2.

    Analogous to eq. (2.4), the generating function (𝝁;r;𝒛)𝝁𝑟𝒛\mathcal{B}(\bm{\mu};r;\bm{z})caligraphic_B ( bold_italic_μ ; italic_r ; bold_italic_z ) has the property

    (𝝁;1;𝒛)r=(𝝁;r;𝒛).superscript𝝁1𝒛𝑟𝝁𝑟𝒛\mathcal{B}(\bm{\mu};1;\bm{z})^{r}=\mathcal{B}(\bm{\mu};r;\bm{z})\,.caligraphic_B ( bold_italic_μ ; 1 ; bold_italic_z ) start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT = caligraphic_B ( bold_italic_μ ; italic_r ; bold_italic_z ) . (2.15)

    Analogous to eq. (2.5), it follows that

    (𝝁;r+s;𝒛)=(𝝁;r;𝒛)(𝝁;s;𝒛).𝝁𝑟𝑠𝒛𝝁𝑟𝒛𝝁𝑠𝒛\mathcal{B}(\bm{\mu};r+s;\bm{z})=\mathcal{B}(\bm{\mu};r;\bm{z})\mathcal{B}(\bm% {\mu};s;\bm{z})\,.caligraphic_B ( bold_italic_μ ; italic_r + italic_s ; bold_italic_z ) = caligraphic_B ( bold_italic_μ ; italic_r ; bold_italic_z ) caligraphic_B ( bold_italic_μ ; italic_s ; bold_italic_z ) . (2.16)
  3. 3.

    The multiparameter convolution identity analogous to eq. (2.6) is (the allowed values of 𝒖𝒖\bm{u}bold_italic_u are obvious)

    𝒜𝒕(𝝁,r+s)=𝒖k𝒜𝒖(𝝁,r)𝒜𝒕𝒖(𝝁,s).subscript𝒜𝒕𝝁𝑟𝑠subscript𝒖superscript𝑘subscript𝒜𝒖𝝁𝑟subscript𝒜𝒕𝒖𝝁𝑠\mathscr{A}_{\bm{t}}(\bm{\mu},r+s)=\sum_{\bm{u}\in\mathbb{N}^{k}}\mathscr{A}_{% \bm{u}}(\bm{\mu},r)\mathscr{A}_{\bm{t}-\bm{u}}(\bm{\mu},s)\,.script_A start_POSTSUBSCRIPT bold_italic_t end_POSTSUBSCRIPT ( bold_italic_μ , italic_r + italic_s ) = ∑ start_POSTSUBSCRIPT bold_italic_u ∈ blackboard_N start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_POSTSUBSCRIPT script_A start_POSTSUBSCRIPT bold_italic_u end_POSTSUBSCRIPT ( bold_italic_μ , italic_r ) script_A start_POSTSUBSCRIPT bold_italic_t - bold_italic_u end_POSTSUBSCRIPT ( bold_italic_μ , italic_s ) . (2.17)
  4. 4.

    Analogous to eq. (2.7), the multiparameter recurrence is (again, there are other equivalent ways to express the recurrence)

    A𝒕(𝝁,r+1)=A𝒕(𝝁,r)+j=1kA𝒕𝒆j(𝝁,r+μj).subscript𝐴𝒕𝝁𝑟1subscript𝐴𝒕𝝁𝑟superscriptsubscript𝑗1𝑘subscript𝐴𝒕subscript𝒆𝑗𝝁𝑟subscript𝜇𝑗A_{\bm{t}}(\bm{\mu},r+1)=A_{\bm{t}}(\bm{\mu},r)+\sum_{j=1}^{k}A_{\bm{t}-\bm{e}% _{j}}(\bm{\mu},r+\mu_{j})\,.italic_A start_POSTSUBSCRIPT bold_italic_t end_POSTSUBSCRIPT ( bold_italic_μ , italic_r + 1 ) = italic_A start_POSTSUBSCRIPT bold_italic_t end_POSTSUBSCRIPT ( bold_italic_μ , italic_r ) + ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_A start_POSTSUBSCRIPT bold_italic_t - bold_italic_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_μ , italic_r + italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) . (2.18)

Theorems 2.4(b) and (c) are equivalent; the proof follows the same steps as for the case k=1𝑘1k=1italic_k = 1. Also, using eqs. (2.12) (or (2.13)) and (2.15), eq. (2.18) yields

(𝝁;1;𝒛)r+1=(𝝁;1;𝒛)r+j=1kzj(𝝁;1;𝒛)r+μj.superscript𝝁1𝒛𝑟1superscript𝝁1𝒛𝑟superscriptsubscript𝑗1𝑘subscript𝑧𝑗superscript𝝁1𝒛𝑟subscript𝜇𝑗\mathcal{B}(\bm{\mu};1;\bm{z})^{r+1}=\mathcal{B}(\bm{\mu};1;\bm{z})^{r}+\sum_{% j=1}^{k}z_{j}\mathcal{B}(\bm{\mu};1;\bm{z})^{r+\mu_{j}}\,.caligraphic_B ( bold_italic_μ ; 1 ; bold_italic_z ) start_POSTSUPERSCRIPT italic_r + 1 end_POSTSUPERSCRIPT = caligraphic_B ( bold_italic_μ ; 1 ; bold_italic_z ) start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT caligraphic_B ( bold_italic_μ ; 1 ; bold_italic_z ) start_POSTSUPERSCRIPT italic_r + italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT . (2.19)

This is eq. (2.14) multiplied through by (𝝁;1;𝒛)rsuperscript𝝁1𝒛𝑟\mathcal{B}(\bm{\mu};1;\bm{z})^{r}caligraphic_B ( bold_italic_μ ; 1 ; bold_italic_z ) start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT. A search of the literature revealed that all of the results in Theorem 2.4 have already been proved. Unfortunately, the proofs are scattered (and rediscovered) in the literature. Unlike the single-parameter case, where the relations are explicitly stated as properties of Fuss–Catalan numbers (see Theorem 2.1), for k>1𝑘1k>1italic_k > 1 there is a variety of notations and not all authors mention Fuss and Catalan. (This should not be misinterpreted as a criticism; see the comment at the beginning of A.) For the multiparameter case, the most comprehensive references I have found were by Raney [37], Chu [11] and Mohanty [34]. I summarize their works in turn. Raney [37] presented proofs of all the results in Theorem 2.4. Raney’s expression is as follows. Let a1,a2,subscript𝑎1subscript𝑎2a_{1},a_{2},\dotsitalic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … be an infinite sequence of natural numbers of which at most a finite number of terms are different from zero. Then define m=n+i=1iai𝑚𝑛superscriptsubscript𝑖1𝑖subscript𝑎𝑖m=n+\sum_{i=1}^{\infty}ia_{i}italic_m = italic_n + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_i italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and a0=n+i=1(i1)aisubscript𝑎0𝑛superscriptsubscript𝑖1𝑖1subscript𝑎𝑖a_{0}=n+\sum_{i=1}^{\infty}(i-1)a_{i}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_n + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_i - 1 ) italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Raney defined the multinomial coefficient

M(a0,a1,a2,)=(a0+a1+a2+)!a0!a1!a2!.𝑀subscript𝑎0subscript𝑎1subscript𝑎2subscript𝑎0subscript𝑎1subscript𝑎2subscript𝑎0subscript𝑎1subscript𝑎2M(a_{0},a_{1},a_{2},\dots)=\frac{(a_{0}+a_{1}+a_{2}+\cdots)!}{a_{0}!a_{1}!a_{2% }!\cdots}\,.italic_M ( italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … ) = divide start_ARG ( italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + ⋯ ) ! end_ARG start_ARG italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ! italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ! italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ! ⋯ end_ARG . (2.20)

Then [37, Theorem 2.2] states

mL(n;a1,a2,)=nM(a0,a1,a2,).𝑚𝐿𝑛subscript𝑎1subscript𝑎2𝑛𝑀subscript𝑎0subscript𝑎1subscript𝑎2mL(n;a_{1},a_{2},\dots)=nM(a_{0},a_{1},a_{2},\dots)\,.italic_m italic_L ( italic_n ; italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … ) = italic_n italic_M ( italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … ) . (2.21)

Let us reexpress this in our notation. We know only finitely many of the aisubscript𝑎𝑖a_{i}italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT are nonzero. Suppose there are k𝑘kitalic_k nonzero aisubscript𝑎𝑖a_{i}italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT are they are indexed by the set (μ1,,μk)subscript𝜇1subscript𝜇𝑘(\mu_{1},\dots,\mu_{k})( italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ). Also define tj=aμjsubscript𝑡𝑗subscript𝑎subscript𝜇𝑗t_{j}=a_{\mu_{j}}italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_a start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT and replace n𝑛nitalic_n by r𝑟ritalic_r, then m=r+𝒕𝝁𝑚𝑟𝒕𝝁m=r+\bm{t}\cdot\bm{\mu}italic_m = italic_r + bold_italic_t ⋅ bold_italic_μ and a0=r+𝒕𝝁|𝒕|=m|𝒕|subscript𝑎0𝑟𝒕𝝁𝒕𝑚𝒕a_{0}=r+\bm{t}\cdot\bm{\mu}-|\bm{t}|=m-|\bm{t}|italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_r + bold_italic_t ⋅ bold_italic_μ - | bold_italic_t | = italic_m - | bold_italic_t |. Then

L(n;a1,a2,)=nm(a0+a1+a2+)!a0!a1!a2!=rr+𝒕𝝁(r+𝒕𝝁)!(r+𝒕𝝁|𝒕|)!t1!tk!=rt1!tk!j=1|𝒕|1(𝒕𝝁+rj)=𝒜𝒕(𝝁,r).𝐿𝑛subscript𝑎1subscript𝑎2𝑛𝑚subscript𝑎0subscript𝑎1subscript𝑎2subscript𝑎0subscript𝑎1subscript𝑎2𝑟𝑟𝒕𝝁𝑟𝒕𝝁𝑟𝒕𝝁𝒕subscript𝑡1subscript𝑡𝑘𝑟subscript𝑡1subscript𝑡𝑘superscriptsubscriptproduct𝑗1𝒕1𝒕𝝁𝑟𝑗subscript𝒜𝒕𝝁𝑟\begin{split}L(n;a_{1},a_{2},\dots)&=\frac{n}{m}\,\frac{(a_{0}+a_{1}+a_{2}+% \cdots)!}{a_{0}!a_{1}!a_{2}!\cdots}\\ &=\frac{r}{r+\bm{t}\cdot\bm{\mu}}\,\frac{(r+\bm{t}\cdot\bm{\mu})!}{(r+\bm{t}% \cdot\bm{\mu}-|\bm{t}|)!t_{1}!\cdots t_{k}!}\\ &=\frac{r}{t_{1}!\cdots t_{k}!}\,\prod_{j=1}^{|\bm{t}|-1}(\bm{t}\cdot\bm{\mu}+% r-j)\\ &=\mathscr{A}_{\bm{t}}(\bm{\mu},r)\,.\end{split}start_ROW start_CELL italic_L ( italic_n ; italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … ) end_CELL start_CELL = divide start_ARG italic_n end_ARG start_ARG italic_m end_ARG divide start_ARG ( italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + ⋯ ) ! end_ARG start_ARG italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ! italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ! italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ! ⋯ end_ARG end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = divide start_ARG italic_r end_ARG start_ARG italic_r + bold_italic_t ⋅ bold_italic_μ end_ARG divide start_ARG ( italic_r + bold_italic_t ⋅ bold_italic_μ ) ! end_ARG start_ARG ( italic_r + bold_italic_t ⋅ bold_italic_μ - | bold_italic_t | ) ! italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ! ⋯ italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ! end_ARG end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = divide start_ARG italic_r end_ARG start_ARG italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ! ⋯ italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ! end_ARG ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT | bold_italic_t | - 1 end_POSTSUPERSCRIPT ( bold_italic_t ⋅ bold_italic_μ + italic_r - italic_j ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = script_A start_POSTSUBSCRIPT bold_italic_t end_POSTSUBSCRIPT ( bold_italic_μ , italic_r ) . end_CELL end_ROW (2.22)

Notice that Raney’s expression for L𝐿Litalic_L is a solution, not a definition. Raney posed and solved many combinatorial problems in [37]. Then [37, Theorems 2.3, 2,4, 4.1] yield respectively eqs. (2.17), (2.18) and (2.15). Also [37, eqs. (6.1) and (6.2)] yield eq. (2.14). Note that Raney [37] took the μjsubscript𝜇𝑗\mu_{j}italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT (in my notation) to be integers; this is common also in the derivations by other authors (see below). However, it is straightforward to generalize from integer to complex-valued parameters. The relevant steps are given by Graham et al. [23] (for the single-parameter case k=1𝑘1k=1italic_k = 1, but the same reasoning works also for multiple parameters k>1𝑘1k>1italic_k > 1). Chu [11] also published a proof of the solution of eq. (2.14) (citing Raney [37]). Chu remarked that eq. (2.14) can also be derived using the multi-variable version of the Lagrange inversion formula [19]. (Numerous authors have stated that eq. (2.14) can be derived using Lagrange inversion. Raney gave an example of Lagrange inversion in [37].) Chu treated only integer-valued parameters. Chu defined ‘higher Catalan numbers’ and ‘generalized Catalan numbers’ as follows. Chu employed vectors v𝑣\vec{v}over→ start_ARG italic_v end_ARG and 𝒏𝒏\bm{n}bold_italic_n, which are k𝑘kitalic_k-tuples of integers. The ‘higher Catalan numbers’ are [11, eq. (1)]

Ck(n)=1nk+1(nk+1n).subscript𝐶𝑘𝑛1𝑛𝑘1binomial𝑛𝑘1𝑛C_{k}(n)=\frac{1}{nk+1}\binom{nk+1}{n}\,.italic_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_n ) = divide start_ARG 1 end_ARG start_ARG italic_n italic_k + 1 end_ARG ( FRACOP start_ARG italic_n italic_k + 1 end_ARG start_ARG italic_n end_ARG ) . (2.23)

This is equivalent to An(k,1)subscript𝐴𝑛𝑘1A_{n}(k,1)italic_A start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_k , 1 ) in eq. (2.1) The ‘generalized Catalan numbers’ are [11, eq. (2)]

Cv(n)=1i=1kniv1+1(i=1kniv1+1n1,n2,,nk,1+i=1kni(v11)).subscript𝐶𝑣𝑛1superscriptsubscript𝑖1𝑘subscript𝑛𝑖subscript𝑣11binomialsuperscriptsubscript𝑖1𝑘subscript𝑛𝑖subscript𝑣11subscript𝑛1subscript𝑛2subscript𝑛𝑘1superscriptsubscript𝑖1𝑘subscript𝑛𝑖subscript𝑣11C_{\vec{v}}(\vec{n})=\frac{1}{\sum_{i=1}^{k}n_{i}v_{1}+1}\binom{\sum_{i=1}^{k}% n_{i}v_{1}+1}{n_{1},n_{2},\dots,n_{k},1+\sum_{i=1}^{k}n_{i}(v_{1}-1)}\,.italic_C start_POSTSUBSCRIPT over→ start_ARG italic_v end_ARG end_POSTSUBSCRIPT ( over→ start_ARG italic_n end_ARG ) = divide start_ARG 1 end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 1 end_ARG ( FRACOP start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 1 end_ARG start_ARG italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , 1 + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - 1 ) end_ARG ) . (2.24)

This is equivalent to 𝒜𝒏(𝒗,1)subscript𝒜𝒏𝒗1\mathscr{A}_{\bm{n}}(\bm{v},1)script_A start_POSTSUBSCRIPT bold_italic_n end_POSTSUBSCRIPT ( bold_italic_v , 1 ) in eq. (2.10). Beware of the slightly inconsistent use of k𝑘kitalic_k by Chu, as quoted in eqs. (2.23) and (2.24). Curiously, Chu [11] restricted his definitions only to r=1𝑟1r=1italic_r = 1, even though unnamed expressions with r>1𝑟1r>1italic_r > 1 appear in his paper (Chu wrote t𝑡titalic_t for what I call r𝑟ritalic_r). Mohanty [34] explicitly treated complex-valued parameters. He defined the multinomial coefficient as follows [34, eq. (3)]

(xj1,,jk)=x(x1)(xj=1kj+1)i=1kji!.binomial𝑥subscript𝑗1subscript𝑗𝑘𝑥𝑥1𝑥superscriptsubscript𝑗1𝑘𝑗1superscriptsubscriptproduct𝑖1𝑘subscript𝑗𝑖\binom{x}{j_{1},\dots,j_{k}}=\frac{x(x-1)\cdots(x-\sum_{j=1}^{k}j+1)}{\prod_{i% =1}^{k}j_{i}!}\,.( FRACOP start_ARG italic_x end_ARG start_ARG italic_j start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_j start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG ) = divide start_ARG italic_x ( italic_x - 1 ) ⋯ ( italic_x - ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_j + 1 ) end_ARG start_ARG ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_j start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ! end_ARG . (2.25)

Then Mohanty defined (without assigning a name) [34, eq. (4)]

A(a;b1,,bk;n1,,nk)=aa+i=1kbini(a+i=1kbinin1,,nk).𝐴𝑎subscript𝑏1subscript𝑏𝑘subscript𝑛1subscript𝑛𝑘𝑎𝑎superscriptsubscript𝑖1𝑘subscript𝑏𝑖subscript𝑛𝑖binomial𝑎superscriptsubscript𝑖1𝑘subscript𝑏𝑖subscript𝑛𝑖subscript𝑛1subscript𝑛𝑘A(a;b_{1},\dots,b_{k};n_{1},\dots,n_{k})=\frac{a}{a+\sum_{i=1}^{k}b_{i}n_{i}}% \binom{a+\sum_{i=1}^{k}b_{i}n_{i}}{n_{1},\dots,n_{k}}\,.italic_A ( italic_a ; italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_b start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ; italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = divide start_ARG italic_a end_ARG start_ARG italic_a + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_b start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ( FRACOP start_ARG italic_a + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_b start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG ) . (2.26)

Here a,b1,,bk𝑎subscript𝑏1subscript𝑏𝑘a,b_{1},\dots,b_{k}\in\mathbb{C}italic_a , italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_b start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ blackboard_C are all complex. This is equivalent to 𝒜𝒏(𝒃,a)subscript𝒜𝒏𝒃𝑎\mathscr{A}_{\bm{n}}(\bm{b},a)script_A start_POSTSUBSCRIPT bold_italic_n end_POSTSUBSCRIPT ( bold_italic_b , italic_a ) in eq. (2.10). Mohanty proved several multiparameter convolution identities in [34], in particular eq. (2.17). (Additional results are given in A below.) Mohanty defined a generating function [34, unnumbered before eq. (13)] and proved that it satisfies the following equation for z𝑧zitalic_z [34, eq. (22)]

szb+tzdz+1=0.𝑠superscript𝑧𝑏𝑡superscript𝑧𝑑𝑧10sz^{b}+tz^{d}-z+1=0\,.italic_s italic_z start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT + italic_t italic_z start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT - italic_z + 1 = 0 . (2.27)

Here all of b𝑏bitalic_b, d𝑑ditalic_d, s𝑠sitalic_s and t𝑡titalic_t are complex. Note that Mohanty displayed explicit derivations for the case k=2𝑘2k=2italic_k = 2 and pointed out that the extension to more parameters merely requires additional bookkee**, hence eq. (2.27) generalizes to k>2𝑘2k>2italic_k > 2 parameters and is effectively eq. (2.14). Similarly [34, eq. (25)] yields the identity eq. (2.16). Strehl [43] also gave a proof of the solution of eq. (2.14), where [43, eq.(21)] is an algebraic equation with complex coefficients. In fact [43, eq.(21)] is the equation solved by Mellin [33] and displayed in eq. (4.12) below. Strehl also provides some historical background, citing both Chu [11] and Raney [37]. More recently, eq. (2.14) was solved by Schuetz and Whieldon [40, Theorem 4.2], who treated integer valued coefficients and exponents only. The series coefficients were identified as Fuss–Catalan numbers, multiplied by multinomial coefficients (see eq. (2.11)); this is the only reference I have found to mention Fuss–Catalan explicitly for the multiparameter case (but see Chu [11] above). Banderier and Drmota [3] also derived a series solution for an algebraic equation where [3, Theorem 3.3] is termed the ‘Flajolet-Soria formula for coefficients of an algebraic function.’ See [3, eq. (3.3)].

Remark 2.5.

The exponents μjsubscript𝜇𝑗\mu_{j}italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT in eq. (2.14) need not be distinct, although from a practical viewpoint it may be pointless if they are not. Consider the extreme case where they are all equal μ1==μk=μsubscript𝜇1subscript𝜇𝑘𝜇\mu_{1}=\cdots=\mu_{k}=\muitalic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = ⋯ = italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_μ so 𝐭^𝛍=μ^𝐭𝛍𝜇\hat{\bm{t}}\cdot\bm{\mu}=\muover^ start_ARG bold_italic_t end_ARG ⋅ bold_italic_μ = italic_μ. Then eq. (2.14) simplifies to

f=1+(z1++zk)fμ.𝑓1subscript𝑧1subscript𝑧𝑘superscript𝑓𝜇f=1+(z_{1}+\cdots+z_{k})f^{\mu}\,.italic_f = 1 + ( italic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + ⋯ + italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) italic_f start_POSTSUPERSCRIPT italic_μ end_POSTSUPERSCRIPT . (2.28)

This is simply eq. (2.3) with z=j=1kzj𝑧superscriptsubscript𝑗1𝑘subscript𝑧𝑗z=\sum_{j=1}^{k}z_{j}italic_z = ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. Then in eq. (2.14), At(μ,1)subscript𝐴𝑡𝜇1A_{t}(\mu,1)italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_μ , 1 ) does not depend on the individual tjsubscript𝑡𝑗t_{j}italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT so

f=t=0At(μ,1)[t1++tk=t(tt1,,tk)z1t1zktk]=t=0At(μ,1)(z1++zk)t.𝑓superscriptsubscript𝑡0subscript𝐴𝑡𝜇1delimited-[]subscriptsubscript𝑡1subscript𝑡𝑘𝑡binomial𝑡subscript𝑡1subscript𝑡𝑘superscriptsubscript𝑧1subscript𝑡1superscriptsubscript𝑧𝑘subscript𝑡𝑘superscriptsubscript𝑡0subscript𝐴𝑡𝜇1superscriptsubscript𝑧1subscript𝑧𝑘𝑡\begin{split}f&=\sum_{t=0}^{\infty}A_{t}(\mu,1)\biggl{[}\sum_{t_{1}+\cdots+t_{% k}=t}\binom{t}{t_{1},\dots,t_{k}}z_{1}^{t_{1}}\cdots z_{k}^{t_{k}}\biggr{]}\\ &=\sum_{t=0}^{\infty}A_{t}(\mu,1)(z_{1}+\cdots+z_{k})^{t}\,.\end{split}start_ROW start_CELL italic_f end_CELL start_CELL = ∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_μ , 1 ) [ ∑ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + ⋯ + italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_t end_POSTSUBSCRIPT ( FRACOP start_ARG italic_t end_ARG start_ARG italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG ) italic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ⋯ italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ] end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = ∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_μ , 1 ) ( italic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + ⋯ + italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT . end_CELL end_ROW (2.29)

This is precisely the Fuss–Catalan series which is the known solution of eq. (2.28).

Remark 2.6 (branch cuts).

If some of the μjsubscript𝜇𝑗\mu_{j}italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT in eq. (2.14) are nonintegers, a branch cut is required in the complex plane. The classic example is the square root μj=12subscript𝜇𝑗12\mu_{j}=\frac{1}{2}italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG 2 end_ARG and f1/2superscript𝑓12f^{1/2}italic_f start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT. A specific sheet of the complex plane must be selected, to render equations such as eq. (2.14) well defined (although, as pointed out above, the domain of absolute convergence will not depend on branch cuts). In all of the numerical work reported in this paper, the branch cut was placed along the positive real axis, so 0arg(zj)<2π0subscript𝑧𝑗2𝜋0\leq\arg(z_{j})<2\pi0 ≤ roman_arg ( italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) < 2 italic_π for j=1,,k𝑗1𝑘j=1,\dots,kitalic_j = 1 , … , italic_k and similarly for f𝑓fitalic_f and all other complex variables to appear below. This is necessary to obtain meaningful sums for the various series in this paper. Mellin [33] placed the branch cut along the negative real axis. The essential fact is that a branch cut is required; one must make a specific choice and adhere to it consistently.

3 Domain of convergence

In general, the convergence of an infinite series depends on the order of summation. In this paper, we take ‘convergence’ to mean exclusively absolute convergence. In that case, the answer does not depend on the order of summation. In general, the series in eq. (2.13) has a finite domain of absolute convergence. We present two sets of conditions for the series in eq. (2.13) to converge absolutely. The first is necessary, but in general not sufficient, and the second is sufficient, but in general not necessary. A more detailed analysis for the special case of algebraic equations will be presented in Secs. 7 and 8. The derivations below assume the μjsubscript𝜇𝑗\mu_{j}italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT are real and are ordered μ1μ2μksubscript𝜇1subscript𝜇2subscript𝜇𝑘\mu_{1}\leq\mu_{2}\leq\cdots\leq\mu_{k}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ ⋯ ≤ italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. We begin with the following lemma for the asymptotic value of the Fuss–Catalan numbers.

Lemma 3.1.

Asymptotically for t1much-greater-than𝑡1t\gg 1italic_t ≫ 1 and real μ,r𝜇𝑟\mu,ritalic_μ , italic_r,

At(μ,r)r2πt3/2|μ|r12|1μ|r+12(|μ|μ|1μ|1μ)t.similar-tosubscript𝐴𝑡𝜇𝑟𝑟2𝜋superscript𝑡32superscript𝜇𝑟12superscript1𝜇𝑟12superscriptsuperscript𝜇𝜇superscript1𝜇1𝜇𝑡A_{t}(\mu,r)\sim\frac{r}{\sqrt{2\pi}\,t^{3/2}}\,\frac{|\mu|^{r-\frac{1}{2}}}{|% 1-\mu|^{r+\frac{1}{2}}}\,(|\mu|^{\mu}|1-\mu|^{1-\mu})^{t}\,.italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_μ , italic_r ) ∼ divide start_ARG italic_r end_ARG start_ARG square-root start_ARG 2 italic_π end_ARG italic_t start_POSTSUPERSCRIPT 3 / 2 end_POSTSUPERSCRIPT end_ARG divide start_ARG | italic_μ | start_POSTSUPERSCRIPT italic_r - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT end_ARG start_ARG | 1 - italic_μ | start_POSTSUPERSCRIPT italic_r + divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT end_ARG ( | italic_μ | start_POSTSUPERSCRIPT italic_μ end_POSTSUPERSCRIPT | 1 - italic_μ | start_POSTSUPERSCRIPT 1 - italic_μ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT . (3.1)

The above is an application of Stirling’s formula and the proof is omitted. We require μ0𝜇0\mu\neq 0italic_μ ≠ 0 and μ1𝜇1\mu\neq 1italic_μ ≠ 1 to justify the intermediate steps in the derivation. To determine the radius of convergence using d’Alembert’s ratio test, note that asymptotically

At(μ,r)At1(μ,r)|μ|μ|1μ|1μ.similar-tosubscript𝐴𝑡𝜇𝑟subscript𝐴𝑡1𝜇𝑟superscript𝜇𝜇superscript1𝜇1𝜇\frac{A_{t}(\mu,r)}{A_{t-1}(\mu,r)}\sim|\mu|^{\mu}|1-\mu|^{1-\mu}\,.divide start_ARG italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_μ , italic_r ) end_ARG start_ARG italic_A start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT ( italic_μ , italic_r ) end_ARG ∼ | italic_μ | start_POSTSUPERSCRIPT italic_μ end_POSTSUPERSCRIPT | 1 - italic_μ | start_POSTSUPERSCRIPT 1 - italic_μ end_POSTSUPERSCRIPT . (3.2)
Proposition 3.2 (necessary, not sufficient).

For the series in eq. (2.13) to converge absolutely, it is necessary that

|zj||zj|max1|μj|μj|1μj|1μj(j=1,,k).formulae-sequencesubscript𝑧𝑗subscriptsubscript𝑧𝑗1superscriptsubscript𝜇𝑗subscript𝜇𝑗superscript1subscript𝜇𝑗1subscript𝜇𝑗𝑗1𝑘|z_{j}|\leq|z_{j}|_{\max}\equiv\frac{1}{|\mu_{j}|^{\mu_{j}}|1-\mu_{j}|^{1-\mu_% {j}}}\qquad(j=1,\dots,k)\,.| italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | ≤ | italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ≡ divide start_ARG 1 end_ARG start_ARG | italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | 1 - italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 1 - italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG ( italic_j = 1 , … , italic_k ) . (3.3)

Then all points of the following form lie in the domain of convergence

𝒛~j=(0,,0,|zj|=|zj|max,0,,0)(j=1,,k).\tilde{\bm{z}}_{j}=(0,\dots,0,|z_{j}|=|z_{j}|_{\max},0,\dots,0)\qquad(j=1,% \dots,k)\,.over~ start_ARG bold_italic_z end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ( 0 , … , 0 , | italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | = | italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT , 0 , … , 0 ) ( italic_j = 1 , … , italic_k ) . (3.4)
Proof.

Fix a value of j𝑗jitalic_j and set all the other zjsubscript𝑧superscript𝑗z_{j^{\prime}}italic_z start_POSTSUBSCRIPT italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT to zero, where jjsuperscript𝑗𝑗j^{\prime}\neq jitalic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≠ italic_j. Then the series in eq. (2.13) reduces to a sum in powers of single variable zjsubscript𝑧𝑗z_{j}italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. Then eq. (3.3) follows from eq. (3.2) and d’Alembert’s ratio test. From the asymptotic form of the Fuss–Catalan numbers in eq. (3.1), the series converges also on its circle of convergence, justifying the ‘\leq’ in eq. (3.3). Then eq. (3.4) follows immediately. ∎

Proposition 3.3 (sufficient, not necessary).

The series in eq. (2.13) converges absolutely if

j=1k|zj|min(1|μ1|μ1|1μ1|1μ1,1|μk|μk|1μk|1μk).superscriptsubscript𝑗1𝑘subscript𝑧𝑗1superscriptsubscript𝜇1subscript𝜇1superscript1subscript𝜇11subscript𝜇11superscriptsubscript𝜇𝑘subscript𝜇𝑘superscript1subscript𝜇𝑘1subscript𝜇𝑘\sum_{j=1}^{k}|z_{j}|\leq\min\biggl{(}\frac{1}{|\mu_{1}|^{\mu_{1}}|1-\mu_{1}|^% {1-\mu_{1}}}\,,\frac{1}{|\mu_{k}|^{\mu_{k}}|1-\mu_{k}|^{1-\mu_{k}}}\biggr{)}\,.∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT | italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | ≤ roman_min ( divide start_ARG 1 end_ARG start_ARG | italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | 1 - italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 1 - italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG , divide start_ARG 1 end_ARG start_ARG | italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | 1 - italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 1 - italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG ) . (3.5)

The above condition is sufficient, but in general not necessary.

Proof.

We employ eq. (2.13) and eq. (2.14). Let us define α=j=1k|zj|𝛼superscriptsubscript𝑗1𝑘subscript𝑧𝑗\alpha=\sum_{j=1}^{k}|z_{j}|italic_α = ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT | italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | and pj=|zj|/αsubscript𝑝𝑗subscript𝑧𝑗𝛼p_{j}=|z_{j}|/\alphaitalic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = | italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | / italic_α, for j=1,,k𝑗1𝑘j=1,\dots,kitalic_j = 1 , … , italic_k. Then 0pj10subscript𝑝𝑗10\leq p_{j}\leq 10 ≤ italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ 1 and j=1kpj=1superscriptsubscript𝑗1𝑘subscript𝑝𝑗1\sum_{j=1}^{k}p_{j}=1∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 1. Then from eq. (2.14)

|f|t=0αt{t1++tk=t|At(𝒕^𝝁,1)|(tt1,,tk)p1t1pktk}.𝑓superscriptsubscript𝑡0superscript𝛼𝑡conditional-setsubscriptsubscript𝑡1subscript𝑡𝑘𝑡conditionalsubscript𝐴𝑡^𝒕𝝁1binomial𝑡subscript𝑡1subscript𝑡𝑘superscriptsubscript𝑝1subscript𝑡1superscriptsubscript𝑝𝑘subscript𝑡𝑘|f|\leq\sum_{t=0}^{\infty}\alpha^{t}\biggl{\{}\sum_{t_{1}+\cdots+t_{k}=t}|A_{t% }(\hat{\bm{t}}\cdot\bm{\mu},1)|\binom{t}{t_{1},\dots,t_{k}}p_{1}^{t_{1}}\cdots p% _{k}^{t_{k}}\biggr{\}}\,.| italic_f | ≤ ∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_α start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT { ∑ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + ⋯ + italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_t end_POSTSUBSCRIPT | italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( over^ start_ARG bold_italic_t end_ARG ⋅ bold_italic_μ , 1 ) | ( FRACOP start_ARG italic_t end_ARG start_ARG italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG ) italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ⋯ italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT } . (3.6)

Let us suppose that |At(𝒕^𝝁,1)|subscript𝐴𝑡^𝒕𝝁1|A_{t}(\hat{\bm{t}}\cdot\bm{\mu},1)|| italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( over^ start_ARG bold_italic_t end_ARG ⋅ bold_italic_μ , 1 ) | is majorized by setting 𝒕^𝝁=μ^𝒕𝝁subscript𝜇\hat{\bm{t}}\cdot\bm{\mu}=\mu_{*}over^ start_ARG bold_italic_t end_ARG ⋅ bold_italic_μ = italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT, where μsubscript𝜇\mu_{*}italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT does not depend on the tjsubscript𝑡𝑗t_{j}italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. (This will be discussed in more detail below.) Actually, to establish convergence of the series, it is sufficient if |At(𝒕^𝝁,1)|<|At(μ,1)|subscript𝐴𝑡^𝒕𝝁1subscript𝐴𝑡subscript𝜇1|A_{t}(\hat{\bm{t}}\cdot\bm{\mu},1)|<|A_{t}(\mu_{*},1)|| italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( over^ start_ARG bold_italic_t end_ARG ⋅ bold_italic_μ , 1 ) | < | italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT , 1 ) | only asymptotically, say for tT𝑡𝑇t\geq Titalic_t ≥ italic_T. Then

|f|const+t=T|At(μ,1)|αt{t1++tk=t(tt1,,tk)p1t1pktk}=const+t=T|At(μ,1)|αt.𝑓constsuperscriptsubscript𝑡𝑇subscript𝐴𝑡subscript𝜇1superscript𝛼𝑡subscriptsubscript𝑡1subscript𝑡𝑘𝑡binomial𝑡subscript𝑡1subscript𝑡𝑘superscriptsubscript𝑝1subscript𝑡1superscriptsubscript𝑝𝑘subscript𝑡𝑘constsuperscriptsubscript𝑡𝑇subscript𝐴𝑡subscript𝜇1superscript𝛼𝑡\begin{split}|f|&\leq\textrm{const}+\sum_{t=T}^{\infty}|A_{t}(\mu_{*},1)|\,% \alpha^{t}\biggl{\{}\sum_{t_{1}+\cdots+t_{k}=t}\binom{t}{t_{1},\dots,t_{k}}p_{% 1}^{t_{1}}\dots p_{k}^{t_{k}}\biggr{\}}\\ &=\textrm{const}+\sum_{t=T}^{\infty}|A_{t}(\mu_{*},1)|\,\alpha^{t}\,.\end{split}start_ROW start_CELL | italic_f | end_CELL start_CELL ≤ const + ∑ start_POSTSUBSCRIPT italic_t = italic_T end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT | italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT , 1 ) | italic_α start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT { ∑ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + ⋯ + italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_t end_POSTSUBSCRIPT ( FRACOP start_ARG italic_t end_ARG start_ARG italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG ) italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT … italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT } end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = const + ∑ start_POSTSUBSCRIPT italic_t = italic_T end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT | italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT , 1 ) | italic_α start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT . end_CELL end_ROW (3.7)

Using eq. (3.2) and d’Alembert’s ratio test, we obtain the following sufficient (but not always necessary) condition for convergence:

j=1k|zj|1|μ|μ|1μ|1μ.superscriptsubscript𝑗1𝑘subscript𝑧𝑗1superscriptsubscript𝜇subscript𝜇superscript1subscript𝜇1subscript𝜇\sum_{j=1}^{k}|z_{j}|\leq\frac{1}{|\mu_{*}|^{\mu_{*}}|1-\mu_{*}|^{1-\mu_{*}}}\,.∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT | italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | ≤ divide start_ARG 1 end_ARG start_ARG | italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | 1 - italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 1 - italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG . (3.8)

The essential step to complete the proof is to specify the value of μsubscript𝜇\mu_{*}italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT. Since the μjsubscript𝜇𝑗\mu_{j}italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT are ordered, μ1𝒕^𝝁μksubscript𝜇1^𝒕𝝁subscript𝜇𝑘\mu_{1}\leq\hat{\bm{t}}\cdot\bm{\mu}\leq\mu_{k}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ over^ start_ARG bold_italic_t end_ARG ⋅ bold_italic_μ ≤ italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Now the graph of |μ|μ|1μ|1μsuperscriptsubscript𝜇subscript𝜇superscript1subscript𝜇1subscript𝜇|\mu_{*}|^{\mu_{*}}|1-\mu_{*}|^{1-\mu_{*}}| italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | 1 - italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 1 - italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT end_POSTSUPERSCRIPT attains a minimum at μ=12subscript𝜇12\mu_{*}=\frac{1}{2}italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG 2 end_ARG (and is symmetric around μ=12subscript𝜇12\mu_{*}=\frac{1}{2}italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG 2 end_ARG) and increases monotonically in either direction away from the minimum. Hence the value of |μ|μ|1μ|1μsuperscriptsubscript𝜇subscript𝜇superscript1subscript𝜇1subscript𝜇|\mu_{*}|^{\mu_{*}}|1-\mu_{*}|^{1-\mu_{*}}| italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | 1 - italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 1 - italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT end_POSTSUPERSCRIPT is maximized by setting μ=μ1subscript𝜇subscript𝜇1\mu_{*}=\mu_{1}italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT = italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT or μ=μksubscript𝜇subscript𝜇𝑘\mu_{*}=\mu_{k}italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT = italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Either value will do if they are equidistant from 1212\frac{1}{2}divide start_ARG 1 end_ARG start_ARG 2 end_ARG. This proves eq. (3.5). Admittedly, this may not be an optimal criterion: it is sufficient, but may not be necessary. Numerical tests indicate that the domain of convergence using the above value of μsubscript𝜇\mu_{*}italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT can be very conservative. ∎

Corollary 3.4 (trinomial).

For the special case k=1𝑘1k=1italic_k = 1, there is only one summand, and so μ=μ1=μsubscript𝜇subscript𝜇1𝜇\mu_{*}=\mu_{1}=\muitalic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT = italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_μ and we may write z1=zsubscript𝑧1𝑧z_{1}=zitalic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_z. Then the criterion for absolute convergence is necessary and sufficient. The series in eq. (2.2) converges if and only if

|z|1|μ|μ|1μ|1μ.𝑧1superscript𝜇𝜇superscript1𝜇1𝜇|z|\leq\frac{1}{|\mu|^{\mu}|1-\mu|^{1-\mu}}\,.| italic_z | ≤ divide start_ARG 1 end_ARG start_ARG | italic_μ | start_POSTSUPERSCRIPT italic_μ end_POSTSUPERSCRIPT | 1 - italic_μ | start_POSTSUPERSCRIPT 1 - italic_μ end_POSTSUPERSCRIPT end_ARG . (3.9)

The proof is immediate from taking Propositions 3.2 and 3.3 together. From Proposition 3.2, the series converges everywhere on its circle of convergence. This corollary will be important below.

Remark 3.5.

It is clear that the conditions in eq. (3.3) are individually necessary, but, even taken together, they are not sufficient to guarantee absolute convergence of the full sum in eq. (2.13). Hence an upper bound for the measure of the domain of absolute convergence, for (|z1|,,|zk|)ksubscript𝑧1subscript𝑧𝑘superscript𝑘(|z_{1}|,\dots,|z_{k}|)\in\mathbb{R}^{k}( | italic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , … , | italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT, is given by the finite product

μj=1k1|μj|μj|1μj|1μj<.𝜇superscriptsubscriptproduct𝑗1𝑘1superscriptsubscript𝜇𝑗subscript𝜇𝑗superscript1subscript𝜇𝑗1subscript𝜇𝑗\mu\leq\prod_{j=1}^{k}\frac{1}{|\mu_{j}|^{\mu_{j}}|1-\mu_{j}|^{1-\mu_{j}}}<% \infty\,.italic_μ ≤ ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG | italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | 1 - italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 1 - italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG < ∞ . (3.10)

The use of μ𝜇\muitalic_μ on the left hand side to denote measure should not be confused with other uses of μ𝜇\muitalic_μ in this paper. The true domain of absolute convergence is a set of smaller measure. This justifies the claim at the beginning of this section that the series in eq. (2.13) has a ‘finite domain’ of absolute convergence, i.e. finite measure. Similarly, using the sufficient condition in eq. (3.5) and μsubscript𝜇\mu_{*}italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT from eq. (3.8), a lower bound for the measure of the domain of absolute convergence is

μ1k!(1|μ|μ|1μ|1μ)k>0.𝜇1𝑘superscript1superscriptsubscript𝜇subscript𝜇superscript1subscript𝜇1subscript𝜇𝑘0\mu\geq\frac{1}{k!}\biggl{(}\frac{1}{|\mu_{*}|^{\mu_{*}}|1-\mu_{*}|^{1-\mu_{*}% }}\biggr{)}^{k}>0\,.italic_μ ≥ divide start_ARG 1 end_ARG start_ARG italic_k ! end_ARG ( divide start_ARG 1 end_ARG start_ARG | italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | 1 - italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 1 - italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT > 0 . (3.11)

The measure is positive: for sufficiently small |zj|subscript𝑧𝑗|z_{j}|| italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT |, j=1,,k𝑗1𝑘j=1,\dots,kitalic_j = 1 , … , italic_k, the series in eq. (2.13) converges in an open neighborhood of the origin for 𝐳k𝐳superscript𝑘\bm{z}\in\mathbb{C}^{k}bold_italic_z ∈ blackboard_C start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT. Of course this latter fact could be deduced directly using eq. (2.14), but eq. (3.11) supplies a quantitative lower bound. For the special case of a trinomial, where k=1𝑘1k=1italic_k = 1, the two bounds in eqs. (3.10) and (3.11) coincide.

Remark 3.6.

A complete derivation of a necessary and sufficient condition for the absolute convergence of the series in eq. (2.13) has not yet been discovered. However, the situation is different for an algebraic equation. As stated in the introduction, the necessary and sufficient bound for the convergence of the series solution of eq. (1.8) will be presented in Sec. 8.

4 Algebraic equations

4.1 Preliminary remarks

We now treat some applications of the above formalism. In this paper we shall treat algebraic equations, i.e. series solutions for roots of polynomials. Consider the general algebraic equation of degree n𝑛nitalic_n with x𝑥x\in\mathbb{C}italic_x ∈ blackboard_C and complex coefficients a0,,ansubscript𝑎0subscript𝑎𝑛a_{0},\dots,a_{n}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , … , italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT

0=a0+a1x++anxn.0subscript𝑎0subscript𝑎1𝑥subscript𝑎𝑛superscript𝑥𝑛0=a_{0}+a_{1}x+\cdots+a_{n}x^{n}\,.0 = italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x + ⋯ + italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT . (4.1)

We require a00subscript𝑎00a_{0}\neq 0italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ≠ 0 else we factor out a root x=0𝑥0x=0italic_x = 0. We also require an0subscript𝑎𝑛0a_{n}\neq 0italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≠ 0. We begin with an obvious, but necessary, caveat. It is possible that some or all of the remaining ajsubscript𝑎𝑗a_{j}italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT could vanish. To avoid cluttering the presentation, it is to be understood that in all of the multinomial sums below, the sums extend only over the nonzero ajsubscript𝑎𝑗a_{j}italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. We now note two elementary transformations of eq. (4.1), which do not affect the fundamental properties of the roots. First, we can multiply all the coefficients by a constant λ0𝜆0\lambda\neq 0italic_λ ≠ 0. This does not change the roots of eq. (4.1). Next, we can replace x𝑥xitalic_x by μy𝜇𝑦\mu yitalic_μ italic_y, where μ0𝜇0\mu\neq 0italic_μ ≠ 0. The roots for x𝑥xitalic_x are simply those for y𝑦yitalic_y, multiplied by μ𝜇\muitalic_μ. The resulting equation is j=0najλμjyj=0superscriptsubscript𝑗0𝑛subscript𝑎𝑗𝜆superscript𝜇𝑗superscript𝑦𝑗0\sum_{j=0}^{n}a_{j}\lambda\mu^{j}y^{j}=0∑ start_POSTSUBSCRIPT italic_j = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_λ italic_μ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT = 0. Define bj=ajλμjsubscript𝑏𝑗subscript𝑎𝑗𝜆superscript𝜇𝑗b_{j}=a_{j}\lambda\mu^{j}italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_λ italic_μ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT. We can select two integers p𝑝pitalic_p and q𝑞qitalic_q such that 0p<qn0𝑝𝑞𝑛0\leq p<q\leq n0 ≤ italic_p < italic_q ≤ italic_n and find values for λ𝜆\lambdaitalic_λ and μ𝜇\muitalic_μ such that bp=bq=1subscript𝑏𝑝subscript𝑏𝑞1b_{p}=b_{q}=1italic_b start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT = italic_b start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT = 1, yielding

0=b0+b1y++yp++yq++bnyn.0subscript𝑏0subscript𝑏1𝑦superscript𝑦𝑝superscript𝑦𝑞subscript𝑏𝑛superscript𝑦𝑛0=b_{0}+b_{1}y+\cdots+y^{p}+\cdots+y^{q}+\cdots+b_{n}y^{n}\,.0 = italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_y + ⋯ + italic_y start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT + ⋯ + italic_y start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT + ⋯ + italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT . (4.2)

Clearly both apsubscript𝑎𝑝a_{p}italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT and aqsubscript𝑎𝑞a_{q}italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT must be nonzero to do this. It is easily derived that μ=(ap/aq)1/(qp)𝜇superscriptsubscript𝑎𝑝subscript𝑎𝑞1𝑞𝑝\mu=(a_{p}/a_{q})^{1/(q-p)}italic_μ = ( italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT / italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 1 / ( italic_q - italic_p ) end_POSTSUPERSCRIPT and

bj=ajap(qj)/(qp)aq(jp)/(qp).subscript𝑏𝑗subscript𝑎𝑗superscriptsubscript𝑎𝑝𝑞𝑗𝑞𝑝superscriptsubscript𝑎𝑞𝑗𝑝𝑞𝑝b_{j}=\frac{a_{j}}{a_{p}^{(q-j)/(q-p)}a_{q}^{(j-p)/(q-p)}}\,.italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = divide start_ARG italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_q - italic_j ) / ( italic_q - italic_p ) end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_j - italic_p ) / ( italic_q - italic_p ) end_POSTSUPERSCRIPT end_ARG . (4.3)

Technically, bjsubscript𝑏𝑗b_{j}italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT depends on p𝑝pitalic_p and q𝑞qitalic_q also, but we consider this to be understood below. Clearly a branch cut is required to derive the above expressions. There are actually qp𝑞𝑝q-pitalic_q - italic_p solutions for μ𝜇\muitalic_μ and bjsubscript𝑏𝑗b_{j}italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, indexed by the qp𝑞𝑝q-pitalic_q - italic_p roots of unity 11/(qp)superscript11𝑞𝑝1^{1/(q-p)}1 start_POSTSUPERSCRIPT 1 / ( italic_q - italic_p ) end_POSTSUPERSCRIPT (actually (1)1/(qp)superscript11𝑞𝑝(-1)^{1/(q-p)}( - 1 ) start_POSTSUPERSCRIPT 1 / ( italic_q - italic_p ) end_POSTSUPERSCRIPT, we shall see this below). For brevity we define the set 𝒩npq={0,,n}{p,q}subscript𝒩𝑛𝑝𝑞0𝑛𝑝𝑞\mathscr{N}_{npq}=\{0,\dots,n\}\setminus\{p,q\}script_N start_POSTSUBSCRIPT italic_n italic_p italic_q end_POSTSUBSCRIPT = { 0 , … , italic_n } ∖ { italic_p , italic_q }. We divide eq. (4.2) through by ypsuperscript𝑦𝑝y^{p}italic_y start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT and rearrange terms to obtain the equation

yqp=1+j𝒩npqbjyjp.superscript𝑦𝑞𝑝1subscript𝑗subscript𝒩𝑛𝑝𝑞subscript𝑏𝑗superscript𝑦𝑗𝑝-y^{q-p}=1+\sum_{j\in\mathscr{N}_{npq}}b_{j}y^{j-p}\,.- italic_y start_POSTSUPERSCRIPT italic_q - italic_p end_POSTSUPERSCRIPT = 1 + ∑ start_POSTSUBSCRIPT italic_j ∈ script_N start_POSTSUBSCRIPT italic_n italic_p italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_y start_POSTSUPERSCRIPT italic_j - italic_p end_POSTSUPERSCRIPT . (4.4)

Note that if we set all the bjsubscript𝑏𝑗b_{j}italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT to zero, the equation reduces to yqp=1superscript𝑦𝑞𝑝1y^{q-p}=-1italic_y start_POSTSUPERSCRIPT italic_q - italic_p end_POSTSUPERSCRIPT = - 1. The solution is any of the radicals y=(1)1/(qp)𝑦superscript11𝑞𝑝y=(-1)^{1/(q-p)}italic_y = ( - 1 ) start_POSTSUPERSCRIPT 1 / ( italic_q - italic_p ) end_POSTSUPERSCRIPT. By the implicit function theorem, an absolutely convergent solution for y𝑦yitalic_y exists for sufficienly small amplitudes of the |bj|subscript𝑏𝑗|b_{j}|| italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT |. Hence the domain of absolute convergence of the series solution of eq. (4.4) is nonempty, expressing y𝑦yitalic_y in a power series in the bjsubscript𝑏𝑗b_{j}italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. (We already know this from Sec. 3.) Now set ζ=yqp𝜁superscript𝑦𝑞𝑝\zeta=-y^{q-p}italic_ζ = - italic_y start_POSTSUPERSCRIPT italic_q - italic_p end_POSTSUPERSCRIPT, so y=(1)1/(qp)ζ1/(qp)𝑦superscript11𝑞𝑝superscript𝜁1𝑞𝑝y=(-1)^{1/(q-p)}\zeta^{1/(q-p)}italic_y = ( - 1 ) start_POSTSUPERSCRIPT 1 / ( italic_q - italic_p ) end_POSTSUPERSCRIPT italic_ζ start_POSTSUPERSCRIPT 1 / ( italic_q - italic_p ) end_POSTSUPERSCRIPT. We append a subscript \ellroman_ℓ on x𝑥xitalic_x, y𝑦yitalic_y and ζ𝜁\zetaitalic_ζ to index the qp𝑞𝑝q-pitalic_q - italic_p choices of radicals (1)1/(qp)superscript11𝑞𝑝(-1)^{1/(q-p)}( - 1 ) start_POSTSUPERSCRIPT 1 / ( italic_q - italic_p ) end_POSTSUPERSCRIPT. Employing a branch cut along the positive real axis, they are eiπ(2+1)/(qp)superscript𝑒𝑖𝜋21𝑞𝑝e^{i\pi(2\ell+1)/(q-p)}italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) / ( italic_q - italic_p ) end_POSTSUPERSCRIPT, where =0,,qp10𝑞𝑝1\ell=0,\dots,q-p-1roman_ℓ = 0 , … , italic_q - italic_p - 1. Set μj=(jp)/(qp)subscript𝜇𝑗𝑗𝑝𝑞𝑝\mu_{j}=(j-p)/(q-p)italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ( italic_j - italic_p ) / ( italic_q - italic_p ), then bj=aj/(ap1μjaqμj)subscript𝑏𝑗subscript𝑎𝑗superscriptsubscript𝑎𝑝1subscript𝜇𝑗superscriptsubscript𝑎𝑞subscript𝜇𝑗b_{j}=a_{j}/(a_{p}^{1-\mu_{j}}a_{q}^{\mu_{j}})italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT / ( italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 - italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) and ζsubscript𝜁\zeta_{\ell}italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT satisfies

ζ=1+j𝒩npqeiπ(2+1)μjajap1μjaqμjζμj.subscript𝜁1subscript𝑗subscript𝒩𝑛𝑝𝑞superscript𝑒𝑖𝜋21subscript𝜇𝑗subscript𝑎𝑗superscriptsubscript𝑎𝑝1subscript𝜇𝑗superscriptsubscript𝑎𝑞subscript𝜇𝑗superscriptsubscript𝜁subscript𝜇𝑗\zeta_{\ell}=1+\sum_{j\in\mathscr{N}_{npq}}e^{i\pi(2\ell+1)\mu_{j}}\frac{a_{j}% }{a_{p}^{1-\mu_{j}}a_{q}^{\mu_{j}}}\,\zeta_{\ell}^{\mu_{j}}\,.italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = 1 + ∑ start_POSTSUBSCRIPT italic_j ∈ script_N start_POSTSUBSCRIPT italic_n italic_p italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT divide start_ARG italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 - italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT . (4.5)

This has the form of eq. (2.14), with k=Card(𝒩npq)𝑘Cardsubscript𝒩𝑛𝑝𝑞k=\textrm{Card}(\mathscr{N}_{npq})italic_k = Card ( script_N start_POSTSUBSCRIPT italic_n italic_p italic_q end_POSTSUBSCRIPT ) parameters (the bjsubscript𝑏𝑗b_{j}italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT). The expressions for t𝑡titalic_t and 𝒕𝝁𝒕𝝁\bm{t}\cdot\bm{\mu}bold_italic_t ⋅ bold_italic_μ are, in this case,

t=j𝒩npqtj,𝒕𝝁=j𝒩npqtjμj.formulae-sequence𝑡subscript𝑗subscript𝒩𝑛𝑝𝑞subscript𝑡𝑗𝒕𝝁subscript𝑗subscript𝒩𝑛𝑝𝑞subscript𝑡𝑗subscript𝜇𝑗t=\sum_{j\in\mathscr{N}_{npq}}t_{j}\,,\qquad\bm{t}\cdot\bm{\mu}=\sum_{j\in% \mathscr{N}_{npq}}t_{j}\mu_{j}\,.italic_t = ∑ start_POSTSUBSCRIPT italic_j ∈ script_N start_POSTSUBSCRIPT italic_n italic_p italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , bold_italic_t ⋅ bold_italic_μ = ∑ start_POSTSUBSCRIPT italic_j ∈ script_N start_POSTSUBSCRIPT italic_n italic_p italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT . (4.6)

The solution for ζsubscript𝜁\zeta_{\ell}italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT is given by eq. (2.13) and xsubscript𝑥x_{\ell}italic_x start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT is obtained from ζsubscript𝜁\zeta_{\ell}italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT via

x=eiπ2+1qp(apaq)1/(qp)ζ1/(qp).subscript𝑥superscript𝑒𝑖𝜋21𝑞𝑝superscriptsubscript𝑎𝑝subscript𝑎𝑞1𝑞𝑝superscriptsubscript𝜁1𝑞𝑝x_{\ell}=e^{i\pi\frac{2\ell+1}{q-p}}\Bigl{(}\frac{a_{p}}{a_{q}}\Bigr{)}^{1/(q-% p)}\zeta_{\ell}^{1/(q-p)}\,.italic_x start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT italic_i italic_π divide start_ARG 2 roman_ℓ + 1 end_ARG start_ARG italic_q - italic_p end_ARG end_POSTSUPERSCRIPT ( divide start_ARG italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT 1 / ( italic_q - italic_p ) end_POSTSUPERSCRIPT italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 / ( italic_q - italic_p ) end_POSTSUPERSCRIPT . (4.7)

It is conventional to solve for the rthsuperscript𝑟𝑡r^{th}italic_r start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT powers of the roots. From Theorem 2.4(a) and (b) and eq. (4.7), we obtain

xr=eiπ(2+1)rqp(apaq)r/(qp)𝒕n1𝒜𝒕(𝝁,rqp)eiπ(2+1)𝒕𝝁(j𝒩npqbjtj)=eiπ(2+1)rqp(apaq)1/(qp)𝒕n1𝒜𝒕(𝝁,rqp)eiπ(2+1)𝒕𝝁apt𝒕𝝁aq𝒕𝝁(j𝒩npqajtj).superscriptsubscript𝑥𝑟superscript𝑒𝑖𝜋21𝑟𝑞𝑝superscriptsubscript𝑎𝑝subscript𝑎𝑞𝑟𝑞𝑝subscript𝒕superscript𝑛1subscript𝒜𝒕𝝁𝑟𝑞𝑝superscript𝑒𝑖𝜋21𝒕𝝁subscriptproduct𝑗subscript𝒩𝑛𝑝𝑞superscriptsubscript𝑏𝑗subscript𝑡𝑗superscript𝑒𝑖𝜋21𝑟𝑞𝑝superscriptsubscript𝑎𝑝subscript𝑎𝑞1𝑞𝑝subscript𝒕superscript𝑛1subscript𝒜𝒕𝝁𝑟𝑞𝑝superscript𝑒𝑖𝜋21𝒕𝝁superscriptsubscript𝑎𝑝𝑡𝒕𝝁superscriptsubscript𝑎𝑞𝒕𝝁subscriptproduct𝑗subscript𝒩𝑛𝑝𝑞superscriptsubscript𝑎𝑗subscript𝑡𝑗\begin{split}x_{\ell}^{r}&=e^{i\pi\frac{(2\ell+1)r}{q-p}}\Bigl{(}\frac{a_{p}}{% a_{q}}\Bigr{)}^{r/(q-p)}\,\sum_{\bm{t}\in\mathbb{N}^{n-1}}\mathscr{A}_{\bm{t}}% \Bigl{(}\bm{\mu},\frac{r}{q-p}\Bigr{)}\,e^{i\pi(2\ell+1)\bm{t}\cdot\bm{\mu}}% \Bigl{(}\prod_{j\in\mathscr{N}_{npq}}b_{j}^{t_{j}}\Bigr{)}\\ &=e^{i\pi\frac{(2\ell+1)r}{q-p}}\Bigl{(}\frac{a_{p}}{a_{q}}\Bigr{)}^{1/(q-p)}% \,\sum_{\bm{t}\in\mathbb{N}^{n-1}}\mathscr{A}_{\bm{t}}\Bigl{(}\bm{\mu},\frac{r% }{q-p}\Bigr{)}\,\frac{e^{i\pi(2\ell+1)\bm{t}\cdot\bm{\mu}}}{a_{p}^{t-\bm{t}% \cdot\bm{\mu}}a_{q}^{\bm{t}\cdot\bm{\mu}}}\Bigl{(}\prod_{j\in\mathscr{N}_{npq}% }a_{j}^{t_{j}}\Bigr{)}\,.\end{split}start_ROW start_CELL italic_x start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT end_CELL start_CELL = italic_e start_POSTSUPERSCRIPT italic_i italic_π divide start_ARG ( 2 roman_ℓ + 1 ) italic_r end_ARG start_ARG italic_q - italic_p end_ARG end_POSTSUPERSCRIPT ( divide start_ARG italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT italic_r / ( italic_q - italic_p ) end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT bold_italic_t ∈ blackboard_N start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT script_A start_POSTSUBSCRIPT bold_italic_t end_POSTSUBSCRIPT ( bold_italic_μ , divide start_ARG italic_r end_ARG start_ARG italic_q - italic_p end_ARG ) italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) bold_italic_t ⋅ bold_italic_μ end_POSTSUPERSCRIPT ( ∏ start_POSTSUBSCRIPT italic_j ∈ script_N start_POSTSUBSCRIPT italic_n italic_p italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = italic_e start_POSTSUPERSCRIPT italic_i italic_π divide start_ARG ( 2 roman_ℓ + 1 ) italic_r end_ARG start_ARG italic_q - italic_p end_ARG end_POSTSUPERSCRIPT ( divide start_ARG italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT 1 / ( italic_q - italic_p ) end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT bold_italic_t ∈ blackboard_N start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT script_A start_POSTSUBSCRIPT bold_italic_t end_POSTSUBSCRIPT ( bold_italic_μ , divide start_ARG italic_r end_ARG start_ARG italic_q - italic_p end_ARG ) divide start_ARG italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) bold_italic_t ⋅ bold_italic_μ end_POSTSUPERSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t - bold_italic_t ⋅ bold_italic_μ end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT bold_italic_t ⋅ bold_italic_μ end_POSTSUPERSCRIPT end_ARG ( ∏ start_POSTSUBSCRIPT italic_j ∈ script_N start_POSTSUBSCRIPT italic_n italic_p italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) . end_CELL end_ROW (4.8)

In the first line of eq. (4.8), xrsuperscriptsubscript𝑥𝑟x_{\ell}^{r}italic_x start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT is a sum over products of positive integral powers of the bjsubscript𝑏𝑗b_{j}italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, i.e. a power series. In the second line, apsubscript𝑎𝑝a_{p}italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT and aqsubscript𝑎𝑞a_{q}italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT appear with fractional (and possibly negative) exponents, whereas the other ajsubscript𝑎𝑗a_{j}italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT appear with positive integral powers. Hence in general the series solution for the roots is a Laurent–Puiseux series in the coefficients of the polynomial, i.e. eq. (4.1). This is of course a known fact, not connected with Fuss–Catalan numbers.

Note that eq. (4.1) has n𝑛nitalic_n roots, counting multiplicities, but eq. (4.8) yields qp𝑞𝑝q-pitalic_q - italic_p roots. If p=0𝑝0p=0italic_p = 0 and q=n𝑞𝑛q=nitalic_q = italic_n, so qp=n𝑞𝑝𝑛q-p=nitalic_q - italic_p = italic_n, then eq. (4.8) yields expressions for all the n𝑛nitalic_n roots of eq. (4.1). If qp<n𝑞𝑝𝑛q-p<nitalic_q - italic_p < italic_n then we require multiple series to obtain all the n𝑛nitalic_n roots of eq. (4.1). It is simplest to explain with an example. Choose p=0𝑝0p=0italic_p = 0 and q=1𝑞1q=1italic_q = 1, this yields only one root. Next choose p=1𝑝1p=1italic_p = 1 and q=n𝑞𝑛q=nitalic_q = italic_n, this yields n1𝑛1n-1italic_n - 1 roots. One must try different selections for p𝑝pitalic_p and q𝑞qitalic_q to verify that expressions for all the roots of eq. (4.1) have been found. We shall see this in connection with the trinomial below.

It is also possible to choose q<p𝑞𝑝q<pitalic_q < italic_p. Doing so yields the same set of roots of eq. (4.1) obtained by interchanging p𝑝pitalic_p and q𝑞qitalic_q. This can be seen with some elementary transformations and relabelling of indices. The details are left to the reader. Hence without loss of generality we may assume p<q𝑝𝑞p<qitalic_p < italic_q.

For all the nonvanishing ajsubscript𝑎𝑗a_{j}italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, from eq. (3.3), for absolute convergence we require (necessary, not sufficient)

|bj|=|aj||ap|1μj|aq|μj1|μj|μj|1μj|1μj.subscript𝑏𝑗subscript𝑎𝑗superscriptsubscript𝑎𝑝1subscript𝜇𝑗superscriptsubscript𝑎𝑞subscript𝜇𝑗1superscriptsubscript𝜇𝑗subscript𝜇𝑗superscript1subscript𝜇𝑗1subscript𝜇𝑗|b_{j}|=\frac{|a_{j}|}{|a_{p}|^{1-\mu_{j}}|a_{q}|^{\mu_{j}}}\leq\frac{1}{|\mu_% {j}|^{\mu_{j}}|1-\mu_{j}|^{1-\mu_{j}}}\,.| italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | = divide start_ARG | italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | end_ARG start_ARG | italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 1 - italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG ≤ divide start_ARG 1 end_ARG start_ARG | italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | 1 - italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 1 - italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG . (4.9)

Let the lowest and highest indices of the nonzero ajsubscript𝑎𝑗a_{j}italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT in 𝒩npqsubscript𝒩𝑛𝑝𝑞\mathscr{N}_{npq}script_N start_POSTSUBSCRIPT italic_n italic_p italic_q end_POSTSUBSCRIPT be jminsubscript𝑗j_{\min}italic_j start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT and jmaxsubscript𝑗j_{\max}italic_j start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT, respectively. From eq. (3.5), the sufficient (but not necessary) criterion for absolute convergence is

j𝒩npq|bj|=j𝒩npq|aj||ap|1μj|aq|μjmin(1|μjmin|μjmin|1μjmin|1μjmin,1|μjmax|μjmax|1μjmax|1μjmax).\begin{split}\sum_{j\in\mathscr{N}_{npq}}|b_{j}|=\sum_{j\in\mathscr{N}_{npq}}% \frac{|a_{j}|}{|a_{p}|^{1-\mu_{j}}|a_{q}|^{\mu_{j}}}&\leq\min\biggl{(}\frac{1}% {|\mu_{j_{\min}}|^{\mu_{j_{\min}}}|1-\mu_{j_{\min}}|^{1-\mu_{j_{\min}}}}\,,\\ &\qquad\qquad\qquad\frac{1}{|\mu_{j_{\max}}|^{\mu_{j_{\max}}}|1-\mu_{j_{\max}}% |^{1-\mu_{j_{\max}}}}\biggr{)}\,.\end{split}start_ROW start_CELL ∑ start_POSTSUBSCRIPT italic_j ∈ script_N start_POSTSUBSCRIPT italic_n italic_p italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | = ∑ start_POSTSUBSCRIPT italic_j ∈ script_N start_POSTSUBSCRIPT italic_n italic_p italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT divide start_ARG | italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | end_ARG start_ARG | italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 1 - italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG end_CELL start_CELL ≤ roman_min ( divide start_ARG 1 end_ARG start_ARG | italic_μ start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | 1 - italic_μ start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 1 - italic_μ start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG | italic_μ start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | 1 - italic_μ start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 1 - italic_μ start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG ) . end_CELL end_ROW (4.10)

As noted in Sec. 3, the domain of absolute convergence depends only on the amplitudes |aj|subscript𝑎𝑗|a_{j}|| italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | and hence the domain is the same for all the choices of radicals for (1)1/(qp)superscript11𝑞𝑝(-1)^{1/(q-p)}( - 1 ) start_POSTSUPERSCRIPT 1 / ( italic_q - italic_p ) end_POSTSUPERSCRIPT, i.e. all the values of \ellroman_ℓ.

4.2 Comment on McClintock’s series

McClintock in 1895 published a paper [32] deriving series expressions for all the roots of a polynomial of arbitrary degree. He began with the illustrative example x6=1xsuperscript𝑥61𝑥x^{6}=-1-xitalic_x start_POSTSUPERSCRIPT 6 end_POSTSUPERSCRIPT = - 1 - italic_x. McClintock obtained [32, eq. (1)]

x=ωω2a32ω3a283ω4a3𝑥𝜔superscript𝜔2𝑎32superscript𝜔3superscript𝑎283superscript𝜔4superscript𝑎3x=\omega-\omega^{2}a-\frac{3}{2}\omega^{3}a^{2}-\frac{8}{3}\omega^{4}a^{3}-\cdotsitalic_x = italic_ω - italic_ω start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_a - divide start_ARG 3 end_ARG start_ARG 2 end_ARG italic_ω start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - divide start_ARG 8 end_ARG start_ARG 3 end_ARG italic_ω start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_a start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT - ⋯ (4.11)

where a=16𝑎16a=-\frac{1}{6}italic_a = - divide start_ARG 1 end_ARG start_ARG 6 end_ARG and “ω𝜔\omegaitalic_ω is any one of sixth-roots of 11-1- 1.” This is essentially exactly the procedure I employed above: in eq. (4.7) I took an nthsuperscript𝑛𝑡n^{th}italic_n start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT root of the highest power xnsuperscript𝑥𝑛x^{n}italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and solved for ζsubscript𝜁\zeta_{\ell}italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT in eq. (4.5). Note that eiπ(2+1)/nsuperscript𝑒𝑖𝜋21𝑛e^{i\pi(2\ell+1)/n}italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) / italic_n end_POSTSUPERSCRIPT is an nthsuperscript𝑛𝑡n^{th}italic_n start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT root of 11-1- 1. McClintock examined many other polynomials, on a case by case basis; the analysis in the previous subsection gives a general expression for the roots of an arbitrary polynomial. McClintock’s solutions are multiparameter Fuss–Catalan series. McClintock noted that his series had finite radii of convergence but did not derive a general expression for the radius of convergence. McClintock also noted the use of the Lagrange inversion theorem in his derivations.

4.3 Comment on Mellin’s solution

Mellin derived a series solution for the following algebraic equation [33]

zn+x1zn1+x2zn2++xpznp1=0.superscript𝑧𝑛subscript𝑥1superscript𝑧subscript𝑛1subscript𝑥2superscript𝑧subscript𝑛2subscript𝑥𝑝superscript𝑧subscript𝑛𝑝10z^{n}+x_{1}z^{n_{1}}+x_{2}z^{n_{2}}+\cdots+x_{p}z^{n_{p}}-1=0\,.italic_z start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT + italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_z start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT + italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_z start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT + ⋯ + italic_x start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_z start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUPERSCRIPT - 1 = 0 . (4.12)

Here n>ns1𝑛subscript𝑛𝑠1n>n_{s}\geq 1italic_n > italic_n start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ≥ 1, s=1,,p𝑠1𝑝s=1,\dots,pitalic_s = 1 , … , italic_p (see [33]). Hence all the coefficients xssubscript𝑥𝑠x_{s}italic_x start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT in eq. (4.12) are nonzero by definition. Mellin derived a series solution for the ‘Hauptlösung’ or principal root, which is the unique branch which equals 1 for x1==xp=0subscript𝑥1subscript𝑥𝑝0x_{1}=\cdots=x_{p}=0italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = ⋯ = italic_x start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT = 0, and where α𝛼\alphaitalic_α is a positive number [33]

zα=1+αk=1(1)knkν1++νp=kμ=1k1(α+n1ν1+npνpnμ)Γ(ν1+1)Γ(ν2+1)Γ(νp+1)x1ν1xpνp.superscript𝑧𝛼1𝛼superscriptsubscript𝑘1superscript1𝑘superscript𝑛𝑘subscriptsubscript𝜈1subscript𝜈𝑝𝑘superscriptsubscriptproduct𝜇1𝑘1𝛼subscript𝑛1subscript𝜈1subscript𝑛𝑝subscript𝜈𝑝𝑛𝜇Γsubscript𝜈11Γsubscript𝜈21Γsubscript𝜈𝑝1superscriptsubscript𝑥1subscript𝜈1superscriptsubscript𝑥𝑝subscript𝜈𝑝z^{\alpha}=1+\alpha\sum_{k=1}^{\infty}\frac{(-1)^{k}}{n^{k}}\sum_{\nu_{1}+% \cdots+\nu_{p}=k}\frac{\prod_{\mu=1}^{k-1}(\alpha+n_{1}\nu_{1}+\cdots n_{p}\nu% _{p}-n\mu)}{\Gamma(\nu_{1}+1)\Gamma(\nu_{2}+1)\cdots\Gamma(\nu_{p}+1)}\,x_{1}^% {\nu_{1}}\cdots x_{p}^{\nu_{p}}\,.italic_z start_POSTSUPERSCRIPT italic_α end_POSTSUPERSCRIPT = 1 + italic_α ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT divide start_ARG ( - 1 ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG start_ARG italic_n start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_ν start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + ⋯ + italic_ν start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT = italic_k end_POSTSUBSCRIPT divide start_ARG ∏ start_POSTSUBSCRIPT italic_μ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k - 1 end_POSTSUPERSCRIPT ( italic_α + italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_ν start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + ⋯ italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_ν start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT - italic_n italic_μ ) end_ARG start_ARG roman_Γ ( italic_ν start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 1 ) roman_Γ ( italic_ν start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + 1 ) ⋯ roman_Γ ( italic_ν start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT + 1 ) end_ARG italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ν start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ⋯ italic_x start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ν start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUPERSCRIPT . (4.13)

It is easily verified that this equals the Fuss–Catalan series with k=p𝑘𝑝k=pitalic_k = italic_p, r=α/n𝑟𝛼𝑛r=\alpha/nitalic_r = italic_α / italic_n, μj=nj/nsubscript𝜇𝑗subscript𝑛𝑗𝑛\mu_{j}=n_{j}/nitalic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_n start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT / italic_n, tj=νjsubscript𝑡𝑗subscript𝜈𝑗t_{j}=\nu_{j}italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_ν start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT and zj=xjsubscript𝑧𝑗subscript𝑥𝑗z_{j}=-x_{j}italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = - italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, for j=1,,p𝑗1𝑝j=1,\dots,pitalic_j = 1 , … , italic_p, so

zα=((n1n,,npn);αn;(x1,,xp)).superscript𝑧𝛼subscript𝑛1𝑛subscript𝑛𝑝𝑛𝛼𝑛subscript𝑥1subscript𝑥𝑝z^{\alpha}=\mathcal{B}\Bigl{(}\Bigl{(}\frac{n_{1}}{n},\dots,\frac{n_{p}}{n}% \Bigr{)};\frac{\alpha}{n};(-x_{1},\dots,-x_{p})\Bigr{)}\,.italic_z start_POSTSUPERSCRIPT italic_α end_POSTSUPERSCRIPT = caligraphic_B ( ( divide start_ARG italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_n end_ARG , … , divide start_ARG italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_ARG start_ARG italic_n end_ARG ) ; divide start_ARG italic_α end_ARG start_ARG italic_n end_ARG ; ( - italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , - italic_x start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) ) . (4.14)

In fact α𝛼\alphaitalic_α is not constrained to be positive. Mellin also specified the following bound for the domain of convergence of his series; it is clearly sufficient but not always necessary [33]

|x1|,,|xp|<1pmin(1|μ1|μ1|1μ1|1μ1,1|μp|μp|1μp|1μp).subscript𝑥1subscript𝑥𝑝1𝑝1superscriptsubscript𝜇1subscript𝜇1superscript1subscript𝜇11subscript𝜇11superscriptsubscript𝜇𝑝subscript𝜇𝑝superscript1subscript𝜇𝑝1subscript𝜇𝑝|x_{1}|,\dots,|x_{p}|<\frac{1}{p}\min\biggl{(}\frac{1}{|\mu_{1}|^{\mu_{1}}|1-% \mu_{1}|^{1-\mu_{1}}}\,,\frac{1}{|\mu_{p}|^{\mu_{p}}|1-\mu_{p}|^{1-\mu_{p}}}% \biggr{)}\,.| italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , … , | italic_x start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT | < divide start_ARG 1 end_ARG start_ARG italic_p end_ARG roman_min ( divide start_ARG 1 end_ARG start_ARG | italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | 1 - italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 1 - italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG , divide start_ARG 1 end_ARG start_ARG | italic_μ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | 1 - italic_μ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 1 - italic_μ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG ) . (4.15)

For ease of comparison with my work, I have written μ1subscript𝜇1\mu_{1}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and μpsubscript𝜇𝑝\mu_{p}italic_μ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT on the right hand side. This is a more conservative bound and is superseded by eq. (3.5) or eq. (4.10).

4.4 Comment on Birkeland’s series

Birkeland published papers on the solutions of algebraic equations using hypergeometric series [6, 7, 8], culminating in his 1927 paper [9]. We study the latter paper (which largely subsumes his earlier work). Birkeland treated the general algebraic equation with complex coefficients [9, eq. (1)]

a0xn+aaxn1++an1x+an=0.subscript𝑎0superscript𝑥𝑛subscript𝑎𝑎superscript𝑥𝑛1subscript𝑎𝑛1𝑥subscript𝑎𝑛0a_{0}x^{n}+a_{a}x^{n-1}+\cdots+a_{n-1}x+a_{n}=0\,.italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT + italic_a start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT + ⋯ + italic_a start_POSTSUBSCRIPT italic_n - 1 end_POSTSUBSCRIPT italic_x + italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = 0 . (4.16)

Hence his indexing is the opposite of that in eq. (4.1). Birkeland also selected two integers, with p>q𝑝𝑞p>qitalic_p > italic_q, such that 0q<pn0𝑞𝑝𝑛0\leq q<p\leq n0 ≤ italic_q < italic_p ≤ italic_n. He then obtained the scaled equation [9, eq. (1superscript11^{\prime}1 start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT)] (see his paper for the definitions of z𝑧zitalic_z, lisubscript𝑙𝑖l_{i}italic_l start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and misubscript𝑚𝑖m_{i}italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT)

zp=zq+l1zm1+l2zm2++lszms.superscript𝑧𝑝superscript𝑧𝑞subscript𝑙1superscript𝑧subscript𝑚1subscript𝑙2superscript𝑧subscript𝑚2subscript𝑙𝑠superscript𝑧subscript𝑚𝑠z^{p}=z^{q}+l_{1}z^{m_{1}}+l_{2}z^{m_{2}}+\cdots+l_{s}z^{m_{s}}\,.italic_z start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT = italic_z start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT + italic_l start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_z start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT + italic_l start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_z start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT + ⋯ + italic_l start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT italic_z start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_POSTSUPERSCRIPT . (4.17)

This is very similar to eq. (4.4). Birkeland then derived a series solution of eq. (4.17). First define ε𝜀\varepsilonitalic_ε as a primitive root of unity satisfying the equation xpq=1superscript𝑥𝑝𝑞1x^{p-q}=1italic_x start_POSTSUPERSCRIPT italic_p - italic_q end_POSTSUPERSCRIPT = 1. Then Birkeland obtained for the root zjsubscript𝑧𝑗z_{j}italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT (raised to the power γ𝛾\gammaitalic_γ), where j=1,2,,pq𝑗12𝑝𝑞j=1,2,\dots,p-qitalic_j = 1 , 2 , … , italic_p - italic_q [9, eq. (5)]

zjγ=εjγ[ 1+γpqα1,,αs=0εjv(τ,r1)α1!α2!αs!l1α1l2α2lsαs].superscriptsubscript𝑧𝑗𝛾superscript𝜀𝑗𝛾delimited-[]1𝛾𝑝𝑞superscriptsubscriptsubscript𝛼1subscript𝛼𝑠0superscript𝜀𝑗𝑣𝜏𝑟1subscript𝛼1subscript𝛼2subscript𝛼𝑠superscriptsubscript𝑙1subscript𝛼1superscriptsubscript𝑙2subscript𝛼2superscriptsubscript𝑙𝑠subscript𝛼𝑠z_{j}^{\gamma}=\varepsilon^{j\gamma}\biggl{[}\,1+\frac{\gamma}{p-q}\sum_{% \alpha_{1},\dots,\alpha_{s}=0}^{\infty}\varepsilon^{jv}\,\frac{(\tau,r-1)}{% \alpha_{1}!\alpha_{2}!\dots\alpha_{s}!}\,l_{1}^{\alpha_{1}}l_{2}^{\alpha_{2}}% \dots l_{s}^{\alpha_{s}}\,\biggr{]}\,.italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_γ end_POSTSUPERSCRIPT = italic_ε start_POSTSUPERSCRIPT italic_j italic_γ end_POSTSUPERSCRIPT [ 1 + divide start_ARG italic_γ end_ARG start_ARG italic_p - italic_q end_ARG ∑ start_POSTSUBSCRIPT italic_α start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_α start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_ε start_POSTSUPERSCRIPT italic_j italic_v end_POSTSUPERSCRIPT divide start_ARG ( italic_τ , italic_r - 1 ) end_ARG start_ARG italic_α start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ! italic_α start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ! … italic_α start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ! end_ARG italic_l start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_l start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT … italic_l start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ] . (4.18)

(N.B.: I changed i𝑖iitalic_i to j𝑗jitalic_j in Birkeland’s equation to avoid confusion with i=1𝑖1i=\sqrt{-1}italic_i = square-root start_ARG - 1 end_ARG.) With elementary changes of notation, eq. (4.18) is equivalent to the first line of eq. (4.8). See in particular [9, eq. (4)] for his definitions of τ𝜏\tauitalic_τ and v𝑣vitalic_v. Birkeland did not recognize his series coefficients as Fuss–Catalan numbers. Birkeland then expressed the series solution in eq. (4.18) in terms of sums of hypergeometric series. It would take us too far afield to discuss hypergeometric series in this paper. Birkeland derived the same convergence criteria as in Sec. 3. He derived the necessary (but not sufficient) bound [9, unnumbered, §2]

|ζ1|<1,|ζ2|<1,,|ζs|<1.formulae-sequencesubscript𝜁11formulae-sequencesubscript𝜁21subscript𝜁𝑠1|\zeta_{1}|<1\,,\quad|\zeta_{2}|<1\,,\quad\dots,\quad|\zeta_{s}|<1\,.| italic_ζ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | < 1 , | italic_ζ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | < 1 , … , | italic_ζ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT | < 1 . (4.19)

This matches eq. (4.9), after working through the details of his notation: his ζjsubscript𝜁𝑗\zeta_{j}italic_ζ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT equals my bj|μj|μj|1μj|1μjsubscript𝑏𝑗superscriptsubscript𝜇𝑗subscript𝜇𝑗superscript1subscript𝜇𝑗1subscript𝜇𝑗b_{j}|\mu_{j}|^{\mu_{j}}|1-\mu_{j}|^{1-\mu_{j}}italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | 1 - italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 1 - italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, with obvious allowances for differences in his indexing. Birkeland did not recognize that one can write ‘\leq’ instead of strict inequalities ‘<<<’ in the bound. As for the sufficient (but not necessary) bound, Birkeland obtained [9, eq. 12]

|l1|+|l2|++|ls|<pqm+pq1(1+pqm)mpq.subscript𝑙1subscript𝑙2subscript𝑙𝑠𝑝𝑞𝑚𝑝𝑞1superscript1𝑝𝑞𝑚𝑚𝑝𝑞|l_{1}|+|l_{2}|+\cdots+|l_{s}|<\frac{p-q}{m+p-q}\frac{1}{\displaystyle\Bigl{(}% 1+\frac{p-q}{m}\Bigr{)}^{\frac{m}{p-q}}}\,.| italic_l start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | + | italic_l start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | + ⋯ + | italic_l start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT | < divide start_ARG italic_p - italic_q end_ARG start_ARG italic_m + italic_p - italic_q end_ARG divide start_ARG 1 end_ARG start_ARG ( 1 + divide start_ARG italic_p - italic_q end_ARG start_ARG italic_m end_ARG ) start_POSTSUPERSCRIPT divide start_ARG italic_m end_ARG start_ARG italic_p - italic_q end_ARG end_POSTSUPERSCRIPT end_ARG . (4.20)

This is clearly similar to eq. (4.10). From eq. (4.17), Birkeland’s ljsubscript𝑙𝑗l_{j}italic_l start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT are my bjsubscript𝑏𝑗b_{j}italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, and on the right hand side of eq. (4.20), he wrote out the bound explicitly in terms of integers m𝑚mitalic_m, p𝑝pitalic_p and q𝑞qitalic_q. Contrary to Mellin [33] (see eq. (4.15)) and myself (see eq. (4.10)), Birkeland did not write a ‘min’ of two possible choices for the best bound. Here Birkeland made an error of algebra: Birkeland defined m𝑚mitalic_m in eq. (4.20) as the value of mνsubscript𝑚𝜈m_{\nu}italic_m start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT which maximizes the value of |mνp|subscript𝑚𝜈𝑝|m_{\nu}-p|| italic_m start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT - italic_p |. Quoting from [9], “Wir wollen mit m𝑚mitalic_m die größte der Zahlen |mνp|subscript𝑚𝜈𝑝|m_{\nu}-p|| italic_m start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT - italic_p | …” However, as was seen in Sec. 3 for the parameter μsubscript𝜇\mu_{*}italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT, we must choose μsubscript𝜇\mu_{*}italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT to be the value of μjsubscript𝜇𝑗\mu_{j}italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT which maximizes the value of |μj12|subscript𝜇𝑗12|\mu_{j}-\frac{1}{2}|| italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG |. Working through Birkeland’s notation, we must choose m𝑚mitalic_m to maximize the value of |mν(p+q)/2|subscript𝑚𝜈𝑝𝑞2|m_{\nu}-(p+q)/2|| italic_m start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT - ( italic_p + italic_q ) / 2 |. Birkeland then applied his formalism to derive the solution of the trinomial, which he had also treated in an earlier paper [6]. The trinomial is sufficiently important that it will be studied in a section of its own in Sec. 6.

4.5 Comment on Lewis’s series

In 1939, Lewis [31] published a paper on the solution of algebraic equations by infinite series. He treated the trinomial, then the quadrinomial and finally general multinomial equations. We discuss only the general case here. Lewis treated the general algebraic equation with complex coefficients [31, eq. (39)]

anznakzkagzgabzba0=0.subscript𝑎𝑛superscript𝑧𝑛subscript𝑎𝑘superscript𝑧𝑘subscript𝑎𝑔superscript𝑧𝑔subscript𝑎𝑏superscript𝑧𝑏subscript𝑎00a_{n}z^{n}-a_{k}z^{k}-a_{g}z^{g}-\cdots-a_{b}z^{b}-a_{0}=0\,.italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_z start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT - italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_z start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT - italic_a start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT italic_z start_POSTSUPERSCRIPT italic_g end_POSTSUPERSCRIPT - ⋯ - italic_a start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_z start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT - italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0 . (4.21)

(The above corrects a misprint in [31].) The notation suggests that all the coefficients are nonzero. Lewis treated only the case we denoted above by p=0𝑝0p=0italic_p = 0 and q=n𝑞𝑛q=nitalic_q = italic_n. He wrote [31, eq. (40)]

zn=a0/an+(1/an)(akzk+agzg++abzb).superscript𝑧𝑛subscript𝑎0subscript𝑎𝑛1subscript𝑎𝑛subscript𝑎𝑘superscript𝑧𝑘subscript𝑎𝑔superscript𝑧𝑔subscript𝑎𝑏superscript𝑧𝑏z^{n}=a_{0}/a_{n}+(1/a_{n})(a_{k}z^{k}+a_{g}z^{g}+\cdots+a_{b}z^{b})\,.italic_z start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT = italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT / italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT + ( 1 / italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ( italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_z start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT + italic_a start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT italic_z start_POSTSUPERSCRIPT italic_g end_POSTSUPERSCRIPT + ⋯ + italic_a start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_z start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT ) . (4.22)

Lewis employed Lagrange inversion to derive his solution. Following Lewis, we write a0/an=reiθsubscript𝑎0subscript𝑎𝑛𝑟superscript𝑒𝑖𝜃a_{0}/a_{n}=re^{i\theta}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT / italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_r italic_e start_POSTSUPERSCRIPT italic_i italic_θ end_POSTSUPERSCRIPT and the n𝑛nitalic_n roots of anzna0=0subscript𝑎𝑛superscript𝑧𝑛subscript𝑎00a_{n}z^{n}-a_{0}=0italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_z start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT - italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0 are denoted by αh=r1/nei(2hπ+θ)/nsubscript𝛼superscript𝑟1𝑛superscript𝑒𝑖2𝜋𝜃𝑛\alpha_{h}=r^{1/n}e^{i(2h\pi+\theta)/n}italic_α start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT = italic_r start_POSTSUPERSCRIPT 1 / italic_n end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT italic_i ( 2 italic_h italic_π + italic_θ ) / italic_n end_POSTSUPERSCRIPT, where h=1,,n1𝑛h=1,\dots,nitalic_h = 1 , … , italic_n. The solution for the root zhsubscript𝑧z_{h}italic_z start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT is given as [31, eq. (41)]

zh=p,q,,v=0akpagqabvp!q!v!(a0n)p+q++v(1+pk++vbnp+q++v1)αh1+pk++vb(h=1,,n).subscript𝑧superscriptsubscript𝑝𝑞𝑣0superscriptsubscript𝑎𝑘𝑝superscriptsubscript𝑎𝑔𝑞superscriptsubscript𝑎𝑏𝑣𝑝𝑞𝑣superscriptsubscript𝑎0𝑛𝑝𝑞𝑣binomial1𝑝𝑘𝑣𝑏𝑛𝑝𝑞𝑣1superscriptsubscript𝛼1𝑝𝑘𝑣𝑏1𝑛z_{h}=\sum_{p,q,\dots,v=0}^{\infty}\frac{a_{k}^{p}a_{g}^{q}\dots a_{b}^{v}}{p!% q!\dots v!(a_{0}n)^{p+q+\cdots+v}}\binom{1+pk+\cdots+vb-n}{p+q+\cdots+v-1}\,% \alpha_{h}^{1+pk+\cdots+vb}\qquad(h=1,\dots,n)\,.italic_z start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_p , italic_q , … , italic_v = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT divide start_ARG italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT … italic_a start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT end_ARG start_ARG italic_p ! italic_q ! … italic_v ! ( italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_n ) start_POSTSUPERSCRIPT italic_p + italic_q + ⋯ + italic_v end_POSTSUPERSCRIPT end_ARG ( FRACOP start_ARG 1 + italic_p italic_k + ⋯ + italic_v italic_b - italic_n end_ARG start_ARG italic_p + italic_q + ⋯ + italic_v - 1 end_ARG ) italic_α start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 + italic_p italic_k + ⋯ + italic_v italic_b end_POSTSUPERSCRIPT ( italic_h = 1 , … , italic_n ) . (4.23)

Lewis did not derive a series for powers of the roots. With some effort, eq. (4.23) can be equated to the solution in eq. (4.8). Lewis derived the following sufficient condition for absolute convergence [31, eq. (42)]

{|akαhk|++|abαhb||a0|}nnnkk(nk)nk.superscriptsubscript𝑎𝑘superscriptsubscript𝛼𝑘subscript𝑎𝑏superscriptsubscript𝛼𝑏subscript𝑎0𝑛superscript𝑛𝑛superscript𝑘𝑘superscript𝑛𝑘𝑛𝑘\biggl{\{}\frac{|a_{k}\alpha_{h}^{k}|+\cdots+|a_{b}\alpha_{h}^{b}|}{|a_{0}|}% \biggr{\}}^{n}\leq\frac{n^{n}}{k^{k}(n-k)^{n-k}}\,.{ divide start_ARG | italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_α start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT | + ⋯ + | italic_a start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_α start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT | end_ARG start_ARG | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | end_ARG } start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ≤ divide start_ARG italic_n start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_ARG start_ARG italic_k start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ( italic_n - italic_k ) start_POSTSUPERSCRIPT italic_n - italic_k end_POSTSUPERSCRIPT end_ARG . (4.24)

Unlike Mellin [33] and Birkeland [9], Lewis [31] recognized that equality ‘\leq’ is permitted in the bound. However, like Birkeland, Lewis failed to recognize that the bound on the right hand side is given by the minimum of multiple possibilities, and the expression he derived is not always the correct choice.

4.6 Comment on Raney’s series

Raney [37] employed his formalism to demonstrate the use of Lagrange inversion for a power series z¯=n=0anx¯n¯𝑧superscriptsubscript𝑛0subscript𝑎𝑛superscript¯𝑥𝑛\bar{z}=\sum_{n=0}^{\infty}a_{n}\bar{x}^{n}over¯ start_ARG italic_z end_ARG = ∑ start_POSTSUBSCRIPT italic_n = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT over¯ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT [37, eqs. (5,4), (5,7) and (5.8)]. Then in [37, Sec. 6], Raney applied his formalism to derive series solutions for algebraic equations. As mentioned earlier, [37, eqs. (6.1) and (6.2)] yield eq. (2.14). Raney took the μjsubscript𝜇𝑗\mu_{j}italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT to be integers; this yields an algebraic equation. Raney displayed the example of the trinomial w¯=1+x¯w¯n¯𝑤1¯𝑥superscript¯𝑤𝑛\bar{w}=1+\bar{x}\bar{w}^{n}over¯ start_ARG italic_w end_ARG = 1 + over¯ start_ARG italic_x end_ARG over¯ start_ARG italic_w end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT [37, eq. (6.3)], with the series solution [37, eq. (6.4)]

w¯=k=011+(n1)k(nkk)x¯k.¯𝑤superscriptsubscript𝑘011𝑛1𝑘binomial𝑛𝑘𝑘superscript¯𝑥𝑘\bar{w}=\sum_{k=0}^{\infty}\frac{1}{1+(n-1)k}\binom{nk}{k}\bar{x}^{k}\,.over¯ start_ARG italic_w end_ARG = ∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 1 + ( italic_n - 1 ) italic_k end_ARG ( FRACOP start_ARG italic_n italic_k end_ARG start_ARG italic_k end_ARG ) over¯ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT . (4.25)

This matches the series coefficients in eq. (1.2), replacing m𝑚mitalic_m by n𝑛nitalic_n and t𝑡titalic_t by k𝑘kitalic_k. Raney did not discuss questions of convergence. Unlike the other authors cited earlier in this section, Raney took the coefficients in his equations to be elements in a commutative ring, not just complex numbers.

5 Quintic

The quintic is sufficiently important that it is placed in a separate section. It is known that by means of a Tschirnhaus transformation, a general quintic may be brought to the Bring-Jerrard normal form

x5x+γ=0.superscript𝑥5𝑥𝛾0x^{5}-x+\gamma=0\,.italic_x start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT - italic_x + italic_γ = 0 . (5.1)

This algebraic equation (of degree n=5𝑛5n=5italic_n = 5) lends itself naturally to a solution using Fuss–Catalan series. Using the formalism in Sec. 5, we set p=0𝑝0p=0italic_p = 0 and q=5𝑞5q=5italic_q = 5. From eq. (4.7), x=eiπ(2+1)/5γ1/5ζ1/5subscript𝑥superscript𝑒𝑖𝜋215superscript𝛾15superscriptsubscript𝜁15x_{\ell}=e^{i\pi(2\ell+1)/5}\gamma^{1/5}\zeta_{\ell}^{1/5}italic_x start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) / 5 end_POSTSUPERSCRIPT italic_γ start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT with =0,,404\ell=0,\dots,4roman_ℓ = 0 , … , 4. Then ζsubscript𝜁\zeta_{\ell}italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT satisfies the equation

ζ=1eiπ(2+1)/5γ4/5ζ1/5.subscript𝜁1superscript𝑒𝑖𝜋215superscript𝛾45superscriptsubscript𝜁15\zeta_{\ell}=1-\frac{e^{i\pi(2\ell+1)/5}}{\gamma^{4/5}}\,\zeta_{\ell}^{1/5}\,.italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = 1 - divide start_ARG italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) / 5 end_POSTSUPERSCRIPT end_ARG start_ARG italic_γ start_POSTSUPERSCRIPT 4 / 5 end_POSTSUPERSCRIPT end_ARG italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT . (5.2)

There is only one summand, so k=1𝑘1k=1italic_k = 1 and μ=1/5𝜇15\mu=1/5italic_μ = 1 / 5. The roots xsubscript𝑥x_{\ell}italic_x start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT are given by

x=eiπ(2+1)/5γ1/5(15;15;eiπ(2+1)/5γ4/5).subscript𝑥superscript𝑒𝑖𝜋215superscript𝛾151515superscript𝑒𝑖𝜋215superscript𝛾45\begin{split}x_{\ell}=e^{i\pi(2\ell+1)/5}\gamma^{1/5}\mathcal{B}\Bigl{(}\frac{% 1}{5};\,\frac{1}{5};\,-\frac{e^{i\pi(2\ell+1)/5}}{\gamma^{4/5}}\Bigr{)}\,.\end% {split}start_ROW start_CELL italic_x start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) / 5 end_POSTSUPERSCRIPT italic_γ start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT caligraphic_B ( divide start_ARG 1 end_ARG start_ARG 5 end_ARG ; divide start_ARG 1 end_ARG start_ARG 5 end_ARG ; - divide start_ARG italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) / 5 end_POSTSUPERSCRIPT end_ARG start_ARG italic_γ start_POSTSUPERSCRIPT 4 / 5 end_POSTSUPERSCRIPT end_ARG ) . end_CELL end_ROW (5.3)

From Corollary 3.4, the condition for convergence is necessary and sufficient:

1|γ|4/51(15)1/5(45)4/5=544/5,|γ|455/40.534992.formulae-sequence1superscript𝛾451superscript1515superscript45455superscript445𝛾4superscript554similar-to-or-equals0.534992\frac{1}{|\gamma|^{4/5}}\leq\frac{1}{(\frac{1}{5})^{1/5}(\frac{4}{5})^{4/5}}=% \frac{5}{4^{4/5}}\,,\qquad|\gamma|\geq\frac{4}{5^{5/4}}\simeq 0.534992\,.divide start_ARG 1 end_ARG start_ARG | italic_γ | start_POSTSUPERSCRIPT 4 / 5 end_POSTSUPERSCRIPT end_ARG ≤ divide start_ARG 1 end_ARG start_ARG ( divide start_ARG 1 end_ARG start_ARG 5 end_ARG ) start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT ( divide start_ARG 4 end_ARG start_ARG 5 end_ARG ) start_POSTSUPERSCRIPT 4 / 5 end_POSTSUPERSCRIPT end_ARG = divide start_ARG 5 end_ARG start_ARG 4 start_POSTSUPERSCRIPT 4 / 5 end_POSTSUPERSCRIPT end_ARG , | italic_γ | ≥ divide start_ARG 4 end_ARG start_ARG 5 start_POSTSUPERSCRIPT 5 / 4 end_POSTSUPERSCRIPT end_ARG ≃ 0.534992 . (5.4)

We can say more. What if the value of γ𝛾\gammaitalic_γ does not satisfy the above bound? There are alternative series we can derive. Rewrite eq. (5.1) as x=γ+x5𝑥𝛾superscript𝑥5x=\gamma+x^{5}italic_x = italic_γ + italic_x start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT and set ζ=x/γ𝜁𝑥𝛾\zeta=x/\gammaitalic_ζ = italic_x / italic_γ. This corresponds to p=0𝑝0p=0italic_p = 0 and q=1𝑞1q=1italic_q = 1. Then ζ𝜁\zetaitalic_ζ satisfies ζ=1+γ4ζ5𝜁1superscript𝛾4superscript𝜁5\zeta=1+\gamma^{4}\zeta^{5}italic_ζ = 1 + italic_γ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_ζ start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT. Once again k=1𝑘1k=1italic_k = 1 and now z1=γ4subscript𝑧1superscript𝛾4z_{1}=\gamma^{4}italic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_γ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT and μ=5𝜇5\mu=5italic_μ = 5. There is only one root and it is

x=γζ=γ(5;1;γ4).𝑥𝛾𝜁𝛾51superscript𝛾4x=\gamma\zeta=\gamma\mathcal{B}(5;1;\gamma^{4})\,.italic_x = italic_γ italic_ζ = italic_γ caligraphic_B ( 5 ; 1 ; italic_γ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT ) . (5.5)

Comment: eq. (5.5) is equivalent to the solution published by Eisenstein [17] (see also [42], in English). Eisenstein expressed the Bring-Jerrard normal form differently and solved the equation x5+x+γ=0superscript𝑥5𝑥𝛾0x^{5}+x+\gamma=0italic_x start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT + italic_x + italic_γ = 0. The necessary and sufficient condition for convergence is

|γ|415544=4455,|γ|455/4.formulae-sequencesuperscript𝛾41superscript55superscript44superscript44superscript55𝛾4superscript554|\gamma|^{4}\leq\frac{1}{5^{5}4^{-4}}=\frac{4^{4}}{5^{5}}\,,\qquad|\gamma|\leq% \frac{4}{5^{5/4}}\,.| italic_γ | start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT ≤ divide start_ARG 1 end_ARG start_ARG 5 start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT 4 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT end_ARG = divide start_ARG 4 start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT end_ARG start_ARG 5 start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT end_ARG , | italic_γ | ≤ divide start_ARG 4 end_ARG start_ARG 5 start_POSTSUPERSCRIPT 5 / 4 end_POSTSUPERSCRIPT end_ARG . (5.6)

This is the inverse of the condition in eq. (5.4). However, we have found only one root. We obtain the other four roots as follows. We divide eq. (5.1) through by x𝑥xitalic_x to obtain x4=1γ/xsuperscript𝑥41𝛾𝑥x^{4}=1-\gamma/xitalic_x start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT = 1 - italic_γ / italic_x. This corresponds to p=1𝑝1p=1italic_p = 1 and q=5𝑞5q=5italic_q = 5. Set ζ=x4𝜁superscript𝑥4\zeta=x^{4}italic_ζ = italic_x start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT or x=ei2π/4ζ1/4𝑥superscript𝑒𝑖2𝜋4superscript𝜁14x=e^{i2\pi\ell/4}\zeta^{1/4}italic_x = italic_e start_POSTSUPERSCRIPT italic_i 2 italic_π roman_ℓ / 4 end_POSTSUPERSCRIPT italic_ζ start_POSTSUPERSCRIPT 1 / 4 end_POSTSUPERSCRIPT, for l=0,,3𝑙03l=0,\dots,3italic_l = 0 , … , 3, so ζsubscript𝜁\zeta_{\ell}italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT satisfies

ζ=1γeiπ/2ζ1/4.subscript𝜁1𝛾superscript𝑒𝑖𝜋2superscriptsubscript𝜁14\zeta_{\ell}=1-\gamma e^{-i\pi\ell/2}\zeta_{\ell}^{-1/4}\,.italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = 1 - italic_γ italic_e start_POSTSUPERSCRIPT - italic_i italic_π roman_ℓ / 2 end_POSTSUPERSCRIPT italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 / 4 end_POSTSUPERSCRIPT . (5.7)

Once again k=1𝑘1k=1italic_k = 1 and now z1=γeiπ/2subscript𝑧1𝛾superscript𝑒𝑖𝜋2z_{1}=-\gamma e^{-i\pi\ell/2}italic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = - italic_γ italic_e start_POSTSUPERSCRIPT - italic_i italic_π roman_ℓ / 2 end_POSTSUPERSCRIPT and μ=14𝜇14\mu=-\frac{1}{4}italic_μ = - divide start_ARG 1 end_ARG start_ARG 4 end_ARG. There are four roots, indexed by =0,,303\ell=0,\dots,3roman_ℓ = 0 , … , 3

x=eiπ/2(14;14;eiπ/2γ).subscript𝑥superscript𝑒𝑖𝜋21414superscript𝑒𝑖𝜋2𝛾\begin{split}x_{\ell}=e^{i\pi\ell/2}\mathcal{B}\Bigl{(}-\frac{1}{4};\,\frac{1}% {4};\,-e^{-i\pi\ell/2}\gamma\Bigr{)}\,.\end{split}start_ROW start_CELL italic_x start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT italic_i italic_π roman_ℓ / 2 end_POSTSUPERSCRIPT caligraphic_B ( - divide start_ARG 1 end_ARG start_ARG 4 end_ARG ; divide start_ARG 1 end_ARG start_ARG 4 end_ARG ; - italic_e start_POSTSUPERSCRIPT - italic_i italic_π roman_ℓ / 2 end_POSTSUPERSCRIPT italic_γ ) . end_CELL end_ROW (5.8)

The necessary and sufficient condition for convergence is

|γ|1(14)1/5(54)5/4=455/4.𝛾1superscript1415superscript54544superscript554|\gamma|\leq\frac{1}{(\frac{1}{4})^{1/5}(\frac{5}{4})^{5/4}}=\frac{4}{5^{5/4}}\,.| italic_γ | ≤ divide start_ARG 1 end_ARG start_ARG ( divide start_ARG 1 end_ARG start_ARG 4 end_ARG ) start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT ( divide start_ARG 5 end_ARG start_ARG 4 end_ARG ) start_POSTSUPERSCRIPT 5 / 4 end_POSTSUPERSCRIPT end_ARG = divide start_ARG 4 end_ARG start_ARG 5 start_POSTSUPERSCRIPT 5 / 4 end_POSTSUPERSCRIPT end_ARG . (5.9)

This is the same as eq. (5.6). As noted earlier for the case k=1𝑘1k=1italic_k = 1, all the series converge on their respective circles of convergence.

The series in eqs. (5.5) and (5.8) do not yield the same root. If we set γ=0𝛾0\gamma=0italic_γ = 0, eq. (5.1) reduces to x(x41)=0𝑥superscript𝑥410x(x^{4}-1)=0italic_x ( italic_x start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT - 1 ) = 0. One root is zero and the others are the fourth roots of unity. The roots in the series in eqs. (5.5) and (5.8) lie on the branches which respectively approach zero and the fourth roots of unity. Hence the two series, taken together, yield all five roots of eq. (5.1).

There are multiple ways to find the roots of a polynomial using Fuss–Catalan series. The series in eq. (5.3) converges for |γ|4/55/4𝛾4superscript554|\gamma|\geq 4/5^{5/4}| italic_γ | ≥ 4 / 5 start_POSTSUPERSCRIPT 5 / 4 end_POSTSUPERSCRIPT whereas those in eqs. (5.5) and (5.8) converge for |γ|4/55/4𝛾4superscript554|\gamma|\leq 4/5^{5/4}| italic_γ | ≤ 4 / 5 start_POSTSUPERSCRIPT 5 / 4 end_POSTSUPERSCRIPT. In all cases one obtains convergent solutions for all the five roots of eq. (5.1), thence the general quintic.

The Bring-Jerrard normal form has been solved using hypergeometric functions, e.g. see [35]. Klein’s solution of the quintic [26] also employed hypergeometric series. The series have finite radii of convergence (actually the same radius for all the series). Analytic continuation is required to treat all coefficients of the general quintic. Using Fuss–Catalan series, the series in eqs. (5.5) and (5.8) are the explicit analytic continuations of the series in eq. (5.3) across the ‘boundary’ |γ|=4/55/4𝛾4superscript554|\gamma|=4/5^{5/4}| italic_γ | = 4 / 5 start_POSTSUPERSCRIPT 5 / 4 end_POSTSUPERSCRIPT. Together they cover the whole parameter space, i.e. all values of γ𝛾\gammaitalic_γ in eq. (5.1). The solution of the Bring-Jerrard normal form using Fuss–Catalan series is arguably ‘cleaner’ than that using hypergeometric series.

6 Trinomial

6.1 General solution

The trinomial is also sufficiently important that it is placed in a separate section. The Bring-Jerrard normal form of the quintic and Lambert’s trinomial, to be discussed in Sec. 6.2, are particular examples of the general trinomial equation

xm+n+axn+b=0.superscript𝑥𝑚𝑛𝑎superscript𝑥𝑛𝑏0x^{m+n}+ax^{n}+b=0\,.italic_x start_POSTSUPERSCRIPT italic_m + italic_n end_POSTSUPERSCRIPT + italic_a italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT + italic_b = 0 . (6.1)

The general solution of the trinomial, for arbitrary values of the coefficients, was derived by Birkeland (1920,1927) [6, 9], Lewis (1935) [31] and Eagle (1939) [15] (who employed McClintock’s [32] formalism). The general solution of the trinomial was also derived in 1908 by P. A. Lambert [30], not to be confused with Johann Lambert. P. A. Lambert in fact also presented a series solution for the general algebraic equation, but his analysis contained some technical errors and was not discussed in Sec. 4. The derivation below follows Eagle (1939) [15], and eq. (6.1) is taken from his paper. We have already noted that the solutions are Fuss–Catalan series. All four authors cited above derived correct expressions for the radii of convergence of their series. The derivation below may be considered as an independent validation of their results.

As was seen in Sec. 5, there are three series. To systematize the derivation, to give a more panoramic overview of the results, we proceed as follows. Here \ellroman_ℓ takes values as appropriate to index the roots of unity.

  • 1.

    Set p=0𝑝0p=0italic_p = 0 and q=m+n𝑞𝑚𝑛q=m+nitalic_q = italic_m + italic_n and x=eiπ(2+1)/(m+n)b1/(m+n)ζ1/(m+n)subscript𝑥superscript𝑒𝑖𝜋21𝑚𝑛superscript𝑏1𝑚𝑛superscriptsubscript𝜁1𝑚𝑛x_{\ell}=e^{i\pi(2\ell+1)/(m+n)}b^{1/(m+n)}\zeta_{\ell}^{1/(m+n)}italic_x start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) / ( italic_m + italic_n ) end_POSTSUPERSCRIPT italic_b start_POSTSUPERSCRIPT 1 / ( italic_m + italic_n ) end_POSTSUPERSCRIPT italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 / ( italic_m + italic_n ) end_POSTSUPERSCRIPT then ζsubscript𝜁\zeta_{\ell}italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT satisfies

    ζ=1+eiπ(2+1)n/(m+n)abm/(m+n)ζn/(m+n).subscript𝜁1superscript𝑒𝑖𝜋21𝑛𝑚𝑛𝑎superscript𝑏𝑚𝑚𝑛superscriptsubscript𝜁𝑛𝑚𝑛\zeta_{\ell}=1+e^{i\pi(2\ell+1)n/(m+n)}\frac{a}{b^{m/(m+n)}}\zeta_{\ell}^{n/(m% +n)}\,.italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = 1 + italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) italic_n / ( italic_m + italic_n ) end_POSTSUPERSCRIPT divide start_ARG italic_a end_ARG start_ARG italic_b start_POSTSUPERSCRIPT italic_m / ( italic_m + italic_n ) end_POSTSUPERSCRIPT end_ARG italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n / ( italic_m + italic_n ) end_POSTSUPERSCRIPT . (6.2)

    Then μ=n/(m+n)𝜇𝑛𝑚𝑛\mu=n/(m+n)italic_μ = italic_n / ( italic_m + italic_n ). This series yields m+n𝑚𝑛m+nitalic_m + italic_n roots.

  • 2.

    Set p=0𝑝0p=0italic_p = 0 and q=n𝑞𝑛q=nitalic_q = italic_n and x=eiπ(2+1)/n(bζ/a)1/nsubscript𝑥superscript𝑒𝑖𝜋21𝑛superscript𝑏subscript𝜁𝑎1𝑛x_{\ell}=e^{i\pi(2\ell+1)/n}(b\zeta_{\ell}/a)^{1/n}italic_x start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) / italic_n end_POSTSUPERSCRIPT ( italic_b italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT / italic_a ) start_POSTSUPERSCRIPT 1 / italic_n end_POSTSUPERSCRIPT then ζsubscript𝜁\zeta_{\ell}italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT satisfies

    ζ=1+eiπ(2+1)(m+n)/nbm/na(m+n)/nζ(m+n)/n.subscript𝜁1superscript𝑒𝑖𝜋21𝑚𝑛𝑛superscript𝑏𝑚𝑛superscript𝑎𝑚𝑛𝑛superscriptsubscript𝜁𝑚𝑛𝑛\zeta_{\ell}=1+e^{i\pi(2\ell+1)(m+n)/n}\frac{b^{m/n}}{a^{(m+n)/n}}\zeta_{\ell}% ^{(m+n)/n}\,.italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = 1 + italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) ( italic_m + italic_n ) / italic_n end_POSTSUPERSCRIPT divide start_ARG italic_b start_POSTSUPERSCRIPT italic_m / italic_n end_POSTSUPERSCRIPT end_ARG start_ARG italic_a start_POSTSUPERSCRIPT ( italic_m + italic_n ) / italic_n end_POSTSUPERSCRIPT end_ARG italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_m + italic_n ) / italic_n end_POSTSUPERSCRIPT . (6.3)

    Then μ=(m+n)/n𝜇𝑚𝑛𝑛\mu=(m+n)/nitalic_μ = ( italic_m + italic_n ) / italic_n. This series yields n𝑛nitalic_n roots.

  • 3.

    Set p=n𝑝𝑛p=nitalic_p = italic_n and q=m+n𝑞𝑚𝑛q=m+nitalic_q = italic_m + italic_n and divide eq. (6.1) through by xnsuperscript𝑥𝑛x^{n}italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. Set x=eiπ(2+1)/m(aζ)1/msubscript𝑥superscript𝑒𝑖𝜋21𝑚superscript𝑎subscript𝜁1𝑚x_{\ell}=e^{i\pi(2\ell+1)/m}(a\zeta_{\ell})^{1/m}italic_x start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) / italic_m end_POSTSUPERSCRIPT ( italic_a italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 1 / italic_m end_POSTSUPERSCRIPT then ζsubscript𝜁\zeta_{\ell}italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT satisfies

    ζ=1+eiπ(2+1)n/mba(m+n)/mζn/m.subscript𝜁1superscript𝑒𝑖𝜋21𝑛𝑚𝑏superscript𝑎𝑚𝑛𝑚superscriptsubscript𝜁𝑛𝑚\zeta_{\ell}=1+e^{-i\pi(2\ell+1)n/m}\frac{b}{a^{(m+n)/m}}\zeta_{\ell}^{-n/m}\,.italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = 1 + italic_e start_POSTSUPERSCRIPT - italic_i italic_π ( 2 roman_ℓ + 1 ) italic_n / italic_m end_POSTSUPERSCRIPT divide start_ARG italic_b end_ARG start_ARG italic_a start_POSTSUPERSCRIPT ( italic_m + italic_n ) / italic_m end_POSTSUPERSCRIPT end_ARG italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_n / italic_m end_POSTSUPERSCRIPT . (6.4)

    Then μ=n/m𝜇𝑛𝑚\mu=-n/mitalic_μ = - italic_n / italic_m. This series yields m𝑚mitalic_m roots.

The respective series solutions are

xsubscript𝑥\displaystyle x_{\ell}italic_x start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT =eiπ(2+1)/(m+n)b1/(m+n)(nm+n;1m+n;eiπ(2+1)nm+nabm/(m+n))absentsuperscript𝑒𝑖𝜋21𝑚𝑛superscript𝑏1𝑚𝑛𝑛𝑚𝑛1𝑚𝑛superscript𝑒𝑖𝜋21𝑛𝑚𝑛𝑎superscript𝑏𝑚𝑚𝑛\displaystyle=e^{i\pi(2\ell+1)/(m+n)}b^{1/(m+n)}\mathcal{B}\Bigl{(}\frac{n}{m+% n};\,\frac{1}{m+n};\,e^{i\pi\frac{(2\ell+1)n}{m+n}}\frac{a}{b^{m/(m+n)}}\Bigr{)}= italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) / ( italic_m + italic_n ) end_POSTSUPERSCRIPT italic_b start_POSTSUPERSCRIPT 1 / ( italic_m + italic_n ) end_POSTSUPERSCRIPT caligraphic_B ( divide start_ARG italic_n end_ARG start_ARG italic_m + italic_n end_ARG ; divide start_ARG 1 end_ARG start_ARG italic_m + italic_n end_ARG ; italic_e start_POSTSUPERSCRIPT italic_i italic_π divide start_ARG ( 2 roman_ℓ + 1 ) italic_n end_ARG start_ARG italic_m + italic_n end_ARG end_POSTSUPERSCRIPT divide start_ARG italic_a end_ARG start_ARG italic_b start_POSTSUPERSCRIPT italic_m / ( italic_m + italic_n ) end_POSTSUPERSCRIPT end_ARG ) (0m+n1),0𝑚𝑛1\displaystyle(0\leq\ell\leq m+n-1)\,,( 0 ≤ roman_ℓ ≤ italic_m + italic_n - 1 ) , (6.5a)
xsubscript𝑥\displaystyle x_{\ell}italic_x start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT =eiπ(2+1)/nb1/na1/n(m+nn;1n;eiπ(2+1)(m+n)/nbm/na(m+n)/n)absentsuperscript𝑒𝑖𝜋21𝑛superscript𝑏1𝑛superscript𝑎1𝑛𝑚𝑛𝑛1𝑛superscript𝑒𝑖𝜋21𝑚𝑛𝑛superscript𝑏𝑚𝑛superscript𝑎𝑚𝑛𝑛\displaystyle=e^{i\pi(2\ell+1)/n}\frac{b^{1/n}}{a^{1/n}}\mathcal{B}\Bigl{(}% \frac{m+n}{n};\,\frac{1}{n};\,e^{i\pi(2\ell+1)(m+n)/n}\frac{b^{m/n}}{a^{(m+n)/% n}}\Bigr{)}= italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) / italic_n end_POSTSUPERSCRIPT divide start_ARG italic_b start_POSTSUPERSCRIPT 1 / italic_n end_POSTSUPERSCRIPT end_ARG start_ARG italic_a start_POSTSUPERSCRIPT 1 / italic_n end_POSTSUPERSCRIPT end_ARG caligraphic_B ( divide start_ARG italic_m + italic_n end_ARG start_ARG italic_n end_ARG ; divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ; italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) ( italic_m + italic_n ) / italic_n end_POSTSUPERSCRIPT divide start_ARG italic_b start_POSTSUPERSCRIPT italic_m / italic_n end_POSTSUPERSCRIPT end_ARG start_ARG italic_a start_POSTSUPERSCRIPT ( italic_m + italic_n ) / italic_n end_POSTSUPERSCRIPT end_ARG ) (0n1),0𝑛1\displaystyle(0\leq\ell\leq n-1)\,,( 0 ≤ roman_ℓ ≤ italic_n - 1 ) , (6.5b)
xsubscript𝑥\displaystyle x_{\ell}italic_x start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT =eiπ(2+1)/ma1/m(nm;1m;eiπ(2+1)n/mba(m+n)/m)absentsuperscript𝑒𝑖𝜋21𝑚superscript𝑎1𝑚𝑛𝑚1𝑚superscript𝑒𝑖𝜋21𝑛𝑚𝑏superscript𝑎𝑚𝑛𝑚\displaystyle=e^{i\pi(2\ell+1)/m}a^{1/m}\mathcal{B}\Bigl{(}-\frac{n}{m};\,% \frac{1}{m};\,e^{-i\pi(2\ell+1)n/m}\frac{b}{a^{(m+n)/m}}\Bigr{)}= italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) / italic_m end_POSTSUPERSCRIPT italic_a start_POSTSUPERSCRIPT 1 / italic_m end_POSTSUPERSCRIPT caligraphic_B ( - divide start_ARG italic_n end_ARG start_ARG italic_m end_ARG ; divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ; italic_e start_POSTSUPERSCRIPT - italic_i italic_π ( 2 roman_ℓ + 1 ) italic_n / italic_m end_POSTSUPERSCRIPT divide start_ARG italic_b end_ARG start_ARG italic_a start_POSTSUPERSCRIPT ( italic_m + italic_n ) / italic_m end_POSTSUPERSCRIPT end_ARG ) (0m1).0𝑚1\displaystyle(0\leq\ell\leq m-1)\,.( 0 ≤ roman_ℓ ≤ italic_m - 1 ) . (6.5c)

The respective domains of convergence are as follows.

|a||b|m/(m+n)𝑎superscript𝑏𝑚𝑚𝑛\displaystyle\frac{|a|}{|b|^{m/(m+n)}}divide start_ARG | italic_a | end_ARG start_ARG | italic_b | start_POSTSUPERSCRIPT italic_m / ( italic_m + italic_n ) end_POSTSUPERSCRIPT end_ARG (m+n)nn/(m+n)mm/(m+n),absent𝑚𝑛superscript𝑛𝑛𝑚𝑛superscript𝑚𝑚𝑚𝑛\displaystyle\leq\frac{(m+n)}{n^{n/(m+n)}m^{m/(m+n)}}\,,≤ divide start_ARG ( italic_m + italic_n ) end_ARG start_ARG italic_n start_POSTSUPERSCRIPT italic_n / ( italic_m + italic_n ) end_POSTSUPERSCRIPT italic_m start_POSTSUPERSCRIPT italic_m / ( italic_m + italic_n ) end_POSTSUPERSCRIPT end_ARG , |b|m|a|m+nsuperscript𝑏𝑚superscript𝑎𝑚𝑛\displaystyle\qquad\frac{|b|^{m}}{|a|^{m+n}}divide start_ARG | italic_b | start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_ARG start_ARG | italic_a | start_POSTSUPERSCRIPT italic_m + italic_n end_POSTSUPERSCRIPT end_ARG mmnn(m+n)m+n,absentsuperscript𝑚𝑚superscript𝑛𝑛superscript𝑚𝑛𝑚𝑛\displaystyle\geq\frac{m^{m}n^{n}}{(m+n)^{m+n}}\,,≥ divide start_ARG italic_m start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_ARG start_ARG ( italic_m + italic_n ) start_POSTSUPERSCRIPT italic_m + italic_n end_POSTSUPERSCRIPT end_ARG , (6.6a)
|bm/n||a|(m+n)/nsuperscript𝑏𝑚𝑛superscript𝑎𝑚𝑛𝑛\displaystyle\frac{|b^{m/n}|}{|a|^{(m+n)/n}}divide start_ARG | italic_b start_POSTSUPERSCRIPT italic_m / italic_n end_POSTSUPERSCRIPT | end_ARG start_ARG | italic_a | start_POSTSUPERSCRIPT ( italic_m + italic_n ) / italic_n end_POSTSUPERSCRIPT end_ARG n(m+n)mm/n,absent𝑛𝑚𝑛superscript𝑚𝑚𝑛\displaystyle\leq\frac{n}{(m+n)m^{-m/n}}\,,≤ divide start_ARG italic_n end_ARG start_ARG ( italic_m + italic_n ) italic_m start_POSTSUPERSCRIPT - italic_m / italic_n end_POSTSUPERSCRIPT end_ARG , |b|m|a|m+nsuperscript𝑏𝑚superscript𝑎𝑚𝑛\displaystyle\qquad\frac{|b|^{m}}{|a|^{m+n}}divide start_ARG | italic_b | start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_ARG start_ARG | italic_a | start_POSTSUPERSCRIPT italic_m + italic_n end_POSTSUPERSCRIPT end_ARG mmnn(m+n)m+n,absentsuperscript𝑚𝑚superscript𝑛𝑛superscript𝑚𝑛𝑚𝑛\displaystyle\leq\frac{m^{m}n^{n}}{(m+n)^{m+n}}\,,≤ divide start_ARG italic_m start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_ARG start_ARG ( italic_m + italic_n ) start_POSTSUPERSCRIPT italic_m + italic_n end_POSTSUPERSCRIPT end_ARG , (6.6b)
|b||a|(m+n)/m𝑏superscript𝑎𝑚𝑛𝑚\displaystyle\frac{|b|}{|a|^{(m+n)/m}}divide start_ARG | italic_b | end_ARG start_ARG | italic_a | start_POSTSUPERSCRIPT ( italic_m + italic_n ) / italic_m end_POSTSUPERSCRIPT end_ARG mnn/m(m+n)(m+n)/m,absent𝑚superscript𝑛𝑛𝑚superscript𝑚𝑛𝑚𝑛𝑚\displaystyle\leq\frac{m}{n^{-n/m}(m+n)^{(m+n)/m}}\,,≤ divide start_ARG italic_m end_ARG start_ARG italic_n start_POSTSUPERSCRIPT - italic_n / italic_m end_POSTSUPERSCRIPT ( italic_m + italic_n ) start_POSTSUPERSCRIPT ( italic_m + italic_n ) / italic_m end_POSTSUPERSCRIPT end_ARG , |b|m|a|m+nsuperscript𝑏𝑚superscript𝑎𝑚𝑛\displaystyle\qquad\frac{|b|^{m}}{|a|^{m+n}}divide start_ARG | italic_b | start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_ARG start_ARG | italic_a | start_POSTSUPERSCRIPT italic_m + italic_n end_POSTSUPERSCRIPT end_ARG mmnn(m+n)m+n.absentsuperscript𝑚𝑚superscript𝑛𝑛superscript𝑚𝑛𝑚𝑛\displaystyle\leq\frac{m^{m}n^{n}}{(m+n)^{m+n}}\,.≤ divide start_ARG italic_m start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_ARG start_ARG ( italic_m + italic_n ) start_POSTSUPERSCRIPT italic_m + italic_n end_POSTSUPERSCRIPT end_ARG . (6.6c)

If we set b0𝑏0b\to 0italic_b → 0 then xn(xm+a)0superscript𝑥𝑛superscript𝑥𝑚𝑎0x^{n}(x^{m}+a)\to 0italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_x start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT + italic_a ) → 0. Hence n𝑛nitalic_n roots approach zero and m𝑚mitalic_m roots approach the respective mthsuperscript𝑚𝑡m^{th}italic_m start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT roots of a𝑎-a- italic_a. The roots of the second and third series respectively lie on the branches which approach zero and the mthsuperscript𝑚𝑡m^{th}italic_m start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT roots of a𝑎-a- italic_a as b0𝑏0b\to 0italic_b → 0. The second and third series have the same domain of convergence and hence together yield all the m+n𝑚𝑛m+nitalic_m + italic_n roots of eq. (6.1). The above set of three series are those found by Eagle [15] and together yield all the roots of the general trinomial for arbitrary values of the coefficients. They are equivalent to the solutions derived by P. A. Lambert [30], Birkeland [6, 9] and Lewis [31].

Consider also the following. Divide eq. (6.1) through by xnsuperscript𝑥𝑛x^{n}italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT as above, but now set x=eiπ(2+1)/n(aζ/b)1/nsubscript𝑥superscript𝑒𝑖𝜋21𝑛superscript𝑎subscript𝜁𝑏1𝑛x_{\ell}=e^{i\pi(2\ell+1)/n}(a\zeta_{\ell}/b)^{-1/n}italic_x start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) / italic_n end_POSTSUPERSCRIPT ( italic_a italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT / italic_b ) start_POSTSUPERSCRIPT - 1 / italic_n end_POSTSUPERSCRIPT. This corresponds to setting p=n𝑝𝑛p=nitalic_p = italic_n and q=0𝑞0q=0italic_q = 0, i.e. q<p𝑞𝑝q<pitalic_q < italic_p. Then ζsubscript𝜁\zeta_{\ell}italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT satisfies

ζ=1+eiπ(2+1)m/nbm/na(m+n)/nζm/n.subscript𝜁1superscript𝑒𝑖𝜋21𝑚𝑛superscript𝑏𝑚𝑛superscript𝑎𝑚𝑛𝑛superscriptsubscript𝜁𝑚𝑛\zeta_{\ell}=1+e^{i\pi(2\ell+1)m/n}\frac{b^{m/n}}{a^{(m+n)/n}}\zeta_{\ell}^{-m% /n}\,.italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = 1 + italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) italic_m / italic_n end_POSTSUPERSCRIPT divide start_ARG italic_b start_POSTSUPERSCRIPT italic_m / italic_n end_POSTSUPERSCRIPT end_ARG start_ARG italic_a start_POSTSUPERSCRIPT ( italic_m + italic_n ) / italic_n end_POSTSUPERSCRIPT end_ARG italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_m / italic_n end_POSTSUPERSCRIPT . (6.7)

Compare this to eq. (6.4) Now μ=m/n𝜇𝑚𝑛\mu=-m/nitalic_μ = - italic_m / italic_n. This series yields n𝑛nitalic_n roots. It converges if and only if

|b|m/n|a|(m+n)/nnmm/n(m+n)(m+n)/n,|b|m|a|m+nmmnn(m+n)m+n.formulae-sequencesuperscript𝑏𝑚𝑛superscript𝑎𝑚𝑛𝑛𝑛superscript𝑚𝑚𝑛superscript𝑚𝑛𝑚𝑛𝑛superscript𝑏𝑚superscript𝑎𝑚𝑛superscript𝑚𝑚superscript𝑛𝑛superscript𝑚𝑛𝑚𝑛\frac{|b|^{m/n}}{|a|^{(m+n)/n}}\leq\frac{n}{m^{-m/n}(m+n)^{(m+n)/n}}\,,\qquad% \qquad\frac{|b|^{m}}{|a|^{m+n}}\leq\frac{m^{m}n^{n}}{(m+n)^{m+n}}\,.divide start_ARG | italic_b | start_POSTSUPERSCRIPT italic_m / italic_n end_POSTSUPERSCRIPT end_ARG start_ARG | italic_a | start_POSTSUPERSCRIPT ( italic_m + italic_n ) / italic_n end_POSTSUPERSCRIPT end_ARG ≤ divide start_ARG italic_n end_ARG start_ARG italic_m start_POSTSUPERSCRIPT - italic_m / italic_n end_POSTSUPERSCRIPT ( italic_m + italic_n ) start_POSTSUPERSCRIPT ( italic_m + italic_n ) / italic_n end_POSTSUPERSCRIPT end_ARG , divide start_ARG | italic_b | start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_ARG start_ARG | italic_a | start_POSTSUPERSCRIPT italic_m + italic_n end_POSTSUPERSCRIPT end_ARG ≤ divide start_ARG italic_m start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_ARG start_ARG ( italic_m + italic_n ) start_POSTSUPERSCRIPT italic_m + italic_n end_POSTSUPERSCRIPT end_ARG . (6.8)

The solution is

x=eiπ(2+1)/nb1/na1/n(mn;1n;eiπ(2+1)m/nbm/na(m+n)/n)(0n1).subscript𝑥superscript𝑒𝑖𝜋21𝑛superscript𝑏1𝑛superscript𝑎1𝑛𝑚𝑛1𝑛superscript𝑒𝑖𝜋21𝑚𝑛superscript𝑏𝑚𝑛superscript𝑎𝑚𝑛𝑛0𝑛1x_{\ell}=e^{i\pi(2\ell+1)/n}\frac{b^{1/n}}{a^{1/n}}\mathcal{B}\Bigl{(}-\frac{m% }{n};\,-\frac{1}{n};\,e^{i\pi(2\ell+1)m/n}\frac{b^{m/n}}{a^{(m+n)/n}}\Bigr{)}% \qquad(0\leq\ell\leq n-1)\,.italic_x start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) / italic_n end_POSTSUPERSCRIPT divide start_ARG italic_b start_POSTSUPERSCRIPT 1 / italic_n end_POSTSUPERSCRIPT end_ARG start_ARG italic_a start_POSTSUPERSCRIPT 1 / italic_n end_POSTSUPERSCRIPT end_ARG caligraphic_B ( - divide start_ARG italic_m end_ARG start_ARG italic_n end_ARG ; - divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ; italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) italic_m / italic_n end_POSTSUPERSCRIPT divide start_ARG italic_b start_POSTSUPERSCRIPT italic_m / italic_n end_POSTSUPERSCRIPT end_ARG start_ARG italic_a start_POSTSUPERSCRIPT ( italic_m + italic_n ) / italic_n end_POSTSUPERSCRIPT end_ARG ) ( 0 ≤ roman_ℓ ≤ italic_n - 1 ) . (6.9)

Hence this series yields the same n𝑛nitalic_n roots as the second series above, for which μ=(m+n)/n𝜇𝑚𝑛𝑛\mu=(m+n)/nitalic_μ = ( italic_m + italic_n ) / italic_n, with the same domain of convergence. It is therefore the same series as in eq. (6.5b). Even the permutations of the roots are identical, because the first terms of both series are eiπ(2+1)/n(b/a)1/nsuperscript𝑒𝑖𝜋21𝑛superscript𝑏𝑎1𝑛e^{i\pi(2\ell+1)/n}(b/a)^{1/n}italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) / italic_n end_POSTSUPERSCRIPT ( italic_b / italic_a ) start_POSTSUPERSCRIPT 1 / italic_n end_POSTSUPERSCRIPT. It was remarked in Sec. 4 that choosing q<p𝑞𝑝q<pitalic_q < italic_p yields the same solutions as the series obtained by interchanging p𝑝pitalic_p and q𝑞qitalic_q.

  • 1.

    Johann Lambert [28] solved the equation xm+px=qsuperscript𝑥𝑚𝑝𝑥𝑞x^{m}+px=qitalic_x start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT + italic_p italic_x = italic_q in 1758. Corless et al. [12] stated “In 1758, Lambert solved the trinomial equation x=q+xm𝑥𝑞superscript𝑥𝑚x=q+x^{m}italic_x = italic_q + italic_x start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT by giving a series development for x𝑥xitalic_x in powers of q𝑞qitalic_q.” I found that the equation x=q+xm𝑥𝑞superscript𝑥𝑚x=q+x^{m}italic_x = italic_q + italic_x start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT appears in Lambert’s 1770 paper [29, §8], where Lambert stated “auquel on peut toujous donner la forme plus simple” (“to which we can always provide the simpler form”) and where Lambert derived a series for the nthsuperscript𝑛𝑡n^{th}italic_n start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT power xnsuperscript𝑥𝑛x^{n}italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. Lambert’s solutions are Fuss–Catalan series, for the branch which approaches zero when q0𝑞0q\to 0italic_q → 0. Lambert’s series solution for xnsuperscript𝑥𝑛x^{n}italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, where x=q+xm𝑥𝑞superscript𝑥𝑚x=q+x^{m}italic_x = italic_q + italic_x start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT, may be written as [29, §8]

    xn=q(m;n;qm1).superscript𝑥𝑛𝑞𝑚𝑛superscript𝑞𝑚1x^{n}=q\mathcal{B}(m;n;q^{m-1})\,.italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT = italic_q caligraphic_B ( italic_m ; italic_n ; italic_q start_POSTSUPERSCRIPT italic_m - 1 end_POSTSUPERSCRIPT ) . (6.10)

    Lambert did not specify the radius of convergence of his series. It converges if and only if

    |q|m1mm/(m1).𝑞𝑚1superscript𝑚𝑚𝑚1|q|\leq\frac{m-1}{m^{m/(m-1)}}\,.| italic_q | ≤ divide start_ARG italic_m - 1 end_ARG start_ARG italic_m start_POSTSUPERSCRIPT italic_m / ( italic_m - 1 ) end_POSTSUPERSCRIPT end_ARG . (6.11)
  • 2.

    Ramanujan also solved the trinomial via a series. The equation he treated was [5, first quarterly report, 1.6 (iv), eq. (1.15)]

    aqxp+xq=1.𝑎𝑞superscript𝑥𝑝superscript𝑥𝑞1aqx^{p}+x^{q}=1\,.italic_a italic_q italic_x start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT + italic_x start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT = 1 . (6.12)

    Ramanujan derived the following solution for any power n𝑛nitalic_n [5, first quarterly report, 1.6 (iv), eq. (1.16)]

    xn=nqk=0Γ({n+pk}/q)(qa)kΓ({n+pk}/qk+1)k!.superscript𝑥𝑛𝑛𝑞superscriptsubscript𝑘0Γ𝑛𝑝𝑘𝑞superscript𝑞𝑎𝑘Γ𝑛𝑝𝑘𝑞𝑘1𝑘x^{n}=\frac{n}{q}\sum_{k=0}^{\infty}\frac{\Gamma(\{n+pk\}/q)(-qa)^{k}}{\Gamma(% \{n+pk\}/q-k+1)k!}\,.italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT = divide start_ARG italic_n end_ARG start_ARG italic_q end_ARG ∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT divide start_ARG roman_Γ ( { italic_n + italic_p italic_k } / italic_q ) ( - italic_q italic_a ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG start_ARG roman_Γ ( { italic_n + italic_p italic_k } / italic_q - italic_k + 1 ) italic_k ! end_ARG . (6.13)

    This is the branch which approaches unity for a0𝑎0a\to 0italic_a → 0. The above expression is stated in [5] to be valid for all real numbers n𝑛nitalic_n, p𝑝pitalic_p, q𝑞qitalic_q and for complex a𝑎aitalic_a satisfying

    |a||p|p/q|pq|(pq)/q.𝑎superscript𝑝𝑝𝑞superscript𝑝𝑞𝑝𝑞𝑞|a|\leq|p|^{-p/q}|p-q|^{(p-q)/q}\,.| italic_a | ≤ | italic_p | start_POSTSUPERSCRIPT - italic_p / italic_q end_POSTSUPERSCRIPT | italic_p - italic_q | start_POSTSUPERSCRIPT ( italic_p - italic_q ) / italic_q end_POSTSUPERSCRIPT . (6.14)

    Let us verify Ramanujan’s solution. The expression in eq. (6.13) is tricky if n=0𝑛0n=0italic_n = 0. The first term in the sum is actually unity

    xn=(n/q)Γ(n/q)Γ(n/q+1)+nqk=1(qa)kk!u=1k1(kp/q+n/qu)=1+nqk=1(qa)kk!u=1k1(kp/q+n/qu).superscript𝑥𝑛𝑛𝑞Γ𝑛𝑞Γ𝑛𝑞1𝑛𝑞superscriptsubscript𝑘1superscript𝑞𝑎𝑘𝑘superscriptsubscriptproduct𝑢1𝑘1𝑘𝑝𝑞𝑛𝑞𝑢1𝑛𝑞superscriptsubscript𝑘1superscript𝑞𝑎𝑘𝑘superscriptsubscriptproduct𝑢1𝑘1𝑘𝑝𝑞𝑛𝑞𝑢\begin{split}x^{n}&=\frac{(n/q)\Gamma(n/q)}{\Gamma(n/q+1)}+\frac{n}{q}\sum_{k=% 1}^{\infty}\frac{(-qa)^{k}}{k!}\prod_{u=1}^{k-1}(kp/q+n/q-u)\\ &=1+\frac{n}{q}\sum_{k=1}^{\infty}\frac{(-qa)^{k}}{k!}\prod_{u=1}^{k-1}(kp/q+n% /q-u)\,.\end{split}start_ROW start_CELL italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_CELL start_CELL = divide start_ARG ( italic_n / italic_q ) roman_Γ ( italic_n / italic_q ) end_ARG start_ARG roman_Γ ( italic_n / italic_q + 1 ) end_ARG + divide start_ARG italic_n end_ARG start_ARG italic_q end_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT divide start_ARG ( - italic_q italic_a ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG start_ARG italic_k ! end_ARG ∏ start_POSTSUBSCRIPT italic_u = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k - 1 end_POSTSUPERSCRIPT ( italic_k italic_p / italic_q + italic_n / italic_q - italic_u ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = 1 + divide start_ARG italic_n end_ARG start_ARG italic_q end_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT divide start_ARG ( - italic_q italic_a ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG start_ARG italic_k ! end_ARG ∏ start_POSTSUBSCRIPT italic_u = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k - 1 end_POSTSUPERSCRIPT ( italic_k italic_p / italic_q + italic_n / italic_q - italic_u ) . end_CELL end_ROW (6.15)

    We need to perform the cancellations before setting n=0𝑛0n=0italic_n = 0 on the right hand side. To solve eq. (6.12) using a Fuss–Catalan series (for the branch treated by Ramanujan), put ζ=xq𝜁superscript𝑥𝑞\zeta=x^{q}italic_ζ = italic_x start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT, so x=ζ1/q𝑥superscript𝜁1𝑞x=\zeta^{1/q}italic_x = italic_ζ start_POSTSUPERSCRIPT 1 / italic_q end_POSTSUPERSCRIPT, then ζ=1aqζp/q𝜁1𝑎𝑞superscript𝜁𝑝𝑞\zeta=1-aq\zeta^{p/q}italic_ζ = 1 - italic_a italic_q italic_ζ start_POSTSUPERSCRIPT italic_p / italic_q end_POSTSUPERSCRIPT. Hence μ=p/q𝜇𝑝𝑞\mu=p/qitalic_μ = italic_p / italic_q and z=qa𝑧𝑞𝑎z=-qaitalic_z = - italic_q italic_a in eq. (2.3). The solution is (using k𝑘kitalic_k as a summation variable)

    xn=ζn/q=k=0Ak(p/q,n/q)(qa)k=1+nqk=1(qa)kk!u=1k1(kp/q+n/qu).superscript𝑥𝑛superscript𝜁𝑛𝑞superscriptsubscript𝑘0subscript𝐴𝑘𝑝𝑞𝑛𝑞superscript𝑞𝑎𝑘1𝑛𝑞superscriptsubscript𝑘1superscript𝑞𝑎𝑘𝑘superscriptsubscriptproduct𝑢1𝑘1𝑘𝑝𝑞𝑛𝑞𝑢\begin{split}x^{n}=\zeta^{n/q}&=\sum_{k=0}^{\infty}A_{k}(p/q,n/q)(-qa)^{k}\\ &=1+\frac{n}{q}\sum_{k=1}^{\infty}\frac{(-qa)^{k}}{k!}\prod_{u=1}^{k-1}(kp/q+n% /q-u)\,.\end{split}start_ROW start_CELL italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT = italic_ζ start_POSTSUPERSCRIPT italic_n / italic_q end_POSTSUPERSCRIPT end_CELL start_CELL = ∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_A start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_p / italic_q , italic_n / italic_q ) ( - italic_q italic_a ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = 1 + divide start_ARG italic_n end_ARG start_ARG italic_q end_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT divide start_ARG ( - italic_q italic_a ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG start_ARG italic_k ! end_ARG ∏ start_POSTSUBSCRIPT italic_u = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k - 1 end_POSTSUPERSCRIPT ( italic_k italic_p / italic_q + italic_n / italic_q - italic_u ) . end_CELL end_ROW (6.16)

    This equals the expression in eq. (6.13). The series converges if and only if

    |a|1|q|1|p/q|p/q|1p/q|1p/q,=|p|p/q|pq|(pq)/q.\begin{split}|a|&\leq\frac{1}{|q|}\,\frac{1}{|p/q|^{p/q}|1-p/q|^{1-p/q}}\,,\\ &=|p|^{-p/q}|p-q|^{(p-q)/q}\,.\end{split}start_ROW start_CELL | italic_a | end_CELL start_CELL ≤ divide start_ARG 1 end_ARG start_ARG | italic_q | end_ARG divide start_ARG 1 end_ARG start_ARG | italic_p / italic_q | start_POSTSUPERSCRIPT italic_p / italic_q end_POSTSUPERSCRIPT | 1 - italic_p / italic_q | start_POSTSUPERSCRIPT 1 - italic_p / italic_q end_POSTSUPERSCRIPT end_ARG , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = | italic_p | start_POSTSUPERSCRIPT - italic_p / italic_q end_POSTSUPERSCRIPT | italic_p - italic_q | start_POSTSUPERSCRIPT ( italic_p - italic_q ) / italic_q end_POSTSUPERSCRIPT . end_CELL end_ROW (6.17)

    This confirms the bound in eq. (6.14).

6.2 Lambert and Euler trinomial equations

At stated above, in 1758 Lambert [28] gave a series solution for the trinomial equation xm+px=qsuperscript𝑥𝑚𝑝𝑥𝑞x^{m}+px=qitalic_x start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT + italic_p italic_x = italic_q and later in 1770, Lambert [29, §8] revisited the equation in the form

x=q+xm.𝑥𝑞superscript𝑥𝑚x=q+x^{m}\,.italic_x = italic_q + italic_x start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT . (6.18)

The treatment below follows Corless et al. [12]. In 1779 Euler [16] derived the following equation from Lambert’s trinomial (I have changed Euler’s ‘x𝑥xitalic_x’ to ‘z𝑧zitalic_z’ to avoid confusion as to which equation x𝑥xitalic_x satisfies)

zαzβ=(αβ)vzα+β.superscript𝑧𝛼superscript𝑧𝛽𝛼𝛽𝑣superscript𝑧𝛼𝛽z^{\alpha}-z^{\beta}=(\alpha-\beta)vz^{\alpha+\beta}\,.italic_z start_POSTSUPERSCRIPT italic_α end_POSTSUPERSCRIPT - italic_z start_POSTSUPERSCRIPT italic_β end_POSTSUPERSCRIPT = ( italic_α - italic_β ) italic_v italic_z start_POSTSUPERSCRIPT italic_α + italic_β end_POSTSUPERSCRIPT . (6.19)

This is obtained from eq. (6.18) via the substitutions x=zβ𝑥superscript𝑧𝛽x=z^{-\beta}italic_x = italic_z start_POSTSUPERSCRIPT - italic_β end_POSTSUPERSCRIPT, m=α/β𝑚𝛼𝛽m=\alpha/\betaitalic_m = italic_α / italic_β (this corrects a misprint in [12], which stated m=αβ𝑚𝛼𝛽m=\alpha\betaitalic_m = italic_α italic_β) and q=(αβ)v𝑞𝛼𝛽𝑣q=(\alpha-\beta)vitalic_q = ( italic_α - italic_β ) italic_v. Euler’s solution of eq. (6.19), for znsuperscript𝑧𝑛z^{n}italic_z start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, was [16]

zn=1+nv+12!n(n+α+β)v2+13!n(n+α+2β)(n+2α+β)v3+14!n(n+α+3β)(n+2α+2β)(n+3α+β)v4+superscript𝑧𝑛1𝑛𝑣12𝑛𝑛𝛼𝛽superscript𝑣213𝑛𝑛𝛼2𝛽𝑛2𝛼𝛽superscript𝑣314𝑛𝑛𝛼3𝛽𝑛2𝛼2𝛽𝑛3𝛼𝛽superscript𝑣4\begin{split}z^{n}=1+nv&+\frac{1}{2!}\,n(n+\alpha+\beta)\,v^{2}\\ &+\frac{1}{3!}\,n(n+\alpha+2\beta)(n+2\alpha+\beta)\,v^{3}\\ &+\frac{1}{4!}\,n(n+\alpha+3\beta)(n+2\alpha+2\beta)(n+3\alpha+\beta)\,v^{4}+% \cdots\end{split}start_ROW start_CELL italic_z start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT = 1 + italic_n italic_v end_CELL start_CELL + divide start_ARG 1 end_ARG start_ARG 2 ! end_ARG italic_n ( italic_n + italic_α + italic_β ) italic_v start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + divide start_ARG 1 end_ARG start_ARG 3 ! end_ARG italic_n ( italic_n + italic_α + 2 italic_β ) ( italic_n + 2 italic_α + italic_β ) italic_v start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + divide start_ARG 1 end_ARG start_ARG 4 ! end_ARG italic_n ( italic_n + italic_α + 3 italic_β ) ( italic_n + 2 italic_α + 2 italic_β ) ( italic_n + 3 italic_α + italic_β ) italic_v start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT + ⋯ end_CELL end_ROW (6.20)

Clearly, eq. (6.18) can be solved using Fuss–Catalan series. There are m𝑚mitalic_m roots, of which one approaches 0 and m1𝑚1m-1italic_m - 1 approach the roots of unity as q0𝑞0q\to 0italic_q → 0. Lambert’s solution is the unique branch which vanishes for q=0𝑞0q=0italic_q = 0 and was displayed as a Fuss–Catalan series in eq. (6.10). Euler’s solution in eq. (6.20) does not vanish for α=β𝛼𝛽\alpha=\betaitalic_α = italic_β, i.e. q=0𝑞0q=0italic_q = 0, and is the unique solution of eq. (6.18) which is real (if q𝑞qitalic_q is real) and approaches 1 as q0𝑞0q\to 0italic_q → 0. We can derive it as follows. Divide eq. (6.18) through by xmsuperscript𝑥𝑚x^{m}italic_x start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT and set x=ζ1/(m1)𝑥superscript𝜁1𝑚1x=\zeta^{-1/(m-1)}italic_x = italic_ζ start_POSTSUPERSCRIPT - 1 / ( italic_m - 1 ) end_POSTSUPERSCRIPT, then ζ=1+qζm/(m1)𝜁1𝑞superscript𝜁𝑚𝑚1\zeta=1+q\zeta^{m/(m-1)}italic_ζ = 1 + italic_q italic_ζ start_POSTSUPERSCRIPT italic_m / ( italic_m - 1 ) end_POSTSUPERSCRIPT. Hence μ=m/(m1)𝜇𝑚𝑚1\mu=m/(m-1)italic_μ = italic_m / ( italic_m - 1 ) and the solution is

x=(mm1;1m1;q).𝑥𝑚𝑚11𝑚1𝑞x=\mathcal{B}\Bigl{(}\frac{m}{m-1};\,-\frac{1}{m-1};\,q\Bigr{)}\,.italic_x = caligraphic_B ( divide start_ARG italic_m end_ARG start_ARG italic_m - 1 end_ARG ; - divide start_ARG 1 end_ARG start_ARG italic_m - 1 end_ARG ; italic_q ) . (6.21)

Put x=zβ𝑥superscript𝑧𝛽x=z^{-\beta}italic_x = italic_z start_POSTSUPERSCRIPT - italic_β end_POSTSUPERSCRIPT, m=α/β𝑚𝛼𝛽m=\alpha/\betaitalic_m = italic_α / italic_β and q=(αβ)v𝑞𝛼𝛽𝑣q=(\alpha-\beta)vitalic_q = ( italic_α - italic_β ) italic_v, then zn=xn/βsuperscript𝑧𝑛superscript𝑥𝑛𝛽z^{n}=x^{-n/\beta}italic_z start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT = italic_x start_POSTSUPERSCRIPT - italic_n / italic_β end_POSTSUPERSCRIPT

zn=(ααβ;nαβ;(αβ)v).superscript𝑧𝑛𝛼𝛼𝛽𝑛𝛼𝛽𝛼𝛽𝑣z^{n}=\mathcal{B}\Bigl{(}\frac{\alpha}{\alpha-\beta};\,\frac{n}{\alpha-\beta};% \,(\alpha-\beta)v\Bigr{)}\,.italic_z start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT = caligraphic_B ( divide start_ARG italic_α end_ARG start_ARG italic_α - italic_β end_ARG ; divide start_ARG italic_n end_ARG start_ARG italic_α - italic_β end_ARG ; ( italic_α - italic_β ) italic_v ) . (6.22)

Let us verify from eq. (6.20) that this equals Euler’s solution:

zn=1+nt=1vtt!j=1t1(tα+nj(αβ))=1+nαβt=1(αβ)tvtt!j=1t1(tααβ+nαβj)=t=0At(ααβ,nαβ)(αβ)tvt=(ααβ;nαβ;(αβ)v).superscript𝑧𝑛1𝑛superscriptsubscript𝑡1superscript𝑣𝑡𝑡superscriptsubscriptproduct𝑗1𝑡1𝑡𝛼𝑛𝑗𝛼𝛽1𝑛𝛼𝛽superscriptsubscript𝑡1superscript𝛼𝛽𝑡superscript𝑣𝑡𝑡superscriptsubscriptproduct𝑗1𝑡1𝑡𝛼𝛼𝛽𝑛𝛼𝛽𝑗superscriptsubscript𝑡0subscript𝐴𝑡𝛼𝛼𝛽𝑛𝛼𝛽superscript𝛼𝛽𝑡superscript𝑣𝑡𝛼𝛼𝛽𝑛𝛼𝛽𝛼𝛽𝑣\begin{split}z^{n}&=1+n\sum_{t=1}^{\infty}\frac{v^{t}}{t!}\,\prod_{j=1}^{t-1}(% t\alpha+n-j(\alpha-\beta))\\ &=1+\frac{n}{\alpha-\beta}\sum_{t=1}^{\infty}\frac{(\alpha-\beta)^{t}v^{t}}{t!% }\,\prod_{j=1}^{t-1}\Bigl{(}\frac{t\alpha}{\alpha-\beta}+\frac{n}{\alpha-\beta% }-j\Bigr{)}\\ &=\sum_{t=0}^{\infty}A_{t}\Bigl{(}\frac{\alpha}{\alpha-\beta},\,\frac{n}{% \alpha-\beta}\Bigr{)}\,(\alpha-\beta)^{t}v^{t}\\ &=\mathcal{B}\Bigl{(}\frac{\alpha}{\alpha-\beta};\,\frac{n}{\alpha-\beta};\,(% \alpha-\beta)v\Bigr{)}\,.\end{split}start_ROW start_CELL italic_z start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_CELL start_CELL = 1 + italic_n ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT divide start_ARG italic_v start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_ARG start_ARG italic_t ! end_ARG ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t - 1 end_POSTSUPERSCRIPT ( italic_t italic_α + italic_n - italic_j ( italic_α - italic_β ) ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = 1 + divide start_ARG italic_n end_ARG start_ARG italic_α - italic_β end_ARG ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT divide start_ARG ( italic_α - italic_β ) start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_v start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_ARG start_ARG italic_t ! end_ARG ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t - 1 end_POSTSUPERSCRIPT ( divide start_ARG italic_t italic_α end_ARG start_ARG italic_α - italic_β end_ARG + divide start_ARG italic_n end_ARG start_ARG italic_α - italic_β end_ARG - italic_j ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = ∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( divide start_ARG italic_α end_ARG start_ARG italic_α - italic_β end_ARG , divide start_ARG italic_n end_ARG start_ARG italic_α - italic_β end_ARG ) ( italic_α - italic_β ) start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_v start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = caligraphic_B ( divide start_ARG italic_α end_ARG start_ARG italic_α - italic_β end_ARG ; divide start_ARG italic_n end_ARG start_ARG italic_α - italic_β end_ARG ; ( italic_α - italic_β ) italic_v ) . end_CELL end_ROW (6.23)

This agrees with eq. (6.22). The paper by Corless et al. [12] discusses various applications of the Lambert W𝑊Witalic_W function, which is the real solution (for real x1/e𝑥1𝑒x\geq-1/eitalic_x ≥ - 1 / italic_e) of the equation W(x)eW(x)=x𝑊𝑥superscript𝑒𝑊𝑥𝑥W(x)e^{W(x)}=xitalic_W ( italic_x ) italic_e start_POSTSUPERSCRIPT italic_W ( italic_x ) end_POSTSUPERSCRIPT = italic_x, but that is beyond the scope of this paper.

7 Algebraic equations: convergence of series I

7.1 General remarks

A necessary and sufficient bound for the domain of absolute convergence is available for the important special case of the solutions of algebraic equations by infinite series. We begin with some known theorems from the theory of power series in several complex variables.

Definition 7.1 (multicircular or Reinhardt domain).

A multi-circular or Reinhardt domain in ksuperscript𝑘\mathbb{C}^{k}blackboard_C start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT has the property that for k𝑘kitalic_k complex variables 𝐳=(z1,,zk)𝐳subscript𝑧1subscript𝑧𝑘\bm{z}=(z_{1},\dots,z_{k})bold_italic_z = ( italic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ), if a point 𝐳subscript𝐳\bm{z}_{*}bold_italic_z start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT lies in the domain, then so does every point 𝐳𝐳\bm{z}bold_italic_z such that |zj|=|zj|subscript𝑧𝑗subscript𝑧absent𝑗|z_{j}|=|z_{*j}|| italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | = | italic_z start_POSTSUBSCRIPT ∗ italic_j end_POSTSUBSCRIPT | for j=1,,k𝑗1𝑘j=1,\dots,kitalic_j = 1 , … , italic_k. A multi-circular domain with the property that if a point 𝐳subscript𝐳\bm{z}_{*}bold_italic_z start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT lies in the domain, then so does the polydisc given by {𝐳:|zj||zj|,j=1,,k}conditional-set𝐳formulae-sequencesubscript𝑧𝑗subscript𝑧absent𝑗𝑗1𝑘\{\bm{z}:|z_{j}|\leq|z_{*j}|,\;j=1,\dots,k\}{ bold_italic_z : | italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | ≤ | italic_z start_POSTSUBSCRIPT ∗ italic_j end_POSTSUBSCRIPT | , italic_j = 1 , … , italic_k } is known as a complete Reinhardt domain. A polydisc is a Cartesian product of discs, in general with different radii.

The convergence domain of a power series in multiple variables is a union of polydiscs centered at the origin and is a complete Reinhardt domain. The following is also known. Using a vector notation, with coefficients 𝒄𝜶subscript𝒄𝜶\bm{c}_{\bm{\alpha}}bold_italic_c start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT indexed by a k𝑘kitalic_k-tuple 𝜶𝜶\bm{\alpha}bold_italic_α, if both 𝜶|𝒄𝜶𝒛𝜶|subscript𝜶subscript𝒄𝜶superscript𝒛𝜶\sum_{\bm{\alpha}}|\bm{c}_{\bm{\alpha}}\bm{z}^{\bm{\alpha}}|∑ start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT | bold_italic_c start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT bold_italic_z start_POSTSUPERSCRIPT bold_italic_α end_POSTSUPERSCRIPT | and 𝜶|𝒄𝜶𝒘𝜶|subscript𝜶subscript𝒄𝜶superscript𝒘𝜶\sum_{\bm{\alpha}}|\bm{c}_{\bm{\alpha}}\bm{w}^{\bm{\alpha}}|∑ start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT | bold_italic_c start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT bold_italic_w start_POSTSUPERSCRIPT bold_italic_α end_POSTSUPERSCRIPT | converge, then so does 𝜶|𝒄𝜶||𝒛𝜶|t|𝒘𝜶|1tsubscript𝜶subscript𝒄𝜶superscriptsuperscript𝒛𝜶𝑡superscriptsuperscript𝒘𝜶1𝑡\sum_{\bm{\alpha}}|\bm{c}_{\bm{\alpha}}||\bm{z}^{\bm{\alpha}}|^{t}|\bm{w}^{\bm% {\alpha}}|^{1-t}∑ start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT | bold_italic_c start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT | | bold_italic_z start_POSTSUPERSCRIPT bold_italic_α end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT | bold_italic_w start_POSTSUPERSCRIPT bold_italic_α end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 1 - italic_t end_POSTSUPERSCRIPT for 0t10𝑡10\leq t\leq 10 ≤ italic_t ≤ 1. This property of a Reinhardt domain is called logarithmic convexity. Define a map Log:({0})kk:Logsuperscript0𝑘superscript𝑘\textrm{Log}:(\mathbb{C}\setminus\{0\})^{k}\to\mathbb{R}^{k}Log : ( blackboard_C ∖ { 0 } ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT where zjln|zj|maps-tosubscript𝑧𝑗subscript𝑧𝑗z_{j}\mapsto\ln|z_{j}|italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ↦ roman_ln | italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | for j=1,,k𝑗1𝑘j=1,\dots,kitalic_j = 1 , … , italic_k. Let the image of the domain of convergence 𝒟𝒟\mathcal{D}caligraphic_D be Log(𝒟)kLog𝒟superscript𝑘\textrm{Log}(\mathcal{D})\subset\mathbb{R}^{k}Log ( caligraphic_D ) ⊂ blackboard_R start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT. If Log(z),Log(w)Log(𝒟)Log𝑧Log𝑤Log𝒟\textrm{Log}(z),\textrm{Log}(w)\in\textrm{Log}(\mathcal{D})Log ( italic_z ) , Log ( italic_w ) ∈ Log ( caligraphic_D ), then also tLog(z)+(1t)Log(w)Log(𝒟)𝑡Log𝑧1𝑡Log𝑤Log𝒟t\textrm{Log}(z)+(1-t)\textrm{Log}(w)\in\textrm{Log}(\mathcal{D})italic_t Log ( italic_z ) + ( 1 - italic_t ) Log ( italic_w ) ∈ Log ( caligraphic_D ) for 0t10𝑡10\leq t\leq 10 ≤ italic_t ≤ 1, i.e.

(tln|z1|+(1t)ln|w1|,,tln|zk|+(1t)ln|wk|)Log(𝒟).𝑡subscript𝑧11𝑡subscript𝑤1𝑡subscript𝑧𝑘1𝑡subscript𝑤𝑘Log𝒟(t\ln|z_{1}|+(1-t)\ln|w_{1}|,\dots,t\ln|z_{k}|+(1-t)\ln|w_{k}|)\in\textrm{Log}% (\mathcal{D})\,.( italic_t roman_ln | italic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | + ( 1 - italic_t ) roman_ln | italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , … , italic_t roman_ln | italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | + ( 1 - italic_t ) roman_ln | italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | ) ∈ Log ( caligraphic_D ) . (7.1)

A complete Reinhardt domain in ksuperscript𝑘\mathbb{C}^{k}blackboard_C start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT is the domain of absolute convergence of a power series if and only if the domain is logarithmically convex. The power series converges uniformly in every compact subset of the domain 𝒟𝒟\mathcal{D}caligraphic_D. Note that logarithmic convexity does not imply convexity. For k=1𝑘1k=1italic_k = 1, the domain of convergence of a univariate power series is a disc in \mathbb{C}blackboard_C centered on the origin, and is convex. However, for k2𝑘2k\geq 2italic_k ≥ 2 variables, a complete Reinhardt domain in not in general convex. However, from the foregoing remarks about polydiscs, the following is true. If a point 𝒛subscript𝒛\bm{z}_{*}bold_italic_z start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT lies in the domain of convergence, then so does every point on the ray joining the origin to 𝒛subscript𝒛\bm{z}_{*}bold_italic_z start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT, i.e. 𝒛=λ𝒛𝒛𝜆subscript𝒛\bm{z}=\lambda\bm{z}_{*}bold_italic_z = italic_λ bold_italic_z start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT for 0λ10𝜆10\leq\lambda\leq 10 ≤ italic_λ ≤ 1.

The above theory is general. In this section, we are concerned with the domain of absolute convergence of the series in eq. (4.8), which is the solution of eq. (4.1). Passare and Tsikh [36] claimed to offer a necessary and sufficient bound for absolute convergence in this case. We summarize their work below. We also display some counterexamples to their bound, and offer a more detailed analysis. For ease of contact with the formalism in [36], we write

a0+a1x++xp++xq++anxn=0.subscript𝑎0subscript𝑎1𝑥superscript𝑥𝑝superscript𝑥𝑞subscript𝑎𝑛superscript𝑥𝑛0a_{0}+a_{1}x+\cdots+x^{p}+\cdots+x^{q}+\cdots+a_{n}x^{n}=0\,.italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x + ⋯ + italic_x start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT + ⋯ + italic_x start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT + ⋯ + italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT = 0 . (7.2)

This is effectively eq. (4.1) (or eq. (4.2)) where we have set ap=aq=1subscript𝑎𝑝subscript𝑎𝑞1a_{p}=a_{q}=1italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT = italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT = 1. This is the equation treated in [36]. The series solution is given by eq. (4.8), with obvious changes of notation. Passare and Tsikh employed the notation [p]delimited-[]𝑝[p][ italic_p ] to denote that the index p𝑝pitalic_p is excluded from a list of the form (α0,α1,,[p],,αn)subscript𝛼0subscript𝛼1delimited-[]𝑝subscript𝛼𝑛(\alpha_{0},\alpha_{1},\dots,[p],\dots,\alpha_{n})( italic_α start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_α start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , [ italic_p ] , … , italic_α start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ). The solution of eq. (7.2) is a power series in the n1𝑛1n-1italic_n - 1 variables (a0,a1,,[p],,[q],,an)subscript𝑎0subscript𝑎1delimited-[]𝑝delimited-[]𝑞subscript𝑎𝑛(a_{0},a_{1},\dots,[p],\dots,[q],\dots,a_{n})( italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , [ italic_p ] , … , [ italic_q ] , … , italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ). Passare and Tsikh studied the discriminant Δpq(a0,a1,,[p],,[q],,an)subscriptΔ𝑝𝑞subscript𝑎0subscript𝑎1delimited-[]𝑝delimited-[]𝑞subscript𝑎𝑛\Delta_{pq}(a_{0},a_{1},\dots,[p],\dots,[q],\dots,a_{n})roman_Δ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , [ italic_p ] , … , [ italic_q ] , … , italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) which is the discriminant of the polynomial in eq. (7.2). Then Passare and Tsikh [36] claimed that the domain of absolute convergence 𝒟pqsubscript𝒟𝑝𝑞\mathcal{D}_{pq}caligraphic_D start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT of the series solution of eq. (7.2) is a complete Reinhardt domain whose boundary is (a segment of) the zero locus Δpq(a0,a1,,[p],,[q],,an)=0subscriptΔ𝑝𝑞subscript𝑎0subscript𝑎1delimited-[]𝑝delimited-[]𝑞subscript𝑎𝑛0\Delta_{pq}(a_{0},a_{1},\dots,[p],\dots,[q],\dots,a_{n})=0roman_Δ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , [ italic_p ] , … , [ italic_q ] , … , italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) = 0. Specifically, for absolute convergence, they derived equations of the form Δpq(±|a0|,,[p],,[q],,±|an|)=0subscriptΔ𝑝𝑞plus-or-minussubscript𝑎0delimited-[]𝑝delimited-[]𝑞plus-or-minussubscript𝑎𝑛0\Delta_{pq}(\pm|a_{0}|,\dots,[p],\dots,[q],\dots,\pm|a_{n}|)=0roman_Δ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( ± | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , … , [ italic_p ] , … , [ italic_q ] , … , ± | italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | ) = 0. See [36, Thm. 3] for a precise statement of their result.

7.2 Application to cubics

Passare and Tsikh employed their formalism to display the domains of convergence for the series solutions of a cubic [36, Sec. 5.2]. The general cubic equation with complex coefficients (a0,a1,a2,a3)subscript𝑎0subscript𝑎1subscript𝑎2subscript𝑎3(a_{0},a_{1},a_{2},a_{3})( italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) is

a0+a1x+a2x2+a3x3=0.subscript𝑎0subscript𝑎1𝑥subscript𝑎2superscript𝑥2subscript𝑎3superscript𝑥30a_{0}+a_{1}x+a_{2}x^{2}+a_{3}x^{3}=0\,.italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x + italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT = 0 . (7.3)

There are six choices for p𝑝pitalic_p and q𝑞qitalic_q, and the respective domains of convergence 𝒟pqsubscript𝒟𝑝𝑞\mathcal{D}_{pq}caligraphic_D start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT were given as follows [36, unnumbered before eq. (17)]

𝒟01subscript𝒟01\displaystyle\mathcal{D}_{01}caligraphic_D start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT ={Δ01(|a2|,|a3|)<0},absentsubscriptΔ01subscript𝑎2subscript𝑎30\displaystyle=\{\Delta_{01}(|a_{2}|,-|a_{3}|)<0\}\,,= { roman_Δ start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | , - | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) < 0 } , (7.4a)
𝒟02superscriptsubscript𝒟02\displaystyle\mathcal{D}_{02}^{*}caligraphic_D start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ={Δ02(|a1|,|a3|)>0}{Δ02(|a1|,|a3|)<0},absentsubscriptΔ02subscript𝑎1subscript𝑎30subscriptΔ02subscript𝑎1subscript𝑎30\displaystyle=\{\Delta_{02}(|a_{1}|,|a_{3}|)>0\}\cap\{\Delta_{02}(|a_{1}|,-|a_% {3}|)<0\}\,,= { roman_Δ start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) > 0 } ∩ { roman_Δ start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , - | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) < 0 } , (7.4b)
𝒟03subscript𝒟03\displaystyle\mathcal{D}_{03}caligraphic_D start_POSTSUBSCRIPT 03 end_POSTSUBSCRIPT ={Δ03(|a1|,|a2|)>0},absentsubscriptΔ03subscript𝑎1subscript𝑎20\displaystyle=\{\Delta_{03}(-|a_{1}|,-|a_{2}|)>0\}\,,= { roman_Δ start_POSTSUBSCRIPT 03 end_POSTSUBSCRIPT ( - | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , - | italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | ) > 0 } , (7.4c)
𝒟12subscript𝒟12\displaystyle\mathcal{D}_{12}caligraphic_D start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT ={Δ12(|a0|,|a3|)<0}{Δ12(|a0|,|a3|)<0},absentsubscriptΔ12subscript𝑎0subscript𝑎30subscriptΔ12subscript𝑎0subscript𝑎30\displaystyle=\{\Delta_{12}(|a_{0}|,-|a_{3}|)<0\}\cap\{\Delta_{12}(-|a_{0}|,|a% _{3}|)<0\}\,,= { roman_Δ start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , - | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) < 0 } ∩ { roman_Δ start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) < 0 } , (7.4d)
𝒟13superscriptsubscript𝒟13\displaystyle\mathcal{D}_{13}^{*}caligraphic_D start_POSTSUBSCRIPT 13 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ={Δ13(|a0|,|a2|)>0}{Δ13(|a0|,|a2|)<0},absentsubscriptΔ13subscript𝑎0subscript𝑎20subscriptΔ13subscript𝑎0subscript𝑎20\displaystyle=\{\Delta_{13}(|a_{0}|,|a_{2}|)>0\}\cap\{\Delta_{13}(|a_{0}|,-|a_% {2}|)<0\}\,,= { roman_Δ start_POSTSUBSCRIPT 13 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | ) > 0 } ∩ { roman_Δ start_POSTSUBSCRIPT 13 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , - | italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | ) < 0 } , (7.4e)
𝒟23superscriptsubscript𝒟23\displaystyle\mathcal{D}_{23}^{*}caligraphic_D start_POSTSUBSCRIPT 23 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ={Δ23(|a0|,|a1|)<0}.absentsubscriptΔ23subscript𝑎0subscript𝑎10\displaystyle=\{\Delta_{23}(|a_{0}|,-|a_{1}|)<0\}\,.= { roman_Δ start_POSTSUBSCRIPT 23 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , - | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) < 0 } . (7.4f)

Three of the above six cases, marked with asterisks, are wrong. The cases 𝒟02subscript𝒟02\mathcal{D}_{02}caligraphic_D start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT and 𝒟13subscript𝒟13\mathcal{D}_{13}caligraphic_D start_POSTSUBSCRIPT 13 end_POSTSUBSCRIPT contain fundamental errors, while 𝒟23subscript𝒟23\mathcal{D}_{23}caligraphic_D start_POSTSUBSCRIPT 23 end_POSTSUBSCRIPT can be explained as a misprint. I have attempted to resolve these issues privately with Tsikh, but regrettably have not received a reply of scientific substance. (Passare is deceased.) We begin with the case p=0𝑝0p=0italic_p = 0 and q=2𝑞2q=2italic_q = 2. The relevant cubic equation is

1+a1x+x2+a3x3=0.1subscript𝑎1𝑥superscript𝑥2subscript𝑎3superscript𝑥301+a_{1}x+x^{2}+a_{3}x^{3}=0\,.1 + italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x + italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT = 0 . (7.5)

Setting a3=0subscript𝑎30a_{3}=0italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = 0 in eq. (7.4b) yields the self-contradictory conditions

Δ02(|a1|,0)>0andΔ02(|a1|,0)<0.formulae-sequencesubscriptΔ02subscript𝑎100andsubscriptΔ02subscript𝑎100\Delta_{02}(|a_{1}|,0)>0\qquad\textrm{and}\qquad\Delta_{02}(|a_{1}|,0)<0\,.roman_Δ start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , 0 ) > 0 and roman_Δ start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , 0 ) < 0 . (7.6)

These conditions imply that for a3=0subscript𝑎30a_{3}=0italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = 0, the series does not converge for any a1subscript𝑎1a_{1}italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, and in particular the origin (a1,a3)=(0,0)subscript𝑎1subscript𝑎300(a_{1},a_{3})=(0,0)( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) = ( 0 , 0 ) is not in the domain of convergence, which is false. For a3=0subscript𝑎30a_{3}=0italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = 0, eq. (7.5) reduces to the quadratic 1+a1x+x2=01subscript𝑎1𝑥superscript𝑥201+a_{1}x+x^{2}=01 + italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x + italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = 0 and the series solution converges for 4|a1|2>04superscriptsubscript𝑎1204-|a_{1}|^{2}>04 - | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT > 0 or |a1|<2subscript𝑎12|a_{1}|<2| italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | < 2. We now show that the error in eq. (7.4b) is fundamental and cannot be explained as a misprint in [36].

  • 1.

    First, the expressions for the discriminants are, for all four ±plus-or-minus\pm± sign assignments (±|a1|,±|a3|)plus-or-minussubscript𝑎1plus-or-minussubscript𝑎3(\pm|a_{1}|,\pm|a_{3}|)( ± | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , ± | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ),

    Δ02(|a1|,|a3|)subscriptΔ02subscript𝑎1subscript𝑎3\displaystyle\Delta_{02}(|a_{1}|,|a_{3}|)roman_Δ start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) =27|a3|2+4|a1|3|a3|+418|a1||a3||a1|2,absent27superscriptsubscript𝑎324superscriptsubscript𝑎13subscript𝑎3418subscript𝑎1subscript𝑎3superscriptsubscript𝑎12\displaystyle=27|a_{3}|^{2}+4|a_{1}|^{3}|a_{3}|+4-18|a_{1}||a_{3}|-|a_{1}|^{2}\,,= 27 | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 4 | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | + 4 - 18 | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | - | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (7.7a)
    Δ02(|a1|,|a3|)subscriptΔ02subscript𝑎1subscript𝑎3\displaystyle\Delta_{02}(|a_{1}|,-|a_{3}|)roman_Δ start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , - | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) =27|a3|24|a1|3|a3|+4+18|a1||a3||a1|2.absent27superscriptsubscript𝑎324superscriptsubscript𝑎13subscript𝑎3418subscript𝑎1subscript𝑎3superscriptsubscript𝑎12\displaystyle=27|a_{3}|^{2}-4|a_{1}|^{3}|a_{3}|+4+18|a_{1}||a_{3}|-|a_{1}|^{2}\,.= 27 | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 4 | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | + 4 + 18 | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | - | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (7.7b)
    Δ02(|a1|,|a3|)subscriptΔ02subscript𝑎1subscript𝑎3\displaystyle\Delta_{02}(-|a_{1}|,|a_{3}|)roman_Δ start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( - | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) =Δ02(|a1|,|a3|),absentsubscriptΔ02subscript𝑎1subscript𝑎3\displaystyle=\Delta_{02}(|a_{1}|,-|a_{3}|)\,,= roman_Δ start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , - | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) , (7.7c)
    Δ02(|a1|,|a3|)subscriptΔ02subscript𝑎1subscript𝑎3\displaystyle\Delta_{02}(-|a_{1}|,-|a_{3}|)roman_Δ start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( - | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , - | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) =Δ02(|a1|,|a3|).absentsubscriptΔ02subscript𝑎1subscript𝑎3\displaystyle=\Delta_{02}(|a_{1}|,|a_{3}|)\,.= roman_Δ start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) . (7.7d)

    Hence there are only two independent expressions, viz. Δ02(|a1|,|a3|)subscriptΔ02subscript𝑎1subscript𝑎3\Delta_{02}(|a_{1}|,|a_{3}|)roman_Δ start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) and Δ02(|a1|,|a3|)subscriptΔ02subscript𝑎1subscript𝑎3\Delta_{02}(|a_{1}|,-|a_{3}|)roman_Δ start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , - | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ). Hence the problem with eq. (7.4b) cannot be explained as a misprint in the assignment of ±plus-or-minus\pm± signs for ±|a1|plus-or-minussubscript𝑎1\pm|a_{1}|± | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | and/or ±|a3|plus-or-minussubscript𝑎3\pm|a_{3}|± | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT |.

  • 2.

    Putting a1=a3=0subscript𝑎1subscript𝑎30a_{1}=a_{3}=0italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = 0 yields Δ02(0,0)=4subscriptΔ02004\Delta_{02}(0,0)=4roman_Δ start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( 0 , 0 ) = 4, i.e. a positive number. Let us therefore tentatively reverse the second inequality in eq. (7.4b) as follows

    𝒟02=?{Δ02(|a1|,|a3|)>0}{Δ02(|a1|,|a3|)>0}.subscript𝒟02?subscriptΔ02subscript𝑎1subscript𝑎30subscriptΔ02subscript𝑎1subscript𝑎30\mathcal{D}_{02}\ =?\ \{\Delta_{02}(|a_{1}|,|a_{3}|)>0\}\cap\{\Delta_{02}(|a_{% 1}|,-|a_{3}|)>0\}\,.caligraphic_D start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT = ? { roman_Δ start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) > 0 } ∩ { roman_Δ start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , - | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) > 0 } . (7.8)

    Putting a1=0subscript𝑎10a_{1}=0italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 0 yields Δ02(0,|a3|)=Δ02(0,|a3|)=27|a3|2+4subscriptΔ020subscript𝑎3subscriptΔ020subscript𝑎327superscriptsubscript𝑎324\Delta_{02}(0,|a_{3}|)=\Delta_{02}(0,-|a_{3}|)=27|a_{3}|^{2}+4roman_Δ start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( 0 , | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) = roman_Δ start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( 0 , - | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) = 27 | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 4, i.e. both expressions are equal and positive definite, so eq. (7.8) is satisfied for all a3subscript𝑎3a_{3}italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT. Hence, if eq. (7.8) is taken seriously, it implies that for a1=0subscript𝑎10a_{1}=0italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 0, the series solution of eq. (7.5) converges absolutely for all a3subscript𝑎3a_{3}italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT. We know this is false. If a1=0subscript𝑎10a_{1}=0italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 0, then eq. (7.5) reduces to the following trinomial equation 1+x2+a3x3=01superscript𝑥2subscript𝑎3superscript𝑥301+x^{2}+a_{3}x^{3}=01 + italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT = 0. We proved in Sec. 6 that for such a situation the series solution converges absolutely for |a3|24/27superscriptsubscript𝑎32427|a_{3}|^{2}\leq 4/27| italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ 4 / 27.

Hence there is no assignment of ±plus-or-minus\pm± signs for ±|a1|plus-or-minussubscript𝑎1\pm|a_{1}|± | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | and/or ±|a3|plus-or-minussubscript𝑎3\pm|a_{3}|± | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT |, nor any reversal of the inequalities in eq. (7.4b), which leads to a correct formula for the domain of convergence 𝒟02subscript𝒟02\mathcal{D}_{02}caligraphic_D start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT. The error in eq. (7.4b) cannot be explained as a misprint in [36]. A more careful treatment is therefore required, and will be presented in the next section. We shall also deal with the other cases 𝒟13subscript𝒟13\mathcal{D}_{13}caligraphic_D start_POSTSUBSCRIPT 13 end_POSTSUBSCRIPT and 𝒟23subscript𝒟23\mathcal{D}_{23}caligraphic_D start_POSTSUBSCRIPT 23 end_POSTSUBSCRIPT in Sec. 8.

8 Algebraic equations: convergence of series II

8.1 Revised formalism

We present a more careful analysis of the problem of the domain of absolute convergence of the series solution of an algebraic equation below. To make the exposition self-contained, we begin from scratch, although we shall attempt to minimize repetition of material already presented earlier in this paper. The original polynomial is

𝒫(x)=a0+a1x++anxn.𝒫𝑥subscript𝑎0subscript𝑎1𝑥subscript𝑎𝑛superscript𝑥𝑛\mathscr{P}(x)=a_{0}+a_{1}x+\cdots+a_{n}x^{n}\,.script_P ( italic_x ) = italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x + ⋯ + italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT . (8.1)

For the purposes of determining domains of convergence, we assume all the coefficients are nonzero in general. The specialization to cases such as a trinomial is obvious. We fix two integers p𝑝pitalic_p and q𝑞qitalic_q such that 0p<qn0𝑝𝑞𝑛0\leq p<q\leq n0 ≤ italic_p < italic_q ≤ italic_n and derive a transformed polynomial, whose roots are proportional to those of 𝒫(x)𝒫𝑥\mathscr{P}(x)script_P ( italic_x ). Employing (nonzero) constants λ𝜆\lambdaitalic_λ and μ𝜇\muitalic_μ, where x=μy𝑥𝜇𝑦x=\mu yitalic_x = italic_μ italic_y, we obtain

𝒫pq(y)λ𝒫(μy)=b0+b1y++yp++yq++bnyn.subscript𝒫𝑝𝑞𝑦𝜆𝒫𝜇𝑦subscript𝑏0subscript𝑏1𝑦superscript𝑦𝑝superscript𝑦𝑞subscript𝑏𝑛superscript𝑦𝑛\mathscr{P}_{pq}(y)\equiv\lambda\mathscr{P}(\mu y)=b_{0}+b_{1}y+\cdots+y^{p}+% \cdots+y^{q}+\cdots+b_{n}y^{n}\,.script_P start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( italic_y ) ≡ italic_λ script_P ( italic_μ italic_y ) = italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_y + ⋯ + italic_y start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT + ⋯ + italic_y start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT + ⋯ + italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT . (8.2)

Here bj=aj/(ap1μjaqμj)subscript𝑏𝑗subscript𝑎𝑗superscriptsubscript𝑎𝑝1subscript𝜇𝑗superscriptsubscript𝑎𝑞subscript𝜇𝑗b_{j}=a_{j}/(a_{p}^{1-\mu_{j}}a_{q}^{\mu_{j}})italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT / ( italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 - italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ), where μj=(jp)/(qp)subscript𝜇𝑗𝑗𝑝𝑞𝑝\mu_{j}=(j-p)/(q-p)italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ( italic_j - italic_p ) / ( italic_q - italic_p ). Then bp=bq=1subscript𝑏𝑝subscript𝑏𝑞1b_{p}=b_{q}=1italic_b start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT = italic_b start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT = 1, by construction. For brevity below, we define the tuples 𝒂=(a0,,an)𝒂subscript𝑎0subscript𝑎𝑛\bm{a}=(a_{0},\dots,a_{n})bold_italic_a = ( italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , … , italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) and 𝒃=(b0,,[p],,[q],,bn)𝒃subscript𝑏0delimited-[]𝑝delimited-[]𝑞subscript𝑏𝑛\bm{b}=(b_{0},\dots,[p],\dots,[q],\dots,b_{n})bold_italic_b = ( italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , … , [ italic_p ] , … , [ italic_q ] , … , italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ). Then 𝒂𝒂\bm{a}bold_italic_a and 𝒃𝒃\bm{b}bold_italic_b contain respectively n+1𝑛1n+1italic_n + 1 and n1𝑛1n-1italic_n - 1 components. We solve for a root ymsubscript𝑦𝑚y_{m}italic_y start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT where 𝒫pq(ym)=0subscript𝒫𝑝𝑞subscript𝑦𝑚0\mathscr{P}_{pq}(y_{m})=0script_P start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) = 0. Note that ymsubscript𝑦𝑚y_{m}italic_y start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT also depends on p𝑝pitalic_p and q𝑞qitalic_q, but we omit this for brevity. We express ymsubscript𝑦𝑚y_{m}italic_y start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT as a multivariate power series in the n1𝑛1n-1italic_n - 1 scaled coefficients bjsubscript𝑏𝑗b_{j}italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, where j𝒩npq𝑗subscript𝒩𝑛𝑝𝑞j\in\mathscr{N}_{npq}italic_j ∈ script_N start_POSTSUBSCRIPT italic_n italic_p italic_q end_POSTSUBSCRIPT. Recall 𝒩npq={0,1,,n}{p,q}subscript𝒩𝑛𝑝𝑞01𝑛𝑝𝑞\mathscr{N}_{npq}=\{0,1,\dots,n\}\setminus\{p,q\}script_N start_POSTSUBSCRIPT italic_n italic_p italic_q end_POSTSUBSCRIPT = { 0 , 1 , … , italic_n } ∖ { italic_p , italic_q }. We saw previously that this procedure yields qp𝑞𝑝q-pitalic_q - italic_p roots, so m=0,,qp1𝑚0𝑞𝑝1m=0,\dots,q-p-1italic_m = 0 , … , italic_q - italic_p - 1. We know the domain of absolute convergence of the resulting power series for ymsubscript𝑦𝑚y_{m}italic_y start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT includes a nonempty open neighborhood of the origin 𝟎pqsubscript0𝑝𝑞\bm{0}_{pq}bold_0 start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT, where bj=0subscript𝑏𝑗0b_{j}=0italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 0 for all j𝒩npq𝑗subscript𝒩𝑛𝑝𝑞j\in\mathscr{N}_{npq}italic_j ∈ script_N start_POSTSUBSCRIPT italic_n italic_p italic_q end_POSTSUBSCRIPT. We also know the domain of absolute convergence is a complete Reinhardt domain and depends only on the amplitudes |bj|subscript𝑏𝑗|b_{j}|| italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT |, i.e. the domain is the same for all the qp𝑞𝑝q-pitalic_q - italic_p roots ymsubscript𝑦𝑚y_{m}italic_y start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT.

Next note that just as the original polynomial 𝒫(x)𝒫𝑥\mathscr{P}(x)script_P ( italic_x ) can always be transformed to 𝒫pq(y)subscript𝒫𝑝𝑞𝑦\mathscr{P}_{pq}(y)script_P start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( italic_y ), the same procedure also transforms the discriminant Δ(𝒂)Δ𝒂\Delta(\bm{a})roman_Δ ( bold_italic_a ) of 𝒫(x)𝒫𝑥\mathscr{P}(x)script_P ( italic_x ) to the scaled discriminant Δpq(𝒃)subscriptΔ𝑝𝑞𝒃\Delta_{pq}(\bm{b})roman_Δ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b ) of 𝒫pq(y)subscript𝒫𝑝𝑞𝑦\mathscr{P}_{pq}(y)script_P start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( italic_y ). Then Δpq(𝒃)=λ2(n1)μn(n1)Δ(𝒂)subscriptΔ𝑝𝑞𝒃superscript𝜆2𝑛1superscript𝜇𝑛𝑛1Δ𝒂\Delta_{pq}(\bm{b})=\lambda^{2(n-1)}\mu^{n(n-1)}\Delta(\bm{a})roman_Δ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b ) = italic_λ start_POSTSUPERSCRIPT 2 ( italic_n - 1 ) end_POSTSUPERSCRIPT italic_μ start_POSTSUPERSCRIPT italic_n ( italic_n - 1 ) end_POSTSUPERSCRIPT roman_Δ ( bold_italic_a ). Thus far, the argument is correct. It is, however, false to conclude that the boundary of the domain of absolute convergence is determined by the scaled discriminants given by Δpq(±|b0|,,[p],,[q],,±|bn|)subscriptΔ𝑝𝑞plus-or-minussubscript𝑏0delimited-[]𝑝delimited-[]𝑞plus-or-minussubscript𝑏𝑛\Delta_{pq}(\pm|b_{0}|,\dots,[p],\dots,[q],\dots,\pm|b_{n}|)roman_Δ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( ± | italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , … , [ italic_p ] , … , [ italic_q ] , … , ± | italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | ), specifically, by solving for the hypersurfaces given by Δpq(±|b0|,,[p],,[q],,±|bn|)=0subscriptΔ𝑝𝑞plus-or-minussubscript𝑏0delimited-[]𝑝delimited-[]𝑞plus-or-minussubscript𝑏𝑛0\Delta_{pq}(\pm|b_{0}|,\dots,[p],\dots,[q],\dots,\pm|b_{n}|)=0roman_Δ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( ± | italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , … , [ italic_p ] , … , [ italic_q ] , … , ± | italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | ) = 0. For example, we saw above that this led to erroneous results for the domains of convergence for the series solutions of a cubic.

The weak point is that there are additional discriminants, which are also required to determine the boundary of the domain of absolute convergence. To see this, let us review the key steps. We employ fresh notation to avoid confusion with the above symbols. For brevity below, define an (n1)𝑛1(n-1)( italic_n - 1 )-tuple of ±plus-or-minus\pm± signs

𝝈=(σ0,,[p],,[q],,σn).𝝈subscript𝜎0delimited-[]𝑝delimited-[]𝑞subscript𝜎𝑛\bm{\sigma}=(\sigma_{0},\dots,[p],\dots,[q],\dots,\sigma_{n})\,.bold_italic_σ = ( italic_σ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , … , [ italic_p ] , … , [ italic_q ] , … , italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) . (8.3)

Here σj=±1subscript𝜎𝑗plus-or-minus1\sigma_{j}=\pm 1italic_σ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ± 1 for j𝒩npq𝑗subscript𝒩𝑛𝑝𝑞j\in\mathscr{N}_{npq}italic_j ∈ script_N start_POSTSUBSCRIPT italic_n italic_p italic_q end_POSTSUBSCRIPT. The dependence of 𝝈𝝈\bm{\sigma}bold_italic_σ and σjsubscript𝜎𝑗\sigma_{j}italic_σ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT on p𝑝pitalic_p and q𝑞qitalic_q is taken as understood. We also define the set ΣpqsubscriptΣ𝑝𝑞\Sigma_{pq}roman_Σ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT of all the distinct tuples 𝝈𝝈\bm{\sigma}bold_italic_σ. Then ΣpqsubscriptΣ𝑝𝑞\Sigma_{pq}roman_Σ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT has cardinality 2n1superscript2𝑛12^{n-1}2 start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT. The power series solutions for the roots of the following 2n+1superscript2𝑛12^{n+1}2 start_POSTSUPERSCRIPT italic_n + 1 end_POSTSUPERSCRIPT algebraic equations all have the same domain of absolute convergence

σ0|b0|+σ1|b1|y+±yp±yq++σn|bn|yn=0.plus-or-minussubscript𝜎0subscript𝑏0subscript𝜎1subscript𝑏1𝑦superscript𝑦𝑝superscript𝑦𝑞subscript𝜎𝑛subscript𝑏𝑛superscript𝑦𝑛0\sigma_{0}|b_{0}|+\sigma_{1}|b_{1}|y+\cdots\pm y^{p}\cdots\pm y^{q}+\cdots+% \sigma_{n}|b_{n}|y^{n}=0\,.italic_σ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | + italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | italic_y + ⋯ ± italic_y start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ⋯ ± italic_y start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT + ⋯ + italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | italic_y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT = 0 . (8.4)

The coefficient of yjsuperscript𝑦𝑗y^{j}italic_y start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT is permitted to be ±|bj|plus-or-minussubscript𝑏𝑗\pm|b_{j}|± | italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | only. We can always divide through by 11-1- 1, if necessary, so that the coefficient of ypsuperscript𝑦𝑝y^{p}italic_y start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT is unity. This yields 2nsuperscript2𝑛2^{n}2 start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT distinct equations. The discriminant of the associated polynomial is Δ(σ0|b0|,,1,,±1,,σn|bn|)Δsubscript𝜎0subscript𝑏01plus-or-minus1subscript𝜎𝑛subscript𝑏𝑛\Delta(\sigma_{0}|b_{0}|,\cdots,1,\cdots,\pm 1,\cdots,\sigma_{n}|b_{n}|)roman_Δ ( italic_σ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , ⋯ , 1 , ⋯ , ± 1 , ⋯ , italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | ). Let us now define the following two families (or sets) of discriminants. We employ the symbol ΨΨ\Psiroman_Ψ to avoid confusion with ΔpqsubscriptΔ𝑝𝑞\Delta_{pq}roman_Δ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT above. Then, with 1111 in the pthsuperscript𝑝𝑡p^{th}italic_p start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT slot and ±1plus-or-minus1\pm 1± 1 in the qthsuperscript𝑞𝑡q^{th}italic_q start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT slot, we define

Ψpq+(𝒃,𝝈)subscriptsuperscriptΨ𝑝𝑞𝒃𝝈\displaystyle\Psi^{+}_{pq}(\bm{b},\bm{\sigma})roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b , bold_italic_σ ) =Δ(σ0|b0|,,1,,1,,σn|bn|),absentΔsubscript𝜎0subscript𝑏011subscript𝜎𝑛subscript𝑏𝑛\displaystyle=\Delta(\sigma_{0}|b_{0}|,\cdots,1,\cdots,\phantom{-}1,\cdots,% \sigma_{n}|b_{n}|)\,,= roman_Δ ( italic_σ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , ⋯ , 1 , ⋯ , 1 , ⋯ , italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | ) , (8.5a)
Ψpq(𝒃,𝝈)subscriptsuperscriptΨ𝑝𝑞𝒃𝝈\displaystyle\Psi^{-}_{pq}(\bm{b},\bm{\sigma})roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b , bold_italic_σ ) =Δ(σ0|b0|,,1,,1,,σn|bn|),absentΔsubscript𝜎0subscript𝑏011subscript𝜎𝑛subscript𝑏𝑛\displaystyle=\Delta(\sigma_{0}|b_{0}|,\cdots,1,\cdots,-1,\cdots,\sigma_{n}|b_% {n}|)\,,= roman_Δ ( italic_σ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , ⋯ , 1 , ⋯ , - 1 , ⋯ , italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | ) , (8.5b)
Ψpq+(𝒃)subscriptsuperscriptΨ𝑝𝑞𝒃\displaystyle\Psi^{+}_{pq}(\bm{b})roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b ) ={Ψpq+(𝒃,𝝈)|𝝈Σpq},absentconditional-setsubscriptsuperscriptΨ𝑝𝑞𝒃𝝈𝝈subscriptΣ𝑝𝑞\displaystyle=\{\Psi^{+}_{pq}(\bm{b},\bm{\sigma})\,|\,\bm{\sigma}\in\Sigma_{pq% }\}\,,= { roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b , bold_italic_σ ) | bold_italic_σ ∈ roman_Σ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT } , (8.5c)
Ψpq(𝒃)subscriptsuperscriptΨ𝑝𝑞𝒃\displaystyle\Psi^{-}_{pq}(\bm{b})roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b ) ={Ψpq(𝒃,𝝈)|𝝈Σpq}.absentconditional-setsubscriptsuperscriptΨ𝑝𝑞𝒃𝝈𝝈subscriptΣ𝑝𝑞\displaystyle=\{\Psi^{-}_{pq}(\bm{b},\bm{\sigma})\,|\,\bm{\sigma}\in\Sigma_{pq% }\}\,.= { roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b , bold_italic_σ ) | bold_italic_σ ∈ roman_Σ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT } . (8.5d)

Each set has at most 2n1superscript2𝑛12^{n-1}2 start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT distinct elements. The following lemma shows that the family Ψpq(𝒃)superscriptsubscriptΨ𝑝𝑞𝒃\Psi_{pq}^{-}(\bm{b})roman_Ψ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_italic_b ) is nontrivial.

Lemma 8.1.

If qp𝑞𝑝q-pitalic_q - italic_p is odd, the sets Ψpq+(𝐛)subscriptsuperscriptΨ𝑝𝑞𝐛\Psi^{+}_{pq}(\bm{b})roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b ) and Ψpq(𝐛)subscriptsuperscriptΨ𝑝𝑞𝐛\Psi^{-}_{pq}(\bm{b})roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b ) are identical. If qp𝑞𝑝q-pitalic_q - italic_p is even, the sets Ψpq+(𝐛)subscriptsuperscriptΨ𝑝𝑞𝐛\Psi^{+}_{pq}(\bm{b})roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b ) and Ψpq(𝐛)subscriptsuperscriptΨ𝑝𝑞𝐛\Psi^{-}_{pq}(\bm{b})roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b ) are disjoint.

Proof.

Introduce two additional tuples 𝝈superscript𝝈\bm{\sigma}^{\prime}bold_italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT where σj=(1)jσjsuperscriptsubscript𝜎𝑗superscript1𝑗subscript𝜎𝑗\sigma_{j}^{\prime}=(-1)^{j}\sigma_{j}italic_σ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = ( - 1 ) start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT and 𝝈superscript𝝈-\bm{\sigma}^{\prime}- bold_italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT where obviously the components are (1)jσjsuperscript1𝑗subscript𝜎𝑗-(-1)^{j}\sigma_{j}- ( - 1 ) start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. Then consider the polynomials

P±(y,𝝈)=σ0|b0|+σ1|b1|y++yp+±yq++σn|bn|yn.subscript𝑃plus-or-minus𝑦𝝈plus-or-minussubscript𝜎0subscript𝑏0subscript𝜎1subscript𝑏1𝑦superscript𝑦𝑝superscript𝑦𝑞subscript𝜎𝑛subscript𝑏𝑛superscript𝑦𝑛P_{\pm}(y,\bm{\sigma})=\sigma_{0}|b_{0}|+\sigma_{1}|b_{1}|y+\cdots+y^{p}+% \cdots\pm y^{q}+\cdots+\sigma_{n}|b_{n}|y^{n}\,.italic_P start_POSTSUBSCRIPT ± end_POSTSUBSCRIPT ( italic_y , bold_italic_σ ) = italic_σ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | + italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | italic_y + ⋯ + italic_y start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT + ⋯ ± italic_y start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT + ⋯ + italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | italic_y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT . (8.6)

The only permitted transformations of P+subscript𝑃P_{+}italic_P start_POSTSUBSCRIPT + end_POSTSUBSCRIPT and Psubscript𝑃P_{-}italic_P start_POSTSUBSCRIPT - end_POSTSUBSCRIPT are to reverse the sign of y𝑦yitalic_y and/or to multiply P±subscript𝑃plus-or-minusP_{\pm}italic_P start_POSTSUBSCRIPT ± end_POSTSUBSCRIPT by 11-1- 1, because the coefficient of yjsuperscript𝑦𝑗y^{j}italic_y start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT must be ±|bj|plus-or-minussubscript𝑏𝑗\pm|b_{j}|± | italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | only. By construction, the discriminant of P+(y,𝝈)subscript𝑃𝑦𝝈P_{+}(y,\bm{\sigma})italic_P start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ( italic_y , bold_italic_σ ) is an element of Ψpq+(𝒃)subscriptsuperscriptΨ𝑝𝑞𝒃\Psi^{+}_{pq}(\bm{b})roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b ) and that of P(y,𝝈)subscript𝑃𝑦𝝈P_{-}(y,\bm{\sigma})italic_P start_POSTSUBSCRIPT - end_POSTSUBSCRIPT ( italic_y , bold_italic_σ ) is an element of Ψpq(𝒃)subscriptsuperscriptΨ𝑝𝑞𝒃\Psi^{-}_{pq}(\bm{b})roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b ). First suppose qp𝑞𝑝q-pitalic_q - italic_p is odd. If p𝑝pitalic_p is even and q𝑞qitalic_q is odd, then

P±(y,𝝈)=σ0|b0|σ1|b1|y++yp+yq++(1)nσn|bn|yn=P(y,𝝈).subscript𝑃plus-or-minus𝑦𝝈minus-or-plussubscript𝜎0subscript𝑏0subscript𝜎1subscript𝑏1𝑦superscript𝑦𝑝superscript𝑦𝑞superscript1𝑛subscript𝜎𝑛subscript𝑏𝑛superscript𝑦𝑛subscript𝑃minus-or-plus𝑦superscript𝝈\begin{split}P_{\pm}(-y,\bm{\sigma})&=\sigma_{0}|b_{0}|-\sigma_{1}|b_{1}|y+% \cdots+y^{p}+\cdots\mp y^{q}+\cdots+(-1)^{n}\sigma_{n}|b_{n}|y^{n}\\ &=P_{\mp}(y,\bm{\sigma}^{\prime})\,.\end{split}start_ROW start_CELL italic_P start_POSTSUBSCRIPT ± end_POSTSUBSCRIPT ( - italic_y , bold_italic_σ ) end_CELL start_CELL = italic_σ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | - italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | italic_y + ⋯ + italic_y start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT + ⋯ ∓ italic_y start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT + ⋯ + ( - 1 ) start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | italic_y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = italic_P start_POSTSUBSCRIPT ∓ end_POSTSUBSCRIPT ( italic_y , bold_italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) . end_CELL end_ROW (8.7)

If p𝑝pitalic_p is odd and q𝑞qitalic_q is even, then

P±(y,𝝈)=σ0|b0|+σ1|b1|y++yp+yq++(1)n+1σn|bn|yn=P(y,𝝈).subscript𝑃plus-or-minus𝑦𝝈minus-or-plussubscript𝜎0subscript𝑏0subscript𝜎1subscript𝑏1𝑦superscript𝑦𝑝superscript𝑦𝑞superscript1𝑛1subscript𝜎𝑛subscript𝑏𝑛superscript𝑦𝑛subscript𝑃minus-or-plus𝑦superscript𝝈\begin{split}-P_{\pm}(-y,\bm{\sigma})&=-\sigma_{0}|b_{0}|+\sigma_{1}|b_{1}|y+% \cdots+y^{p}+\cdots\mp y^{q}+\cdots+(-1)^{n+1}\sigma_{n}|b_{n}|y^{n}\\ &=P_{\mp}(y,-\bm{\sigma}^{\prime})\,.\end{split}start_ROW start_CELL - italic_P start_POSTSUBSCRIPT ± end_POSTSUBSCRIPT ( - italic_y , bold_italic_σ ) end_CELL start_CELL = - italic_σ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | + italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | italic_y + ⋯ + italic_y start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT + ⋯ ∓ italic_y start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT + ⋯ + ( - 1 ) start_POSTSUPERSCRIPT italic_n + 1 end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | italic_y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = italic_P start_POSTSUBSCRIPT ∓ end_POSTSUBSCRIPT ( italic_y , - bold_italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) . end_CELL end_ROW (8.8)

Cycling through all values of 𝝈𝝈\bm{\sigma}bold_italic_σ shows that the sets Ψpq+(𝒃)subscriptsuperscriptΨ𝑝𝑞𝒃\Psi^{+}_{pq}(\bm{b})roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b ) and Ψpq(𝒃)subscriptsuperscriptΨ𝑝𝑞𝒃\Psi^{-}_{pq}(\bm{b})roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b ) are identical, if qp𝑞𝑝q-pitalic_q - italic_p is odd. Now suppose qp𝑞𝑝q-pitalic_q - italic_p is even. Suppose p𝑝pitalic_p and q𝑞qitalic_q are both even. Then

P±(y,𝝈)=σ0|b0|σ1|b1|y++yp+±yq++(1)nσn|bn|yn=P±(y,𝝈).subscript𝑃plus-or-minus𝑦𝝈plus-or-minussubscript𝜎0subscript𝑏0subscript𝜎1subscript𝑏1𝑦superscript𝑦𝑝superscript𝑦𝑞superscript1𝑛subscript𝜎𝑛subscript𝑏𝑛superscript𝑦𝑛subscript𝑃plus-or-minus𝑦superscript𝝈\begin{split}P_{\pm}(-y,\bm{\sigma})&=\sigma_{0}|b_{0}|-\sigma_{1}|b_{1}|y+% \cdots+y^{p}+\cdots\pm y^{q}+\cdots+(-1)^{n}\sigma_{n}|b_{n}|y^{n}\\ &=P_{\pm}(y,\bm{\sigma}^{\prime})\,.\end{split}start_ROW start_CELL italic_P start_POSTSUBSCRIPT ± end_POSTSUBSCRIPT ( - italic_y , bold_italic_σ ) end_CELL start_CELL = italic_σ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | - italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | italic_y + ⋯ + italic_y start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT + ⋯ ± italic_y start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT + ⋯ + ( - 1 ) start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | italic_y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = italic_P start_POSTSUBSCRIPT ± end_POSTSUBSCRIPT ( italic_y , bold_italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) . end_CELL end_ROW (8.9)

The discriminant of P+(y,𝝈)subscript𝑃𝑦𝝈P_{+}(-y,\bm{\sigma})italic_P start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ( - italic_y , bold_italic_σ ) is an element of Ψpq+(𝒃)subscriptsuperscriptΨ𝑝𝑞𝒃\Psi^{+}_{pq}(\bm{b})roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b ). The other transformations P+(y,𝝈)subscript𝑃𝑦𝝈-P_{+}(y,\bm{\sigma})- italic_P start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ( italic_y , bold_italic_σ ) and P+(y,𝝈)subscript𝑃𝑦𝝈-P_{+}(-y,\bm{\sigma})- italic_P start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ( - italic_y , bold_italic_σ ) also fail to yield a polynomial with a discriminant which is an element of Ψpq(𝒃)subscriptsuperscriptΨ𝑝𝑞𝒃\Psi^{-}_{pq}(\bm{b})roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b ). Similarly the discriminant of ±P(±y,𝝈)plus-or-minussubscript𝑃plus-or-minus𝑦𝝈\pm P_{-}(\pm y,\bm{\sigma})± italic_P start_POSTSUBSCRIPT - end_POSTSUBSCRIPT ( ± italic_y , bold_italic_σ ) is always an element of Ψpq(𝒃)subscriptsuperscriptΨ𝑝𝑞𝒃\Psi^{-}_{pq}(\bm{b})roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b ). Next suppose p𝑝pitalic_p and q𝑞qitalic_q are both odd. Then

P±(y,𝝈)=σ0|b0|+σ1|b1|y++yp+±yq++(1)n+1σn|bn|yn=P±(y,𝝈).subscript𝑃plus-or-minus𝑦𝝈plus-or-minussubscript𝜎0subscript𝑏0subscript𝜎1subscript𝑏1𝑦superscript𝑦𝑝superscript𝑦𝑞superscript1𝑛1subscript𝜎𝑛subscript𝑏𝑛superscript𝑦𝑛subscript𝑃plus-or-minus𝑦superscript𝝈\begin{split}-P_{\pm}(-y,\bm{\sigma})&=-\sigma_{0}|b_{0}|+\sigma_{1}|b_{1}|y+% \cdots+y^{p}+\cdots\pm y^{q}+\cdots+(-1)^{n+1}\sigma_{n}|b_{n}|y^{n}\\ &=P_{\pm}(y,-\bm{\sigma}^{\prime})\,.\end{split}start_ROW start_CELL - italic_P start_POSTSUBSCRIPT ± end_POSTSUBSCRIPT ( - italic_y , bold_italic_σ ) end_CELL start_CELL = - italic_σ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | + italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | italic_y + ⋯ + italic_y start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT + ⋯ ± italic_y start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT + ⋯ + ( - 1 ) start_POSTSUPERSCRIPT italic_n + 1 end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | italic_y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = italic_P start_POSTSUBSCRIPT ± end_POSTSUBSCRIPT ( italic_y , - bold_italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) . end_CELL end_ROW (8.10)

As was the case when p𝑝pitalic_p and q𝑞qitalic_q were both even, the discriminant of ±P+(±y,𝝈)plus-or-minussubscript𝑃plus-or-minus𝑦𝝈\pm P_{+}(\pm y,\bm{\sigma})± italic_P start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ( ± italic_y , bold_italic_σ ) is always an element of Ψpq+(𝒃)subscriptsuperscriptΨ𝑝𝑞𝒃\Psi^{+}_{pq}(\bm{b})roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b ) and the discriminant of ±P(±y,𝝈)plus-or-minussubscript𝑃plus-or-minus𝑦𝝈\pm P_{-}(\pm y,\bm{\sigma})± italic_P start_POSTSUBSCRIPT - end_POSTSUBSCRIPT ( ± italic_y , bold_italic_σ ) is always an element of Ψpq(𝒃)subscriptsuperscriptΨ𝑝𝑞𝒃\Psi^{-}_{pq}(\bm{b})roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b ). Hence if qp𝑞𝑝q-pitalic_q - italic_p is even, the sets Ψpq+(𝒃)subscriptsuperscriptΨ𝑝𝑞𝒃\Psi^{+}_{pq}(\bm{b})roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b ) and Ψpq(𝒃)subscriptsuperscriptΨ𝑝𝑞𝒃\Psi^{-}_{pq}(\bm{b})roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b ) are disjoint. ∎

Armed with this additional information, we return to eq. (7.5) and the case p=0𝑝0p=0italic_p = 0 and q=2𝑞2q=2italic_q = 2. There are four distinct discriminants which can contribute to the boundary of the domain of convergence, viz.

Ψ02+(|a1|,|a3|)subscriptsuperscriptΨ02subscript𝑎1subscript𝑎3\displaystyle\Psi^{+}_{02}(|a_{1}|,|a_{3}|)roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) =27|a3|2+4|a1|3|a3|+418|a1||a3||a1|2,absent27superscriptsubscript𝑎324superscriptsubscript𝑎13subscript𝑎3418subscript𝑎1subscript𝑎3superscriptsubscript𝑎12\displaystyle=27|a_{3}|^{2}+4|a_{1}|^{3}|a_{3}|+4-18|a_{1}||a_{3}|-|a_{1}|^{2}\,,= 27 | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 4 | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | + 4 - 18 | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | - | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (8.11a)
Ψ02+(|a1|,|a3|)subscriptsuperscriptΨ02subscript𝑎1subscript𝑎3\displaystyle\Psi^{+}_{02}(|a_{1}|,-|a_{3}|)roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , - | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) =27|a3|24|a1|3|a3|+4+18|a1||a3||a1|2,absent27superscriptsubscript𝑎324superscriptsubscript𝑎13subscript𝑎3418subscript𝑎1subscript𝑎3superscriptsubscript𝑎12\displaystyle=27|a_{3}|^{2}-4|a_{1}|^{3}|a_{3}|+4+18|a_{1}||a_{3}|-|a_{1}|^{2}\,,= 27 | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 4 | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | + 4 + 18 | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | - | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (8.11b)
Ψ02(|a1|,|a3|)subscriptsuperscriptΨ02subscript𝑎1subscript𝑎3\displaystyle\Psi^{-}_{02}(|a_{1}|,|a_{3}|)roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) =27|a3|2+4|a1|3|a3|4+18|a1||a3||a1|2,absent27superscriptsubscript𝑎324superscriptsubscript𝑎13subscript𝑎3418subscript𝑎1subscript𝑎3superscriptsubscript𝑎12\displaystyle=27|a_{3}|^{2}+4|a_{1}|^{3}|a_{3}|-4+18|a_{1}||a_{3}|-|a_{1}|^{2}\,,= 27 | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 4 | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | - 4 + 18 | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | - | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (8.11c)
Ψ02(|a1|,|a3|)subscriptsuperscriptΨ02subscript𝑎1subscript𝑎3\displaystyle\Psi^{-}_{02}(|a_{1}|,-|a_{3}|)roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , - | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) =27|a3|24|a1|3|a3|418|a1||a3||a1|2.absent27superscriptsubscript𝑎324superscriptsubscript𝑎13subscript𝑎3418subscript𝑎1subscript𝑎3superscriptsubscript𝑎12\displaystyle=27|a_{3}|^{2}-4|a_{1}|^{3}|a_{3}|-4-18|a_{1}||a_{3}|-|a_{1}|^{2}\,.= 27 | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 4 | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | - 4 - 18 | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | - | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (8.11d)

The expressions for Ψ02+(|a1|,±|a3|)subscriptsuperscriptΨ02subscript𝑎1plus-or-minussubscript𝑎3\Psi^{+}_{02}(|a_{1}|,\pm|a_{3}|)roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , ± | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) are the same as for Δ02(|a1|,±|a3|)subscriptΔ02subscript𝑎1plus-or-minussubscript𝑎3\Delta_{02}(|a_{1}|,\pm|a_{3}|)roman_Δ start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , ± | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) in eq. (7.7). Observe that Ψ02+(0,0)=4subscriptsuperscriptΨ02004\Psi^{+}_{02}(0,0)=4roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( 0 , 0 ) = 4 and Ψ02(0,0)=4subscriptsuperscriptΨ02004\Psi^{-}_{02}(0,0)=-4roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( 0 , 0 ) = - 4. If we set |a1|=0subscript𝑎10|a_{1}|=0| italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | = 0 then Ψ02(0,±|a3|)=27|a3|24subscriptsuperscriptΨ020plus-or-minussubscript𝑎327superscriptsubscript𝑎324\Psi^{-}_{02}(0,\pm|a_{3}|)=27|a_{3}|^{2}-4roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( 0 , ± | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) = 27 | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 4, which yields the correct upper bound for |a3|subscript𝑎3|a_{3}|| italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT |. If we set |a3|=0subscript𝑎30|a_{3}|=0| italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | = 0 then Ψ02+(|a1|,0)=4|a1|2subscriptsuperscriptΨ02subscript𝑎104superscriptsubscript𝑎12\Psi^{+}_{02}(|a_{1}|,0)=4-|a_{1}|^{2}roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , 0 ) = 4 - | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, which yields the correct upper bound for |a1|subscript𝑎1|a_{1}|| italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT |. The correct answer requires both Ψ02+subscriptsuperscriptΨ02\Psi^{+}_{02}roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT and Ψ02subscriptsuperscriptΨ02\Psi^{-}_{02}roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT. Some further algebra yields the correct expression for the domain of convergence to be 𝒟02={Ψ02+(|a1|,|a3|)0}{Ψ02(|a1|,|a3|)0}subscript𝒟02subscriptsuperscriptΨ02subscript𝑎1subscript𝑎30subscriptsuperscriptΨ02subscript𝑎1subscript𝑎30\mathcal{D}_{02}=\{\Psi^{+}_{02}(|a_{1}|,|a_{3}|)\geq 0\}\cap\{\Psi^{-}_{02}(|% a_{1}|,|a_{3}|)\leq 0\}caligraphic_D start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT = { roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) ≥ 0 } ∩ { roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) ≤ 0 }. Note that the series converges on the boundary of its domain of convergence.

The corrected expressions for the domains of absolute convergence for the series solutions for the roots of a cubic as follows. For clarity, we distinguish between the coefficients a0,,a3subscript𝑎0subscript𝑎3a_{0},\dots,a_{3}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , … , italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT of the original cubic in eq. (7.3) and the coefficients bjsubscript𝑏𝑗b_{j}italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT in the scaled polynomial 𝒫pq(y)subscript𝒫𝑝𝑞𝑦\mathscr{P}_{pq}(y)script_P start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( italic_y ). Recall that ‘bjsubscript𝑏𝑗b_{j}italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT’ depends also on p𝑝pitalic_p and q𝑞qitalic_q but this is considered to be understood. The domains of convergence 𝒟pqsubscript𝒟𝑝𝑞\mathcal{D}_{pq}caligraphic_D start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT are given by

𝒟01subscript𝒟01\displaystyle\mathcal{D}_{01}caligraphic_D start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT ={Ψ01+(|b2|,|b3|)0},absentsubscriptsuperscriptΨ01subscript𝑏2subscript𝑏30\displaystyle=\{\Psi^{+}_{01}(|b_{2}|,-|b_{3}|)\leq 0\}\,,= { roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT ( | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | , - | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) ≤ 0 } , (8.12a)
𝒟02subscript𝒟02\displaystyle\mathcal{D}_{02}caligraphic_D start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ={Ψ02+(|b1|,|b3|)0}{Ψ02(|b1|,|b3|)0},absentsubscriptsuperscriptΨ02subscript𝑏1subscript𝑏30subscriptsuperscriptΨ02subscript𝑏1subscript𝑏30\displaystyle=\{\Psi^{+}_{02}(|b_{1}|,|b_{3}|)\geq 0\}\cap\{\Psi^{-}_{02}(|b_{% 1}|,|b_{3}|)\leq 0\}\,,= { roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) ≥ 0 } ∩ { roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) ≤ 0 } , (8.12b)
𝒟03subscript𝒟03\displaystyle\mathcal{D}_{03}caligraphic_D start_POSTSUBSCRIPT 03 end_POSTSUBSCRIPT ={Ψ03+(|b1|,|b2|)0},absentsubscriptsuperscriptΨ03subscript𝑏1subscript𝑏20\displaystyle=\{\Psi^{+}_{03}(-|b_{1}|,-|b_{2}|)\geq 0\}\,,= { roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 03 end_POSTSUBSCRIPT ( - | italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , - | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | ) ≥ 0 } , (8.12c)
𝒟12subscript𝒟12\displaystyle\mathcal{D}_{12}caligraphic_D start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT ={Ψ12+(|b0|,|b3|)0}{Ψ12+(|b0|,|b3|)0},absentsubscriptsuperscriptΨ12subscript𝑏0subscript𝑏30subscriptsuperscriptΨ12subscript𝑏0subscript𝑏30\displaystyle=\{\Psi^{+}_{12}(|b_{0}|,-|b_{3}|)\leq 0\}\cap\{\Psi^{+}_{12}(-|b% _{0}|,|b_{3}|)\leq 0\}\,,= { roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT ( | italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , - | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) ≤ 0 } ∩ { roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT ( - | italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) ≤ 0 } , (8.12d)
𝒟13subscript𝒟13\displaystyle\mathcal{D}_{13}caligraphic_D start_POSTSUBSCRIPT 13 end_POSTSUBSCRIPT ={Ψ13+(|b0|,|b2|)0}{Ψ13(|b0|,|b2|)0},absentsubscriptsuperscriptΨ13subscript𝑏0subscript𝑏20subscriptsuperscriptΨ13subscript𝑏0subscript𝑏20\displaystyle=\{\Psi^{+}_{13}(|b_{0}|,|b_{2}|)\geq 0\}\cap\{\Psi^{-}_{13}(|b_{% 0}|,|b_{2}|)\leq 0\}\,,= { roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 13 end_POSTSUBSCRIPT ( | italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | ) ≥ 0 } ∩ { roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 13 end_POSTSUBSCRIPT ( | italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | ) ≤ 0 } , (8.12e)
𝒟23subscript𝒟23\displaystyle\mathcal{D}_{23}caligraphic_D start_POSTSUBSCRIPT 23 end_POSTSUBSCRIPT ={Ψ23+(|b0|,|b1|)0}.absentsubscriptsuperscriptΨ23subscript𝑏0subscript𝑏10\displaystyle=\{\Psi^{+}_{23}(-|b_{0}|,|b_{1}|)\leq 0\}\,.= { roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 23 end_POSTSUBSCRIPT ( - | italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) ≤ 0 } . (8.12f)

The series converge on the boundaries of their respective domains of convergence. For the case 𝒟23subscript𝒟23\mathcal{D}_{23}caligraphic_D start_POSTSUBSCRIPT 23 end_POSTSUBSCRIPT, note that qp=1𝑞𝑝1q-p=1italic_q - italic_p = 1 is odd, hence Ψpq+(𝒃)subscriptsuperscriptΨ𝑝𝑞𝒃\Psi^{+}_{pq}(\bm{b})roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b ) and Ψpq(𝒃)subscriptsuperscriptΨ𝑝𝑞𝒃\Psi^{-}_{pq}(\bm{b})roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b ) are identical, so the solution is expressed using purely Ψ23+(𝒃)subscriptsuperscriptΨ23𝒃\Psi^{+}_{23}(\bm{b})roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 23 end_POSTSUBSCRIPT ( bold_italic_b ). We can therefore consider the expression in eq. (7.4f) to be a misprint. For the cases 𝒟01subscript𝒟01\mathcal{D}_{01}caligraphic_D start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT and 𝒟13subscript𝒟13\mathcal{D}_{13}caligraphic_D start_POSTSUBSCRIPT 13 end_POSTSUBSCRIPT, where qp=2𝑞𝑝2q-p=2italic_q - italic_p = 2 is even, the need for Ψpq(𝒃)subscriptsuperscriptΨ𝑝𝑞𝒃\Psi^{-}_{pq}(\bm{b})roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b ) is essential.

There is an additional caveat, which is that the formula for the domain of convergence cannot always be expressed using purely inequalities. Consider the quartic equation with no term in x3superscript𝑥3x^{3}italic_x start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT

a0+a1x+x2+x4=0.subscript𝑎0subscript𝑎1𝑥superscript𝑥2superscript𝑥40a_{0}+a_{1}x+x^{2}+x^{4}=0\,.italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x + italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_x start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT = 0 . (8.13)

We choose p=2𝑝2p=2italic_p = 2 and q=4𝑞4q=4italic_q = 4 and we have set a2=a4=1subscript𝑎2subscript𝑎41a_{2}=a_{4}=1italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT = 1 so that the scaled coefficients are simply bj=ajsubscript𝑏𝑗subscript𝑎𝑗b_{j}=a_{j}italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. The boundary of the domain of convergence in this case is determined solely by Ψ24(|a0|,|a1|)subscriptsuperscriptΨ24subscript𝑎0subscript𝑎1\Psi^{-}_{24}(-|a_{0}|,|a_{1}|)roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ). The derivation is omitted. Then

Ψ24(|a0|,|a1|)=16|a0|(14|a0|)2+4(136|a0|)|a1|227|a1|4.superscriptsubscriptΨ24subscript𝑎0subscript𝑎116subscript𝑎0superscript14subscript𝑎024136subscript𝑎0superscriptsubscript𝑎1227superscriptsubscript𝑎14\Psi_{24}^{-}(-|a_{0}|,|a_{1}|)=16|a_{0}|(1-4|a_{0}|)^{2}+4(1-36|a_{0}|)|a_{1}% |^{2}-27|a_{1}|^{4}\,.roman_Ψ start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) = 16 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ( 1 - 4 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 4 ( 1 - 36 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ) | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 27 | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT . (8.14)
  • 1.

    The discriminant vanishes at the origin: Ψ24(0,0)=0superscriptsubscriptΨ24000\Psi_{24}^{-}(0,0)=0roman_Ψ start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( 0 , 0 ) = 0. The significance of this will be discussed below. For now we seek nonzero solutions of the equation Ψ24(|a0|,|a1|)=0superscriptsubscriptΨ24subscript𝑎0subscript𝑎10\Psi_{24}^{-}(-|a_{0}|,|a_{1}|)=0roman_Ψ start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) = 0.

  • 2.

    Put a1=0subscript𝑎10a_{1}=0italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 0, then Ψ24(|a0|,0)=16|a0|(14|a0|)2subscriptsuperscriptΨ24subscript𝑎0016subscript𝑎0superscript14subscript𝑎02\Psi^{-}_{24}(-|a_{0}|,0)=16|a_{0}|(1-4|a_{0}|)^{2}roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , 0 ) = 16 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ( 1 - 4 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. This is (proportional to) a perfect square, which equals zero at |a0|=14subscript𝑎014|a_{0}|=\frac{1}{4}| italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | = divide start_ARG 1 end_ARG start_ARG 4 end_ARG. The necessary upper bound on |a0|subscript𝑎0|a_{0}|| italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | for this problem is known to be |a0|14subscript𝑎014|a_{0}|\leq\frac{1}{4}| italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ≤ divide start_ARG 1 end_ARG start_ARG 4 end_ARG.

  • 3.

    Next put a0=0subscript𝑎00a_{0}=0italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0, then Ψ24(0,|a1|)=(427|a1|2)|a1|2subscriptsuperscriptΨ240subscript𝑎1427superscriptsubscript𝑎12superscriptsubscript𝑎12\Psi^{-}_{24}(0,|a_{1}|)=(4-27|a_{1}|^{2})|a_{1}|^{2}roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT ( 0 , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) = ( 4 - 27 | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. For nonzero a1subscript𝑎1a_{1}italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, this vanishes at |a1|=4/27subscript𝑎1427|a_{1}|=\sqrt{4/27}| italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | = square-root start_ARG 4 / 27 end_ARG. The necessary upper bound on |a1|subscript𝑎1|a_{1}|| italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | for this problem is known to be |a1|4/27subscript𝑎1427|a_{1}|\leq\sqrt{4/27}| italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ≤ square-root start_ARG 4 / 27 end_ARG.

  • 4.

    Next let us put |a0|=asubscript𝑎0𝑎|a_{0}|=a| italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | = italic_a and |a1|=12asubscript𝑎112𝑎|a_{1}|=\frac{1}{2}a| italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | = divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_a, where a+𝑎subscripta\in\mathbb{R}_{+}italic_a ∈ blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT. The graph of Ψ24(a,12a)subscriptsuperscriptΨ24𝑎12𝑎\Psi^{-}_{24}(-a,\frac{1}{2}a)roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT ( - italic_a , divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_a ) is plotted against a𝑎aitalic_a in Fig. 1. As the value of a𝑎aitalic_a increases from zero, initially Ψ24(0,0)=0subscriptsuperscriptΨ24000\Psi^{-}_{24}(0,0)=0roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT ( 0 , 0 ) = 0 for a=0𝑎0a=0italic_a = 0, then the value of Ψ24(a,12a)subscriptsuperscriptΨ24𝑎12𝑎\Psi^{-}_{24}(-a,\frac{1}{2}a)roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT ( - italic_a , divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_a ) is positive, reaches a maximum, then it changes sign and becomes negative, reaches a minimum and then becomes positive again and increases to ++\infty+ ∞ thereafter. The value of Ψ24(a,12a)subscriptsuperscriptΨ24𝑎12𝑎\Psi^{-}_{24}(-a,\frac{1}{2}a)roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT ( - italic_a , divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_a ) is thus not monotonic in a𝑎aitalic_a.

  • 5.

    All of the above facts demonstrate that an unconditional inequality Ψ24(|a0|,|a1|)0subscriptsuperscriptΨ24subscript𝑎0subscript𝑎10\Psi^{-}_{24}(-|a_{0}|,|a_{1}|)\geq 0roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) ≥ 0 is insufficient to determine the domain of convergence. First, the discriminant vanishes at the origin. We need to exclude the origin as a solution, because we know the domain of convergence has positive measure. Even after doing so, we require an additional stipulation “the domain of convergence includes only the component which satisfies the inequality and is connected to the origin.”

  • 6.

    It is implicit in [36, Thm. 3] that the domain of convergence includes only the component connected to the origin. What is not clear is that the formula for the domain of convergence cannot always be expressed using only unconditional inequalities on the values of the discriminants. The stipulation “the component connected to the origin” is necessary.

  • 7.

    We remark in passing that for this problem, the domain of convergence is determined solely by a discriminant of the form ΨpqsubscriptsuperscriptΨ𝑝𝑞\Psi^{-}_{pq}roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT. A discriminant of the form Ψpq+subscriptsuperscriptΨ𝑝𝑞\Psi^{+}_{pq}roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT, i.e. ΔpqsubscriptΔ𝑝𝑞\Delta_{pq}roman_Δ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT in the formalism in [36], does not appear.

One source of the difficulty is that if a1=0subscript𝑎10a_{1}=0italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 0 in eq. (8.13), it becomes an algebraic equation in x2superscript𝑥2x^{2}italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, viz. a0+x2+x4=0subscript𝑎0superscript𝑥2superscript𝑥40a_{0}+x^{2}+x^{4}=0italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_x start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT = 0. For a polynomial of degree n𝑛nitalic_n such as in eq. (8.1), the discriminant of 𝒫(xm)𝒫superscript𝑥𝑚\mathscr{P}(x^{m})script_P ( italic_x start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ), for a positive integer m𝑚mitalic_m, is given by

Δ(𝒫(xm))=(1)nm(m1)/2mmn(a0an)m1(Δ(𝒫(x)))m.Δ𝒫superscript𝑥𝑚superscript1𝑛𝑚𝑚12superscript𝑚𝑚𝑛superscriptsubscript𝑎0subscript𝑎𝑛𝑚1superscriptΔ𝒫𝑥𝑚\Delta(\mathscr{P}(x^{m}))=(-1)^{nm(m-1)/2}m^{mn}(a_{0}a_{n})^{m-1}\bigl{(}% \Delta(\mathscr{P}(x))\bigr{)}^{m}\,.roman_Δ ( script_P ( italic_x start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) ) = ( - 1 ) start_POSTSUPERSCRIPT italic_n italic_m ( italic_m - 1 ) / 2 end_POSTSUPERSCRIPT italic_m start_POSTSUPERSCRIPT italic_m italic_n end_POSTSUPERSCRIPT ( italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_m - 1 end_POSTSUPERSCRIPT ( roman_Δ ( script_P ( italic_x ) ) ) start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT . (8.15)

Hence if m𝑚mitalic_m is even, the discriminant Δ(𝒫(xm))Δ𝒫superscript𝑥𝑚\Delta(\mathscr{P}(x^{m}))roman_Δ ( script_P ( italic_x start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) ) will not change sign as the (absolute values of the) coefficients are varied. Hence, in general, an unconditional inequality on the value of the discriminant(s) is insufficient to determine the domain of convergence. This feature will occur generically (or at least, cannot be ruled out) for a quartic and algebraic equations of all higher degrees, for example if the coefficients of all the odd powers of x𝑥xitalic_x are set to zero.

8.2 General formula

We have seen that the formalism in [36] must be augmented by the inclusion of an extra set of discriminants. Although this yields the correct result for a cubic, as in eq. (8.12), the procedure in [36] becomes tedious for polynomials of high degree, and we have seen that it is prone to error. We seek a procedure that yields a single ‘general formula’ valid for arbitrary n𝑛nitalic_n, which is simpler to state and to compute, for practical work. This can be accomplished via the use of hyperplanes and foliations, as will be explained below. (N.B. the word ‘single’ was employed informally above; we shall require at least two formulas.)

Still speaking informally, given an algebraic equation of degree n𝑛nitalic_n with a coefficient tuple 𝒂𝒂\bm{a}bold_italic_a and a choice for p𝑝pitalic_p and q𝑞qitalic_q, hence a scaled tuple 𝒃𝒃\bm{b}bold_italic_b, the equations Ψpq+(𝒃,𝝈)=0subscriptsuperscriptΨ𝑝𝑞𝒃𝝈0\Psi^{+}_{pq}(\bm{b},\bm{\sigma})=0roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b , bold_italic_σ ) = 0 and Ψpq(𝒃,𝝈)=0subscriptsuperscriptΨ𝑝𝑞𝒃𝝈0\Psi^{-}_{pq}(\bm{b},\bm{\sigma})=0roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b , bold_italic_σ ) = 0, taken over all 𝝈Σpq𝝈subscriptΣ𝑝𝑞\bm{\sigma}\in\Sigma_{pq}bold_italic_σ ∈ roman_Σ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT, specify a set of hyperplanes in the amplitudes |bj|subscript𝑏𝑗|b_{j}|| italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT |. The domain of convergence in +n1superscriptsubscript𝑛1\mathbb{R}_{+}^{n-1}blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT is given by the set of hyperplanes closest to the origin and which together bound a region which is connected to the origin 𝟎pqsubscript0𝑝𝑞\bm{0}_{pq}bold_0 start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT. The domain of convergence for 𝒃n1𝒃superscript𝑛1\bm{b}\in\mathbb{C}^{n-1}bold_italic_b ∈ blackboard_C start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT is the inverse image of the above domain in +n1superscriptsubscript𝑛1\mathbb{R}_{+}^{n-1}blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT. The domain of absolute convergence is clearly unique. If there were two or more sets of such hyperplanes, the full domain of absolute convergence would simply be the union of the individual domains. However, one reason the above discussion is informal is that we saw that the discriminant can vanish at the origin. Hence to write an equation such as ‘Ψpq±(𝒃,𝝈)=0subscriptsuperscriptΨplus-or-minus𝑝𝑞𝒃𝝈0\Psi^{\pm}_{pq}(\bm{b},\bm{\sigma})=0roman_Ψ start_POSTSUPERSCRIPT ± end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( bold_italic_b , bold_italic_σ ) = 0’ is not precise enough for our needs.

We now sharpen the above ideas. Clearly, the domain of absolute convergence is determined solely by the amplitudes |bj|subscript𝑏𝑗|b_{j}|| italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT |, j𝒩npq𝑗subscript𝒩𝑛𝑝𝑞j\in\mathscr{N}_{npq}italic_j ∈ script_N start_POSTSUBSCRIPT italic_n italic_p italic_q end_POSTSUBSCRIPT. We previously denoted the doman of absolute convergence by 𝒟𝒟\mathcal{D}caligraphic_D and introduced its image Log(𝒟)Log𝒟\textrm{Log}(\mathcal{D})Log ( caligraphic_D ). Here we define a second image via an ‘amplitude map’ k+ksuperscript𝑘superscriptsubscript𝑘\mathbb{C}^{k}\to\mathbb{R}_{+}^{k}blackboard_C start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT → blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT where zj|zj|maps-tosubscript𝑧𝑗subscript𝑧𝑗z_{j}\mapsto|z_{j}|italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ↦ | italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | for j=1,,k𝑗1𝑘j=1,\dots,kitalic_j = 1 , … , italic_k:

𝒟={(|z1|,,|zk|)|𝒛𝒟}.𝒟conditionalsubscript𝑧1subscript𝑧𝑘𝒛𝒟\mathscr{D}=\{(|z_{1}|,\dots,|z_{k}|)\,|\,\bm{z}\in\mathcal{D}\}\,.script_D = { ( | italic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , … , | italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | ) | bold_italic_z ∈ caligraphic_D } . (8.16)

From the previous discussion of polydiscs and a complete Reinhardt domain, 𝒛𝒟𝒛𝒟\bm{z}\in\mathcal{D}bold_italic_z ∈ caligraphic_D if and only if (|z1|,,|zk|)𝒟subscript𝑧1subscript𝑧𝑘𝒟(|z_{1}|,\dots,|z_{k}|)\in\mathscr{D}( | italic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , … , | italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | ) ∈ script_D. Clearly also Log(𝒟)=Log(𝒟)Log𝒟Log𝒟\textrm{Log}(\mathscr{D})=\textrm{Log}(\mathcal{D})Log ( script_D ) = Log ( caligraphic_D ). Our interest is the case of an algebraic equation of degree n𝑛nitalic_n, so k=n1𝑘𝑛1k=n-1italic_k = italic_n - 1 and 𝒛=𝒃𝒛𝒃\bm{z}=\bm{b}bold_italic_z = bold_italic_b. We shall derive a formula to determine the domain 𝒟𝒟\mathscr{D}script_D in this case. Obviously 𝟎pq𝒟subscript0𝑝𝑞𝒟\bm{0}_{pq}\in\mathscr{D}bold_0 start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ∈ script_D. Recall that for an algebraic equation of degree n𝑛nitalic_n and fixed p,q𝑝𝑞p,qitalic_p , italic_q, then μj=(jp)/(qp)subscript𝜇𝑗𝑗𝑝𝑞𝑝\mu_{j}=(j-p)/(q-p)italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ( italic_j - italic_p ) / ( italic_q - italic_p ). From eq. (3.4), let us define, for algebraic equations,

b^j=1|μj|μj|1μj|1μj.subscript^𝑏𝑗1superscriptsubscript𝜇𝑗subscript𝜇𝑗superscript1subscript𝜇𝑗1subscript𝜇𝑗\hat{b}_{j}=\frac{1}{|\mu_{j}|^{\mu_{j}}|1-\mu_{j}|^{1-\mu_{j}}}\,.over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG | italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | 1 - italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 1 - italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG . (8.17)

Recall one must have |bj|b^jsubscript𝑏𝑗subscript^𝑏𝑗|b_{j}|\leq\hat{b}_{j}| italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | ≤ over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT for j𝒩npq𝑗subscript𝒩𝑛𝑝𝑞j\in\mathscr{N}_{npq}italic_j ∈ script_N start_POSTSUBSCRIPT italic_n italic_p italic_q end_POSTSUBSCRIPT. It follows that 𝒟𝒟^𝒟^𝒟\mathscr{D}\subset\mathscr{\hat{D}}script_D ⊂ over^ start_ARG script_D end_ARG where the ‘hypercuboid’ is

𝒟^={(|b1|,,[p],,[q],,|bn|)||bj|b^j,j𝒩npq}.\mathscr{\hat{D}}=\biggl{\{}(|b_{1}|,\dots,[p],\dots,[q],\dots,|b_{n}|)\;% \biggl{|}\;|b_{j}|\leq\hat{b}_{j},\,j\in\mathscr{N}_{npq}\biggr{\}}\,.over^ start_ARG script_D end_ARG = { ( | italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , … , [ italic_p ] , … , [ italic_q ] , … , | italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | ) | | italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | ≤ over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_j ∈ script_N start_POSTSUBSCRIPT italic_n italic_p italic_q end_POSTSUBSCRIPT } . (8.18)

The following n1𝑛1n-1italic_n - 1 vertices of the hypercuboid lie in the domain of convergence, viz. (b^0,0,,0)subscript^𝑏000(\hat{b}_{0},0,\dots,0)( over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , 0 , … , 0 ), (0,b^1,0,,0)0subscript^𝑏100(0,\hat{b}_{1},0,\dots,0)( 0 , over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , 0 , … , 0 ), …, (0,,b^n)0subscript^𝑏𝑛(0,\dots,\hat{b}_{n})( 0 , … , over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ). We also know that 𝒟𝒟\mathscr{D}script_D has positive measure and 𝒟𝒟ˇˇ𝒟𝒟\mathscr{D}\supset\mathscr{\check{D}}script_D ⊃ overroman_ˇ start_ARG script_D end_ARG, where

𝒟ˇ={(|b1|,,[p],,[q],,|bn|)|j𝒩npq|bj|1|μ|μ|1μ|1μ}.\mathscr{\check{D}}=\biggl{\{}(|b_{1}|,\dots,[p],\dots,[q],\dots,|b_{n}|)\,% \biggl{|}\,\sum_{j\in\mathscr{N}_{npq}}|b_{j}|\leq\frac{1}{|\mu_{*}|^{\mu_{*}}% |1-\mu_{*}|^{1-\mu_{*}}}\biggr{\}}\,.overroman_ˇ start_ARG script_D end_ARG = { ( | italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , … , [ italic_p ] , … , [ italic_q ] , … , | italic_b start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | ) | ∑ start_POSTSUBSCRIPT italic_j ∈ script_N start_POSTSUBSCRIPT italic_n italic_p italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT | italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | ≤ divide start_ARG 1 end_ARG start_ARG | italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | 1 - italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 1 - italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG } . (8.19)

Recall eq. (3.8) and the definition of μsubscript𝜇\mu_{*}italic_μ start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT. We require the following lemma.

Lemma 8.2.

At the origin, exactly one of the two following mutually exclusive possibilities is true: (i) Ψpq+(𝟎pq)=Ψpq(𝟎pq)=0superscriptsubscriptΨ𝑝𝑞subscript0𝑝𝑞superscriptsubscriptΨ𝑝𝑞subscript0𝑝𝑞0\Psi_{pq}^{+}(\bm{0}_{pq})=\Psi_{pq}^{-}(\bm{0}_{pq})=0roman_Ψ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_0 start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ) = roman_Ψ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_0 start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ) = 0, or (ii) Ψpq+(𝟎pq)=±Ψpq(𝟎pq)0superscriptsubscriptΨ𝑝𝑞subscript0𝑝𝑞plus-or-minussuperscriptsubscriptΨ𝑝𝑞subscript0𝑝𝑞0\Psi_{pq}^{+}(\bm{0}_{pq})=\pm\Psi_{pq}^{-}(\bm{0}_{pq})\neq 0roman_Ψ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_0 start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ) = ± roman_Ψ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_0 start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ) ≠ 0. (Explicit mention of 𝛔𝛔\bm{\sigma}bold_italic_σ has been omitted since it is irrelevant at the origin.)

Proof.

The values of Ψpq+(𝟎pq)superscriptsubscriptΨ𝑝𝑞subscript0𝑝𝑞\Psi_{pq}^{+}(\bm{0}_{pq})roman_Ψ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_0 start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ) and Ψpq(𝟎pq)superscriptsubscriptΨ𝑝𝑞subscript0𝑝𝑞\Psi_{pq}^{-}(\bm{0}_{pq})roman_Ψ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_0 start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ) can be nonzero if and only if the unscaled discriminant Δ(a0,,an)Δsubscript𝑎0subscript𝑎𝑛\Delta(a_{0},\dots,a_{n})roman_Δ ( italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , … , italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) contains a term of the form cpqapαaqβsubscript𝑐𝑝𝑞superscriptsubscript𝑎𝑝𝛼superscriptsubscript𝑎𝑞𝛽c_{pq}a_{p}^{\alpha}a_{q}^{\beta}italic_c start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_α end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_β end_POSTSUPERSCRIPT for some coefficient cpqsubscript𝑐𝑝𝑞c_{pq}italic_c start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT and exponents α𝛼\alphaitalic_α and β𝛽\betaitalic_β. From the homogeneity properties of the discriminant, we must have α+β=2n2𝛼𝛽2𝑛2\alpha+\beta=2n-2italic_α + italic_β = 2 italic_n - 2 and pα+qβ=n(n1)𝑝𝛼𝑞𝛽𝑛𝑛1p\alpha+q\beta=n(n-1)italic_p italic_α + italic_q italic_β = italic_n ( italic_n - 1 ). Hence, given p𝑝pitalic_p and q𝑞qitalic_q, then α𝛼\alphaitalic_α and β𝛽\betaitalic_β are uniquely determined, so there is at most one monomial of this form in the discriminant. We say ‘at most one’ because α=(n1)(2qn)/(qp)𝛼𝑛12𝑞𝑛𝑞𝑝\alpha=(n-1)(2q-n)/(q-p)italic_α = ( italic_n - 1 ) ( 2 italic_q - italic_n ) / ( italic_q - italic_p ) and β=(n1)(n2p)/(qp)𝛽𝑛1𝑛2𝑝𝑞𝑝\beta=(n-1)(n-2p)/(q-p)italic_β = ( italic_n - 1 ) ( italic_n - 2 italic_p ) / ( italic_q - italic_p ) and these values may not be integers. Even if they are integers, the relevant monomial may not appear in the discriminant. After scaling, this term (if it exists) maps to cpqbpαbqβsubscript𝑐𝑝𝑞superscriptsubscript𝑏𝑝𝛼superscriptsubscript𝑏𝑞𝛽c_{pq}b_{p}^{\alpha}b_{q}^{\beta}italic_c start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_α end_POSTSUPERSCRIPT italic_b start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_β end_POSTSUPERSCRIPT. Then at the origin )pq\bm{)}_{pq}bold_) start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT we obtain Ψpq+(𝟎pq)=cpqsuperscriptsubscriptΨ𝑝𝑞subscript0𝑝𝑞subscript𝑐𝑝𝑞\Psi_{pq}^{+}(\bm{0}_{pq})=c_{pq}roman_Ψ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_0 start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ) = italic_c start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT and Ψpq(𝟎pq)=(1)βcpqsuperscriptsubscriptΨ𝑝𝑞subscript0𝑝𝑞superscript1𝛽subscript𝑐𝑝𝑞\Psi_{pq}^{-}(\bm{0}_{pq})=(-1)^{\beta}c_{pq}roman_Ψ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_0 start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ) = ( - 1 ) start_POSTSUPERSCRIPT italic_β end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT. Hence either (i) holds, if cpq=0subscript𝑐𝑝𝑞0c_{pq}=0italic_c start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT = 0, or else (ii) holds, with Ψpq+(𝟎pq)=cpq=±Ψpq(𝟎pq)superscriptsubscriptΨ𝑝𝑞subscript0𝑝𝑞subscript𝑐𝑝𝑞plus-or-minussuperscriptsubscriptΨ𝑝𝑞subscript0𝑝𝑞\Psi_{pq}^{+}(\bm{0}_{pq})=c_{pq}=\pm\Psi_{pq}^{-}(\bm{0}_{pq})roman_Ψ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_0 start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ) = italic_c start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT = ± roman_Ψ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_0 start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ). ∎

The two cases (i) and (ii) in Lemma 8.2 require separate treatments. In practice, it is convenient to introduce the notion of a ‘reduced’ discriminant. If Ψpq+(𝒃,𝝈)superscriptsubscriptΨ𝑝𝑞𝒃𝝈\Psi_{pq}^{+}(\bm{b},\bm{\sigma})roman_Ψ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_italic_b , bold_italic_σ ) contains a common factor, we divide out that common factor. A common factor in a discriminant clearly cannot contribute to the determination of the domain boundary in an equation such as Ψpq+(𝒃,𝝈)=0superscriptsubscriptΨ𝑝𝑞𝒃𝝈0\Psi_{pq}^{+}(\bm{b},\bm{\sigma})=0roman_Ψ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_italic_b , bold_italic_σ ) = 0. We denote the reduced discriminant by Ψ~pq+(𝒃,𝝈)superscriptsubscript~Ψ𝑝𝑞𝒃𝝈\tilde{\Psi}_{pq}^{+}(\bm{b},\bm{\sigma})over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_italic_b , bold_italic_σ ). By definition, it does not vanish if any single component bjsubscript𝑏𝑗b_{j}italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT in 𝒃𝒃\bm{b}bold_italic_b is set to zero. Next Ψpq(𝒃,𝝈)superscriptsubscriptΨ𝑝𝑞𝒃𝝈\Psi_{pq}^{-}(\bm{b},\bm{\sigma})roman_Ψ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_italic_b , bold_italic_σ ) clearly contains the same common factor as Ψpq+(𝒃,𝝈)superscriptsubscriptΨ𝑝𝑞𝒃𝝈\Psi_{pq}^{+}(\bm{b},\bm{\sigma})roman_Ψ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_italic_b , bold_italic_σ ) because flip** ±plus-or-minus\pm± signs in the coefficients of the polynomial does not affect common factors in the discriminant. Hence by an obvious analogy we define the reduced discriminant Ψ~pq(𝒃,𝝈)superscriptsubscript~Ψ𝑝𝑞𝒃𝝈\tilde{\Psi}_{pq}^{-}(\bm{b},\bm{\sigma})over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_italic_b , bold_italic_σ ). We work with Ψ~pq+(𝒃,𝝈)superscriptsubscript~Ψ𝑝𝑞𝒃𝝈\tilde{\Psi}_{pq}^{+}(\bm{b},\bm{\sigma})over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_italic_b , bold_italic_σ ) and Ψ~pq(𝒃,𝝈)superscriptsubscript~Ψ𝑝𝑞𝒃𝝈\tilde{\Psi}_{pq}^{-}(\bm{b},\bm{\sigma})over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_italic_b , bold_italic_σ ) below. Note that Lemma 8.2 holds true also for the reduced discriminants.

The next key idea is that of foliation. For fixed tuples 𝒃𝒃\bm{b}bold_italic_b and 𝝈𝝈\bm{\sigma}bold_italic_σ, the level sets of Ψ~pq+(𝒃,𝝈)superscriptsubscript~Ψ𝑝𝑞𝒃𝝈\tilde{\Psi}_{pq}^{+}(\bm{b},\bm{\sigma})over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_italic_b , bold_italic_σ ) foliate the parameter space +n1superscriptsubscript𝑛1\mathbb{R}_{+}^{n-1}blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT. The level sets of Ψ~pq(𝒃,𝝈)superscriptsubscript~Ψ𝑝𝑞𝒃𝝈\tilde{\Psi}_{pq}^{-}(\bm{b},\bm{\sigma})over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_italic_b , bold_italic_σ ) also foliate +n1superscriptsubscript𝑛1\mathbb{R}_{+}^{n-1}blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT. Hence both families of level sets foliate the domain 𝒟^^𝒟\mathscr{\hat{D}}over^ start_ARG script_D end_ARG. For our purposes, the foliation is a map** +n1superscriptsubscript𝑛1\mathbb{R}_{+}^{n-1}\to\mathbb{R}blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT → blackboard_R, because both Ψ~pq+(𝒃,𝝈)superscriptsubscript~Ψ𝑝𝑞𝒃𝝈\tilde{\Psi}_{pq}^{+}(\bm{b},\bm{\sigma})over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_italic_b , bold_italic_σ ) and Ψ~pq(𝒃,𝝈)superscriptsubscript~Ψ𝑝𝑞𝒃𝝈\tilde{\Psi}_{pq}^{-}(\bm{b},\bm{\sigma})over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_italic_b , bold_italic_σ ) are real valued.

8.3 Ψ~pq±(𝒃,σ)superscriptsubscript~Ψ𝑝𝑞plus-or-minus𝒃𝜎\tilde{\Psi}_{pq}^{\pm}(\bm{b},\sigma)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ± end_POSTSUPERSCRIPT ( bold_italic_b , italic_σ ) nonzero at origin

We begin with the simpler case (ii) in Lemma 8.2, where Ψ~pq+(𝒃,σ)superscriptsubscript~Ψ𝑝𝑞𝒃𝜎\tilde{\Psi}_{pq}^{+}(\bm{b},\sigma)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_italic_b , italic_σ ) and Ψ~pq(𝒃,σ)superscriptsubscript~Ψ𝑝𝑞𝒃𝜎\tilde{\Psi}_{pq}^{-}(\bm{b},\sigma)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_italic_b , italic_σ ) are nonzero at the origin. First fix the values of p𝑝pitalic_p and q𝑞qitalic_q. Then for any tuple 𝝈𝝈\bm{\sigma}bold_italic_σ, for any 𝒃superscript𝒃\bm{b}^{\prime}bold_italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT whose image is in 𝒟ˇˇ𝒟\mathscr{\check{D}}overroman_ˇ start_ARG script_D end_ARG, both Ψ~pq+(𝟎pq,𝝈)Ψ~pq+(𝒃,𝝈)>0superscriptsubscript~Ψ𝑝𝑞subscript0𝑝𝑞𝝈superscriptsubscript~Ψ𝑝𝑞superscript𝒃𝝈0\tilde{\Psi}_{pq}^{+}(\bm{0}_{pq},\bm{\sigma})\tilde{\Psi}_{pq}^{+}(\bm{b}^{% \prime},\bm{\sigma})>0over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_0 start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT , bold_italic_σ ) over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , bold_italic_σ ) > 0 and Ψ~pq(𝟎pq,𝝈)Ψ~pq(𝒃,𝝈)>0superscriptsubscript~Ψ𝑝𝑞subscript0𝑝𝑞𝝈superscriptsubscript~Ψ𝑝𝑞superscript𝒃𝝈0\tilde{\Psi}_{pq}^{-}(\bm{0}_{pq},\bm{\sigma})\tilde{\Psi}_{pq}^{-}(\bm{b}^{% \prime},\bm{\sigma})>0over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_0 start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT , bold_italic_σ ) over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , bold_italic_σ ) > 0. To determine the boundary of the domain of convergence, we solve for 𝒃subscript𝒃\bm{b}_{*}bold_italic_b start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT where Ψ~pq+(𝒃,𝝈)=0superscriptsubscript~Ψ𝑝𝑞subscript𝒃𝝈0\tilde{\Psi}_{pq}^{+}(\bm{b}_{*},\bm{\sigma})=0over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_italic_b start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT , bold_italic_σ ) = 0 or Ψ~pq(𝒃,𝝈)=0superscriptsubscript~Ψ𝑝𝑞subscript𝒃𝝈0\tilde{\Psi}_{pq}^{-}(\bm{b}_{*},\bm{\sigma})=0over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_italic_b start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT , bold_italic_σ ) = 0 for any σΣpq𝜎subscriptΣ𝑝𝑞\sigma\in\Sigma_{pq}italic_σ ∈ roman_Σ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT. This can be encapsulated in a single formula

𝝈ΣpqΨ~pq+(𝒃,𝝈)Ψ~pq(𝒃,𝝈)=0.subscriptproduct𝝈subscriptΣ𝑝𝑞superscriptsubscript~Ψ𝑝𝑞subscript𝒃𝝈superscriptsubscript~Ψ𝑝𝑞subscript𝒃𝝈0\prod_{\bm{\sigma}\in\Sigma_{pq}}\tilde{\Psi}_{pq}^{+}(\bm{b}_{*},\bm{\sigma})% \tilde{\Psi}_{pq}^{-}(\bm{b}_{*},\bm{\sigma})=0\,.∏ start_POSTSUBSCRIPT bold_italic_σ ∈ roman_Σ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_italic_b start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT , bold_italic_σ ) over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_italic_b start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT , bold_italic_σ ) = 0 . (8.20)

The domain of convergence 𝒟𝒟\mathscr{D}script_D is the set connected to the origin, bounded by the hyperplanes which satisfy eq. (8.20). Although technically there are 2nsuperscript2𝑛2^{n}2 start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT discriminants in the product in eq. (8.20), in practice many of them are identical and the number of distinct discriminants is much fewer. However, I do not have a definitive estimate of the number of distinct discriminants. If qp𝑞𝑝q-pitalic_q - italic_p is odd, we need consider only Ψ~pq+superscriptsubscript~Ψ𝑝𝑞\tilde{\Psi}_{pq}^{+}over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT and we can simplify eq. (8.20) to

𝝈ΣpqΨ~pq+(𝒃,𝝈)=0.subscriptproduct𝝈subscriptΣ𝑝𝑞superscriptsubscript~Ψ𝑝𝑞subscript𝒃𝝈0\prod_{\bm{\sigma}\in\Sigma_{pq}}\tilde{\Psi}_{pq}^{+}(\bm{b}_{*},\bm{\sigma})% =0\,.∏ start_POSTSUBSCRIPT bold_italic_σ ∈ roman_Σ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_italic_b start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT , bold_italic_σ ) = 0 . (8.21)

As an illustrative example, consider the quartic equation a0+x+x2+a4x4=0subscript𝑎0𝑥superscript𝑥2subscript𝑎4superscript𝑥40a_{0}+x+x^{2}+a_{4}x^{4}=0italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_x + italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT = 0. We choose p=1𝑝1p=1italic_p = 1 and q=2𝑞2q=2italic_q = 2 and we have set a1=a2=1subscript𝑎1subscript𝑎21a_{1}=a_{2}=1italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 1 so that the scaled coefficients are simply bj=ajsubscript𝑏𝑗subscript𝑎𝑗b_{j}=a_{j}italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. Because qp=1𝑞𝑝1q-p=1italic_q - italic_p = 1 is odd, we require only Ψpq+superscriptsubscriptΨ𝑝𝑞\Psi_{pq}^{+}roman_Ψ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT. The discriminant has a common factor of |a4|subscript𝑎4|a_{4}|| italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT |:

Ψ12+(|a0|,|a4|)=|a4|(256|a0|3|a4|2128|a0|2|a4|+144|a0||a4|+16|a0|27|a4|4).superscriptsubscriptΨ12subscript𝑎0subscript𝑎4subscript𝑎4256superscriptsubscript𝑎03superscriptsubscript𝑎42128superscriptsubscript𝑎02subscript𝑎4144subscript𝑎0subscript𝑎416subscript𝑎027subscript𝑎44\Psi_{12}^{+}(|a_{0}|,|a_{4}|)=|a_{4}|\Bigl{(}256|a_{0}|^{3}|a_{4}|^{2}-128|a_% {0}|^{2}|a_{4}|+144|a_{0}||a_{4}|+16|a_{0}|-27|a_{4}|-4\Bigr{)}\,.roman_Ψ start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | ) = | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | ( 256 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 128 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | + 144 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | + 16 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | - 27 | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | - 4 ) . (8.22)

We divide out the common factor |a4|subscript𝑎4|a_{4}|| italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | and obtain the reduced discriminants

Ψ~12+(|a0|,|a4|)superscriptsubscript~Ψ12subscript𝑎0subscript𝑎4\displaystyle\tilde{\Psi}_{12}^{+}(|a_{0}|,|a_{4}|)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | ) =256|a0|3|a4|2128|a0|2|a4|+144|a0||a4|+16|a0|27|a4|4,absent256superscriptsubscript𝑎03superscriptsubscript𝑎42128superscriptsubscript𝑎02subscript𝑎4144subscript𝑎0subscript𝑎416subscript𝑎027subscript𝑎44\displaystyle=\phantom{-}256|a_{0}|^{3}|a_{4}|^{2}-128|a_{0}|^{2}|a_{4}|+144|a% _{0}||a_{4}|+16|a_{0}|-27|a_{4}|-4\,,= 256 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 128 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | + 144 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | + 16 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | - 27 | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | - 4 , (8.23a)
Ψ~12+(|a0|,|a4|)superscriptsubscript~Ψ12subscript𝑎0subscript𝑎4\displaystyle\tilde{\Psi}_{12}^{+}(|a_{0}|,-|a_{4}|)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , - | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | ) =256|a0|3|a4|2+128|a0|2|a4|144|a0||a4|+16|a0|+27|a4|4,absent256superscriptsubscript𝑎03superscriptsubscript𝑎42128superscriptsubscript𝑎02subscript𝑎4144subscript𝑎0subscript𝑎416subscript𝑎027subscript𝑎44\displaystyle=\phantom{-}256|a_{0}|^{3}|a_{4}|^{2}+128|a_{0}|^{2}|a_{4}|-144|a% _{0}||a_{4}|+16|a_{0}|+27|a_{4}|-4\,,= 256 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 128 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | - 144 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | + 16 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | + 27 | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | - 4 , (8.23b)
Ψ~12+(|a0|,|a4|)superscriptsubscript~Ψ12subscript𝑎0subscript𝑎4\displaystyle\tilde{\Psi}_{12}^{+}(-|a_{0}|,|a_{4}|)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | ) =256|a0|3|a4|2128|a0|2|a4|144|a0||a4|16|a0|27|a4|4,absent256superscriptsubscript𝑎03superscriptsubscript𝑎42128superscriptsubscript𝑎02subscript𝑎4144subscript𝑎0subscript𝑎416subscript𝑎027subscript𝑎44\displaystyle=-256|a_{0}|^{3}|a_{4}|^{2}-128|a_{0}|^{2}|a_{4}|-144|a_{0}||a_{4% }|-16|a_{0}|-27|a_{4}|-4\,,= - 256 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 128 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | - 144 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | - 16 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | - 27 | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | - 4 , (8.23c)
Ψ~12+(|a0|,|a4|)superscriptsubscript~Ψ12subscript𝑎0subscript𝑎4\displaystyle\tilde{\Psi}_{12}^{+}(-|a_{0}|,-|a_{4}|)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , - | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | ) =256|a0|3|a4|2+128|a0|2|a4|+144|a0||a4|16|a0|+27|a4|4.absent256superscriptsubscript𝑎03superscriptsubscript𝑎42128superscriptsubscript𝑎02subscript𝑎4144subscript𝑎0subscript𝑎416subscript𝑎027subscript𝑎44\displaystyle=-256|a_{0}|^{3}|a_{4}|^{2}+128|a_{0}|^{2}|a_{4}|+144|a_{0}||a_{4% }|-16|a_{0}|+27|a_{4}|-4\,.= - 256 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 128 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | + 144 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | - 16 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | + 27 | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | - 4 . (8.23d)

The reduced discriminants all equal 44-4- 4 at the origin. The necessary bounds for convergence yield |a0|14subscript𝑎014|a_{0}|\leq\frac{1}{4}| italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ≤ divide start_ARG 1 end_ARG start_ARG 4 end_ARG and |a4|4/27subscript𝑎4427|a_{4}|\leq 4/27| italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | ≤ 4 / 27. Note that the above expressions are quadratics in |a4|subscript𝑎4|a_{4}|| italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT |. Thus to solve for Ψ12+(±|a0|,±|a4|)=0subscriptsuperscriptΨ12plus-or-minussubscript𝑎0plus-or-minussubscript𝑎40\Psi^{+}_{12}(\pm|a_{0}|,\pm|a_{4}|)=0roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT ( ± | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , ± | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | ) = 0, we fix a value of |a0|subscript𝑎0|a_{0}|| italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | and solve the resulting quadratic in |a4|subscript𝑎4|a_{4}|| italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT |. Note that this procedure will not always yield a real solution for |a4|subscript𝑎4|a_{4}|| italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT |; the discriminants which fail to do so do not contribute to the boundary of the domain of convergence. The discriminant Ψ~12+(|a0|,|a4|)superscriptsubscript~Ψ12subscript𝑎0subscript𝑎4\tilde{\Psi}_{12}^{+}(-|a_{0}|,|a_{4}|)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | ) is such a case. Setting the other discriminants to zero yields valid hyperplanes. The resulting curves in the (|a0|,|a4|)subscript𝑎0subscript𝑎4(|a_{0}|,|a_{4}|)( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | ) parameter space are displayed in Fig. 2, for Ψ12+(|a0|,|a4|)=0subscriptsuperscriptΨ12subscript𝑎0subscript𝑎40\Psi^{+}_{12}(|a_{0}|,|a_{4}|)=0roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | ) = 0 (dashed), Ψ12+(|a0|,|a4|)=0subscriptsuperscriptΨ12subscript𝑎0subscript𝑎40\Psi^{+}_{12}(|a_{0}|,-|a_{4}|)=0roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , - | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | ) = 0 (dotdash) and Ψ12+(|a0|,|a4|)=0subscriptsuperscriptΨ12subscript𝑎0subscript𝑎40\Psi^{+}_{12}(-|a_{0}|,-|a_{4}|)=0roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , - | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | ) = 0 (solid). The shaded area indicates the domain 𝒟12subscript𝒟12\mathscr{D}_{12}script_D start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT, which is determined by the two hyperplanes given by the level sets Ψ12+(|a0|,|a4|)=0subscriptsuperscriptΨ12subscript𝑎0subscript𝑎40\Psi^{+}_{12}(|a_{0}|,|a_{4}|)=0roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | ) = 0 and Ψ12+(|a0|,|a4|)=0subscriptsuperscriptΨ12subscript𝑎0subscript𝑎40\Psi^{+}_{12}(-|a_{0}|,-|a_{4}|)=0roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , - | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | ) = 0. The level set Ψ12+(|a0|,|a4|)=0subscriptsuperscriptΨ12subscript𝑎0subscript𝑎40\Psi^{+}_{12}(|a_{0}|,-|a_{4}|)=0roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , - | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | ) = 0 does not contribute. The domain of convergence is therefore

𝒟12={Ψ12+(|a0|,|a4|)0}{Ψ12+(|a0|,|a4|)0}.subscript𝒟12subscriptsuperscriptΨ12subscript𝑎0subscript𝑎40subscriptsuperscriptΨ12subscript𝑎0subscript𝑎40\mathcal{D}_{12}=\{\Psi^{+}_{12}(|a_{0}|,|a_{4}|)\leq 0\}\cap\{\Psi^{+}_{12}(-% |a_{0}|,-|a_{4}|)\leq 0\}\,.caligraphic_D start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT = { roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | ) ≤ 0 } ∩ { roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , - | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | ) ≤ 0 } . (8.24)

Recall that technically, the domain 𝒟12subscript𝒟12\mathcal{D}_{12}caligraphic_D start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT is the component which satisfies the above conditions and is connected to the origin. Observe from the curvature of the upper boundary in Fig. 2, i.e. the level set Ψ12+(|a0|,|a4|)=0subscriptsuperscriptΨ12subscript𝑎0subscript𝑎40\Psi^{+}_{12}(-|a_{0}|,-|a_{4}|)=0roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , - | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | ) = 0, that the domain of convergence is not convex. A complete Reinhardt domain is logarithmically convex, but is not necessarily convex.

8.4 Ψ~pq±(𝒃,σ)superscriptsubscript~Ψ𝑝𝑞plus-or-minus𝒃𝜎\tilde{\Psi}_{pq}^{\pm}(\bm{b},\sigma)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ± end_POSTSUPERSCRIPT ( bold_italic_b , italic_σ ) vanishes at origin

The case (i) in Lemma 8.2 is more difficult. Now Ψ~pq+(𝒃,σ)superscriptsubscript~Ψ𝑝𝑞𝒃𝜎\tilde{\Psi}_{pq}^{+}(\bm{b},\sigma)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_italic_b , italic_σ ) and Ψ~pq(𝒃,σ)superscriptsubscript~Ψ𝑝𝑞𝒃𝜎\tilde{\Psi}_{pq}^{-}(\bm{b},\sigma)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_italic_b , italic_σ ) vanish at the origin, hence solving for Ψ~pq+(𝒃,σ)=0superscriptsubscript~Ψ𝑝𝑞𝒃𝜎0\tilde{\Psi}_{pq}^{+}(\bm{b},\sigma)=0over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_italic_b , italic_σ ) = 0 or Ψ~pq(𝒃,σ)=0superscriptsubscript~Ψ𝑝𝑞𝒃𝜎0\tilde{\Psi}_{pq}^{-}(\bm{b},\sigma)=0over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_italic_b , italic_σ ) = 0 yields the origin as an unwanted solution. Recall the example of the quartic eq. (8.13). Hence we must proceed more carefully. As always, we first fix the values of p𝑝pitalic_p and q𝑞qitalic_q. Next, fix a tuple 𝝈𝝈\bm{\sigma}bold_italic_σ. Then the discriminants will exhibit one of three mutually exclusive properties: either Ψ~pq+(𝒃,σ)superscriptsubscript~Ψ𝑝𝑞𝒃𝜎\tilde{\Psi}_{pq}^{+}(\bm{b},\sigma)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_italic_b , italic_σ ) has a local maximum, or a local minimum, or a saddle point at the origin. The same is true for Ψ~pq(𝒃,σ)superscriptsubscript~Ψ𝑝𝑞𝒃𝜎\tilde{\Psi}_{pq}^{-}(\bm{b},\sigma)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_italic_b , italic_σ ). We can also say ‘a local extremum or a saddle point’ at the origin. The concept of ‘local extremum’ must be understood carefully, because it is really a constrained extremization. It is simplest to illustrate with an example. Consider a cubic equation 1+x+b2x2+b3x3=01𝑥subscript𝑏2superscript𝑥2subscript𝑏3superscript𝑥301+x+b_{2}x^{2}+b_{3}x^{3}=01 + italic_x + italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT = 0 with p=0𝑝0p=0italic_p = 0 and q=1𝑞1q=1italic_q = 1. Since qp=1𝑞𝑝1q-p=1italic_q - italic_p = 1 is odd, it suffices to treat Ψ~pq+(𝒃,σ)superscriptsubscript~Ψ𝑝𝑞𝒃𝜎\tilde{\Psi}_{pq}^{+}(\bm{b},\sigma)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_italic_b , italic_σ ) only. The expressions for the discriminants in this case are (note that they all vanish at the origin)

Ψ~01+(|b2|,|b3|)superscriptsubscript~Ψ01subscript𝑏2subscript𝑏3\displaystyle\tilde{\Psi}_{01}^{+}(|b_{2}|,|b_{3}|)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | , | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) =27|b3|2+4|b3|+4|b2|318|b2||b3||b2|2,absent27superscriptsubscript𝑏324subscript𝑏34superscriptsubscript𝑏2318subscript𝑏2subscript𝑏3superscriptsubscript𝑏22\displaystyle=27|b_{3}|^{2}+4|b_{3}|+4|b_{2}|^{3}-18|b_{2}||b_{3}|-|b_{2}|^{2}\,,= 27 | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 4 | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | + 4 | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT - 18 | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | - | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (8.25a)
Ψ~01+(|b2|,|b3|)superscriptsubscript~Ψ01subscript𝑏2subscript𝑏3\displaystyle\tilde{\Psi}_{01}^{+}(|b_{2}|,-|b_{3}|)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | , - | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) =27|b3|24|b3|+4|b2|3+18|b2||b3||b2|2,absent27superscriptsubscript𝑏324subscript𝑏34superscriptsubscript𝑏2318subscript𝑏2subscript𝑏3superscriptsubscript𝑏22\displaystyle=27|b_{3}|^{2}-4|b_{3}|+4|b_{2}|^{3}+18|b_{2}||b_{3}|-|b_{2}|^{2}\,,= 27 | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 4 | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | + 4 | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT + 18 | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | - | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (8.25b)
Ψ~01+(|b2|,|b3|)superscriptsubscript~Ψ01subscript𝑏2subscript𝑏3\displaystyle\tilde{\Psi}_{01}^{+}(-|b_{2}|,|b_{3}|)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( - | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | , | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) =27|b3|2+4|b3|4|b2|3+18|b2||b3||b2|2,absent27superscriptsubscript𝑏324subscript𝑏34superscriptsubscript𝑏2318subscript𝑏2subscript𝑏3superscriptsubscript𝑏22\displaystyle=27|b_{3}|^{2}+4|b_{3}|-4|b_{2}|^{3}+18|b_{2}||b_{3}|-|b_{2}|^{2}\,,= 27 | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 4 | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | - 4 | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT + 18 | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | - | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (8.25c)
Ψ~01+(|b2|,|b3|)superscriptsubscript~Ψ01subscript𝑏2subscript𝑏3\displaystyle\tilde{\Psi}_{01}^{+}(-|b_{2}|,-|b_{3}|)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( - | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | , - | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) =27|b3|24|b3|4|b2|318|b2||b3||b2|2.absent27superscriptsubscript𝑏324subscript𝑏34superscriptsubscript𝑏2318subscript𝑏2subscript𝑏3superscriptsubscript𝑏22\displaystyle=27|b_{3}|^{2}-4|b_{3}|-4|b_{2}|^{3}-18|b_{2}||b_{3}|-|b_{2}|^{2}\,.= 27 | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 4 | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | - 4 | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT - 18 | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | - | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (8.25d)
  • 1.

    Then Ψ~01+(|b2|,|b3|)superscriptsubscript~Ψ01subscript𝑏2subscript𝑏3\tilde{\Psi}_{01}^{+}(|b_{2}|,-|b_{3}|)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | , - | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) has a local maximum at the origin. Put |b2|=0subscript𝑏20|b_{2}|=0| italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | = 0, then Ψ~01+(0,|b3|)4|b3|similar-to-or-equalssuperscriptsubscript~Ψ010subscript𝑏34subscript𝑏3\tilde{\Psi}_{01}^{+}(0,-|b_{3}|)\simeq-4|b_{3}|over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( 0 , - | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) ≃ - 4 | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | for sufficiently small |b3|subscript𝑏3|b_{3}|| italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT |. This is negative definite because |b3|>0subscript𝑏30|b_{3}|>0| italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | > 0 only, for b30subscript𝑏30b_{3}\neq 0italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ≠ 0. However, the partial derivative Ψ~01+/|b3|superscriptsubscript~Ψ01subscript𝑏3\partial\tilde{\Psi}_{01}^{+}/\partial|b_{3}|∂ over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT / ∂ | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | does not vanish at |b1|=0subscript𝑏10|b_{1}|=0| italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | = 0. Next put |b3|=0subscript𝑏30|b_{3}|=0| italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | = 0, then Ψ~01+(|b2|,0)|b2|2similar-to-or-equalssuperscriptsubscript~Ψ01subscript𝑏20superscriptsubscript𝑏22\tilde{\Psi}_{01}^{+}(|b_{2}|,0)\simeq-|b_{2}|^{2}over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | , 0 ) ≃ - | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT for sufficiently small |b2|subscript𝑏2|b_{2}|| italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT |. This is also negative definite for b20subscript𝑏20b_{2}\neq 0italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≠ 0. One can show that Ψ~01+(|b2|,|b3|)<0superscriptsubscript~Ψ01subscript𝑏2subscript𝑏30\tilde{\Psi}_{01}^{+}(|b_{2}|,-|b_{3}|)<0over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | , - | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) < 0 for all sufficiently small |b2|>0subscript𝑏20|b_{2}|>0| italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | > 0 and |b3|>0subscript𝑏30|b_{3}|>0| italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | > 0. Hence for our purposes, a ‘local maximum’ is a constrained local maximum. Similarly, the concept of ‘local minimum’ is a constrained local minimum.

  • 2.

    Similarly Ψ~01+(|b2|,|b3|)superscriptsubscript~Ψ01subscript𝑏2subscript𝑏3\tilde{\Psi}_{01}^{+}(-|b_{2}|,-|b_{3}|)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( - | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | , - | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) also has a local maximum at the origin.

  • 3.

    However Ψ~01+(|b2|,|b3|)superscriptsubscript~Ψ01subscript𝑏2subscript𝑏3\tilde{\Psi}_{01}^{+}(|b_{2}|,|b_{3}|)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | , | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) and Ψ~01+(|b2|,|b3|)superscriptsubscript~Ψ01subscript𝑏2subscript𝑏3\tilde{\Psi}_{01}^{+}(-|b_{2}|,|b_{3}|)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( - | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | , | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) both have saddle points at the origin. Put |b2|=0subscript𝑏20|b_{2}|=0| italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | = 0, then Ψ~01+(0,|b3|)4|b3|similar-to-or-equalssuperscriptsubscript~Ψ010subscript𝑏34subscript𝑏3\tilde{\Psi}_{01}^{+}(0,|b_{3}|)\simeq 4|b_{3}|over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( 0 , | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) ≃ 4 | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | for sufficiently small |b3|subscript𝑏3|b_{3}|| italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT |, and is positive for |b3|>0subscript𝑏30|b_{3}|>0| italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | > 0. Next put |b3|=0subscript𝑏30|b_{3}|=0| italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | = 0, then Ψ~01+(|b2|,0)|b2|2similar-to-or-equalssuperscriptsubscript~Ψ01subscript𝑏20superscriptsubscript𝑏22\tilde{\Psi}_{01}^{+}(|b_{2}|,0)\simeq-|b_{2}|^{2}over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | , 0 ) ≃ - | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and Ψ~01+(|b2|,0)|b2|2similar-to-or-equalssuperscriptsubscript~Ψ01subscript𝑏20superscriptsubscript𝑏22\tilde{\Psi}_{01}^{+}(-|b_{2}|,0)\simeq-|b_{2}|^{2}over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( - | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | , 0 ) ≃ - | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT for sufficiently small |b2|subscript𝑏2|b_{2}|| italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT |, and are both negative for |b2|>0subscript𝑏20|b_{2}|>0| italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | > 0. This establishes that both Ψ~01+(|b2|,|b3|)superscriptsubscript~Ψ01subscript𝑏2subscript𝑏3\tilde{\Psi}_{01}^{+}(|b_{2}|,|b_{3}|)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | , | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) and Ψ~01+(|b2|,|b3|)superscriptsubscript~Ψ01subscript𝑏2subscript𝑏3\tilde{\Psi}_{01}^{+}(-|b_{2}|,|b_{3}|)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( - | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | , | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) are of indefinite sign in the vicinity of the origin, i.e. they have saddle points at the origin.

Returning to the general theory, a discriminant Ψ~pq+(𝒃,σ)superscriptsubscript~Ψ𝑝𝑞𝒃𝜎\tilde{\Psi}_{pq}^{+}(\bm{b},\sigma)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_italic_b , italic_σ ) or Ψ~pq(𝒃,σ)superscriptsubscript~Ψ𝑝𝑞𝒃𝜎\tilde{\Psi}_{pq}^{-}(\bm{b},\sigma)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_italic_b , italic_σ ) which has a saddle point at the origin does not contribute to the determination of the boundary of the domain of convergence. Such a discriminant has a nontrivial level set Ψ~pq+(𝒃,σ)=0superscriptsubscript~Ψ𝑝𝑞𝒃𝜎0\tilde{\Psi}_{pq}^{+}(\bm{b},\sigma)=0over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_italic_b , italic_σ ) = 0 or Ψ~pq(𝒃,σ)=0superscriptsubscript~Ψ𝑝𝑞𝒃𝜎0\tilde{\Psi}_{pq}^{-}(\bm{b},\sigma)=0over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_italic_b , italic_σ ) = 0 which includes the origin, and as such, the hyperplane cannot form part of a set which encloses any open neighborhood of the origin in +n1superscriptsubscript𝑛1\mathbb{R}_{+}^{n-1}blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT.

We must therefore define a set of ±plus-or-minus\pm± sign assignments Σpq+subscriptsuperscriptΣ𝑝𝑞\Sigma^{+}_{pq}roman_Σ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT (resp. ΣpqsubscriptsuperscriptΣ𝑝𝑞\Sigma^{-}_{pq}roman_Σ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT) consisting only of those 𝝈𝝈\bm{\sigma}bold_italic_σ such that the discriminants Ψ~pq+(𝒃,σ)superscriptsubscript~Ψ𝑝𝑞𝒃𝜎\tilde{\Psi}_{pq}^{+}(\bm{b},\sigma)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_italic_b , italic_σ ) (resp. Ψ~pq(𝒃,σ)superscriptsubscript~Ψ𝑝𝑞𝒃𝜎\tilde{\Psi}_{pq}^{-}(\bm{b},\sigma)over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_italic_b , italic_σ )) have a (constrained) local extremum at the origin. (It is possible that the sets Σpq+subscriptsuperscriptΣ𝑝𝑞\Sigma^{+}_{pq}roman_Σ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT and ΣpqsubscriptsuperscriptΣ𝑝𝑞\Sigma^{-}_{pq}roman_Σ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT are identical, but I do not have a proof of this.) For these values of 𝝈𝝈\bm{\sigma}bold_italic_σ, the origin is a one-element level set of the equations Ψ~pq+(𝒃,σ)=0superscriptsubscript~Ψ𝑝𝑞𝒃𝜎0\tilde{\Psi}_{pq}^{+}(\bm{b},\sigma)=0over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_italic_b , italic_σ ) = 0 or Ψ~pq(𝒃,σ)=0superscriptsubscript~Ψ𝑝𝑞𝒃𝜎0\tilde{\Psi}_{pq}^{-}(\bm{b},\sigma)=0over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_italic_b , italic_σ ) = 0. We exclude the origin as an unwanted solution. Recall the example of the quartic eq. (8.13) and the discriminant Ψ~24(|a0|,|a1|)subscriptsuperscript~Ψ24subscript𝑎0subscript𝑎1\tilde{\Psi}^{-}_{24}(-|a_{0}|,|a_{1}|)over~ start_ARG roman_Ψ end_ARG start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ). We then proceed as in the previous section. To determine the boundary of the domain of convergence, we solve for 𝒃𝟎pqsubscript𝒃absentsubscript0𝑝𝑞\bm{b}_{**}\neq\bm{0}_{pq}bold_italic_b start_POSTSUBSCRIPT ∗ ∗ end_POSTSUBSCRIPT ≠ bold_0 start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT where

(𝝈Σpq+Ψ~pq+(𝒃,𝝈))(𝝈ΣpqΨ~pq(𝒃,𝝈))=0.subscriptproduct𝝈subscriptsuperscriptΣ𝑝𝑞superscriptsubscript~Ψ𝑝𝑞subscript𝒃absent𝝈subscriptproduct𝝈subscriptsuperscriptΣ𝑝𝑞superscriptsubscript~Ψ𝑝𝑞subscript𝒃absent𝝈0\biggl{(}\prod_{\bm{\sigma}\in\Sigma^{+}_{pq}}\tilde{\Psi}_{pq}^{+}(\bm{b}_{**% },\bm{\sigma})\biggr{)}\biggl{(}\prod_{\bm{\sigma}\in\Sigma^{-}_{pq}}\tilde{% \Psi}_{pq}^{-}(\bm{b}_{**},\bm{\sigma})\biggr{)}=0\,.( ∏ start_POSTSUBSCRIPT bold_italic_σ ∈ roman_Σ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_italic_b start_POSTSUBSCRIPT ∗ ∗ end_POSTSUBSCRIPT , bold_italic_σ ) ) ( ∏ start_POSTSUBSCRIPT bold_italic_σ ∈ roman_Σ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_italic_b start_POSTSUBSCRIPT ∗ ∗ end_POSTSUBSCRIPT , bold_italic_σ ) ) = 0 . (8.26)

The domain of convergence 𝒟𝒟\mathscr{D}script_D is the set connected to the origin, bounded by the hyperplanes which satisfy eq. (8.26). Hence we require a preliminary calculation to exclude those values of 𝝈𝝈\bm{\sigma}bold_italic_σ for which the discriminants have saddle points at the origin. As before, if qp𝑞𝑝q-pitalic_q - italic_p is odd, we can restrict attention only to Ψ~pq+superscriptsubscript~Ψ𝑝𝑞\tilde{\Psi}_{pq}^{+}over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT and write the simpler formula

𝝈Σpq+Ψ~pq+(𝒃,𝝈)=0.subscriptproduct𝝈subscriptsuperscriptΣ𝑝𝑞superscriptsubscript~Ψ𝑝𝑞subscript𝒃absent𝝈0\prod_{\bm{\sigma}\in\Sigma^{+}_{pq}}\tilde{\Psi}_{pq}^{+}(\bm{b}_{**},\bm{% \sigma})=0\,.∏ start_POSTSUBSCRIPT bold_italic_σ ∈ roman_Σ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT over~ start_ARG roman_Ψ end_ARG start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( bold_italic_b start_POSTSUBSCRIPT ∗ ∗ end_POSTSUBSCRIPT , bold_italic_σ ) = 0 . (8.27)

As an illustrative example, consider the quartic eq. (8.13). Recall we choose p=2𝑝2p=2italic_p = 2 and q=4𝑞4q=4italic_q = 4 and we have set a2=a4=1subscript𝑎2subscript𝑎41a_{2}=a_{4}=1italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT = 1 so that the scaled coefficients are simply b0=a0subscript𝑏0subscript𝑎0b_{0}=a_{0}italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and b1=a1subscript𝑏1subscript𝑎1b_{1}=a_{1}italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. Then

Ψ24+(|a0|,±|a1|)superscriptsubscriptΨ24subscript𝑎0plus-or-minussubscript𝑎1\displaystyle\Psi_{24}^{+}(|a_{0}|,\pm|a_{1}|)roman_Ψ start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , ± | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) =27|a1|44(136|a0|)|a1|2+16|a0|(14|a0|)2,absent27superscriptsubscript𝑎144136subscript𝑎0superscriptsubscript𝑎1216subscript𝑎0superscript14subscript𝑎02\displaystyle=-27|a_{1}|^{4}-4(1-36|a_{0}|)|a_{1}|^{2}+16|a_{0}|(1-4|a_{0}|)^{% 2}\,,= - 27 | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT - 4 ( 1 - 36 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ) | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 16 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ( 1 - 4 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (8.28a)
Ψ24+(|a0|,±|a1|)superscriptsubscriptΨ24subscript𝑎0plus-or-minussubscript𝑎1\displaystyle\Psi_{24}^{+}(-|a_{0}|,\pm|a_{1}|)roman_Ψ start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , ± | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) =27|a1|44(1+36|a0|)|a1|2+16|a0|(1+4|a0|)2,absent27superscriptsubscript𝑎144136subscript𝑎0superscriptsubscript𝑎1216subscript𝑎0superscript14subscript𝑎02\displaystyle=-27|a_{1}|^{4}-4(1+36|a_{0}|)|a_{1}|^{2}+16|a_{0}|(1+4|a_{0}|)^{% 2}\,,= - 27 | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT - 4 ( 1 + 36 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ) | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 16 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ( 1 + 4 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (8.28b)
Ψ24(|a0|,±|a1|)superscriptsubscriptΨ24subscript𝑎0plus-or-minussubscript𝑎1\displaystyle\Psi_{24}^{-}(|a_{0}|,\pm|a_{1}|)roman_Ψ start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , ± | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) =27|a1|4+4(1+36|a0|)|a1|216|a0|(1+4|a0|)2,absent27superscriptsubscript𝑎144136subscript𝑎0superscriptsubscript𝑎1216subscript𝑎0superscript14subscript𝑎02\displaystyle=-27|a_{1}|^{4}+4(1+36|a_{0}|)|a_{1}|^{2}-16|a_{0}|(1+4|a_{0}|)^{% 2}\,,= - 27 | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT + 4 ( 1 + 36 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ) | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 16 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ( 1 + 4 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (8.28c)
Ψ24(|a0|,±|a1|)superscriptsubscriptΨ24subscript𝑎0plus-or-minussubscript𝑎1\displaystyle\Psi_{24}^{-}(-|a_{0}|,\pm|a_{1}|)roman_Ψ start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , ± | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) =27|a1|4+4(136|a0|)|a1|2+16|a0|(14|a0|)2.absent27superscriptsubscript𝑎144136subscript𝑎0superscriptsubscript𝑎1216subscript𝑎0superscript14subscript𝑎02\displaystyle=-27|a_{1}|^{4}+4(1-36|a_{0}|)|a_{1}|^{2}+16|a_{0}|(1-4|a_{0}|)^{% 2}\,.= - 27 | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT + 4 ( 1 - 36 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ) | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 16 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ( 1 - 4 | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (8.28d)

Hence there are four distinct discriminants, viz. Ψ24+(|a0|,|a1|)superscriptsubscriptΨ24subscript𝑎0subscript𝑎1\Psi_{24}^{+}(|a_{0}|,|a_{1}|)roman_Ψ start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ), Ψ24+(|a0|,|a1|)superscriptsubscriptΨ24subscript𝑎0subscript𝑎1\Psi_{24}^{+}(-|a_{0}|,|a_{1}|)roman_Ψ start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ), Ψ24(|a0|,|a1|)superscriptsubscriptΨ24subscript𝑎0subscript𝑎1\Psi_{24}^{-}(|a_{0}|,|a_{1}|)roman_Ψ start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) and Ψ24(|a0|,|a1|)superscriptsubscriptΨ24subscript𝑎0subscript𝑎1\Psi_{24}^{-}(-|a_{0}|,|a_{1}|)roman_Ψ start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ). The necessary bounds for convergence yield |a0|14subscript𝑎014|a_{0}|\leq\frac{1}{4}| italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ≤ divide start_ARG 1 end_ARG start_ARG 4 end_ARG and |a1|4/27subscript𝑎1427|a_{1}|\leq\sqrt{4/27}| italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ≤ square-root start_ARG 4 / 27 end_ARG. The above expressions are all quadratics in |a1|2superscriptsubscript𝑎12|a_{1}|^{2}| italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Thus to solve for Ψ24±(±|a0|,±|a1|)=0subscriptsuperscriptΨplus-or-minus24plus-or-minussubscript𝑎0plus-or-minussubscript𝑎10\Psi^{\pm}_{24}(\pm|a_{0}|,\pm|a_{1}|)=0roman_Ψ start_POSTSUPERSCRIPT ± end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT ( ± | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , ± | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) = 0, we fix a value of |a0|subscript𝑎0|a_{0}|| italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | and solve the resulting quadratic in |a1|2superscriptsubscript𝑎12|a_{1}|^{2}| italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. This procedure will not always yield a real positive solution for |a1|subscript𝑎1|a_{1}|| italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT |; the discriminants which fail to do so do not contribute to the boundary of the domain of convergence. The discriminant Ψ24+(|a0|,|a1|)superscriptsubscriptΨ24subscript𝑎0subscript𝑎1\Psi_{24}^{+}(-|a_{0}|,|a_{1}|)roman_Ψ start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) is such an example. Setting the other discriminants in eq. (8.28) to zero yields valid hyperplanes. The resulting curves in the (|a0|,|a1|)subscript𝑎0subscript𝑎1(|a_{0}|,|a_{1}|)( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) parameter space are displayed in Fig. 3, for Ψ24+(|a0|,|a1|)=0subscriptsuperscriptΨ24subscript𝑎0subscript𝑎10\Psi^{+}_{24}(|a_{0}|,|a_{1}|)=0roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) = 0 (dashed), Ψ24(|a0|,|a1|)=0subscriptsuperscriptΨ24subscript𝑎0subscript𝑎10\Psi^{-}_{24}(|a_{0}|,|a_{1}|)=0roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) = 0 (dotdash) and Ψ24(|a0|,|a1|)=0subscriptsuperscriptΨ24subscript𝑎0subscript𝑎10\Psi^{-}_{24}(-|a_{0}|,|a_{1}|)=0roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) = 0 (solid). The first two level sets pass through the origin, because the discriminants have saddle points at the origin, and they do not contribute to the boundary of the domain of convergence. The domain of convergence is determined solely by the level set Ψ24(|a0|,|a1|)=0subscriptsuperscriptΨ24subscript𝑎0subscript𝑎10\Psi^{-}_{24}(-|a_{0}|,|a_{1}|)=0roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) = 0. However, that level set (the solid curve) bounds two domains in Fig. 3. The domain 𝒟24subscript𝒟24\mathscr{D}_{24}script_D start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT is given by the shaded area only, because that is the region connected to the origin. The cross-hatched region is not connected to the origin and is not part of the domain of convergence. Because Ψ24(|a0|,|a1|)subscriptsuperscriptΨ24subscript𝑎0subscript𝑎1\Psi^{-}_{24}(-|a_{0}|,|a_{1}|)roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) has a local minimum at the origin, the domain of convergence is the component connected to the origin such that

𝒟24={Ψ24(|a0|,|a1|)0}.subscript𝒟24subscriptsuperscriptΨ24subscript𝑎0subscript𝑎10\mathcal{D}_{24}=\{\Psi^{-}_{24}(-|a_{0}|,|a_{1}|)\geq 0\}\,.caligraphic_D start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT = { roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) ≥ 0 } . (8.29)

The caveat about ‘connectedness to the origin’ is essential in this case. The domain 𝒟24subscript𝒟24\mathcal{D}_{24}caligraphic_D start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT is also not convex. This is demonstrated in Fig. 4. The domain 𝒟𝒟\mathscr{D}script_D is the shaded area and is bounded by the solid curve. The dashed line is the straight line which joins the vertices (b^0,0)=(14,0)subscript^𝑏00140(\hat{b}_{0},0)=(\frac{1}{4},0)( over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , 0 ) = ( divide start_ARG 1 end_ARG start_ARG 4 end_ARG , 0 ) and (0,b^1)=(0,4/27)0subscript^𝑏10427(0,\hat{b}_{1})=(0,\sqrt{4/27})( 0 , over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) = ( 0 , square-root start_ARG 4 / 27 end_ARG ) to form a right-angled triangle with the origin. Observe that 𝒟24subscript𝒟24\mathcal{D}_{24}caligraphic_D start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT is not convex, hence the full domain of convergence 𝒟24subscript𝒟24\mathcal{D}_{24}caligraphic_D start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT is not convex.

8.5 Normalization of discriminant

The literature in fact contains multiple normalization conventions for discriminants. For example for the quadratic ax2+bx+c𝑎superscript𝑥2𝑏𝑥𝑐ax^{2}+bx+citalic_a italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_b italic_x + italic_c, some authors define the discriminant to be Δ=4acb2Δ4𝑎𝑐superscript𝑏2\Delta=4ac-b^{2}roman_Δ = 4 italic_a italic_c - italic_b start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and others prefer Δ=b24acΔsuperscript𝑏24𝑎𝑐\Delta=b^{2}-4acroman_Δ = italic_b start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 4 italic_a italic_c. Hence the directions of the inequalities in expressions such as eq. (8.12) could be reversed, depending on the normalization convention employed for the discriminant. The reader should beware of this important detail. However, the formulas in both eqs. (8.20) and (8.26) are independent of the normalization of the discriminant, which is an advantage of the above formalism.

9 Applications: principal and Brioschi quintics

The principal and Brioschi forms of the quintic are tetranomials, and furnish nontrivial applications of the more sophisticated formalism of Sec. 8, to determine the domains of convergence of their solutions by infinite series. We treat them in turn. The principal quintic form is

a0+a1x+a2x2+x5=0.subscript𝑎0subscript𝑎1𝑥subscript𝑎2superscript𝑥2superscript𝑥50a_{0}+a_{1}x+a_{2}x^{2}+x^{5}=0\,.italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x + italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_x start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT = 0 . (9.1)

The discriminant is

Δprin=a52(3125a04a52+2250a02a1a22a51600a0a13a2a5+108a0a25+256a15a527a12a24).subscriptΔprinsuperscriptsubscript𝑎523125superscriptsubscript𝑎04superscriptsubscript𝑎522250superscriptsubscript𝑎02subscript𝑎1superscriptsubscript𝑎22subscript𝑎51600subscript𝑎0superscriptsubscript𝑎13subscript𝑎2subscript𝑎5108subscript𝑎0superscriptsubscript𝑎25256superscriptsubscript𝑎15subscript𝑎527superscriptsubscript𝑎12superscriptsubscript𝑎24\begin{split}\Delta_{\rm prin}=a_{5}^{2}\Bigl{(}3125a_{0}^{4}a_{5}^{2}+2250a_{% 0}^{2}a_{1}a_{2}^{2}a_{5}-1600a_{0}a_{1}^{3}a_{2}a_{5}+108a_{0}a_{2}^{5}+256a_% {1}^{5}a_{5}-27a_{1}^{2}a_{2}^{4}\Bigr{)}\,.\end{split}start_ROW start_CELL roman_Δ start_POSTSUBSCRIPT roman_prin end_POSTSUBSCRIPT = italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( 3125 italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 2250 italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT - 1600 italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT + 108 italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT + 256 italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT - 27 italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT ) . end_CELL end_ROW (9.2)

We factor out a52superscriptsubscript𝑎52a_{5}^{2}italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and employ reduced discriminants for the formulas for the domains of convergence, expressed in terms of the scaled coefficients (b0,b1,b2,b5)subscript𝑏0subscript𝑏1subscript𝑏2subscript𝑏5(b_{0},b_{1},b_{2},b_{5})( italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ) for each choice of p𝑝pitalic_p and q𝑞qitalic_q:

𝒟01subscript𝒟01\displaystyle\mathcal{D}_{01}caligraphic_D start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT ={Ψ~01+(|b2|,|b5|)0},absentsubscriptsuperscript~Ψ01subscript𝑏2subscript𝑏50\displaystyle=\{\tilde{\Psi}^{+}_{01}(|b_{2}|,-|b_{5}|)\leq 0\}\,,= { over~ start_ARG roman_Ψ end_ARG start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT ( | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | , - | italic_b start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT | ) ≤ 0 } , (9.3a)
𝒟02subscript𝒟02\displaystyle\mathcal{D}_{02}caligraphic_D start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ={Ψ~02+(|b1|,|b5|)0}{Ψ~02(|b1|,|b5|)0},absentsubscriptsuperscript~Ψ02subscript𝑏1subscript𝑏50subscriptsuperscript~Ψ02subscript𝑏1subscript𝑏50\displaystyle=\{\tilde{\Psi}^{+}_{02}(|b_{1}|,|b_{5}|)\geq 0\}\cap\{\tilde{% \Psi}^{-}_{02}(|b_{1}|,|b_{5}|)\leq 0\}\,,= { over~ start_ARG roman_Ψ end_ARG start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , | italic_b start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT | ) ≥ 0 } ∩ { over~ start_ARG roman_Ψ end_ARG start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 02 end_POSTSUBSCRIPT ( | italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , | italic_b start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT | ) ≤ 0 } , (9.3b)
𝒟05subscript𝒟05\displaystyle\mathcal{D}_{05}caligraphic_D start_POSTSUBSCRIPT 05 end_POSTSUBSCRIPT ={Ψ~05+(|b1|,|b2|)0},absentsubscriptsuperscript~Ψ05subscript𝑏1subscript𝑏20\displaystyle=\{\tilde{\Psi}^{+}_{05}(-|b_{1}|,-|b_{2}|)\geq 0\}\,,= { over~ start_ARG roman_Ψ end_ARG start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 05 end_POSTSUBSCRIPT ( - | italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , - | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | ) ≥ 0 } , (9.3c)
𝒟12subscript𝒟12\displaystyle\mathcal{D}_{12}caligraphic_D start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT ={Ψ~12+(|b0|,|b5|)0}{Ψ~12+(|b0|,|b5|)0},absentsubscriptsuperscript~Ψ12subscript𝑏0subscript𝑏50subscriptsuperscript~Ψ12subscript𝑏0subscript𝑏50\displaystyle=\{\tilde{\Psi}^{+}_{12}(|b_{0}|,-|b_{5}|)\leq 0\}\cap\{\tilde{% \Psi}^{+}_{12}(-|b_{0}|,|b_{5}|)\leq 0\}\,,= { over~ start_ARG roman_Ψ end_ARG start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT ( | italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , - | italic_b start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT | ) ≤ 0 } ∩ { over~ start_ARG roman_Ψ end_ARG start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT ( - | italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_b start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT | ) ≤ 0 } , (9.3d)
𝒟15subscript𝒟15\displaystyle\mathcal{D}_{15}caligraphic_D start_POSTSUBSCRIPT 15 end_POSTSUBSCRIPT ={Ψ~15+(|b0|,|b2|)0}{Ψ~15(|b0|,|b2|)0},absentsubscriptsuperscript~Ψ15subscript𝑏0subscript𝑏20subscriptsuperscript~Ψ15subscript𝑏0subscript𝑏20\displaystyle=\{\tilde{\Psi}^{+}_{15}(|b_{0}|,|b_{2}|)\geq 0\}\cap\{\tilde{% \Psi}^{-}_{15}(|b_{0}|,|b_{2}|)\leq 0\}\,,= { over~ start_ARG roman_Ψ end_ARG start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 15 end_POSTSUBSCRIPT ( | italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | ) ≥ 0 } ∩ { over~ start_ARG roman_Ψ end_ARG start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 15 end_POSTSUBSCRIPT ( | italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | ) ≤ 0 } , (9.3e)
𝒟25subscript𝒟25\displaystyle\mathcal{D}_{25}caligraphic_D start_POSTSUBSCRIPT 25 end_POSTSUBSCRIPT ={Ψ~25+(|b0|,|b1|)0}.absentsubscriptsuperscript~Ψ25subscript𝑏0subscript𝑏10\displaystyle=\{\tilde{\Psi}^{+}_{25}(-|b_{0}|,|b_{1}|)\leq 0\}\,.= { over~ start_ARG roman_Ψ end_ARG start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 25 end_POSTSUBSCRIPT ( - | italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) ≤ 0 } . (9.3f)

As always, the domains of convergence consist only of the components which are connected to the origin. Next, the Brioschi normal form of the quintic is [10]

x510Cx3+45C2xC2=0.superscript𝑥510𝐶superscript𝑥345superscript𝐶2𝑥superscript𝐶20x^{5}-10Cx^{3}+45C^{2}x-C^{2}=0\,.italic_x start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT - 10 italic_C italic_x start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT + 45 italic_C start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_x - italic_C start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = 0 . (9.4)

The coefficients are all functions of a single parameter C𝐶Citalic_C and are hence not independent. There is a real root for all real C𝐶Citalic_C. If C=0𝐶0C=0italic_C = 0 there is a repeated root of multiplicity five at x=0𝑥0x=0italic_x = 0. We write eq. (9.4) as a0+a1x+a3x3+a5x5=0subscript𝑎0subscript𝑎1𝑥subscript𝑎3superscript𝑥3subscript𝑎5superscript𝑥50a_{0}+a_{1}x+a_{3}x^{3}+a_{5}x^{5}=0italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x + italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT + italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT = 0 where a0=C2subscript𝑎0superscript𝐶2a_{0}=-C^{2}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = - italic_C start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, a1=45C2subscript𝑎145superscript𝐶2a_{1}=45C^{2}italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 45 italic_C start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, a3=10Csubscript𝑎310𝐶a_{3}=-10Citalic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = - 10 italic_C and a5=1subscript𝑎51a_{5}=1italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT = 1. The discriminant is

ΔBr=a5(3125a04a53+2000a02a12a3a52900a02a1a33a5+108a02a35+256a15a52128a14a32a5+16a13a34).subscriptΔBrsubscript𝑎53125superscriptsubscript𝑎04superscriptsubscript𝑎532000superscriptsubscript𝑎02superscriptsubscript𝑎12subscript𝑎3superscriptsubscript𝑎52900superscriptsubscript𝑎02subscript𝑎1superscriptsubscript𝑎33subscript𝑎5108superscriptsubscript𝑎02superscriptsubscript𝑎35256superscriptsubscript𝑎15superscriptsubscript𝑎52128superscriptsubscript𝑎14superscriptsubscript𝑎32subscript𝑎516superscriptsubscript𝑎13superscriptsubscript𝑎34\begin{split}\Delta_{\rm Br}&=a_{5}\Bigl{(}3125a_{0}^{4}a_{5}^{3}+2000a_{0}^{2% }a_{1}^{2}a_{3}a_{5}^{2}-900a_{0}^{2}a_{1}a_{3}^{3}a_{5}+108a_{0}^{2}a_{3}^{5}% +256a_{1}^{5}a_{5}^{2}-128a_{1}^{4}a_{3}^{2}a_{5}+16a_{1}^{3}a_{3}^{4}\Bigr{)}% \,.\end{split}start_ROW start_CELL roman_Δ start_POSTSUBSCRIPT roman_Br end_POSTSUBSCRIPT end_CELL start_CELL = italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ( 3125 italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT + 2000 italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 900 italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT + 108 italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT + 256 italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 128 italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT + 16 italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT ) . end_CELL end_ROW (9.5)

We factor out a5subscript𝑎5a_{5}italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT and employ reduced discriminants for the formulas for the domains of convergence, now expressed in terms of the scaled coefficients (b0,b1,b3,b5)subscript𝑏0subscript𝑏1subscript𝑏3subscript𝑏5(b_{0},b_{1},b_{3},b_{5})( italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ).

  • 1.

    Setting p=0𝑝0p=0italic_p = 0 and q=1𝑞1q=1italic_q = 1 yields one root. Put x=(a0/a1)z=z/45𝑥subscript𝑎0subscript𝑎1𝑧𝑧45x=(a_{0}/a_{1})z=-z/45italic_x = ( italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT / italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) italic_z = - italic_z / 45, then

    0=1+z10453Cz3+1455C2z5.01𝑧10superscript453𝐶superscript𝑧31superscript455superscript𝐶2superscript𝑧50=1+z-\frac{10}{45^{3}C}\,z^{3}+\frac{1}{45^{5}C^{2}}\,z^{5}\,.0 = 1 + italic_z - divide start_ARG 10 end_ARG start_ARG 45 start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_C end_ARG italic_z start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT + divide start_ARG 1 end_ARG start_ARG 45 start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT italic_C start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG italic_z start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT . (9.6)

    The domain of convergence is given by

    𝒟01={Ψ~01+(|b3|,|b5|)0}.subscript𝒟01subscriptsuperscript~Ψ01subscript𝑏3subscript𝑏50\mathcal{D}_{01}=\{\tilde{\Psi}^{+}_{01}(-|b_{3}|,-|b_{5}|)\geq 0\}\,.caligraphic_D start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT = { over~ start_ARG roman_Ψ end_ARG start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT ( - | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | , - | italic_b start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT | ) ≥ 0 } . (9.7)

    This yields the condition

    1+29376|C|36578304|C|20.129376𝐶36578304superscript𝐶201+29376|C|-36578304|C|^{2}\leq 0\,.1 + 29376 | italic_C | - 36578304 | italic_C | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ 0 . (9.8)

    This is satisfied for

    |C|17+1323227498.358104.𝐶17132322749similar-to-or-equals8.358superscript104|C|\geq\frac{17+13\sqrt{2}}{32\cdot 27\cdot 49}\simeq 8.358\cdot 10^{-4}\,.| italic_C | ≥ divide start_ARG 17 + 13 square-root start_ARG 2 end_ARG end_ARG start_ARG 32 ⋅ 27 ⋅ 49 end_ARG ≃ 8.358 ⋅ 10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT . (9.9)
  • 2.

    Setting p=1𝑝1p=1italic_p = 1 and q=5𝑞5q=5italic_q = 5 yields four roots. Put x=(a1/a5)1/4z=(45C2)1/4z𝑥superscriptsubscript𝑎1subscript𝑎514𝑧superscript45superscript𝐶214𝑧x=(a_{1}/a_{5})^{1/4}z=(45C^{2})^{1/4}zitalic_x = ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 1 / 4 end_POSTSUPERSCRIPT italic_z = ( 45 italic_C start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 1 / 4 end_POSTSUPERSCRIPT italic_z, then

    0=1455/4C1/2+z10451/2z3+z5.01superscript4554superscript𝐶12𝑧10superscript4512superscript𝑧3superscript𝑧50=-\frac{1}{45^{5/4}C^{1/2}}+z-\frac{10}{45^{1/2}}z^{3}+z^{5}\,.0 = - divide start_ARG 1 end_ARG start_ARG 45 start_POSTSUPERSCRIPT 5 / 4 end_POSTSUPERSCRIPT italic_C start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT end_ARG + italic_z - divide start_ARG 10 end_ARG start_ARG 45 start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT end_ARG italic_z start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT + italic_z start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT . (9.10)

    The domain of convergence is given by

    𝒟15={Ψ~15+(|b0|,|b3|)0}{Ψ~15(|b0|,|b3|)0}.subscript𝒟15subscriptsuperscript~Ψ15subscript𝑏0subscript𝑏30subscriptsuperscript~Ψ15subscript𝑏0subscript𝑏30\mathcal{D}_{15}=\{\tilde{\Psi}^{+}_{15}(|b_{0}|,-|b_{3}|)\geq 0\}\cap\{\tilde% {\Psi}^{-}_{15}(|b_{0}|,-|b_{3}|)\leq 0\}\,.caligraphic_D start_POSTSUBSCRIPT 15 end_POSTSUBSCRIPT = { over~ start_ARG roman_Ψ end_ARG start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 15 end_POSTSUBSCRIPT ( | italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , - | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) ≥ 0 } ∩ { over~ start_ARG roman_Ψ end_ARG start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 15 end_POSTSUBSCRIPT ( | italic_b start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , - | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) ≤ 0 } . (9.11)

    This also yields the condition eq. (9.8) and hence is also satisfied for the bound in eq. (9.9).

  • 3.

    Setting p=0𝑝0p=0italic_p = 0 and q=5𝑞5q=5italic_q = 5 yields five roots. Put x=(a0/a5)1/5z=C2/5z𝑥superscriptsubscript𝑎0subscript𝑎515𝑧superscript𝐶25𝑧x=(a_{0}/a_{5})^{1/5}z=-C^{2/5}zitalic_x = ( italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT / italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT italic_z = - italic_C start_POSTSUPERSCRIPT 2 / 5 end_POSTSUPERSCRIPT italic_z, then

    0=1+45C2/5x10C1/5z3+z5.0145superscript𝐶25𝑥10superscript𝐶15superscript𝑧3superscript𝑧50=1+45C^{2/5}x-10C^{1/5}z^{3}+z^{5}\,.0 = 1 + 45 italic_C start_POSTSUPERSCRIPT 2 / 5 end_POSTSUPERSCRIPT italic_x - 10 italic_C start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT italic_z start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT + italic_z start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT . (9.12)

    The domain of convergence is given by

    𝒟05={Ψ~05+(|b1|,|b3|)0}.subscript𝒟05subscriptsuperscript~Ψ05subscript𝑏1subscript𝑏30\mathcal{D}_{05}=\{\tilde{\Psi}^{+}_{05}(-|b_{1}|,-|b_{3}|)\geq 0\}\,.caligraphic_D start_POSTSUBSCRIPT 05 end_POSTSUBSCRIPT = { over~ start_ARG roman_Ψ end_ARG start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 05 end_POSTSUBSCRIPT ( - | italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , - | italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | ) ≥ 0 } . (9.13)

    This yields the condition

    129376|C|36578304|C|20.129376𝐶36578304superscript𝐶201-29376|C|-36578304|C|^{2}\geq 0\,.1 - 29376 | italic_C | - 36578304 | italic_C | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≥ 0 . (9.14)

    This is satisfied for

    |C|17+1323227490.327104.𝐶17132322749similar-to-or-equals0.327superscript104|C|\leq\frac{-17+13\sqrt{2}}{32\cdot 27\cdot 49}\simeq 0.327\cdot 10^{-4}\,.| italic_C | ≤ divide start_ARG - 17 + 13 square-root start_ARG 2 end_ARG end_ARG start_ARG 32 ⋅ 27 ⋅ 49 end_ARG ≃ 0.327 ⋅ 10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT . (9.15)
  • 4.

    Next set p=0𝑝0p=0italic_p = 0 and q=3𝑞3q=3italic_q = 3. Put x=(a0/a3)1/3z=(C/10)1/3z𝑥superscriptsubscript𝑎0subscript𝑎313𝑧superscript𝐶1013𝑧x=(a_{0}/a_{3})^{1/3}z=(C/10)^{1/3}zitalic_x = ( italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT / italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 1 / 3 end_POSTSUPERSCRIPT italic_z = ( italic_C / 10 ) start_POSTSUPERSCRIPT 1 / 3 end_POSTSUPERSCRIPT italic_z then

    0=145C1/3101/3z+z31105/3C1/3z5.0145superscript𝐶13superscript1013𝑧superscript𝑧31superscript1053superscript𝐶13superscript𝑧50=1-\frac{45C^{1/3}}{10^{1/3}}z+z^{3}-\frac{1}{10^{5/3}C^{1/3}}z^{5}\,.0 = 1 - divide start_ARG 45 italic_C start_POSTSUPERSCRIPT 1 / 3 end_POSTSUPERSCRIPT end_ARG start_ARG 10 start_POSTSUPERSCRIPT 1 / 3 end_POSTSUPERSCRIPT end_ARG italic_z + italic_z start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 10 start_POSTSUPERSCRIPT 5 / 3 end_POSTSUPERSCRIPT italic_C start_POSTSUPERSCRIPT 1 / 3 end_POSTSUPERSCRIPT end_ARG italic_z start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT . (9.16)

    The domain of convergence is given by

    𝒟03={Ψ~03+(|b1|,|b5|)0}.subscript𝒟03subscriptsuperscript~Ψ03subscript𝑏1subscript𝑏50\mathcal{D}_{03}=\{\tilde{\Psi}^{+}_{03}(-|b_{1}|,-|b_{5}|)\geq 0\}\,.caligraphic_D start_POSTSUBSCRIPT 03 end_POSTSUBSCRIPT = { over~ start_ARG roman_Ψ end_ARG start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 03 end_POSTSUBSCRIPT ( - | italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | , - | italic_b start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT | ) ≥ 0 } . (9.17)

    This yields the condition

    (11728|C|)20.superscript11728𝐶20-(1-1728|C|)^{2}\geq 0\,.- ( 1 - 1728 | italic_C | ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≥ 0 . (9.18)

    This is only satisfied by the single value |C|=1/1728𝐶11728|C|=1/1728| italic_C | = 1 / 1728, but |C|=1/1728𝐶11728|C|=1/1728| italic_C | = 1 / 1728 lies in a domain not connected to the origin. Hence this scenario yields no roots. We see that the stipulation ‘connected to the origin’ is essential.

  • 5.

    Next set p=1𝑝1p=1italic_p = 1 and q=3𝑞3q=3italic_q = 3. Put x=(a1/a3)1/2z=i(45C/10)1/2z𝑥superscriptsubscript𝑎1subscript𝑎312𝑧𝑖superscript45𝐶1012𝑧x=(a_{1}/a_{3})^{1/2}z=i(45C/10)^{1/2}zitalic_x = ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT italic_z = italic_i ( 45 italic_C / 10 ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT italic_z, then

    0=i101/2453/2C1/2+z+z3+45100z5.0𝑖superscript1012superscript4532superscript𝐶12𝑧superscript𝑧345100superscript𝑧50=\frac{i10^{1/2}}{45^{3/2}C^{1/2}}+z+z^{3}+\frac{45}{100}z^{5}\,.0 = divide start_ARG italic_i 10 start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT end_ARG start_ARG 45 start_POSTSUPERSCRIPT 3 / 2 end_POSTSUPERSCRIPT italic_C start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT end_ARG + italic_z + italic_z start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT + divide start_ARG 45 end_ARG start_ARG 100 end_ARG italic_z start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT . (9.19)

    The necessary upper bound for convergence is |b5|14subscript𝑏514|b_{5}|\leq\frac{1}{4}| italic_b start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT | ≤ divide start_ARG 1 end_ARG start_ARG 4 end_ARG. However, b5=0.45subscript𝑏50.45b_{5}=0.45italic_b start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT = 0.45, which exceeds the above bound. Hence the series solution does not converge for any value of C𝐶Citalic_C, for this scenario.

  • 6.

    Next set p=3𝑝3p=3italic_p = 3 and q=5𝑞5q=5italic_q = 5. Put x=(a3/a5)1/2z=i(10C)1/2z𝑥superscriptsubscript𝑎3subscript𝑎512𝑧𝑖superscript10𝐶12𝑧x=(a_{3}/a_{5})^{1/2}z=i(10C)^{1/2}zitalic_x = ( italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT / italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT italic_z = italic_i ( 10 italic_C ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT italic_z, then

    0=i105/2C1/2+45100z+z3+z5.0𝑖superscript1052superscript𝐶1245100𝑧superscript𝑧3superscript𝑧50=\frac{i}{10^{5/2}C^{1/2}}+\frac{45}{100}z+z^{3}+z^{5}\,.0 = divide start_ARG italic_i end_ARG start_ARG 10 start_POSTSUPERSCRIPT 5 / 2 end_POSTSUPERSCRIPT italic_C start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT end_ARG + divide start_ARG 45 end_ARG start_ARG 100 end_ARG italic_z + italic_z start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT + italic_z start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT . (9.20)

    The necessary upper bound for convergence is |b1|14subscript𝑏114|b_{1}|\leq\frac{1}{4}| italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ≤ divide start_ARG 1 end_ARG start_ARG 4 end_ARG. However, b1=0.45subscript𝑏10.45b_{1}=0.45italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 0.45, which exceeds the above bound. Hence the series solution does not converge for any value of C𝐶Citalic_C for this scenario.

The Brioschi quintic normal form yields some instructive insights. First, for three of the six possible choices of p𝑝pitalic_p and q𝑞qitalic_q, the series solutions do not converge for any value of C𝐶Citalic_C. Second, observe that the choices p=0𝑝0p=0italic_p = 0, q=1𝑞1q=1italic_q = 1 and p=1𝑝1p=1italic_p = 1, q=5𝑞5q=5italic_q = 5 together yield five roots, but only if |C|8.358104greater-than-or-equivalent-to𝐶8.358superscript104|C|\gtrsim 8.358\cdot 10^{-4}| italic_C | ≳ 8.358 ⋅ 10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT (see eq. (9.9)), whereas the choice p=0𝑝0p=0italic_p = 0, q=5𝑞5q=5italic_q = 5 also yields five roots, but only if |C|0.327104less-than-or-similar-to𝐶0.327superscript104|C|\lesssim 0.327\cdot 10^{-4}| italic_C | ≲ 0.327 ⋅ 10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT (see eq. (9.15)). Hence there is a gap of values for which there is no convergent series solution of the Brioschi quintic, for any choice of p𝑝pitalic_p and q𝑞qitalic_q, given by

17+132322749|C|17+132322749.17132322749𝐶17132322749\frac{-17+13\sqrt{2}}{32\cdot 27\cdot 49}\leq|C|\leq\frac{17+13\sqrt{2}}{32% \cdot 27\cdot 49}\,.divide start_ARG - 17 + 13 square-root start_ARG 2 end_ARG end_ARG start_ARG 32 ⋅ 27 ⋅ 49 end_ARG ≤ | italic_C | ≤ divide start_ARG 17 + 13 square-root start_ARG 2 end_ARG end_ARG start_ARG 32 ⋅ 27 ⋅ 49 end_ARG . (9.21)

The Brioschi quintic demonstrates that the formulas in Sec. 8 do not imply that the domains of convergence for an algebraic equation of degree n𝑛nitalic_n span all values of the coefficients, i.e. all of n+1superscript𝑛1\mathbb{C}^{n+1}blackboard_C start_POSTSUPERSCRIPT italic_n + 1 end_POSTSUPERSCRIPT. The criterion for convergence may not be satisfied for any of the choices of p𝑝pitalic_p and q𝑞qitalic_q. Nevertheless, it was shown in Sec. 5 that convergent series solutions do exist for all values of the coefficients of a general quintic, via the use of the Bring-Jerrand normal form. Hence one must examine every algebraic equation on its merits; some transformations may work better than others.

10 Conclusion

This author was led to the main ideas of this paper because they are required to prove results in probability and statistics (not reported here). The papers by numerous authors were cited and the various notations, definitions, identities and nomenclature were collected in a common setting. Note that although most of the derivations in the literature treat only integer valued parameters, Theorem 2.4 is applicable for arbitrary complex coefficients and real (or even complex) exponents. The early works by Lambert [28, 29] and Euler [16] and were shown to be Fuss–Catalan series. An important application of the formalism was the solution of algebraic equations by infinite series. This is a heavily studied problem and contact was made with the works of numerous authors [5, 9, 15, 31, 32, 33]. An example was to present convergent Fuss–Catalan series solutions for all the roots the Bring-Jerrard normal form, thence the roots of a general quintic, for arbitrary values of the quintic coefficients. Two bounds for the absolute convergence of general Fuss–Catalan series were derived (necessary but not sufficient and sufficient but not necessary). For the important special case of the solutions of algebraic equations by infinite series, a new necessary and sufficient bound for absolute convergence was presented in Sec. 8, correcting and extending earlier work in the field [36].

Acknowledgements

I am deeply indebted to numerous individuals who helped me generously with their time and enthusiasm. However, special mention must go to Professors R. E. Borcherds and S. J. Dilworth, and to Dr. P. M. Strickland, without whose unflagging support this work simply would not have seen fruition. It is my enormous pleasure to thank them, and also, in alphabetical order, Professors I. M. Gessel, H. W. Gould, R. L. Graham, O. Patashnik, T. J. Ransford, R. P. Stanley and D. Zeilberger.
Addendum: I thank Drs. F. de Sousa Coelho and D. Rubine for pointing out misprints in previous versions of this note.

Appendix A Miscellaneous items

This Appendix lists various items which were not used in the main body of the paper, essentially for completeness of the exposition. According to the information in Appendix B of Stanley’s text [41] (the Appendix was written by Pak), the name ‘Catalan numbers’ only came into prominent use after the publication of Riordan’s monograph [38] (in 1968, first edition). Hence it is understandable if authors such as Gould [20, 21] and Raney [37], also earlier authors such as Mellin [33] and Schläfli [39], did not mention Fuss or Catalan. Belardinelli’s memoir [4] contains an overview of the solutions of algebraic equations using hypergeometric series. His extensive bibliography lists several papers on functions of several complex variables, but not papers on combinatorics such as by Raney [37]. There is evidently a diversity of notations and terminology, and duplication of proofs.

The ‘diversity of notations’ leads to an immediate caveat: different authors employ the same symbols, such as n𝑛nitalic_n, p𝑝pitalic_p or q𝑞qitalic_q, to mean different things and it is impractical to disambiguate all the notations in the equations below. The reader is warned to consult the original literature for the precise meanings of all symbols displayed below.

Turning to technical matters, Mohanty [34] derived additional convolution identities not mentioned in the main text above. I list only one, viz. [34, eq. (11)], because it subsumes the others as special cases. In the notation of this paper, [34, eq. (11)] is

𝒋k(p+𝒒𝒋)𝒜𝒋(𝒃,a)𝒜𝒏𝒋(𝒃,c)=p(a+c)+a𝒒𝒏a+c𝒜𝒏(𝒃,a+c).subscript𝒋superscript𝑘𝑝𝒒𝒋subscript𝒜𝒋𝒃𝑎subscript𝒜𝒏𝒋𝒃𝑐𝑝𝑎𝑐𝑎𝒒𝒏𝑎𝑐subscript𝒜𝒏𝒃𝑎𝑐\sum_{\bm{j}\in\mathbb{N}^{k}}(p+\bm{q}\cdot\bm{j})\mathscr{A}_{\bm{j}}(\bm{b}% ,a)\mathscr{A}_{\bm{n}-\bm{j}}(\bm{b},c)=\frac{p(a+c)+a\bm{q}\cdot\bm{n}}{a+c}% \mathscr{A}_{\bm{n}}(\bm{b},a+c)\,.∑ start_POSTSUBSCRIPT bold_italic_j ∈ blackboard_N start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_p + bold_italic_q ⋅ bold_italic_j ) script_A start_POSTSUBSCRIPT bold_italic_j end_POSTSUBSCRIPT ( bold_italic_b , italic_a ) script_A start_POSTSUBSCRIPT bold_italic_n - bold_italic_j end_POSTSUBSCRIPT ( bold_italic_b , italic_c ) = divide start_ARG italic_p ( italic_a + italic_c ) + italic_a bold_italic_q ⋅ bold_italic_n end_ARG start_ARG italic_a + italic_c end_ARG script_A start_POSTSUBSCRIPT bold_italic_n end_POSTSUBSCRIPT ( bold_italic_b , italic_a + italic_c ) . (1.1)

All of a𝑎aitalic_a, c𝑐citalic_c, p𝑝pitalic_p, 𝒃𝒃\bm{b}bold_italic_b and 𝒒𝒒\bm{q}bold_italic_q are complex valued and 𝒃𝒃\bm{b}bold_italic_b, 𝒋𝒋\bm{j}bold_italic_j, 𝒏𝒏\bm{n}bold_italic_n and 𝒒𝒒\bm{q}bold_italic_q are k𝑘kitalic_k-tuples. Put 𝒒=0𝒒0\bm{q}=0bold_italic_q = 0 then eq. (1.1) yields [34, eq. (9)], which is eq. (2.17) above. Put p=c+𝒃𝒏𝑝𝑐𝒃𝒏p=c+\bm{b}\cdot\bm{n}italic_p = italic_c + bold_italic_b ⋅ bold_italic_n and 𝒒=𝒃𝒒𝒃\bm{q}=-\bm{b}bold_italic_q = - bold_italic_b then eq. (1.1) yields [34, eq. (10)]. Mohanty [34] actually cited Gould [21] for the single-parameter convolution identities; Mohanty generalized them to multiparameter versions. Gould [20, 21] proved several convolution identities and suggested that they are all special cases of a single general formula. The exposition below follows Raney’s [37] summary of Gould’s work. Define the numbers [37, eq. (7.7)]

G(α,0;β,γ)=1,G(α,n;β,γ)=αn!m=1n1(α+βnγm).formulae-sequence𝐺𝛼0𝛽𝛾1𝐺𝛼𝑛𝛽𝛾𝛼𝑛superscriptsubscriptproduct𝑚1𝑛1𝛼𝛽𝑛𝛾𝑚\begin{split}G(\alpha,0;\beta,\gamma)&=1\,,\\ G(\alpha,n;\beta,\gamma)&=\frac{\alpha}{n!}\prod_{m=1}^{n-1}(\alpha+\beta n-% \gamma m)\,.\end{split}start_ROW start_CELL italic_G ( italic_α , 0 ; italic_β , italic_γ ) end_CELL start_CELL = 1 , end_CELL end_ROW start_ROW start_CELL italic_G ( italic_α , italic_n ; italic_β , italic_γ ) end_CELL start_CELL = divide start_ARG italic_α end_ARG start_ARG italic_n ! end_ARG ∏ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT ( italic_α + italic_β italic_n - italic_γ italic_m ) . end_CELL end_ROW (1.2)

See also the polynomial Pk(p)subscript𝑃𝑘𝑝P_{k}(p)italic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_p ) by Gould [21, Sec. 5] and associated comments therein about the work of Schläfli [39]. Then [37, eq. (7.8)]

G(α1+α2,n;β,γ)=n1+n2=nG(α1,n1;β,γ)G(α2,n2;β,γ).𝐺subscript𝛼1subscript𝛼2𝑛𝛽𝛾subscriptsubscript𝑛1subscript𝑛2𝑛𝐺subscript𝛼1subscript𝑛1𝛽𝛾𝐺subscript𝛼2subscript𝑛2𝛽𝛾G(\alpha_{1}+\alpha_{2},n;\beta,\gamma)=\sum_{n_{1}+n_{2}=n}G(\alpha_{1},n_{1}% ;\beta,\gamma)G(\alpha_{2},n_{2};\beta,\gamma)\,.italic_G ( italic_α start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_α start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_n ; italic_β , italic_γ ) = ∑ start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = italic_n end_POSTSUBSCRIPT italic_G ( italic_α start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ; italic_β , italic_γ ) italic_G ( italic_α start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ; italic_β , italic_γ ) . (1.3)

Here α1,α2,β,γ𝔸subscript𝛼1subscript𝛼2𝛽𝛾𝔸\alpha_{1},\alpha_{2},\beta,\gamma\in\mathbb{A}italic_α start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_α start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_β , italic_γ ∈ blackboard_A where 𝔸𝔸\mathbb{A}blackboard_A is a commutative ring and n,n1,n2𝑛subscript𝑛1subscript𝑛2n,n_{1},n_{2}\in\mathbb{N}italic_n , italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ blackboard_N. Gould [21, Sec. 5] also proved that the convolution identity derived by Schläfli [39], in the latter’s 1847 paper on Lambert series, was equivalent to [20, eq. (10)]. See also Riordan [38] for additional combinatorial identities and Strehl [43] for an overview of numerous multiparameter identities. If γ=0𝛾0\gamma=0italic_γ = 0 then

G(α,n;β,0)=αα+βn(α+βn)nn!.𝐺𝛼𝑛𝛽0𝛼𝛼𝛽𝑛superscript𝛼𝛽𝑛𝑛𝑛G(\alpha,n;\beta,0)=\frac{\alpha}{\alpha+\beta n}\frac{(\alpha+\beta n)^{n}}{n% !}\,.italic_G ( italic_α , italic_n ; italic_β , 0 ) = divide start_ARG italic_α end_ARG start_ARG italic_α + italic_β italic_n end_ARG divide start_ARG ( italic_α + italic_β italic_n ) start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_ARG start_ARG italic_n ! end_ARG . (1.4)

If γ0𝛾0\gamma\neq 0italic_γ ≠ 0 then the above is proportional to a Fuss–Catalan number

G(α,n;β,γ)=(α/γ)γnn!m=1n1(α+βnγm)=γnAn(β/γ,α/γ).𝐺𝛼𝑛𝛽𝛾𝛼𝛾superscript𝛾𝑛𝑛superscriptsubscriptproduct𝑚1𝑛1𝛼𝛽𝑛𝛾𝑚superscript𝛾𝑛subscript𝐴𝑛𝛽𝛾𝛼𝛾\begin{split}G(\alpha,n;\beta,\gamma)&=\frac{(\alpha/\gamma)\gamma^{n}}{n!}% \prod_{m=1}^{n-1}\Bigl{(}\frac{\alpha+\beta n}{\gamma}-m\Bigr{)}\\ &=\gamma^{n}A_{n}(\beta/\gamma,\alpha/\gamma)\,.\end{split}start_ROW start_CELL italic_G ( italic_α , italic_n ; italic_β , italic_γ ) end_CELL start_CELL = divide start_ARG ( italic_α / italic_γ ) italic_γ start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_ARG start_ARG italic_n ! end_ARG ∏ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT ( divide start_ARG italic_α + italic_β italic_n end_ARG start_ARG italic_γ end_ARG - italic_m ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = italic_γ start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_A start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_β / italic_γ , italic_α / italic_γ ) . end_CELL end_ROW (1.5)

Note that this relation works for n=0𝑛0n=0italic_n = 0 also. A multiparameter generalization might be as follows

G(α,𝟎;𝜷,γ)=1,G(α,𝒏;𝜷,γ)=αn1!nk!m=1|𝒏|1(α+𝜷𝒏γm).formulae-sequence𝐺𝛼0𝜷𝛾1𝐺𝛼𝒏𝜷𝛾𝛼subscript𝑛1subscript𝑛𝑘superscriptsubscriptproduct𝑚1𝒏1𝛼𝜷𝒏𝛾𝑚\begin{split}G(\alpha,\bm{0};\bm{\beta},\gamma)&=1\,,\\ G(\alpha,\bm{n};\bm{\beta},\gamma)&=\frac{\alpha}{n_{1}!\cdots n_{k}!}\prod_{m% =1}^{|\bm{n}|-1}(\alpha+\bm{\beta}\cdot\bm{n}-\gamma m)\,.\end{split}start_ROW start_CELL italic_G ( italic_α , bold_0 ; bold_italic_β , italic_γ ) end_CELL start_CELL = 1 , end_CELL end_ROW start_ROW start_CELL italic_G ( italic_α , bold_italic_n ; bold_italic_β , italic_γ ) end_CELL start_CELL = divide start_ARG italic_α end_ARG start_ARG italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ! ⋯ italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ! end_ARG ∏ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT | bold_italic_n | - 1 end_POSTSUPERSCRIPT ( italic_α + bold_italic_β ⋅ bold_italic_n - italic_γ italic_m ) . end_CELL end_ROW (1.6)

Gould also published a later paper [22] with additional formulas, but its contents are beyond the scope of this paper. The work of Gould may lead to a more general set of multiparameter identities and generating functions. The matter will be left to future work.

Kahkeshani [24] has defined so-called ‘generalized Catalan numbers’ via

C(m,n)=1n(m1)+1(2n(m1)n,,nm1,n(m1)).𝐶𝑚𝑛1𝑛𝑚11binomial2𝑛𝑚1subscript𝑛𝑛𝑚1𝑛𝑚1C(m,n)=\frac{1}{n(m-1)+1}\binom{2n(m-1)}{\underbrace{n,\dots,n}_{m-1},n(m-1)}\,.italic_C ( italic_m , italic_n ) = divide start_ARG 1 end_ARG start_ARG italic_n ( italic_m - 1 ) + 1 end_ARG ( FRACOP start_ARG 2 italic_n ( italic_m - 1 ) end_ARG start_ARG under⏟ start_ARG italic_n , … , italic_n end_ARG start_POSTSUBSCRIPT italic_m - 1 end_POSTSUBSCRIPT , italic_n ( italic_m - 1 ) end_ARG ) . (1.7)

Let us process this as follows. Set k=m1𝑘𝑚1k=m-1italic_k = italic_m - 1 and r=1𝑟1r=1italic_r = 1 in eq. (2.1). Also set t1==tk=nsubscript𝑡1subscript𝑡𝑘𝑛t_{1}=\cdots=t_{k}=nitalic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = ⋯ = italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_n and μ1==μk=2subscript𝜇1subscript𝜇𝑘2\mu_{1}=\cdots=\mu_{k}=2italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = ⋯ = italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 2, so |𝒕|=n(m1)𝒕𝑛𝑚1|\bm{t}|=n(m-1)| bold_italic_t | = italic_n ( italic_m - 1 ) and 𝒕𝝁=2n(m1)𝒕𝝁2𝑛𝑚1\bm{t}\cdot\bm{\mu}=2n(m-1)bold_italic_t ⋅ bold_italic_μ = 2 italic_n ( italic_m - 1 ). Then

C(m,n)=1n(m1)+11(n!)m1j=0n(m1)1(2n(m1)j)=1(n!)m1j=1n(m1)1(2n(m1)+1j)=𝒜(n,,n)((2,,2),1).𝐶𝑚𝑛1𝑛𝑚111superscript𝑛𝑚1superscriptsubscriptproduct𝑗0𝑛𝑚112𝑛𝑚1𝑗1superscript𝑛𝑚1superscriptsubscriptproduct𝑗1𝑛𝑚112𝑛𝑚11𝑗subscript𝒜𝑛𝑛221\begin{split}C(m,n)&=\frac{1}{n(m-1)+1}\frac{1}{(n!)^{m-1}}\prod_{j=0}^{n(m-1)% -1}(2n(m-1)-j)\\ &=\frac{1}{(n!)^{m-1}}\prod_{j=1}^{n(m-1)-1}(2n(m-1)+1-j)\\ &=\mathscr{A}_{(n,\dots,n)}((2,\dots,2),1)\,.\end{split}start_ROW start_CELL italic_C ( italic_m , italic_n ) end_CELL start_CELL = divide start_ARG 1 end_ARG start_ARG italic_n ( italic_m - 1 ) + 1 end_ARG divide start_ARG 1 end_ARG start_ARG ( italic_n ! ) start_POSTSUPERSCRIPT italic_m - 1 end_POSTSUPERSCRIPT end_ARG ∏ start_POSTSUBSCRIPT italic_j = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n ( italic_m - 1 ) - 1 end_POSTSUPERSCRIPT ( 2 italic_n ( italic_m - 1 ) - italic_j ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = divide start_ARG 1 end_ARG start_ARG ( italic_n ! ) start_POSTSUPERSCRIPT italic_m - 1 end_POSTSUPERSCRIPT end_ARG ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n ( italic_m - 1 ) - 1 end_POSTSUPERSCRIPT ( 2 italic_n ( italic_m - 1 ) + 1 - italic_j ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = script_A start_POSTSUBSCRIPT ( italic_n , … , italic_n ) end_POSTSUBSCRIPT ( ( 2 , … , 2 ) , 1 ) . end_CELL end_ROW (1.8)

Hence Kahkeshani’s definition is a special case of the multiparameter Fuss–Catalan numbers defined in this paper. Note that Chu’s [11] and Kahkeshani’s [24] nomenclature ‘generalized Catalan numbers’ should not be confused with each other.

We close with a comment on the paper by Aval [2], who defined so-called ‘multivariate Fuss–Catalan numbers’ via [2, remark 3.2]

Bp(n,k1,k2,,kp1)=(i=1p1(n+ki1ki))ni=1p1kin.subscript𝐵𝑝𝑛subscript𝑘1subscript𝑘2subscript𝑘𝑝1superscriptsubscriptproduct𝑖1𝑝1binomial𝑛subscript𝑘𝑖1subscript𝑘𝑖𝑛superscriptsubscript𝑖1𝑝1subscript𝑘𝑖𝑛B_{p}(n,k_{1},k_{2},\dots,k_{p-1})=\biggl{(}\prod_{i=1}^{p-1}\binom{n+k_{i}-1}% {k_{i}}\biggr{)}\,\frac{n-\sum_{i=1}^{p-1}k_{i}}{n}\,.italic_B start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_n , italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_k start_POSTSUBSCRIPT italic_p - 1 end_POSTSUBSCRIPT ) = ( ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p - 1 end_POSTSUPERSCRIPT ( FRACOP start_ARG italic_n + italic_k start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - 1 end_ARG start_ARG italic_k start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ) ) divide start_ARG italic_n - ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p - 1 end_POSTSUPERSCRIPT italic_k start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_n end_ARG . (1.9)

Clearly Bp()=1subscript𝐵𝑝1B_{p}(\cdot)=1italic_B start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( ⋅ ) = 1 for p=0𝑝0p=0italic_p = 0 and p=1𝑝1p=1italic_p = 1. For p2𝑝2p\geq 2italic_p ≥ 2 we have

Bp(n,k1,k2,,kp1)=ni=1p1kini=1p1(1k1!j=0ki1(n+ki1j))=np2(n|𝒌|)i=1p1(1k1!j=1ki1(n+kij))=np2(n|𝒌|)i=1p1Aki(1,n).subscript𝐵𝑝𝑛subscript𝑘1subscript𝑘2subscript𝑘𝑝1𝑛superscriptsubscript𝑖1𝑝1subscript𝑘𝑖𝑛superscriptsubscriptproduct𝑖1𝑝11subscript𝑘1superscriptsubscriptproduct𝑗0subscript𝑘𝑖1𝑛subscript𝑘𝑖1𝑗superscript𝑛𝑝2𝑛𝒌superscriptsubscriptproduct𝑖1𝑝11subscript𝑘1superscriptsubscriptproduct𝑗1subscript𝑘𝑖1𝑛subscript𝑘𝑖𝑗superscript𝑛𝑝2𝑛𝒌superscriptsubscriptproduct𝑖1𝑝1subscript𝐴subscript𝑘𝑖1𝑛\begin{split}B_{p}(n,k_{1},k_{2},\dots,k_{p-1})&=\frac{n-\sum_{i=1}^{p-1}k_{i}% }{n}\prod_{i=1}^{p-1}\biggl{(}\frac{1}{k_{1}!}\prod_{j=0}^{k_{i}-1}(n+k_{i}-1-% j)\biggr{)}\\ &=n^{p-2}(n-|\bm{k}|)\prod_{i=1}^{p-1}\biggl{(}\frac{1}{k_{1}!}\prod_{j=1}^{k_% {i}-1}(n+k_{i}-j)\biggr{)}\\ &=n^{p-2}(n-|\bm{k}|)\prod_{i=1}^{p-1}A_{k_{i}}(1,n)\,.\end{split}start_ROW start_CELL italic_B start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_n , italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_k start_POSTSUBSCRIPT italic_p - 1 end_POSTSUBSCRIPT ) end_CELL start_CELL = divide start_ARG italic_n - ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p - 1 end_POSTSUPERSCRIPT italic_k start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_n end_ARG ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p - 1 end_POSTSUPERSCRIPT ( divide start_ARG 1 end_ARG start_ARG italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ! end_ARG ∏ start_POSTSUBSCRIPT italic_j = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_n + italic_k start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - 1 - italic_j ) ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = italic_n start_POSTSUPERSCRIPT italic_p - 2 end_POSTSUPERSCRIPT ( italic_n - | bold_italic_k | ) ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p - 1 end_POSTSUPERSCRIPT ( divide start_ARG 1 end_ARG start_ARG italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ! end_ARG ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_n + italic_k start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_j ) ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = italic_n start_POSTSUPERSCRIPT italic_p - 2 end_POSTSUPERSCRIPT ( italic_n - | bold_italic_k | ) ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p - 1 end_POSTSUPERSCRIPT italic_A start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( 1 , italic_n ) . end_CELL end_ROW (1.10)

Hence for p2𝑝2p\geq 2italic_p ≥ 2, Aval’s definition equals a product of p1𝑝1p-1italic_p - 1 single-parameter Fuss–Catalan numbers, with a prefactor. This is different from the multiparameter Fuss–Catalan numbers defined in this paper.

Appendix B 𝒜𝒜\mathscr{A}script_A-hypergeometric series

Sturmfels [44] published an elegant analysis employing so-called 𝒜𝒜\mathscr{A}script_A-hypergeometric series to solve for the roots of the general algebraic equation of degree n𝑛nitalic_n. A brief comparison with his work is presented here. His first example is for the quintic. Let us write the quintic in the form

x=a0a1a2a1x2a3a1x3a4a1x4a5a1x5.𝑥subscript𝑎0subscript𝑎1subscript𝑎2subscript𝑎1superscript𝑥2subscript𝑎3subscript𝑎1superscript𝑥3subscript𝑎4subscript𝑎1superscript𝑥4subscript𝑎5subscript𝑎1superscript𝑥5x=-\frac{a_{0}}{a_{1}}-\frac{a_{2}}{a_{1}}x^{2}-\frac{a_{3}}{a_{1}}x^{3}-\frac% {a_{4}}{a_{1}}x^{4}-\frac{a_{5}}{a_{1}}x^{5}\,.italic_x = - divide start_ARG italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG - divide start_ARG italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - divide start_ARG italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG italic_x start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT - divide start_ARG italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG italic_x start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT - divide start_ARG italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG italic_x start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT . (2.1)

This corresponds to p=0𝑝0p=0italic_p = 0 and q=1𝑞1q=1italic_q = 1 in my terminology, so qp=1𝑞𝑝1q-p=1italic_q - italic_p = 1 and the Fuss–Catalan series yields one root, which is

xroot=a0a1[𝒕4A𝒕(𝝁,1)eiπ𝒕𝝁a0t𝒕𝝁a1𝒕𝝁(j𝒩npqajtj)]=a0a1[ 1+a0a2a12a02a3a13+a03a4a14a04a5a15+2a02a22a145a03a2a3a15+].subscript𝑥rootsubscript𝑎0subscript𝑎1delimited-[]subscript𝒕superscript4subscript𝐴𝒕𝝁1superscript𝑒𝑖𝜋𝒕𝝁superscriptsubscript𝑎0𝑡𝒕𝝁superscriptsubscript𝑎1𝒕𝝁subscriptproduct𝑗subscript𝒩𝑛𝑝𝑞superscriptsubscript𝑎𝑗subscript𝑡𝑗subscript𝑎0subscript𝑎1delimited-[]1subscript𝑎0subscript𝑎2superscriptsubscript𝑎12superscriptsubscript𝑎02subscript𝑎3superscriptsubscript𝑎13superscriptsubscript𝑎03subscript𝑎4superscriptsubscript𝑎14superscriptsubscript𝑎04subscript𝑎5superscriptsubscript𝑎152superscriptsubscript𝑎02superscriptsubscript𝑎22superscriptsubscript𝑎145superscriptsubscript𝑎03subscript𝑎2subscript𝑎3superscriptsubscript𝑎15\begin{split}x_{\rm root}&=-\frac{a_{0}}{a_{1}}\biggl{[}\sum_{\bm{t}\in\mathbb% {N}^{4}}A_{\bm{t}}(\bm{\mu},1)\,\frac{e^{-i\pi\bm{t}\cdot\bm{\mu}}}{a_{0}^{t-% \bm{t}\cdot\bm{\mu}}a_{1}^{\bm{t}\cdot\bm{\mu}}}\Bigl{(}\prod_{j\in\mathscr{N}% _{npq}}a_{j}^{t_{j}}\Bigr{)}\biggr{]}\\ &=-\frac{a_{0}}{a_{1}}\biggl{[}\,1+\frac{a_{0}a_{2}}{a_{1}^{2}}-\frac{a_{0}^{2% }a_{3}}{a_{1}^{3}}+\frac{a_{0}^{3}a_{4}}{a_{1}^{4}}-\frac{a_{0}^{4}a_{5}}{a_{1% }^{5}}+\frac{2a_{0}^{2}a_{2}^{2}}{a_{1}^{4}}-\frac{5a_{0}^{3}a_{2}a_{3}}{a_{1}% ^{5}}+\cdots\biggr{]}\,.\end{split}start_ROW start_CELL italic_x start_POSTSUBSCRIPT roman_root end_POSTSUBSCRIPT end_CELL start_CELL = - divide start_ARG italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG [ ∑ start_POSTSUBSCRIPT bold_italic_t ∈ blackboard_N start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_A start_POSTSUBSCRIPT bold_italic_t end_POSTSUBSCRIPT ( bold_italic_μ , 1 ) divide start_ARG italic_e start_POSTSUPERSCRIPT - italic_i italic_π bold_italic_t ⋅ bold_italic_μ end_POSTSUPERSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t - bold_italic_t ⋅ bold_italic_μ end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT bold_italic_t ⋅ bold_italic_μ end_POSTSUPERSCRIPT end_ARG ( ∏ start_POSTSUBSCRIPT italic_j ∈ script_N start_POSTSUBSCRIPT italic_n italic_p italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) ] end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = - divide start_ARG italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG [ 1 + divide start_ARG italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG - divide start_ARG italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG + divide start_ARG italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT end_ARG - divide start_ARG italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT end_ARG + divide start_ARG 2 italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT end_ARG - divide start_ARG 5 italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT end_ARG + ⋯ ] . end_CELL end_ROW (2.2)

This equals the root X1,1subscript𝑋11X_{1,-1}italic_X start_POSTSUBSCRIPT 1 , - 1 end_POSTSUBSCRIPT of Sturmfels [44]. Next let us select p=0𝑝0p=0italic_p = 0 and q=5𝑞5q=5italic_q = 5 and write

x5=a0a5a1a5xa2a5x2a3a5x3a4a5x4.superscript𝑥5subscript𝑎0subscript𝑎5subscript𝑎1subscript𝑎5𝑥subscript𝑎2subscript𝑎5superscript𝑥2subscript𝑎3subscript𝑎5superscript𝑥3subscript𝑎4subscript𝑎5superscript𝑥4x^{5}=-\frac{a_{0}}{a_{5}}-\frac{a_{1}}{a_{5}}x-\frac{a_{2}}{a_{5}}x^{2}-\frac% {a_{3}}{a_{5}}x^{3}-\frac{a_{4}}{a_{5}}x^{4}\,.italic_x start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT = - divide start_ARG italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT end_ARG - divide start_ARG italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT end_ARG italic_x - divide start_ARG italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT end_ARG italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - divide start_ARG italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT end_ARG italic_x start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT - divide start_ARG italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT end_ARG italic_x start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT . (2.3)

The series yields five roots. Following Sturmfels, we define ξ=eiπ(2+1)/5𝜉superscript𝑒𝑖𝜋215\xi=e^{i\pi(2\ell+1)/5}italic_ξ = italic_e start_POSTSUPERSCRIPT italic_i italic_π ( 2 roman_ℓ + 1 ) / 5 end_POSTSUPERSCRIPT as a root of 11-1- 1. Then the roots of the quintic are given by

xξ=ξa01/5a51/5𝒕4A𝒕(𝝁,15)ξ5𝒕𝝁a0t𝒕𝝁a5𝒕𝝁(j𝒩npqajtj)=ξa01/5a51/5+15(ξ2a1a03/5a52/5+ξ3a2a04/5a53/5+ξ4a3a01/5a54/5a4a5)+subscript𝑥𝜉𝜉superscriptsubscript𝑎015superscriptsubscript𝑎515subscript𝒕superscript4subscript𝐴𝒕𝝁15superscript𝜉5𝒕𝝁superscriptsubscript𝑎0𝑡𝒕𝝁superscriptsubscript𝑎5𝒕𝝁subscriptproduct𝑗subscript𝒩𝑛𝑝𝑞superscriptsubscript𝑎𝑗subscript𝑡𝑗𝜉superscriptsubscript𝑎015superscriptsubscript𝑎51515superscript𝜉2subscript𝑎1superscriptsubscript𝑎035superscriptsubscript𝑎525superscript𝜉3subscript𝑎2superscriptsubscript𝑎045superscriptsubscript𝑎535superscript𝜉4subscript𝑎3superscriptsubscript𝑎015superscriptsubscript𝑎545subscript𝑎4subscript𝑎5\begin{split}x_{\xi}&=\xi\,\frac{a_{0}^{1/5}}{a_{5}^{1/5}}\,\sum_{\bm{t}\in% \mathbb{N}^{4}}A_{\bm{t}}\Bigl{(}\bm{\mu},\frac{1}{5}\Bigr{)}\,\frac{\xi^{5\bm% {t}\cdot\bm{\mu}}}{a_{0}^{t-\bm{t}\cdot\bm{\mu}}a_{5}^{\bm{t}\cdot\bm{\mu}}}% \Bigl{(}\prod_{j\in\mathscr{N}_{npq}}a_{j}^{t_{j}}\Bigr{)}\\ &=\xi\,\frac{a_{0}^{1/5}}{a_{5}^{1/5}}+\frac{1}{5}\biggl{(}\frac{\xi^{2}a_{1}}% {a_{0}^{3/5}a_{5}^{2/5}}+\frac{\xi^{3}a_{2}}{a_{0}^{4/5}a_{5}^{3/5}}+\frac{\xi% ^{4}a_{3}}{a_{0}^{1/5}a_{5}^{4/5}}-\frac{a_{4}}{a_{5}}\biggr{)}+\cdots\end{split}start_ROW start_CELL italic_x start_POSTSUBSCRIPT italic_ξ end_POSTSUBSCRIPT end_CELL start_CELL = italic_ξ divide start_ARG italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT bold_italic_t ∈ blackboard_N start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_A start_POSTSUBSCRIPT bold_italic_t end_POSTSUBSCRIPT ( bold_italic_μ , divide start_ARG 1 end_ARG start_ARG 5 end_ARG ) divide start_ARG italic_ξ start_POSTSUPERSCRIPT 5 bold_italic_t ⋅ bold_italic_μ end_POSTSUPERSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t - bold_italic_t ⋅ bold_italic_μ end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT bold_italic_t ⋅ bold_italic_μ end_POSTSUPERSCRIPT end_ARG ( ∏ start_POSTSUBSCRIPT italic_j ∈ script_N start_POSTSUBSCRIPT italic_n italic_p italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = italic_ξ divide start_ARG italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT end_ARG + divide start_ARG 1 end_ARG start_ARG 5 end_ARG ( divide start_ARG italic_ξ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 / 5 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 / 5 end_POSTSUPERSCRIPT end_ARG + divide start_ARG italic_ξ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 4 / 5 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 / 5 end_POSTSUPERSCRIPT end_ARG + divide start_ARG italic_ξ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 4 / 5 end_POSTSUPERSCRIPT end_ARG - divide start_ARG italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT end_ARG ) + ⋯ end_CELL end_ROW (2.4)

These are the leading terms of the 𝒜𝒜\mathscr{A}script_A-hypergeometric series for the roots X5,ξsubscript𝑋5𝜉X_{5,\xi}italic_X start_POSTSUBSCRIPT 5 , italic_ξ end_POSTSUBSCRIPT of Sturmfels (see [44] for details of his notation)

X5,ξ=ξ[a01/5a51/5]+15(ξ2[a1a03/5a52/5]+ξ3[a2a02/5a53/5]+ξ4[a3a01/5a54/5][a4a5]).subscript𝑋5𝜉𝜉delimited-[]superscriptsubscript𝑎015superscriptsubscript𝑎51515superscript𝜉2delimited-[]subscript𝑎1superscriptsubscript𝑎035superscriptsubscript𝑎525superscript𝜉3delimited-[]subscript𝑎2superscriptsubscript𝑎025superscriptsubscript𝑎535superscript𝜉4delimited-[]subscript𝑎3superscriptsubscript𝑎015superscriptsubscript𝑎545delimited-[]subscript𝑎4subscript𝑎5X_{5,\xi}=\xi\,\biggl{[}\frac{a_{0}^{1/5}}{a_{5}^{1/5}}\biggr{]}+\frac{1}{5}% \biggl{(}\xi^{2}\,\biggl{[}\frac{a_{1}}{a_{0}^{3/5}a_{5}^{2/5}}\biggr{]}+\xi^{% 3}\,\biggl{[}\frac{a_{2}}{a_{0}^{2/5}a_{5}^{3/5}}\biggr{]}+\xi^{4}\,\biggl{[}% \frac{a_{3}}{a_{0}^{1/5}a_{5}^{4/5}}\biggr{]}-\biggl{[}\frac{a_{4}}{a_{5}}% \biggr{]}\biggr{)}\,.italic_X start_POSTSUBSCRIPT 5 , italic_ξ end_POSTSUBSCRIPT = italic_ξ [ divide start_ARG italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT end_ARG ] + divide start_ARG 1 end_ARG start_ARG 5 end_ARG ( italic_ξ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT [ divide start_ARG italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 / 5 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 / 5 end_POSTSUPERSCRIPT end_ARG ] + italic_ξ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT [ divide start_ARG italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 / 5 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 / 5 end_POSTSUPERSCRIPT end_ARG ] + italic_ξ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT [ divide start_ARG italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 4 / 5 end_POSTSUPERSCRIPT end_ARG ] - [ divide start_ARG italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT end_ARG ] ) . (2.5)
  • 1.

    This illustrates a difference between the use of 𝒜𝒜\mathscr{A}script_A-hypergeometric series and Fuss–Catalan series. In general, for a polynomial of degree n𝑛nitalic_n, a total of n𝑛nitalic_n 𝒜𝒜\mathscr{A}script_A-hypergeometric series are required to derive solutions for all the roots. In contrast, a Fuss–Catalan series encapsulates all the roots in one series, cycling through the roots of unity. There is a single formula for all the terms in a Fuss–Catalan series.

  • 2.

    A similar remark applies to the work of Birkeland [9]. In general, a total of |qp|n1superscript𝑞𝑝𝑛1|q-p|^{n-1}| italic_q - italic_p | start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT hypergeometric series are required to express the solutions for all the roots of an algebraic equation of degree n𝑛nitalic_n.

  • 3.

    For the ‘triangulation of unit length’ of the quintic, Sturmfels obtained expressions for the five roots Xj,1subscript𝑋𝑗1X_{j,-1}italic_X start_POSTSUBSCRIPT italic_j , - 1 end_POSTSUBSCRIPT, j=1,,5𝑗15j=1,\dots,5italic_j = 1 , … , 5 (see [44] for details). The example X1,1subscript𝑋11X_{1,-1}italic_X start_POSTSUBSCRIPT 1 , - 1 end_POSTSUBSCRIPT was displayed above. If all of the coefficients of the quintic are real then all of the series for the roots Xj,1subscript𝑋𝑗1X_{j,-1}italic_X start_POSTSUBSCRIPT italic_j , - 1 end_POSTSUBSCRIPT are real. However, a quintic with all real coefficients may not have all real roots. As Sturmfels noted, the 𝒜𝒜\mathscr{A}script_A-hypergeometric series have finite radii of convergence. Sturmfels offered a convergence criterion for the 𝒜𝒜\mathscr{A}script_A-hypergeometric series in his Theorem 3.2, reproduced here for ease of reference. (Consult [44] for definitions and notation).

    Theorem ([44] Theorem 3.2).

    The n𝑛nitalic_n series Xj,ξsubscript𝑋𝑗𝜉X_{j,\xi}italic_X start_POSTSUBSCRIPT italic_j , italic_ξ end_POSTSUBSCRIPT are roots of the general equation of order n𝑛nitalic_n; that is; f(Xj,ξ)=0𝑓subscript𝑋𝑗𝜉0f(X_{j,\xi})=0italic_f ( italic_X start_POSTSUBSCRIPT italic_j , italic_ξ end_POSTSUBSCRIPT ) = 0. There exists a constant M𝑀Mitalic_M such that all n𝑛nitalic_n series Xj,ξsubscript𝑋𝑗𝜉X_{j,\xi}italic_X start_POSTSUBSCRIPT italic_j , italic_ξ end_POSTSUBSCRIPT converge whenever

    |aij1|ijk|aij|kij1M|ak|djfor all 1jr and k{ij1,ij}.superscriptsubscript𝑎subscript𝑖𝑗1subscript𝑖𝑗𝑘superscriptsubscript𝑎subscript𝑖𝑗𝑘subscript𝑖𝑗1𝑀superscriptsubscript𝑎𝑘subscript𝑑𝑗for all 1jr and k{ij1,ij}|a_{i_{j-1}}|^{i_{j}-k}|a_{i_{j}}|^{k-i_{j-1}}\geq M|a_{k}|^{d_{j}}\qquad% \textrm{for all $1\leq j\leq r$ and $k\not\in\{i_{j-1},i_{j}\}$}\,.| italic_a start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_i start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_k end_POSTSUPERSCRIPT | italic_a start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_k - italic_i start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ≥ italic_M | italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT for all 1 ≤ italic_j ≤ italic_r and italic_k ∉ { italic_i start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT , italic_i start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } . (2.6)

    The above corrects a misprint in the direction of the inequality in [44, Thm. 3.2]. (I thank Sturmfels [45] for confirming the correct direction of the inequality.)

  • 4.

    Sturmfels also stated (last paragraph in [44, Section 3]) “First, no good bound for M𝑀Mitalic_M seems to be currently known, and, second, for many concrete instances the inequalities (3.2) [this is reproduced as eq. (2.6) above] will not hold for any triangulation. In this case one has to carry out analytic continuation: …” From eq. (3.3), we can supply a value for M𝑀Mitalic_M above. First define

    Mk,ij,ij1=|kij1|kij1|ijk|ijk|ijij1|ijij1.subscript𝑀𝑘subscript𝑖𝑗subscript𝑖𝑗1superscript𝑘subscript𝑖𝑗1𝑘subscript𝑖𝑗1superscriptsubscript𝑖𝑗𝑘subscript𝑖𝑗𝑘superscriptsubscript𝑖𝑗subscript𝑖𝑗1subscript𝑖𝑗subscript𝑖𝑗1M_{k,i_{j},i_{j-1}}=\frac{|k-i_{j-1}|^{k-i_{j-1}}|i_{j}-k|^{i_{j}-k}}{|i_{j}-i% _{j-1}|^{i_{j}-i_{j-1}}}\,.italic_M start_POSTSUBSCRIPT italic_k , italic_i start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_i start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = divide start_ARG | italic_k - italic_i start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_k - italic_i start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | italic_i start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_k | start_POSTSUPERSCRIPT italic_i start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_k end_POSTSUPERSCRIPT end_ARG start_ARG | italic_i start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_i start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_i start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_i start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG . (2.7)

    Then M=min(Mk,ij,ij1)/(n1)𝑀subscript𝑀𝑘subscript𝑖𝑗subscript𝑖𝑗1𝑛1M=\min(M_{k,i_{j},i_{j-1}})/(n-1)italic_M = roman_min ( italic_M start_POSTSUBSCRIPT italic_k , italic_i start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_i start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) / ( italic_n - 1 ). Sturmfels was however correct to note that a convergent series solution might not exist for any triangulation. We saw some examples earlier in this paper.

References

  • [1] P. Appell and Kampé de Fériet, Fonctions hypergéométriques et hypersphériques: polynomes d’Hermite, Gauthier-Villars, Paris (1926).
  • [2] J.-C. Aval, “Multivariate Fuss–Catalan numbers,” Discrete Mathematics, 308 (2008) 4660–4669.
  • [3] C. Banderier and M. Drmota, “Formulae and Asymptotics for Coefficients of Algebraic Functions,” Combinatorics, Probability and Computing, 24 (2015) 1–53.
  • [4] G. Belardinelli, Fonctions hypergéométriques de plusieurs variables et résolution analytique des équations algébriques genérales, Gauthier-Villars, Paris (1960).
  • [5] B. C. Berndt, Ramanujan’s Notebooks, Pt. 1, Springer-Verlag, New York, USA, (1985).
  • [6] R. Birkeland, “Résolution de l’equation algebrique trinome par des fonctions hypergéométriques supérieurs,” Comptes Rendus Acad. Sci., 171 (1920) 778–781.
  • [7] R. Birkeland, “Résolution de l’equation genérale du cinquième degré,” Comptes Rendus Acad. Sci., 171 (1920) 1047–1049.
  • [8] R. Birkeland, “Résolution de l’equation algebrique genérale par des fonctions hypergéométriques des plusieurs variables,” Comptes Rendus Acad. Sci., 171 (1920) 1370–1372 and 172 (1921) 309–311.
  • [9] R. Birkeland, “Über die Auflösung algebraischer Gleichungen durch hypergeometrische Funktionen,” Mathematische Zeitschrift, 26 (1927) 1047–1049.
  • [10] F. Brioschi, “Sulla risoluzione delle equazioni di quinto grado,” Ann. Mat. Pura Appl. 1 (1858) 256–259.
  • [11] W. Chu, “A new combinatorial interpretation for generalized Catalan number,” Discrete Mathematics, 65 (1987) 91–94.
  • [12] R. M. Corless, G. H. Gonnet, D. E. G. Hare, D. J. Jeffrey and D. E. Knuth, “On the Lambert W𝑊Witalic_W function,” Advances in Computational Mathematics, 5 (1996) 329–359.
  • [13] S. J. Dilworth and S. R. Mane, “Success Run Waiting Times and Fuss–Catalan Numbers,” Journal of Probability and Statistics, 2015 482462 (2015).
  • [14] S. J. Dilworth and S. R. Mane, “Applications of Fuss–Catalan Numbers to Success Runs of Bernoulli Trials,” Journal of Probability and Statistics, 2016 2071582 (2016).
  • [15] A. Eagle, “Series for all the Roots of a Trinomial Equation,” American Mathematical Monthly, 46 (1939) 422–425.
  • [16] L. Euler, “De serie Lambertina plurimisque eius insignibus proprietatibus,” Acta Academiae Scientarum Imperialis Petropolitinae, 2 (1783) (original date 1779) 29–51. Reprinted in L. Euler, Opera Omnia, Series Prima in “Commentationes Algebraicae,” Teubner, Leipzig, Germany, 6 (1921) 350–369.
  • [17] F. G. M. Eisenstein, “Allgemeine Auflösung der Gleichungen von den ersten vier Geraden,” J. Reine Angew. Math. 27, (1844) 81–83. Mathematische Werke I, 7–9. (in German).
  • [18] W. Feller, An Introduction to Probability Theory and its Applications, Wiley, New York, USA, 3rd Edition, (1968).
  • [19] I. J. Good, “Generalizations to several variables of Lagrange’s expansion, with applications to stochastic processes,” Mathematical Proceedings of the Cambridge Philosophical Society, 56 (1960) 367–380.
  • [20] H. W. Gould, “Some Generalizations of Vandermonde’s Convolution,” American Mathematical Monthly, 63 (1956) 84–91.
  • [21] H. W. Gould, “Final Analysis of Vandermonde’s Convolution,” American Mathematical Monthly, 64 (1957) 409–415.
  • [22] H. W. Gould, “Coefficient Identities for Powers of Taylor and Dirichlet Series,” American Mathematical Monthly, 81 (1974) 3–14.
  • [23] R. L. Graham, D. E. Knuth and O. Patashnik, Concrete Mathematics, Addison-Wesley, New York, USA, 2nd Edition, (1994).
  • [24] R. Kahkeshani, “A Generalization of the Catalan Numbers,” Journal of Integer Sequences, 16 (2013) 13.6.8.
  • [25] F. Kamber, “Formules exprimant les valeurs des coefficients des séries de puissances inverse,” Acta Mathematica, 78 (1946) 193–204.
  • [26] F. Klein, “Vorlesungen über das Ikosaeder und die Auflösung der Gleichungen vom fünften Grade,” Teubner, Leipzig, Germany (1884).
  • [27] M. J. A. Serret, Oeuvres de Lagrange, Gauthier-Villars, Paris (1869).
  • [28] J. Lambert, “Observationes variae in mathesin puram,” Acta Helvetica, 3 (1758) 128–168.
  • [29] J. Lambert, “Observations analytiques,” Nouveaux Mémoires de l’Académie Royale des Sciences et Belles-Lettres, Berlin, (1770) pp. 225–244.
  • [30] P. A. Lambert, “On the Solution of Algebraic Equations in Infinite Series,” Bull. Amer. Math. Soc., 14 (1908) 467–477.
  • [31] A. J. Lewis, “The Solution of Algebraic Equations by Infinite Series,” National Mathematics Magazine, 10 (1935) 80–95.
  • [32] E. McClintock, “A Method for Calculating Simultaneously all the Roots of an Equation.” American Journal of Mathematics, 17 (1895) 89–110.
  • [33] H. Mellin, “Ein allgemeiner Satz über algebraische Gleichungen,” Ann. Soc. Fennicae, (A) 7 (1915) 44S.
  • [34] S. G. Mohanty, “Identities of Rothe-Abel-Schläfli-Hurwitz-type,” SIAM Review, 8 (1966) 501–509.
  • [35] O. Nash, “On Klein’s icosahedral solution of the quintic,” Expositiones Mathematicae, 32 (2014) 99–120.
  • [36] M. Passare and A. Tsikh, “Algebraic Equations and Hypergeometric Series” in The Legacy of Neils Hendrik Abel, Springer-Verlag, Berlin, (2002) 653–672.
  • [37] G. N. Raney, “Functional Composition Patterns and Power Series Reversion,” Transactions of the American Mathematical Society, 94 (1960) 441–451.
  • [38] J. Riordan, Combinatorial Identities, Wiley, New York, USA, 2nd Edition, (1979).
  • [39] L. Schläfli, “Gesammelte Mathematische Abhandlungen,” vol. 1, (reprinted) Springer, Basel, Switzerland, (1950).
  • [40] A. Schuetz and G. Whieldon, “Polygonal Dissections and Reversions of Series,” Involve, 9 (2016) 223–236.
  • [41] R. P. Stanley, Catalan Numbers, Cambridge University Press, New York, USA, (2015).
  • [42] J. Stillwell, “Eisenstein’s Footnote,” The Mathematical Intelligencer, 17, (1995) 58–62.
  • [43] V. Strehl, “Identities of Rothe-Abel-Schläfli-Hurwitz-type,” Discrete Mathematics, 99 (1992) 321–340.
  • [44] B. Sturmfels, “Solving algebraic equations in terms of 𝒜𝒜\mathscr{A}script_A-hypergeometric series,” Discrete Mathematics, 210 (2000) 171–181.
  • [45] B. Sturmfels, private communication (2016).
Refer to caption
Figure 1: Graph of the discriminant Ψ24(|a0|,|a1|)subscriptsuperscriptΨ24subscript𝑎0subscript𝑎1\Psi^{-}_{24}(-|a_{0}|,|a_{1}|)roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) for (|a0|,|a1|)=(a,12a)subscript𝑎0subscript𝑎1𝑎12𝑎(|a_{0}|,|a_{1}|)=(a,\frac{1}{2}a)( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) = ( italic_a , divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_a ), plotted as a function of a𝑎aitalic_a.
Refer to caption
Figure 2: Graphs of the discriminant level sets Ψ12+(|a0|,|a4|)=0subscriptsuperscriptΨ12subscript𝑎0subscript𝑎40\Psi^{+}_{12}(|a_{0}|,|a_{4}|)=0roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | ) = 0 (dashed), Ψ12+(|a0|,|a4|)=0subscriptsuperscriptΨ12subscript𝑎0subscript𝑎40\Psi^{+}_{12}(|a_{0}|,-|a_{4}|)=0roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , - | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | ) = 0 (dotdash) and Ψ12+(|a0|,|a4|)=0subscriptsuperscriptΨ12subscript𝑎0subscript𝑎40\Psi^{+}_{12}(-|a_{0}|,-|a_{4}|)=0roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , - | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | ) = 0 (solid) plotted in the (|a0|,|a4|)subscript𝑎0subscript𝑎4(|a_{0}|,|a_{4}|)( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | ) parameter space. Points which map into the shaded area lie in the domain of convergence.
Refer to caption
Figure 3: Graphs of the discriminant level sets Ψ24+(|a0|,|a1|)=0subscriptsuperscriptΨ24subscript𝑎0subscript𝑎10\Psi^{+}_{24}(|a_{0}|,|a_{1}|)=0roman_Ψ start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) = 0 (dashed), Ψ24(|a0|,|a1|)=0subscriptsuperscriptΨ24subscript𝑎0subscript𝑎10\Psi^{-}_{24}(|a_{0}|,|a_{1}|)=0roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT ( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) = 0 (dotdash) and Ψ24(|a0|,|a1|)=0subscriptsuperscriptΨ24subscript𝑎0subscript𝑎10\Psi^{-}_{24}(-|a_{0}|,|a_{1}|)=0roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) = 0 (solid) plotted in the (|a0|,|a1|)subscript𝑎0subscript𝑎1(|a_{0}|,|a_{1}|)( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) parameter space. Points which map into the shaded area lie in the domain of convergence. Points which map into the cross-hatched area are not in the domain of convergence.
Refer to caption
Figure 4: Magnified view of Fig. 3. The solid curve is the level set Ψ24(|a0|,|a1|)=0subscriptsuperscriptΨ24subscript𝑎0subscript𝑎10\Psi^{-}_{24}(-|a_{0}|,|a_{1}|)=0roman_Ψ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 24 end_POSTSUBSCRIPT ( - | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ) = 0. The dashed line is a straight line which demarcates a right-angled triangle in the parameter space (|a0|,|a1|)subscript𝑎0subscript𝑎1(|a_{0}|,|a_{1}|)( | italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , | italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ).