On Hoffman polynomials of $\lambda$ -doubly stochastic irreducible matrices and commutative association schemes

Giusy Monzillo
Faculty of Mathematics, Natural Sciences
and Information Technologies
University of Primorska
Muzejski trg 2, 6000 Koper, Slovenia
[email protected] Safet Penjić
Faculty of Mathematics, Natural Sciences
and Information Technologies; and
Andrej Marušič Institute
University of Primorska
Muzejski trg 2, 6000 Koper, Slovenia
[email protected]

Abstract

Let $\Gamma$ denote a finite (strongly) connected regular (di)graph with adjacency matrix $A$ . The Hoffman polynomial $h(t)$ of $\Gamma=\Gamma(A)$ is the unique polynomial of smallest degree satisfying $h(A)=J$ , where $J$ denotes the all-ones matrix. Let $X$ denote a nonempty finite set. A nonnegative matrix $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ is called $\lambda$ -doubly stochastic if $\sum_{z\in X}(B)_{yz}=\sum_{z\in X}(B)_{zy}=\lambda$ for each $y\in X$ . In this paper we first show that there exists a polynomial $h(t)$ such that $h(B)=J$ if and only if $B$ is a $\lambda$ -doubly stochastic irreducible matrix. This result allows us to define the Hoffman polynomial of a $\lambda$ -doubly stochastic irreducible matrix.

Now, let $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ denote a normal irreducible nonnegative matrix, and ${\mathcal{B}}=\{p(B)\mid p\in{\mathbb{C}}[t]\}$ denote the vector space over ${\mathbb{C}}$ of all polynomials in $B$ . Let us define a $01$ -matrix $\widehat{A}$ in the following way: $(\widehat{A})_{xy}=1$ if and only if $(B)_{xy}>0$ $(x,y\in X)$ . Let $\Gamma=\Gamma(\widehat{A})$ denote a (di)graph with adjacency matrix $\widehat{A}$ , diameter $D$ , and let $A_{D}$ denote the distance- $D$ matrix of $\Gamma$ . We show that ${\mathcal{B}}$ is the Bose–Mesner algebra of a commutative $D$ -class association scheme if and only if $B$ is a normal $\lambda$ -doubly stochastic matrix with $D+1$ distinct eigenvalues and $A_{D}$ is a polynomial in $B$ .

MSC: 05E30, 05C75, 05C50, 05C12, 05C20

Keywords: Hoffman polynomial, doubly stochastic matrix, commutative association schemes, Bose-Mesner algebra.

1 Introduction

Let $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ denote a normal $\lambda$ -doubly stochastic irreducible matrix, $\Gamma$ denote the underlying weighted digraph of $B$ , and ${\mathcal{B}}=\{p(B)\mid p\in{\mathbb{C}}[t]\}$ denote the vector space over ${\mathbb{C}}$ of all polynomials in $B$ . In this paper, we study connections between commutative association schemes and $\lambda$ -doubly stochastic matrices by considering the following question: under which combinatorial or algebraic restriction on $\Gamma$ is the vector space ${\mathcal{B}}$ the Bose–Mesner algebra of a commutative association scheme? Formal definitions are given in Section 2.

We first give the relevant background before presenting our main results. Let $X$ denote a finite set, and $\mbox{\rm Mat}_{X}({\mathbb{C}})$ the set of complex matrices with rows and columns indexed by $X$ . Let ${\mathcal{R}}=\{R_{0},R_{1},\ldots,R_{d}\}$ denote a set of cardinality $d+1$ of nonempty subsets of $X\times X$ . The elements of the set ${\mathcal{R}}$ are called relations (or classes) on $X$ . For each integer $i$ $(0\leq i\leq d)$ , let $B_{i}\in\mbox{\rm Mat}_{X}({\mathbb{C}})$ denote the adjacency matrix of the graph $(X,R_{i})$ (directed, in general). The pair ${{\mathfrak{X}}}=(X,{\mathcal{R}})$ is a commutative $d$ -class association scheme if the relation matrices $B_{i}$ satisfy the following properties

(AS1)

$B_{0}=I$ , the identity matrix.
(AS2)

$\displaystyle{\sum_{i=0}^{d}B_{i}=J}$ , the all-ones matrix.
(AS3)

${B_{i}}^{\top}\in\{B_{0},B_{1},\ldots,B_{d}\}$ for $0\leq i\leq d$ .
(AS4)

$B_{i}B_{j}$ is a linear combination of $B_{0},B_{1},\ldots,B_{d}$ for $0\leq i,j\leq d$ (i.e., for every $i,j$ $(0\leq i,j\leq d)$ there exist positive integers $p^{h}_{ij}$ $(0\leq h\leq d)$ , known as intersection numbers, such that $B_{i}B_{j}=\sum_{h=0}^{d}p^{h}_{ij}B_{h}$ ).
(AS5)

$B_{i}B_{j}=B_{j}B_{i}$ for every $i,j$ $(0\leq i,j\leq d)$ (i.e., for the intersection numbers $p^{h}_{ij}$ , $0\leq i,j,h\leq d$ , from (AS4) we have that $p^{h}_{ij}=p^{h}_{ji}$ ).

By (AS1)–(AS5) the vector space ${\mathcal{M}}=\operatorname{span}\{B_{0},B_{1},\ldots,B_{d}\}$ is a commutative algebra; it is known as the Bose–Mesner algebra of ${\mathfrak{X}}$ . We say that a matrix $B$ generates ${\mathcal{M}}$ if every element in ${\mathcal{M}}$ can be written as a polynomial in $B$ . We say that ${\mathfrak{X}}$ is symmetric if the $B_{i}$ ’s are symmetric matrices.

A nonnegative matrix $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ such that $\sum_{z\in X}(B)_{yz}=\sum_{z\in X}(B)_{zy}=\lambda$ for each $y\in X$ is called a $\lambda$ -doubly stochastic matrix. If $\lambda=1$ , the matrix is simply called doubly stochastic. The following result was proved by Birkhoff (1946) and independently by von Neumann (1953): Each doubly stochastic matrix $B$ can be represented as a convex combination of permutation matrices, that is,

B=c_{1}P_{1}+c_{2}P_{2}+\cdots+c_{m}P_{m}

(1)

where the $c_{i}$ ’s are positive real numbers with $\sum_{i=1}^{m}c_{i}=1$ , and $P_{1},P_{2},\ldots,P_{m}$ are distinct permutation matrices. It is well known that the convex representation (1) of the doubly stochastic matrix $B$ is unique (up to reordering the terms) if and only if the graph $\Gamma$ is uniquely edge colourable, where $\Gamma$ is a bipartite graph with bipartition $(V_{1},V_{2})$ , where $V_{1}=\{x_{1},x_{2},\ldots,x_{|X|}\}$ , $V_{2}=\{y_{1},y_{2},\ldots,y_{|X|}\}$ and two vertices $x_{i}$ and $y_{j}$ are joined by $\left(\sum_{i=1}^{m}P_{i}\right)_{ij}$ edges $(1\leq i,j\leq|X|)$ (see, for example, [1, Subchapter 9.2]). In [19], Dufossé and Uçar showed that determining the minimal number of permutation matrices needed in (1) is strongly NP-complete. Some interesting papers that study doubly stochastic matrices are, for example, [4, 6, 7, 35, 36]. With respect to representation (1), from our point of view, it would be interesting to study the combinatorial structure of a (di)graph with adjacency matrix $\sum_{i=1}^{m}P_{i}$ . In this paper we study representation (1) in the set-up of Problem 1.1.

Problem 1.1

Let $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ denote a non-negative matrix. Assume that the matrix $B$ has exactly $m+1$ distinct entries $\{0,c_{1},\ldots,c_{m}\}$ , so that we can write $B$ as a linear combination of $01$ -matrices $F_{i}$ $(1\leq i\leq m)$ as follows

B=c_{1}F_{1}+c_{2}F_{2}+\cdots+c_{m}F_{m}

(note that the $c_{i}$ ’s are positive real numbers and our $F_{i}$ ’s are not necessarily permutation matrices). We also assume that $F_{i}$ ’s are $\circ$ -idempotents, i.e., $F_{i}\circ F_{j}=\operatorname{\boldsymbol{O}}$ whenever $i\neq j$ , where $\circ$ denotes the elementwise-Hadamard product. Let $A=\sum_{i=0}^{m}F_{i}$ . Can we describe the combinatorial structure (or give some algebraic properties) of the digraph $\Gamma=\Gamma(A)$ , so that the vector space ${\mathcal{B}}=\{p(B)\mid p\in{\mathbb{C}}[t]\}$ over ${\mathbb{C}}$ of all polynomials in $B$ is the Bose–Mesner algebra of a commutative association scheme? Also, what can we say about the entries of the matrix $B$ ?

Since the all- $1$ matrix $J$ belongs to every commutative association scheme, as a first sub-problem of Problem 1.1, we are interested in the case when $J$ is a polynomial in $B$ . This property implies that $B$ is a $\lambda$ -doubly stochastic matrix (see Theorem 1.1), so we obtain an answer to the second part of the problem. Let us give some background in this direction. For the moment, let $\Gamma$ denote an undirected graph with vertex set $X$ , adjacency matrix $A$ , and let $J$ denote the all-ones matrix of order $|X|$ . In [26], Hoffman proved that there exists a polynomial $p(x)$ such that

p(A)=J

(2)

if and only if $\Gamma$ is connected and regular. In [27], Hoffman and McAndrew studied the case of a directed graph, and obtained a similar result: there exists a polynomial $p(x)$ such that (2) holds if and only if $\Gamma$ is strongly connected and regular. Moreover, they showed that the unique polynomial of smallest degree satisfying (2) is $h(t)=\frac{|X|}{q(k)}q(t)$ , where $\Gamma=\Gamma(A)$ is a regular digraph of valency $k$ , and $(t-k)q(t)$ is the minimal polynomial of $A$ . Next, it is well known that a digraph $\Gamma$ is strongly connected if and only if its adjacency matrix $A$ is irreducible (see, for example, [31, Section 8.3]). For the moment, let $C\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ denote a nonnegative matrix. In [41], Wu and Deng study a polynomial that sends a nonnegative irreducible matrix to a positive rank one matrix; they showed that there is a polynomial $p(t)\in{\mathbb{R}}[t]$ such that $p(C)$ is a positive matrix of rank one if and only if $C$ is irreducible. Moreover, they show that the lowest degree of such a polynomial $p(t)$ with $\operatorname{trace}p(C)=|X|$ is unique. The first main result of our paper is Theorem 1.1, which is in the same spirit as that of Hoffman and McAndrew from [27] (note that one direction of our theorem also follows from [41, Theorem 2.2]).

Figure 1: A doubly stochastic matrix

B

and its underlying weighted digraph. The Hoffman polynomial of

B

h(t)=\frac{8}{q(1)}q(t)

, where

q(t)=t^{7}-\frac{1}{3}t^{6}+\frac{1}{3}t^{5}+\frac{5}{27}t^{4}-\frac{8}{27}t^{% 3}+\frac{8}{27}t^{2}-\frac{32}{243}t

Theorem 1.1

For a nonnegative matrix $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ there exists a polynomial $p\in{\mathbb{C}}[t]$ such that

p(B)=J

(3)

if and only if $B$ is a $\lambda$ -doubly stochastic irreducible matrix. Moreover, the unique polynomial of smallest degree satisfying (3) is $h(t)=\frac{|X|}{q(\lambda)}q(t)$ , where $q(\lambda)\neq 0$ and $(t-\lambda)q(t)$ is the minimal polynomial of $B$ .

We call the polynomial $h(t)$ from the Theorem 1.1 Hoffman polynomial of $B$ . We use Theorem 1.1 to prove Theorem 1.2, giving an algebraic-combinatorial characterization when the Bose–Mesner algebra of a commutative association scheme is generated by a normal $\lambda$ -doubly stochastic matrix (for results about when the Bose–Mesner algebra of a commutative association scheme is generated by a (directed) graph, see [21, 32, 43]). For a normal $\lambda$ -doubly stochastic irreducible matrix $B$ with $d+1$ distinct eigenvalues $\{\lambda,\lambda_{1},\ldots,\lambda_{d}\}$ the Hoffman polynomial is $h(t)=\frac{|X|}{\pi_{0}}\prod_{i=1}^{d}(t-\lambda_{i})$ , where $\pi_{0}=\prod_{i=1}^{d}(\lambda-\lambda_{i})$ . In Section 5, using the inner product $\langle p,q\rangle=\frac{1}{|X|}\operatorname{trace}(p(B)\overline{q(B)}^{\top})$ on the ring ${\mathbb{R}}_{d}[t]$ , we define the so-called “predistance polynomials” $\{p_{i}(t)\}_{i=0}^{d}$ , and we show that $\sum_{i=0}^{d}p_{i}(A)=J$ (see Lemma 4.5). The term “predistance polynomials” is taken from the theory of distance-regular graphs (see, for example, [15, 16, 17, 20, 38]). For the moment Problem 1.1 seems to be a hard problem, so we make one restriction on it: we assume that the number of distinct eigenvalues of $B$ is $D+1$ , where $D$ is the diameter of a graph $\Gamma=\Gamma(A)$ . The motivation for this restriction (again) arises from the theory of distance-regular graphs (see, for example, [12, 20, 22, 33, 34]). As a consequence of our restriction, the second main result of this paper is the following theorem:

Theorem 1.2

Let $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ denote a nonnegative irreducible matrix. Let ${\mathcal{B}}=\{p(B)\mid p\in{\mathbb{C}}[t]\}$ denote the vector space of all polynomials in $B$ . Define a $01$ -matrix $A$ in the following way: $(A)_{xy}=1$ if and only if $(B_{i})_{xy}>0$ . Let $\Gamma=\Gamma(A)$ denote a digraph with adjacency matrix $A$ , diameter $D$ , and let $A_{D}$ denote the distance- $D$ matrix of $\Gamma$ . Then, ${\mathcal{B}}$ is the Bose–Mesner algebra of a commutative $D$ -class association scheme if and only if $B$ is a normal $\lambda$ -doubly stochastic matrix with $D+1$ distinct eigenvalues and $A_{D}$ is a polynomial in $B$ .

Our Theorem 1.2 is an analogue of a result from algebraic graph theory, see for example [22, Proposition 2] or [24], where the authors considered an undirected graph (symmetric adjacency matrix) and proved the following claim: An undirected regular graph $\Gamma=\Gamma(A)$ with diameter $D$ and $d+1$ distinct eigenvalues is a distance-regular if and only if $D=d$ and the distance- $D$ matrix $A_{D}$ is a polynomial in $A$ .

Figure 2: A normal doubly stochastic matrix

B

and its underlying weighted digraph. The Hoffman polynimal of

B

h(t)=16t^{3}-16t^{2}+8t-2

, and the predistance polynomials are

p_{0}(t)=1

p_{1}(t)=4t-2

p_{2}(t)=8t^{2}-8t+2

and

p_{3}(t)=16t^{3}-24t^{2}+12t-3

. By Lemma 4.5,

\sum_{i=0}^{3}p_{i}(B)=J

. Moreover,

B

generates a

3

-class association scheme.

Our paper is organized as follows. In Section 2, we recall basic concepts from algebraic graph theory (experts from the field can skip this section). Our paper then starts from Section 3, where we prove Theorem 1.1. In Section 4 we define predistance-polynomials, the polynomials that we use later in the paper. In Section 5, we prove Theorem 1.2.

2 Preliminaries

A digraph with vertex set $X$ and arc set ${\cal E}$ is a pair $\Gamma=(X,{\cal E})$ which consists of a finite set $X=X(\Gamma)$ of vertices and a set ${\cal E}={\cal E}(\Gamma)$ of arcs (directed edges) between vertices of $\Gamma$ . As the initial and final vertices of an arc are not necessarily different, digraphs may have loops (arcs from a vertex to itself) and multiple arcs, that is, there can be more than one arc from each vertex to any other. If $e=(x,y)\in{\cal E}$ is an arc from $x$ to $y$ , then the vertex $x$ (and the arc $e$ ) is adjacent to the vertex $y$ , and the vertex $y$ (and the arc $e$ ) is adjacent from $x$ . The converse directed graph $\overline{\Gamma}$ is obtained from $\Gamma$ by reversing the direction of each arc. For a vertex $x$ , let $\Gamma_{1}^{\leftarrow}(x)$ (resp. $\Gamma_{1}^{\rightarrow}(x)$ ) denote the set of vertices adjacent to (resp. from) the vertex $x$ . In other words,

\Gamma_{1}^{\rightarrow}(x)=\{z\mid(x,z)\in{\cal E}(\Gamma)\}\qquad\mbox{ and % }\qquad\Gamma_{1}^{\leftarrow}(x)=\{z\mid(z,x)\in{\cal E}(\Gamma)\}.

Two small comments about the above notation: (i) drawing a directed edge from $x$ to $z$ , we have $x\rightarrow z$ , which yields the idea of using the notation $\Gamma_{1}^{\rightarrow}(x)$ ; (ii) drawing a directed edge from $z$ to $x$ , we have $x\leftarrow z$ (or $z\rightarrow x$ ), which yields the idea of using the notation $\Gamma_{1}^{\leftarrow}(x)$ . The elements of $\Gamma_{1}^{\rightarrow}(x)$ are called neighbors of $x$ . Instead of a set of vertices, we can consider a set of arcs: for a vertex $y$ , let $\delta_{1}^{\leftarrow}(y)$ (resp. $\delta_{1}^{\rightarrow}(y)$ ) denote the set of arcs adjacent to (resp. from) the vertex $y$ . The number $|\delta_{1}^{\rightarrow}(y)|$ is called the out-degree of $y$ and is equal to the number of edges leaving $y$ . The number $|\delta_{1}^{\leftarrow}(y)|$ is called the in-degree of $y$ and is equal to the number of edges going to $y$ . A digraph $\Gamma$ is $k$ -regular (of valency $k$ ) if $|\delta^{\rightarrow}_{1}(y)|=|\delta_{1}^{\leftarrow}(y)|=k$ for all $y\in X$ . We call $\Gamma$ simple if $\Gamma$ contains neither loops nor multiple edges.

Let $\Gamma=(X,{\cal E})$ denote a digraph. For any two vertices $x,y\in X$ , a directed walk of length $h$ from $x$ to $y$ is a sequence $[x_{0},x_{1},x_{2},\ldots,x_{h}]$ $(x_{i}\in X,\,0\leq i\leq h)$ such that $x_{0}=x$ , $x_{h}=y$ , and $x_{i}$ is adjacent to $x_{i+1}$ (i.e. $x_{i+1}\in\Gamma^{\rightarrow}_{1}(x_{i})$ ) for $0\leq i\leq h-1$ . We say that $\Gamma$ is strongly connected if for any $x,y\in X$ there is a directed walk from $x$ to $y$ . A closed directed walk is a directed walk from a vertex to itself. A directed path is a directed walk such that all vertices of the directed walk are distinct. A cycle is a closed directed path. The girth of $\Gamma$ is the length of a shortest cycle in $\Gamma$ .

For any $x,y\in X$ , the distance from $x$ to $y$ (or between $x$ and $y$ ), denoted by $\partial(x,y)$ , is the length of a shortest directed path from $x$ to $y$ . The diameter $D=D(\Gamma)$ of a strongly connected digraph $\Gamma$ is defined to be

D=\max\{\partial(y,z)\,|\,y,z\in X\}.

For a vertex $x\in X$ and any nonnegative integer $i$ not exceeding $D$ , let $\Gamma^{\rightarrow}_{i}(x)$ (or $\Gamma_{i}(x)$ ) denote the subset of vertices in $X$ that are at distance $i$ from $x$ , i.e.,

\Gamma^{\rightarrow}_{i}(x)=\{z\in X\mid\partial(x,z)=i\}.

We also define the set $\Gamma^{\leftarrow}_{i}(x)$ as $\Gamma^{\leftarrow}_{i}(x)=\{z\in X\mid\partial(z,x)=i\}$ . Let $\Gamma_{-1}(x)=\Gamma_{D+1}(x):=\emptyset$ . The eccentricity of $x$ , denoted by $\varepsilon=\varepsilon(x)$ , is the maximum distance between $x$ and any other vertex of $\Gamma$ . Note that the diameter of $\Gamma$ equals $\max\{\varepsilon(x)\mid x\in X\}$ .

All undirected graphs in this paper can be understood as digraphs in which an undirected edge between two vertices $x$ and $y$ represents two arcs, an arc from $x$ to $y$ , and an arc from $y$ to $x$ . In diagrams, instead of drawing two arcs, we draw one undirected edge between vertices $x$ and $y$ . For a basic introduction to the theory of undirected graphs we refer to [25, Section 2]. With the word graph we refer to a finite simple digraph.

2.1 Doubly stochastic matrix

Let $X$ denote a nonempty finite set and let ${\mathbb{R}}$ (resp. ${\mathbb{C}}$ ) denote the real number field (resp. the complex number field). Let $\mbox{\rm Mat}_{X}({\mathbb{R}})$ (resp. $\mbox{\rm Mat}_{X}({\mathbb{C}})$ ) denote the ${\mathbb{R}}$ -algebra (resp. the ${\mathbb{C}}$ -algebra) consisting of all matrices whose rows and columns are indexed by $X$ and whose entries are in ${\mathbb{R}}$ (resp. ${\mathbb{C}}$ ).

A square matrix $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ is said to be a $\lambda$ -doubly stochastic if $B\geq\operatorname{\boldsymbol{O}}$ and $B\operatorname{\boldsymbol{j}}=B^{\top}\operatorname{\boldsymbol{j}}=\lambda% \operatorname{\boldsymbol{j}}$ , where $\operatorname{\boldsymbol{O}}$ is the zero square matrix of order $|X|$ , $B\geq\operatorname{\boldsymbol{O}}$ is a shortcut for $(B)_{xy}\geq 0$ (for all $x,y\in X$ ), $\operatorname{\boldsymbol{j}}$ is the $|X|$ -dimensional column-vector with $1$ in all entries, and $B^{\top}$ is the transpose of $B$ . If $\lambda=1$ , the matrix is called doubly stochastic. A permutation matrix $P$ is a square matrix with exactly one $1$ in each row and column, and the rest of the entries being zero.

2.2 Elementary algebraic graph theory

In this section, we recall some definitions and basic concepts from algebraic graph theory.

The adjacency matrix $A\in\mbox{\rm Mat}_{X}({\mathbb{C}})$ of a digraph $\Gamma$ (with vertex set $X$ ) is indexed by the vertices from $X$ , and defined in the following way

\mbox{$(A)_{yz}=$ the number of arcs from $y$ to $z$}\qquad(y,z\in X)

(4)

(note that $(A)_{yz}\in{\mathbb{Z}}^{+}_{0}$ ). The distance- $i$ matrix $A_{i}$ $(2\leq i\leq D)$ of a digraph $\Gamma$ with diameter $D$ and vertex set $X$ is defined by

(A_{i})_{zy}=\left\{\begin{matrix}1&\mbox{ if $\partial(z,y)=i$},\\ 0&\mbox{ otherwise.~{}~{}~{}~{}}\end{matrix}\right.\qquad(z,y\in X,~{}2\leq i% \leq D).

We also define $A_{0}=I$ and $A_{1}=A$ . A matrix $B\in\mbox{\rm Mat}_{X}({\mathbb{C}})$ is said to be reducible when there exists a permutation matrix $P$ such that $P^{\top}BP=\left(\begin{matrix}X&Y\\ \operatorname{\boldsymbol{O}}&Z\end{matrix}\right)$ , where $X$ and $Z$ are both square, and $\operatorname{\boldsymbol{O}}$ is a zero matrix of suitable size. Otherwise, $B$ is said to be irreducible.

Theorem 2.1 (Perron–Frobenius Theorem)

Let $B$ denote an irreducible nonnegative matrix, and let $\operatorname{eig}(B)$ denote the set of distinct eigenvalues of $B$ . If $\theta=\max\limits_{\lambda\in\operatorname{eig}(B)}|\lambda|$ , then the following hold.

(i)

$\theta\in\operatorname{eig}(B)$ and $\theta>0$ .
(ii)

The algebraic multiplicity of $\theta$ is equal to $1$ .
(iii)

There exists an eigenvector ${\boldsymbol{\nu}}$ with all positive entries, such that $B{\boldsymbol{\nu}}=\theta{\boldsymbol{\nu}}$ .

Sometimes it is useful to normalize a vector ${\boldsymbol{\nu}}$ from (iii) in such a way that the smallest entry is equal to $1$ . Such a vector ${\boldsymbol{\nu}}$ is called a Perron–Frobenius eigenvector.

Proof. See, for example, [31, Section 8.3].

Lemma 2.2 (see, for example, [31, Section 8.3])

A digraph $\Gamma$ with adjacency matrix $A$ is strongly connected if and only if $A$ is an irreducible matrix.

Corollary 2.3

Let $\Gamma=\Gamma(A)$ denote a simple strongly connected digraph, and let $\operatorname{eig}(\Gamma)$ denote the set of distinct eigenvalues of $\Gamma$ . If $\theta=\max_{\lambda\in\operatorname{spec}(\Gamma)}|\lambda|$ , then the following hold.

(i)

$\theta\in\operatorname{eig}(\Gamma)$ and $\theta>0$ .
(ii)

The algebraic multiplicity of $\theta$ is equal to $1$ .
(iii)

There exists an eigenvector ${\boldsymbol{\nu}}$ with all positive entries, such that $A{\boldsymbol{\nu}}=\theta{\boldsymbol{\nu}}$ .

Proof. Routine using Lemma 2.2. (See, for example, [31, Section 8.3].)

A matrix $A\in\mbox{\rm Mat}_{X}({\mathbb{C}})$ is called normal if it commutes with its adjoint, i.e. if $A\overline{A}^{\top}=\overline{A}^{\top}A$ .

Theorem 2.4 (see, for example, [2, Chapter 7])

Let $A\in\mbox{\rm Mat}_{X}({\mathbb{C}})$ denote a matrix over ${\mathbb{C}}$ , with rows and columns indexed by $X$ . Then, the following are equivalent.

(i)

$A$ is normal.
(ii)

${\mathbb{C}}^{|X|}$ has an orthonormal basis consisting of eigenvectors of $A$ .
(iii)

$A$ is a diagonalizable matrix.
(iv)

The algebraic multiplicity of $\lambda$ is equal to the geometric multiplicity of $\lambda$ , for every eigenvalue $\lambda$ of $A$ .

Two matrices $A,B\in\mbox{\rm Mat}_{X}({\mathbb{C}})$ are said to be simultaneously diagonalizable if there is a nonsingular $S\in\mbox{\rm Mat}_{X}({\mathbb{C}})$ such that $S^{-1}AS$ and $S^{-1}BS$ are both diagonal.

Lemma 2.5 ([28, Theorem 1.3.12])

Two diagonalizable matrices are simultaneously diagonalizable if and only if they commute.

Theorem 2.6

Let ${\mathcal{M}}$ denote a space of commutative normal matrices. Then, there exists a unitary matrix $U\in\mbox{\rm Mat}_{X}({\mathbb{C}})$ which diagonalizes ${\mathcal{M}}$ .

Proof. Immediate from Theorem 2.4, Lemma 2.5 and [28, Subsection 1.3].

2.3 Underlying digraph of a nonnegative matrix $B$

The underlying digraph of a nonnegative matrix $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ is defined as a pair $\Gamma=(X,E)$ , in which $X$ denotes the set of vertices (nodes), and $E$ stands for the set of arcs such that $(x,y)\in E$ if and only if $(B)_{xy}>0$ . With other words, the adjacency matrix $A$ of an underlying digraph of a nonnegative matrix $B$ is defined in the following way:

(A)_{xy}=\left\{\begin{array}[]{ll}1&\mbox{ if $B_{xy}>0$},\\ 0&\mbox{ otherwise.}\end{array}\right.\qquad(x,y\in X).

Lemma 2.7

Let $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ denote a nonnegative matrix, and let $A$ denote the adjacency matrix of the underlying digraph of $B$ . Then $B$ is irreducible if and only if $A$ is irreducible.

Proof. Routine.

2.4 Underlying weighted digraph of a nonnegative matrix $B$

A weighted digraph is a digraph whose arcs are assigned values, known as weights. An underlying weighted digraph of a nonnegative matrix $B$ is defined as a triplet $\Delta=(X,E,\omega)$ for which the following (i)–(iii) holds.

(i)

$X$ denotes the set of vertices.
(ii)

$E$ stands for the set of arcs such that $(x,y)\in E$ if and only if $B_{x,y}>0$ .
(iii)

$\omega:E\rightarrow{\mathbb{R}}^{+}_{0}$ stands for a function that weights each arc of the graph, which is defined in the following way: $\omega(x,y)=(B)_{xy}$ $((x,y)\in E)$ .

Note that $\Delta$ is the underlying digraph of a nonnegative matrix $B$ such that each arc $(x,y)$ of $\Delta$ has weight $(B)_{xy}$ .

2.5 Number of walks in $\Gamma$ and $B^{\ell}$

Lemma 2.8 is well known for undirected graphs. We give a corresponding claim for directed graphs, in particular, for our definition of the adjacency matrix $A$ of a digraph $\Gamma$ as given in (4).

Lemma 2.8

Let $\Gamma=\Gamma(A)$ denote a strongly connected digraph with vertex set $X$ , diameter $D$ , and adjacency matrix $A$ . The number of walks of length $\ell\in{\mathbb{N}}$ in $\Gamma$ from $x$ to $y$ is equal to $(x,y)$ -entry of the matrix $A^{\ell}$ .

Proof. Pick $x,y\in X$ . We prove the claim by mathematical induction on $h=\partial(x,y)$ (the distance from $x$ to $y$ ).

Base of induction. For $\ell=1$ the claim is trivial.

Induction step. Assume that, if $\partial(x,y)=h\in\{1,\ldots,m-1\}$ $(m\geq 2)$ , then the number of walks of length $h$ is equal to $(x,y)$ -entry of the matrix $A^{h}$ . We prove that the claim is true for $m$ .

	$\displaystyle(A^{m})_{xy}$	$\displaystyle=(A^{m-1}A)_{xy}$
		$\displaystyle=\sum_{z\in X}(A^{m-1})_{xz}\cdot(A)_{zy}.$		(5)

By the induction assumption, $(A^{m-1})_{xz}$ is the number of walks of length $m-1$ from $x$ to $z$ . The entry $(A)_{zy}$ is nonzero (i.e., equal to the number of walks of length $1$ from $z$ to $y$ ) if and only if in $\Gamma$ there is at least one arc from $z$ to $y$ . So, in (5), we start from the sum $S=0$ , and for every $z\in X$ if there is at least one arc $(z,y)$ we add the number $(A^{m-1})_{xz}\cdot(A)_{zy}$ to $S$ . Thus, the sum (5) represents the number of walks of length $m$ from $x$ to $y$ , and the result follows.

Lemma 2.9

Let $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ denote a nonnegative matrix and $\Gamma=\Gamma(A)$ denote underlying digraph of $B$ with adjacency matrix $A$ . Then, for any $\ell\in{\mathbb{N}}$ ,

(B^{\ell})_{zy}\neq 0\qquad\mbox{ if and only if }\qquad(A^{\ell})_{zy}\neq 0% \qquad(y,z\in X).

Proof. Immediate from the definition of the underlying digraph and the underlying weighted digraph of $B$ .

2.6 A vector space of all polynomials in a normal nonnegative matrix

Let $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ denote a normal nonnegative matrix. By Theorem 2.4, $B$ has $|X|$ linearly independent eigenvectors ${\cal U}=\{u_{1},u_{2},...,u_{|X|}\}$ which form an orthonormal basis for $\mathbb{C}^{|X|}$ . Let $V_{i}$ denote the eigenspace $V_{i}=\ker(B-\lambda_{i}I)$ and $\dim(V_{i})=m_{i},$ for $0\leq i\leq d$ . For every vector $u_{i}\in{{\cal U}}$ there exists exactly one eigenspace $V_{j}$ such that $u_{i}\in V_{j}$ , and since $V_{i}\cap V_{j}=\{{\boldsymbol{0}}\}$ for $i\not=j$ , we can partition the set ${\cal U}$ into sets ${\cal U}_{0},$ ${\cal U}_{1},$ …, ${\cal U}_{d}$ such that

{\cal U}_{i}\mbox{ is a basis for }V_{i},\qquad{\cal U}={\cal U}_{0}\cup{\cal U% }_{1}\cup\ldots\cup{\cal U}_{d}\qquad\mbox{ and }\qquad{\cal U}_{i}\cap{\cal U% }_{j}=\emptyset.

Note that

{\mathbb{C}}^{|X|}=V_{0}\oplus V_{1}\oplus\cdots\oplus V_{d}\quad(\mbox{% orthogonal direct sum})

and

m_{0}+m_{1}+\cdots+m_{d}=|X|.

(6)

Definition 2.10

Let $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ denote a normal nonnegative matrix. For each eigenvalue $\lambda_{i}$ $(0\leq i\leq d)$ of $B$ , let $U_{i}$ denote the matrix whose columns form an orthonormal basis of its eigenspace $V_{i}=\ker(B-\lambda_{i}I)\subseteq{\mathbb{C}}^{|X|}$ . The primitive idempotents of $B$ are matrices $E_{i}$ defined in the following way:

E_{i}:=U_{i}\overline{U_{i}}^{\top}\qquad(0\leq i\leq d).

Proposition 2.11

With reference to Definition 2.10, let ${\mathcal{B}}=\{p(B)\mid p\in{\mathbb{C}}[x]\}$ denote the vector space over ${\mathbb{C}}$ of all polynomials in $B$ . Then, the following hold.

(i)

Any power of $B$ can be expressed as a linear combination of the idempotents $E_{i}$ $(0\leq i\leq d)$ , i.e.,

B^{h}=\sum_{i=0}^{d}\lambda_{i}^{h}E_{i}\qquad(h\in{\mathbb{N}}).

(ii)

$\{E_{0},E_{1},\ldots,E_{d}\}$ is an orthogonal basis of ${\mathcal{B}}$ .
(iii)

$\{I,B,B^{2},\ldots,B^{d}\}$ is a basis of ${\mathcal{B}}$ .
(iv)

$\overline{B}^{\top}=B^{\top}=p(B)$ for some polynomial $p\in{\mathbb{C}}[t]$ .

Proof. (i) With respect to Definition 2.10, abbreviate $m_{i}=m(\lambda_{i})$ $(0\leq i\leq d)$ . Pick $i$ $(0\leq i\leq d)$ , and note that $BU_{i}=\lambda_{i}U_{i}.$ If $U=[U_{1}|U_{2}|\ldots|U_{d}]$ , then

B=U\Lambda\overline{U}^{\top},\qquad\mbox{ where }\qquad\Lambda=\left[\begin{% matrix}\lambda_{0}I_{m_{0}}&0&\ldots&0\\ 0&\lambda_{1}I_{m_{1}}&\ldots&0\\ \vdots&\vdots&\ddots&\vdots\\ 0&0&\ldots&\lambda_{d}I_{m_{d}}\end{matrix}\right].

Now, it is routine to see that

	$\displaystyle B$	$\displaystyle=U\Lambda\overline{U}^{\top}=[U_{0}\|U_{1}\|\ldots\|U_{d}]\left[% \begin{matrix}\lambda_{0}I_{m_{0}}&0&\cdots&0\\ 0&\lambda_{1}I_{m_{1}}&\cdots&0\\ \vdots&\vdots&\ddots&\vdots\\ 0&0&\ldots&\lambda_{d}I_{m_{d}}\\ \end{matrix}\right]\left[\begin{matrix}\underline{\overline{U_{0}}^{\top}}\\ \underline{\overline{U_{1}}^{\top}}\\ \underline{\,\,\,\vdots\,\,\,}\\ \overline{U_{d}}^{\top}\\ \end{matrix}\right]$
		$\displaystyle=[\lambda_{0}U_{0}\|\lambda_{1}U_{1}\|\cdots\|\lambda_{d}U_{d}]\left% [\begin{matrix}\underline{\overline{U_{0}}^{\top}}\\ \underline{\overline{U_{1}}^{\top}}\\ \underline{\,\,\,\vdots\,\,\,}\\ \overline{U_{d}}^{\top}\\ \end{matrix}\right]$
		$\displaystyle=\lambda_{0}U_{0}\overline{U_{0}}^{\top}+\lambda_{1}U_{1}% \overline{U_{1}}^{\top}+\cdots+\lambda_{d}U_{d}\overline{U_{d}}^{\top}$
		$\displaystyle=\underbrace{\lambda_{0}}_{\in{\mathbb{C}}}E_{0}+\underbrace{% \lambda_{1}}_{\in{\mathbb{C}}}E_{1}+\cdots+\underbrace{\lambda_{d}}_{\in{% \mathbb{C}}}E_{d}\qquad(\mbox{where }E_{i}:=U_{i}\overline{U_{i}}^{\top}).$

Since $E_{i}E_{j}=\delta_{ij}E_{i}$ , $\overline{E_{i}}^{\top}=E_{i}$ and $\operatorname{trace}(E_{i})=m_{i}$ $(0\leq i\leq d)$ , for any polynomial $p\in{\mathbb{C}}[t]$ we have

\displaystyle p(B)

\displaystyle=\underbrace{p(\lambda_{0})}_{\in{\mathbb{C}}}E_{0}+\cdots+% \underbrace{p(\lambda_{d})}_{\in{\mathbb{C}}}E_{d}.

The result follows.

(ii)–(iv) Routine. For (ii) and (iii), see, for example, [34, Chapter 2]. For claim (iv), see, for example, [12, Theorem 1], or note that $\overline{E_{i}}^{\top}=E_{i}$ .

2.7 Commutative association schemes

In Section 1, we have already provided the definition of a commutative $d$ -class association scheme ${\mathfrak{X}}=\{X,\{R_{i}\}_{i=0}^{d}\}$ together with definitions of relations $\{R_{i}\}_{i=0}^{d}$ , relation matrices $\{B_{i}\}_{i=0}^{d}$ and Bose–Mesner algebra ${\mathcal{M}}$ of ${\mathfrak{X}}$ . The meaning of a “matrix generates ${\mathfrak{X}}$ ” has been given in Section 1 as well. Note that relation matrices $\{B_{i}\}_{i=0}^{d}$ form a standard basis of the Bose-Mesner algebra ${\mathcal{M}}$ . We say that the relation $R_{i}$ generates the association scheme ${\mathfrak{X}}$ if every element of the Bose–Mesner algebra ${\mathcal{M}}$ of ${\mathfrak{X}}$ can be writen as a polynomial in $B_{i}$ , where $B_{i}$ is adjacency matrix of the (di)graph $(X,R_{i})$ . We say that the association scheme ${\mathfrak{X}}$ is $P$ -polynomial with respect to $B_{1}$ , if it is generated by $B_{1}$ and there exists an ordering $(B_{0},B_{1},\ldots,B_{d})$ and polynomials $p_{j}(t)$ of degree $j$ , such that $B_{j}=p_{j}(B_{1})$ $(0\leq j\leq d)$ . It is well known that if $(B_{0},B_{1},\ldots,B_{d})$ is a $P$ -polynomial ordering of ${\mathfrak{X}}$ then $\Gamma=\Gamma(B_{1})$ is a distance-regular (di)graph. Recall that the association scheme ${\mathfrak{X}}$ is polynomial (with respect to $R_{i}$ ) if it is generated by a relation $R_{i}$ for some $i$ (we recommend articles [32, 43] for interesting results in that direction).

3 The Hoffman polynomial of a nonnegative matrix

In this section, we prove Theorem 1.1. Our proof is within the lines of [27, Theorem 1]. Theorem 1.1 states: For a nonnegative matrix $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ there exists a polynomial $p\in{\mathbb{C}}[t]$ such that $p(B)=J$ if and only if $B$ is a $\lambda$ -doubly stochastic irreducible matrix. Furthermore, it states that the unique polynomial of smallest degree satisfying $p(B)=J$ is $h(t)=\frac{|X|}{q(\lambda)}q(t)$ , where $q(\lambda)\neq 0$ and $(t-\lambda)q(t)$ is the minimal polynomial of $B$ .

Hoffman polynomial is well-known polynomial in algebraic graph theory and can be found in many textbooks (see, for example, [3, 5, 8, 14, 39, 40]). There has been a great deal of work following this concept, for example: polynomials that sends a nonnegative irreducible matrix to a positive rank one matrix [41], some Hoffman-type identities for the class of harmonic and semiharmonic graphs [18], Hoffman identities of non-regular graphs through the use of the Laplacian [37], Hoffman identities by means of main eigenvalues [29], Hoffman polynomial of the tensor product of a cycle [11], Hoffman polynomial of cycle prefix digraphs [13], Hoffmn polynomials of some more general regular strongly connected digraphs [42].

Proof of Theorem 1.1. $(\Leftarrow)$ Assume that $B$ is a $\lambda$ -doubly stochastic irreducible matrix. We use this assumption to show that there exists a polynomial $p\in{\mathbb{C}}[t]$ such that $p(B)=J$ . Note that, by assumption, $B\operatorname{\boldsymbol{j}}=\lambda\operatorname{\boldsymbol{j}}=B^{\top}% \operatorname{\boldsymbol{j}}$ .

A square matrix is stochastic if all of its entries are nonnegative, and the entries of each column sum to $1$ . From, for example, [30, Subsection 5.6], if $M$ is a stochastic matrix, then $1$ is an eigenvalue of $M$ ; and if $\theta$ is a (real or complex) eigenvalue of $M$ , then $|\theta|\leq 1$ . In our case, we have that $\lambda^{-1}B$ is a stochastic matrix. It is routine to show that $\operatorname{eig}(B)=\{\lambda,\lambda_{1},\ldots,\lambda_{d}\}$ are eigenvalues of $B$ if and only if $\lambda^{-1}\operatorname{eig}(B)$ are eigenvalues of $\lambda^{-1}B$ . It follows that $\lambda=\max_{\theta\in\operatorname{eig}(B)}|\theta|$ . By Theorem 2.1, the algebraic multiplicity of $\lambda$ is equal to $1$ and consequently

the geometric multiplicity of

\lambda

is equal to

1

(7)

also. This implies that the minimal polynomial of $B$ is in the following form $m(t)=(t-\lambda)s(t)$ for some polynomial $s(t)\in{\mathbb{C}}[t]$ , where $s(\lambda)\neq 0$ . Since $(B-\lambda I)s(B)=\operatorname{\boldsymbol{O}}$ , for any $v\in{\mathbb{C}}^{|X|}$ we have $(B-\lambda I)s(B)v={\boldsymbol{0}}$ , which yields $s(B)v\in\ker(B-\lambda I)$ . Thus,

\mbox{for every $v\in{\mathbb{C}}^{|X|}$ there exists $\alpha_{v}\in{\mathbb{C% }}$ such that $s(B)v=\alpha_{v}\operatorname{\boldsymbol{j}}$}.

(8)

Next we consider when equation (8) is possible. For the moment, assume that $\langle v,\operatorname{\boldsymbol{j}}\rangle=0$ , where $\langle\cdot,\cdot\rangle$ stands for standard Hermitian inner product $\langle u,v\rangle=u\overline{v}^{\top}$ . For every $\ell\in{\mathbb{N}}$ ,

	$\displaystyle\langle B^{\ell}v,\operatorname{\boldsymbol{j}}\rangle$	$\displaystyle=\langle v,\overline{B^{\ell}}^{\top}\operatorname{\boldsymbol{j}}\rangle$
		$\displaystyle=\lambda^{\ell}\langle v,\operatorname{\boldsymbol{j}}\rangle$
		$\displaystyle=0.$

This yields that $\langle s(B)v,j\rangle=0$ , and by (8), $\langle\alpha_{v}\operatorname{\boldsymbol{j}},\operatorname{\boldsymbol{j}}% \rangle=0=\alpha_{v}\|\operatorname{\boldsymbol{j}}\|^{2}$ , which implies $\alpha_{v}=0$ . Thus, for every vector $v\in{\mathbb{C}}^{|X|}$ for which $\langle v,\operatorname{\boldsymbol{j}}\rangle=0$ , we also have $s(B)v={\boldsymbol{0}}$ . Since $s(B)\operatorname{\boldsymbol{j}}=s(\lambda)\operatorname{\boldsymbol{j}}$ (as well as $J\operatorname{\boldsymbol{j}}=|X|\operatorname{\boldsymbol{j}}$ ), and ${\mathbb{C}}^{|X|}=\langle\operatorname{\boldsymbol{j}}\rangle\oplus\langle% \operatorname{\boldsymbol{j}}\rangle^{\bot}$ (orthogonal direct sum), we can conclude that the polynomial

h(t)=\frac{|X|}{s(\lambda)}s(t)

has the property that $h(B)=J$ .

$(\Rightarrow)$ Assume now that for a nonnegative matrix $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ , there exists a polynomial $p\in{\mathbb{C}}[t]$ such that $p(B)=J$ . We use this assumption to show that $B$ is a $\lambda$ -doubly stochastic irreducible matrix, for some $\lambda\in{\mathbb{R}}$ .

Let $y,z\in X$ denote two arbitrary elements of $X$ , and $A$ denote the adjacency matrix of the underlying digraph of $B$ . Since $p(B)=J$ , there exists $\ell$ such that $(B^{\ell})_{zy}\neq 0$ , which is true if and only if $(A^{\ell})_{zy}\neq 0$ (see Lemma 2.9). Thus the digraph $\Gamma=\Gamma(A)$ (digraph with adjacency matrix $A$ ) is strongly connected. Note that $\Gamma$ is strongly connected if and only if $A$ is an irreducible matrix (see Lemma 2.2). Therefore, our non-negative matrix $B$ is also irreducible (see Lemma 2.7).

To prove that $B$ is $\lambda$ -doubly stochastic matrix, for some $\lambda\in{\mathbb{R}}$ , we pick $x\in X$ , and we consider the out-edge-weight-sum $\Sigma_{1}^{\rightarrow}(x)$ of the vertex $x$ , as well as the in-edge-weight-sum $\Sigma_{1}^{\leftarrow}(x)$ of the same vertex:

\Sigma_{1}^{\rightarrow}(x)=\sum_{z\in X}(B)_{xz}=(B\operatorname{\boldsymbol{% j}})_{x},

\Sigma_{1}^{\leftarrow}(x)=\sum_{z\in X}(B)_{zx}=(B^{\top}\operatorname{% \boldsymbol{j}})_{x}.

Since $J=p(B)$ , we have $JB=BJ$ , and with it, for any $y\in X$ , we have

	$\displaystyle\theta$	$\displaystyle=\Sigma_{1}^{\rightarrow}(x)=(B\operatorname{\boldsymbol{j}})_{x}$
		$\displaystyle=\sum_{z\in X}(B)_{xz}(J)_{zy}=(BJ)_{xy}$
		$\displaystyle=(JB)_{xy}=\sum_{z\in X}(J)_{xz}(B)_{zy}$
		$\displaystyle=\sum_{z\in X}(B^{\top})_{yz}(J)_{zx}=(B^{\top}\operatorname{% \boldsymbol{j}})_{y}$
		$\displaystyle=\Sigma_{1}^{\leftarrow}(y).$

From above, we have that every $y\in X$ (including our fixed $x$ ) has the same in-edge-weight-sum, i.e., $\Sigma_{1}^{\leftarrow}(y)=\theta$ $(\forall y\in X)$ . Again, considering $JB=BJ$ , for any $y\in X$ , we have

	$\displaystyle\theta$	$\displaystyle=\Sigma_{1}^{\leftarrow}(x)=(B^{\top}\operatorname{\boldsymbol{j}% })_{x}$
		$\displaystyle=\sum_{z\in X}(B^{\top})_{xz}(J)_{zy}=\sum_{z\in X}(J)_{yz}(B)_{zx}$
		$\displaystyle=(JB)_{yx}=(BJ)_{yx}$
		$\displaystyle=\sum_{z\in X}(B)_{yz}(J)_{zx}=(B\operatorname{\boldsymbol{j}})_{y}$
		$\displaystyle=\Sigma_{1}^{\rightarrow}(y).$

Thus, every $y\in X$ has the same out-edge-weight-sum $\theta$ also, i.e., $\Sigma_{1}^{\rightarrow}(y)=\theta$ $(\forall y\in X)$ . This conclude the proof that $B$ is a $\lambda$ -doubly stochastic irreducible matrix, for some $\lambda\in{\mathbb{R}}$ .

It is left to prove that the unique polynomial of smallest degree satisfying $p(B)=J$ is $h(t)=\frac{|X|}{q(\lambda)}q(t)$ where $q(\lambda)\neq 0$ and $(t-\lambda)q(t)$ is the minimal polynomial of $B$ .

We had already shown that $h(B)=J$ for $h(t)=\frac{|X|}{q(\lambda)}q(t)$ , where $q(\lambda)\neq 0$ and $m(t)=(t-\lambda)q(t)$ is the minimal polynomial of $B$ . Assume that degree of $q(t)$ is $\ell$ , and note that $\ell$ is smaller than the degree of the minimal polynomial $m(t)$ of $B$ . We first prove that $h(t)$ is the unique polynomial of degree $\ell$ such that $h(B)=J$ ; our proof is by a contradiction. Assume that $r(t)$ is a polynomial of degree $m$ (in particular for this case $m=\ell$ ) such that $r(B)=J$ , and that $r(t)\neq h(t)$ . We have $(r-h)(B)=\operatorname{\boldsymbol{O}}$ , and since degree of $r(t)-h(t)$ is less or equal to $\ell$ , this is possible if and only if $r(t)-h(t)=0$ , a contradiction. Thus, $h(t)$ is the unique such polynomial of degree $\ell$ . In a similar way we show that there are no polynomials $r(t)$ of degree smaller than $\ell$ that satisfy $r(B)=J$ . The result follows.

Corollary 3.1

Assume that $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ is a normal $\lambda$ -doubly stochastic irreducible matrix, and let $\operatorname{eig}(B)=\{\lambda,\lambda_{1},\ldots,\lambda_{d}\}$ denote the set of distinct eigenvalues of $B$ . Then, the Hoffman polynomial of $B$ is

h(t)=\frac{|X|}{\pi_{0}}\prod_{i=1}^{d}(t-\lambda_{i}),

where $\pi_{0}=\prod_{i=1}^{d}(\lambda-\lambda_{i})$ .

Proof. Define $\theta=\max\limits_{\mu\in\operatorname{eig}(B)}|\mu|$ . Since $B$ is a nonnegative irreducible matrix, by Theorem 2.1, $\theta\in\operatorname{eig}(B)$ , $\theta>0$ , the algebraic multiplicity of $\theta$ is $1$ , and there exists an eigenvector ${\boldsymbol{\nu}}$ with all positive entries (normalized in such a way that the smallest entry is equal to $1$ ), such that $B{\boldsymbol{\nu}}=\theta{\boldsymbol{\nu}}$ . On the other hand, since $B$ is $\lambda$ -doubly stochastic, we have $B\operatorname{\boldsymbol{j}}=\lambda\operatorname{\boldsymbol{j}}$ . By definition of $\theta$ we have $\lambda\leq\theta$ . Next, we prove that $\theta\leq\lambda$ . Let ${\boldsymbol{\nu}}=(\nu_{x},\ldots,\nu_{w})^{\top}$ and define $\nu_{y}=\max\limits_{z\in X}\{\nu_{z}\}$ . We have

\theta\nu_{y}=(\theta{\boldsymbol{\nu}})_{y}=(B{\boldsymbol{\nu}})_{y}=\sum_{z% \in X}(B)_{yz}\nu_{z}\leq\nu_{y}\sum_{z\in X}(B)_{yz}=\nu_{y}\lambda.

As consequence, $\theta=\lambda$ and ${\boldsymbol{\nu}}=\operatorname{\boldsymbol{j}}$ .

From the above observation, now we can let $m(t)=(t-\lambda)q(t)$ denote the minimal polynomial of $B$ . In the end, since $B$ is a normal matrix, $m(t)=(t-\lambda)\prod_{i=1}^{d}(t-\lambda_{i})$ (see, for example, [31, Section 7.11]). The result follows from Theorem 1.1.

4 On predistance polynomials

In this section, we define predistance polynomials, the set of orthogonal polynomials that we use for the rest of the paper. The term “predistance polynomial” is taken from the theory of distance-regular graphs (see, for example, [15, 16, 17, 20, 38]).

We define an inner product on $\mbox{\rm Mat}_{X}({\mathbb{C}})$ in the following way:

\langle B,C\rangle=\frac{1}{|X|}\operatorname{trace}(B\overline{C}^{\top})% \qquad(B,C\in\mbox{\rm Mat}_{X}({\mathbb{C}})).

(9)

Let

\|C\|^{2}=\langle C,C\rangle\qquad\mbox{ for all $C\in\mbox{\rm Mat}_{X}({% \mathbb{C}})$,}

and note that for any $R,S\in\mbox{\rm Mat}_{X}({\mathbb{C}})$ ,

\langle R,S\rangle=\frac{1}{|X|}\sum\limits_{u\in X}(R\overline{S}^{\top})_{uu% }=\frac{1}{|X|}\sum\limits_{u\in X}\sum\limits_{v\in X}(R)_{uv}(\overline{S})_% {uv}=\frac{1}{|X|}\sum\limits_{u,v\in X}(R\circ\overline{S})_{uv},

(10)

(where $\circ$ is the elementwise-Hadamard product).

Now, let $B\in\mbox{\rm Mat}_{X}({\mathbb{C}})$ denote a normal matrix with $d+1$ distinct eigenvalues. Let $\mathbb{C}_{d}[t]=\{a_{0}+a_{1}t+\ldots+a_{d}t^{d}\mid a_{i}\in\mathbb{C},\,0% \leq i\leq d\}$ denote the ring of all polynomials of degree at most $d$ with coefficients in $\mathbb{C}$ . For every $p,q\in{\mathbb{C}}_{d}[t]$ we define

\langle p,q\rangle={1\over|X|}\operatorname{trace}(p(B)\overline{q(B)}^{\top}),

(11)

and let $\|p\|^{2}=\langle p,p\rangle$ . We also have that (11) is an inner product in $\mathbb{C}_{d}[t]$ .

Lemma 4.1

Proof. Since $B$ is diagonalizable (see Theorem 2.4), the minimal polynomial $m(t)$ of $B$ is

m(t)=(t-\lambda)(t-\lambda_{1})\cdot\cdots\cdot(t-\lambda_{d})

(12)

(see, for example, [31, Subchapter 7.11]). As $B$ has real entries, its characteristic polynomial $c(t)=\det(B-tI)$ will only have real coefficients. The complex roots of $c(t)=0$ come in conjugate complex pairs, yielding $m(t)\in{\mathbb{R}}_{d}[t]$ from (12).

Lemma 4.2

Let $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ denote a normal real matrix with $d+1$ distinct eigenvalues $\{\lambda,\lambda_{1},\ldots,\lambda_{d}\}\subseteq{\mathbb{C}}$ (real or complex), and let $\mathbb{R}_{d}[t]=\{a_{0}+a_{1}t+\ldots+a_{d}t^{d}\mid a_{i}\in\mathbb{R},\,0% \leq i\leq d\}$ denote the ring of all polynomials of degree at most $d$ with coefficients in $\mathbb{R}$ . For every $p,q\in{\mathbb{R}}_{d}[t]$ we define an inner product $\langle p,q\rangle$ on ${\mathbb{R}}_{d}[x]$ as in (11). If $\lambda\neq 0$ , then there exists an orthogonal system of polynomials $\{q_{0}(t),q_{1}(t),\ldots,q_{d}(t)\}\subseteq{\mathbb{R}}_{d}[t]$ such that every $q_{i}(t)$ $(0\leq i\leq d)$ has degree $i$ and $q_{i}(\lambda)\neq 0$ $(0\leq i\leq d)$ (i.e., $\lambda$ is not a root of $q_{i}(t)$ $(0\leq i\leq d)$ ).

Proof. Our proof is by construction. Using the inner product (11), we apply the Gram–Schmidt orthogonalization algorithm to the set $\{s_{0}(t)=1,s_{1}(t)=t,\ldots,s_{d}(t)=t^{d}\}$ , modifying it in such a way to meet our conditions.

Note that, since $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ and $p,q\in{\mathbb{R}}_{d}[t]$ , by (11) we have $\langle p,q\rangle\in{\mathbb{R}}$ . To construct the $q_{i}$ ’s, we use mathematical induction on $i$ .

Base of induction. Since $s_{0}(t)=1$ is a constant function, $\lambda$ is not a root of $s_{0}(t)$ . So, we can define $q_{0}(t):=s_{0}(t)$ . Next, define $r_{1}(t)$ in the following way

	$\displaystyle r_{1}(t)$	$\displaystyle:=s_{1}(t)-\sum_{\ell=0}^{0}\frac{\langle q_{\ell},s_{1}\rangle}{% \\|q_{\ell}\\|^{2}}q_{\ell}(t)$
		$\displaystyle=s_{1}(t)-\frac{\langle q_{0},s_{1}\rangle}{\\|q_{0}\\|^{2}}q_{0}(t).$

For the polynomial $r_{1}(t)$ two cases are possible: either $r_{1}(\lambda)\neq 0$ or $r_{1}(\lambda)=0$ . In the first case (i.e., $r_{1}(\lambda)\neq 0$ ), we let $q_{1}(t):=r_{1}(t)$ . In the second case (i.e., $r_{1}(\lambda)=0$ ), we have $\lambda=s_{1}(\lambda)=\frac{\langle q_{0},s_{1}\rangle}{\|q_{0}\|^{2}}q_{0}(\lambda)$ , which yields, for example, $2\lambda\neq\frac{\langle q_{0},s_{1}\rangle}{\|q_{0}\|^{2}}q_{0}(\lambda)$ . Thus, we define $q_{1}(t)$ in the following way

q_{1}(t):=2s_{1}(t)-\frac{\langle q_{0},s_{1}\rangle}{\|q_{0}\|^{2}}q_{0}(t).

In both cases, $q_{1}(\lambda)\neq 0$ and $q_{1}(t)$ is a polynomial of degree $1$ .

Induction step. Assume that we found an orthogonal set of polynomials $\{q_{0}(t),q_{1}(t),\ldots,q_{j-1}(t)\}\subseteq{\mathbb{R}}_{d}[t]$ such that $q_{i}(\lambda)\neq 0$ ( $0\leq i\leq j-1$ , $j\geq 2$ ), i.e., such that $\lambda$ is not a root of $q_{i}(t)$ (and assume that each $q_{i}$ has degree $i$ ). Now, we define $r_{j}(t)$ in the following way

\displaystyle r_{j}(t)

\displaystyle:=s_{j}(t)-\sum_{\ell=0}^{j-1}\frac{\langle q_{\ell},s_{j}\rangle% }{\|q_{\ell}\|^{2}}q_{\ell}(t)

For the polynomial $r_{j}(t)$ two cases are possible: either $r_{j}(\lambda)\neq 0$ or $r_{j}(\lambda)=0$ . In the first case (i.e., $r_{j}(\lambda)\neq 0$ ), we let $q_{j}(t):=r_{j}(t)$ . In the second case (i.e., $r_{j}(\lambda)=0$ ), we have

\lambda^{j}=s_{j}(\lambda)=\sum_{\ell=0}^{j-1}\frac{\langle q_{\ell},s_{j}% \rangle}{\|q_{\ell}\|^{2}}q_{\ell}(\lambda),

which yields, for example, $2\lambda^{j}\neq\sum_{\ell=0}^{j-1}\frac{\langle q_{\ell},s_{j}\rangle}{\|q_{% \ell}\|^{2}}q_{\ell}(\lambda)$ . Thus, we define $q_{j}(t)$ in the following way

q_{j}(t):=2s_{j}(t)-\sum_{\ell=0}^{j-1}\frac{\langle q_{\ell},s_{j}\rangle}{\|% q_{\ell}\|^{2}}q_{\ell}(t)

In both cases, $q_{j}(\lambda)\neq 0$ and $q_{j}(t)$ is a polynomial of degree $j$ (by construction).

Lemma 4.3

Let $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ denote a normal irreducible matrix with $d+1$ distinct eigenvalues $\{\lambda,\lambda_{1},\ldots,\lambda_{d}\}$ , such that $B\operatorname{\boldsymbol{j}}=\lambda\operatorname{\boldsymbol{j}}$ $(\lambda\neq 0)$ . For every $p,q\in{\mathbb{R}}_{d}[t]$ we define an inner product $\langle p,q\rangle$ on ${\mathbb{R}}_{d}[x]$ as in (11), and let $\|p\|^{2}=\langle p,p\rangle$ . Then, there exists a set of orthogonal polynomials with respect to (11), such that $\deg(p_{i})=i$ $(0\leq i\leq d)$ and they are normalized in such a way that $\|p_{i}\|^{2}=p_{i}(\lambda)\in{\mathbb{R}}^{+}$ $(0\leq i\leq d)$ .

Proof. Our proof is by construction. By Lemma 4.2, we always can find an orthogonal system of polynomials $\{q_{0}(t),q_{1}(t),\ldots,q_{d}(t)\}\subseteq{\mathbb{R}}_{d}[t]$ such that $q_{i}(\lambda)\neq 0$ $(0\leq i\leq d)$ (i.e., that $\lambda$ is not a root of any $q_{i}(t)$ ) and that each $q_{i}$ is of degree $i$ . Next, we first define polynomial $r_{i}(t)$ $(0\leq i\leq d)$ on the following way

\displaystyle r_{i}(t)

\displaystyle=\frac{1}{\|q_{i}\|}q_{i}(t)\qquad(0\leq i\leq d).

Note that $\{r_{0}(t),r_{1}(t),\ldots,r_{d}(t)\}\subseteq{\mathbb{R}}_{d}[t]$ orthonormal system of polynomials, and that we have $\operatorname{deg}r_{i}=i$ , $\|r_{i}\|=1$ $(0\leq i\leq d)$ , and by our choice of the $q_{i}$ ’s, we also have $r_{i}(\lambda)\neq 0$ $(0\leq i\leq d)$ . For arbitrary nonzero real numbers $\alpha_{0},\alpha_{1},\ldots,\alpha_{d}$ , the set $\{\alpha_{0}r_{0},\alpha_{1}r_{1},\ldots,\alpha_{d}r_{r}\}$ is again an orthogonal set. For any $r_{i}(t)\in{\mathbb{R}}_{d}[t]$ $(0\leq i\leq d)$ , define $c:=r_{i}(\lambda)$ and $p_{i}(t):=cr_{i}(t)$ (note that $c\neq 0$ by our construction). We have

\|p_{i}\|^{2}=\langle cr_{i},cr_{i}\rangle=c^{2}\underbrace{\|r_{i}\|}_{=1}=c% \cdot c=cr_{i}(\lambda)=p_{i}(\lambda).

Thus, $\|p_{i}\|^{2}=p_{i}(\lambda)$ $(0\leq i\leq d)$ . Note that the set $\{p_{0}(t),p_{1}(t),\ldots,p_{d}(t)\}$ is an orthogonal system and $p_{0}(t)=1$ .

Definition 4.4

Let $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ denote a normal irreducible matrix with $d+1$ distinct eigenvalues $\{\lambda,\lambda_{1},\ldots,\lambda_{d}\}$ , such that $B\operatorname{\boldsymbol{j}}=\lambda\operatorname{\boldsymbol{j}}$ . For every $p,q\in{\mathbb{R}}_{d}[t]$ we define $\langle p,q\rangle$ as in (11) (i.e., $\langle p,q\rangle={1\over|X|}\operatorname{trace}(p(B)\overline{q(B)}^{\top})$ ), and let $\|p\|^{2}=\langle p,p\rangle$ . With reference to Lemma 4.3, the set of so-called predistance polynomials $\{p_{0},p_{1},\ldots,p_{d}\}\subseteq{\mathbb{R}}_{d}[t]$ , is a set of orthogonal polynomials with respect to the inner product (11) (defined on the vector space ${\mathbb{R}}_{d}[t]$ ), such that $\deg(p_{i})=i$ $(0\leq i\leq d)$ and they are normalized in such a way that $\|p_{i}\|^{2}=p_{i}(\lambda)$ . Note that $p_{i}(\lambda)\in{\mathbb{R}}^{+}$ $(0\leq i\leq d)$ .

Lemma 4.5

\sum_{i=0}^{d}p_{i}(B)=J.

Proof. Our proof uses the same technique that implicitly can be found, for example, in [9, 10, 23, 26].

Let $h(t)$ denote the Hoffman polynomial, i.e., $h(B)=J$ (see Theorem 1.1). If we denote by $\{\lambda,\lambda_{1},\lambda_{2},\ldots,\lambda_{d}\}$ the set of the distinct eigenvalues of $B$ , then $h(\lambda)=|X|$ and $h(\mu)=0$ for $\mu\in\{\lambda_{1},\lambda_{2},...,\lambda_{d}\}$ (see Theorem 1.1). Since $B$ is a normal matrix, there exists a unitary matrix $U$ such that $B=U\Lambda\overline{U}^{\top}$ , where $\Lambda$ is diagonal matrix in which the diagonal entries are the eigenvalues of $B$ . Let $\operatorname{diag}(\Lambda)$ denote the list of all diagonal entries of $\Lambda$ . Then, we have

	$\displaystyle\langle h,p_{j}\rangle$	$\displaystyle={1\over\|X\|}\operatorname{trace}(h(B)\overline{p_{j}(B)}^{\top})$
		$\displaystyle={1\over\|X\|}\operatorname{trace}(Uh(\Lambda)\overline{p_{j}(% \Lambda)}^{\top}\overline{U}^{\top})$
		$\displaystyle={1\over\|X\|}\operatorname{trace}(h(\Lambda)\overline{p_{j}(% \Lambda)}^{\top})$
		$\displaystyle={1\over\|X\|}\operatorname{trace}(h(\Lambda)\overline{p_{j}(% \Lambda)})$
		$\displaystyle={1\over\|X\|}\sum_{\mu\in\operatorname{diag}(\Lambda)}h(\mu)% \overline{p_{j}(\mu)}$
		$\displaystyle={1\over\|X\|}\cdot h(\lambda)\cdot{p_{j}(\lambda)}$

which yields

\langle h,p_{j}\rangle=\|p_{j}\|^{2}\qquad(0\leq j\leq d).

(13)

On the other hand, the Fourier expansion of $h$ is

h=\frac{\langle h,p_{0}\rangle}{\|p_{0}\|^{2}}p_{0}+\frac{\langle h,p_{1}% \rangle}{\|p_{1}\|^{2}}p_{1}+\cdots+\frac{\langle h,p_{d}\rangle}{\|p_{d}\|^{2% }}p_{d}.

(14)

By (13) and (14), the result follows.

Proposition 4.6

With reference to Definition 4.4, let $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ denote a normal $\lambda$ -doubly stochastic irreducible matrix with $d+1$ distinct eigenvalues. Let $\Gamma=\Gamma(A)$ denote the underlying digraph of $B$ , with adjacency matrix $A$ , diameter $D$ , and let $A_{D}$ denote the distance- $D$ matrix of $\Gamma$ . Assume that $d=D$ , and let $\{p_{0},p_{1},\ldots,p_{D}\}$ denote the set of the predistance polynomials. If there exists a polynomial $q(t)\in{\mathbb{R}}_{D}[t]$ such that $A_{D}=q(B)$ , then

q(t)=p_{D}(t).

Proof. By our assumption, $B\operatorname{\boldsymbol{j}}=\lambda\operatorname{\boldsymbol{j}}$ . We first show that $\|q\|^{2}=q(\lambda)$ . Note that $A_{D}\operatorname{\boldsymbol{j}}=q(B)\operatorname{\boldsymbol{j}}=q(\lambda% )\operatorname{\boldsymbol{j}}$ , and from (10)

	$\displaystyle\\|q\\|^{2}$	$\displaystyle=\frac{1}{\|X\|}\operatorname{trace}(q(B)\overline{q(B)}^{\top})$
		$\displaystyle=\frac{1}{\|X\|}\sum_{x,y\in X}(A_{D})_{xy}=\frac{1}{\|X\|}\sum_{x\in X% }(A_{D}\operatorname{\boldsymbol{j}})_{x}$
		$\displaystyle=\frac{1}{\|X\|}\cdot\|X\|\cdot q(\lambda)$
		$\displaystyle=q(\lambda).$

Recall that $\{p_{0},p_{1},\ldots,p_{D}\}$ is a set of orthogonal polynomials such that $\deg(p_{i})=i$ and $\|p_{i}\|^{2}=p_{i}(\lambda)$ $(0\leq i\leq D)$ . Next note that

	$\displaystyle\langle q,p_{i}\rangle$	$\displaystyle=\frac{1}{\|X\|}\operatorname{trace}(q(B)\overline{p_{i}(B)}^{\top})$
		$\displaystyle=\frac{1}{\|X\|}\sum_{x,y\in X}\left(A_{D}\circ\overline{p_{i}(B)}% \right)_{xy}$
		$\displaystyle=\left\{\begin{array}[]{ll}0,&\mbox{ if $0\leq i\leq D-1$},\\ \frac{1}{\|X\|}\displaystyle\sum_{x,y\in X}\left(A_{D}\circ\overline{p_{D}(B)}% \right)_{xy},&\mbox{ if $i=D$}\end{array}\right.(\mbox{by Lemmas~{}\ref{hB}, % \ref{hC}}).$

This yields that the Fourier expansion of $q=\sum_{i=0}^{D}\frac{\langle q,p_{i}\rangle}{\|p_{i}\|^{2}}p_{i}$ is equal to

q=\frac{\langle q,p_{D}\rangle}{\|p_{D}\|^{2}}p_{D}\qquad\mbox{and}\qquad% \langle q,p_{D}\rangle\neq 0

which implies $p_{D}=c\cdot q$ where $c=\frac{\|p_{D}\|^{2}}{\langle q,p_{D}\rangle}$ . To conclude, we show that $c=1$ :

	$\displaystyle q(\lambda)$	$\displaystyle=\frac{\langle q,p_{D}\rangle}{\\|p_{D}\\|^{2}}\underbrace{p_{D}(% \lambda)}_{=\\|p_{D}\\|^{2}}$
		$\displaystyle=\langle q,p_{D}\rangle=\langle q,cq\rangle$
		$\displaystyle=\overline{c}\langle q,q\rangle=\overline{c}\\|q\\|^{2}$
		$\displaystyle=\overline{c}q(\lambda).$

The result follows.

Lemma 4.7

With reference to Definition 4.4, let $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ denote a normal $\lambda$ -doubly stochastic irreducible matrix with $d+1$ distinct eigenvalues. Let $\Gamma=\Gamma(A)$ denote the underlying digraph of $B$ with diameter $D$ , and let $A_{D}$ denote the distance- $D$ matrix of $\Gamma$ . Assume that $D\geq 3$ . For any $x,y\in X$ , if $\partial(x,y)\leq D-2$ in $\Gamma$ , then

(A_{D}B^{\top})_{xy}=0.

Proof. For any $x,y\in X$ , we have

	$\displaystyle(A_{D}B^{\top})_{xy}$	$\displaystyle=\sum_{z\in X}(A_{D})_{xz}(B^{\top})_{zy}$
		$\displaystyle=\sum_{z\in\Gamma_{D}^{\rightarrow}(x)}(B)_{yz}.$

Our proof is by a contradiction. Assume that there exists $x,y\in X$ such that $\partial(x,y)\leq D-2$ , and $(A_{D}B^{\top})_{xy}\neq 0$ . This yields $\sum_{z\in\Gamma_{D}^{\rightarrow}(x)}(B)_{yz}\neq 0$ , i.e., there exists $z\in\Gamma^{\rightarrow}_{D}(x)$ such that $(B)_{yz}\neq 0$ , or equivalently $(A)_{yz}=1$ . Now consider the distance- $i$ partition $\{\Gamma^{\rightarrow}_{i}(x)\}_{i=0}^{D}$ of the vertex set $X$ (for our choice of $x\in X$ ). Since $\partial(x,y)\leq D-2$ and $(A)_{yz}=1$ , it follows that $\partial(x,z)\leq D-1$ , a contradiction with $z\in\Gamma^{\rightarrow}_{D}(x)$ . The result follows.

5 Case when $\boldsymbol{A_{D}}$ is polynomial in $\boldsymbol{B}$

In this section, we prove Theorem 1.2. For this purpose, we need Proposition 5.1 and Lemma 5.2.

Proposition 5.1

With reference to Definition 4.4, let $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ denote a normal $\lambda$ -doubly stochastic irreducible matrix with $d+1$ distinct eigenvalues. Let $\Gamma=\Gamma(A)$ denote the underlying digraph of $B$ , with adjacency matrix $A$ , diameter $D$ , and let $\{A_{0},A_{1},\ldots,A_{D}\}$ denote the distance- $i$ matrices of $\Gamma$ . Assume that $d=D$ , and let $\{p_{0},p_{1},\ldots,p_{D}\}$ denote the set of predistance polynomials. If $A_{D}=p_{D}(B)$ , $A_{D-1}=p_{D-1}(B),\ldots,A_{i+1}=p_{i+1}(B)$ , and if there exists a polynomial $q(t)\in{\mathbb{R}}_{i}[t]$ such that $A_{i}=q(B)$ , then

q(t)=p_{i}(t).

Proof. The proof is similar to the proof of Proposition 4.6. By assumption $B\operatorname{\boldsymbol{j}}=\lambda\operatorname{\boldsymbol{j}}$ . We first show that $\|q\|^{2}=q(\lambda)$ . Note that $A_{i}\operatorname{\boldsymbol{j}}=q(B)\operatorname{\boldsymbol{j}}=q(\lambda% )\operatorname{\boldsymbol{j}}$ , and from (10)

	$\displaystyle\\|q\\|^{2}$	$\displaystyle=\frac{1}{\|X\|}\operatorname{trace}(q(B)\overline{q(B)}^{\top})$
		$\displaystyle=\frac{1}{\|X\|}\sum_{x,y\in X}(A_{i})_{xy}$
		$\displaystyle=\frac{1}{\|X\|}\cdot\|X\|\cdot q(\lambda)$
		$\displaystyle=q(\lambda).$

Recall that $\{p_{0},p_{1},\ldots,p_{D}\}$ is a set of orthogonal polynomials such that $\deg(p_{j})=j$ and $\|p_{j}\|^{2}=p_{j}(\lambda)$ $(0\leq j\leq D)$ . Next note that

	$\displaystyle\langle q,p_{j}\rangle$	$\displaystyle=\frac{1}{\|X\|}\operatorname{trace}(q(B)\overline{p_{j}(B)}^{\top})$
		$\displaystyle=\frac{1}{\|X\|}\sum_{x,y\in X}\left(A_{i}\circ\overline{p_{j}(B)}% \right)_{xy}$
		$\displaystyle=\frac{1}{\|X\|}\sum_{x,y\in X}\left(A_{i}\circ{p_{j}(B)}\right)_{xy}$
		$\displaystyle=\left\{\begin{array}[]{ll}0,&\mbox{ if $0\leq j\leq i-1$},\\ \displaystyle\frac{1}{\|X\|}\displaystyle\sum_{x,y\in X}\left(A_{i}\circ% \overline{p_{i}(B)}\right)_{xy},&\mbox{ if $j=i$,}\\ 0,&\mbox{ if $i+1\leq j\leq D$}\\ \end{array}\right.(\mbox{by Lemmas~{}\ref{hB}, \ref{hC}}).$

This yields that the Fourier expansion of $q=\sum_{j=0}^{D}\frac{\langle q,p_{j}\rangle}{\|p_{j}\|^{2}}p_{j}$ is equal to

q=\frac{\langle q,p_{i}\rangle}{\|p_{i}\|^{2}}p_{i}\qquad\mbox{and}\qquad% \langle q,p_{i}\rangle\neq 0

which implies $p_{i}=c\cdot q$ where $c=\frac{\|p_{i}\|^{2}}{\langle q,p_{i}\rangle}$ . To conclude, we show that $c=1$ :

	$\displaystyle q(\lambda)$	$\displaystyle=\frac{\langle q,p_{i}\rangle}{\\|p_{i}\\|^{2}}\underbrace{p_{i}(% \lambda)}_{=\\|p_{i}\\|^{2}}$
		$\displaystyle=\langle q,p_{i}\rangle=\langle q,cq\rangle$
		$\displaystyle=\overline{c}\langle q,q\rangle=\overline{c}\\|q\\|^{2}$
		$\displaystyle=\overline{c}q(\lambda).$

The result follows.

Lemma 5.2

(A_{D-j}B^{\top})_{xy}=0.

Proof. The proof is similar to the proof of Lemma 4.7. For any $x,y\in X$ , we have

	$\displaystyle(A_{D-j}B^{\top})_{xy}$	$\displaystyle=\sum_{z\in X}(A_{D-j})_{xz}(B^{\top})_{zy}$
		$\displaystyle=\sum_{z\in\Gamma_{D-j}^{\rightarrow}(x)}(B)_{yz}.$

Our proof is by a contradiction. Assume that there exists $x,y\in X$ such that $\partial(x,y)\leq D-j-2$ , and $(A_{D-j}B^{\top})_{xy}\neq 0$ . This yields $\sum_{z\in\Gamma_{D-j}^{\rightarrow}(x)}(B)_{yz}\neq 0$ , i.e., there exists $z\in\Gamma^{\rightarrow}_{D-j}(x)$ such that $(B)_{yz}\neq 0$ , or equivalently $(A)_{yz}=1$ . Now consider the distance- $i$ partition $\{\Gamma^{\rightarrow}_{i}(x)\}_{i=0}^{D}$ of the vertex set $X$ (for our choice of $x\in X$ ). Since $\partial(x,y)\leq D-j-2$ and $(A)_{yz}=1$ , we have $\partial(x,z)\leq D-j-1$ , a contradiction with $z\in\Gamma^{\rightarrow}_{D-j}(x)$ . The result follows.

Proposition 5.3

With reference to Definition 4.4, let $B\in\mbox{\rm Mat}_{X}({\mathbb{R}})$ denote a normal $\lambda$ -doubly stochastic irreducible matrix with $d+1$ distinct eigenvalues. Let $\Gamma=\Gamma(A)$ denote the underlying digraph of $B$ , with adjacency matrix $A$ , diameter $D$ , and let $A_{D}$ denote the distance- $D$ matrix of $\Gamma$ . Assume that $d=D$ , and let $\{p_{0},p_{1},\ldots,p_{D}\}$ denote the set of predistance polynomials. If $A_{D}$ is a polynomial in $B$ , then

A_{i}=p_{i}(B)\qquad(0\leq i\leq D).

Proof. For the moment, let $q_{m}(t)=\sum_{i=0}^{m}p_{i}(t)$ $(0\leq m\leq D)$ , and note that $\deg(q_{m})=m$ $(0\leq m\leq D)$ . Since $A_{D}$ is a polynomial in $B$ , by Proposition 4.6, $p_{D}(B)=A_{D}$ .

We first prove that $A_{D-1}=p_{D-1}(B)$ . If $D=2$ , the result follows (because $p_{0}(t)=1$ , by assumption $p_{2}(B)=A_{2}$ , and $J=\sum_{i=0}^{2}p_{i}(B)$ yields $p_{1}(B)=A$ ). Assume that $D\geq 3$ . Pick $x,y\in X$ such that $\partial(x,y)=D-1$ , and note that $(q_{D-2}(B))_{xy}=0$ . By Lemma 4.5, we have

$\displaystyle 1=(J)_{xy}$	$\displaystyle=(q_{D}(B))_{xy}$
	$\displaystyle=\left(q_{D-2}(B)+p_{D-1}(B)+p_{D}(B)\right)_{xy}$
	$\displaystyle=\underbrace{(q_{D-2}(B))_{xy}}_{=0}+(p_{D-1}(B))_{xy}+% \underbrace{(A_{D})_{xy}}_{=0}$
	$\displaystyle=(p_{D-1}(B))_{xy}.$	(15)

We can conclude that,

\partial(x,y)=D-1

, then

(p_{D-1}(B))_{xy}=1

(16)

Using (16), we show that $A_{D-1}$ is a polynomial in $B$ .

Since $A_{D}$ is a polynomial in $B$ , by Proposition 4.6, $p_{D}(B)=A_{D}$ . By Lemma 4.7, for any $x,y\in X$ ,

(A_{D}B^{\top})_{xy}=\left\{\begin{array}[]{ll}0,&\mbox{if }\partial(x,y)\leq D% -2,\\ \sum_{z\in\Gamma_{D}^{\rightarrow}(x)}(B)_{yz},&\mbox{if }\partial(x,y)\in\{D-% 1,D\}\end{array}\right.\quad(x,y\in X).

(17)

Let ${\mathcal{B}}$ denote the vector space of all polynomials in $B$ . Since $\{p_{i}(B)\}_{i=0}^{D}$ is a basis of ${\mathcal{B}}$ , and $B^{\top}\in{\mathcal{B}}$ (see Proposition 2.11(iv)), there exist complex scalars $\alpha_{h}$ $(0\leq h\leq D)$ such that

A_{D}B^{\top}=\sum_{h=0}^{D}\alpha_{h}p_{h}(B)=\sum_{h=0}^{D-1}\alpha_{h}p_{h}% (B)+\alpha_{D}\underbrace{p_{D}(B)}_{A_{D}}.

(18)

By (17) and (18), note that

\left(\sum_{h=0}^{D}\alpha_{h}p_{h}(B)\right)_{xy}=0\qquad\mbox{if }\partial(x% ,y)\leq D-2.

(19)

Now, from (15), (16) and (19), we have

\left(\sum_{h=0}^{D-1}\alpha_{h}p_{h}(B)+\alpha_{D}A_{D}\right)_{xy}=\left\{% \begin{array}[]{ll}0,&\mbox{if }\partial(x,y)\leq D-2,\\ \alpha_{D-1},&\mbox{if }\partial(x,y)=D-1\\ \alpha_{D},&\mbox{if }\partial(x,y)=D\end{array}\right.\quad(x,y\in X),

or, in other words,

\sum_{h=0}^{D-1}\alpha_{h}p_{h}(B)=\alpha_{D-1}A_{D-1}.

(20)

If $\alpha_{D-1}=0$ then by (18) and (20), $A_{D}B^{\top}=\alpha_{D}A_{D}$ , a contradiction (see (17)). Thus (20) yields that $A_{D-1}$ is a polynomial in $B$ . By Proposition 5.1, $p_{D-1}(B)=A_{D-1}$ .

If $D=3$ , the result follows. Assume that $D\geq 4$ , and that we executed $j\geq 1$ steps from above (where $D-j-1\geq 1$ ), i.e.,

\displaystyle A_{D}

\displaystyle=p_{D}(B),\qquad A_{D-1}=p_{D-1}(B),\qquad\ldots,\qquad A_{D-j}=p% _{D-j}(B).

We now show that $A_{D-j-1}=p_{D-j-1}(B)$ . Pick $x,y\in X$ such that $\partial(x,y)=D-j-1$ , and note that $(q_{D-j-2}(B))_{xy}=0$ (by Lemma 2.8). By Lemma 4.5,

	$\displaystyle 1=(J)_{xy}$	$\displaystyle=(q_{D}(B))_{xy}$
		$\displaystyle=\left(q_{D-j-2}(B)+p_{D-j-1}(B)+\sum_{i=D-j}^{D}p_{i}(B)\right)_% {xy}$
		$\displaystyle=\underbrace{(q_{D-j-2}(B))_{xy}}_{=0}+(p_{D-j-1}(B))_{xy}+% \underbrace{\left(\sum_{i=D-j}^{D}p_{i}(B)\right)_{xy}}_{(A_{D-j}+\cdots+A_{D}% )_{xy}=0}$
		$\displaystyle=(p_{D-j-1}(B))_{xy}.$

We can conclude that,

\partial(x,y)=D-j-1

, then

(p_{D-j-1}(B))_{xy}=1

(21)

Using (21), we prove that $A_{D-j-1}$ is a polynomial in $B$ .

Consider the product $A_{D-j}B^{\top}$ . For arbitrary $x,y\in X$ , by Lemma 5.2, we have

(A_{D-j}B^{\top})_{xy}=0,\qquad\mbox{if }\partial(x,y)<D-j-1.

(22)

Since $\{p_{i}(B)\}_{i=0}^{D}$ is a linearly independent set and $B^{\top}\in{\mathcal{B}}$ , there exist complex scalars $\beta_{h}$ $(0\leq h\leq D)$ such that

A_{D-j}B^{\top}=\sum_{h=0}^{D}\beta_{h}p_{h}(B)=\sum_{h=0}^{D-j-1}\beta_{h}p_{% h}(B)+\beta_{D-j}A_{D-j}+\cdots+\beta_{D}A_{D}.

(23)

By (21), (22) and (23) we have

\sum_{h=0}^{D-j-1}\beta_{h}p_{h}(B)=\beta_{D-j-1}A_{D-j-1}.

Next we prove that $\beta_{D-j-1}\neq 0$ . The proof is by contradiction. If $\beta_{D-j-1}=0$ , then (23) becomes $A_{D-j}B^{\top}=\beta_{D-j}A_{D-j}+\beta_{D-j+1}A_{D-j+1}+\cdots+\beta_{D}A_{D}.$ This yields that $(A_{D-j}B^{\top})_{xy}=0$ for all $x\in X$ and $y\in\Gamma^{\rightarrow}_{D-j-1}(x)$ , a contradiction (note that $(A_{D-j}B^{\top})_{xy}=\sum_{z_{\in}\Gamma_{D-j}^{\rightarrow}(x)}(B)_{yz}$ ). Thus, $A_{D-j-1}$ is a polynomial in $B$ . By Proposition 5.1, the result follows.

In some sense, our Theorem 1.2 is similar to the following result from the theory of distance-regular graphs.

Proposition 5.4 ([22, Proposition 2] or [24])

An undirected regular graph $\Gamma=\Gamma(A)$ with diameter $D$ and $d+1$ distinct eigenvalues is distance-regular if and only if $D=d$ and the distance- $D$ matrix $A_{D}$ is a polynomial in $A$ .

5.1 Proof of Theorem 1.2

In this subsection we prove Theorem 1.2.

$(\Leftarrow)$ Assume that $B$ is a normal $\lambda$ -doubly stochastic matrix with $D+1$ distinct eigenvalues and that $A_{D}$ is a polynomial in $B$ . We use this assumption to show that ${\mathcal{B}}$ is the Bose–Mesner algebra of a commutative $D$ -class association scheme.

Since $B$ is a normal matrix with $D+1$ distinct eigenvalues, by Proposition 2.11, $\{I,B,B^{2},\ldots,\break B^{D}\}$ is a basis of ${\mathcal{B}}$ . Since $A_{D}\in{\mathcal{B}}$ , from Proposition 5.3 it follows that the distance- $i$ matrices $A_{i}$ $(0\leq i\leq D)$ belong to ${\mathcal{B}}$ . Furthermore since $\{A_{i}\}_{i=0}^{D}$ is a linearly independent set of matrices, it also forms a basis of ${\mathcal{B}}$ . As $B$ is a normal matrix, note that $\overline{B}^{\top}\in{\mathcal{B}}$ by Proposition 2.11(iv). Now it is routine to check that the distance- $i$ matrices satisfy all properties (AS1)–(AS5) of a commutative association scheme.

$(\Rightarrow)$ Assume that ${\mathcal{B}}$ is the Bose–Mesner algebra of a commutative $D$ -class association scheme. We use this assumption to show that $B$ is a normal $\lambda$ -doubly stochastic matrix (for some $\lambda$ ) with $D+1$ distinct eigenvalues and that the distance- $D$ matrix $A_{D}$ of $\Gamma=\Gamma(A)$ is a polynomial in $B$ , where $A$ is the adjacency matrix of the underlying digraph of $B$ .

The fact that $B$ belongs to a commutative association scheme implies that $B$ is a normal matrix. Since $B$ generates a commutative association scheme and $J$ belongs to the algebra of this scheme, by Theorem 1.1, $B$ is a (normal) $\lambda$ -doubly stochastic matrix (for some $\lambda$ ). For a moment, assume that $B$ has $d+1$ distinct eigenvalues; then, by Proposition 2.11, $\{I,B,B^{2},\ldots,B^{d}\}$ is a basis of ${\mathcal{B}}$ . This is possible only if $d=D$ , and thus $B$ has $D+1$ distinct eigenvalues (see also, for example, [32, Corollary 3.5]).

It is left to show that the distance- $D$ matrix $A_{D}$ is polynomial in $B$ . Let $\{B_{0},B_{1},\ldots,B_{D}\}$ denote the standard basis of ${\mathcal{B}}$ (the basis of $\circ$ -idempotent $01$ -matrices), and let $\theta_{i}$ ’s denote the nonzero scalars such that

B=\sum_{i\in\Phi}\theta_{i}B_{i}.

for some nonempty set of indices $\Phi$ . Note that, by definition, $A=\sum_{i\in\Phi}B_{i}$ is the adjacency matrix of the underlying digraph of $B$ . Since $B$ is an irreducible matrix, the matrix $A$ is irreducible too (Lemma 2.7). Then, by Lemma 2.2, $\Gamma$ is a strongly connected digraph, it is also a regular graph since $A\in{\mathcal{B}}$ .

We consider $\Gamma=\Gamma(A)$ and we finish the proof by proving Claims 1 and 2 below.

Claim 1. For any $i$ $(0\leq i\leq d)$ and $y,z,u,v\in X$ , if $(B_{i})_{zy}=(B_{i})_{uv}=1$ , then $\partial(z,y)=\partial(u,v)$ in $\Gamma$ .

Proof of Claim 1. For every $\ell\in{\mathbb{N}}$ , there exist complex scalars $\alpha^{(\ell)}_{i}$ $(0\leq i\leq d)$ such that $A^{\ell}=\sum_{i=0}^{d}\alpha^{(\ell)}_{i}B_{i}$ . Recall that $\sum_{i=0}^{d}B_{i}=J$ and $B_{i}\circ B_{j}=\delta_{ij}B_{i}$ $(0\leq i,j\leq d)$ . This yields that, for any $y,z,u,v\in X$ and $i$ $(0\leq i\leq d)$ , if $(B_{i})_{zy}\neq 0$ and $(B_{i})_{uv}\neq 0$ , then $(A^{\ell})_{zy}=(A^{\ell})_{uv}=\alpha^{(\ell)}_{i}$ , i.e., the number of walks of length $\ell$ from $z$ to $y$ is equal to the the number of walks of length $\ell$ from $u$ to $v$ (see Lemma 2.8). Moreover, $(A^{\ell})_{zy}=(A^{\ell})_{uv}$ holds for any $\ell$ $(\ell\in{\mathbb{N}})$ . We proceed by a contradiction, in the same spirit as in [21, Lemma 2.3], where the author has an undirected graph. Assume that $\partial(z,y)>\partial(u,v)=m$ . Then, $(A^{m})_{uv}\neq 0$ and $(A^{m})_{zy}=0$ , a contradiction. The claim follows.

Next we show that Claim 2 holds.

Claim 2. Every distance- $i$ matrix $A_{i}$ of $\Gamma=\Gamma(A)$ belongs to ${\mathcal{B}}$ , i.e., $A_{i}\in{\mathcal{B}}$ $(0\leq i\leq D)$ .

Proof of Claim 2. From the proof of Claim 1 it follows that, if $y,z\in X$ are two arbitrary vertices such that $\partial(z,y)=i$ , then there exists $B_{j}$ (for some $0\leq j\leq d$ ) such that $(B_{j})_{zy}=1$ . Recall also that $(A_{i})_{zy}=1$ . In fact, for such an index $j$ and any nonzero $(u,v)$ -entry of $B_{j}$ , we have $\partial(u,v)=i$ . This yields

A_{i}=\sum_{j:A_{i}\circ B_{j}\neq{\boldsymbol{O}}}B_{j}\qquad(0\leq i\leq D).

The result follows.

Acknowledgments

This work is supported in part by the Slovenian Research Agency (research program P1-0285 and research projects J1-3001 and N1-0353).

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

[1] A. S. Asratian, T. M. J. Denley and R. Häggkvist, Bipartite graphs and their applications, volume 131 of Cambridge Tracts in Mathematics, Cambridge University Press, Cambridge, 1998, doi:10.1017/CBO9780511984068, https://doi.org/10.1017/CBO9780511984068.
[2] S. Axler, Linear algebra done right, Undergraduate Texts in Mathematics, Springer, Cham, 3rd edition, 2015, doi:10.1007/978-3-319-11080-6, https://doi.org/10.1007/978-3-319-11080-6.
[3] N. Biggs, Algebraic graph theory, Cambridge Mathematical Library, Cambridge University Press, Cambridge, 2nd edition, 1993.
[4] G. Birkhoff, Three observations on linear algebra, Univ. Nac. Tucumán. Revista A. 5 (1946), 147–151.
[5] B. Bollobás, Modern graph theory, volume 184 of Graduate Texts in Mathematics, Springer-Verlag, New York, 1998, doi:10.1007/978-1-4612-0619-4, https://doi.org/10.1007/978-1-4612-0619-4.
[6] R. A. Brualdi and G. Dahl, Diagonal sums of doubly stochastic matrices, Linear Multilinear Algebra 70 (2022), 4946–4972, doi:10.1080/03081087.2021.1901844, https://doi.org/10.1080/03081087.2021.1901844.
[7] R. A. Brualdi and P. M. Gibson, Convex polyhedra of doubly stochastic matrices. I. Applications of the permanent function, J. Combinatorial Theory Ser. A 22 (1977), 194–230, doi:10.1016/0097-3165(77)90051-6, https://doi.org/10.1016/0097-3165(77)90051-6.
[8] R. A. Brualdi and H. J. Ryser, Combinatorial matrix theory, volume 39 of Encyclopedia of Mathematics and its Applications, Cambridge University Press, Cambridge, 1991, doi:10.1017/CBO9781107325708, https://doi.org/10.1017/CBO9781107325708.
[9] M. Cámara, J. Fàbrega, M. A. Fiol and E. Garriga, Some families of orthogonal polynomials of a discrete variable and their applications to graphs and codes, Electron. J. Combin. 16 (2009), Research Paper 83, 30, doi:10.37236/172, https://doi.org/10.37236/172.
[10] T. S. Chihara, An introduction to orthogonal polynomials, volume Vol. 13 of Mathematics and its Applications, Gordon and Breach Science Publishers, New York-London-Paris, 1978.
[11] F. Comellas, M. A. Fiol, J. Gimbert and M. Mitjana, The spectra of wrapped butterfly digraphs, Networks 42 (2003), 15–19, doi:10.1002/net.10085, https://doi.org/10.1002/net.10085.
[12] F. Comellas, M. A. Fiol, J. Gimbert and M. Mitjana, Weakly distance-regular digraphs, J. Combin. Theory Ser. B 90 (2004), 233–255, doi:10.1016/j.jctb.2003.07.003, https://doi.org/10.1016/j.jctb.2003.07.003.
[13] F. Comellas and M. Mitjana, The spectra of cycle prefix digraphs, SIAM J. Discrete Math. 16 (2003), 418–421, doi:10.1137/S0895480100380604, https://doi.org/10.1137/S0895480100380604.
[14] D. M. Cvetković, M. Doob and H. Sachs, Spectra of graphs, Johann Ambrosius Barth, Heidelberg, 3rd edition, 1995, theory and applications.
[15] C. Dalfó, M. A. Fiol and E. Garriga, Characterizing $(\ell,m)$ -walk-regular graphs, Linear Algebra Appl. 433 (2010), 1821–1826, doi:10.1016/j.laa.2010.06.042, https://doi.org/10.1016/j.laa.2010.06.042.
[16] C. Dalfó, E. R. van Dam, M. A. Fiol, E. Garriga and B. L. Gorissen, On almost distance-regular graphs, J. Combin. Theory Ser. A 118 (2011), 1094–1113, doi:10.1016/j.jcta.2010.10.005, https://doi.org/10.1016/j.jcta.2010.10.005.
[17] V. Diego, J. Fàbrega and M. A. Fiol, Equivalent characterizations of the spectra of graphs and applications to measures of distance-regularity, Electron. J. Linear Algebra 36 (2020), 629–644.
[18] A. Dress and D. Stevanović, Hoffman-type identities, Appl. Math. Lett. 16 (2003), 297–302, doi:10.1016/S0893-9659(03)80047-2, https://doi.org/10.1016/S0893-9659(03)80047-2.
[19] F. Dufossé and B. Uçar, Notes on Birkhoff–von Neumann decomposition of doubly stochastic matrices, Linear Algebra Appl. 497 (2016), 108–115, doi:10.1016/j.laa.2016.02.023, https://doi.org/10.1016/j.laa.2016.02.023.
[20] M. A. Fiol, Algebraic characterizations of distance-regular graphs, volume 246, pp. 111–129, 2002, doi:10.1016/S0012-365X(01)00255-2, formal power series and algebraic combinatorics (Barcelona, 1999), https://doi.org/10.1016/S0012-365X(01)00255-2.
[21] M. A. Fiol, Quotient-polynomial graphs, Linear Algebra Appl. 488 (2016), 363–376, doi:10.1016/j.laa.2015.09.053, https://doi.org/10.1016/j.laa.2015.09.053.
[22] M. A. Fiol, S. Gago and E. Garriga, A simple proof of the spectral excess theorem for distance-regular graphs, Linear Algebra Appl. 432 (2010), 2418–2422, doi:10.1016/j.laa.2009.07.030, https://doi.org/10.1016/j.laa.2009.07.030.
[23] M. A. Fiol and E. Garriga, From local adjacency polynomials to locally pseudo-distance-regular graphs, J. Combin. Theory Ser. B 71 (1997), 162–183, doi:10.1006/jctb.1997.1778, https://doi.org/10.1006/jctb.1997.1778.
[24] M. A. Fiol, E. Garriga and J. L. A. Yebra, Locally pseudo-distance-regular graphs, J. Combin. Theory Ser. B 68 (1996), 179–205, doi:10.1006/jctb.1996.0063, https://doi.org/10.1006/jctb.1996.0063.
[25] M. A. Fiol and S. Penjić, On symmetric association schemes and associated quotient-polynomial graphs, Algebr. Comb. 4 (2021), 947–969, doi:10.5802/alco, https://doi.org/10.5802/alco.
[26] A. J. Hoffman, On the polynomial of a graph, Amer. Math. Monthly 70 (1963), 30–36, doi:10.2307/2312780, https://doi.org/10.2307/2312780.
[27] A. J. Hoffman and M. H. McAndrew, The polynomial of a directed graph, Proc. Amer. Math. Soc. 16 (1965), 303–309, doi:10.2307/2033868, https://doi.org/10.2307/2033868.
[28] R. A. Horn and C. R. Johnson, Matrix analysis, Cambridge University Press, Cambridge, 2nd edition, 2013.
[29] Y. Hou and F. Tian, A note on Hoffman-type identities of graphs, Linear Algebra Appl. 402 (2005), 143–149, doi:10.1016/j.laa.2004.12.017, https://doi.org/10.1016/j.laa.2004.12.017.
[30] D. Margalit, J. Rabinoff and B. Williams, Interactive Linear Algebra: UBC edition, University of British Columbia, 2023, https://personal.math.ubc.ca/~tbjw/ila/index2.html.
[31] C. Meyer, Matrix analysis and applied linear algebra, Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA, 2000, doi:10.1137/1.9780898719512, with 1 CD-ROM (Windows, Macintosh and UNIX) and a solutions manual (iv+171 pp.), https://doi.org/10.1137/1.9780898719512.
[32] G. Monzillo and S. Penjić, On commutative association schemes and associated (directed) graphs, 2023., https://arxiv.longhoe.net/abs/2307.11680.
[33] S. Penjić, Algebraic characterizations of distance-regular graphs, Master’s thesis, University of Sarajevo, 2013.
[34] S. Penjić, On the Terwilliger algebra of bipartite distance-regular graphs, University of Primorska, 2019, thesis (Ph.D.) – University of Primorska, Koper, https://osebje.famnit.upr.si/~penjic/research/.
[35] H. Perfect and L. Mirsky, The distribution of positive elements in doubly-stochastic matrices, J. London Math. Soc. 40 (1965), 689–698, doi:10.1112/jlms/s1-40.1.689, https://doi.org/10.1112/jlms/s1-40.1.689.
[36] R. Sinkhorn and P. Knopp, Concerning nonnegative matrices and doubly stochastic matrices, Pacific J. Math. 21 (1967), 343–348, http://projecteuclid.org/euclid.pjm/1102992505.
[37] Y. Teranishi, The Hoffman number of a graph, Discrete Math. 260 (2003), 255–265, doi:10.1016/S0012-365X(02)00764-1, https://doi.org/10.1016/S0012-365X(02)00764-1.
[38] E. R. van Dam and M. A. Fiol, A short proof of the odd-girth theorem, Electron. J. Combin. 19 (2012), Paper 12, 5, doi:10.37236/2289, https://doi.org/10.37236/2289.
[39] J. H. van Lint and R. M. Wilson, A course in combinatorics, Cambridge University Press, Cambridge, 2nd edition, 2001, doi:10.1017/CBO9780511987045, https://doi.org/10.1017/CBO9780511987045.
[40] D. B. West, Introduction to graph theory, Prentice Hall, Inc., Upper Saddle River, NJ, 1996.
[41] Y. Wu and A. Deng, Hoffman polynomials of nonnegative irreducible matrices and strongly connected digraphs, Linear Algebra Appl. 414 (2006), 138–171, doi:10.1016/j.laa.2005.09.012, https://doi.org/10.1016/j.laa.2005.09.012.
[42] Y. Wu and A. Deng, Hoffman polynomials of nonnegative irreducible matrices and strongly connected digraphs, Linear Algebra Appl. 414 (2006), 138–171, doi:10.1016/j.laa.2005.09.012, https://doi.org/10.1016/j.laa.2005.09.012.
[43] T.-T. Xia, Y.-Y. Tan, X. Liang and J. H. Koolen, On association schemes generated by a relation or an idempotent, Linear Algebra Appl. 670 (2023), 1–18, doi:10.1016/j.laa.2023.03.029, https://doi.org/10.1016/j.laa.2023.03.029.

	$\displaystyle\langle h,p_{j}\rangle$	$\displaystyle={1\over\|X\|}\operatorname{trace}(h(B)\overline{p_{j}(B)}^{\top})$
		$\displaystyle={1\over\|X\|}\operatorname{trace}(Uh(\Lambda)\overline{p_{j}(% \Lambda)}^{\top}\overline{U}^{\top})$
		$\displaystyle={1\over\|X\|}\operatorname{trace}(h(\Lambda)\overline{p_{j}(% \Lambda)}^{\top})$
		$\displaystyle={1\over\|X\|}\operatorname{trace}(h(\Lambda)\overline{p_{j}(% \Lambda)})$
		$\displaystyle={1\over\|X\|}\sum_{\mu\in\operatorname{diag}(\Lambda)}h(\mu)% \overline{p_{j}(\mu)}$
		$\displaystyle={1\over\|X\|}\cdot h(\lambda)\cdot{p_{j}(\lambda)}$

	$\displaystyle\\|q\\|^{2}$	$\displaystyle=\frac{1}{\|X\|}\operatorname{trace}(q(B)\overline{q(B)}^{\top})$
		$\displaystyle=\frac{1}{\|X\|}\sum_{x,y\in X}(A_{D})_{xy}=\frac{1}{\|X\|}\sum_{x\in X% }(A_{D}\operatorname{\boldsymbol{j}})_{x}$
		$\displaystyle=\frac{1}{\|X\|}\cdot\|X\|\cdot q(\lambda)$
		$\displaystyle=q(\lambda).$

	$\displaystyle\langle q,p_{i}\rangle$	$\displaystyle=\frac{1}{\|X\|}\operatorname{trace}(q(B)\overline{p_{i}(B)}^{\top})$
		$\displaystyle=\frac{1}{\|X\|}\sum_{x,y\in X}\left(A_{D}\circ\overline{p_{i}(B)}% \right)_{xy}$
		$\displaystyle=\left\{\begin{array}[]{ll}0,&\mbox{ if $0\leq i\leq D-1$},\\ \frac{1}{\|X\|}\displaystyle\sum_{x,y\in X}\left(A_{D}\circ\overline{p_{D}(B)}% \right)_{xy},&\mbox{ if $i=D$}\end{array}\right.(\mbox{by Lemmas~{}\ref{hB}, % \ref{hC}}).$

	$\displaystyle\\|q\\|^{2}$	$\displaystyle=\frac{1}{\|X\|}\operatorname{trace}(q(B)\overline{q(B)}^{\top})$
		$\displaystyle=\frac{1}{\|X\|}\sum_{x,y\in X}(A_{i})_{xy}$
		$\displaystyle=\frac{1}{\|X\|}\cdot\|X\|\cdot q(\lambda)$
		$\displaystyle=q(\lambda).$

	$\displaystyle\langle q,p_{j}\rangle$	$\displaystyle=\frac{1}{\|X\|}\operatorname{trace}(q(B)\overline{p_{j}(B)}^{\top})$
		$\displaystyle=\frac{1}{\|X\|}\sum_{x,y\in X}\left(A_{i}\circ\overline{p_{j}(B)}% \right)_{xy}$
		$\displaystyle=\frac{1}{\|X\|}\sum_{x,y\in X}\left(A_{i}\circ{p_{j}(B)}\right)_{xy}$
		$\displaystyle=\left\{\begin{array}[]{ll}0,&\mbox{ if $0\leq j\leq i-1$},\\ \displaystyle\frac{1}{\|X\|}\displaystyle\sum_{x,y\in X}\left(A_{i}\circ% \overline{p_{i}(B)}\right)_{xy},&\mbox{ if $j=i$,}\\ 0,&\mbox{ if $i+1\leq j\leq D$}\\ \end{array}\right.(\mbox{by Lemmas~{}\ref{hB}, \ref{hC}}).$

On Hoffman polynomials of λ𝜆\lambdaitalic_λ-doubly stochastic irreducible matrices and commutative association schemes

Abstract

1 Introduction

Problem 1.1

Theorem 1.1

Theorem 1.2

2 Preliminaries

2.1 Doubly stochastic matrix

2.2 Elementary algebraic graph theory

Theorem 2.1 (Perron–Frobenius Theorem)

Lemma 2.2 (see, for example, [31, Section 8.3])

Corollary 2.3

Theorem 2.4 (see, for example, [2, Chapter 7])

Lemma 2.5 ([28, Theorem 1.3.12])

Theorem 2.6

2.3 Underlying digraph of a nonnegative matrix B𝐵Bitalic_B

Lemma 2.7

2.4 Underlying weighted digraph of a nonnegative matrix B𝐵Bitalic_B

2.5 Number of walks in ΓΓ\Gammaroman_Γ and Bℓsuperscript𝐵ℓB^{\ell}italic_B start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT

Lemma 2.8

Lemma 2.9

2.6 A vector space of all polynomials in a normal nonnegative matrix

Definition 2.10

Proposition 2.11

2.7 Commutative association schemes

3 The Hoffman polynomial of a nonnegative matrix

Corollary 3.1

4 On predistance polynomials

Lemma 4.1

Lemma 4.2

Lemma 4.3

Definition 4.4

Lemma 4.5

Proposition 4.6

Lemma 4.7

5 Case when 𝑨𝑫subscript𝑨𝑫\boldsymbol{A_{D}}bold_italic_A start_POSTSUBSCRIPT bold_italic_D end_POSTSUBSCRIPT is polynomial in 𝑩𝑩\boldsymbol{B}bold_italic_B

Proposition 5.1

Lemma 5.2

Proposition 5.3

Proposition 5.4 ([22, Proposition 2] or [24])

5.1 Proof of Theorem 1.2

Acknowledgments

Declaration of competing interest

References

On Hoffman polynomials of $\lambda$ -doubly stochastic irreducible matrices and commutative association schemes

2.3 Underlying digraph of a nonnegative matrix $B$

2.4 Underlying weighted digraph of a nonnegative matrix $B$

2.5 Number of walks in $\Gamma$ and $B^{\ell}$

5 Case when $\boldsymbol{A_{D}}$ is polynomial in $\boldsymbol{B}$