-
The G-invariant graph Laplacian
Authors:
Eitan Rosen,
Paulina Hoyos,
Xiuyuan Cheng,
Joe Kileel,
Yoel Shkolnisky
Abstract:
Graph Laplacian based algorithms for data lying on a manifold have been proven effective for tasks such as dimensionality reduction, clustering, and denoising. In this work, we consider data sets whose data points lie on a manifold that is closed under the action of a known unitary matrix Lie group G. We propose to construct the graph Laplacian by incorporating the distances between all the pairs…
▽ More
Graph Laplacian based algorithms for data lying on a manifold have been proven effective for tasks such as dimensionality reduction, clustering, and denoising. In this work, we consider data sets whose data points lie on a manifold that is closed under the action of a known unitary matrix Lie group G. We propose to construct the graph Laplacian by incorporating the distances between all the pairs of points generated by the action of G on the data set. We deem the latter construction the ``G-invariant Graph Laplacian'' (G-GL). We show that the G-GL converges to the Laplace-Beltrami operator on the data manifold, while enjoying a significantly improved convergence rate compared to the standard graph Laplacian which only utilizes the distances between the points in the given data set. Furthermore, we show that the G-GL admits a set of eigenfunctions that have the form of certain products between the group elements and eigenvectors of certain matrices, which can be estimated from the data efficiently using FFT-type algorithms. We demonstrate our construction and its advantages on the problem of filtering data on a noisy manifold closed under the action of the special unitary group SU(2).
△ Less
Submitted 28 June, 2024; v1 submitted 29 March, 2023;
originally announced March 2023.
-
Diffusion Maps for Group-Invariant Manifolds
Authors:
Paulina Hoyos,
Joe Kileel
Abstract:
In this article, we consider the manifold learning problem when the data set is invariant under the action of a compact Lie group $K$. Our approach consists in augmenting the data-induced graph Laplacian by integrating over the $K$-orbits of the existing data points, which yields a $K$-invariant graph Laplacian $L$. We prove that $L$ can be diagonalized by using the unitary irreducible representat…
▽ More
In this article, we consider the manifold learning problem when the data set is invariant under the action of a compact Lie group $K$. Our approach consists in augmenting the data-induced graph Laplacian by integrating over the $K$-orbits of the existing data points, which yields a $K$-invariant graph Laplacian $L$. We prove that $L$ can be diagonalized by using the unitary irreducible representation matrices of $K$, and we provide an explicit formula for computing its eigenvalues and eigenfunctions. In addition, we show that the normalized Laplacian operator $L_N$ converges to the Laplace-Beltrami operator of the data manifold with an improved convergence rate, where the improvement grows with the dimension of the symmetry group $K$. This work extends the steerable graph Laplacian framework of Landa and Shkolnisky from the case of $\operatorname{SO}(2)$ to arbitrary compact Lie groups.
△ Less
Submitted 3 April, 2023; v1 submitted 28 March, 2023;
originally announced March 2023.
-
Discrete diffusion-type equation on regular graphs and its applications
Authors:
Carlos A. Cadavid,
Paulina Hoyos,
Jay Jorgenson,
Lejla Smajlović,
Juan D. Vélez
Abstract:
We derive an explicit formula for the fundamental solution $K_{T_{q+1}}(x,x_{0};t)$ to the discrete-time diffusion equation on the $(q+1)$-regular tree $T_{q+1}$ in terms of the discrete $I$-Bessel function. We then use the formula to derive an explicit expression for the fundamental solution $K_{X}(x,x_{0};t)$ to the discrete-time diffusion equation on any $(q+1)$-regular graph $X$. Going further…
▽ More
We derive an explicit formula for the fundamental solution $K_{T_{q+1}}(x,x_{0};t)$ to the discrete-time diffusion equation on the $(q+1)$-regular tree $T_{q+1}$ in terms of the discrete $I$-Bessel function. We then use the formula to derive an explicit expression for the fundamental solution $K_{X}(x,x_{0};t)$ to the discrete-time diffusion equation on any $(q+1)$-regular graph $X$. Going further, we develop three applications. The first one is to derive a general trace formula that relates the spectral data on $X$ to its topological data. Though we emphasize the results in the case when $X$ is finite, our method also applies when $X$ has a countably infinite number of vertices. As a second application, we obtain a closed-form expression for the return time probability distribution of the uniform random walk on any $(q+1)$-regular graph. The expression is obtained by relating $K_{X}(x,x_{0};t)$ to the uniform random walk on a $(q+1)$-regular graph. We then show that if $\{X_{h}\}$ is a sequence of $(q+1)$-regular graphs whose number of vertices goes to infinity and which satisfies a certain natural geometric condition, then the limit of the return time probability distributions from $\{X_{h}\}$ is equal to the return time probability distribution on the tree $T_{q+1}$. As a third application, we derive formulas which express the number of distinct closed irreducible walks without tails on a finite graph $X$ in terms of moments of the spectrum of its adjacency matrix.
△ Less
Submitted 23 March, 2023; v1 submitted 24 August, 2022;
originally announced August 2022.
-
On an approach for evaluating certain trigonometric character sums using the discrete time heat kernel
Authors:
Carlos A. Cadavid,
Paulina Hoyos,
Jay Jorgenson,
Lejla Smajlović,
Juan D. Vélez
Abstract:
In this article we develop a general method by which one can explicitly evaluate certain sums of $n$-th powers of products of $d\geq 1$ elementary trigonometric functions evaluated at $\mathbf{m}=(m_1,\ldots,m_d)$-th roots of unity. Our approach is to first identify the individual terms in the expression under consideration as eigenvalues of a discrete Laplace operator associated to a graph whose…
▽ More
In this article we develop a general method by which one can explicitly evaluate certain sums of $n$-th powers of products of $d\geq 1$ elementary trigonometric functions evaluated at $\mathbf{m}=(m_1,\ldots,m_d)$-th roots of unity. Our approach is to first identify the individual terms in the expression under consideration as eigenvalues of a discrete Laplace operator associated to a graph whose vertices form a $d$-dimensional discrete torus $G_{\mathbf{m}}$ which depends on $\mathbf{m}$. The sums in question are then related to the $n$-th step of a Markov chain on $G_{\mathbf{m}}$. The Markov chain admits the interpretation as a particular random walk, also viewed as a discrete time and discrete space heat diffusion, so then the sum in question is related to special values of the associated heat kernel. Our evaluation follows by deriving a combinatorial expression for the heat kernel, which is obtained by periodizing the heat kernel on the infinite lattice $\mathbb{Z}^{d}$ which covers $G_{\mathbf{m}}$.
△ Less
Submitted 23 October, 2022; v1 submitted 19 January, 2022;
originally announced January 2022.
-
An integer factorization algorithm which uses diffusion as a computational engine
Authors:
Carlos A. Cadavid,
Paulina Hoyos,
Jay Jorgenson,
Lejla Smajlović,
Juan D. Vélez
Abstract:
In this article we develop an algorithm which computes a divisor of an integer $N$, which is assumed to be neither prime nor the power of a prime. The algorithm uses discrete time heat diffusion on a finite graph. If $N$ has $m$ distinct prime factors, then the probability that our algorithm runs successfully is at least $p(m) = 1-(m+1)/2^{m}$. We compute the computational complexity of the algori…
▽ More
In this article we develop an algorithm which computes a divisor of an integer $N$, which is assumed to be neither prime nor the power of a prime. The algorithm uses discrete time heat diffusion on a finite graph. If $N$ has $m$ distinct prime factors, then the probability that our algorithm runs successfully is at least $p(m) = 1-(m+1)/2^{m}$. We compute the computational complexity of the algorithm in terms of classical, or digital, steps and in terms of diffusion steps, which is a concept that we define here. As we will discuss below, we assert that a diffusion step can and should be considered as being comparable to a quantum step for an algorithm which runs on a quantum computer. With this, we prove that our factorization algorithm uses at most $O((\log N)^{2})$ deterministic steps and at most $O((\log N)^{2})$ diffusion steps with an implied constant which is effective. By comparison, Shor's algorithm is known to use at most $O((\log N)^{2}\log (\log N) \log (\log \log N))$ quantum steps on a quantum computer.
As an example of our algorithm, we simulate the diffusion computer algorithm on a desktop computer and obtain factorizations of $N=33$ and $N=1363$.
△ Less
Submitted 23 January, 2023; v1 submitted 23 April, 2021;
originally announced April 2021.
-
Composite Higgs models
Authors:
Juan Pablo Hoyos Daza
Abstract:
One of the solutions to the hierarchy problem of the Standard Model is the composite Higgs scenario, where the Higgs emerges as a composite pseudo-Nambu-Goldstone boson. In this work we present and study the basic characteristics of the composite Higgs scenario, based on the $SO(5)/SO(4)$ and $SO(6)/SO(5)$ cosets. We construct their effective Lagrangians through the Callan-Coleman-Wess-Zumino cons…
▽ More
One of the solutions to the hierarchy problem of the Standard Model is the composite Higgs scenario, where the Higgs emerges as a composite pseudo-Nambu-Goldstone boson. In this work we present and study the basic characteristics of the composite Higgs scenario, based on the $SO(5)/SO(4)$ and $SO(6)/SO(5)$ cosets. We construct their effective Lagrangians through the Callan-Coleman-Wess-Zumino construction. The first coset does not differ much from the Standard Model and the second contains a singlet scalar in addition to the Higgs doublet. In these models we study the gauge sector, the fermion sector and we estimate the composite Higgs potential.
△ Less
Submitted 24 August, 2019;
originally announced August 2019.
-
Adaptative significance levels in linear regression models with known variance
Authors:
Alejandra Estefanía Patiño Hoyos,
Victor Fossaluza
Abstract:
The Full Bayesian Significance Test (FBST) for precise hypotheses was presented by Pereira and Stern [Entropy 1(4) (1999) 99-110] as a Bayesian alternative instead of the traditional significance test using p-value. The FBST is based on the evidence in favor of the null hypothesis (H). An important practical issue for the implementation of the FBST is the determination of how large the evidence mu…
▽ More
The Full Bayesian Significance Test (FBST) for precise hypotheses was presented by Pereira and Stern [Entropy 1(4) (1999) 99-110] as a Bayesian alternative instead of the traditional significance test using p-value. The FBST is based on the evidence in favor of the null hypothesis (H). An important practical issue for the implementation of the FBST is the determination of how large the evidence must be in order to decide for its rejection. In the Classical significance tests, it is known that p-value decreases as sample size increases, so by setting a single significance level, it usually leads H rejection. In the FBST procedure, the evidence in favor of H exhibits the same behavior as the p-value when the sample size increases. This suggests that the cut-off point to define the rejection of H in the FBST should be a sample size function. In this work, the scenario of Linear Regression Models with known variance under the Bayesian approach is considered, and a method to find a cut-off value for the evidence in the FBST is presented by minimizing the linear combination of the averaged type I and type II error probabilities for a given sample size and also for a given dimension of the parametric space.
△ Less
Submitted 10 June, 2019;
originally announced June 2019.
-
Adaptative significance levels in normal mean hypothesis testing
Authors:
Alejandra Estefanía Patiño Hoyos,
Victor Fossaluza
Abstract:
The Full Bayesian Significance Test (FBST) for precise hypotheses was presented by Pereira and Stern (1999) as a Bayesian alternative instead of the traditional significance test based on p-value. The FBST uses the evidence in favor of the null hypothesis ($H_0$) calculated as the complement of the posterior probability of the highest posterior density region, which is tangent to the set defined b…
▽ More
The Full Bayesian Significance Test (FBST) for precise hypotheses was presented by Pereira and Stern (1999) as a Bayesian alternative instead of the traditional significance test based on p-value. The FBST uses the evidence in favor of the null hypothesis ($H_0$) calculated as the complement of the posterior probability of the highest posterior density region, which is tangent to the set defined by $H_0$. An important practical issue for the implementation of the FBST is the determination of how large the evidence must be in order to decide for its rejection. In the Classical significance tests, the most used measure for rejecting a hypothesis is p-value. It is known that p-value decreases as sample size increases, so by setting a single significance level, it usually leads $H_0$ rejection. In the FBST procedure, the evidence in favor of $H_0$ exhibits the same behavior as the p-value when the sample size increases. This suggests that the cut-off point to define the rejection of $H_0$ in the FBST should be a sample size function. In this work, we focus on the case of two-sided normal mean hypothesis testing and present a method to find a cut-off value for the evidence in the FBST by minimizing the linear combination of the type I error probability and the expected type II error probability for a given sample size.
△ Less
Submitted 29 August, 2018;
originally announced August 2018.