-
Diffusion Models for Generative Artificial Intelligence: An Introduction for Applied Mathematicians
Authors:
Catherine F. Higham,
Desmond J. Higham,
Peter Grindrod
Abstract:
Generative artificial intelligence (AI) refers to algorithms that create synthetic but realistic output. Diffusion models currently offer state of the art performance in generative AI for images. They also form a key component in more general tools, including text-to-image generators and large language models. Diffusion models work by adding noise to the available training data and then learning h…
▽ More
Generative artificial intelligence (AI) refers to algorithms that create synthetic but realistic output. Diffusion models currently offer state of the art performance in generative AI for images. They also form a key component in more general tools, including text-to-image generators and large language models. Diffusion models work by adding noise to the available training data and then learning how to reverse the process. The reverse operation may then be applied to new random data in order to produce new outputs. We provide a brief introduction to diffusion models for applied mathematicians and statisticians. Our key aims are (a) to present illustrative computational examples, (b) to give a careful derivation of the underlying mathematical formulas involved, and (c) to draw a connection with partial differential equation (PDE) diffusion models. We provide code for the computational experiments. We hope that this topic will be of interest to advanced undergraduate students and postgraduate students. Portions of the material may also provide useful motivational examples for those who teach courses in stochastic processes, inference, machine learning, PDEs or scientific computing.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
Estimating Network Dimension When the Spectrum Struggles
Authors:
Peter Grindrod,
Desmond John Higham,
Henry-Louis de Kergorlay
Abstract:
What is the dimension of a network? Here, we view it as the smallest dimension of Euclidean space into which nodes can be embedded so that pairwise distances accurately reflect the connectivity structure. We show that a recently proposed and extremely efficient algorithm for data clouds, based on computing first and second nearest neighbour distances, can be used as the basis of an approach for es…
▽ More
What is the dimension of a network? Here, we view it as the smallest dimension of Euclidean space into which nodes can be embedded so that pairwise distances accurately reflect the connectivity structure. We show that a recently proposed and extremely efficient algorithm for data clouds, based on computing first and second nearest neighbour distances, can be used as the basis of an approach for estimating the dimension of a network with weighted edges. We also show how the algorithm can be extended to unweighted networks when combined with spectral embedding. We illustrate the advantages of this technique over the widely-used approach of characterising dimension by visually searching for a suitable gap in the spectrum of the Laplacian.
△ Less
Submitted 25 June, 2023;
originally announced June 2023.
-
A roadmap for the computation of persistent homology
Authors:
Nina Otter,
Mason A. Porter,
Ulrike Tillmann,
Peter Grindrod,
Heather A. Harrington
Abstract:
Persistent homology (PH) is a method used in topological data analysis (TDA) to study qualitative features of data that persist across multiple scales. It is robust to perturbations of input data, independent of dimensions and coordinates, and provides a compact representation of the qualitative features of the input. The computation of PH is an open area with numerous important and fascinating ch…
▽ More
Persistent homology (PH) is a method used in topological data analysis (TDA) to study qualitative features of data that persist across multiple scales. It is robust to perturbations of input data, independent of dimensions and coordinates, and provides a compact representation of the qualitative features of the input. The computation of PH is an open area with numerous important and fascinating challenges. The field of PH computation is evolving rapidly, and new algorithms and software implementations are being updated and released at a rapid pace. The purposes of our article are to (1) introduce theory and computational methods for PH to a broad range of computational scientists and (2) provide benchmarks of state-of-the-art implementations for the computation of PH. We give a friendly introduction to PH, navigate the pipeline for the computation of PH with an eye towards applications, and use a range of synthetic and real-world data sets to evaluate currently available open-source implementations for the computation of PH. Based on our benchmarking, we indicate which algorithms and implementations are best suited to different types of data sets. In an accompanying tutorial, we provide guidelines for the computation of PH. We make publicly available all scripts that we wrote for the tutorial, and we make available the processed version of the data sets used in the benchmarking.
△ Less
Submitted 12 September, 2017; v1 submitted 29 June, 2015;
originally announced June 2015.
-
Opportunities at the Mathematics/Future Cities Interface
Authors:
Peter Grindrod,
Desmond J. Higham,
Robert S. MacKay
Abstract:
We make the case for mathematicians and statisticians to stake their claim in the fast-moving and high-impact research field that is becoming known as Future Cities. After assessing the Future Cities arena, we provide some illustrative challenges where mathematical scientists can make an impact.
We make the case for mathematicians and statisticians to stake their claim in the fast-moving and high-impact research field that is becoming known as Future Cities. After assessing the Future Cities arena, we provide some illustrative challenges where mathematical scientists can make an impact.
△ Less
Submitted 5 September, 2014;
originally announced September 2014.
-
Anticipating Activity in Social Media Spikes
Authors:
Desmond J. Higham,
Peter Grindrod,
Alexander V. Mantzaris,
Amanda Otley,
Peter Laflin
Abstract:
We propose a novel mathematical model for the activity of microbloggers during an external, event-driven spike. The model leads to a testable prediction of who would become most active if a spike were to take place. This type of information is of great interest to commercial organisations, governments and charities, as it identifies key players who can be targeted with information in real time whe…
▽ More
We propose a novel mathematical model for the activity of microbloggers during an external, event-driven spike. The model leads to a testable prediction of who would become most active if a spike were to take place. This type of information is of great interest to commercial organisations, governments and charities, as it identifies key players who can be targeted with information in real time when the network is most receptive. The model takes account of the fact that dynamic interactions evolve over an underlying, static network that records who listens to whom. The model is based on the assumption that, in the case where the entire community has become aware of an external news event, a key driver of activity is the motivation to participate by responding to incoming messages. We test the model on a large scale Twitter conversation concerning the appointment of a UK Premier League football club manager. We also present further results for a Bundesliga football match, a marketing event and a television programme. In each case we find that exploiting the underlying connectivity structure improves the prediction of who will be active during a spike. We also show how the half-life of a spike in activity can be quantified in terms of the network size and the typical response rate.
△ Less
Submitted 8 June, 2014;
originally announced June 2014.
-
Dynamical Systems to Monitor Complex Networks in Continuous Time
Authors:
Peter Grindrod,
Desmond Higham
Abstract:
In many settings it is appropriate to treat the evolution of pairwise interactions over continuous time. We show that new Katz-style centrality measures can be derived in this context via solutions to a nonautonomous ODE driven by the network dynamics. This allows us to identify and track, at any resolution, the most influential nodes in terms of broadcasting and receiving information through time…
▽ More
In many settings it is appropriate to treat the evolution of pairwise interactions over continuous time. We show that new Katz-style centrality measures can be derived in this context via solutions to a nonautonomous ODE driven by the network dynamics. This allows us to identify and track, at any resolution, the most influential nodes in terms of broadcasting and receiving information through time dependent links. In addition to the classical notion of attenuation across edges used in the static Katz centrality measure, the ODE also allows for attenuation over time, so that real time "running measures" can be computed. With regard to computational efficiency, we explain why it is cheaper to track good receivers of information than good broadcasters. We illustrate the new measures on a large scale voice call network, where key features are discovered that are not evident from snapshots or aggregates.
△ Less
Submitted 31 May, 2013;
originally announced May 2013.
-
Aperiodic dynamics in a deterministic model of attitude formation in social groups
Authors:
Jonathan Ward,
Peter Grindrod
Abstract:
Homophily and social influence are the fundamental mechanisms that drive the evolution of attitudes, beliefs and behaviour within social groups. Homophily relates the similarity between pairs of individuals' attitudinal states to their frequency of interaction, and hence structural tie strength, while social influence causes the convergence of individuals' states during interaction. Building on th…
▽ More
Homophily and social influence are the fundamental mechanisms that drive the evolution of attitudes, beliefs and behaviour within social groups. Homophily relates the similarity between pairs of individuals' attitudinal states to their frequency of interaction, and hence structural tie strength, while social influence causes the convergence of individuals' states during interaction. Building on these basic elements, we propose a new mathematical modelling framework to describe the evolution of attitudes within a group of interacting agents. Specifically, our model describes sub-conscious attitudes that have an activator-inhibitor relationship. We consider a homogeneous population using a deterministic, continuous-time dynamical system. Surprisingly, the combined effects of homophily and social influence do not necessarily lead to group consensus or global monoculture. We observe that sub-group formation and polarisation-like effects may be transient, the long-time dynamics being quasi-periodic with sensitive dependence to initial conditions. This is due to the interplay between the evolving interaction network and Turing instability associated with the attitudinal state dynamics.
△ Less
Submitted 1 February, 2013;
originally announced February 2013.