-
Codes with Hierarchical Locality on Artin-Schreier Surfaces
Authors:
Jennifer Berg,
Beth Malmskog,
Mckenzie West
Abstract:
In this article, we construct codes with hierarchical locality using natural geometric structures in Artin-Schreier surfaces of the form $y^p-y=f(x,z)$. Our main theorem describes the codes, their hierarchical structure and recovery algorithms, and gives parameters. We also develop a family of examples using codes defined over $\mathbb{F}_{p^2}$ on the surface $y^p-y=x^{p+1}z^2+x^2z^{p+1}$. We cou…
▽ More
In this article, we construct codes with hierarchical locality using natural geometric structures in Artin-Schreier surfaces of the form $y^p-y=f(x,z)$. Our main theorem describes the codes, their hierarchical structure and recovery algorithms, and gives parameters. We also develop a family of examples using codes defined over $\mathbb{F}_{p^2}$ on the surface $y^p-y=x^{p+1}z^2+x^2z^{p+1}$. We count the $\mathbb{F}_{p^2}$-rational points on the surface, a topic of more general number theoretic interest, and provide more explicit parameters a better bound on minimum distance for these codes. An additional example and some generalizations are also considered.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Learning from Integral Losses in Physics Informed Neural Networks
Authors:
Ehsan Saleh,
Saba Ghaffari,
Timothy Bretl,
Luke Olson,
Matthew West
Abstract:
This work proposes a solution for the problem of training physics-informed networks under partial integro-differential equations. These equations require an infinite or a large number of neural evaluations to construct a single residual for training. As a result, accurate evaluation may be impractical, and we show that naive approximations at replacing these integrals with unbiased estimates lead…
▽ More
This work proposes a solution for the problem of training physics-informed networks under partial integro-differential equations. These equations require an infinite or a large number of neural evaluations to construct a single residual for training. As a result, accurate evaluation may be impractical, and we show that naive approximations at replacing these integrals with unbiased estimates lead to biased loss functions and solutions. To overcome this bias, we investigate three types of potential solutions: the deterministic sampling approaches, the double-sampling trick, and the delayed target method. We consider three classes of PDEs for benchmarking; one defining Poisson problems with singular charges and weak solutions of up to 10 dimensions, another involving weak solutions on electro-magnetic fields and a Maxwell equation, and a third one defining a Smoluchowski coagulation problem. Our numerical results confirm the existence of the aforementioned bias in practice and also show that our proposed delayed target approach can lead to accurate solutions with comparable quality to ones estimated with a large sample size integral. Our implementation is open-source and available at https://github.com/ehsansaleh/btspinn.
△ Less
Submitted 11 June, 2024; v1 submitted 27 May, 2023;
originally announced May 2023.
-
Generalizing Lloyd's algorithm for graph clustering
Authors:
Tareq Zaman,
Nicolas Nytko,
Ali Taghibakhshi,
Scott MacLachlan,
Luke Olson,
Matthew West
Abstract:
Clustering is a commonplace problem in many areas of data science, with applications in biology and bioinformatics, understanding chemical structure, image segmentation, building recommender systems, and many more fields. While there are many different clustering variants (based on given distance or graph structure, probability distributions, or data density), we consider here the problem of clust…
▽ More
Clustering is a commonplace problem in many areas of data science, with applications in biology and bioinformatics, understanding chemical structure, image segmentation, building recommender systems, and many more fields. While there are many different clustering variants (based on given distance or graph structure, probability distributions, or data density), we consider here the problem of clustering nodes in a graph, motivated by the problem of aggregating discrete degrees of freedom in multigrid and domain decomposition methods for solving sparse linear systems. Specifically, we consider the challenge of forming balanced clusters in the graph of a sparse matrix for use in algebraic multigrid, although the algorithm has general applicability. Based on an extension of the Bellman-Ford algorithm, we generalize Lloyd's algorithm for partitioning subsets of Rn to balance the number of nodes in each cluster; this is accompanied by a rebalancing algorithm that reduces the overall energy in the system. The algorithm provides control over the number of clusters and leads to "well centered" partitions of the graph. Theoretical results are provided to establish linear complexity and numerical results in the context of algebraic multigrid highlight the benefits of improved clustering.
△ Less
Submitted 22 December, 2023; v1 submitted 2 March, 2023;
originally announced March 2023.
-
A Pair of Non-Isometric Potentials With the Same Semiclassical Invariants
Authors:
Matthew West
Abstract:
We show that there exist pairs of non-isometric potentials for the 1D semiclassical Schrödinger operator whose spectra agree up to $O(h^\infty)$, yet their corresponding eigenvalues differ no less than exponentially. This result was conjectured by Guillemen and Hezari in [GH12], where they prove a very similar result, yet cannot remove the possibility of a subsequence $h_k\to 0$ where the ground s…
▽ More
We show that there exist pairs of non-isometric potentials for the 1D semiclassical Schrödinger operator whose spectra agree up to $O(h^\infty)$, yet their corresponding eigenvalues differ no less than exponentially. This result was conjectured by Guillemen and Hezari in [GH12], where they prove a very similar result, yet cannot remove the possibility of a subsequence $h_k\to 0$ where the ground state eigenvalues may agree.
△ Less
Submitted 2 March, 2023;
originally announced March 2023.
-
MG-GNN: Multigrid Graph Neural Networks for Learning Multilevel Domain Decomposition Methods
Authors:
Ali Taghibakhshi,
Nicolas Nytko,
Tareq Uz Zaman,
Scott MacLachlan,
Luke Olson,
Matthew West
Abstract:
Domain decomposition methods (DDMs) are popular solvers for discretized systems of partial differential equations (PDEs), with one-level and multilevel variants. These solvers rely on several algorithmic and mathematical parameters, prescribing overlap, subdomain boundary conditions, and other properties of the DDM. While some work has been done on optimizing these parameters, it has mostly focuse…
▽ More
Domain decomposition methods (DDMs) are popular solvers for discretized systems of partial differential equations (PDEs), with one-level and multilevel variants. These solvers rely on several algorithmic and mathematical parameters, prescribing overlap, subdomain boundary conditions, and other properties of the DDM. While some work has been done on optimizing these parameters, it has mostly focused on the one-level setting or special cases such as structured-grid discretizations with regular subdomain construction. In this paper, we propose multigrid graph neural networks (MG-GNN), a novel GNN architecture for learning optimized parameters in two-level DDMs\@. We train MG-GNN using a new unsupervised loss function, enabling effective training on small problems that yields robust performance on unstructured grids that are orders of magnitude larger than those in the training set. We show that MG-GNN outperforms popular hierarchical graph network architectures for this optimization and that our proposed loss function is critical to achieving this improved performance.
△ Less
Submitted 1 March, 2023; v1 submitted 26 January, 2023;
originally announced January 2023.
-
Generalizing Reduction-Based Algebraic Multigrid
Authors:
Tareq Zaman,
Nicolas Nytko,
Ali Taghibakhshi,
Scott MacLachlan,
Luke Olson,
Matthew West
Abstract:
Algebraic Multigrid (AMG) methods are often robust and effective solvers for solving the large and sparse linear systems that arise from discretized PDEs and other problems, relying on heuristic graph algorithms to achieve their performance. Reduction-based AMG (AMGr) algorithms attempt to formalize these heuristics by providing two-level convergence bounds that depend concretely on properties of…
▽ More
Algebraic Multigrid (AMG) methods are often robust and effective solvers for solving the large and sparse linear systems that arise from discretized PDEs and other problems, relying on heuristic graph algorithms to achieve their performance. Reduction-based AMG (AMGr) algorithms attempt to formalize these heuristics by providing two-level convergence bounds that depend concretely on properties of the partitioning of the given matrix into its fine- and coarse-grid degrees of freedom. MacLachlan and Saad (SISC 2007) proved that the AMGr method yields provably robust two-level convergence for symmetric and positive-definite matrices that are diagonally dominant, with a convergence factor bounded as a function of a coarsening parameter. However, when applying AMGr algorithms to matrices that are not diagonally dominant, not only do the convergence factor bounds not hold, but measured performance is notably degraded. Here, we present modifications to the classical AMGr algorithm that improve its performance on matrices that are not diagonally dominant, making use of strength of connection, sparse approximate inverse (SPAI) techniques, and interpolation truncation and rescaling, to improve robustness while maintaining control of the algorithmic costs. We present numerical results demonstrating the robustness of this approach for both classical isotropic diffusion problems and for non-diagonally dominant systems coming from anisotropic diffusion.
△ Less
Submitted 22 August, 2023; v1 submitted 16 December, 2022;
originally announced December 2022.
-
Optimized Sparse Matrix Operations for Reverse Mode Automatic Differentiation
Authors:
Nicolas Nytko,
Ali Taghibakhshi,
Tareq Uz Zaman,
Scott MacLachlan,
Luke N. Olson,
Matt West
Abstract:
Sparse matrix representations are ubiquitous in computational science and machine learning, leading to significant reductions in compute time, in comparison to dense representation, for problems that have local connectivity. The adoption of sparse representation in leading ML frameworks such as PyTorch is incomplete, however, with support for both automatic differentiation and GPU acceleration mis…
▽ More
Sparse matrix representations are ubiquitous in computational science and machine learning, leading to significant reductions in compute time, in comparison to dense representation, for problems that have local connectivity. The adoption of sparse representation in leading ML frameworks such as PyTorch is incomplete, however, with support for both automatic differentiation and GPU acceleration missing. In this work, we present an implementation of a CSR-based sparse matrix wrapper for PyTorch with CUDA acceleration for basic matrix operations, as well as automatic differentiability. We also present several applications of the resulting sparse kernels to optimization problems, demonstrating ease of implementation and performance measurements versus their dense counterparts.
△ Less
Submitted 9 November, 2023; v1 submitted 9 December, 2022;
originally announced December 2022.
-
On Entropic Tilting and Predictive Conditioning
Authors:
Emily Tallman,
Mike West
Abstract:
Entropic tilting (ET) is a Bayesian decision-analytic method for constraining distributions to satisfy defined targets or bounds for sets of expectations. This report recapitulates the foundations and basic theory of ET for conditioning predictive distributions on such constraints, recognising the increasing interest in ET in several application areas. Contributions include new results related to…
▽ More
Entropic tilting (ET) is a Bayesian decision-analytic method for constraining distributions to satisfy defined targets or bounds for sets of expectations. This report recapitulates the foundations and basic theory of ET for conditioning predictive distributions on such constraints, recognising the increasing interest in ET in several application areas. Contributions include new results related to connections with regular exponential families of distributions, and the extension of ET to relaxed entropic tilting (RET) where specified values for expectations define bounds rather than exact targets. Additional new developments include theory and examples that condition on quantile constraints for modified predictive distributions and examples relevant to Bayesian forecasting applications.
△ Less
Submitted 14 August, 2022; v1 submitted 20 July, 2022;
originally announced July 2022.
-
Bayesian Predictive Decision Synthesis
Authors:
Emily Tallman,
Mike West
Abstract:
Decision-guided perspectives on model uncertainty expand traditional statistical thinking about managing, comparing and combining inferences from sets of models. Bayesian predictive decision synthesis (BPDS) advances conceptual and theoretical foundations, and defines new methodology that explicitly integrates decision-analytic outcomes into the evaluation, comparison and potential combination of…
▽ More
Decision-guided perspectives on model uncertainty expand traditional statistical thinking about managing, comparing and combining inferences from sets of models. Bayesian predictive decision synthesis (BPDS) advances conceptual and theoretical foundations, and defines new methodology that explicitly integrates decision-analytic outcomes into the evaluation, comparison and potential combination of candidate models. BPDS extends recent theoretical and practical advances based on both Bayesian predictive synthesis and empirical goal-focused model uncertainty analysis. This is enabled by the development of a novel subjective Bayesian perspective on model weighting in predictive decision settings. Illustrations come from applied contexts including optimal design for regression prediction and sequential time series forecasting for financial portfolio decisions.
△ Less
Submitted 5 May, 2023; v1 submitted 8 June, 2022;
originally announced June 2022.
-
How does a Rational Agent Act in an Epidemic?
Authors:
S. Yagiz Olmez,
Shubham Aggarwal,
** Won Kim,
Erik Miehling,
Tamer Başar,
Matthew West,
Prashant G. Mehta
Abstract:
Evolution of disease in a large population is a function of the top-down policy measures from a centralized planner, as well as the self-interested decisions (to be socially active) of individual agents in a large heterogeneous population. This paper is concerned with understanding the latter based on a mean-field type optimal control model. Specifically, the model is used to investigate the role…
▽ More
Evolution of disease in a large population is a function of the top-down policy measures from a centralized planner, as well as the self-interested decisions (to be socially active) of individual agents in a large heterogeneous population. This paper is concerned with understanding the latter based on a mean-field type optimal control model. Specifically, the model is used to investigate the role of partial information on an agent's decision-making, and study the impact of such decisions by a large number of agents on the spread of the virus in the population. The motivation comes from the presymptomatic and asymptomatic spread of the COVID-19 virus where an agent unwittingly spreads the virus. We show that even in a setting with fully rational agents, limited information on the viral state can result in an epidemic growth.
△ Less
Submitted 5 June, 2022;
originally announced June 2022.
-
Learning Interface Conditions in Domain Decomposition Solvers
Authors:
Ali Taghibakhshi,
Nicolas Nytko,
Tareq Zaman,
Scott MacLachlan,
Luke Olson,
Matthew West
Abstract:
Domain decomposition methods are widely used and effective in the approximation of solutions to partial differential equations. Yet the optimal construction of these methods requires tedious analysis and is often available only in simplified, structured-grid settings, limiting their use for more complex problems. In this work, we generalize optimized Schwarz domain decomposition methods to unstruc…
▽ More
Domain decomposition methods are widely used and effective in the approximation of solutions to partial differential equations. Yet the optimal construction of these methods requires tedious analysis and is often available only in simplified, structured-grid settings, limiting their use for more complex problems. In this work, we generalize optimized Schwarz domain decomposition methods to unstructured-grid problems, using Graph Convolutional Neural Networks (GCNNs) and unsupervised learning to learn optimal modifications at subdomain interfaces. A key ingredient in our approach is an improved loss function, enabling effective training on relatively small problems, but robust performance on arbitrarily large problems, with computational cost linear in problem size. The performance of the learned linear solvers is compared with both classical and optimized domain decomposition algorithms, for both structured- and unstructured-grid problems.
△ Less
Submitted 17 October, 2022; v1 submitted 19 May, 2022;
originally announced May 2022.
-
Minimum Distance and Parameter Ranges of Locally Recoverable Codes with Availability from Fiber Products of Curves
Authors:
María Chara,
Sam Kottler,
Beth Malmskog,
Bianca Thompson,
Mckenzie West
Abstract:
We construct families of locally recoverable codes with availability $t\geq 2$ using fiber products of curves, determine the exact minimum distance of many families, and prove a general theorem for minimum distance of such codes. The paper concludes with an exploration of parameters of codes from these families and the fiber product construction more generally. We show that fiber product codes can…
▽ More
We construct families of locally recoverable codes with availability $t\geq 2$ using fiber products of curves, determine the exact minimum distance of many families, and prove a general theorem for minimum distance of such codes. The paper concludes with an exploration of parameters of codes from these families and the fiber product construction more generally. We show that fiber product codes can achieve arbitrarily large rate and arbitrarily small relative defect, and compare to known bounds and important constructions from the literature.
△ Less
Submitted 7 April, 2022;
originally announced April 2022.
-
Modeling Presymptomatic Spread in Epidemics via Mean-Field Games
Authors:
S. Yagiz Olmez,
Shubham Aggarwal,
** Won Kim,
Erik Miehling,
Tamer Başar,
Matthew West,
Prashant G. Mehta
Abstract:
This paper is concerned with develo** mean-field game models for the evolution of epidemics. Specifically, an agent's decision -- to be socially active in the midst of an epidemic -- is modeled as a mean-field game with health-related costs and activity-related rewards. By considering the fully and partially observed versions of this problem, the role of information in guiding an agent's rationa…
▽ More
This paper is concerned with develo** mean-field game models for the evolution of epidemics. Specifically, an agent's decision -- to be socially active in the midst of an epidemic -- is modeled as a mean-field game with health-related costs and activity-related rewards. By considering the fully and partially observed versions of this problem, the role of information in guiding an agent's rational decision is highlighted. The main contributions of the paper are to derive the equations for the mean-field game in both fully and partially observed settings of the problem, to present a complete analysis of the fully observed case, and to present some analytical results for the partially observed case.
△ Less
Submitted 19 November, 2021;
originally announced November 2021.
-
Coarse-Grid Selection Using Simulated Annealing
Authors:
Tareq. U. Zaman,
Scott P. MacLachlan,
Luke N. Olson,
Matt West
Abstract:
Multilevel techniques are efficient approaches for solving the large linear systems that arise from discretized partial differential equations and other problems. While geometric multigrid requires detailed knowledge about the underlying problem and its discretization, algebraic multigrid aims to be less intrusive, requiring less knowledge about the origin of the linear system. A key step in algeb…
▽ More
Multilevel techniques are efficient approaches for solving the large linear systems that arise from discretized partial differential equations and other problems. While geometric multigrid requires detailed knowledge about the underlying problem and its discretization, algebraic multigrid aims to be less intrusive, requiring less knowledge about the origin of the linear system. A key step in algebraic multigrid is the choice of the coarse/fine partitioning, aiming to balance the convergence of the iteration with its cost. In work by MacLachlan and Saad, a constrained combinatorial optimization problem is used to define the ``best'' coarse grid within the setting of a two-level reduction-based algebraic multigrid method and is shown to be NP-complete. Here, we develop a new coarsening algorithm based on simulated annealing to approximate solutions to this problem, which yields improved results over the greedy algorithm developed previously. We present numerical results for test problems on both structured and unstructured meshes, demonstrating the ability to exploit knowledge about the underlying grid structure if it is available.
△ Less
Submitted 19 January, 2023; v1 submitted 27 May, 2021;
originally announced May 2021.
-
Verifying Stochastic Hybrid Systems with Temporal Logic Specifications via Model Reduction
Authors:
Yu Wang,
Nima Roohi,
Matthew West,
Mahesh Viswanathan,
Geir E. Dullerud
Abstract:
We present a scalable methodology to verify stochastic hybrid systems. Using the Mori-Zwanzig reduction method, we construct a finite state Markov chain reduction of a given stochastic hybrid system and prove that this reduced Markov chain is approximately equivalent to the original system in a distributional sense. Approximate equivalence of the stochastic hybrid system and its Markov chain reduc…
▽ More
We present a scalable methodology to verify stochastic hybrid systems. Using the Mori-Zwanzig reduction method, we construct a finite state Markov chain reduction of a given stochastic hybrid system and prove that this reduced Markov chain is approximately equivalent to the original system in a distributional sense. Approximate equivalence of the stochastic hybrid system and its Markov chain reduction means that analyzing the Markov chain with respect to a suitably strengthened property, allows us to conclude whether the original stochastic hybrid system meets its temporal logic specifications. We present the first statistical model checking algorithms to verify stochastic hybrid systems against correctness properties, expressed in the linear inequality linear temporal logic (iLTL) or the metric interval temporal logic (MITL).
△ Less
Submitted 16 September, 2020;
originally announced September 2020.
-
Perspectives on Constrained Forecasting
Authors:
Mike West
Abstract:
This expository paper discusses Bayesian decision analysis perspectives on problems of constrained forecasting. Foundational and pedagogic discussion contrasts decision analytic approaches with the traditional, but typically inappropriate, inferential approach. Illustrative examples include development of novel constrained point forecasting and entropic tilting methodology to explore consistency o…
▽ More
This expository paper discusses Bayesian decision analysis perspectives on problems of constrained forecasting. Foundational and pedagogic discussion contrasts decision analytic approaches with the traditional, but typically inappropriate, inferential approach. Illustrative examples include development of novel constrained point forecasting and entropic tilting methodology to explore consistency of a predictive distribution with an imposed or hypothesized constraint. Linear, aggregate constraints define illuminating examples that relate to broadly important problems involving aggregate and hierarchical constraints in commercial and economic forecasting. Discussion explores the impact of different loss functions, questions of how constrained forecasting is impacted by dependencies among outcomes being predicted, and promotes the broader use of decision analysis including routine evaluation of predictive distributions of loss under chosen forecasts/decisions. Extensions to more general constrained forecasting problems, connections with broader interests in forecast reconciliation and other considerations are noted.
△ Less
Submitted 30 November, 2021; v1 submitted 21 July, 2020;
originally announced July 2020.
-
Periodic intermediate $β$-expansions of Pisot numbers
Authors:
Blaine Quackenbush,
Tony Samuel,
Matthew A. West
Abstract:
The subshift of finite type property (also known as the Markov property) is ubiquitous in dynamical systems and the simplest and most widely studied class of dynamical systems are $β$-shifts, namely transformations of the form $T_{β, α} \colon x \mapsto βx + α\bmod{1}$ acting on $[-α/(β- 1), (1-α)/(β- 1)]$, where $(β, α) \in Δ$ is fixed and where…
▽ More
The subshift of finite type property (also known as the Markov property) is ubiquitous in dynamical systems and the simplest and most widely studied class of dynamical systems are $β$-shifts, namely transformations of the form $T_{β, α} \colon x \mapsto βx + α\bmod{1}$ acting on $[-α/(β- 1), (1-α)/(β- 1)]$, where $(β, α) \in Δ$ is fixed and where $Δ= \{ (β, α) \in \mathbb{R}^{2} \colon β\in (1,2) \; \text{and} \; 0 \leq α\leq 2-β\}$. Recently, it was shown, by Li et al. (Proc. Amer. Math. Soc. 147(5): 2045-2055, 2019), that the set of $(β, α)$ such that $T_{β, α}$ has the subshift of finite type property is dense in the parameter space $Δ$. Here, they proposed the following question. Given a fixed $β\in (1, 2)$ which is the $n$-th root of a Perron number, does there exists a dense set of $α$ in the fiber $\{β\} \times (0, 2- β)$, so that $T_{β, α}$ has the subshift of finite type property?
We answer this question in the positive for a class of Pisot numbers. Further, we investigate if this question holds true when replacing the subshift of finite type property by the property of beginning sofic (that is a factor of a subshift of finite). In doing so we generalise, a classical result of Schmidt (Bull. London Math. Soc., 12(4): 269-278, 1980) from the case when $α= 0$ to the case when $α\in (0, 2 - β)$. That is, we examine the structure of the set of eventually periodic points of $T_{β, α}$ when $β$ is a Pisot number and when $β$ is the $n$-th root of a Pisot number.
△ Less
Submitted 28 April, 2020;
originally announced April 2020.
-
Restrictions on Weil polynomials of Jacobians of hyperelliptic curves
Authors:
Edgar Costa,
Ravi Donepudi,
Ravi Fernando,
Valentijn Karemaker,
Caleb Springer,
Mckenzie West
Abstract:
Inspired by experimental data, this paper investigates which isogeny classes of abelian varieties defined over a finite field of odd characteristic contain the Jacobian of a hyperelliptic curve. We provide a necessary condition by demonstrating that the Weil polynomial of a hyperelliptic Jacobian must have a particular form modulo 2. For fixed ${g\geq1}$, the proportion of isogeny classes of $g$ d…
▽ More
Inspired by experimental data, this paper investigates which isogeny classes of abelian varieties defined over a finite field of odd characteristic contain the Jacobian of a hyperelliptic curve. We provide a necessary condition by demonstrating that the Weil polynomial of a hyperelliptic Jacobian must have a particular form modulo 2. For fixed ${g\geq1}$, the proportion of isogeny classes of $g$ dimensional abelian varieties defined over $\mathbb{F}_q$ which fail this condition is $1 - Q(2g + 2)/2^g$ as $q\to\infty$ ranges over odd prime powers, where $Q(n)$ denotes the number of partitions of $n$ into odd parts.
△ Less
Submitted 25 November, 2020; v1 submitted 5 February, 2020;
originally announced February 2020.
-
Bayesian forecasting of multivariate time series: Scalability, structure uncertainty and decisions
Authors:
Mike West
Abstract:
I overview recent research advances in Bayesian state-space modeling of multivariate time series. A main focus is on the decouple/recouple concept that enables application of state-space models to increasingly large-scale data, applying to continuous or discrete time series outcomes. The scope includes large-scale dynamic graphical models for forecasting and multivariate volatility analysis in are…
▽ More
I overview recent research advances in Bayesian state-space modeling of multivariate time series. A main focus is on the decouple/recouple concept that enables application of state-space models to increasingly large-scale data, applying to continuous or discrete time series outcomes. The scope includes large-scale dynamic graphical models for forecasting and multivariate volatility analysis in areas such as economics and finance, multi-scale approaches for forecasting discrete/count time series in areas such as commercial sales and demand forecasting, and dynamic network flow models for areas including internet traffic monitoring. In applications, explicit forecasting, monitoring and decision goals are paramount and should factor into model assessment and comparison, a perspective that is highlighted.
△ Less
Submitted 11 December, 2019; v1 submitted 21 November, 2019;
originally announced November 2019.
-
A tree-based radial basis function method for noisy parallel surrogate optimization
Authors:
Chenchao Shou,
Matthew West
Abstract:
Parallel surrogate optimization algorithms have proven to be efficient methods for solving expensive noisy optimization problems. In this work we develop a new parallel surrogate optimization algorithm (ProSRS), using a novel tree-based "zoom strategy" to improve the efficiency of the algorithm. We prove that if ProSRS is run for sufficiently long, with probability converging to one there will be…
▽ More
Parallel surrogate optimization algorithms have proven to be efficient methods for solving expensive noisy optimization problems. In this work we develop a new parallel surrogate optimization algorithm (ProSRS), using a novel tree-based "zoom strategy" to improve the efficiency of the algorithm. We prove that if ProSRS is run for sufficiently long, with probability converging to one there will be at least one point among all the evaluations that will be arbitrarily close to the global minimum. We compare our algorithm to several state-of-the-art Bayesian optimization algorithms on a suite of standard benchmark functions and two real machine learning hyperparameter-tuning problems. We find that our algorithm not only achieves significantly faster optimization convergence, but is also 1-4 orders of magnitude cheaper in computational cost.
△ Less
Submitted 21 August, 2019;
originally announced August 2019.
-
A robust implementation for solving the $S$-unit equation and several applications
Authors:
Alejandra Alvarado,
Angelos Koutsianas,
Beth Malmskog,
Christopher Rasmussen,
Christelle Vincent,
Mckenzie West
Abstract:
Let $K$ be a number field, and $S$ a finite set of places in $K$ containing all infinite places. We present an implementation for solving the $S$-unit equation $x + y = 1$, $x,y \in\mathscr{O}_{K,S}^\times$ in the computer algebra package SageMath. This paper outlines the mathematical basis for the implementation. We discuss and reference the results of extensive computations, including exponent b…
▽ More
Let $K$ be a number field, and $S$ a finite set of places in $K$ containing all infinite places. We present an implementation for solving the $S$-unit equation $x + y = 1$, $x,y \in\mathscr{O}_{K,S}^\times$ in the computer algebra package SageMath. This paper outlines the mathematical basis for the implementation. We discuss and reference the results of extensive computations, including exponent bounds for solutions in many fields of small degree for small sets $S$. As an application, we prove an asymptotic version of Fermat's Last Theorem for totally real cubic number fields with bounded discriminant where 2 is totally ramified. In addition, we use the implementation to find all solutions to some cubic Ramanujan-Nagell equations.
△ Less
Submitted 8 July, 2020; v1 submitted 3 March, 2019;
originally announced March 2019.
-
On the continuity of entropy of Lorenz maps
Authors:
Zoe Cooperband,
Erin P. J. Pearse,
Blaine Quackenbush,
Jordan M. Rowley,
Tony Samuel,
Matthew A. West
Abstract:
We consider a one parameter family of Lorenz maps indexed by their point of discontinuity $p$ and constructed from a pair of bilipschitz functions. We prove that their topological entropies vary continuously as a function of $p$ and discuss Milnor's monotonicity conjecture in this setting.
We consider a one parameter family of Lorenz maps indexed by their point of discontinuity $p$ and constructed from a pair of bilipschitz functions. We prove that their topological entropies vary continuously as a function of $p$ and discuss Milnor's monotonicity conjecture in this setting.
△ Less
Submitted 12 March, 2018;
originally announced March 2018.
-
On the arithmetic of a family of degree-two K3 surfaces
Authors:
Florian Bouyer,
Edgar Costa,
Dino Festi,
Christopher Nicholls,
Mckenzie West
Abstract:
Let $\mathbb{P}$ denote the weighted projective space with weights $(1,1,1,3)$ over the rationals, with coordinates $x,y,z,$ and $w$; let $\mathcal{X}$ be the generic element of the family of surfaces in $\mathbb{P}$ given by \begin{equation*}
X\colon w^2=x^6+y^6+z^6+tx^2y^2z^2. \end{equation*} The surface $\mathcal{X}$ is a K3 surface over the function field $\mathbb{Q}(t)$. In this paper, we e…
▽ More
Let $\mathbb{P}$ denote the weighted projective space with weights $(1,1,1,3)$ over the rationals, with coordinates $x,y,z,$ and $w$; let $\mathcal{X}$ be the generic element of the family of surfaces in $\mathbb{P}$ given by \begin{equation*}
X\colon w^2=x^6+y^6+z^6+tx^2y^2z^2. \end{equation*} The surface $\mathcal{X}$ is a K3 surface over the function field $\mathbb{Q}(t)$. In this paper, we explicitly compute the geometric Picard lattice of $\mathcal{X}$, together with its Galois module structure, as well as derive more results on the arithmetic of $\mathcal{X}$ and other elements of the family $X$.
△ Less
Submitted 24 February, 2018; v1 submitted 6 March, 2017;
originally announced March 2017.
-
Homotopy Decompositions of Gauge Groups over Real Surfaces
Authors:
Michael West
Abstract:
We analyse the homotopy types of gauge groups of principal U(n)-bundles associated to pseudo Real vector bundles in the sense of Atiyah. We provide satisfactory homotopy decompositions of these gauge groups into factors in which the homotopy groups are well known. Therefore, we substantially build upon the low dimensional homotopy groups as provided in a paper by I. Biswas, J. Huisman, and J. Hurt…
▽ More
We analyse the homotopy types of gauge groups of principal U(n)-bundles associated to pseudo Real vector bundles in the sense of Atiyah. We provide satisfactory homotopy decompositions of these gauge groups into factors in which the homotopy groups are well known. Therefore, we substantially build upon the low dimensional homotopy groups as provided in a paper by I. Biswas, J. Huisman, and J. Hurtubise.
△ Less
Submitted 2 January, 2017;
originally announced January 2017.
-
Spectrally arbitrary pattern extensions
Authors:
In-Jae Kim,
Bryan L. Shader,
Kevin N. Vander Meulen,
Matthew West
Abstract:
A matrix pattern is often either a sign pattern with entries in {0,+,-} or, more simply, a nonzero pattern with entries in {0,*}. A matrix pattern A is spectrally arbitrary if for any choice of a real matrix spectrum, there is a real matrix having the pattern A and the chosen spectrum. We describe a graphical technique, a triangle extension, for constructing spectrally arbitrary patterns out of so…
▽ More
A matrix pattern is often either a sign pattern with entries in {0,+,-} or, more simply, a nonzero pattern with entries in {0,*}. A matrix pattern A is spectrally arbitrary if for any choice of a real matrix spectrum, there is a real matrix having the pattern A and the chosen spectrum. We describe a graphical technique, a triangle extension, for constructing spectrally arbitrary patterns out of some known lower order spectrally arbitrary patterns. These methods provide a new way of viewing some known spectrally arbitrary patterns, as well as providing many new families of spectrally arbitrary patterns. We also demonstrate how the technique can be applied to certain inertially arbitrary patterns to obtain larger inertially arbitrary patterns. We then provide an additional extension method for zero-nonzero patterns.
△ Less
Submitted 9 December, 2016;
originally announced December 2016.
-
On Birch and Swinnerton-Dyer's cubic surfaces
Authors:
Mckenzie West
Abstract:
In a 1975 paper of Birch and Swinnerton-Dyer, a number of explicit norm form cubic surfaces are shown to fail the Hasse Principle. They make a correspondence between this failure and the Brauer--Manin obstruction, recently discovered by Manin. We generalize their work, making use of modern computer algebra software to show that a larger set of cubic surfaces have a Brauer--Manin obstruction to the…
▽ More
In a 1975 paper of Birch and Swinnerton-Dyer, a number of explicit norm form cubic surfaces are shown to fail the Hasse Principle. They make a correspondence between this failure and the Brauer--Manin obstruction, recently discovered by Manin. We generalize their work, making use of modern computer algebra software to show that a larger set of cubic surfaces have a Brauer--Manin obstruction to the Hasse principle, thus verifying the Colliot-Thélène--Sansuc conjecture for infinitely many cubic surfaces.
△ Less
Submitted 2 March, 2017; v1 submitted 13 October, 2015;
originally announced October 2015.
-
Sequential Monte Carlo with Adaptive Weights for Approximate Bayesian Computation
Authors:
Fernando V. Bonassi,
Mike West
Abstract:
Methods of approximate Bayesian computation (ABC) are increasingly used for analysis of complex models. A major challenge for ABC is over-coming the often inherent problem of high rejection rates in the accept/reject methods based on prior:predictive sampling. A number of recent developments aim to address this with extensions based on sequential Monte Carlo (SMC) strategies. We build on this here…
▽ More
Methods of approximate Bayesian computation (ABC) are increasingly used for analysis of complex models. A major challenge for ABC is over-coming the often inherent problem of high rejection rates in the accept/reject methods based on prior:predictive sampling. A number of recent developments aim to address this with extensions based on sequential Monte Carlo (SMC) strategies. We build on this here, introducing an ABC SMC method that uses data-based adaptive weights. This easily implemented and computationally trivial extension of ABC SMC can very substantially improve acceptance rates, as is demonstrated in a series of examples with simulated and real data sets, including a currently topical example from dynamic modelling in systems biology applications.
△ Less
Submitted 26 March, 2015;
originally announced March 2015.
-
Discrete Routh Reduction
Authors:
Sameer M. Jalnapurkar,
Melvin Leok,
Jerrold E. Marsden,
Matthew West
Abstract:
This paper develops the theory of abelian Routh reduction for discrete mechanical systems and applies it to the variational integration of mechanical systems with abelian symmetry. The reduction of variational Runge-Kutta discretizations is considered, as well as the extent to which symmetry reduction and discretization commute. These reduced methods allow the direct simulation of dynamical feat…
▽ More
This paper develops the theory of abelian Routh reduction for discrete mechanical systems and applies it to the variational integration of mechanical systems with abelian symmetry. The reduction of variational Runge-Kutta discretizations is considered, as well as the extent to which symmetry reduction and discretization commute. These reduced methods allow the direct simulation of dynamical features such as relative equilibria and relative periodic orbits that can be obscured or difficult to identify in the unreduced dynamics. The methods are demonstrated for the dynamics of an Earth orbiting satellite with a non-spherical $J_2$ correction, as well as the double spherical pendulum. The $J_2$ problem is interesting because in the unreduced picture, geometric phases inherent in the model and those due to numerical discretization can be hard to distinguish, but this issue does not appear in the reduced algorithm, where one can directly observe interesting dynamical structures in the reduced phase space (the cotangent bundle of shape space), in which the geometric phases have been removed. The main feature of the double spherical pendulum example is that it has a nontrivial magnetic term in its reduced symplectic form. Our method is still efficient as it can directly handle the essential non-canonical nature of the symplectic structure. In contrast, a traditional symplectic method for canonical systems could require repeated coordinate changes if one is evoking Darboux' theorem to transform the symplectic structure into canonical form, thereby incurring additional computational cost. Our method allows one to design reduced symplectic integrators in a natural way, despite the noncanonical nature of the symplectic structure.
△ Less
Submitted 5 January, 2006; v1 submitted 17 August, 2005;
originally announced August 2005.
-
Variational methods, multisymplectic geometry and continuum mechanics
Authors:
Jerrold E. Marsden,
Sergey Pekarsky,
Steve Shkoller,
Matthew West
Abstract:
This paper presents a variational and multisymplectic formulation of both compressible and incompressible models of continuum mechanics on general Riemannian manifolds. A general formalism is developed for non-relativistic first-order multisymplectic field theories with constraints, such as the incompressibility constraint. The results obtained in this paper set the stage for multisymplectic red…
▽ More
This paper presents a variational and multisymplectic formulation of both compressible and incompressible models of continuum mechanics on general Riemannian manifolds. A general formalism is developed for non-relativistic first-order multisymplectic field theories with constraints, such as the incompressibility constraint. The results obtained in this paper set the stage for multisymplectic reduction and for the further development of Veselov-type multisymplectic discretizations and numerical algorithms. The latter will be the subject of a companion paper.
△ Less
Submitted 3 May, 2000;
originally announced May 2000.