Skip to main content

Showing 1–50 of 336 results for author: Turner, E

.
  1. arXiv:2406.13493  [pdf, other

    cs.LG stat.ML

    In-Context In-Context Learning with Transformer Neural Processes

    Authors: Matthew Ashman, Cristiana Diaconu, Adrian Weller, Richard E. Turner

    Abstract: Neural processes (NPs) are a powerful family of meta-learning models that seek to approximate the posterior predictive map of the ground-truth stochastic process from which each dataset in a meta-dataset is sampled. There are many cases in which practitioners, besides having access to the dataset of interest, may also have access to other datasets that share similarities with it. In this case, int… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  2. arXiv:2406.13488  [pdf, other

    stat.ML cs.LG

    Approximately Equivariant Neural Processes

    Authors: Matthew Ashman, Cristiana Diaconu, Adrian Weller, Wessel Bruinsma, Richard E. Turner

    Abstract: Equivariant deep learning architectures exploit symmetries in learning problems to improve the sample efficiency of neural-network-based models and their ability to generalise. However, when modelling real-world data, learning problems are often not exactly equivariant, but only approximately. For example, when estimating the global temperature field from weather station observations, local topogr… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  3. arXiv:2406.13151  [pdf, other

    stat.ML cs.LG stat.CO

    von Mises Quasi-Processes for Bayesian Circular Regression

    Authors: Yarden Cohen, Alexandre Khae Wu Navarro, Jes Frellsen, Richard E. Turner, Raziel Riemer, Ari Pakman

    Abstract: The need for regression models to predict circular values arises in many scientific fields. In this work we explore a family of expressive and interpretable distributions over circle-valued random functions related to Gaussian processes targeting two Euclidean dimensions conditioned on the unit circle. The resulting probability model has connections with continuous spin models in statistical physi… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Contribution to the Structured Probabilistic Inference & Generative Modeling workshop of ICML 2024

  4. arXiv:2406.12409  [pdf, other

    stat.ML cs.LG

    Translation Equivariant Transformer Neural Processes

    Authors: Matthew Ashman, Cristiana Diaconu, Junhyuck Kim, Lakee Sivaraya, Stratis Markou, James Requeima, Wessel P. Bruinsma, Richard E. Turner

    Abstract: The effectiveness of neural processes (NPs) in modelling posterior prediction maps -- the map** from data to posterior predictive distributions -- has significantly improved since their inception. This improvement can be attributed to two principal factors: (1) advancements in the architecture of permutation invariant set functions, which are intrinsic to all NPs; and (2) leveraging symmetries p… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  5. arXiv:2406.08569  [pdf, other

    cs.LG cs.CR stat.ML

    Noise-Aware Differentially Private Regression via Meta-Learning

    Authors: Ossi Räisä, Stratis Markou, Matthew Ashman, Wessel P. Bruinsma, Marlon Tobaben, Antti Honkela, Richard E. Turner

    Abstract: Many high-stakes applications require machine learning models that protect user privacy and provide well-calibrated, accurate predictions. While Differential Privacy (DP) is the gold standard for protecting user privacy, standard DP mechanisms typically significantly impair performance. One approach to mitigating this issue is pre-training models on simulated data before DP learning on the private… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  6. arXiv:2406.01801  [pdf, other

    stat.ML cs.LG

    Fearless Stochasticity in Expectation Propagation

    Authors: Jonathan So, Richard E. Turner

    Abstract: Expectation propagation (EP) is a family of algorithms for performing approximate inference in probabilistic models. The updates of EP involve the evaluation of moments -- expectations of certain functions -- which can be estimated from Monte Carlo (MC) samples. However, the updates are not robust to MC noise when performed naively, and various prior works have attempted to address this issue in d… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  7. arXiv:2405.19434  [pdf, other

    astro-ph.HE

    The Pulsar Science Collaboratory: Multi-Epoch Scintillation Studies of Pulsars

    Authors: Jacob E. Turner, Juan G. Lebron Medina, Zachary Zelensky, Kathleen A. Gustavso, Jeffrey Marx, Manvith Kothapalli, Luis D. Cruz Vega, Alexander Lee, Caryelis B. Figueroa, Daniel E. Reichart, Joshua B. Haislip, Vladimir V. Kouprianov, Steve White, Frank Ghigo, Sue Ann Heatherly, Maura A. McLaughlin

    Abstract: We report on findings from scintillation analyses using high-cadence observations of nine canonical pulsars with observing baselines ranging from one to three years. We obtain scintillation bandwidth and timescale measurements for all pulsars in our survey and obtain scintillation arc curvature measurements for four pulsars, detecting multiple arcs for two of them. Using updated pulsar distance es… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Submitted to ApJ

  8. arXiv:2405.16541  [pdf, other

    stat.ML cs.LG

    Variance-Reducing Couplings for Random Features: Perspectives from Optimal Transport

    Authors: Isaac Reid, Stratis Markou, Krzysztof Choromanski, Richard E. Turner, Adrian Weller

    Abstract: Random features (RFs) are a popular technique to scale up kernel methods in machine learning, replacing exact kernel evaluations with stochastic Monte Carlo estimates. They underpin models as diverse as efficient transformers (by approximating attention) to sparse spectrum Gaussian processes (by approximating the covariance function). Efficiency can be further improved by speeding up the convergen… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  9. arXiv:2405.13063  [pdf, other

    physics.ao-ph cs.LG

    Aurora: A Foundation Model of the Atmosphere

    Authors: Cristian Bodnar, Wessel P. Bruinsma, Ana Lucic, Megan Stanley, Johannes Brandstetter, Patrick Garvan, Maik Riechert, Jonathan Weyn, Haiyu Dong, Anna Vaughan, Jayesh K. Gupta, Kit Tambiratnam, Alex Archibald, Elizabeth Heider, Max Welling, Richard E. Turner, Paris Perdikaris

    Abstract: Deep learning foundation models are revolutionizing many facets of science by leveraging vast amounts of data to learn general-purpose representations that can be adapted to tackle diverse downstream tasks. Foundation models hold the promise to also transform our ability to model our planet and its subsystems by exploiting the vast expanse of Earth system data. Here we introduce Aurora, a large-sc… ▽ More

    Submitted 28 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

  10. arXiv:2405.12856  [pdf, other

    stat.ML cs.CL cs.LG

    LLM Processes: Numerical Predictive Distributions Conditioned on Natural Language

    Authors: James Requeima, John Bronskill, Dami Choi, Richard E. Turner, David Duvenaud

    Abstract: Machine learning practitioners often face significant challenges in formally integrating their prior knowledge and beliefs into predictive models, limiting the potential for nuanced and context-aware analyses. Moreover, the expertise needed to integrate this prior knowledge into probabilistic modeling typically limits the application of these models to specialists. Our goal is to build a regressio… ▽ More

    Submitted 25 May, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

  11. arXiv:2404.13796  [pdf, other

    astro-ph.HE

    A Cyclic Spectroscopy Scintillation Study of PSR B1937+21 I. Demonstration of Improved Scintillometry

    Authors: Jacob E. Turner, Timothy Dolch, James M. Cordes, Stella K. Ocker, Daniel R. Stinebring, Shami Chatterjee, Maura A. McLaughlin, Victoria E. Catlett, Cody Jessup, Nathaniel Jones, Christopher Scheithauer

    Abstract: We use cyclic spectroscopy to perform high frequency-resolution analyses of multi-hour baseband Arecibo observations of the millisecond pulsar PSR B1937+21. This technique allows for the examination of scintillation features in far greater detail than is otherwise possible under most pulsar timing array observing setups. We measure scintillation bandwidths and timescales in each of eight subbands… ▽ More

    Submitted 21 June, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

    Comments: Accepted to ApJ

  12. arXiv:2404.07020  [pdf, other

    astro-ph.HE astro-ph.GA

    The NANOGrav 15 yr Data Set: Looking for Signs of Discreteness in the Gravitational-wave Background

    Authors: Gabriella Agazie, Paul T. Baker, Bence Bécsy, Laura Blecha, Adam Brazier, Paul R. Brook, Lucas Brown, Sarah Burke-Spolaor, J. Andrew Casey-Clyde, Maria Charisi, Shami Chatterjee, Tyler Cohen, James M. Cordes, Neil J. Cornish, Fronefield Crawford, H. Thankful Cromartie, Megan E. DeCesar, Paul B. Demorest, Heling Deng, Timothy Dolch, Elizabeth C. Ferrara, William Fiore, Emmanuel Fonseca, Gabriel E. Freedman, Nate Garver-Daniels , et al. (58 additional authors not shown)

    Abstract: The cosmic merger history of supermassive black hole binaries (SMBHBs) is expected to produce a low-frequency gravitational wave background (GWB). Here we investigate how signs of the discrete nature of this GWB can manifest in pulsar timing arrays through excursions from, and breaks in, the expected $f_{\mathrm{GW}}^{-2/3}$ power-law of the GWB strain spectrum. To do this, we create a semi-analyt… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 10 pages, 8 figures, 1 appendix, submitted to ApJ

  13. arXiv:2404.00411  [pdf, other

    physics.ao-ph cs.LG

    End-to-end data-driven weather forecasting

    Authors: Anna Vaughan, Stratis Markou, Will Tebbutt, James Requeima, Wessel P. Bruinsma, Tom R. Andersson, Michael Herzog, Nicholas D. Lane, Matthew Chantry, J. Scott Hosking, Richard E. Turner

    Abstract: Weather forecasting is critical for a range of human activities including transportation, agriculture, industry, as well as the safety of the general public. Machine learning models have the potential to transform the complex weather prediction pipeline, but current approaches still rely on numerical weather prediction (NWP) systems, limiting forecast speed and accuracy. Here we demonstrate that a… ▽ More

    Submitted 10 July, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

  14. arXiv:2403.12977  [pdf, other

    cs.CV cs.LG eess.IV stat.AP

    SportsNGEN: Sustained Generation of Multi-player Sports Gameplay

    Authors: Lachlan Thorpe, Lewis Bawden, Karanjot Vendal, John Bronskill, Richard E. Turner

    Abstract: We present a transformer decoder based model, SportsNGEN, that is trained on sports player and ball tracking sequences that is capable of generating realistic and sustained gameplay. We train and evaluate SportsNGEN on a large database of professional tennis tracking data and demonstrate that by combining the generated simulations with a shot classifier and logic to start and end rallies, the syst… ▽ More

    Submitted 9 February, 2024; originally announced March 2024.

  15. arXiv:2403.01946  [pdf, other

    cs.LG

    A Generative Model of Symmetry Transformations

    Authors: James Urquhart Allingham, Bruno Kacper Mlodozeniec, Shreyas Padhy, Javier Antorán, David Krueger, Richard E. Turner, Eric Nalisnick, José Miguel Hernández-Lobato

    Abstract: Correctly capturing the symmetry transformations of data can lead to efficient models with strong generalization capabilities, though methods incorporating symmetries often require prior knowledge. While recent advancements have been made in learning those symmetries directly from the dataset, most of this work has focused on the discriminative setting. In this paper, we take inspiration from grou… ▽ More

    Submitted 20 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  16. arXiv:2402.04384  [pdf, other

    cs.LG stat.ML

    Denoising Diffusion Probabilistic Models in Six Simple Steps

    Authors: Richard E. Turner, Cristiana-Diana Diaconu, Stratis Markou, Aliaksandra Shysheya, Andrew Y. K. Foong, Bruno Mlodozeniec

    Abstract: Denoising Diffusion Probabilistic Models (DDPMs) are a very popular class of deep generative model that have been successfully applied to a diverse range of problems including image and video generation, protein and material synthesis, weather forecasting, and neural surrogates of partial differential equations. Despite their ubiquity it is hard to find an introduction to DDPMs which is simple, co… ▽ More

    Submitted 10 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  17. arXiv:2402.03496  [pdf, other

    cs.LG math.OC

    Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective

    Authors: Wu Lin, Felix Dangel, Runa Eschenhagen, Juhan Bae, Richard E. Turner, Alireza Makhzani

    Abstract: Adaptive gradient optimizers like Adam(W) are the default training algorithms for many deep learning architectures, such as transformers. Their diagonal preconditioner is based on the gradient outer product which is incorporated into the parameter update via a square root. While these methods are often motivated as approximate second-order methods, the square root represents a fundamental differen… ▽ More

    Submitted 20 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: A long version of the ICML 2024 paper. Updated the abstract and Sec 4 to emphasize the concept of preconditioner invariance

  18. arXiv:2401.01855  [pdf, other

    cs.LG

    Transformer Neural Autoregressive Flows

    Authors: Massimiliano Patacchiola, Aliaksandra Shysheya, Katja Hofmann, Richard E. Turner

    Abstract: Density estimation, a central problem in machine learning, can be performed using Normalizing Flows (NFs). NFs comprise a sequence of invertible transformations, that turn a complex target distribution into a simple one, by exploiting the change of variables theorem. Neural Autoregressive Flows (NAFs) and Block Neural Autoregressive Flows (B-NAFs) are arguably the most perfomant members of the NF… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  19. arXiv:2312.05705  [pdf, other

    cs.LG stat.ML

    Structured Inverse-Free Natural Gradient: Memory-Efficient & Numerically-Stable KFAC

    Authors: Wu Lin, Felix Dangel, Runa Eschenhagen, Kirill Neklyudov, Agustinus Kristiadi, Richard E. Turner, Alireza Makhzani

    Abstract: Second-order methods such as KFAC can be useful for neural net training. However, they are often memory-inefficient since their preconditioning Kronecker factors are dense, and numerically unstable in low precision as they require matrix inversion or decomposition. These limitations render such methods unpopular for modern mixed-precision training. We address them by (i) formulating an inverse-fre… ▽ More

    Submitted 15 June, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

    Comments: A long version of the ICML 2024 paper

  20. arXiv:2311.16849  [pdf, other

    stat.ML cs.LG

    Identifiable Feature Learning for Spatial Data with Nonlinear ICA

    Authors: Hermanni Hälvä, Jonathan So, Richard E. Turner, Aapo Hyvärinen

    Abstract: Recently, nonlinear ICA has surfaced as a popular alternative to the many heuristic models used in deep representation learning and disentanglement. An advantage of nonlinear ICA is that a sophisticated identifiability theory has been developed; in particular, it has been proven that the original components can be recovered under sufficiently strong latent dependencies. Despite this general theory… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: Work under review

  21. arXiv:2311.09848  [pdf, other

    cs.LG

    Diffusion-Augmented Neural Processes

    Authors: Lorenzo Bonito, James Requeima, Aliaksandra Shysheya, Richard E. Turner

    Abstract: Over the last few years, Neural Processes have become a useful modelling tool in many application areas, such as healthcare and climate sciences, in which data are scarce and prediction uncertainty estimates are indispensable. However, the current state of the art in the field (AR CNPs; Bruinsma et al., 2023) presents a few issues that prevent its widespread deployment. This work proposes an alter… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted to the NeurIPS 2023 Workshop on Diffusion Models

    ACM Class: I.2.6

  22. arXiv:2311.04369  [pdf, other

    q-bio.PE

    The context-specificity of virulence evolution revealed through evolutionary invasion analysis

    Authors: Sudam Surasinghe, Ketty Kabengele, Paul E. Turner, C. Brandon Ogbunugafor

    Abstract: Models are often employed to integrate knowledge about epidemics across scales and simulate disease dynamics. While these approaches have played a central role in studying the mechanics underlying epidemics, we lack ways to reliably predict how the relationship between virulence (the harm to hosts caused by an infection) and transmission will evolve in certain virus-host contexts. In this study, w… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  23. arXiv:2311.00636  [pdf, other

    cs.LG stat.ML

    Kronecker-Factored Approximate Curvature for Modern Neural Network Architectures

    Authors: Runa Eschenhagen, Alexander Immer, Richard E. Turner, Frank Schneider, Philipp Hennig

    Abstract: The core components of many modern neural network architectures, such as transformers, convolutional, or graph neural networks, can be expressed as linear layers with $\textit{weight-sharing}$. Kronecker-Factored Approximate Curvature (K-FAC), a second-order optimisation method, has shown promise to speed up neural network training and thereby reduce computational costs. However, there is currentl… ▽ More

    Submitted 11 January, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: NeurIPS 2023

  24. arXiv:2310.19932  [pdf, other

    cs.LG physics.ao-ph

    Sim2Real for Environmental Neural Processes

    Authors: Jonas Scholz, Tom R. Andersson, Anna Vaughan, James Requeima, Richard E. Turner

    Abstract: Machine learning (ML)-based weather models have recently undergone rapid improvements. These models are typically trained on gridded reanalysis data from numerical data assimilation systems. However, reanalysis data comes with limitations, such as assumptions about physical laws and low spatiotemporal resolution. The gap between reanalysis and reality has sparked growing interest in training ML mo… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: 4 pages, 3 figures, To be published in Tackling Climate Change with Machine Learning workshop at NeurIPS

  25. arXiv:2310.12138  [pdf, other

    gr-qc astro-ph.GA astro-ph.HE

    The NANOGrav 15-year data set: Search for Transverse Polarization Modes in the Gravitational-Wave Background

    Authors: Gabriella Agazie, Akash Anumarlapudi, Anne M. Archibald, Zaven Arzoumanian, Jeremy Baier, Paul T. Baker, Bence Bécsy, Laura Blecha, Adam Brazier, Paul R. Brook, Sarah Burke-Spolaor, Rand Burnette, Robin Case, J. Andrew Casey-Clyde, Maria Charisi, Shami Chatterjee, Tyler Cohen, James M. Cordes, Neil J. Cornish, Fronefield Crawford, H. Thankful Cromartie, Kathryn Crowter, Megan E. DeCesar, Dallas DeGan, Paul B. Demorest , et al. (74 additional authors not shown)

    Abstract: Recently we found compelling evidence for a gravitational wave background with Hellings and Downs (HD) correlations in our 15-year data set. These correlations describe gravitational waves as predicted by general relativity, which has two transverse polarization modes. However, more general metric theories of gravity can have additional polarization modes which produce different interpulsar correl… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 11 pages, 5 figures

  26. arXiv:2310.11837  [pdf, other

    stat.ML cs.LG

    Optimising Distributions with Natural Gradient Surrogates

    Authors: Jonathan So, Richard E. Turner

    Abstract: Natural gradient methods have been used to optimise the parameters of probability distributions in a variety of settings, often resulting in fast-converging procedures. Unfortunately, for many distributions of interest, computing the natural gradient has a number of challenges. In this work we propose a novel technique for tackling such issues, which involves reframing the optimisation as one with… ▽ More

    Submitted 4 March, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

    Journal ref: PMLR 238 (2024):2224-2232

  27. arXiv:2309.17438  [pdf, other

    astro-ph.HE gr-qc

    The NANOGrav 12.5-year data set: A computationally efficient eccentric binary search pipeline and constraints on an eccentric supermassive binary candidate in 3C 66B

    Authors: Gabriella Agazie, Zaven Arzoumanian, Paul T. Baker, Bence Bécsy, Laura Blecha, Harsha Blumer, Adam Brazier, Paul R. Brook, Sarah Burke-Spolaor, J. Andrew Casey-Clyde, Maria Charisi, Shami Chatterjee, Belinda D. Cheeseboro, Tyler Cohen, James M. Cordes, Neil J. Cornish, Fronefield Crawford, H. Thankful Cromartie, Megan E. DeCesar, Paul B. Demorest, Lankeswar Dey, Timothy Dolch, Justin A. Ellis, Robert D. Ferdman, Elizabeth C. Ferrara , et al. (63 additional authors not shown)

    Abstract: The radio galaxy 3C 66B has been hypothesized to host a supermassive black hole binary (SMBHB) at its center based on electromagnetic observations. Its apparent 1.05-year period and low redshift ($\sim0.02$) make it an interesting testbed to search for low-frequency gravitational waves (GWs) using Pulsar Timing Array (PTA) experiments. This source has been subjected to multiple searches for contin… ▽ More

    Submitted 15 January, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: 27 Pages, 10 Figures, 1 Table, Accepted for publication in ApJ

  28. arXiv:2309.04443  [pdf, other

    gr-qc astro-ph.HE

    How to Detect an Astrophysical Nanohertz Gravitational-Wave Background

    Authors: Bence Bécsy, Neil J. Cornish, Patrick M. Meyers, Luke Zoltan Kelley, Gabriella Agazie, Akash Anumarlapudi, Anne M. Archibald, Zaven Arzoumanian, Paul T. Baker, Laura Blecha, Adam Brazier, Paul R. Brook, Sarah Burke-Spolaor, J. Andrew Casey-Clyde, Maria Charisi, Shami Chatterjee, Katerina Chatziioannou, Tyler Cohen, James M. Cordes, Fronefield Crawford, H. Thankful Cromartie, Kathryn Crowter, Megan E. DeCesar, Paul B. Demorest, Timothy Dolch , et al. (71 additional authors not shown)

    Abstract: Analysis of pulsar timing data have provided evidence for a stochastic gravitational wave background in the nHz frequency band. The most plausible source of such a background is the superposition of signals from millions of supermassive black hole binaries. The standard statistical techniques used to search for such a background and assess its significance make several simplifying assumptions, nam… ▽ More

    Submitted 1 December, 2023; v1 submitted 8 September, 2023; originally announced September 2023.

    Comments: 14 pages, 8 figures, version matching published paper

    Journal ref: ApJ 959 9 (2023)

  29. arXiv:2309.00693  [pdf, other

    astro-ph.HE gr-qc

    Comparing recent PTA results on the nanohertz stochastic gravitational wave background

    Authors: The International Pulsar Timing Array Collaboration, G. Agazie, J. Antoniadis, A. Anumarlapudi, A. M. Archibald, P. Arumugam, S. Arumugam, Z. Arzoumanian, J. Askew, S. Babak, M. Bagchi, M. Bailes, A. -S. Bak Nielsen, P. T. Baker, C. G. Bassa, A. Bathula, B. Bécsy, A. Berthereau, N. D. R. Bhat, L. Blecha, M. Bonetti, E. Bortolas, A. Brazier, P. R. Brook, M. Burgay , et al. (220 additional authors not shown)

    Abstract: The Australian, Chinese, European, Indian, and North American pulsar timing array (PTA) collaborations recently reported, at varying levels, evidence for the presence of a nanohertz gravitational wave background (GWB). Given that each PTA made different choices in modeling their data, we perform a comparison of the GWB and individual pulsar noise parameters across the results reported from the PTA… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: 21 pages, 9 figures, submitted to ApJ

  30. arXiv:2308.05732  [pdf, other

    cs.LG cs.AI

    PDE-Refiner: Achieving Accurate Long Rollouts with Neural PDE Solvers

    Authors: Phillip Lippe, Bastiaan S. Veeling, Paris Perdikaris, Richard E. Turner, Johannes Brandstetter

    Abstract: Time-dependent partial differential equations (PDEs) are ubiquitous in science and engineering. Recently, mostly due to the high computational cost of traditional solution techniques, deep neural network based surrogates have gained increased interest. The practical utility of such neural PDE solvers relies on their ability to provide accurate, stable predictions over long time horizons, which is… ▽ More

    Submitted 21 October, 2023; v1 submitted 10 August, 2023; originally announced August 2023.

    Comments: Project website: https://phlippe.github.io/PDERefiner/

  31. arXiv:2307.13797  [pdf, other

    gr-qc astro-ph.IM

    The NANOGrav 12.5-year Data Set: Search for Gravitational Wave Memory

    Authors: Gabriella Agazie, Zaven Arzoumanian, Paul T. Baker, Bence Bécsy, Laura Blecha, Harsha Blumer, Adam Brazier, Paul R. Brook, Sarah Burke-Spolaor, Rand Burnette, Robin Case, J. Andrew Casey-Clyde, Maria Charisi, Shami Chatterjee, Tyler Cohen, James M. Cordes, Neil J. Cornish, Fronefield Crawford, H. Thankful Cromartie, Megan E. DeCesar, Dallas DeGan, Paul B. Demorest, Timothy Dolch, Brendan Drachler, Justin A. Ellis , et al. (65 additional authors not shown)

    Abstract: We present the results of a Bayesian search for gravitational wave (GW) memory in the NANOGrav 12.5-yr data set. We find no convincing evidence for any gravitational wave memory signals in this data set (Bayes factor = 2.8). As such, we go on to place upper limits on the strain amplitude of GW memory events as a function of sky location and event epoch. These upper limits are computed using a sign… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: 29 pages, 5 figures

  32. arXiv:2307.05431  [pdf, other

    stat.ML cs.LG

    Geometric Neural Diffusion Processes

    Authors: Emile Mathieu, Vincent Dutordoir, Michael J. Hutchinson, Valentin De Bortoli, Yee Whye Teh, Richard E. Turner

    Abstract: Denoising diffusion models have proven to be a flexible and effective paradigm for generative modelling. Their recent extension to infinite dimensional Euclidean spaces has allowed for the modelling of stochastic processes. However, many problems in the natural sciences incorporate symmetries and involve data living in non-Euclidean spaces. In this work, we extend the framework of diffusion models… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  33. arXiv:2307.03093  [pdf, other

    cs.LG stat.ML

    Beyond Intuition, a Framework for Applying GPs to Real-World Data

    Authors: Kenza Tazi, Jihao Andreas Lin, Ross Viljoen, Alex Gardner, ST John, Hong Ge, Richard E. Turner

    Abstract: Gaussian Processes (GPs) offer an attractive method for regression over small, structured and correlated datasets. However, their deployment is hindered by computational costs and limited guidelines on how to apply GPs beyond simple low-dimensional datasets. We propose a framework to identify the suitability of GPs to a given problem and how to set up a robust and well-specified GP model. The guid… ▽ More

    Submitted 17 July, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: Accepted at the ICML Workshop on Structured Probabilistic Inference and Generative Modelling (2023)

  34. arXiv:2306.16223  [pdf, other

    astro-ph.HE astro-ph.IM gr-qc

    The NANOGrav 15-year Gravitational-Wave Background Analysis Pipeline

    Authors: Aaron D. Johnson, Patrick M. Meyers, Paul T. Baker, Neil J. Cornish, Jeffrey S. Hazboun, Tyson B. Littenberg, Joseph D. Romano, Stephen R. Taylor, Michele Vallisneri, Sarah J. Vigeland, Ken D. Olum, Xavier Siemens, Justin A. Ellis, Rutger van Haasteren, Sophie Hourihane, Gabriella Agazie, Akash Anumarlapudi, Anne M. Archibald, Zaven Arzoumanian, Laura Blecha, Adam Brazier, Paul R. Brook, Sarah Burke-Spolaor, Bence Bécsy, J. Andrew Casey-Clyde , et al. (71 additional authors not shown)

    Abstract: This paper presents rigorous tests of pulsar timing array methods and software, examining their consistency across a wide range of injected parameters and signal strength. We discuss updates to the 15-year isotropic gravitational-wave background analyses and their corresponding code representations. Descriptions of the internal structure of the flagship algorithms \texttt{Enterprise} and \texttt{P… ▽ More

    Submitted 7 July, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: 30 pages, 10 figures, 1 table; Companion paper to "The NANOGrav 15-year Data Set: Evidence for a Gravitational-Wave Background"; For questions or comments, please email [email protected]

  35. arXiv:2306.16222  [pdf, other

    astro-ph.HE gr-qc

    The NANOGrav 15-year Data Set: Bayesian Limits on Gravitational Waves from Individual Supermassive Black Hole Binaries

    Authors: Gabriella Agazie, Akash Anumarlapudi, Anne M. Archibald, Zaven Arzoumanian, Paul T. Baker, Bence Bécsy, Laura Blecha, Adam Brazier, Paul R. Brook, Sarah Burke-Spolaor, Robin Case, J. Andrew Casey-Clyde, Maria Charisi, Shami Chatterjee, Tyler Cohen, James M. Cordes, Neil Cornish, Fronefield Crawford, H. Thankful Cromartie, Kathryn Crowter, Megan DeCesar, Paul B. Demorest, Matthew C. Digman, Timothy Dolch, Brendan Drachler , et al. (74 additional authors not shown)

    Abstract: Evidence for a low-frequency stochastic gravitational wave background has recently been reported based on analyses of pulsar timing array data. The most likely source of such a background is a population of supermassive black hole binaries, the loudest of which may be individually detected in these datasets. Here we present the search for individual supermassive black hole binaries in the NANOGrav… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: 23 pages, 13 figures, 2 tables. Accepted for publication in Astrophysical Journal Letters as part of Focus on NANOGrav's 15-year Data Set and the Gravitational Wave Background. For questions or comments, please email [email protected]

  36. arXiv:2306.16221  [pdf, other

    astro-ph.HE gr-qc

    The NANOGrav 15-year Data Set: Search for Anisotropy in the Gravitational-Wave Background

    Authors: Gabriella Agazie, Akash Anumarlapudi, Anne M. Archibald, Zaven Arzoumanian, Paul T. Baker, Bence Bécsy, Laura Blecha, Adam Brazier, Paul R. Brook, Sarah Burke-Spolaor, J. Andrew Casey-Clyde, Maria Charisi, Shami Chatterjee, Tyler Cohen, James M. Cordes, Neil J. Cornish, Fronefield Crawford, H. Thankful Cromartie, Kathryn Crowter, Megan E. DeCesar, Paul B. Demorest, Timothy Dolch, Brendan Drachler, Elizabeth C. Ferrara, William Fiore , et al. (68 additional authors not shown)

    Abstract: The North American Nanohertz Observatory for Gravitational Waves (NANOGrav) has reported evidence for the presence of an isotropic nanohertz gravitational wave background (GWB) in its 15 yr dataset. However, if the GWB is produced by a population of inspiraling supermassive black hole binary (SMBHB) systems, then the background is predicted to be anisotropic, depending on the distribution of these… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: 19 pages, 11 figures; submitted to Astrophysical Journal Letters as part of Focus on NANOGrav's 15-year Data Set and the Gravitational Wave Background. For questions or comments, please email [email protected]

  37. arXiv:2306.16220  [pdf, other

    astro-ph.HE astro-ph.CO gr-qc

    The NANOGrav 15-year Data Set: Constraints on Supermassive Black Hole Binaries from the Gravitational Wave Background

    Authors: Gabriella Agazie, Akash Anumarlapudi, Anne M. Archibald, Paul T. Baker, Bence Bécsy, Laura Blecha, Alexander Bonilla, Adam Brazier, Paul R. Brook, Sarah Burke-Spolaor, Rand Burnette, Robin Case, J. Andrew Casey-Clyde, Maria Charisi, Shami Chatterjee, Katerina Chatziioannou, Belinda D. Cheeseboro, Siyuan Chen, Tyler Cohen, James M. Cordes, Neil J. Cornish, Fronefield Crawford, H. Thankful Cromartie, Kathryn Crowter, Curt J. Cutler , et al. (89 additional authors not shown)

    Abstract: The NANOGrav 15-year data set shows evidence for the presence of a low-frequency gravitational-wave background (GWB). While many physical processes can source such low-frequency gravitational waves, here we analyze the signal as coming from a population of supermassive black hole (SMBH) binaries distributed throughout the Universe. We show that astrophysically motivated models of SMBH binary popul… ▽ More

    Submitted 18 July, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: Accepted by Astrophysical Journal Letters as part of Focus on NANOGrav's 15-year Data Set and the Gravitational Wave Background. For questions or comments, please email [email protected]. Edited to fix two equation typos (Eq.13 & 21), and minor text typos

  38. arXiv:2306.16219  [pdf, other

    astro-ph.HE astro-ph.CO gr-qc hep-ph

    The NANOGrav 15-year Data Set: Search for Signals from New Physics

    Authors: Adeela Afzal, Gabriella Agazie, Akash Anumarlapudi, Anne M. Archibald, Zaven Arzoumanian, Paul T. Baker, Bence Bécsy, Jose Juan Blanco-Pillado, Laura Blecha, Kimberly K. Boddy, Adam Brazier, Paul R. Brook, Sarah Burke-Spolaor, Rand Burnette, Robin Case, Maria Charisi, Shami Chatterjee, Katerina Chatziioannou, Belinda D. Cheeseboro, Siyuan Chen, Tyler Cohen, James M. Cordes, Neil J. Cornish, Fronefield Crawford, H. Thankful Cromartie , et al. (98 additional authors not shown)

    Abstract: The 15-year pulsar timing data set collected by the North American Nanohertz Observatory for Gravitational Waves (NANOGrav) shows positive evidence for the presence of a low-frequency gravitational-wave (GW) background. In this paper, we investigate potential cosmological interpretations of this signal, specifically cosmic inflation, scalar-induced GWs, first-order phase transitions, cosmic string… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: 74 pages, 31 figures, 4 tables; published in Astrophysical Journal Letters as part of Focus on NANOGrav's 15-year Data Set and the Gravitational Wave Background. For questions or comments, please email [email protected]

  39. arXiv:2306.16218  [pdf, other

    astro-ph.HE astro-ph.CO astro-ph.GA astro-ph.IM gr-qc

    The NANOGrav 15-Year Data Set: Detector Characterization and Noise Budget

    Authors: Gabriella Agazie, Akash Anumarlapudi, Anne M. Archibald, Zaven Arzoumanian, Paul T. Baker, Bence Bécsy, Laura Blecha, Adam Brazier, Paul R. Brook, Sarah Burke-Spolaor, Maria Charisi, Shami Chatterjee, Tyler Cohen, James M. Cordes, Neil J. Cornish, Fronefield Crawford, H. Thankful Cromartie, Kathryn Crowter, Megan E. Decesar, Paul B. Demorest, Timothy Dolch, Brendan Drachler, Elizabeth C. Ferrara, William Fiore, Emmanuel Fonseca , et al. (66 additional authors not shown)

    Abstract: Pulsar timing arrays (PTAs) are galactic-scale gravitational wave detectors. Each individual arm, composed of a millisecond pulsar, a radio telescope, and a kiloparsecs-long path, differs in its properties but, in aggregate, can be used to extract low-frequency gravitational wave (GW) signals. We present a noise and sensitivity analysis to accompany the NANOGrav 15-year data release and associated… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: 67 pages, 73 figures, 3 tables; published in Astrophysical Journal Letters as part of Focus on NANOGrav's 15-year Data Set and the Gravitational Wave Background. For questions or comments, please email [email protected]

  40. arXiv:2306.16217  [pdf, other

    astro-ph.HE astro-ph.IM

    The NANOGrav 15-year Data Set: Observations and Timing of 68 Millisecond Pulsars

    Authors: Gabriella Agazie, Md Faisal Alam, Akash Anumarlapudi, Anne M. Archibald, Zaven Arzoumanian, Paul T. Baker, Laura Blecha, Victoria Bonidie, Adam Brazier, Paul R. Brook, Sarah Burke-Spolaor, Bence Bécsy, Christopher Chapman, Maria Charisi, Shami Chatterjee, Tyler Cohen, James M. Cordes, Neil J. Cornish, Fronefield Crawford, H. Thankful Cromartie, Kathryn Crowter, Megan E. DeCesar, Paul B. Demorest, Timothy Dolch, Brendan Drachler , et al. (75 additional authors not shown)

    Abstract: We present observations and timing analyses of 68 millisecond pulsars (MSPs) comprising the 15-year data set of the North American Nanohertz Observatory for Gravitational Waves (NANOGrav). NANOGrav is a pulsar timing array (PTA) experiment that is sensitive to low-frequency gravitational waves. This is NANOGrav's fifth public data release, including both "narrowband" and "wideband" time-of-arrival… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: 90 pages, 74 figures, 6 tables; published in Astrophysical Journal Letters as part of Focus on NANOGrav's 15-year Data Set and the Gravitational Wave Background. For questions or comments, please email [email protected]

  41. arXiv:2306.16213  [pdf, other

    astro-ph.HE gr-qc

    The NANOGrav 15-year Data Set: Evidence for a Gravitational-Wave Background

    Authors: Gabriella Agazie, Akash Anumarlapudi, Anne M. Archibald, Zaven Arzoumanian, Paul T. Baker, Bence Becsy, Laura Blecha, Adam Brazier, Paul R. Brook, Sarah Burke-Spolaor, Rand Burnette, Robin Case, Maria Charisi, Shami Chatterjee, Katerina Chatziioannou, Belinda D. Cheeseboro, Siyuan Chen, Tyler Cohen, James M. Cordes, Neil J. Cornish, Fronefield Crawford, H. Thankful Cromartie, Kathryn Crowter, Curt J. Cutler, Megan E. DeCesar , et al. (89 additional authors not shown)

    Abstract: We report multiple lines of evidence for a stochastic signal that is correlated among 67 pulsars from the 15-year pulsar-timing data set collected by the North American Nanohertz Observatory for Gravitational Waves. The correlations follow the Hellings-Downs pattern expected for a stochastic gravitational-wave background. The presence of such a gravitational-wave background with a power-law-spectr… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: 30 pages, 18 figures. Published in Astrophysical Journal Letters as part of Focus on NANOGrav's 15-year Data Set and the Gravitational Wave Background. For questions or comments, please email [email protected]

  42. arXiv:2306.13554  [pdf, other

    cs.LG cs.AI

    Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation

    Authors: Massimiliano Patacchiola, Mingfei Sun, Katja Hofmann, Richard E. Turner

    Abstract: In this paper we explore few-shot imitation learning for control problems, which involves learning to imitate a target policy by accessing a limited set of offline rollouts. This setting has been relatively under-explored despite its relevance to robotics and control applications. State-of-the-art methods developed to tackle few-shot imitation rely on meta-learning, which is expensive to train as… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

  43. arXiv:2304.10557  [pdf, other

    cs.LG cs.AI

    An Introduction to Transformers

    Authors: Richard E. Turner

    Abstract: The transformer is a neural network component that can be used to learn useful representations of sequences or sets of data-points. The transformer has driven recent advances in natural language processing, computer vision, and spatio-temporal modelling. There are many introductions to transformers, but most do not contain precise mathematical descriptions of the architecture and the intuitions be… ▽ More

    Submitted 8 February, 2024; v1 submitted 20 April, 2023; originally announced April 2023.

  44. arXiv:2303.14468  [pdf, other

    stat.ML cs.LG

    Autoregressive Conditional Neural Processes

    Authors: Wessel P. Bruinsma, Stratis Markou, James Requiema, Andrew Y. K. Foong, Tom R. Andersson, Anna Vaughan, Anthony Buonomo, J. Scott Hosking, Richard E. Turner

    Abstract: Conditional neural processes (CNPs; Garnelo et al., 2018a) are attractive meta-learning models which produce well-calibrated predictions and are trainable via a simple maximum likelihood procedure. Although CNPs have many advantages, they are unable to model dependencies in their predictions. Various works propose solutions to this, but these come at the cost of either requiring approximate infere… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    Comments: 57 pages; accepted to the 11th International Conference on Learning Representations (ICLR 2023)

  45. arXiv:2303.13199  [pdf, other

    cs.CV

    First Session Adaptation: A Strong Replay-Free Baseline for Class-Incremental Learning

    Authors: Aristeidis Panos, Yuriko Kobe, Daniel Olmeda Reino, Rahaf Aljundi, Richard E. Turner

    Abstract: In Class-Incremental Learning (CIL) an image classification system is exposed to new classes in each learning session and must be updated incrementally. Methods approaching this problem have updated both the classification head and the feature extractor body at each session of CIL. In this work, we develop a baseline method, First Session Adaptation (FSA), that sheds light on the efficacy of exist… ▽ More

    Submitted 12 January, 2024; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: accepted at ICCV 23

  46. arXiv:2302.01190  [pdf, other

    stat.ML cs.CR cs.LG

    On the Efficacy of Differentially Private Few-shot Image Classification

    Authors: Marlon Tobaben, Aliaksandra Shysheya, John Bronskill, Andrew Paverd, Shruti Tople, Santiago Zanella-Beguelin, Richard E Turner, Antti Honkela

    Abstract: There has been significant recent progress in training differentially private (DP) models which achieve accuracy that approaches the best non-private models. These DP models are typically pretrained on large public datasets and then fine-tuned on private downstream datasets that are relatively large and similar in distribution to the pretraining data. However, in many applications including person… ▽ More

    Submitted 19 December, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: 49 pages, 24 figures; published in TMLR 12/2023 https://openreview.net/forum?id=hFsr59Imzm

    Journal ref: Transactions on Machine Learning Research, ISSN 2835-8856, 2023

  47. arXiv:2301.12089  [pdf, other

    astro-ph.HE astro-ph.IM

    Scattering Delay Mitigation in High Accuracy Pulsar Timing: Cyclic Spectroscopy Techniques

    Authors: Jacob E. Turner, Daniel R. Stinebring, Maura A. McLaughlin, Anne M. Archibald, Timothy Dolch, Ryan S. Lynch

    Abstract: We simulate scattering delays from the interstellar medium to examine the effectiveness of three estimators in recovering these delays in pulsar timing data. Two of these estimators use the more traditional process of fitting autocorrelation functions to pulsar dynamic spectra to extract scintillation bandwidths, while the third estimator uses the newer technique of cyclic spectroscopy on baseband… ▽ More

    Submitted 27 February, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: Accepted to ApJ

  48. arXiv:2301.05306  [pdf, other

    astro-ph.HE

    A Simultaneous Dual-Frequency Scintillation Arc Survey of Six Bright Canonical Pulsars Using the Upgraded Giant Metrewave Radio Telescope

    Authors: Jacob E. Turner, Bhal Chandra Joshi, Maura A. McLaughlin, Daniel R. Stinebring

    Abstract: We use the upgraded Giant Metrewave Radio Telescope to measure scintillation arc properties in six bright canonical pulsars with simultaneous dual frequency coverage. These observations at frequencies from 300 to 750 MHz allowed for detailed analysis of arc evolution across frequency and epoch. We perform more robust determinations of frequency dependence for arc curvature, scintillation bandwidth… ▽ More

    Submitted 20 October, 2023; v1 submitted 12 January, 2023; originally announced January 2023.

  49. arXiv:2301.03608  [pdf, other

    astro-ph.GA astro-ph.HE gr-qc

    The NANOGrav 12.5-year Data Set: Bayesian Limits on Gravitational Waves from Individual Supermassive Black Hole Binaries

    Authors: Zaven Arzoumanian, Paul T. Baker, Laura Blecha, Harsha Blumer, Adam Brazier, Paul R. Brook, Sarah Burke-Spolaor, Bence Bécsy, J. Andrew Casey-Clyde, Maria Charisi, Shami Chatterjee, Siyuan Chen, James M. Cordes, Neil J. Cornish, Fronefield Crawford, H. Thankful Cromartie, Megan E. DeCesar, Paul B. Demorest, Timothy Dolch, Brendan Drachler, Justin A. Ellis, E. C. Ferrara, William Fiore, Emmanuel Fonseca, Gabriel E. Freedman , et al. (53 additional authors not shown)

    Abstract: Pulsar timing array collaborations, such as the North American Nanohertz Observatory for Gravitational Waves (NANOGrav), are seeking to detect nanohertz gravitational waves emitted by supermassive black hole binaries formed in the aftermath of galaxy mergers. We have searched for continuous waves from individual circular supermassive black hole binaries using the NANOGrav's recent 12.5-year data s… ▽ More

    Submitted 6 June, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: 22 pages, 12 figures. Accepted by ApJL

  50. arXiv:2211.12990  [pdf, other

    cs.LG cs.CR

    Adversarial Attacks are a Surprisingly Strong Baseline for Poisoning Few-Shot Meta-Learners

    Authors: Elre T. Oldewage, John Bronskill, Richard E. Turner

    Abstract: This paper examines the robustness of deployed few-shot meta-learning systems when they are fed an imperceptibly perturbed few-shot dataset. We attack amortized meta-learners, which allows us to craft colluding sets of inputs that are tailored to fool the system's learning algorithm when used as training data. Jointly crafted adversarial inputs might be expected to synergistically manipulate a cla… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: Accepted at I Can't Believe It's Not Better Workshop, Neurips 2022