Search | arXiv e-print repository

Reconciling Kaplan and Chinchilla Scaling Laws

Abstract: Kaplan et al. [2020] (`Kaplan') and Hoffmann et al. [2022] (`Chinchilla') studied the scaling behavior of transformers trained on next-token language prediction. These studies produced different estimates for how the number of parameters ($N$) and training tokens ($D$) should be set to achieve the lowest possible loss for a given compute budget ($C$). Kaplan: $N_\text{optimal} \propto C^{0.73}$, C… ▽ More Kaplan et al. [2020] (`Kaplan') and Hoffmann et al. [2022] (`Chinchilla') studied the scaling behavior of transformers trained on next-token language prediction. These studies produced different estimates for how the number of parameters ($N$) and training tokens ($D$) should be set to achieve the lowest possible loss for a given compute budget ($C$). Kaplan: $N_\text{optimal} \propto C^{0.73}$, Chinchilla: $N_\text{optimal} \propto C^{0.50}$. This note finds that much of this discrepancy can be attributed to Kaplan counting non-embedding rather than total parameters, combined with their analysis being performed at small scale. Simulating the Chinchilla study under these conditions produces biased scaling coefficients close to Kaplan's. Hence, this note reaffirms Chinchilla's scaling coefficients, by explaining the cause of Kaplan's original overestimation. △ Less

Submitted 12 June, 2024; originally announced June 2024.

arXiv:2406.09528 [pdf, other]

JWST/NIRCam 4-5 $μ$m Imaging of the Giant Planet AF Lep b

Authors: Kyle Franson, William O. Balmer, Brendan P. Bowler, Laurent Pueyo, Yifan Zhou, Emily Rickman, Zhoujian Zhang, Sagnick Mukherjee, Tim D. Pearce, Daniella C. Bardalez Gagliuffi, Lauren I. Biddle, Timothy D. Brandt, Rachel Bowens-Rubin, Justin R. Crepp, James W. Davidson, Jr., Jacqueline Faherty, Christian Ginski, Elliott P. Horch, Marvin Morgan, Caroline V. Morley, Marshall D. Perrin, Aniket Sanghi, Maissa Salama, Christopher A. Theissen, Quang H. Tran , et al. (1 additional authors not shown)

Abstract: With a dynamical mass of $3 \, M_\mathrm{Jup}$, the recently discovered giant planet AF Lep b is the lowest-mass imaged planet with a direct mass measurement. Its youth and spectral type near the L/T transition make it a promising target to study the impact of clouds and atmospheric chemistry at low surface gravities. In this work, we present JWST/NIRCam imaging of AF Lep b. Across two epochs, we… ▽ More With a dynamical mass of $3 \, M_\mathrm{Jup}$, the recently discovered giant planet AF Lep b is the lowest-mass imaged planet with a direct mass measurement. Its youth and spectral type near the L/T transition make it a promising target to study the impact of clouds and atmospheric chemistry at low surface gravities. In this work, we present JWST/NIRCam imaging of AF Lep b. Across two epochs, we detect AF Lep b in F444W ($4.4 \, \mathrm{μm}$) with S/N ratios of 9.6 and 8.7, respectively. At the planet's separation of $320 \, \mathrm{mas}$ during the observations, the coronagraphic throughput is ${\approx}7\%$, demonstrating that NIRCam's excellent sensitivity persists down to small separations. The F444W photometry of AF Lep b affirms the presence of disequilibrium carbon chemistry and enhanced atmospheric metallicity. These observations also place deep limits on wider-separation planets in the system, ruling out $1.1 \, M_\mathrm{Jup}$ planets beyond $15.6 \, \mathrm{au}$ (0.58 arcsec), $1.1 \, M_\mathrm{Sat}$ planets beyond $27 \, \mathrm{au}$ (1 arcsec), and $2.8 \, M_\mathrm{Nep}$ planets beyond $67 \, \mathrm{au}$ (2.5 arcsec). We also present new Keck/NIRC2 $L'$ imaging of AF Lep b; combining this with the two epochs of F444W photometry and previous Keck $L'$ photometry provides limits on the long-term 3-$5 \, \mathrm{μm}$ variability of AF Lep b on months-to-years timescales. AF Lep b is the closest-separation planet imaged with JWST to date, demonstrating that planets can be recovered well inside the nominal (50% throughput) NIRCam coronagraph inner working angle. △ Less

Submitted 13 June, 2024; originally announced June 2024.

Comments: 17 pages, 4 figures, submitted to ApJL

arXiv:2406.04105 [pdf, other]

From Tissue Plane to Organ World: A Benchmark Dataset for Multimodal Biomedical Image Registration using Deep Co-Attention Networks

Authors: Yifeng Wang, Weipeng Li, Thomas Pearce, Haohan Wang

Abstract: Correlating neuropathology with neuroimaging findings provides a multiscale view of pathologic changes in the human organ spanning the meso- to micro-scales, and is an emerging methodology expected to shed light on numerous disease states. To gain the most information from this multimodal, multiscale approach, it is desirable to identify precisely where a histologic tissue section was taken from w… ▽ More Correlating neuropathology with neuroimaging findings provides a multiscale view of pathologic changes in the human organ spanning the meso- to micro-scales, and is an emerging methodology expected to shed light on numerous disease states. To gain the most information from this multimodal, multiscale approach, it is desirable to identify precisely where a histologic tissue section was taken from within the organ in order to correlate with the tissue features in exactly the same organ region. Histology-to-organ registration poses an extra challenge, as any given histologic section can capture only a small portion of a human organ. Making use of the capabilities of state-of-the-art deep learning models, we unlock the potential to address and solve such intricate challenges. Therefore, we create the ATOM benchmark dataset, sourced from diverse institutions, with the primary objective of transforming this challenge into a machine learning problem and delivering outstanding outcomes that enlighten the biomedical community. The performance of our RegisMCAN model demonstrates the potential of deep learning to accurately predict where a subregion extracted from an organ image was obtained from within the overall 3D volume. The code and dataset can be found at: https://github.com/haizailache999/Image-Registration/tree/main △ Less

Submitted 6 June, 2024; originally announced June 2024.

arXiv:2406.03947 [pdf, other]

Weight-based Decomposition: A Case for Bilinear MLPs

Authors: Michael T. Pearce, Thomas Dooms, Alice Rigg

Abstract: Gated Linear Units (GLUs) have become a common building block in modern foundation models. Bilinear layers drop the non-linearity in the "gate" but still have comparable performance to other GLUs. An attractive quality of bilinear layers is that they can be fully expressed in terms of a third-order tensor and linear operations. Leveraging this, we develop a method to decompose the bilinear tensor… ▽ More Gated Linear Units (GLUs) have become a common building block in modern foundation models. Bilinear layers drop the non-linearity in the "gate" but still have comparable performance to other GLUs. An attractive quality of bilinear layers is that they can be fully expressed in terms of a third-order tensor and linear operations. Leveraging this, we develop a method to decompose the bilinear tensor into a set of sparsely interacting eigenvectors that show promising interpretability properties in preliminary experiments for shallow image classifiers (MNIST) and small language models (Tiny Stories). Since the decomposition is fully equivalent to the model's original computations, bilinear layers may be an interpretability-friendly architecture that helps connect features to the model weights. Application of our method may not be limited to pretrained bilinear models since we find that language models such as TinyLlama-1.1B can be finetuned into bilinear variants. △ Less

Submitted 6 June, 2024; originally announced June 2024.

arXiv:2405.12399 [pdf, other]

Diffusion for World Modeling: Visual Details Matter in Atari

Authors: Eloi Alonso, Adam Jelley, Vincent Micheli, Anssi Kanervisto, Amos Storkey, Tim Pearce, François Fleuret

Abstract: World models constitute a promising approach for training reinforcement learning agents in a safe and sample-efficient manner. Recent world models predominantly operate on sequences of discrete latent variables to model environment dynamics. However, this compression into a compact discrete representation may ignore visual details that are important for reinforcement learning. Concurrently, diffus… ▽ More World models constitute a promising approach for training reinforcement learning agents in a safe and sample-efficient manner. Recent world models predominantly operate on sequences of discrete latent variables to model environment dynamics. However, this compression into a compact discrete representation may ignore visual details that are important for reinforcement learning. Concurrently, diffusion models have become a dominant approach for image generation, challenging well-established methods modeling discrete latents. Motivated by this paradigm shift, we introduce DIAMOND (DIffusion As a Model Of eNvironment Dreams), a reinforcement learning agent trained in a diffusion world model. We analyze the key design choices that are required to make diffusion suitable for world modeling, and demonstrate how improved visual details can lead to improved agent performance. DIAMOND achieves a mean human normalized score of 1.46 on the competitive Atari 100k benchmark; a new best for agents trained entirely within a world model. To foster future research on diffusion for world modeling, we release our code, agents and playable world models at https://github.com/eloialonso/diamond. △ Less

Submitted 20 May, 2024; originally announced May 2024.

Comments: 25 pages, 11 figures, 10 tables

arXiv:2403.11804 [pdf, other]

Debris disks around main-sequence stars

Authors: Tim D. Pearce

Abstract: 'Debris disks' are collections of small bodies around stars, such as the Asteroid Belt and Kuiper Belt in our Solar System. These disks are composed of objects smaller than planets, including asteroids, comets, dust, and dwarf planets. We detect debris disks around a significant fraction of stars, and these disks appear to be common components of planetary systems. Extrasolar debris disks have a b… ▽ More 'Debris disks' are collections of small bodies around stars, such as the Asteroid Belt and Kuiper Belt in our Solar System. These disks are composed of objects smaller than planets, including asteroids, comets, dust, and dwarf planets. We detect debris disks around a significant fraction of stars, and these disks appear to be common components of planetary systems. Extrasolar debris disks have a broad range of locations, shapes and features. This chapter provides an introduction to debris disks around main-sequence stars. It summarises our understanding of the field, and covers a wide range of concepts from observations and theory. It describes how we detect extrasolar debris disks, what we see, and what these observations tell us. It also describes how debris disks evolve, and how they interact with planets. The chapter concludes by discussing several unsolved questions in debris-disk science. △ Less

Submitted 18 March, 2024; originally announced March 2024.

Comments: Introductory review, aimed as a first-entry point for undergraduates and early postgraduates. Provides a concise overview of debris-disk observations and theory. Preprint of a chapter for the 'Encyclopedia of Astrophysics' (Editor-in-Chief Ilya Mandel, Section Editor Dimitri Veras) to be published by Elsevier as a Reference Module. The number of references was capped

arXiv:2403.03004 [pdf, other]

Ultralight vector dark matter search using data from the KAGRA O3GK run

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi , et al. (1778 additional authors not shown)

Abstract: Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese… ▽ More Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we present the result of a search for $U(1)_{B-L}$ gauge boson DM using the KAGRA data from auxiliary length channels during the first joint observation run together with GEO600. By applying our search pipeline, which takes into account the stochastic nature of ultralight DM, upper bounds on the coupling strength between the $U(1)_{B-L}$ gauge boson and ordinary matter are obtained for a range of DM masses. While our constraints are less stringent than those derived from previous experiments, this study demonstrates the applicability of our method to the lower-mass vector DM search, which is made difficult in this measurement by the short observation time compared to the auto-correlation time scale of DM. △ Less

Submitted 5 March, 2024; originally announced March 2024.

Comments: 20 pages, 5 figures

Report number: LIGO-P2300250

arXiv:2402.16349 [pdf, other]

C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory

Authors: Tianjiao Luo, Tim Pearce, Huayu Chen, Jianfei Chen, Jun Zhu

Abstract: Generative Adversarial Imitation Learning (GAIL) trains a generative policy to mimic a demonstrator. It uses on-policy Reinforcement Learning (RL) to optimize a reward signal derived from a GAN-like discriminator. A major drawback of GAIL is its training instability - it inherits the complex training dynamics of GANs, and the distribution shift introduced by RL. This can cause oscillations during… ▽ More Generative Adversarial Imitation Learning (GAIL) trains a generative policy to mimic a demonstrator. It uses on-policy Reinforcement Learning (RL) to optimize a reward signal derived from a GAN-like discriminator. A major drawback of GAIL is its training instability - it inherits the complex training dynamics of GANs, and the distribution shift introduced by RL. This can cause oscillations during training, harming its sample efficiency and final policy performance. Recent work has shown that control theory can help with the convergence of a GAN's training. This paper extends this line of work, conducting a control-theoretic analysis of GAIL and deriving a novel controller that not only pushes GAIL to the desired equilibrium but also achieves asymptotic stability in a 'one-step' setting. Based on this, we propose a practical algorithm 'Controlled-GAIL' (C-GAIL). On MuJoCo tasks, our controlled variant is able to speed up the rate of convergence, reduce the range of oscillation and match the expert's distribution more closely both for vanilla GAIL and GAIL-DAC. △ Less

Submitted 26 February, 2024; originally announced February 2024.

arXiv:2311.10461 [pdf, other]

Increasing planet-stirring efficiency of debris disks by "projectile stirring" and "resonant stirring"

Authors: Tyson Costa, Tim D. Pearce, Alexander V. Krivov

Abstract: Extrasolar debris disks are detected by observing dust, which is thought to be released during planetesimal collisions. This implies that planetesimals are dynamically excited ("stirred"), such that collisions are sufficiently common and violent. The most frequently considered stirring mechanisms are self-stirring by disk self-gravity, and planet-stirring via secular interactions. However, these m… ▽ More Extrasolar debris disks are detected by observing dust, which is thought to be released during planetesimal collisions. This implies that planetesimals are dynamically excited ("stirred"), such that collisions are sufficiently common and violent. The most frequently considered stirring mechanisms are self-stirring by disk self-gravity, and planet-stirring via secular interactions. However, these models face problems when considering disk mass, self-gravity, and planet eccentricity, leading to the possibility that other, unexplored mechanisms instead stir debris. We hypothesize that planet-stirring could be more efficient than the traditional secular model implies, due to two additional mechanisms. First, a planet at the inner edge of a debris disk can scatter massive bodies onto eccentric, disk-crossing orbits, which then excite debris ("projectile stirring"). Second, a planet can stir debris over a wide region via broad mean-motion resonances, both at and between nominal resonance locations ("resonant stirring"). Both mechanisms can be effective even for low-eccentricity planets, unlike secular-planet-stirring. We run N-body simulations across a broad parameter space, to determine the viability of these new stirring mechanisms. We quantify stirring levels using a bespoke program for assessing Rebound debris simulations, which we make publicly available. We find that even low-mass projectiles can stir disks, and verify this with a simple analytic criterion. We also show that resonant stirring is effective for planets above ~0.5 MJup. By proving that these mechanisms can increase planet-stirring efficiency, we demonstrate that planets could still be stirring debris disks even in cases where conventional (secular) planet-stirring is insufficient. △ Less

Submitted 17 November, 2023; originally announced November 2023.

Comments: 21 pages, 16 figures, accepted for publication in MNRAS

arXiv:2311.04265 [pdf, other]

The effect of sculpting planets on the steepness of debris-disc inner edges

Authors: Tim D. Pearce, Alexander V. Krivov, Antranik A. Sefilian, Marija R. Jankovic, Torsten Löhne, Tobias Morgner, Mark C. Wyatt, Mark Booth, Sebastian Marino

Abstract: Debris discs are our best means to probe the outer regions of planetary systems. Many studies assume that planets lie at the inner edges of debris discs, akin to Neptune and the Kuiper Belt, and use the disc morphologies to constrain those otherwise-undetectable planets. However, this produces a degeneracy in planet mass and semimajor axis. We investigate the effect of a sculpting planet on the ra… ▽ More Debris discs are our best means to probe the outer regions of planetary systems. Many studies assume that planets lie at the inner edges of debris discs, akin to Neptune and the Kuiper Belt, and use the disc morphologies to constrain those otherwise-undetectable planets. However, this produces a degeneracy in planet mass and semimajor axis. We investigate the effect of a sculpting planet on the radial surface-density profile at the disc inner edge, and show that this degeneracy can be broken by considering the steepness of the edge profile. Like previous studies, we show that a planet on a circular orbit ejects unstable debris and excites surviving material through mean-motion resonances. For a non-migrating, circular-orbit planet, in the case where collisions are negligible, the steepness of the disc inner edge depends on the planet-to-star mass ratio and the initial-disc excitation level. We provide a simple analytic model to infer planet properties from the steepness of ALMA-resolved disc edges. We also perform a collisional analysis, showing that a purely planet-sculpted disc would be distinguishable from a purely collisional disc and that, whilst collisions flatten planet-sculpted edges, they are unlikely to fully erase a planet's signature. Finally, we apply our results to ALMA-resolved debris discs and show that, whilst many inner edges are too steep to be explained by collisions alone, they are too flat to arise through completed sculpting by non-migrating, circular-orbit planets. We discuss implications of this for the architectures, histories and dynamics in the outer regions of planetary systems. △ Less

Submitted 7 November, 2023; originally announced November 2023.

Comments: 24 pages, 18 figures, accepted for publication in MNRAS

arXiv:2310.17485 [pdf, other]

doi 10.1016/j.trc.2023.104376

Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach

Authors: Stephen Mak, Liming Xu, Tim Pearce, Michael Ostroumov, Alexandra Brintrup

Abstract: Collaborative vehicle routing occurs when carriers collaborate through sharing their transportation requests and performing transportation requests on behalf of each other. This achieves economies of scale, thus reducing cost, greenhouse gas emissions and road congestion. But which carrier should partner with whom, and how much should each carrier be compensated? Traditional game theoretic solutio… ▽ More Collaborative vehicle routing occurs when carriers collaborate through sharing their transportation requests and performing transportation requests on behalf of each other. This achieves economies of scale, thus reducing cost, greenhouse gas emissions and road congestion. But which carrier should partner with whom, and how much should each carrier be compensated? Traditional game theoretic solution concepts are expensive to calculate as the characteristic function scales exponentially with the number of agents. This would require solving the vehicle routing problem (NP-hard) an exponential number of times. We therefore propose to model this problem as a coalitional bargaining game solved using deep multi-agent reinforcement learning, where - crucially - agents are not given access to the characteristic function. Instead, we implicitly reason about the characteristic function; thus, when deployed in production, we only need to evaluate the expensive post-collaboration vehicle routing problem once. Our contribution is that we are the first to consider both the route allocation problem and gain sharing problem simultaneously - without access to the expensive characteristic function. Through decentralised machine learning, our agents bargain with each other and agree to outcomes that correlate well with the Shapley value - a fair profit allocation mechanism. Importantly, we are able to achieve a reduction in run-time of 88%. △ Less

Submitted 26 October, 2023; originally announced October 2023.

Comments: Final, published version can be found here: https://www.sciencedirect.com/science/article/pii/S0968090X23003662

Journal ref: Volume 157, December 2023, 104376

arXiv:2310.17458 [pdf, other]

Coalitional Bargaining via Reinforcement Learning: An Application to Collaborative Vehicle Routing

Authors: Stephen Mak, Liming Xu, Tim Pearce, Michael Ostroumov, Alexandra Brintrup

Abstract: Collaborative Vehicle Routing is where delivery companies cooperate by sharing their delivery information and performing delivery requests on behalf of each other. This achieves economies of scale and thus reduces cost, greenhouse gas emissions, and road congestion. But which company should partner with whom, and how much should each company be compensated? Traditional game theoretic solution conc… ▽ More Collaborative Vehicle Routing is where delivery companies cooperate by sharing their delivery information and performing delivery requests on behalf of each other. This achieves economies of scale and thus reduces cost, greenhouse gas emissions, and road congestion. But which company should partner with whom, and how much should each company be compensated? Traditional game theoretic solution concepts, such as the Shapley value or nucleolus, are difficult to calculate for the real-world problem of Collaborative Vehicle Routing due to the characteristic function scaling exponentially with the number of agents. This would require solving the Vehicle Routing Problem (an NP-Hard problem) an exponential number of times. We therefore propose to model this problem as a coalitional bargaining game where - crucially - agents are not given access to the characteristic function. Instead, we implicitly reason about the characteristic function, and thus eliminate the need to evaluate the VRP an exponential number of times - we only need to evaluate it once. Our contribution is that our decentralised approach is both scalable and considers the self-interested nature of companies. The agents learn using a modified Independent Proximal Policy Optimisation. Our RL agents outperform a strong heuristic bot. The agents correctly identify the optimal coalitions 79% of the time with an average optimality gap of 4.2% and reduction in run-time of 62%. △ Less

Submitted 26 October, 2023; originally announced October 2023.

Comments: Accepted to NeurIPS 2021 Workshop on Cooperative AI

arXiv:2309.06994 [pdf, other]

doi 10.1093/mnras/stad2827

Self-gravity of debris discs can strongly change the outcomes of interactions with inclined planets

Authors: Pedro P. Poblete, Torsten Löhne, Tim D. Pearce, Antranik A. Sefilian

Abstract: Drastic changes in protoplanets' orbits could occur in the early stages of planetary systems through interactions with other planets and their surrounding protoplanetary or debris discs. The resulting planetary system could exhibit orbits with moderate to high eccentricities and/or inclinations, causing planets to perturb one another as well as the disc significantly. The present work studies the… ▽ More Drastic changes in protoplanets' orbits could occur in the early stages of planetary systems through interactions with other planets and their surrounding protoplanetary or debris discs. The resulting planetary system could exhibit orbits with moderate to high eccentricities and/or inclinations, causing planets to perturb one another as well as the disc significantly. The present work studies the evolution of systems composed of an initially inclined planet and a debris disc. We perform N-body simulations of a narrow, self-gravitating debris disc and a single interior Neptune-like planet. We simulate systems with various initial planetary inclinations, from coplanar to polar configurations considering different separations between the planet and the disc. We find that except when the planet is initially on a polar orbit, the planet-disc system tends to reach a quasi-coplanar configuration with low vertical dispersion in the disc. When present, the Zeipel--Kozai--Lidov oscillations induced by the disc pump the planet's eccentricity and, in turn, affect the disc structure. We also find that the resulting disc morphology in most of the simulations looks very similar in both radial and vertical directions once the simulations are converged. This contrasts strongly with massless disc simulations, where vertical disc dispersion is set by the initial disc-planet inclination and can be high for initially highly inclined planets. The results suggest caution in interpreting an unseen planet's dynamical history based only on the disc's appearance. △ Less

Submitted 13 September, 2023; originally announced September 2023.

Comments: 15 pages, 6 figures. Accepted for publication in MNRAS

arXiv:2308.10391 [pdf, other]

doi 10.1051/0004-6361/202346109

How much large dust could be present in hot exozodiacal dust systems?

Authors: T. A. Stuber, F. Kirchschlager, T. D. Pearce, S. Ertel, A. V. Krivov, S. Wolf

Abstract: An infrared excess over the stellar photospheric emission of main-sequence stars has been found in interferometric surveys, commonly attributed to the presence of hot exozodiacal dust (HEZD). While submicrometer-sized grains in close vicinity to their host star have been inferred to be responsible for the found near-infrared excesses, the presence and amount of larger grains as part of the dust di… ▽ More An infrared excess over the stellar photospheric emission of main-sequence stars has been found in interferometric surveys, commonly attributed to the presence of hot exozodiacal dust (HEZD). While submicrometer-sized grains in close vicinity to their host star have been inferred to be responsible for the found near-infrared excesses, the presence and amount of larger grains as part of the dust distributions are weakly constrained. We quantify how many larger grains (above-micrometer-sized) could be present in addition to submicrometer-sized grains, while being consistent with observational constraints. This is important in order to distinguish between various scenarios for the origin of HEZD and to better estimate its observational appearance when observed with future instruments. We extended a model suitable to reproduce current observations of HEZD to investigate a bimodal size distribution. By deriving the characteristics of dust distributions whose observables are consistent with observational limits from interferometric measurements in the $K$ and $N$ bands we constrained the radii of sub- and above-micrometer-sized grains as well as their mass, number, and flux density ratios. In the most extreme cases of some of the investigated systems, large grains $\gtrsim 10\,μ$m might dominate the mass budget of HEZD while contributing up to 25$\,$% of the total flux density originating from the dust at a wavelength of 2.13$\,μ$m and up to 50$\,$% at a wavelength of 4.1$\,μ$m; at a wavelength of 11.1$\,μ$m their emission might clearly dominate over the emission of small grains. While it is not possible to detect such hot-dust distributions using ALMA, the ngVLA might allow us to detect HEZD at millimeter wavelengths. Large dust grains might have a more important impact on the observational appearance of HEZD than previously assumed, especially at longer wavelengths. △ Less

Submitted 20 August, 2023; originally announced August 2023.

Comments: Accepted for publication in Astronomy & Astrophysics. 18 pages, 7 figures

Journal ref: A&A 678, A121 (2023)

arXiv:2308.03822 [pdf, other]

Search for Eccentric Black Hole Coalescences during the Third Observing Run of LIGO and Virgo

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi , et al. (1750 additional authors not shown)

Abstract: Despite the growing number of confident binary black hole coalescences observed through gravitational waves so far, the astrophysical origin of these binaries remains uncertain. Orbital eccentricity is one of the clearest tracers of binary formation channels. Identifying binary eccentricity, however, remains challenging due to the limited availability of gravitational waveforms that include effect… ▽ More Despite the growing number of confident binary black hole coalescences observed through gravitational waves so far, the astrophysical origin of these binaries remains uncertain. Orbital eccentricity is one of the clearest tracers of binary formation channels. Identifying binary eccentricity, however, remains challenging due to the limited availability of gravitational waveforms that include effects of eccentricity. Here, we present observational results for a waveform-independent search sensitive to eccentric black hole coalescences, covering the third observing run (O3) of the LIGO and Virgo detectors. We identified no new high-significance candidates beyond those that were already identified with searches focusing on quasi-circular binaries. We determine the sensitivity of our search to high-mass (total mass $M>70$ $M_\odot$) binaries covering eccentricities up to 0.3 at 15 Hz orbital frequency, and use this to compare model predictions to search results. Assuming all detections are indeed quasi-circular, for our fiducial population model, we place an upper limit for the merger rate density of high-mass binaries with eccentricities $0 < e \leq 0.3$ at $0.33$ Gpc$^{-3}$ yr$^{-1}$ at 90\% confidence level. △ Less

Submitted 7 August, 2023; originally announced August 2023.

Comments: 24 pages, 5 figures

Report number: LIGO-P2300080

arXiv:2303.13584 [pdf, other]

doi 10.1093/mnras/stad938

The clumpy structure of $ε$ Eridani's debris disc revisited by ALMA

Authors: Mark Booth, Tim D. Pearce, Alexander V. Krivov, Mark C. Wyatt, William R. F. Dent, Antonio S. Hales, Jean-François Lestrade, Fernando Cruz-Sáenz de Miera, Virginie C. Faramaz, Torsten Löhne, Miguel Chavez-Dagostino

Abstract: $ε… ▽ More $ε$ Eridani is the closest star to our Sun known to host a debris disc. Prior observations in the (sub-)millimetre regime have potentially detected clumpy structure in the disc and attributed this to interactions with an (as yet) undetected planet. However, the prior observations were unable to distinguish between structure in the disc and background confusion. Here we present the first ALMA image of the entire disc, which has a resolution of 1.6"$\times$1.2". We clearly detect the star, the main belt and two point sources. The resolution and sensitivity of this data allow us to clearly distinguish background galaxies (that show up as point sources) from the disc emission. We show that the two point sources are consistent with background galaxies. After taking account of these, we find that resolved residuals are still present in the main belt, including two clumps with a $>3σ$ significance -- one to the east of the star and the other to the northwest. We perform $n$-body simulations to demonstrate that a migrating planet can form structures similar to those observed by trap** planetesimals in resonances. We find that the observed features can be reproduced by a migrating planet trap** planetesimals in the 2:1 mean motion resonance and the symmetry of the most prominent clumps means that the planet should have a position angle of either ${\sim10^\circ}$ or ${\sim190^\circ}$. Observations over multiple epochs are necessary to test whether the observed features rotate around the star. △ Less

Submitted 23 March, 2023; originally announced March 2023.

Comments: 16 pages, 10 figures, accepted for publication in MNRAS

arXiv:2302.07515 [pdf, other]

TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play

Authors: Fanqi Lin, Shiyu Huang, Tim Pearce, Wenze Chen, Wei-Wei Tu

Abstract: Multi-agent football poses an unsolved challenge in AI research. Existing work has focused on tackling simplified scenarios of the game, or else leveraging expert demonstrations. In this paper, we develop a multi-agent system to play the full 11 vs. 11 game mode, without demonstrations. This game mode contains aspects that present major challenges to modern reinforcement learning algorithms; multi… ▽ More Multi-agent football poses an unsolved challenge in AI research. Existing work has focused on tackling simplified scenarios of the game, or else leveraging expert demonstrations. In this paper, we develop a multi-agent system to play the full 11 vs. 11 game mode, without demonstrations. This game mode contains aspects that present major challenges to modern reinforcement learning algorithms; multi-agent coordination, long-term planning, and non-transitivity. To address these challenges, we present TiZero; a self-evolving, multi-agent system that learns from scratch. TiZero introduces several innovations, including adaptive curriculum learning, a novel self-play strategy, and an objective that optimizes the policies of multiple agents jointly. Experimentally, it outperforms previous systems by a large margin on the Google Research Football environment, increasing win rates by over 30%. To demonstrate the generality of TiZero's innovations, they are assessed on several environments beyond football; Overcooked, Multi-agent Particle-Environment, Tic-Tac-Toe and Connect-Four. △ Less

Submitted 20 February, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

Comments: The 22nd International Conference on Autonomous Agents and Multiagent Systems(AAMAS2023)

arXiv:2302.05420 [pdf, other]

doi 10.3847/2041-8213/acd6f6

Astrometric Accelerations as Dynamical Beacons: A Giant Planet Imaged Inside the Debris Disk of the Young Star AF Lep

Authors: Kyle Franson, Brendan P. Bowler, Yifan Zhou, Tim D. Pearce, Daniella C. Bardalez Gagliuffi, Lauren Biddle, Timothy D. Brandt, Justin R. Crepp, Trent J. Dupuy, Jacqueline Faherty, Rebecca Jensen-Clem, Marvin Morgan, Aniket Sanghi, Christopher A. Theissen, Quang H. Tran, Trevor A. Wolf

Abstract: We present the direct imaging discovery of a giant planet orbiting the young star AF Lep, a 1.2 $M_{\odot}$ member of the 24 $\pm$ 3 Myr $β$ Pic moving group. AF Lep was observed as part of our ongoing high-contrast imaging program targeting stars with astrometric accelerations between Hipparcos and Gaia that indicate the presence of substellar companions. Keck/NIRC2 observations in $L'$ with the… ▽ More We present the direct imaging discovery of a giant planet orbiting the young star AF Lep, a 1.2 $M_{\odot}$ member of the 24 $\pm$ 3 Myr $β$ Pic moving group. AF Lep was observed as part of our ongoing high-contrast imaging program targeting stars with astrometric accelerations between Hipparcos and Gaia that indicate the presence of substellar companions. Keck/NIRC2 observations in $L'$ with the Vector Vortex Coronagraph reveal a point source, AF Lep b, at ${\approx}340$ mas which exhibits orbital motion at the 6-$σ$ level over the course of 13 months. A joint orbit fit yields precise constraints on the planet's dynamical mass of 3.2$^{+0.7}_{-0.6}$ $M_\mathrm{Jup}$, semi-major axis of $8.4^{+1.1}_{-1.3}$ au, and eccentricity of $0.24^{+0.27}_{-0.15}$. AF Lep hosts a debris disk located at $\sim$50 au, but it is unlikely to be sculpted by AF Lep b, implying there may be additional planets in the system at wider separations. The stellar inclination ($i_* = 54^{+11}_{-9} {}^\circ$) and orbital inclination ($i_o = 50^{+9}_{-12} {}^\circ$) are in good agreement, which is consistent with the system having spin-orbit alignment. AF Lep b is the lowest-mass imaged planet with a dynamical mass measurement and highlights the promise of using astrometric accelerations as a tool to find and characterize long-period planets. △ Less

Submitted 25 May, 2023; v1 submitted 10 February, 2023; originally announced February 2023.

Comments: 14 pages, 3 figures, accepted to ApJL

arXiv:2302.03676 [pdf, other]

doi 10.3847/1538-4365/acdc9f

Open data from the third observing run of LIGO, Virgo, KAGRA and GEO

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, R. Abbott, H. Abe, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, A. Al-Jodah, C. Alléné, A. Allocca , et al. (1719 additional authors not shown)

Abstract: The global network of gravitational-wave observatories now includes five detectors, namely LIGO Hanford, LIGO Livingston, Virgo, KAGRA, and GEO 600. These detectors collected data during their third observing run, O3, composed of three phases: O3a starting in April of 2019 and lasting six months, O3b starting in November of 2019 and lasting five months, and O3GK starting in April of 2020 and lasti… ▽ More The global network of gravitational-wave observatories now includes five detectors, namely LIGO Hanford, LIGO Livingston, Virgo, KAGRA, and GEO 600. These detectors collected data during their third observing run, O3, composed of three phases: O3a starting in April of 2019 and lasting six months, O3b starting in November of 2019 and lasting five months, and O3GK starting in April of 2020 and lasting 2 weeks. In this paper we describe these data and various other science products that can be freely accessed through the Gravitational Wave Open Science Center at https://gwosc.org. The main dataset, consisting of the gravitational-wave strain time series that contains the astrophysical signals, is released together with supporting data useful for their analysis and documentation, tutorials, as well as analysis software packages. △ Less

Submitted 7 February, 2023; originally announced February 2023.

Comments: 27 pages, 3 figures

Report number: LIGO-P2200316

arXiv:2301.10677 [pdf, other]

Imitating Human Behaviour with Diffusion Models

Authors: Tim Pearce, Tabish Rashid, Anssi Kanervisto, Dave Bignell, Mingfei Sun, Raluca Georgescu, Sergio Valcarcel Macua, Shan Zheng Tan, Ida Momennejad, Katja Hofmann, Sam Devlin

Abstract: Diffusion models have emerged as powerful generative models in the text-to-image domain. This paper studies their application as observation-to-action models for imitating human behaviour in sequential environments. Human behaviour is stochastic and multimodal, with structured correlations between action dimensions. Meanwhile, standard modelling choices in behaviour cloning are limited in their ex… ▽ More Diffusion models have emerged as powerful generative models in the text-to-image domain. This paper studies their application as observation-to-action models for imitating human behaviour in sequential environments. Human behaviour is stochastic and multimodal, with structured correlations between action dimensions. Meanwhile, standard modelling choices in behaviour cloning are limited in their expressiveness and may introduce bias into the cloned policy. We begin by pointing out the limitations of these choices. We then propose that diffusion models are an excellent fit for imitating human behaviour, since they learn an expressive distribution over the joint action space. We introduce several innovations to make diffusion models suitable for sequential environments; designing suitable architectures, investigating the role of guidance, and develo** reliable sampling strategies. Experimentally, diffusion models closely match human demonstrations in a simulated robotic control task and a modern 3D gaming environment. △ Less

Submitted 3 March, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

Comments: Published in ICLR 2023

Journal ref: ICLR 2023

arXiv:2211.15434 [pdf, other]

doi 10.1051/0004-6361/202244891

ISPY-NACO Imaging Survey for Planets around Young stars. The demographics of forming planets embedded in protoplanetary disks

Authors: Gabriele Cugno, Timothy D. Pearce, Ralf Launhardt, Markus. J. Bonse, Jie. Ma, Thomas Henning, Andreas Quirrenbach, Damien Ségransan, Elisabeth C. Matthews, Sascha P. Quanz, Grant M. Kennedy, André Müller, Sabine Reffert, Emily L. Rickman

Abstract: We present the statistical analysis of a subsample of 45 young stars surrounded by protoplanetary disks (PPDs). This is the largest imaging survey uniquely focused on PPDs to date. Our goal is to search for young forming companions embedded in the disk material and to constrain their occurrence rate in relation to the formation mechanism. We used principal component analysis based point spread fun… ▽ More We present the statistical analysis of a subsample of 45 young stars surrounded by protoplanetary disks (PPDs). This is the largest imaging survey uniquely focused on PPDs to date. Our goal is to search for young forming companions embedded in the disk material and to constrain their occurrence rate in relation to the formation mechanism. We used principal component analysis based point spread function subtraction techniques to reveal young companions forming in the disks. We calculated detection limits for our datasets and adopted a black-body model to derive temperature upper limits of potential forming planets. We then used Monte Carlo simulations to constrain the population of forming gas giant companions and compare our results to different types of formation scenarios. Our data revealed a new binary system (HD38120) and a recently identified triple system with a brown dwarf companion orbiting a binary system (HD101412), in addition to 12 known companions. Furthermore, we detected signals from 17 disks, two of which (HD72106 and TCrA) were imaged for the first time. We reached median detection limits of L =15.4 mag at 2.0 arcsec, which were used to investigate the temperature of potentially embedded forming companions. We can constrain the occurrence of forming planets with semi-major axis a in [20 - 500] au and Teff in [600 - 3000] K, in line with the statistical results obtained for more evolved systems from other direct imaging surveys. The NaCo-ISPY data confirm that massive bright planets accreting at high rates are rare. More powerful instruments with better sensitivity in the near- to mid-infrared are likely required to unveil the wealth of forming planets sculpting the observed disk substructures. △ Less

Submitted 28 November, 2022; originally announced November 2022.

Comments: 25 pages, 16 figures, 3 tables, accepted for publication in A&A

Journal ref: A&A 669, A145 (2023)

arXiv:2209.11219 [pdf, other]

doi 10.1093/mnras/stac2773

Hot exozodis: cometary supply without trap** is unlikely to be the mechanism

Authors: Tim D. Pearce, Florian Kirchschlager, Gaël Rouillé, Steve Ertel, Alexander Bensberg, Alexander V. Krivov, Mark Booth, Sebastian Wolf, Jean-Charles Augereau

Abstract: Excess near-infrared emission is detected around one fifth of main-sequence stars, but its nature is a mystery. These excesses are interpreted as thermal emission from populations of small, hot dust very close to their stars (`hot exozodis'), but such grains should rapidly sublimate or be blown out of the system. To date, no model has fully explained this phenomenon. One mechanism commonly suggest… ▽ More Excess near-infrared emission is detected around one fifth of main-sequence stars, but its nature is a mystery. These excesses are interpreted as thermal emission from populations of small, hot dust very close to their stars (`hot exozodis'), but such grains should rapidly sublimate or be blown out of the system. To date, no model has fully explained this phenomenon. One mechanism commonly suggested in the literature is cometary supply, where star-grazing comets deposit dust close to the star, replenishing losses from grain sublimation and blowout. However, we show that this mechanism alone is very unlikely to be responsible for hot exozodis. We model the trajectory and size evolution of dust grains released by star-grazing comets, to establish the dust and comet properties required to reproduce hot-exozodi observations. We find that cometary supply alone can only reproduce observations if dust ejecta has an extremely steep size distribution upon release, and the dust-deposition rate is extraordinarily high. These requirements strongly contradict our current understanding of cometary dust and planetary systems. Cometary supply is therefore unlikely to be solely responsible for hot exozodis, so may need to be combined with some dust-trap** mechanism (such as gas or magnetic trap**) if it is to reproduce observations. △ Less

Submitted 22 September, 2022; originally announced September 2022.

Comments: 18 pages, 9 figures, accepted for publication in MNRAS

arXiv:2207.05631 [pdf, other]

DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization

Authors: Wentse Chen, Shiyu Huang, Yuan Chiang, Tim Pearce, Wei-Wei Tu, Ting Chen, Jun Zhu

Abstract: Most reinforcement learning algorithms seek a single optimal strategy that solves a given task. However, it can often be valuable to learn a diverse set of solutions, for instance, to make an agent's interaction with users more engaging, or improve the robustness of a policy to an unexpected perturbance. We propose Diversity-Guided Policy Optimization (DGPO), an on-policy algorithm that discovers… ▽ More Most reinforcement learning algorithms seek a single optimal strategy that solves a given task. However, it can often be valuable to learn a diverse set of solutions, for instance, to make an agent's interaction with users more engaging, or improve the robustness of a policy to an unexpected perturbance. We propose Diversity-Guided Policy Optimization (DGPO), an on-policy algorithm that discovers multiple strategies for solving a given task. Unlike prior work, it achieves this with a shared policy network trained over a single run. Specifically, we design an intrinsic reward based on an information-theoretic diversity objective. Our final objective alternately constraints on the diversity of the strategies and on the extrinsic reward. We solve the constrained optimization problem by casting it as a probabilistic inference task and use policy iteration to maximize the derived lower bound. Experimental results show that our method efficiently discovers diverse strategies in a wide variety of reinforcement learning tasks. Compared to baseline methods, DGPO achieves comparable rewards, while discovering more diverse strategies, and often with better sample efficiency. △ Less

Submitted 5 January, 2024; v1 submitted 12 July, 2022; originally announced July 2022.

arXiv:2205.13496 [pdf, other]

Censored Quantile Regression Neural Networks for Distribution-Free Survival Analysis

Authors: Tim Pearce, Jong-Hyeon Jeong, Yichen Jia, Jun Zhu

Abstract: This paper considers doing quantile regression on censored data using neural networks (NNs). This adds to the survival analysis toolkit by allowing direct prediction of the target variable, along with a distribution-free characterisation of uncertainty, using a flexible function approximator. We begin by showing how an algorithm popular in linear models can be applied to NNs. However, the resultin… ▽ More This paper considers doing quantile regression on censored data using neural networks (NNs). This adds to the survival analysis toolkit by allowing direct prediction of the target variable, along with a distribution-free characterisation of uncertainty, using a flexible function approximator. We begin by showing how an algorithm popular in linear models can be applied to NNs. However, the resulting procedure is inefficient, requiring sequential optimisation of an individual NN at each desired quantile. Our major contribution is a novel algorithm that simultaneously optimises a grid of quantiles output by a single NN. To offer theoretical insight into our algorithm, we show firstly that it can be interpreted as a form of expectation-maximisation, and secondly that it exhibits a desirable `self-correcting' property. Experimentally, the algorithm produces quantiles that are better calibrated than existing methods on 10 out of 12 real datasets. △ Less

Submitted 6 February, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

Comments: Published in NeurIPS 2022

Journal ref: NeurIPS 2022

arXiv:2203.03611 [pdf, other]

doi 10.1093/mnras/stac664

Gap carving by a migrating planet embedded in a massive debris disc

Authors: Marc F. Friebe, Tim D. Pearce, Torsten Löhne

Abstract: When considering gaps in debris discs, a typical approach is to invoke clearing by an unseen planet within the gap, and derive the planet mass using Wisdom overlap or Hill radius arguments. However, this approach can be invalid if the disc is massive, because this clearing would also cause planet migration. This could result in a calculated planet mass that is incompatible with the inferred disc m… ▽ More When considering gaps in debris discs, a typical approach is to invoke clearing by an unseen planet within the gap, and derive the planet mass using Wisdom overlap or Hill radius arguments. However, this approach can be invalid if the disc is massive, because this clearing would also cause planet migration. This could result in a calculated planet mass that is incompatible with the inferred disc mass, because the predicted planet would in reality be too small to carve the gap without significant migration. We investigate the gap that a single embedded planet would carve in a massive debris disc. We show that a degeneracy is introduced, whereby an observed gap could be carved by two different planets: either a high-mass, barely-migrating planet, or a smaller planet that clears debris as it migrates. We find that, depending on disc mass, there is a minimum possible gap width that an embedded planet could carve (because smaller planets, rather than carving a smaller gap, would actually migrate through the disc and clear a wider region). We provide simple formulae for the planet-to-debris disc mass ratio at which planet migration becomes important, the gap width that an embedded planet would carve in a massive debris disc, and the interaction timescale. We also apply our results to various systems, and in particular show that the disc of HD 107146 can be reasonably well-reproduced with a migrating, embedded planet. Finally, we discuss the importance of planet-debris disc interactions as a tool for constraining debris disc masses. △ Less

Submitted 7 March, 2022; originally announced March 2022.

Comments: 15 pages, 10 figures, accepted for publication in MNRAS

arXiv:2201.08369 [pdf, other]

doi 10.1051/0004-6361/202142720

Planet populations inferred from debris discs: insights from 178 debris systems in the ISPY, LEECH and LIStEN planet-hunting surveys

Authors: Tim D. Pearce, Ralf Launhardt, Robert Ostermann, Grant M. Kennedy, Mario Gennaro, Mark Booth, Alexander V. Krivov, Gabriele Cugno, Thomas K. Henning, Andreas Quirrenbach, Arianna Musso Barcucci, Elisabeth C. Matthews, Henrik L. Ruh, Jordan M. Stone

Abstract: We know little about the outermost exoplanets in planetary systems, because our detection methods are insensitive to moderate-mass planets on wide orbits. However, debris discs can probe the outer-planet population, because dynamical modelling of observed discs can reveal properties of perturbing planets. We use four sculpting and stirring arguments to infer planet properties in 178 debris-disc sy… ▽ More We know little about the outermost exoplanets in planetary systems, because our detection methods are insensitive to moderate-mass planets on wide orbits. However, debris discs can probe the outer-planet population, because dynamical modelling of observed discs can reveal properties of perturbing planets. We use four sculpting and stirring arguments to infer planet properties in 178 debris-disc systems from the ISPY, LEECH and LIStEN planet-hunting surveys. Similar analyses are often conducted for individual discs, but we consider a large sample in a consistent manner. We aim to predict the population of wide-separation planets, gain insight into the formation and evolution histories of planetary systems, and determine the feasibility of detecting these planets in the near future. We show that a `typical' cold debris disc likely requires a Neptune- to Saturn-mass planet at 10-100 au, with some needing Jupiter-mass perturbers. Our predicted planets are currently undetectable, but modest detection-limit improvements (e.g. from JWST) should reveal many such perturbers. We find that planets thought to be perturbing debris discs at late times are similar to those inferred to be forming in protoplanetary discs, so these could be the same population if newly formed planets do not migrate as far as currently thought. Alternatively, young planets could rapidly sculpt debris before migrating inwards, meaning that the responsible planets are more massive (and located further inwards) than debris-disc studies assume. We combine self-stirring and size-distribution modelling to show that many debris discs cannot be self-stirred without having unreasonably high masses; planet- or companion-stirring may therefore be the dominant mechanism in many (perhaps all) debris discs. Finally, we provide catalogues of planet predictions, and identify promising targets for future planet searches. △ Less

Submitted 20 January, 2022; originally announced January 2022.

Comments: 41 pages, 16 figures, accepted for publication in A&A

arXiv:2109.03966 [pdf, other]

doi 10.4204/EPTCS.342.4

Sensitive Samples Revisited: Detecting Neural Network Attacks Using Constraint Solvers

Authors: Amel Nestor Docena, Thomas Wahl, Trevor Pearce, Yunsi Fei

Abstract: Neural Networks are used today in numerous security- and safety-relevant domains and are, as such, a popular target of attacks that subvert their classification capabilities, by manipulating the network parameters. Prior work has introduced sensitive samples -- inputs highly sensitive to parameter changes -- to detect such manipulations, and proposed a gradient ascent-based approach to compute the… ▽ More Neural Networks are used today in numerous security- and safety-relevant domains and are, as such, a popular target of attacks that subvert their classification capabilities, by manipulating the network parameters. Prior work has introduced sensitive samples -- inputs highly sensitive to parameter changes -- to detect such manipulations, and proposed a gradient ascent-based approach to compute them. In this paper we offer an alternative, using symbolic constraint solvers. We model the network and a formal specification of a sensitive sample in the language of the solver and ask for a solution. This approach supports a rich class of queries, corresponding, for instance, to the presence of certain types of attacks. Unlike earlier techniques, our approach does not depend on convex search domains, or on the suitability of a starting point for the search. We address the performance limitations of constraint solvers by partitioning the search space for the solver, and exploring the partitions according to a balanced schedule that still retains completeness of the search. We demonstrate the impact of the use of solvers in terms of functionality and search efficiency, using a case study for the detection of Trojan attacks on Neural Networks. △ Less

Submitted 6 September, 2021; originally announced September 2021.

Comments: In Proceedings SCSS 2021, arXiv:2109.02501

Journal ref: EPTCS 342, 2021, pp. 35-48

arXiv:2107.13304 [pdf, other]

Bayesian Autoencoders: Analysing and Fixing the Bernoulli likelihood for Out-of-Distribution Detection

Authors: Bang Xiang Yong, Tim Pearce, Alexandra Brintrup

Abstract: After an autoencoder (AE) has learnt to reconstruct one dataset, it might be expected that the likelihood on an out-of-distribution (OOD) input would be low. This has been studied as an approach to detect OOD inputs. Recent work showed this intuitive approach can fail for the dataset pairs FashionMNIST vs MNIST. This paper suggests this is due to the use of Bernoulli likelihood and analyses why th… ▽ More After an autoencoder (AE) has learnt to reconstruct one dataset, it might be expected that the likelihood on an out-of-distribution (OOD) input would be low. This has been studied as an approach to detect OOD inputs. Recent work showed this intuitive approach can fail for the dataset pairs FashionMNIST vs MNIST. This paper suggests this is due to the use of Bernoulli likelihood and analyses why this is the case, proposing two fixes: 1) Compute the uncertainty of likelihood estimate by using a Bayesian version of the AE. 2) Use alternative distributions to model the likelihood. △ Less

Submitted 28 July, 2021; originally announced July 2021.

Comments: Presented at the ICML 2020 Workshop on Uncertainty and Ro-bustness in Deep Learning

arXiv:2106.05975 [pdf, other]

doi 10.1093/mnras/stab1678

High resolution ALMA and HST images of q$^1$ Eri: an asymmetric debris disc with an eccentric Jupiter

Authors: J. B. Lovell, S. Marino, M. C. Wyatt, G. M. Kennedy, M. A. MacGregor, K. Stapelfeldt, B. Dent, J. Krist, L. Matrà, Q. Kral, O. Panić, T. D. Pearce, D. Wilner

Abstract: We present \textit{ALMA} 1.3 mm and 0.86 mm observations of the nearby (17.34 pc) F9V star q1 Eri (HD 10647, HR 506). This system, with age ${\sim}1.4$ Gyr, hosts a ${\sim}2$ au radial velocity planet and a debris disc with the highest fractional luminosity of the closest 300 FGK type stars. The \textit{ALMA} images, with resolution ${\sim}0.5''$, reveal a broad (34{-}134 au) belt of millimeter em… ▽ More We present \textit{ALMA} 1.3 mm and 0.86 mm observations of the nearby (17.34 pc) F9V star q1 Eri (HD 10647, HR 506). This system, with age ${\sim}1.4$ Gyr, hosts a ${\sim}2$ au radial velocity planet and a debris disc with the highest fractional luminosity of the closest 300 FGK type stars. The \textit{ALMA} images, with resolution ${\sim}0.5''$, reveal a broad (34{-}134 au) belt of millimeter emission inclined by $76.7{\pm}1.0$ degrees with maximum brightness at $81.6{\pm}0.5$ au. The images reveal an asymmetry, with higher flux near the southwest ansa, which is also closer to the star. Scattered light observed with the Hubble Space Telescope is also asymmetric, being more radially extended to the northeast. We fit the millimeter emission with parametric models and place constraints on the disc morphology, radius, width, dust mass, and scale height. We find the southwest ansa asymmetry is best fitted by an extended clump on the inner edge of the disc, consistent with perturbations from a planet with mass $8 M_{\oplus} {-} 11 M_{\rm Jup}$ at ${\sim}60$ au that may have migrated outwards, similar to Neptune in our Solar System. If the measured vertical aspect ratio of $h{=}0.04{\pm}0.01$ is due to dynamical interactions in the disc, then this requires perturbers with sizes ${>}1200$ km. We find tentative evidence for an 0.86 mm excess within 10 au, $70{\pm}22\, μ$Jy, that may be due to an inner planetesimal belt. We find no evidence for CO gas, but set an upper bound on the CO gas mass of $4{\times}10^{-6}$ M$_{\oplus}$ ($3\,σ$), consistent with cometary abundances in the Solar System. △ Less

Submitted 10 June, 2021; originally announced June 2021.

Comments: Accepted for publication in MNRAS. Paper: 21 pages, appendix: 4 pages. 16 figures

arXiv:2106.04972 [pdf, other]

Understanding Softmax Confidence and Uncertainty

Authors: Tim Pearce, Alexandra Brintrup, Jun Zhu

Abstract: It is often remarked that neural networks fail to increase their uncertainty when predicting on data far from the training distribution. Yet naively using softmax confidence as a proxy for uncertainty achieves modest success in tasks exclusively testing for this, e.g., out-of-distribution (OOD) detection. This paper investigates this contradiction, identifying two implicit biases that do encourage… ▽ More It is often remarked that neural networks fail to increase their uncertainty when predicting on data far from the training distribution. Yet naively using softmax confidence as a proxy for uncertainty achieves modest success in tasks exclusively testing for this, e.g., out-of-distribution (OOD) detection. This paper investigates this contradiction, identifying two implicit biases that do encourage softmax confidence to correlate with epistemic uncertainty: 1) Approximately optimal decision boundary structure, and 2) Filtering effects of deep networks. It describes why low-dimensional intuitions about softmax confidence are misleading. Diagnostic experiments quantify reasons softmax confidence can fail, finding that extrapolations are less to blame than overlap between training and OOD data in final-layer representations. Pre-trained/fine-tuned networks reduce this overlap. △ Less

Submitted 9 June, 2021; originally announced June 2021.

arXiv:2104.04258 [pdf, other]

Counter-Strike Deathmatch with Large-Scale Behavioural Cloning

Authors: Tim Pearce, Jun Zhu

Abstract: This paper describes an AI agent that plays the popular first-person-shooter (FPS) video game `Counter-Strike; Global Offensive' (CSGO) from pixel input. The agent, a deep neural network, matches the performance of the medium difficulty built-in AI on the deathmatch game mode, whilst adopting a humanlike play style. Unlike much prior work in games, no API is available for CSGO, so algorithms must… ▽ More This paper describes an AI agent that plays the popular first-person-shooter (FPS) video game `Counter-Strike; Global Offensive' (CSGO) from pixel input. The agent, a deep neural network, matches the performance of the medium difficulty built-in AI on the deathmatch game mode, whilst adopting a humanlike play style. Unlike much prior work in games, no API is available for CSGO, so algorithms must train and run in real-time. This limits the quantity of on-policy data that can be generated, precluding many reinforcement learning algorithms. Our solution uses behavioural cloning - training on a large noisy dataset scraped from human play on online servers (4 million frames, comparable in size to ImageNet), and a smaller dataset of high-quality expert demonstrations. This scale is an order of magnitude larger than prior work on imitation learning in FPS games. △ Less

Submitted 9 December, 2021; v1 submitted 9 April, 2021; originally announced April 2021.

Comments: Offline Reinforcement Learning Workshop at Neural Information Processing Systems, 2021

arXiv:2103.04977 [pdf, other]

doi 10.1093/mnras/stab760

Fomalhaut b could be massive and sculpting the narrow, eccentric debris disc, if in mean-motion resonance with it

Authors: Tim D. Pearce, Hervé Beust, Virginie Faramaz, Mark Booth, Alexander V. Krivov, Torsten Löhne, Pedro P. Poblete

Abstract: The star Fomalhaut hosts a narrow, eccentric debris disc, plus a highly eccentric companion Fomalhaut b. It is often argued that Fomalhaut b cannot have significant mass, otherwise it would quickly perturb the disc. We show that material in internal mean-motion resonances with a massive, coplanar Fomalhaut b would actually be long-term stable, and occupy orbits similar to the observed debris. Furt… ▽ More The star Fomalhaut hosts a narrow, eccentric debris disc, plus a highly eccentric companion Fomalhaut b. It is often argued that Fomalhaut b cannot have significant mass, otherwise it would quickly perturb the disc. We show that material in internal mean-motion resonances with a massive, coplanar Fomalhaut b would actually be long-term stable, and occupy orbits similar to the observed debris. Furthermore, millimetre dust released in collisions between resonant bodies could reproduce the width, shape and orientation of the observed disc. We first re-examine the possible orbits of Fomalhaut b, assuming that it moves under gravity alone. If Fomalhaut b orbits close to the disc midplane then its orbit crosses the disc, and the two are apsidally aligned. This alignment may hint at an ongoing dynamical interaction. Using the observationally allowed orbits, we then model the interaction between a massive Fomalhaut b and debris. Whilst most debris is unstable in such an extreme configuration, we identify several resonant populations that remain stable for the stellar lifetime, despite crossing the orbit of Fomalhaut b. This debris occupies low-eccentricity orbits similar to the observed debris ring. These resonant bodies would have a clumpy distribution, but dust released in collisions between them would form a narrow, relatively smooth ring similar to observations. We show that if Fomalhaut b has a mass between those of Earth and Jupiter then, far from removing the observed debris, it could actually be sculpting it through resonant interactions. △ Less

Submitted 8 March, 2021; originally announced March 2021.

Comments: 24 pages, 11 figures, accepted for publication in MNRAS

arXiv:2102.06893 [pdf]

A Bayesian social platform for inclusive and evidence-based decision making

Authors: Susannah Kate Devitt, Tamara Rose Pearce, Alok Kumar Chowdhury, Kerrie Mengersen

Abstract: Against the backdrop of a social media reckoning, this paper seeks to demonstrate the potential of social tools to build virtuous behaviours online. We must assume that human behaviour is flawed, the truth can be elusive, and as communities we must commit to mechanisms to encourage virtuous social digital behaviours. Societies that use social platforms should be inclusive, responsive to evidence,… ▽ More Against the backdrop of a social media reckoning, this paper seeks to demonstrate the potential of social tools to build virtuous behaviours online. We must assume that human behaviour is flawed, the truth can be elusive, and as communities we must commit to mechanisms to encourage virtuous social digital behaviours. Societies that use social platforms should be inclusive, responsive to evidence, limit punitive actions and allow productive discord and respectful disagreement. Social media success, we argue, is in the hypothesis. Documents are valuable to the degree that they are evidence in service of, or to challenge an idea for a purpose. We outline how a Bayesian social platform can facilitate virtuous behaviours to build evidence-based collective rationality. The chapter outlines the epistemic architecture of the platform's algorithms and user interface in conjunction with explicit community management to ensure psychological safety. The BetterBeliefs platform rewards users who demonstrate epistemically virtuous behaviours and exports evidence-based propositions for decision-making. A Bayesian social network can make virtuous ideas powerful. △ Less

Submitted 13 February, 2021; originally announced February 2021.

Comments: 38 pages, 3 tables, 13 figures submitted for peer review for inclusion in M. Alfano, C. Klein and J de Ridder (Eds.) Social Virtue Epistemology. Routledge [forthcoming]

MSC Class: 62C12 ACM Class: H.4.1; H.4.2; H.4.3; H.5.3; J.4

arXiv:2010.14521 [pdf, other]

doi 10.1093/mnras/staa3362

Resolving the outer ring of HD 38206 using ALMA and constraining limits on planets in the system

Authors: Mark Booth, Michael Schulz, Alexander V. Krivov, Sebastián Marino, Tim D. Pearce, Ralf Launhardt

Abstract: HD 38206 is an A0V star in the Columba association, hosting a debris disc first discovered by IRAS. Further observations by Spitzer and Herschel showed that the disc has two components, likely analogous to the asteroid and Kuiper belts of the Solar System. The young age of this star makes it a prime target for direct imaging planet searches. Possible planets in the system can be constrained using… ▽ More HD 38206 is an A0V star in the Columba association, hosting a debris disc first discovered by IRAS. Further observations by Spitzer and Herschel showed that the disc has two components, likely analogous to the asteroid and Kuiper belts of the Solar System. The young age of this star makes it a prime target for direct imaging planet searches. Possible planets in the system can be constrained using the debris disc. Here we present the first ALMA observations of the system's Kuiper belt and fit them using a forward modelling MCMC approach. We detect an extended disc of dust peaking at around 180 au with a width of 140 au. The disc is close to edge on and shows tentative signs of an asymmetry best fit by an eccentricity of $0.25^{+0.10}_{-0.09}$. We use the fitted parameters to determine limits on the masses of planets interior to the cold belt. We determine that a minimum of four planets are required, each with a minimum mass of 0.64 M$_J$, in order to clear the gap between the asteroid and Kuiper belts of the system. If we make the assumption that the outermost planet is responsible for the stirring of the disc, the location of its inner edge and the eccentricity of the disc, then we can more tightly predict its eccentricity, mass and semimajor axis to be $e_{\rm{p}}=0.34^{+0.20}_{-0.13}$, $m_{\rm{p}}=0.7^{+0.5}_{-0.3}\,\rm{M}_{\rm{J}}$ and $a_{\rm{p}}=76^{+12}_{-13}\,\rm{au}$. △ Less

Submitted 27 October, 2020; originally announced October 2020.

Comments: 9 pages, 5 figures, accepted for publication in MNRAS

Journal ref: MNRAS, 500, 2, 1604-1611 (2021)

arXiv:2008.07505 [pdf, other]

doi 10.1093/mnras/staa2514

Gas trap** of hot dust around main-sequence stars

Authors: Tim D. Pearce, Alexander V. Krivov, Mark Booth

Abstract: In 2006 Vega was discovered to display excess near-infrared emission. Surveys now detect this phenomenon for one fifth of main-sequence stars, across various spectral types and ages. The excesses are interpreted as populations of small, hot dust grains very close to their stars, which must originate from comets or asteroids. However, the presence of such grains in copious amounts is mysterious, si… ▽ More In 2006 Vega was discovered to display excess near-infrared emission. Surveys now detect this phenomenon for one fifth of main-sequence stars, across various spectral types and ages. The excesses are interpreted as populations of small, hot dust grains very close to their stars, which must originate from comets or asteroids. However, the presence of such grains in copious amounts is mysterious, since they should rapidly sublimate or be blown out of the system. Here we investigate a potential mechanism to generate excesses: dust migrating inwards under radiation forces sublimates near the star, releasing modest quantities of gas which then traps subsequent grains. This mechanism requires neither specialised system architectures nor high dust supply rates, and could operate across diverse stellar types and ages. The model naturally reproduces many features of inferred dust populations, in particular their location, preference for small grains, steep size distribution, and dust location scaling with stellar luminosity. For Sun-like stars the mechanism can produce 2.2 micron excesses that are an order of magnitude larger than those at 8.5 micron, as required by observations. However, for A-type stars the simulated near-infrared excesses were only twice those in the mid infrared; grains would have to be 5-10 times smaller than those trapped in our model to be able to explain observed near-infrared excesses around A stars. Further progress with any hot dust explanation for A stars requires a means for grains to become very hot without either rapidly sublimating or being blown out of the system. △ Less

Submitted 17 August, 2020; originally announced August 2020.

Comments: 19 pages, 9 figures. Accepted for publication in MNRAS

arXiv:2007.14235 [pdf, other]

Structured Weight Priors for Convolutional Neural Networks

Authors: Tim Pearce, Andrew Y. K. Foong, Alexandra Brintrup

Abstract: Selection of an architectural prior well suited to a task (e.g. convolutions for image data) is crucial to the success of deep neural networks (NNs). Conversely, the weight priors within these architectures are typically left vague, e.g.~independent Gaussian distributions, which has led to debate over the utility of Bayesian deep learning. This paper explores the benefits of adding structure to we… ▽ More Selection of an architectural prior well suited to a task (e.g. convolutions for image data) is crucial to the success of deep neural networks (NNs). Conversely, the weight priors within these architectures are typically left vague, e.g.~independent Gaussian distributions, which has led to debate over the utility of Bayesian deep learning. This paper explores the benefits of adding structure to weight priors. It initially considers first-layer filters of a convolutional NN, designing a prior based on random Gabor filters. Second, it considers adding structure to the prior of final-layer weights by estimating how each hidden feature relates to each class. Empirical results suggest that these structured weight priors lead to more meaningful functional priors for image data. This contributes to the ongoing discussion on the importance of weight priors. △ Less

Submitted 12 July, 2020; originally announced July 2020.

Comments: Presented at the ICML 2020 Workshop on Uncertainty and Robustness in Deep Learning

arXiv:2004.07171 [pdf, other]

Musical Features for Automatic Music Transcription Evaluation

Authors: Adrien Ycart, Lele Liu, Emmanouil Benetos, Marcus T. Pearce

Abstract: This technical report gives a detailed, formal description of the features introduced in the paper: Adrien Ycart, Lele Liu, Emmanouil Benetos and Marcus T. Pearce. "Investigating the Perceptual Validity of Evaluation Metrics for Automatic Piano Music Transcription", Transactions of the International Society for Music Information Retrieval (TISMIR), Accepted, 2020. This technical report gives a detailed, formal description of the features introduced in the paper: Adrien Ycart, Lele Liu, Emmanouil Benetos and Marcus T. Pearce. "Investigating the Perceptual Validity of Evaluation Metrics for Automatic Piano Music Transcription", Transactions of the International Society for Music Information Retrieval (TISMIR), Accepted, 2020. △ Less

Submitted 15 April, 2020; originally announced April 2020.

Comments: Technical report

arXiv:2002.08517 [pdf, other]

Avoiding Kernel Fixed Points: Computing with ELU and GELU Infinite Networks

Authors: Russell Tsuchida, Tim Pearce, Chris van der Heide, Fred Roosta, Marcus Gallagher

Abstract: Analysing and computing with Gaussian processes arising from infinitely wide neural networks has recently seen a resurgence in popularity. Despite this, many explicit covariance functions of networks with activation functions used in modern networks remain unknown. Furthermore, while the kernels of deep networks can be computed iteratively, theoretical understanding of deep kernels is lacking, par… ▽ More Analysing and computing with Gaussian processes arising from infinitely wide neural networks has recently seen a resurgence in popularity. Despite this, many explicit covariance functions of networks with activation functions used in modern networks remain unknown. Furthermore, while the kernels of deep networks can be computed iteratively, theoretical understanding of deep kernels is lacking, particularly with respect to fixed-point dynamics. Firstly, we derive the covariance functions of multi-layer perceptrons (MLPs) with exponential linear units (ELU) and Gaussian error linear units (GELU) and evaluate the performance of the limiting Gaussian processes on some benchmarks. Secondly, and more generally, we analyse the fixed-point dynamics of iterated kernels corresponding to a broad range of activation functions. We find that unlike some previously studied neural network kernels, these new kernels exhibit non-trivial fixed-point dynamics which are mirrored in finite-width neural networks. The fixed point behaviour present in some networks explains a mechanism for implicit regularisation in overparameterised deep models. Our results relate to both the static iid parameter conjugate kernel and the dynamic neural tangent kernel constructions. Software at github.com/RussellTsuchida/ELU_GELU_kernels. △ Less

Submitted 28 February, 2021; v1 submitted 19 February, 2020; originally announced February 2020.

Comments: AAAI camera ready version. 18 pages, 9 figures, 2 tables. Corrected name particle capitalisation and formatting

arXiv:1905.06076 [pdf, other]

Expressive Priors in Bayesian Neural Networks: Kernel Combinations and Periodic Functions

Authors: Tim Pearce, Russell Tsuchida, Mohamed Zaki, Alexandra Brintrup, Andy Neely

Abstract: A simple, flexible approach to creating expressive priors in Gaussian process (GP) models makes new kernels from a combination of basic kernels, e.g. summing a periodic and linear kernel can capture seasonal variation with a long term trend. Despite a well-studied link between GPs and Bayesian neural networks (BNNs), the BNN analogue of this has not yet been explored. This paper derives BNN archit… ▽ More A simple, flexible approach to creating expressive priors in Gaussian process (GP) models makes new kernels from a combination of basic kernels, e.g. summing a periodic and linear kernel can capture seasonal variation with a long term trend. Despite a well-studied link between GPs and Bayesian neural networks (BNNs), the BNN analogue of this has not yet been explored. This paper derives BNN architectures mirroring such kernel combinations. Furthermore, it shows how BNNs can produce periodic kernels, which are often useful in this context. These ideas provide a principled approach to designing BNNs that incorporate prior knowledge about a function. We showcase the practical value of these ideas with illustrative experiments in supervised and reinforcement learning settings. △ Less

Submitted 28 June, 2019; v1 submitted 15 May, 2019; originally announced May 2019.

Journal ref: The 35th Conference on Uncertainty in Artificial Intelligence (UAI 2019)

arXiv:1904.02288 [pdf]

Metabolomics in the Cloud: Scaling Computational Tools to Big Data

Authors: Jianliang Gao, Noureddin Sadawi, Ibrahim Karaman, Jake T M Pearce, Pablo Moreno, Anders Larsson, Marco Capuccini, Paul Elliott, Jeremy K Nicholson, Timothy M D Ebbels, Robert Glen

Abstract: Background: Metabolomics datasets are becoming increasingly large and complex, with multiple types of algorithms and workflows needed to process and analyse the data. A cloud infrastructure with portable software tools can provide much needed resources enabling faster processing of much larger datasets than would be possible at any individual lab. The PhenoMeNal project has developed such an infra… ▽ More Background: Metabolomics datasets are becoming increasingly large and complex, with multiple types of algorithms and workflows needed to process and analyse the data. A cloud infrastructure with portable software tools can provide much needed resources enabling faster processing of much larger datasets than would be possible at any individual lab. The PhenoMeNal project has developed such an infrastructure, allowing users to run analyses on local or commercial cloud platforms. We have examined the computational scaling behaviour of the PhenoMeNal platform using four different implementations across 1-1000 virtual CPUs using two common metabolomics tools. Results: Our results show that data which takes up to 4 days to process on a standard desktop computer can be processed in just 10 min on the largest cluster. Improved runtimes come at the cost of decreased efficiency, with all platforms falling below 80% efficiency above approximately 1/3 of the maximum number of vCPUs. An economic analysis revealed that running on large scale cloud platforms is cost effective compared to traditional desktop systems. Conclusions: Overall, cloud implementations of PhenoMeNal show excellent scalability for standard metabolomics computing tasks on a range of platforms, making them a compelling choice for research computing in metabolomics. △ Less

Submitted 9 April, 2019; v1 submitted 3 April, 2019; originally announced April 2019.

Comments: 25 pages, 5 figures

arXiv:1811.12188 [pdf, other]

Bayesian Neural Network Ensembles

Authors: Tim Pearce, Mohamed Zaki, Andy Neely

Abstract: Ensembles of neural networks (NNs) have long been used to estimate predictive uncertainty; a small number of NNs are trained from different initialisations and sometimes on differing versions of the dataset. The variance of the ensemble's predictions is interpreted as its epistemic uncertainty. The appeal of ensembling stems from being a collection of regular NNs - this makes them both scalable an… ▽ More Ensembles of neural networks (NNs) have long been used to estimate predictive uncertainty; a small number of NNs are trained from different initialisations and sometimes on differing versions of the dataset. The variance of the ensemble's predictions is interpreted as its epistemic uncertainty. The appeal of ensembling stems from being a collection of regular NNs - this makes them both scalable and easily implementable. They have achieved strong empirical results in recent years, often presented as a practical alternative to more costly Bayesian NNs (BNNs). The departure from Bayesian methodology is of concern since the Bayesian framework provides a principled, widely-accepted approach to handling uncertainty. In this extended abstract we derive and implement a modified NN ensembling scheme, which provides a consistent estimator of the Bayesian posterior in wide NNs - regularising parameters about values drawn from a prior distribution. △ Less

Submitted 27 November, 2018; originally announced November 2018.

Comments: arXiv admin note: substantial text overlap with arXiv:1810.05546

arXiv:1810.05546 [pdf, other]

Uncertainty in Neural Networks: Approximately Bayesian Ensembling

Authors: Tim Pearce, Felix Leibfried, Alexandra Brintrup, Mohamed Zaki, Andy Neely

Abstract: Understanding the uncertainty of a neural network's (NN) predictions is essential for many purposes. The Bayesian framework provides a principled approach to this, however applying it to NNs is challenging due to large numbers of parameters and data. Ensembling NNs provides an easily implementable, scalable method for uncertainty quantification, however, it has been criticised for not being Bayesi… ▽ More Understanding the uncertainty of a neural network's (NN) predictions is essential for many purposes. The Bayesian framework provides a principled approach to this, however applying it to NNs is challenging due to large numbers of parameters and data. Ensembling NNs provides an easily implementable, scalable method for uncertainty quantification, however, it has been criticised for not being Bayesian. This work proposes one modification to the usual process that we argue does result in approximate Bayesian inference; regularising parameters about values drawn from a distribution which can be set equal to the prior. A theoretical analysis of the procedure in a simplified setting suggests the recovered posterior is centred correctly but tends to have an underestimated marginal variance, and overestimated correlation. However, two conditions can lead to exact recovery. We argue that these conditions are partially present in NNs. Empirical evaluations demonstrate it has an advantage over standard ensembling, and is competitive with variational methods. △ Less

Submitted 26 February, 2020; v1 submitted 12 October, 2018; originally announced October 2018.

Comments: Please cite as published in AISTATS 2020

Journal ref: The 23rd International Conference on Artificial Intelligence and Statistics, AISTATS 2020

arXiv:1807.00790 [pdf, other]

An energy-based generative sequence model for testing sensory theories of Western harmony

Authors: Peter M. C. Harrison, Marcus T. Pearce

Abstract: The relationship between sensory consonance and Western harmony is an important topic in music theory and psychology. We introduce new methods for analysing this relationship, and apply them to large corpora representing three prominent genres of Western music: classical, popular, and jazz music. These methods centre on a generative sequence model with an exponential-family energy-based form that… ▽ More The relationship between sensory consonance and Western harmony is an important topic in music theory and psychology. We introduce new methods for analysing this relationship, and apply them to large corpora representing three prominent genres of Western music: classical, popular, and jazz music. These methods centre on a generative sequence model with an exponential-family energy-based form that predicts chord sequences from continuous features. We use this model to investigate one aspect of instantaneous consonance (harmonicity) and two aspects of sequential consonance (spectral distance and voice-leading distance). Applied to our three musical genres, the results generally support the relationship between sensory consonance and harmony, but lead us to question the high importance attributed to spectral distance in the psychological literature. We anticipate that our methods will provide a useful platform for future work linking music psychology to music theory. △ Less

Submitted 2 July, 2018; originally announced July 2018.

Comments: 8 pages, 2 figures. To appear in Proceedings of the 19th International Society for Music Information Retrieval Conference (ISMIR), Paris, France, 2018

arXiv:1805.11324 [pdf, other]

Bayesian Inference with Anchored Ensembles of Neural Networks, and Application to Exploration in Reinforcement Learning

Authors: Tim Pearce, Nicolas Anastassacos, Mohamed Zaki, Andy Neely

Abstract: The use of ensembles of neural networks (NNs) for the quantification of predictive uncertainty is widespread. However, the current justification is intuitive rather than analytical. This work proposes one minor modification to the normal ensembling methodology, which we prove allows the ensemble to perform Bayesian inference, hence converging to the corresponding Gaussian Process as both the total… ▽ More The use of ensembles of neural networks (NNs) for the quantification of predictive uncertainty is widespread. However, the current justification is intuitive rather than analytical. This work proposes one minor modification to the normal ensembling methodology, which we prove allows the ensemble to perform Bayesian inference, hence converging to the corresponding Gaussian Process as both the total number of NNs, and the size of each, tend to infinity. This working paper provides early-stage results in a reinforcement learning setting, analysing the practicality of the technique for an ensemble of small, finite number. Using the uncertainty estimates produced by anchored ensembles to govern the exploration-exploitation process results in steadier, more stable learning. △ Less

Submitted 2 July, 2018; v1 submitted 29 May, 2018; originally announced May 2018.

arXiv:1802.07167 [pdf, other]

High-Quality Prediction Intervals for Deep Learning: A Distribution-Free, Ensembled Approach

Authors: Tim Pearce, Mohamed Zaki, Alexandra Brintrup, Andy Neely

Abstract: This paper considers the generation of prediction intervals (PIs) by neural networks for quantifying uncertainty in regression tasks. It is axiomatic that high-quality PIs should be as narrow as possible, whilst capturing a specified portion of data. We derive a loss function directly from this axiom that requires no distributional assumption. We show how its form derives from a likelihood princip… ▽ More This paper considers the generation of prediction intervals (PIs) by neural networks for quantifying uncertainty in regression tasks. It is axiomatic that high-quality PIs should be as narrow as possible, whilst capturing a specified portion of data. We derive a loss function directly from this axiom that requires no distributional assumption. We show how its form derives from a likelihood principle, that it can be used with gradient descent, and that model uncertainty is accounted for in ensembled form. Benchmark experiments show the method outperforms current state-of-the-art uncertainty quantification methods, reducing average PI width by over 10%. △ Less

Submitted 15 June, 2018; v1 submitted 20 February, 2018; originally announced February 2018.

Report number: PMLR 80:4075-4084, 2018

Journal ref: Proceedings of the 35th International Conference on Machine Learning, 2018

arXiv:1708.03687 [pdf]

Effects of pitch and timing expectancy on musical emotion

Authors: Sarah A. Sauvé, Aminah Sayed, Roger T. Dean, Marcus T. Pearce

Abstract: Pitch and timing information work hand in hand to create a coherent piece of music; but what happens when this information goes against the norm? Relationships between musical expectancy and emotional responses were investigated in a study conducted with 40 participants: 20 musicians and 20 non-musicians. Participants took part in one of two behavioural paradigms measuring continuous expectancy or… ▽ More Pitch and timing information work hand in hand to create a coherent piece of music; but what happens when this information goes against the norm? Relationships between musical expectancy and emotional responses were investigated in a study conducted with 40 participants: 20 musicians and 20 non-musicians. Participants took part in one of two behavioural paradigms measuring continuous expectancy or emotional responses (arousal and valence) while listening to folk melodies that exhibited either high or low pitch predictability and high or low onset predictability. The causal influence of pitch predictability was investigated in an additional condition where pitch was artificially manipulated and a comparison conducted between original and manipulated forms; the dynamic correlative influence of pitch and timing information and its perception on emotional change during listening was evaluated using cross-sectional time series analysis. The results indicate that pitch and onset predictability are consistent predictors of perceived expectancy and emotional response, with onset carrying more weight than pitch. In addition, musicians and non-musicians do not differ in their responses, possibly due to shared cultural background and knowledge. The results demonstrate in a controlled lab-based setting a precise, quantitative relationship between the predictability of musical structure, expectation and emotional response. △ Less

Submitted 11 August, 2017; originally announced August 2017.

Comments: 53 pages, 5 figures; Submitted to Psychomusicology

arXiv:1708.03666 [pdf]

Attention but not musical training affects auditory streaming

Authors: Sarah A. Sauvé, Marcus T. Pearce

Abstract: While musicians generally perform better than non-musicians in various auditory discrimination tasks, effects of specific instrumental training have received little attention. The effects of instrument-specific musical training on auditory grou** in the context of stream segregation are investigated here in three experiments. In Experiment 1a, participants listened to sequences of ABA tones and… ▽ More While musicians generally perform better than non-musicians in various auditory discrimination tasks, effects of specific instrumental training have received little attention. The effects of instrument-specific musical training on auditory grou** in the context of stream segregation are investigated here in three experiments. In Experiment 1a, participants listened to sequences of ABA tones and indicated when they heard a change in rhythm. This change is caused by the manipulation of the B tones' timbre and indexes a change in perception from integration to segregation, or vice versa. While it was expected that musicians would detect a change in rhythm earlier when their own instrument was involved, no such pattern was observed. In Experiment 1b, designed to control for potential expectation effects in Experiment 1a, participants heard sequences of static ABA tones and reported their initial perceptions, whether the sequence was integrated or segregated. Results show that participants tend to initially perceive these static sequences as segregated, and that perception is influenced by similarity between the timbres involved. Finally, in Experiment 2 violinists and flautists located mistuned notes in an interleaved melody paradigm containing a violin and a flute melody. Performance did not depend on the instrument the participant played but rather which melody their attention was directed to. Taken together, results from the three experiments suggest that the specific instrument one practices does not have an influence on auditory grou**, but attentional mechanisms are necessary for processing auditory scenes. △ Less

Submitted 11 August, 2017; originally announced August 2017.

Comments: 36 pages, 6 figures

arXiv:1511.09390 [pdf, other]

doi 10.1051/0004-6361/201527863

An M-dwarf star in the transition disk of Herbig HD 142527; Physical parameters and orbital elements

Authors: S. Lacour, B. Biller, A. Cheetham, A. Greenbaum, T. Pearce, S. Marino, P. Tuthill, L. Pueyo, E. E. Mamajek, J. H. Girard, A. Sivaramakrishnan, M. Bonnefoy, I. Baraffe, G. Chauvin, J. Olofsson, A. Juhasz, M. Benisty, J. -U. Pott, A. Sicilia-Aguilar, T. Henning, A. Cardwell, S. Goodsell, J. R. Graham, P. Hibon, P. Ingraham , et al. (7 additional authors not shown)

Abstract: HD 142527A is one of the most studied Herbig Ae/Be stars with a transitional disk, as it has the largest imaged gap in any protoplanetary disk: the gas is cleared from 30 to 90 AU. The HD142527 system is also unique in that it has a stellar companion with a small mass compared to the mass of the primary star. This factor of ~20 in mass ratio between the two objects makes this binary system differe… ▽ More HD 142527A is one of the most studied Herbig Ae/Be stars with a transitional disk, as it has the largest imaged gap in any protoplanetary disk: the gas is cleared from 30 to 90 AU. The HD142527 system is also unique in that it has a stellar companion with a small mass compared to the mass of the primary star. This factor of ~20 in mass ratio between the two objects makes this binary system different from any other YSO. The HD142527 system could therefore provide a valuable test bed. This low-mass stellar object may be responsible for both the gap and dust trap** observed by ALMA at longer distances. We observed this system with the NACO and GPI instruments using the aperture masking technique. Aperture masking is ideal for providing high dynamic range even at very small angular separations. We present the spectral energy distribution for HD142527A and B. Brightness of the companion is now known from the R band up to the M' band. We also followed the orbital motion of HD 142527B over a period of more than two years. The SED of the companion is compatible with a T=3000+/-100K object in addition to a 1700K blackbody environment (likely a circus-secondary disk). From evolution models, we find that it is compatible with an object of mass 0.13+/-0.03Msun, radius 0.90+/-0.15Rsun, and age $1.0^{+1.0}_{-0.75}$Myr. This age is significantly younger than the age previously estimated for HD142527A. Computations to constrain the orbital parameters found a semi major axis of $140^{+120}_{-70}$mas, an eccentricity of 0.5+/-0.2, an inclination of 125+/-15 degrees, and a position angle of the right ascending node of -5+/-40 degrees. Inclination and position angle of the ascending node are in agreement with an orbit coplanar with the inner disk, not coplanar with the outer disk. Despite its high eccentricity, it is unlikely that HD142527B is responsible for truncating the inner edge of the outer disk. △ Less

Submitted 22 June, 2016; v1 submitted 30 November, 2015; originally announced November 2015.

Comments: published in A&A; 8 pages

Journal ref: A&A 590, A90 (2016)

arXiv:1507.04367 [pdf, other]

doi 10.1093/mnras/stv1847

Double-ringed debris discs could be the work of eccentric planets: explaining the strange morphology of HD 107146

Authors: Tim Pearce, Mark Wyatt

Abstract: We investigate the general interaction between an eccentric planet and a coplanar debris disc of the same mass, using analytical theory and n-body simulations. Such an interaction could result from a planet-planet scattering or merging event. We show that when the planet mass is comparable to that of the disc, the former is often circularised with little change to its semimajor axis. The secular e… ▽ More We investigate the general interaction between an eccentric planet and a coplanar debris disc of the same mass, using analytical theory and n-body simulations. Such an interaction could result from a planet-planet scattering or merging event. We show that when the planet mass is comparable to that of the disc, the former is often circularised with little change to its semimajor axis. The secular effect of such a planet can cause debris to apsidally anti-align with the planet's orbit (the opposite of what may be naively expected), leading to the counter-intuitive result that a low-mass planet may clear a larger region of debris than a higher-mass body would. The interaction generally results in a double-ringed debris disc, which is comparable to those observed in HD 107146 and HD 92945. As an example we apply our results to HD 107146, and show that the disc's morphology and surface brightness profile can be well-reproduced if the disc is interacting with an eccentric planet of comparable mass (~10-100 Earth masses). This hypothetical planet had a pre-interaction semimajor axis of 30 or 40 au (similar to its present-day value) and an eccentricity of 0.4 or 0.5 (which would since have reduced to ~0.1). Thus the planet (if it exists) presently resides near the inner edge of the disc, rather than between the two debris peaks as may otherwise be expected. Finally we show that disc self-gravity can be important in this mass regime and, whilst it would not affect these results significantly, it should be considered when probing the interaction between a debris disc and a planet. △ Less

Submitted 15 July, 2015; originally announced July 2015.

Comments: Submitted to MNRAS, uploaded in revised form

arXiv:1502.01834 [pdf, other]

doi 10.1093/mnras/stv252

Constraining the orbits of sub-stellar companions imaged over short orbital arcs

Authors: Tim D. Pearce, Mark C. Wyatt, Grant M. Kennedy

Abstract: Imaging a star's companion at multiple epochs over a short orbital arc provides only four of the six coordinates required for a unique orbital solution. Probability distributions of possible solutions are commonly generated by Monte Carlo (MCMC) analysis, but these are biased by priors and may not probe the full parameter space. We suggest alternative methods to characterise possible orbits, which… ▽ More Imaging a star's companion at multiple epochs over a short orbital arc provides only four of the six coordinates required for a unique orbital solution. Probability distributions of possible solutions are commonly generated by Monte Carlo (MCMC) analysis, but these are biased by priors and may not probe the full parameter space. We suggest alternative methods to characterise possible orbits, which compliment the MCMC technique. Firstly the allowed ranges of orbital elements are prior-independent, and we provide means to calculate these ranges without numerical analyses. Hence several interesting constraints (including whether a companion even can be bound, its minimum possible semi-major axis and its minimum eccentricity) may be quickly computed using our relations as soon as orbital motion is detected. We also suggest an alternative to posterior probability distributions as a means to present possible orbital elements, namely contour plots of elements as functions of line of sight coordinates. These plots are prior-independent, readily show degeneracies between elements and allow readers to extract orbital solutions themselves. This approach is particularly useful when there are other constraints on the geometry, for example if a companion's orbit is assumed to be aligned with a disc. As examples we apply our methods to several imaged sub-stellar companions including Fomalhaut b, and for the latter object we show how different origin hypotheses affect its possible orbital solutions. We also examine visual companions of A- and G-type main sequence stars in the Washington Double Star Catalogue, and show that $\gtrsim50$ per cent must be unbound. △ Less

Submitted 6 February, 2015; originally announced February 2015.

Comments: Accepted for publication in MNRAS

Showing 1–50 of 52 results for author: Pearce, T