-
Reconciling Kaplan and Chinchilla Scaling Laws
Authors:
Tim Pearce,
**yeop Song
Abstract:
Kaplan et al. [2020] (`Kaplan') and Hoffmann et al. [2022] (`Chinchilla') studied the scaling behavior of transformers trained on next-token language prediction. These studies produced different estimates for how the number of parameters ($N$) and training tokens ($D$) should be set to achieve the lowest possible loss for a given compute budget ($C$). Kaplan: $N_\text{optimal} \propto C^{0.73}$, C…
▽ More
Kaplan et al. [2020] (`Kaplan') and Hoffmann et al. [2022] (`Chinchilla') studied the scaling behavior of transformers trained on next-token language prediction. These studies produced different estimates for how the number of parameters ($N$) and training tokens ($D$) should be set to achieve the lowest possible loss for a given compute budget ($C$). Kaplan: $N_\text{optimal} \propto C^{0.73}$, Chinchilla: $N_\text{optimal} \propto C^{0.50}$. This note finds that much of this discrepancy can be attributed to Kaplan counting non-embedding rather than total parameters, combined with their analysis being performed at small scale. Simulating the Chinchilla study under these conditions produces biased scaling coefficients close to Kaplan's. Hence, this note reaffirms Chinchilla's scaling coefficients, by explaining the cause of Kaplan's original overestimation.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
JWST/NIRCam 4-5 $μ$m Imaging of the Giant Planet AF Lep b
Authors:
Kyle Franson,
William O. Balmer,
Brendan P. Bowler,
Laurent Pueyo,
Yifan Zhou,
Emily Rickman,
Zhoujian Zhang,
Sagnick Mukherjee,
Tim D. Pearce,
Daniella C. Bardalez Gagliuffi,
Lauren I. Biddle,
Timothy D. Brandt,
Rachel Bowens-Rubin,
Justin R. Crepp,
James W. Davidson, Jr.,
Jacqueline Faherty,
Christian Ginski,
Elliott P. Horch,
Marvin Morgan,
Caroline V. Morley,
Marshall D. Perrin,
Aniket Sanghi,
Maissa Salama,
Christopher A. Theissen,
Quang H. Tran
, et al. (1 additional authors not shown)
Abstract:
With a dynamical mass of $3 \, M_\mathrm{Jup}$, the recently discovered giant planet AF Lep b is the lowest-mass imaged planet with a direct mass measurement. Its youth and spectral type near the L/T transition make it a promising target to study the impact of clouds and atmospheric chemistry at low surface gravities. In this work, we present JWST/NIRCam imaging of AF Lep b. Across two epochs, we…
▽ More
With a dynamical mass of $3 \, M_\mathrm{Jup}$, the recently discovered giant planet AF Lep b is the lowest-mass imaged planet with a direct mass measurement. Its youth and spectral type near the L/T transition make it a promising target to study the impact of clouds and atmospheric chemistry at low surface gravities. In this work, we present JWST/NIRCam imaging of AF Lep b. Across two epochs, we detect AF Lep b in F444W ($4.4 \, \mathrm{μm}$) with S/N ratios of 9.6 and 8.7, respectively. At the planet's separation of $320 \, \mathrm{mas}$ during the observations, the coronagraphic throughput is ${\approx}7\%$, demonstrating that NIRCam's excellent sensitivity persists down to small separations. The F444W photometry of AF Lep b affirms the presence of disequilibrium carbon chemistry and enhanced atmospheric metallicity. These observations also place deep limits on wider-separation planets in the system, ruling out $1.1 \, M_\mathrm{Jup}$ planets beyond $15.6 \, \mathrm{au}$ (0.58 arcsec), $1.1 \, M_\mathrm{Sat}$ planets beyond $27 \, \mathrm{au}$ (1 arcsec), and $2.8 \, M_\mathrm{Nep}$ planets beyond $67 \, \mathrm{au}$ (2.5 arcsec). We also present new Keck/NIRC2 $L'$ imaging of AF Lep b; combining this with the two epochs of F444W photometry and previous Keck $L'$ photometry provides limits on the long-term 3-$5 \, \mathrm{μm}$ variability of AF Lep b on months-to-years timescales. AF Lep b is the closest-separation planet imaged with JWST to date, demonstrating that planets can be recovered well inside the nominal (50% throughput) NIRCam coronagraph inner working angle.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
From Tissue Plane to Organ World: A Benchmark Dataset for Multimodal Biomedical Image Registration using Deep Co-Attention Networks
Authors:
Yifeng Wang,
Weipeng Li,
Thomas Pearce,
Haohan Wang
Abstract:
Correlating neuropathology with neuroimaging findings provides a multiscale view of pathologic changes in the human organ spanning the meso- to micro-scales, and is an emerging methodology expected to shed light on numerous disease states. To gain the most information from this multimodal, multiscale approach, it is desirable to identify precisely where a histologic tissue section was taken from w…
▽ More
Correlating neuropathology with neuroimaging findings provides a multiscale view of pathologic changes in the human organ spanning the meso- to micro-scales, and is an emerging methodology expected to shed light on numerous disease states. To gain the most information from this multimodal, multiscale approach, it is desirable to identify precisely where a histologic tissue section was taken from within the organ in order to correlate with the tissue features in exactly the same organ region. Histology-to-organ registration poses an extra challenge, as any given histologic section can capture only a small portion of a human organ. Making use of the capabilities of state-of-the-art deep learning models, we unlock the potential to address and solve such intricate challenges. Therefore, we create the ATOM benchmark dataset, sourced from diverse institutions, with the primary objective of transforming this challenge into a machine learning problem and delivering outstanding outcomes that enlighten the biomedical community. The performance of our RegisMCAN model demonstrates the potential of deep learning to accurately predict where a subregion extracted from an organ image was obtained from within the overall 3D volume. The code and dataset can be found at: https://github.com/haizailache999/Image-Registration/tree/main
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Weight-based Decomposition: A Case for Bilinear MLPs
Authors:
Michael T. Pearce,
Thomas Dooms,
Alice Rigg
Abstract:
Gated Linear Units (GLUs) have become a common building block in modern foundation models. Bilinear layers drop the non-linearity in the "gate" but still have comparable performance to other GLUs. An attractive quality of bilinear layers is that they can be fully expressed in terms of a third-order tensor and linear operations. Leveraging this, we develop a method to decompose the bilinear tensor…
▽ More
Gated Linear Units (GLUs) have become a common building block in modern foundation models. Bilinear layers drop the non-linearity in the "gate" but still have comparable performance to other GLUs. An attractive quality of bilinear layers is that they can be fully expressed in terms of a third-order tensor and linear operations. Leveraging this, we develop a method to decompose the bilinear tensor into a set of sparsely interacting eigenvectors that show promising interpretability properties in preliminary experiments for shallow image classifiers (MNIST) and small language models (Tiny Stories). Since the decomposition is fully equivalent to the model's original computations, bilinear layers may be an interpretability-friendly architecture that helps connect features to the model weights. Application of our method may not be limited to pretrained bilinear models since we find that language models such as TinyLlama-1.1B can be finetuned into bilinear variants.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Diffusion for World Modeling: Visual Details Matter in Atari
Authors:
Eloi Alonso,
Adam Jelley,
Vincent Micheli,
Anssi Kanervisto,
Amos Storkey,
Tim Pearce,
François Fleuret
Abstract:
World models constitute a promising approach for training reinforcement learning agents in a safe and sample-efficient manner. Recent world models predominantly operate on sequences of discrete latent variables to model environment dynamics. However, this compression into a compact discrete representation may ignore visual details that are important for reinforcement learning. Concurrently, diffus…
▽ More
World models constitute a promising approach for training reinforcement learning agents in a safe and sample-efficient manner. Recent world models predominantly operate on sequences of discrete latent variables to model environment dynamics. However, this compression into a compact discrete representation may ignore visual details that are important for reinforcement learning. Concurrently, diffusion models have become a dominant approach for image generation, challenging well-established methods modeling discrete latents. Motivated by this paradigm shift, we introduce DIAMOND (DIffusion As a Model Of eNvironment Dreams), a reinforcement learning agent trained in a diffusion world model. We analyze the key design choices that are required to make diffusion suitable for world modeling, and demonstrate how improved visual details can lead to improved agent performance. DIAMOND achieves a mean human normalized score of 1.46 on the competitive Atari 100k benchmark; a new best for agents trained entirely within a world model. To foster future research on diffusion for world modeling, we release our code, agents and playable world models at https://github.com/eloialonso/diamond.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
Debris disks around main-sequence stars
Authors:
Tim D. Pearce
Abstract:
'Debris disks' are collections of small bodies around stars, such as the Asteroid Belt and Kuiper Belt in our Solar System. These disks are composed of objects smaller than planets, including asteroids, comets, dust, and dwarf planets. We detect debris disks around a significant fraction of stars, and these disks appear to be common components of planetary systems. Extrasolar debris disks have a b…
▽ More
'Debris disks' are collections of small bodies around stars, such as the Asteroid Belt and Kuiper Belt in our Solar System. These disks are composed of objects smaller than planets, including asteroids, comets, dust, and dwarf planets. We detect debris disks around a significant fraction of stars, and these disks appear to be common components of planetary systems. Extrasolar debris disks have a broad range of locations, shapes and features. This chapter provides an introduction to debris disks around main-sequence stars. It summarises our understanding of the field, and covers a wide range of concepts from observations and theory. It describes how we detect extrasolar debris disks, what we see, and what these observations tell us. It also describes how debris disks evolve, and how they interact with planets. The chapter concludes by discussing several unsolved questions in debris-disk science.
△ Less
Submitted 18 March, 2024;
originally announced March 2024.
-
Ultralight vector dark matter search using data from the KAGRA O3GK run
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
A. G. Abac,
R. Abbott,
H. Abe,
I. Abouelfettouh,
F. Acernese,
K. Ackley,
C. Adamcewicz,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
O. D. Aguiar,
I. Aguilar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi
, et al. (1778 additional authors not shown)
Abstract:
Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese…
▽ More
Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we present the result of a search for $U(1)_{B-L}$ gauge boson DM using the KAGRA data from auxiliary length channels during the first joint observation run together with GEO600. By applying our search pipeline, which takes into account the stochastic nature of ultralight DM, upper bounds on the coupling strength between the $U(1)_{B-L}$ gauge boson and ordinary matter are obtained for a range of DM masses. While our constraints are less stringent than those derived from previous experiments, this study demonstrates the applicability of our method to the lower-mass vector DM search, which is made difficult in this measurement by the short observation time compared to the auto-correlation time scale of DM.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory
Authors:
Tianjiao Luo,
Tim Pearce,
Huayu Chen,
Jianfei Chen,
Jun Zhu
Abstract:
Generative Adversarial Imitation Learning (GAIL) trains a generative policy to mimic a demonstrator. It uses on-policy Reinforcement Learning (RL) to optimize a reward signal derived from a GAN-like discriminator. A major drawback of GAIL is its training instability - it inherits the complex training dynamics of GANs, and the distribution shift introduced by RL. This can cause oscillations during…
▽ More
Generative Adversarial Imitation Learning (GAIL) trains a generative policy to mimic a demonstrator. It uses on-policy Reinforcement Learning (RL) to optimize a reward signal derived from a GAN-like discriminator. A major drawback of GAIL is its training instability - it inherits the complex training dynamics of GANs, and the distribution shift introduced by RL. This can cause oscillations during training, harming its sample efficiency and final policy performance. Recent work has shown that control theory can help with the convergence of a GAN's training. This paper extends this line of work, conducting a control-theoretic analysis of GAIL and deriving a novel controller that not only pushes GAIL to the desired equilibrium but also achieves asymptotic stability in a 'one-step' setting. Based on this, we propose a practical algorithm 'Controlled-GAIL' (C-GAIL). On MuJoCo tasks, our controlled variant is able to speed up the rate of convergence, reduce the range of oscillation and match the expert's distribution more closely both for vanilla GAIL and GAIL-DAC.
△ Less
Submitted 26 February, 2024;
originally announced February 2024.
-
Increasing planet-stirring efficiency of debris disks by "projectile stirring" and "resonant stirring"
Authors:
Tyson Costa,
Tim D. Pearce,
Alexander V. Krivov
Abstract:
Extrasolar debris disks are detected by observing dust, which is thought to be released during planetesimal collisions. This implies that planetesimals are dynamically excited ("stirred"), such that collisions are sufficiently common and violent. The most frequently considered stirring mechanisms are self-stirring by disk self-gravity, and planet-stirring via secular interactions. However, these m…
▽ More
Extrasolar debris disks are detected by observing dust, which is thought to be released during planetesimal collisions. This implies that planetesimals are dynamically excited ("stirred"), such that collisions are sufficiently common and violent. The most frequently considered stirring mechanisms are self-stirring by disk self-gravity, and planet-stirring via secular interactions. However, these models face problems when considering disk mass, self-gravity, and planet eccentricity, leading to the possibility that other, unexplored mechanisms instead stir debris. We hypothesize that planet-stirring could be more efficient than the traditional secular model implies, due to two additional mechanisms. First, a planet at the inner edge of a debris disk can scatter massive bodies onto eccentric, disk-crossing orbits, which then excite debris ("projectile stirring"). Second, a planet can stir debris over a wide region via broad mean-motion resonances, both at and between nominal resonance locations ("resonant stirring"). Both mechanisms can be effective even for low-eccentricity planets, unlike secular-planet-stirring. We run N-body simulations across a broad parameter space, to determine the viability of these new stirring mechanisms. We quantify stirring levels using a bespoke program for assessing Rebound debris simulations, which we make publicly available. We find that even low-mass projectiles can stir disks, and verify this with a simple analytic criterion. We also show that resonant stirring is effective for planets above ~0.5 MJup. By proving that these mechanisms can increase planet-stirring efficiency, we demonstrate that planets could still be stirring debris disks even in cases where conventional (secular) planet-stirring is insufficient.
△ Less
Submitted 17 November, 2023;
originally announced November 2023.
-
The effect of sculpting planets on the steepness of debris-disc inner edges
Authors:
Tim D. Pearce,
Alexander V. Krivov,
Antranik A. Sefilian,
Marija R. Jankovic,
Torsten Löhne,
Tobias Morgner,
Mark C. Wyatt,
Mark Booth,
Sebastian Marino
Abstract:
Debris discs are our best means to probe the outer regions of planetary systems. Many studies assume that planets lie at the inner edges of debris discs, akin to Neptune and the Kuiper Belt, and use the disc morphologies to constrain those otherwise-undetectable planets. However, this produces a degeneracy in planet mass and semimajor axis. We investigate the effect of a sculpting planet on the ra…
▽ More
Debris discs are our best means to probe the outer regions of planetary systems. Many studies assume that planets lie at the inner edges of debris discs, akin to Neptune and the Kuiper Belt, and use the disc morphologies to constrain those otherwise-undetectable planets. However, this produces a degeneracy in planet mass and semimajor axis. We investigate the effect of a sculpting planet on the radial surface-density profile at the disc inner edge, and show that this degeneracy can be broken by considering the steepness of the edge profile. Like previous studies, we show that a planet on a circular orbit ejects unstable debris and excites surviving material through mean-motion resonances. For a non-migrating, circular-orbit planet, in the case where collisions are negligible, the steepness of the disc inner edge depends on the planet-to-star mass ratio and the initial-disc excitation level. We provide a simple analytic model to infer planet properties from the steepness of ALMA-resolved disc edges. We also perform a collisional analysis, showing that a purely planet-sculpted disc would be distinguishable from a purely collisional disc and that, whilst collisions flatten planet-sculpted edges, they are unlikely to fully erase a planet's signature. Finally, we apply our results to ALMA-resolved debris discs and show that, whilst many inner edges are too steep to be explained by collisions alone, they are too flat to arise through completed sculpting by non-migrating, circular-orbit planets. We discuss implications of this for the architectures, histories and dynamics in the outer regions of planetary systems.
△ Less
Submitted 7 November, 2023;
originally announced November 2023.
-
Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach
Authors:
Stephen Mak,
Liming Xu,
Tim Pearce,
Michael Ostroumov,
Alexandra Brintrup
Abstract:
Collaborative vehicle routing occurs when carriers collaborate through sharing their transportation requests and performing transportation requests on behalf of each other. This achieves economies of scale, thus reducing cost, greenhouse gas emissions and road congestion. But which carrier should partner with whom, and how much should each carrier be compensated? Traditional game theoretic solutio…
▽ More
Collaborative vehicle routing occurs when carriers collaborate through sharing their transportation requests and performing transportation requests on behalf of each other. This achieves economies of scale, thus reducing cost, greenhouse gas emissions and road congestion. But which carrier should partner with whom, and how much should each carrier be compensated? Traditional game theoretic solution concepts are expensive to calculate as the characteristic function scales exponentially with the number of agents. This would require solving the vehicle routing problem (NP-hard) an exponential number of times. We therefore propose to model this problem as a coalitional bargaining game solved using deep multi-agent reinforcement learning, where - crucially - agents are not given access to the characteristic function. Instead, we implicitly reason about the characteristic function; thus, when deployed in production, we only need to evaluate the expensive post-collaboration vehicle routing problem once. Our contribution is that we are the first to consider both the route allocation problem and gain sharing problem simultaneously - without access to the expensive characteristic function. Through decentralised machine learning, our agents bargain with each other and agree to outcomes that correlate well with the Shapley value - a fair profit allocation mechanism. Importantly, we are able to achieve a reduction in run-time of 88%.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
Coalitional Bargaining via Reinforcement Learning: An Application to Collaborative Vehicle Routing
Authors:
Stephen Mak,
Liming Xu,
Tim Pearce,
Michael Ostroumov,
Alexandra Brintrup
Abstract:
Collaborative Vehicle Routing is where delivery companies cooperate by sharing their delivery information and performing delivery requests on behalf of each other. This achieves economies of scale and thus reduces cost, greenhouse gas emissions, and road congestion. But which company should partner with whom, and how much should each company be compensated? Traditional game theoretic solution conc…
▽ More
Collaborative Vehicle Routing is where delivery companies cooperate by sharing their delivery information and performing delivery requests on behalf of each other. This achieves economies of scale and thus reduces cost, greenhouse gas emissions, and road congestion. But which company should partner with whom, and how much should each company be compensated? Traditional game theoretic solution concepts, such as the Shapley value or nucleolus, are difficult to calculate for the real-world problem of Collaborative Vehicle Routing due to the characteristic function scaling exponentially with the number of agents. This would require solving the Vehicle Routing Problem (an NP-Hard problem) an exponential number of times. We therefore propose to model this problem as a coalitional bargaining game where - crucially - agents are not given access to the characteristic function. Instead, we implicitly reason about the characteristic function, and thus eliminate the need to evaluate the VRP an exponential number of times - we only need to evaluate it once. Our contribution is that our decentralised approach is both scalable and considers the self-interested nature of companies. The agents learn using a modified Independent Proximal Policy Optimisation. Our RL agents outperform a strong heuristic bot. The agents correctly identify the optimal coalitions 79% of the time with an average optimality gap of 4.2% and reduction in run-time of 62%.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
Self-gravity of debris discs can strongly change the outcomes of interactions with inclined planets
Authors:
Pedro P. Poblete,
Torsten Löhne,
Tim D. Pearce,
Antranik A. Sefilian
Abstract:
Drastic changes in protoplanets' orbits could occur in the early stages of planetary systems through interactions with other planets and their surrounding protoplanetary or debris discs. The resulting planetary system could exhibit orbits with moderate to high eccentricities and/or inclinations, causing planets to perturb one another as well as the disc significantly. The present work studies the…
▽ More
Drastic changes in protoplanets' orbits could occur in the early stages of planetary systems through interactions with other planets and their surrounding protoplanetary or debris discs. The resulting planetary system could exhibit orbits with moderate to high eccentricities and/or inclinations, causing planets to perturb one another as well as the disc significantly. The present work studies the evolution of systems composed of an initially inclined planet and a debris disc. We perform N-body simulations of a narrow, self-gravitating debris disc and a single interior Neptune-like planet. We simulate systems with various initial planetary inclinations, from coplanar to polar configurations considering different separations between the planet and the disc. We find that except when the planet is initially on a polar orbit, the planet-disc system tends to reach a quasi-coplanar configuration with low vertical dispersion in the disc. When present, the Zeipel--Kozai--Lidov oscillations induced by the disc pump the planet's eccentricity and, in turn, affect the disc structure. We also find that the resulting disc morphology in most of the simulations looks very similar in both radial and vertical directions once the simulations are converged. This contrasts strongly with massless disc simulations, where vertical disc dispersion is set by the initial disc-planet inclination and can be high for initially highly inclined planets. The results suggest caution in interpreting an unseen planet's dynamical history based only on the disc's appearance.
△ Less
Submitted 13 September, 2023;
originally announced September 2023.
-
How much large dust could be present in hot exozodiacal dust systems?
Authors:
T. A. Stuber,
F. Kirchschlager,
T. D. Pearce,
S. Ertel,
A. V. Krivov,
S. Wolf
Abstract:
An infrared excess over the stellar photospheric emission of main-sequence stars has been found in interferometric surveys, commonly attributed to the presence of hot exozodiacal dust (HEZD). While submicrometer-sized grains in close vicinity to their host star have been inferred to be responsible for the found near-infrared excesses, the presence and amount of larger grains as part of the dust di…
▽ More
An infrared excess over the stellar photospheric emission of main-sequence stars has been found in interferometric surveys, commonly attributed to the presence of hot exozodiacal dust (HEZD). While submicrometer-sized grains in close vicinity to their host star have been inferred to be responsible for the found near-infrared excesses, the presence and amount of larger grains as part of the dust distributions are weakly constrained. We quantify how many larger grains (above-micrometer-sized) could be present in addition to submicrometer-sized grains, while being consistent with observational constraints. This is important in order to distinguish between various scenarios for the origin of HEZD and to better estimate its observational appearance when observed with future instruments. We extended a model suitable to reproduce current observations of HEZD to investigate a bimodal size distribution. By deriving the characteristics of dust distributions whose observables are consistent with observational limits from interferometric measurements in the $K$ and $N$ bands we constrained the radii of sub- and above-micrometer-sized grains as well as their mass, number, and flux density ratios. In the most extreme cases of some of the investigated systems, large grains $\gtrsim 10\,μ$m might dominate the mass budget of HEZD while contributing up to 25$\,$% of the total flux density originating from the dust at a wavelength of 2.13$\,μ$m and up to 50$\,$% at a wavelength of 4.1$\,μ$m; at a wavelength of 11.1$\,μ$m their emission might clearly dominate over the emission of small grains. While it is not possible to detect such hot-dust distributions using ALMA, the ngVLA might allow us to detect HEZD at millimeter wavelengths. Large dust grains might have a more important impact on the observational appearance of HEZD than previously assumed, especially at longer wavelengths.
△ Less
Submitted 20 August, 2023;
originally announced August 2023.
-
Search for Eccentric Black Hole Coalescences during the Third Observing Run of LIGO and Virgo
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
A. G. Abac,
R. Abbott,
H. Abe,
F. Acernese,
K. Ackley,
C. Adamcewicz,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
O. D. Aguiar,
I. Aguilar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi
, et al. (1750 additional authors not shown)
Abstract:
Despite the growing number of confident binary black hole coalescences observed through gravitational waves so far, the astrophysical origin of these binaries remains uncertain. Orbital eccentricity is one of the clearest tracers of binary formation channels. Identifying binary eccentricity, however, remains challenging due to the limited availability of gravitational waveforms that include effect…
▽ More
Despite the growing number of confident binary black hole coalescences observed through gravitational waves so far, the astrophysical origin of these binaries remains uncertain. Orbital eccentricity is one of the clearest tracers of binary formation channels. Identifying binary eccentricity, however, remains challenging due to the limited availability of gravitational waveforms that include effects of eccentricity. Here, we present observational results for a waveform-independent search sensitive to eccentric black hole coalescences, covering the third observing run (O3) of the LIGO and Virgo detectors. We identified no new high-significance candidates beyond those that were already identified with searches focusing on quasi-circular binaries. We determine the sensitivity of our search to high-mass (total mass $M>70$ $M_\odot$) binaries covering eccentricities up to 0.3 at 15 Hz orbital frequency, and use this to compare model predictions to search results. Assuming all detections are indeed quasi-circular, for our fiducial population model, we place an upper limit for the merger rate density of high-mass binaries with eccentricities $0 < e \leq 0.3$ at $0.33$ Gpc$^{-3}$ yr$^{-1}$ at 90\% confidence level.
△ Less
Submitted 7 August, 2023;
originally announced August 2023.
-
The clumpy structure of $ε$ Eridani's debris disc revisited by ALMA
Authors:
Mark Booth,
Tim D. Pearce,
Alexander V. Krivov,
Mark C. Wyatt,
William R. F. Dent,
Antonio S. Hales,
Jean-François Lestrade,
Fernando Cruz-Sáenz de Miera,
Virginie C. Faramaz,
Torsten Löhne,
Miguel Chavez-Dagostino
Abstract:
$ε…
▽ More
$ε$ Eridani is the closest star to our Sun known to host a debris disc. Prior observations in the (sub-)millimetre regime have potentially detected clumpy structure in the disc and attributed this to interactions with an (as yet) undetected planet. However, the prior observations were unable to distinguish between structure in the disc and background confusion. Here we present the first ALMA image of the entire disc, which has a resolution of 1.6"$\times$1.2". We clearly detect the star, the main belt and two point sources. The resolution and sensitivity of this data allow us to clearly distinguish background galaxies (that show up as point sources) from the disc emission. We show that the two point sources are consistent with background galaxies. After taking account of these, we find that resolved residuals are still present in the main belt, including two clumps with a $>3σ$ significance -- one to the east of the star and the other to the northwest. We perform $n$-body simulations to demonstrate that a migrating planet can form structures similar to those observed by trap** planetesimals in resonances. We find that the observed features can be reproduced by a migrating planet trap** planetesimals in the 2:1 mean motion resonance and the symmetry of the most prominent clumps means that the planet should have a position angle of either ${\sim10^\circ}$ or ${\sim190^\circ}$. Observations over multiple epochs are necessary to test whether the observed features rotate around the star.
△ Less
Submitted 23 March, 2023;
originally announced March 2023.
-
TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play
Authors:
Fanqi Lin,
Shiyu Huang,
Tim Pearce,
Wenze Chen,
Wei-Wei Tu
Abstract:
Multi-agent football poses an unsolved challenge in AI research. Existing work has focused on tackling simplified scenarios of the game, or else leveraging expert demonstrations. In this paper, we develop a multi-agent system to play the full 11 vs. 11 game mode, without demonstrations. This game mode contains aspects that present major challenges to modern reinforcement learning algorithms; multi…
▽ More
Multi-agent football poses an unsolved challenge in AI research. Existing work has focused on tackling simplified scenarios of the game, or else leveraging expert demonstrations. In this paper, we develop a multi-agent system to play the full 11 vs. 11 game mode, without demonstrations. This game mode contains aspects that present major challenges to modern reinforcement learning algorithms; multi-agent coordination, long-term planning, and non-transitivity. To address these challenges, we present TiZero; a self-evolving, multi-agent system that learns from scratch. TiZero introduces several innovations, including adaptive curriculum learning, a novel self-play strategy, and an objective that optimizes the policies of multiple agents jointly. Experimentally, it outperforms previous systems by a large margin on the Google Research Football environment, increasing win rates by over 30%. To demonstrate the generality of TiZero's innovations, they are assessed on several environments beyond football; Overcooked, Multi-agent Particle-Environment, Tic-Tac-Toe and Connect-Four.
△ Less
Submitted 20 February, 2023; v1 submitted 15 February, 2023;
originally announced February 2023.
-
Astrometric Accelerations as Dynamical Beacons: A Giant Planet Imaged Inside the Debris Disk of the Young Star AF Lep
Authors:
Kyle Franson,
Brendan P. Bowler,
Yifan Zhou,
Tim D. Pearce,
Daniella C. Bardalez Gagliuffi,
Lauren Biddle,
Timothy D. Brandt,
Justin R. Crepp,
Trent J. Dupuy,
Jacqueline Faherty,
Rebecca Jensen-Clem,
Marvin Morgan,
Aniket Sanghi,
Christopher A. Theissen,
Quang H. Tran,
Trevor A. Wolf
Abstract:
We present the direct imaging discovery of a giant planet orbiting the young star AF Lep, a 1.2 $M_{\odot}$ member of the 24 $\pm$ 3 Myr $β$ Pic moving group. AF Lep was observed as part of our ongoing high-contrast imaging program targeting stars with astrometric accelerations between Hipparcos and Gaia that indicate the presence of substellar companions. Keck/NIRC2 observations in $L'$ with the…
▽ More
We present the direct imaging discovery of a giant planet orbiting the young star AF Lep, a 1.2 $M_{\odot}$ member of the 24 $\pm$ 3 Myr $β$ Pic moving group. AF Lep was observed as part of our ongoing high-contrast imaging program targeting stars with astrometric accelerations between Hipparcos and Gaia that indicate the presence of substellar companions. Keck/NIRC2 observations in $L'$ with the Vector Vortex Coronagraph reveal a point source, AF Lep b, at ${\approx}340$ mas which exhibits orbital motion at the 6-$σ$ level over the course of 13 months. A joint orbit fit yields precise constraints on the planet's dynamical mass of 3.2$^{+0.7}_{-0.6}$ $M_\mathrm{Jup}$, semi-major axis of $8.4^{+1.1}_{-1.3}$ au, and eccentricity of $0.24^{+0.27}_{-0.15}$. AF Lep hosts a debris disk located at $\sim$50 au, but it is unlikely to be sculpted by AF Lep b, implying there may be additional planets in the system at wider separations. The stellar inclination ($i_* = 54^{+11}_{-9} {}^\circ$) and orbital inclination ($i_o = 50^{+9}_{-12} {}^\circ$) are in good agreement, which is consistent with the system having spin-orbit alignment. AF Lep b is the lowest-mass imaged planet with a dynamical mass measurement and highlights the promise of using astrometric accelerations as a tool to find and characterize long-period planets.
△ Less
Submitted 25 May, 2023; v1 submitted 10 February, 2023;
originally announced February 2023.
-
Open data from the third observing run of LIGO, Virgo, KAGRA and GEO
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
R. Abbott,
H. Abe,
F. Acernese,
K. Ackley,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
A. Al-Jodah,
C. Alléné,
A. Allocca
, et al. (1719 additional authors not shown)
Abstract:
The global network of gravitational-wave observatories now includes five detectors, namely LIGO Hanford, LIGO Livingston, Virgo, KAGRA, and GEO 600. These detectors collected data during their third observing run, O3, composed of three phases: O3a starting in April of 2019 and lasting six months, O3b starting in November of 2019 and lasting five months, and O3GK starting in April of 2020 and lasti…
▽ More
The global network of gravitational-wave observatories now includes five detectors, namely LIGO Hanford, LIGO Livingston, Virgo, KAGRA, and GEO 600. These detectors collected data during their third observing run, O3, composed of three phases: O3a starting in April of 2019 and lasting six months, O3b starting in November of 2019 and lasting five months, and O3GK starting in April of 2020 and lasting 2 weeks. In this paper we describe these data and various other science products that can be freely accessed through the Gravitational Wave Open Science Center at https://gwosc.org. The main dataset, consisting of the gravitational-wave strain time series that contains the astrophysical signals, is released together with supporting data useful for their analysis and documentation, tutorials, as well as analysis software packages.
△ Less
Submitted 7 February, 2023;
originally announced February 2023.
-
Imitating Human Behaviour with Diffusion Models
Authors:
Tim Pearce,
Tabish Rashid,
Anssi Kanervisto,
Dave Bignell,
Mingfei Sun,
Raluca Georgescu,
Sergio Valcarcel Macua,
Shan Zheng Tan,
Ida Momennejad,
Katja Hofmann,
Sam Devlin
Abstract:
Diffusion models have emerged as powerful generative models in the text-to-image domain. This paper studies their application as observation-to-action models for imitating human behaviour in sequential environments. Human behaviour is stochastic and multimodal, with structured correlations between action dimensions. Meanwhile, standard modelling choices in behaviour cloning are limited in their ex…
▽ More
Diffusion models have emerged as powerful generative models in the text-to-image domain. This paper studies their application as observation-to-action models for imitating human behaviour in sequential environments. Human behaviour is stochastic and multimodal, with structured correlations between action dimensions. Meanwhile, standard modelling choices in behaviour cloning are limited in their expressiveness and may introduce bias into the cloned policy. We begin by pointing out the limitations of these choices. We then propose that diffusion models are an excellent fit for imitating human behaviour, since they learn an expressive distribution over the joint action space. We introduce several innovations to make diffusion models suitable for sequential environments; designing suitable architectures, investigating the role of guidance, and develo** reliable sampling strategies. Experimentally, diffusion models closely match human demonstrations in a simulated robotic control task and a modern 3D gaming environment.
△ Less
Submitted 3 March, 2023; v1 submitted 25 January, 2023;
originally announced January 2023.
-
ISPY-NACO Imaging Survey for Planets around Young stars. The demographics of forming planets embedded in protoplanetary disks
Authors:
Gabriele Cugno,
Timothy D. Pearce,
Ralf Launhardt,
Markus. J. Bonse,
Jie. Ma,
Thomas Henning,
Andreas Quirrenbach,
Damien Ségransan,
Elisabeth C. Matthews,
Sascha P. Quanz,
Grant M. Kennedy,
André Müller,
Sabine Reffert,
Emily L. Rickman
Abstract:
We present the statistical analysis of a subsample of 45 young stars surrounded by protoplanetary disks (PPDs). This is the largest imaging survey uniquely focused on PPDs to date. Our goal is to search for young forming companions embedded in the disk material and to constrain their occurrence rate in relation to the formation mechanism. We used principal component analysis based point spread fun…
▽ More
We present the statistical analysis of a subsample of 45 young stars surrounded by protoplanetary disks (PPDs). This is the largest imaging survey uniquely focused on PPDs to date. Our goal is to search for young forming companions embedded in the disk material and to constrain their occurrence rate in relation to the formation mechanism. We used principal component analysis based point spread function subtraction techniques to reveal young companions forming in the disks. We calculated detection limits for our datasets and adopted a black-body model to derive temperature upper limits of potential forming planets. We then used Monte Carlo simulations to constrain the population of forming gas giant companions and compare our results to different types of formation scenarios. Our data revealed a new binary system (HD38120) and a recently identified triple system with a brown dwarf companion orbiting a binary system (HD101412), in addition to 12 known companions. Furthermore, we detected signals from 17 disks, two of which (HD72106 and TCrA) were imaged for the first time. We reached median detection limits of L =15.4 mag at 2.0 arcsec, which were used to investigate the temperature of potentially embedded forming companions. We can constrain the occurrence of forming planets with semi-major axis a in [20 - 500] au and Teff in [600 - 3000] K, in line with the statistical results obtained for more evolved systems from other direct imaging surveys. The NaCo-ISPY data confirm that massive bright planets accreting at high rates are rare. More powerful instruments with better sensitivity in the near- to mid-infrared are likely required to unveil the wealth of forming planets sculpting the observed disk substructures.
△ Less
Submitted 28 November, 2022;
originally announced November 2022.
-
Hot exozodis: cometary supply without trap** is unlikely to be the mechanism
Authors:
Tim D. Pearce,
Florian Kirchschlager,
Gaël Rouillé,
Steve Ertel,
Alexander Bensberg,
Alexander V. Krivov,
Mark Booth,
Sebastian Wolf,
Jean-Charles Augereau
Abstract:
Excess near-infrared emission is detected around one fifth of main-sequence stars, but its nature is a mystery. These excesses are interpreted as thermal emission from populations of small, hot dust very close to their stars (`hot exozodis'), but such grains should rapidly sublimate or be blown out of the system. To date, no model has fully explained this phenomenon. One mechanism commonly suggest…
▽ More
Excess near-infrared emission is detected around one fifth of main-sequence stars, but its nature is a mystery. These excesses are interpreted as thermal emission from populations of small, hot dust very close to their stars (`hot exozodis'), but such grains should rapidly sublimate or be blown out of the system. To date, no model has fully explained this phenomenon. One mechanism commonly suggested in the literature is cometary supply, where star-grazing comets deposit dust close to the star, replenishing losses from grain sublimation and blowout. However, we show that this mechanism alone is very unlikely to be responsible for hot exozodis. We model the trajectory and size evolution of dust grains released by star-grazing comets, to establish the dust and comet properties required to reproduce hot-exozodi observations. We find that cometary supply alone can only reproduce observations if dust ejecta has an extremely steep size distribution upon release, and the dust-deposition rate is extraordinarily high. These requirements strongly contradict our current understanding of cometary dust and planetary systems. Cometary supply is therefore unlikely to be solely responsible for hot exozodis, so may need to be combined with some dust-trap** mechanism (such as gas or magnetic trap**) if it is to reproduce observations.
△ Less
Submitted 22 September, 2022;
originally announced September 2022.
-
DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization
Authors:
Wentse Chen,
Shiyu Huang,
Yuan Chiang,
Tim Pearce,
Wei-Wei Tu,
Ting Chen,
Jun Zhu
Abstract:
Most reinforcement learning algorithms seek a single optimal strategy that solves a given task. However, it can often be valuable to learn a diverse set of solutions, for instance, to make an agent's interaction with users more engaging, or improve the robustness of a policy to an unexpected perturbance. We propose Diversity-Guided Policy Optimization (DGPO), an on-policy algorithm that discovers…
▽ More
Most reinforcement learning algorithms seek a single optimal strategy that solves a given task. However, it can often be valuable to learn a diverse set of solutions, for instance, to make an agent's interaction with users more engaging, or improve the robustness of a policy to an unexpected perturbance. We propose Diversity-Guided Policy Optimization (DGPO), an on-policy algorithm that discovers multiple strategies for solving a given task. Unlike prior work, it achieves this with a shared policy network trained over a single run. Specifically, we design an intrinsic reward based on an information-theoretic diversity objective. Our final objective alternately constraints on the diversity of the strategies and on the extrinsic reward. We solve the constrained optimization problem by casting it as a probabilistic inference task and use policy iteration to maximize the derived lower bound. Experimental results show that our method efficiently discovers diverse strategies in a wide variety of reinforcement learning tasks. Compared to baseline methods, DGPO achieves comparable rewards, while discovering more diverse strategies, and often with better sample efficiency.
△ Less
Submitted 5 January, 2024; v1 submitted 12 July, 2022;
originally announced July 2022.
-
Censored Quantile Regression Neural Networks for Distribution-Free Survival Analysis
Authors:
Tim Pearce,
Jong-Hyeon Jeong,
Yichen Jia,
Jun Zhu
Abstract:
This paper considers doing quantile regression on censored data using neural networks (NNs). This adds to the survival analysis toolkit by allowing direct prediction of the target variable, along with a distribution-free characterisation of uncertainty, using a flexible function approximator. We begin by showing how an algorithm popular in linear models can be applied to NNs. However, the resultin…
▽ More
This paper considers doing quantile regression on censored data using neural networks (NNs). This adds to the survival analysis toolkit by allowing direct prediction of the target variable, along with a distribution-free characterisation of uncertainty, using a flexible function approximator. We begin by showing how an algorithm popular in linear models can be applied to NNs. However, the resulting procedure is inefficient, requiring sequential optimisation of an individual NN at each desired quantile. Our major contribution is a novel algorithm that simultaneously optimises a grid of quantiles output by a single NN. To offer theoretical insight into our algorithm, we show firstly that it can be interpreted as a form of expectation-maximisation, and secondly that it exhibits a desirable `self-correcting' property. Experimentally, the algorithm produces quantiles that are better calibrated than existing methods on 10 out of 12 real datasets.
△ Less
Submitted 6 February, 2023; v1 submitted 26 May, 2022;
originally announced May 2022.
-
Gap carving by a migrating planet embedded in a massive debris disc
Authors:
Marc F. Friebe,
Tim D. Pearce,
Torsten Löhne
Abstract:
When considering gaps in debris discs, a typical approach is to invoke clearing by an unseen planet within the gap, and derive the planet mass using Wisdom overlap or Hill radius arguments. However, this approach can be invalid if the disc is massive, because this clearing would also cause planet migration. This could result in a calculated planet mass that is incompatible with the inferred disc m…
▽ More
When considering gaps in debris discs, a typical approach is to invoke clearing by an unseen planet within the gap, and derive the planet mass using Wisdom overlap or Hill radius arguments. However, this approach can be invalid if the disc is massive, because this clearing would also cause planet migration. This could result in a calculated planet mass that is incompatible with the inferred disc mass, because the predicted planet would in reality be too small to carve the gap without significant migration. We investigate the gap that a single embedded planet would carve in a massive debris disc. We show that a degeneracy is introduced, whereby an observed gap could be carved by two different planets: either a high-mass, barely-migrating planet, or a smaller planet that clears debris as it migrates. We find that, depending on disc mass, there is a minimum possible gap width that an embedded planet could carve (because smaller planets, rather than carving a smaller gap, would actually migrate through the disc and clear a wider region). We provide simple formulae for the planet-to-debris disc mass ratio at which planet migration becomes important, the gap width that an embedded planet would carve in a massive debris disc, and the interaction timescale. We also apply our results to various systems, and in particular show that the disc of HD 107146 can be reasonably well-reproduced with a migrating, embedded planet. Finally, we discuss the importance of planet-debris disc interactions as a tool for constraining debris disc masses.
△ Less
Submitted 7 March, 2022;
originally announced March 2022.
-
Planet populations inferred from debris discs: insights from 178 debris systems in the ISPY, LEECH and LIStEN planet-hunting surveys
Authors:
Tim D. Pearce,
Ralf Launhardt,
Robert Ostermann,
Grant M. Kennedy,
Mario Gennaro,
Mark Booth,
Alexander V. Krivov,
Gabriele Cugno,
Thomas K. Henning,
Andreas Quirrenbach,
Arianna Musso Barcucci,
Elisabeth C. Matthews,
Henrik L. Ruh,
Jordan M. Stone
Abstract:
We know little about the outermost exoplanets in planetary systems, because our detection methods are insensitive to moderate-mass planets on wide orbits. However, debris discs can probe the outer-planet population, because dynamical modelling of observed discs can reveal properties of perturbing planets. We use four sculpting and stirring arguments to infer planet properties in 178 debris-disc sy…
▽ More
We know little about the outermost exoplanets in planetary systems, because our detection methods are insensitive to moderate-mass planets on wide orbits. However, debris discs can probe the outer-planet population, because dynamical modelling of observed discs can reveal properties of perturbing planets. We use four sculpting and stirring arguments to infer planet properties in 178 debris-disc systems from the ISPY, LEECH and LIStEN planet-hunting surveys. Similar analyses are often conducted for individual discs, but we consider a large sample in a consistent manner. We aim to predict the population of wide-separation planets, gain insight into the formation and evolution histories of planetary systems, and determine the feasibility of detecting these planets in the near future. We show that a `typical' cold debris disc likely requires a Neptune- to Saturn-mass planet at 10-100 au, with some needing Jupiter-mass perturbers. Our predicted planets are currently undetectable, but modest detection-limit improvements (e.g. from JWST) should reveal many such perturbers. We find that planets thought to be perturbing debris discs at late times are similar to those inferred to be forming in protoplanetary discs, so these could be the same population if newly formed planets do not migrate as far as currently thought. Alternatively, young planets could rapidly sculpt debris before migrating inwards, meaning that the responsible planets are more massive (and located further inwards) than debris-disc studies assume. We combine self-stirring and size-distribution modelling to show that many debris discs cannot be self-stirred without having unreasonably high masses; planet- or companion-stirring may therefore be the dominant mechanism in many (perhaps all) debris discs. Finally, we provide catalogues of planet predictions, and identify promising targets for future planet searches.
△ Less
Submitted 20 January, 2022;
originally announced January 2022.
-
Sensitive Samples Revisited: Detecting Neural Network Attacks Using Constraint Solvers
Authors:
Amel Nestor Docena,
Thomas Wahl,
Trevor Pearce,
Yunsi Fei
Abstract:
Neural Networks are used today in numerous security- and safety-relevant domains and are, as such, a popular target of attacks that subvert their classification capabilities, by manipulating the network parameters. Prior work has introduced sensitive samples -- inputs highly sensitive to parameter changes -- to detect such manipulations, and proposed a gradient ascent-based approach to compute the…
▽ More
Neural Networks are used today in numerous security- and safety-relevant domains and are, as such, a popular target of attacks that subvert their classification capabilities, by manipulating the network parameters. Prior work has introduced sensitive samples -- inputs highly sensitive to parameter changes -- to detect such manipulations, and proposed a gradient ascent-based approach to compute them. In this paper we offer an alternative, using symbolic constraint solvers. We model the network and a formal specification of a sensitive sample in the language of the solver and ask for a solution. This approach supports a rich class of queries, corresponding, for instance, to the presence of certain types of attacks. Unlike earlier techniques, our approach does not depend on convex search domains, or on the suitability of a starting point for the search. We address the performance limitations of constraint solvers by partitioning the search space for the solver, and exploring the partitions according to a balanced schedule that still retains completeness of the search. We demonstrate the impact of the use of solvers in terms of functionality and search efficiency, using a case study for the detection of Trojan attacks on Neural Networks.
△ Less
Submitted 6 September, 2021;
originally announced September 2021.
-
Bayesian Autoencoders: Analysing and Fixing the Bernoulli likelihood for Out-of-Distribution Detection
Authors:
Bang Xiang Yong,
Tim Pearce,
Alexandra Brintrup
Abstract:
After an autoencoder (AE) has learnt to reconstruct one dataset, it might be expected that the likelihood on an out-of-distribution (OOD) input would be low. This has been studied as an approach to detect OOD inputs. Recent work showed this intuitive approach can fail for the dataset pairs FashionMNIST vs MNIST. This paper suggests this is due to the use of Bernoulli likelihood and analyses why th…
▽ More
After an autoencoder (AE) has learnt to reconstruct one dataset, it might be expected that the likelihood on an out-of-distribution (OOD) input would be low. This has been studied as an approach to detect OOD inputs. Recent work showed this intuitive approach can fail for the dataset pairs FashionMNIST vs MNIST. This paper suggests this is due to the use of Bernoulli likelihood and analyses why this is the case, proposing two fixes: 1) Compute the uncertainty of likelihood estimate by using a Bayesian version of the AE. 2) Use alternative distributions to model the likelihood.
△ Less
Submitted 28 July, 2021;
originally announced July 2021.
-
High resolution ALMA and HST images of q$^1$ Eri: an asymmetric debris disc with an eccentric Jupiter
Authors:
J. B. Lovell,
S. Marino,
M. C. Wyatt,
G. M. Kennedy,
M. A. MacGregor,
K. Stapelfeldt,
B. Dent,
J. Krist,
L. Matrà,
Q. Kral,
O. Panić,
T. D. Pearce,
D. Wilner
Abstract:
We present \textit{ALMA} 1.3 mm and 0.86 mm observations of the nearby (17.34 pc) F9V star q1 Eri (HD 10647, HR 506). This system, with age ${\sim}1.4$ Gyr, hosts a ${\sim}2$ au radial velocity planet and a debris disc with the highest fractional luminosity of the closest 300 FGK type stars. The \textit{ALMA} images, with resolution ${\sim}0.5''$, reveal a broad (34{-}134 au) belt of millimeter em…
▽ More
We present \textit{ALMA} 1.3 mm and 0.86 mm observations of the nearby (17.34 pc) F9V star q1 Eri (HD 10647, HR 506). This system, with age ${\sim}1.4$ Gyr, hosts a ${\sim}2$ au radial velocity planet and a debris disc with the highest fractional luminosity of the closest 300 FGK type stars. The \textit{ALMA} images, with resolution ${\sim}0.5''$, reveal a broad (34{-}134 au) belt of millimeter emission inclined by $76.7{\pm}1.0$ degrees with maximum brightness at $81.6{\pm}0.5$ au. The images reveal an asymmetry, with higher flux near the southwest ansa, which is also closer to the star. Scattered light observed with the Hubble Space Telescope is also asymmetric, being more radially extended to the northeast. We fit the millimeter emission with parametric models and place constraints on the disc morphology, radius, width, dust mass, and scale height. We find the southwest ansa asymmetry is best fitted by an extended clump on the inner edge of the disc, consistent with perturbations from a planet with mass $8 M_{\oplus} {-} 11 M_{\rm Jup}$ at ${\sim}60$ au that may have migrated outwards, similar to Neptune in our Solar System. If the measured vertical aspect ratio of $h{=}0.04{\pm}0.01$ is due to dynamical interactions in the disc, then this requires perturbers with sizes ${>}1200$ km. We find tentative evidence for an 0.86 mm excess within 10 au, $70{\pm}22\, μ$Jy, that may be due to an inner planetesimal belt. We find no evidence for CO gas, but set an upper bound on the CO gas mass of $4{\times}10^{-6}$ M$_{\oplus}$ ($3\,σ$), consistent with cometary abundances in the Solar System.
△ Less
Submitted 10 June, 2021;
originally announced June 2021.
-
Understanding Softmax Confidence and Uncertainty
Authors:
Tim Pearce,
Alexandra Brintrup,
Jun Zhu
Abstract:
It is often remarked that neural networks fail to increase their uncertainty when predicting on data far from the training distribution. Yet naively using softmax confidence as a proxy for uncertainty achieves modest success in tasks exclusively testing for this, e.g., out-of-distribution (OOD) detection. This paper investigates this contradiction, identifying two implicit biases that do encourage…
▽ More
It is often remarked that neural networks fail to increase their uncertainty when predicting on data far from the training distribution. Yet naively using softmax confidence as a proxy for uncertainty achieves modest success in tasks exclusively testing for this, e.g., out-of-distribution (OOD) detection. This paper investigates this contradiction, identifying two implicit biases that do encourage softmax confidence to correlate with epistemic uncertainty: 1) Approximately optimal decision boundary structure, and 2) Filtering effects of deep networks. It describes why low-dimensional intuitions about softmax confidence are misleading. Diagnostic experiments quantify reasons softmax confidence can fail, finding that extrapolations are less to blame than overlap between training and OOD data in final-layer representations. Pre-trained/fine-tuned networks reduce this overlap.
△ Less
Submitted 9 June, 2021;
originally announced June 2021.
-
Counter-Strike Deathmatch with Large-Scale Behavioural Cloning
Authors:
Tim Pearce,
Jun Zhu
Abstract:
This paper describes an AI agent that plays the popular first-person-shooter (FPS) video game `Counter-Strike; Global Offensive' (CSGO) from pixel input. The agent, a deep neural network, matches the performance of the medium difficulty built-in AI on the deathmatch game mode, whilst adopting a humanlike play style. Unlike much prior work in games, no API is available for CSGO, so algorithms must…
▽ More
This paper describes an AI agent that plays the popular first-person-shooter (FPS) video game `Counter-Strike; Global Offensive' (CSGO) from pixel input. The agent, a deep neural network, matches the performance of the medium difficulty built-in AI on the deathmatch game mode, whilst adopting a humanlike play style. Unlike much prior work in games, no API is available for CSGO, so algorithms must train and run in real-time. This limits the quantity of on-policy data that can be generated, precluding many reinforcement learning algorithms. Our solution uses behavioural cloning - training on a large noisy dataset scraped from human play on online servers (4 million frames, comparable in size to ImageNet), and a smaller dataset of high-quality expert demonstrations. This scale is an order of magnitude larger than prior work on imitation learning in FPS games.
△ Less
Submitted 9 December, 2021; v1 submitted 9 April, 2021;
originally announced April 2021.
-
Fomalhaut b could be massive and sculpting the narrow, eccentric debris disc, if in mean-motion resonance with it
Authors:
Tim D. Pearce,
Hervé Beust,
Virginie Faramaz,
Mark Booth,
Alexander V. Krivov,
Torsten Löhne,
Pedro P. Poblete
Abstract:
The star Fomalhaut hosts a narrow, eccentric debris disc, plus a highly eccentric companion Fomalhaut b. It is often argued that Fomalhaut b cannot have significant mass, otherwise it would quickly perturb the disc. We show that material in internal mean-motion resonances with a massive, coplanar Fomalhaut b would actually be long-term stable, and occupy orbits similar to the observed debris. Furt…
▽ More
The star Fomalhaut hosts a narrow, eccentric debris disc, plus a highly eccentric companion Fomalhaut b. It is often argued that Fomalhaut b cannot have significant mass, otherwise it would quickly perturb the disc. We show that material in internal mean-motion resonances with a massive, coplanar Fomalhaut b would actually be long-term stable, and occupy orbits similar to the observed debris. Furthermore, millimetre dust released in collisions between resonant bodies could reproduce the width, shape and orientation of the observed disc. We first re-examine the possible orbits of Fomalhaut b, assuming that it moves under gravity alone. If Fomalhaut b orbits close to the disc midplane then its orbit crosses the disc, and the two are apsidally aligned. This alignment may hint at an ongoing dynamical interaction. Using the observationally allowed orbits, we then model the interaction between a massive Fomalhaut b and debris. Whilst most debris is unstable in such an extreme configuration, we identify several resonant populations that remain stable for the stellar lifetime, despite crossing the orbit of Fomalhaut b. This debris occupies low-eccentricity orbits similar to the observed debris ring. These resonant bodies would have a clumpy distribution, but dust released in collisions between them would form a narrow, relatively smooth ring similar to observations. We show that if Fomalhaut b has a mass between those of Earth and Jupiter then, far from removing the observed debris, it could actually be sculpting it through resonant interactions.
△ Less
Submitted 8 March, 2021;
originally announced March 2021.
-
A Bayesian social platform for inclusive and evidence-based decision making
Authors:
Susannah Kate Devitt,
Tamara Rose Pearce,
Alok Kumar Chowdhury,
Kerrie Mengersen
Abstract:
Against the backdrop of a social media reckoning, this paper seeks to demonstrate the potential of social tools to build virtuous behaviours online. We must assume that human behaviour is flawed, the truth can be elusive, and as communities we must commit to mechanisms to encourage virtuous social digital behaviours. Societies that use social platforms should be inclusive, responsive to evidence,…
▽ More
Against the backdrop of a social media reckoning, this paper seeks to demonstrate the potential of social tools to build virtuous behaviours online. We must assume that human behaviour is flawed, the truth can be elusive, and as communities we must commit to mechanisms to encourage virtuous social digital behaviours. Societies that use social platforms should be inclusive, responsive to evidence, limit punitive actions and allow productive discord and respectful disagreement. Social media success, we argue, is in the hypothesis. Documents are valuable to the degree that they are evidence in service of, or to challenge an idea for a purpose. We outline how a Bayesian social platform can facilitate virtuous behaviours to build evidence-based collective rationality. The chapter outlines the epistemic architecture of the platform's algorithms and user interface in conjunction with explicit community management to ensure psychological safety. The BetterBeliefs platform rewards users who demonstrate epistemically virtuous behaviours and exports evidence-based propositions for decision-making. A Bayesian social network can make virtuous ideas powerful.
△ Less
Submitted 13 February, 2021;
originally announced February 2021.
-
Resolving the outer ring of HD 38206 using ALMA and constraining limits on planets in the system
Authors:
Mark Booth,
Michael Schulz,
Alexander V. Krivov,
Sebastián Marino,
Tim D. Pearce,
Ralf Launhardt
Abstract:
HD 38206 is an A0V star in the Columba association, hosting a debris disc first discovered by IRAS. Further observations by Spitzer and Herschel showed that the disc has two components, likely analogous to the asteroid and Kuiper belts of the Solar System. The young age of this star makes it a prime target for direct imaging planet searches. Possible planets in the system can be constrained using…
▽ More
HD 38206 is an A0V star in the Columba association, hosting a debris disc first discovered by IRAS. Further observations by Spitzer and Herschel showed that the disc has two components, likely analogous to the asteroid and Kuiper belts of the Solar System. The young age of this star makes it a prime target for direct imaging planet searches. Possible planets in the system can be constrained using the debris disc. Here we present the first ALMA observations of the system's Kuiper belt and fit them using a forward modelling MCMC approach. We detect an extended disc of dust peaking at around 180 au with a width of 140 au. The disc is close to edge on and shows tentative signs of an asymmetry best fit by an eccentricity of $0.25^{+0.10}_{-0.09}$. We use the fitted parameters to determine limits on the masses of planets interior to the cold belt. We determine that a minimum of four planets are required, each with a minimum mass of 0.64 M$_J$, in order to clear the gap between the asteroid and Kuiper belts of the system. If we make the assumption that the outermost planet is responsible for the stirring of the disc, the location of its inner edge and the eccentricity of the disc, then we can more tightly predict its eccentricity, mass and semimajor axis to be $e_{\rm{p}}=0.34^{+0.20}_{-0.13}$, $m_{\rm{p}}=0.7^{+0.5}_{-0.3}\,\rm{M}_{\rm{J}}$ and $a_{\rm{p}}=76^{+12}_{-13}\,\rm{au}$.
△ Less
Submitted 27 October, 2020;
originally announced October 2020.
-
Gas trap** of hot dust around main-sequence stars
Authors:
Tim D. Pearce,
Alexander V. Krivov,
Mark Booth
Abstract:
In 2006 Vega was discovered to display excess near-infrared emission. Surveys now detect this phenomenon for one fifth of main-sequence stars, across various spectral types and ages. The excesses are interpreted as populations of small, hot dust grains very close to their stars, which must originate from comets or asteroids. However, the presence of such grains in copious amounts is mysterious, si…
▽ More
In 2006 Vega was discovered to display excess near-infrared emission. Surveys now detect this phenomenon for one fifth of main-sequence stars, across various spectral types and ages. The excesses are interpreted as populations of small, hot dust grains very close to their stars, which must originate from comets or asteroids. However, the presence of such grains in copious amounts is mysterious, since they should rapidly sublimate or be blown out of the system. Here we investigate a potential mechanism to generate excesses: dust migrating inwards under radiation forces sublimates near the star, releasing modest quantities of gas which then traps subsequent grains. This mechanism requires neither specialised system architectures nor high dust supply rates, and could operate across diverse stellar types and ages. The model naturally reproduces many features of inferred dust populations, in particular their location, preference for small grains, steep size distribution, and dust location scaling with stellar luminosity. For Sun-like stars the mechanism can produce 2.2 micron excesses that are an order of magnitude larger than those at 8.5 micron, as required by observations. However, for A-type stars the simulated near-infrared excesses were only twice those in the mid infrared; grains would have to be 5-10 times smaller than those trapped in our model to be able to explain observed near-infrared excesses around A stars. Further progress with any hot dust explanation for A stars requires a means for grains to become very hot without either rapidly sublimating or being blown out of the system.
△ Less
Submitted 17 August, 2020;
originally announced August 2020.
-
Structured Weight Priors for Convolutional Neural Networks
Authors:
Tim Pearce,
Andrew Y. K. Foong,
Alexandra Brintrup
Abstract:
Selection of an architectural prior well suited to a task (e.g. convolutions for image data) is crucial to the success of deep neural networks (NNs). Conversely, the weight priors within these architectures are typically left vague, e.g.~independent Gaussian distributions, which has led to debate over the utility of Bayesian deep learning. This paper explores the benefits of adding structure to we…
▽ More
Selection of an architectural prior well suited to a task (e.g. convolutions for image data) is crucial to the success of deep neural networks (NNs). Conversely, the weight priors within these architectures are typically left vague, e.g.~independent Gaussian distributions, which has led to debate over the utility of Bayesian deep learning. This paper explores the benefits of adding structure to weight priors. It initially considers first-layer filters of a convolutional NN, designing a prior based on random Gabor filters. Second, it considers adding structure to the prior of final-layer weights by estimating how each hidden feature relates to each class. Empirical results suggest that these structured weight priors lead to more meaningful functional priors for image data. This contributes to the ongoing discussion on the importance of weight priors.
△ Less
Submitted 12 July, 2020;
originally announced July 2020.
-
Musical Features for Automatic Music Transcription Evaluation
Authors:
Adrien Ycart,
Lele Liu,
Emmanouil Benetos,
Marcus T. Pearce
Abstract:
This technical report gives a detailed, formal description of the features introduced in the paper: Adrien Ycart, Lele Liu, Emmanouil Benetos and Marcus T. Pearce. "Investigating the Perceptual Validity of Evaluation Metrics for Automatic Piano Music Transcription", Transactions of the International Society for Music Information Retrieval (TISMIR), Accepted, 2020.
This technical report gives a detailed, formal description of the features introduced in the paper: Adrien Ycart, Lele Liu, Emmanouil Benetos and Marcus T. Pearce. "Investigating the Perceptual Validity of Evaluation Metrics for Automatic Piano Music Transcription", Transactions of the International Society for Music Information Retrieval (TISMIR), Accepted, 2020.
△ Less
Submitted 15 April, 2020;
originally announced April 2020.
-
Avoiding Kernel Fixed Points: Computing with ELU and GELU Infinite Networks
Authors:
Russell Tsuchida,
Tim Pearce,
Chris van der Heide,
Fred Roosta,
Marcus Gallagher
Abstract:
Analysing and computing with Gaussian processes arising from infinitely wide neural networks has recently seen a resurgence in popularity. Despite this, many explicit covariance functions of networks with activation functions used in modern networks remain unknown. Furthermore, while the kernels of deep networks can be computed iteratively, theoretical understanding of deep kernels is lacking, par…
▽ More
Analysing and computing with Gaussian processes arising from infinitely wide neural networks has recently seen a resurgence in popularity. Despite this, many explicit covariance functions of networks with activation functions used in modern networks remain unknown. Furthermore, while the kernels of deep networks can be computed iteratively, theoretical understanding of deep kernels is lacking, particularly with respect to fixed-point dynamics. Firstly, we derive the covariance functions of multi-layer perceptrons (MLPs) with exponential linear units (ELU) and Gaussian error linear units (GELU) and evaluate the performance of the limiting Gaussian processes on some benchmarks. Secondly, and more generally, we analyse the fixed-point dynamics of iterated kernels corresponding to a broad range of activation functions. We find that unlike some previously studied neural network kernels, these new kernels exhibit non-trivial fixed-point dynamics which are mirrored in finite-width neural networks. The fixed point behaviour present in some networks explains a mechanism for implicit regularisation in overparameterised deep models. Our results relate to both the static iid parameter conjugate kernel and the dynamic neural tangent kernel constructions. Software at github.com/RussellTsuchida/ELU_GELU_kernels.
△ Less
Submitted 28 February, 2021; v1 submitted 19 February, 2020;
originally announced February 2020.
-
Expressive Priors in Bayesian Neural Networks: Kernel Combinations and Periodic Functions
Authors:
Tim Pearce,
Russell Tsuchida,
Mohamed Zaki,
Alexandra Brintrup,
Andy Neely
Abstract:
A simple, flexible approach to creating expressive priors in Gaussian process (GP) models makes new kernels from a combination of basic kernels, e.g. summing a periodic and linear kernel can capture seasonal variation with a long term trend. Despite a well-studied link between GPs and Bayesian neural networks (BNNs), the BNN analogue of this has not yet been explored. This paper derives BNN archit…
▽ More
A simple, flexible approach to creating expressive priors in Gaussian process (GP) models makes new kernels from a combination of basic kernels, e.g. summing a periodic and linear kernel can capture seasonal variation with a long term trend. Despite a well-studied link between GPs and Bayesian neural networks (BNNs), the BNN analogue of this has not yet been explored. This paper derives BNN architectures mirroring such kernel combinations. Furthermore, it shows how BNNs can produce periodic kernels, which are often useful in this context. These ideas provide a principled approach to designing BNNs that incorporate prior knowledge about a function. We showcase the practical value of these ideas with illustrative experiments in supervised and reinforcement learning settings.
△ Less
Submitted 28 June, 2019; v1 submitted 15 May, 2019;
originally announced May 2019.
-
Metabolomics in the Cloud: Scaling Computational Tools to Big Data
Authors:
Jianliang Gao,
Noureddin Sadawi,
Ibrahim Karaman,
Jake T M Pearce,
Pablo Moreno,
Anders Larsson,
Marco Capuccini,
Paul Elliott,
Jeremy K Nicholson,
Timothy M D Ebbels,
Robert Glen
Abstract:
Background: Metabolomics datasets are becoming increasingly large and complex, with multiple types of algorithms and workflows needed to process and analyse the data. A cloud infrastructure with portable software tools can provide much needed resources enabling faster processing of much larger datasets than would be possible at any individual lab. The PhenoMeNal project has developed such an infra…
▽ More
Background: Metabolomics datasets are becoming increasingly large and complex, with multiple types of algorithms and workflows needed to process and analyse the data. A cloud infrastructure with portable software tools can provide much needed resources enabling faster processing of much larger datasets than would be possible at any individual lab. The PhenoMeNal project has developed such an infrastructure, allowing users to run analyses on local or commercial cloud platforms. We have examined the computational scaling behaviour of the PhenoMeNal platform using four different implementations across 1-1000 virtual CPUs using two common metabolomics tools.
Results: Our results show that data which takes up to 4 days to process on a standard desktop computer can be processed in just 10 min on the largest cluster. Improved runtimes come at the cost of decreased efficiency, with all platforms falling below 80% efficiency above approximately 1/3 of the maximum number of vCPUs. An economic analysis revealed that running on large scale cloud platforms is cost effective compared to traditional desktop systems.
Conclusions: Overall, cloud implementations of PhenoMeNal show excellent scalability for standard metabolomics computing tasks on a range of platforms, making them a compelling choice for research computing in metabolomics.
△ Less
Submitted 9 April, 2019; v1 submitted 3 April, 2019;
originally announced April 2019.
-
Bayesian Neural Network Ensembles
Authors:
Tim Pearce,
Mohamed Zaki,
Andy Neely
Abstract:
Ensembles of neural networks (NNs) have long been used to estimate predictive uncertainty; a small number of NNs are trained from different initialisations and sometimes on differing versions of the dataset. The variance of the ensemble's predictions is interpreted as its epistemic uncertainty. The appeal of ensembling stems from being a collection of regular NNs - this makes them both scalable an…
▽ More
Ensembles of neural networks (NNs) have long been used to estimate predictive uncertainty; a small number of NNs are trained from different initialisations and sometimes on differing versions of the dataset. The variance of the ensemble's predictions is interpreted as its epistemic uncertainty. The appeal of ensembling stems from being a collection of regular NNs - this makes them both scalable and easily implementable. They have achieved strong empirical results in recent years, often presented as a practical alternative to more costly Bayesian NNs (BNNs). The departure from Bayesian methodology is of concern since the Bayesian framework provides a principled, widely-accepted approach to handling uncertainty. In this extended abstract we derive and implement a modified NN ensembling scheme, which provides a consistent estimator of the Bayesian posterior in wide NNs - regularising parameters about values drawn from a prior distribution.
△ Less
Submitted 27 November, 2018;
originally announced November 2018.
-
Uncertainty in Neural Networks: Approximately Bayesian Ensembling
Authors:
Tim Pearce,
Felix Leibfried,
Alexandra Brintrup,
Mohamed Zaki,
Andy Neely
Abstract:
Understanding the uncertainty of a neural network's (NN) predictions is essential for many purposes. The Bayesian framework provides a principled approach to this, however applying it to NNs is challenging due to large numbers of parameters and data. Ensembling NNs provides an easily implementable, scalable method for uncertainty quantification, however, it has been criticised for not being Bayesi…
▽ More
Understanding the uncertainty of a neural network's (NN) predictions is essential for many purposes. The Bayesian framework provides a principled approach to this, however applying it to NNs is challenging due to large numbers of parameters and data. Ensembling NNs provides an easily implementable, scalable method for uncertainty quantification, however, it has been criticised for not being Bayesian. This work proposes one modification to the usual process that we argue does result in approximate Bayesian inference; regularising parameters about values drawn from a distribution which can be set equal to the prior. A theoretical analysis of the procedure in a simplified setting suggests the recovered posterior is centred correctly but tends to have an underestimated marginal variance, and overestimated correlation. However, two conditions can lead to exact recovery. We argue that these conditions are partially present in NNs. Empirical evaluations demonstrate it has an advantage over standard ensembling, and is competitive with variational methods.
△ Less
Submitted 26 February, 2020; v1 submitted 12 October, 2018;
originally announced October 2018.
-
An energy-based generative sequence model for testing sensory theories of Western harmony
Authors:
Peter M. C. Harrison,
Marcus T. Pearce
Abstract:
The relationship between sensory consonance and Western harmony is an important topic in music theory and psychology. We introduce new methods for analysing this relationship, and apply them to large corpora representing three prominent genres of Western music: classical, popular, and jazz music. These methods centre on a generative sequence model with an exponential-family energy-based form that…
▽ More
The relationship between sensory consonance and Western harmony is an important topic in music theory and psychology. We introduce new methods for analysing this relationship, and apply them to large corpora representing three prominent genres of Western music: classical, popular, and jazz music. These methods centre on a generative sequence model with an exponential-family energy-based form that predicts chord sequences from continuous features. We use this model to investigate one aspect of instantaneous consonance (harmonicity) and two aspects of sequential consonance (spectral distance and voice-leading distance). Applied to our three musical genres, the results generally support the relationship between sensory consonance and harmony, but lead us to question the high importance attributed to spectral distance in the psychological literature. We anticipate that our methods will provide a useful platform for future work linking music psychology to music theory.
△ Less
Submitted 2 July, 2018;
originally announced July 2018.
-
Bayesian Inference with Anchored Ensembles of Neural Networks, and Application to Exploration in Reinforcement Learning
Authors:
Tim Pearce,
Nicolas Anastassacos,
Mohamed Zaki,
Andy Neely
Abstract:
The use of ensembles of neural networks (NNs) for the quantification of predictive uncertainty is widespread. However, the current justification is intuitive rather than analytical. This work proposes one minor modification to the normal ensembling methodology, which we prove allows the ensemble to perform Bayesian inference, hence converging to the corresponding Gaussian Process as both the total…
▽ More
The use of ensembles of neural networks (NNs) for the quantification of predictive uncertainty is widespread. However, the current justification is intuitive rather than analytical. This work proposes one minor modification to the normal ensembling methodology, which we prove allows the ensemble to perform Bayesian inference, hence converging to the corresponding Gaussian Process as both the total number of NNs, and the size of each, tend to infinity. This working paper provides early-stage results in a reinforcement learning setting, analysing the practicality of the technique for an ensemble of small, finite number. Using the uncertainty estimates produced by anchored ensembles to govern the exploration-exploitation process results in steadier, more stable learning.
△ Less
Submitted 2 July, 2018; v1 submitted 29 May, 2018;
originally announced May 2018.
-
High-Quality Prediction Intervals for Deep Learning: A Distribution-Free, Ensembled Approach
Authors:
Tim Pearce,
Mohamed Zaki,
Alexandra Brintrup,
Andy Neely
Abstract:
This paper considers the generation of prediction intervals (PIs) by neural networks for quantifying uncertainty in regression tasks. It is axiomatic that high-quality PIs should be as narrow as possible, whilst capturing a specified portion of data. We derive a loss function directly from this axiom that requires no distributional assumption. We show how its form derives from a likelihood princip…
▽ More
This paper considers the generation of prediction intervals (PIs) by neural networks for quantifying uncertainty in regression tasks. It is axiomatic that high-quality PIs should be as narrow as possible, whilst capturing a specified portion of data. We derive a loss function directly from this axiom that requires no distributional assumption. We show how its form derives from a likelihood principle, that it can be used with gradient descent, and that model uncertainty is accounted for in ensembled form. Benchmark experiments show the method outperforms current state-of-the-art uncertainty quantification methods, reducing average PI width by over 10%.
△ Less
Submitted 15 June, 2018; v1 submitted 20 February, 2018;
originally announced February 2018.
-
Effects of pitch and timing expectancy on musical emotion
Authors:
Sarah A. Sauvé,
Aminah Sayed,
Roger T. Dean,
Marcus T. Pearce
Abstract:
Pitch and timing information work hand in hand to create a coherent piece of music; but what happens when this information goes against the norm? Relationships between musical expectancy and emotional responses were investigated in a study conducted with 40 participants: 20 musicians and 20 non-musicians. Participants took part in one of two behavioural paradigms measuring continuous expectancy or…
▽ More
Pitch and timing information work hand in hand to create a coherent piece of music; but what happens when this information goes against the norm? Relationships between musical expectancy and emotional responses were investigated in a study conducted with 40 participants: 20 musicians and 20 non-musicians. Participants took part in one of two behavioural paradigms measuring continuous expectancy or emotional responses (arousal and valence) while listening to folk melodies that exhibited either high or low pitch predictability and high or low onset predictability. The causal influence of pitch predictability was investigated in an additional condition where pitch was artificially manipulated and a comparison conducted between original and manipulated forms; the dynamic correlative influence of pitch and timing information and its perception on emotional change during listening was evaluated using cross-sectional time series analysis. The results indicate that pitch and onset predictability are consistent predictors of perceived expectancy and emotional response, with onset carrying more weight than pitch. In addition, musicians and non-musicians do not differ in their responses, possibly due to shared cultural background and knowledge. The results demonstrate in a controlled lab-based setting a precise, quantitative relationship between the predictability of musical structure, expectation and emotional response.
△ Less
Submitted 11 August, 2017;
originally announced August 2017.
-
Attention but not musical training affects auditory streaming
Authors:
Sarah A. Sauvé,
Marcus T. Pearce
Abstract:
While musicians generally perform better than non-musicians in various auditory discrimination tasks, effects of specific instrumental training have received little attention. The effects of instrument-specific musical training on auditory grou** in the context of stream segregation are investigated here in three experiments. In Experiment 1a, participants listened to sequences of ABA tones and…
▽ More
While musicians generally perform better than non-musicians in various auditory discrimination tasks, effects of specific instrumental training have received little attention. The effects of instrument-specific musical training on auditory grou** in the context of stream segregation are investigated here in three experiments. In Experiment 1a, participants listened to sequences of ABA tones and indicated when they heard a change in rhythm. This change is caused by the manipulation of the B tones' timbre and indexes a change in perception from integration to segregation, or vice versa. While it was expected that musicians would detect a change in rhythm earlier when their own instrument was involved, no such pattern was observed. In Experiment 1b, designed to control for potential expectation effects in Experiment 1a, participants heard sequences of static ABA tones and reported their initial perceptions, whether the sequence was integrated or segregated. Results show that participants tend to initially perceive these static sequences as segregated, and that perception is influenced by similarity between the timbres involved. Finally, in Experiment 2 violinists and flautists located mistuned notes in an interleaved melody paradigm containing a violin and a flute melody. Performance did not depend on the instrument the participant played but rather which melody their attention was directed to. Taken together, results from the three experiments suggest that the specific instrument one practices does not have an influence on auditory grou**, but attentional mechanisms are necessary for processing auditory scenes.
△ Less
Submitted 11 August, 2017;
originally announced August 2017.
-
An M-dwarf star in the transition disk of Herbig HD 142527; Physical parameters and orbital elements
Authors:
S. Lacour,
B. Biller,
A. Cheetham,
A. Greenbaum,
T. Pearce,
S. Marino,
P. Tuthill,
L. Pueyo,
E. E. Mamajek,
J. H. Girard,
A. Sivaramakrishnan,
M. Bonnefoy,
I. Baraffe,
G. Chauvin,
J. Olofsson,
A. Juhasz,
M. Benisty,
J. -U. Pott,
A. Sicilia-Aguilar,
T. Henning,
A. Cardwell,
S. Goodsell,
J. R. Graham,
P. Hibon,
P. Ingraham
, et al. (7 additional authors not shown)
Abstract:
HD 142527A is one of the most studied Herbig Ae/Be stars with a transitional disk, as it has the largest imaged gap in any protoplanetary disk: the gas is cleared from 30 to 90 AU. The HD142527 system is also unique in that it has a stellar companion with a small mass compared to the mass of the primary star. This factor of ~20 in mass ratio between the two objects makes this binary system differe…
▽ More
HD 142527A is one of the most studied Herbig Ae/Be stars with a transitional disk, as it has the largest imaged gap in any protoplanetary disk: the gas is cleared from 30 to 90 AU. The HD142527 system is also unique in that it has a stellar companion with a small mass compared to the mass of the primary star. This factor of ~20 in mass ratio between the two objects makes this binary system different from any other YSO. The HD142527 system could therefore provide a valuable test bed. This low-mass stellar object may be responsible for both the gap and dust trap** observed by ALMA at longer distances. We observed this system with the NACO and GPI instruments using the aperture masking technique. Aperture masking is ideal for providing high dynamic range even at very small angular separations. We present the spectral energy distribution for HD142527A and B. Brightness of the companion is now known from the R band up to the M' band. We also followed the orbital motion of HD 142527B over a period of more than two years. The SED of the companion is compatible with a T=3000+/-100K object in addition to a 1700K blackbody environment (likely a circus-secondary disk). From evolution models, we find that it is compatible with an object of mass 0.13+/-0.03Msun, radius 0.90+/-0.15Rsun, and age $1.0^{+1.0}_{-0.75}$Myr. This age is significantly younger than the age previously estimated for HD142527A. Computations to constrain the orbital parameters found a semi major axis of $140^{+120}_{-70}$mas, an eccentricity of 0.5+/-0.2, an inclination of 125+/-15 degrees, and a position angle of the right ascending node of -5+/-40 degrees. Inclination and position angle of the ascending node are in agreement with an orbit coplanar with the inner disk, not coplanar with the outer disk. Despite its high eccentricity, it is unlikely that HD142527B is responsible for truncating the inner edge of the outer disk.
△ Less
Submitted 22 June, 2016; v1 submitted 30 November, 2015;
originally announced November 2015.
-
Double-ringed debris discs could be the work of eccentric planets: explaining the strange morphology of HD 107146
Authors:
Tim Pearce,
Mark Wyatt
Abstract:
We investigate the general interaction between an eccentric planet and a coplanar debris disc of the same mass, using analytical theory and n-body simulations. Such an interaction could result from a planet-planet scattering or merging event. We show that when the planet mass is comparable to that of the disc, the former is often circularised with little change to its semimajor axis. The secular e…
▽ More
We investigate the general interaction between an eccentric planet and a coplanar debris disc of the same mass, using analytical theory and n-body simulations. Such an interaction could result from a planet-planet scattering or merging event. We show that when the planet mass is comparable to that of the disc, the former is often circularised with little change to its semimajor axis. The secular effect of such a planet can cause debris to apsidally anti-align with the planet's orbit (the opposite of what may be naively expected), leading to the counter-intuitive result that a low-mass planet may clear a larger region of debris than a higher-mass body would. The interaction generally results in a double-ringed debris disc, which is comparable to those observed in HD 107146 and HD 92945. As an example we apply our results to HD 107146, and show that the disc's morphology and surface brightness profile can be well-reproduced if the disc is interacting with an eccentric planet of comparable mass (~10-100 Earth masses). This hypothetical planet had a pre-interaction semimajor axis of 30 or 40 au (similar to its present-day value) and an eccentricity of 0.4 or 0.5 (which would since have reduced to ~0.1). Thus the planet (if it exists) presently resides near the inner edge of the disc, rather than between the two debris peaks as may otherwise be expected. Finally we show that disc self-gravity can be important in this mass regime and, whilst it would not affect these results significantly, it should be considered when probing the interaction between a debris disc and a planet.
△ Less
Submitted 15 July, 2015;
originally announced July 2015.
-
Constraining the orbits of sub-stellar companions imaged over short orbital arcs
Authors:
Tim D. Pearce,
Mark C. Wyatt,
Grant M. Kennedy
Abstract:
Imaging a star's companion at multiple epochs over a short orbital arc provides only four of the six coordinates required for a unique orbital solution. Probability distributions of possible solutions are commonly generated by Monte Carlo (MCMC) analysis, but these are biased by priors and may not probe the full parameter space. We suggest alternative methods to characterise possible orbits, which…
▽ More
Imaging a star's companion at multiple epochs over a short orbital arc provides only four of the six coordinates required for a unique orbital solution. Probability distributions of possible solutions are commonly generated by Monte Carlo (MCMC) analysis, but these are biased by priors and may not probe the full parameter space. We suggest alternative methods to characterise possible orbits, which compliment the MCMC technique. Firstly the allowed ranges of orbital elements are prior-independent, and we provide means to calculate these ranges without numerical analyses. Hence several interesting constraints (including whether a companion even can be bound, its minimum possible semi-major axis and its minimum eccentricity) may be quickly computed using our relations as soon as orbital motion is detected. We also suggest an alternative to posterior probability distributions as a means to present possible orbital elements, namely contour plots of elements as functions of line of sight coordinates. These plots are prior-independent, readily show degeneracies between elements and allow readers to extract orbital solutions themselves. This approach is particularly useful when there are other constraints on the geometry, for example if a companion's orbit is assumed to be aligned with a disc. As examples we apply our methods to several imaged sub-stellar companions including Fomalhaut b, and for the latter object we show how different origin hypotheses affect its possible orbital solutions. We also examine visual companions of A- and G-type main sequence stars in the Washington Double Star Catalogue, and show that $\gtrsim50$ per cent must be unbound.
△ Less
Submitted 6 February, 2015;
originally announced February 2015.