-
Counterfactual Explanations via Locally-guided Sequential Algorithmic Recourse
Authors:
Edward A. Small,
Jeffrey N. Clark,
Christopher J. McWilliams,
Kacper Sokol,
Jeffrey Chan,
Flora D. Salim,
Raul Santos-Rodriguez
Abstract:
Counterfactuals operationalised through algorithmic recourse have become a powerful tool to make artificial intelligence systems explainable. Conceptually, given an individual classified as y -- the factual -- we seek actions such that their prediction becomes the desired class y' -- the counterfactual. This process offers algorithmic recourse that is (1) easy to customise and interpret, and (2) d…
▽ More
Counterfactuals operationalised through algorithmic recourse have become a powerful tool to make artificial intelligence systems explainable. Conceptually, given an individual classified as y -- the factual -- we seek actions such that their prediction becomes the desired class y' -- the counterfactual. This process offers algorithmic recourse that is (1) easy to customise and interpret, and (2) directly aligned with the goals of each individual. However, the properties of a "good" counterfactual are still largely debated; it remains an open challenge to effectively locate a counterfactual along with its corresponding recourse. Some strategies use gradient-driven methods, but these offer no guarantees on the feasibility of the recourse and are open to adversarial attacks on carefully created manifolds. This can lead to unfairness and lack of robustness. Other methods are data-driven, which mostly addresses the feasibility problem at the expense of privacy, security and secrecy as they require access to the entire training data set. Here, we introduce LocalFACE, a model-agnostic technique that composes feasible and actionable counterfactual explanations using locally-acquired information at each step of the algorithmic recourse. Our explainer preserves the privacy of users by only leveraging data that it specifically requires to construct actionable algorithmic recourse, and protects the model by offering transparency solely in the regions deemed necessary for the intervention.
△ Less
Submitted 8 September, 2023;
originally announced September 2023.
-
PulsarX: a new pulsar searching package -I. A high performance folding program for pulsar surveys
Authors:
Yunpeng Men,
Ewan Barr,
C. J. Clark,
Emma Carli,
Gregory Desvignes
Abstract:
Pulsar surveys with modern radio telescopes are becoming increasingly computationally demanding. This is particularly true for wide field-of-view pulsar surveys with radio interferometers, and those conducted in real or quasi-real time. These demands result in data analysis bottlenecks that can limit the parameter space covered by the surveys and diminish their scientific return. In this paper, we…
▽ More
Pulsar surveys with modern radio telescopes are becoming increasingly computationally demanding. This is particularly true for wide field-of-view pulsar surveys with radio interferometers, and those conducted in real or quasi-real time. These demands result in data analysis bottlenecks that can limit the parameter space covered by the surveys and diminish their scientific return. In this paper, we address the computational challenge of `candidate folding' in pulsar searching, presenting a novel, efficient approach designed to optimise the simultaneous folding of large numbers of pulsar candidates. We provide a complete folding pipeline appropriate for large-scale pulsar surveys including radio frequency interference (RFI) mitigation, dedispersion, folding and parameter optimization. By leveraging the Fast Discrete Dispersion Measure Transform (FDMT) algorithm proposed by Zackay et al. (2017), we have developed an optimized, and cache-friendly implementation that we term the pruned FDMT (pFDMT). The pFDMT approach efficiently reuses intermediate processing results and prunes the unused computation paths, resulting in a significant reduction in arithmetic operations. In addition, we propose a novel folding algorithm based on the Tikhonov-regularised least squares method (TLSM) that can improve the time resolution of the pulsar profile. We present the performance of its real-world application as an integral part of two major pulsar search projects conducted with the MeerKAT telescope: the MPIfR-MeerKAT Galactic Plane Survey (MMGPS) and the Transients and Pulsars with MeerKAT (TRAPUM) project. In our processing, for approximately 500 candidates, the theoretical number of dedispersion operations can be reduced by a factor of around 50 when compared to brute-force dedispersion, which scales with the number of candidates.
△ Less
Submitted 5 September, 2023;
originally announced September 2023.
-
Concerning the Verity of the MMRD Relation for Novae
Authors:
Allen W. Shafter,
J. Grace Clark,
Kamil Hornoch
Abstract:
It has long been claimed that novae reaching the highest luminosity at the peak of their eruptions appear to fade the fastest from maximum light. The relationship between peak brightness and fade rate is known as the Maximum-Magnitude, Rate-of-Decline (MMRD) relation. Lightcurve parameters for the most recent sample of M31 recurrent novae are presented and used to buttress the case that the observ…
▽ More
It has long been claimed that novae reaching the highest luminosity at the peak of their eruptions appear to fade the fastest from maximum light. The relationship between peak brightness and fade rate is known as the Maximum-Magnitude, Rate-of-Decline (MMRD) relation. Lightcurve parameters for the most recent sample of M31 recurrent novae are presented and used to buttress the case that the observed MMRD relation can be explained as a consequence of observational selection effects coupled with expectations from standard nova models.
△ Less
Submitted 1 September, 2023;
originally announced September 2023.
-
A Joint Fermi-GBM and Swift-BAT Analysis of Gravitational-Wave Candidates from the Third Gravitational-wave Observing Run
Authors:
C. Fletcher,
J. Wood,
R. Hamburg,
P. Veres,
C. M. Hui,
E. Bissaldi,
M. S. Briggs,
E. Burns,
W. H. Cleveland,
M. M. Giles,
A. Goldstein,
B. A. Hristov,
D. Kocevski,
S. Lesage,
B. Mailyan,
C. Malacaria,
S. Poolakkil,
A. von Kienlin,
C. A. Wilson-Hodge,
The Fermi Gamma-ray Burst Monitor Team,
M. Crnogorčević,
J. DeLaunay,
A. Tohuvavohu,
R. Caputo,
S. B. Cenko
, et al. (1674 additional authors not shown)
Abstract:
We present Fermi Gamma-ray Burst Monitor (Fermi-GBM) and Swift Burst Alert Telescope (Swift-BAT) searches for gamma-ray/X-ray counterparts to gravitational wave (GW) candidate events identified during the third observing run of the Advanced LIGO and Advanced Virgo detectors. Using Fermi-GBM on-board triggers and sub-threshold gamma-ray burst (GRB) candidates found in the Fermi-GBM ground analyses,…
▽ More
We present Fermi Gamma-ray Burst Monitor (Fermi-GBM) and Swift Burst Alert Telescope (Swift-BAT) searches for gamma-ray/X-ray counterparts to gravitational wave (GW) candidate events identified during the third observing run of the Advanced LIGO and Advanced Virgo detectors. Using Fermi-GBM on-board triggers and sub-threshold gamma-ray burst (GRB) candidates found in the Fermi-GBM ground analyses, the Targeted Search and the Untargeted Search, we investigate whether there are any coincident GRBs associated with the GWs. We also search the Swift-BAT rate data around the GW times to determine whether a GRB counterpart is present. No counterparts are found. Using both the Fermi-GBM Targeted Search and the Swift-BAT search, we calculate flux upper limits and present joint upper limits on the gamma-ray luminosity of each GW. Given these limits, we constrain theoretical models for the emission of gamma-rays from binary black hole mergers.
△ Less
Submitted 25 August, 2023;
originally announced August 2023.
-
Asymmetric phase diagram and dimensional crossover in a system of spin-1/2 dimers under applied hydrostatic pressure
Authors:
M. J. Coak,
S. P. M. Curley,
Z. Hawkhead,
J. P. Tidey,
D. Graf,
S. J. Clark,
P. Sengupta,
Z. E. Manson,
T. Lancaster,
P. A. Goddard,
J. L. Manson
Abstract:
We present the magnetic and structural properties of [Cu(pyrazine)$_{0.5}$(glycine)]ClO$_4$ under applied pressure. As previously reported, at ambient pressure this material consists of quasi-two-dimensional layers of weakly coupled antiferromagnetic dimers which undergo Bose-Einstein condensation of triplet excitations between two magnetic field-induced quantum critical points (QCPs). The molecul…
▽ More
We present the magnetic and structural properties of [Cu(pyrazine)$_{0.5}$(glycine)]ClO$_4$ under applied pressure. As previously reported, at ambient pressure this material consists of quasi-two-dimensional layers of weakly coupled antiferromagnetic dimers which undergo Bose-Einstein condensation of triplet excitations between two magnetic field-induced quantum critical points (QCPs). The molecular building blocks from which the compound is constructed give rise to exchange strengths that are considerably lower than those found in other $S = 1/2$ dimer materials, which allows us to determine the pressure evolution of the entire field-temperature magnetic phase diagram using radio-frequency magnetometry. We find that a distinct phase emerges above the upper field-induced transition at elevated pressures and also show that an additional QCP is induced at zero-field at a critical pressure of $p_{\rm c} = 15.7(5)$ kbar. Pressure-dependent single-crystal X-ray diffraction and density functional theory calculations indicate that this QCP arises primarily from a dimensional crossover driven by an increase in the interdimer interactions between the planes. While the effect of quantum fluctuations on the lower field-induced transition is enhanced with applied pressure, quantum Monte Carlo calculations suggest that this alone cannot explain an unconventional asymmetry that develops in the phase diagram.
△ Less
Submitted 21 November, 2023; v1 submitted 24 August, 2023;
originally announced August 2023.
-
Ultrafast Radiographic Imaging and Tracking: An overview of instruments, methods, data, and applications
Authors:
Zhehui Wang,
Andrew F. T. Leong,
Angelo Dragone,
Arianna E. Gleason,
Rafael Ballabriga,
Christopher Campbell,
Michael Campbell,
Samuel J. Clark,
Cinzia Da Vià,
Dana M. Dattelbaum,
Marcel Demarteau,
Lorenzo Fabris,
Kamel Fezzaa,
Eric R. Fossum,
Sol M. Gruner,
Todd Hufnagel,
Xiaolu Ju,
Ke Li,
Xavier Llopart,
Bratislav Lukić,
Alexander Rack,
Joseph Strehlow,
Audrey C. Therrien,
Julia Thom-Levy,
Feixiang Wang
, et al. (3 additional authors not shown)
Abstract:
Ultrafast radiographic imaging and tracking (U-RadIT) use state-of-the-art ionizing particle and light sources to experimentally study sub-nanosecond dynamic processes in physics, chemistry, biology, geology, materials science and other fields. These processes, fundamental to nuclear fusion energy, advanced manufacturing, green transportation and others, often involve one mole or more atoms, and t…
▽ More
Ultrafast radiographic imaging and tracking (U-RadIT) use state-of-the-art ionizing particle and light sources to experimentally study sub-nanosecond dynamic processes in physics, chemistry, biology, geology, materials science and other fields. These processes, fundamental to nuclear fusion energy, advanced manufacturing, green transportation and others, often involve one mole or more atoms, and thus are challenging to compute by using the first principles of quantum physics or other forward models. One of the central problems in U-RadIT is to optimize information yield through, e.g. high-luminosity X-ray and particle sources, efficient imaging and tracking detectors, novel methods to collect data, and large-bandwidth online and offline data processing, regulated by the underlying physics, statistics, and computing power. We review and highlight recent progress in: a.) Detectors; b.) U-RadIT modalities; c.) Data and algorithms; and d.) Applications. Hardware-centric approaches to U-RadIT optimization are constrained by detector material properties, low signal-to-noise ratio, high cost and long development cycles of critical hardware components such as ASICs. Interpretation of experimental data, including comparisons with forward models, is frequently hindered by sparse measurements, model and measurement uncertainties, and noise. Alternatively, U-RadIT make increasing use of data science and machine learning algorithms, including experimental implementations of compressed sensing. Machine learning and artificial intelligence approaches, refined by physics and materials information, may also contribute significantly to data interpretation, uncertainty quantification, and U-RadIT optimization.
△ Less
Submitted 4 September, 2023; v1 submitted 21 August, 2023;
originally announced August 2023.
-
The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation
Authors:
Patrick Fernandes,
Daniel Deutsch,
Mara Finkelstein,
Parker Riley,
André F. T. Martins,
Graham Neubig,
Ankush Garg,
Jonathan H. Clark,
Markus Freitag,
Orhan Firat
Abstract:
Automatic evaluation of machine translation (MT) is a critical tool driving the rapid iterative development of MT systems. While considerable progress has been made on estimating a single scalar quality score, current metrics lack the informativeness of more detailed schemes that annotate individual errors, such as Multidimensional Quality Metrics (MQM). In this paper, we help fill this gap by pro…
▽ More
Automatic evaluation of machine translation (MT) is a critical tool driving the rapid iterative development of MT systems. While considerable progress has been made on estimating a single scalar quality score, current metrics lack the informativeness of more detailed schemes that annotate individual errors, such as Multidimensional Quality Metrics (MQM). In this paper, we help fill this gap by proposing AutoMQM, a prompting technique which leverages the reasoning and in-context learning capabilities of large language models (LLMs) and asks them to identify and categorize errors in translations. We start by evaluating recent LLMs, such as PaLM and PaLM-2, through simple score prediction prompting, and we study the impact of labeled data through in-context learning and finetuning. We then evaluate AutoMQM with PaLM-2 models, and we find that it improves performance compared to just prompting for scores (with particularly large gains for larger models) while providing interpretability through error spans that align with human annotations.
△ Less
Submitted 14 August, 2023;
originally announced August 2023.
-
Search for Eccentric Black Hole Coalescences during the Third Observing Run of LIGO and Virgo
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
A. G. Abac,
R. Abbott,
H. Abe,
F. Acernese,
K. Ackley,
C. Adamcewicz,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
O. D. Aguiar,
I. Aguilar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi
, et al. (1750 additional authors not shown)
Abstract:
Despite the growing number of confident binary black hole coalescences observed through gravitational waves so far, the astrophysical origin of these binaries remains uncertain. Orbital eccentricity is one of the clearest tracers of binary formation channels. Identifying binary eccentricity, however, remains challenging due to the limited availability of gravitational waveforms that include effect…
▽ More
Despite the growing number of confident binary black hole coalescences observed through gravitational waves so far, the astrophysical origin of these binaries remains uncertain. Orbital eccentricity is one of the clearest tracers of binary formation channels. Identifying binary eccentricity, however, remains challenging due to the limited availability of gravitational waveforms that include effects of eccentricity. Here, we present observational results for a waveform-independent search sensitive to eccentric black hole coalescences, covering the third observing run (O3) of the LIGO and Virgo detectors. We identified no new high-significance candidates beyond those that were already identified with searches focusing on quasi-circular binaries. We determine the sensitivity of our search to high-mass (total mass $M>70$ $M_\odot$) binaries covering eccentricities up to 0.3 at 15 Hz orbital frequency, and use this to compare model predictions to search results. Assuming all detections are indeed quasi-circular, for our fiducial population model, we place an upper limit for the merger rate density of high-mass binaries with eccentricities $0 < e \leq 0.3$ at $0.33$ Gpc$^{-3}$ yr$^{-1}$ at 90\% confidence level.
△ Less
Submitted 7 August, 2023;
originally announced August 2023.
-
The Third Fermi Large Area Telescope Catalog of Gamma-ray Pulsars
Authors:
David A. Smith,
Philippe Bruel,
Colin J. Clark,
Lucas Guillemot,
Matthew T. Kerr,
Paul Ray,
Soheila Abdollahi,
Marco Ajello,
Luca Baldini,
Jean Ballet,
Matthew Baring,
Cees Bassa,
Josefa Becerra Gonzalez,
Ronaldo Bellazzini,
Alessandra Berretta,
Bhaswati Bhattacharyya,
Elisabetta Bissaldi,
Raffaella Bonino,
Eugenio Bottacini,
Johan Bregeon,
Marta Burgay,
Toby Burnett,
Rob Cameron,
Fernando Camilo,
Regina Caputo
, et al. (134 additional authors not shown)
Abstract:
We present 294 pulsars found in GeV data from the Large Area Telescope (LAT) on the Fermi Gamma-ray Space Telescope. Another 33 millisecond pulsars (MSPs) discovered in deep radio searches of LAT sources will likely reveal pulsations once phase-connected rotation ephemerides are achieved. A further dozen optical and/or X-ray binary systems co-located with LAT sources also likely harbor gamma-ray M…
▽ More
We present 294 pulsars found in GeV data from the Large Area Telescope (LAT) on the Fermi Gamma-ray Space Telescope. Another 33 millisecond pulsars (MSPs) discovered in deep radio searches of LAT sources will likely reveal pulsations once phase-connected rotation ephemerides are achieved. A further dozen optical and/or X-ray binary systems co-located with LAT sources also likely harbor gamma-ray MSPs. This catalog thus reports roughly 340 gamma-ray pulsars and candidates, 10% of all known pulsars, compared to $\leq 11$ known before Fermi. Half of the gamma-ray pulsars are young. Of these, the half that are undetected in radio have a broader Galactic latitude distribution than the young radio-loud pulsars. The others are MSPs, with 6 undetected in radio. Overall, >235 are bright enough above 50 MeV to fit the pulse profile, the energy spectrum, or both. For the common two-peaked profiles, the gamma-ray peak closest to the magnetic pole crossing generally has a softer spectrum. The spectral energy distributions tend to narrow as the spindown power $\dot E$ decreases to its observed minimum near $10^{33}$ erg s$^{-1}$, approaching the shape for synchrotron radiation from monoenergetic electrons. We calculate gamma-ray luminosities when distances are available. Our all-sky gamma-ray sensitivity map is useful for population syntheses. The electronic catalog version provides gamma-ray pulsar ephemerides, properties and fit results to guide and be compared with modeling results.
△ Less
Submitted 20 July, 2023;
originally announced July 2023.
-
The Effect of Data Visualisation Quality and Task Density on Human-Swarm Interaction
Authors:
Ayodeji O. Abioye,
Mohammad Naiseh,
William Hunt,
Jediah Clark,
Sarvapali D. Ramchurn,
Mohammad D. Soorati
Abstract:
Despite the advantages of having robot swarms, human supervision is required for real-world applications. The performance of the human-swarm system depends on several factors including the data availability for the human operators. In this paper, we study the human factors aspect of the human-swarm interaction and investigate how having access to high-quality data can affect the performance of the…
▽ More
Despite the advantages of having robot swarms, human supervision is required for real-world applications. The performance of the human-swarm system depends on several factors including the data availability for the human operators. In this paper, we study the human factors aspect of the human-swarm interaction and investigate how having access to high-quality data can affect the performance of the human-swarm system - the number of tasks completed and the human trust level in operation. We designed an experiment where a human operator is tasked to operate a swarm to identify casualties in an area within a given time period. One group of operators had the option to request high-quality pictures while the other group had to base their decision on the available low-quality images. We performed a user study with 120 participants and recorded their success rate (directly logged via the simulation platform) as well as their workload and trust level (measured through a questionnaire after completing a human-swarm scenario). The findings from our study indicated that the group granted access to high-quality data exhibited an increased workload and placed greater trust in the swarm, thus confirming our initial hypothesis. However, we also found that the number of accurately identified casualties did not significantly vary between the two groups, suggesting that data quality had no impact on the successful completion of tasks.
△ Less
Submitted 17 July, 2023;
originally announced July 2023.
-
Towards Measuring the Representation of Subjective Global Opinions in Language Models
Authors:
Esin Durmus,
Karina Nguyen,
Thomas I. Liao,
Nicholas Schiefer,
Amanda Askell,
Anton Bakhtin,
Carol Chen,
Zac Hatfield-Dodds,
Danny Hernandez,
Nicholas Joseph,
Liane Lovitt,
Sam McCandlish,
Orowa Sikder,
Alex Tamkin,
Janel Thamkul,
Jared Kaplan,
Jack Clark,
Deep Ganguli
Abstract:
Large language models (LLMs) may not equitably represent diverse global perspectives on societal issues. In this paper, we develop a quantitative framework to evaluate whose opinions model-generated responses are more similar to. We first build a dataset, GlobalOpinionQA, comprised of questions and answers from cross-national surveys designed to capture diverse opinions on global issues across dif…
▽ More
Large language models (LLMs) may not equitably represent diverse global perspectives on societal issues. In this paper, we develop a quantitative framework to evaluate whose opinions model-generated responses are more similar to. We first build a dataset, GlobalOpinionQA, comprised of questions and answers from cross-national surveys designed to capture diverse opinions on global issues across different countries. Next, we define a metric that quantifies the similarity between LLM-generated survey responses and human responses, conditioned on country. With our framework, we run three experiments on an LLM trained to be helpful, honest, and harmless with Constitutional AI. By default, LLM responses tend to be more similar to the opinions of certain populations, such as those from the USA, and some European and South American countries, highlighting the potential for biases. When we prompt the model to consider a particular country's perspective, responses shift to be more similar to the opinions of the prompted populations, but can reflect harmful cultural stereotypes. When we translate GlobalOpinionQA questions to a target language, the model's responses do not necessarily become the most similar to the opinions of speakers of those languages. We release our dataset for others to use and build on. Our data is at https://huggingface.co/datasets/Anthropic/llm_global_opinions. We also provide an interactive visualization at https://llmglobalvalues.anthropic.com.
△ Less
Submitted 11 April, 2024; v1 submitted 28 June, 2023;
originally announced June 2023.
-
On planar Brownian motion singularly tilted through a point potential
Authors:
Jeremy Clark,
Barkat Mian
Abstract:
We discuss a family of time-inhomogeneous two-dimensional diffusions, defined over a finite time interval $[0,T]$, having transition density functions that are expressible in terms of the integral kernels for negative exponentials of the two-dimensional Schrödinger operator with a point potential at the origin. These diffusions have a singular drift pointing in the direction of the origin that is…
▽ More
We discuss a family of time-inhomogeneous two-dimensional diffusions, defined over a finite time interval $[0,T]$, having transition density functions that are expressible in terms of the integral kernels for negative exponentials of the two-dimensional Schrödinger operator with a point potential at the origin. These diffusions have a singular drift pointing in the direction of the origin that is strong enough to enable the possibly of visiting there, in contrast to a two-dimensional Brownian motion. Our main focus is on characterizing a local time process at the origin analogous to that for a one-dimensional Brownian motion and on studying the law of its process inverse.
△ Less
Submitted 2 July, 2023; v1 submitted 26 June, 2023;
originally announced June 2023.
-
Magnetic Dirac semimetal state of (Mn,Ge)Bi$_2$Te$_4$
Authors:
Alexander S. Frolov,
Dmitry Yu. Usachov,
Artem V. Tarasov,
Alexander V. Fedorov,
Kirill A. Bokai,
Ilya Klimovskikh,
Vasily S. Stolyarov,
Anton I. Sergeev,
Alexander N. Lavrov,
Vladimir A. Golyashov,
Oleg E. Tereshchenko,
Giovanni Di Santo,
Luca Petaccia,
Oliver J. Clark,
Jaime Sanchez-Barriga,
Lada V. Yashina
Abstract:
For quantum electronics, the possibility to finely tune the properties of magnetic topological insulators (TIs) is a key issue. We studied solid solutions between two isostructural Z$_2$ TIs, magnetic MnBi$_2$Te$_4$ and nonmagnetic GeBi$_2$Te$_4$, with Z$_2$ invariants of 1;000 and 1;001, respectively. For high-quality, large mixed crystals of Ge$_x$Mn$_{1-x}$Bi$_2$Te$_4$, we observed linear x-dep…
▽ More
For quantum electronics, the possibility to finely tune the properties of magnetic topological insulators (TIs) is a key issue. We studied solid solutions between two isostructural Z$_2$ TIs, magnetic MnBi$_2$Te$_4$ and nonmagnetic GeBi$_2$Te$_4$, with Z$_2$ invariants of 1;000 and 1;001, respectively. For high-quality, large mixed crystals of Ge$_x$Mn$_{1-x}$Bi$_2$Te$_4$, we observed linear x-dependent magnetic properties, composition-independent pairwise exchange interactions along with an easy magnetization axis. The bulk band gap gradually decreases to zero for $x$ from 0 to 0.4, before reopening for $x>0.6$, evidencing topological phase transitions (TPTs) between topologically nontrivial phases and the semimetal state. The TPTs are driven purely by the variation of orbital contributions. By tracing the x-dependent $6p$ contribution to the states near the fundamental gap, the effective spin-orbit coupling variation is extracted. As $x$ varies, the maximum of this contribution switches from the valence to the conduction band, thereby driving two TPTs. The gapless state observed at $x=0.42$ closely resembles a Dirac semimetal above the Neel temperature and shows a magnetic gap below, which is clearly visible in raw photoemission data. The observed behavior of the Ge$_x$Mn$_{1-x}$Bi$_2$Te$_4$ system thereby demonstrates an ability to precisely control topological and magnetic properties of TIs.
△ Less
Submitted 22 June, 2023;
originally announced June 2023.
-
A unified quasiparticle approach to the theory of strongly correlated electron liquids
Authors:
V. A. Khodel,
J. W. Clark,
M. V. Zverev
Abstract:
Landau's quasiparticle formalism is generalized to describe a wide class of strongly correlated Fermi systems, in addition to conventional Fermi liquids. This class includes (i) so-called marginal exemplars and (ii) systems that harbor interaction-driven flat bands, in both of which manifestations of non-Fermi-liquid behavior are well documented. Specifically, the advent of such flat bands is attr…
▽ More
Landau's quasiparticle formalism is generalized to describe a wide class of strongly correlated Fermi systems, in addition to conventional Fermi liquids. This class includes (i) so-called marginal exemplars and (ii) systems that harbor interaction-driven flat bands, in both of which manifestations of non-Fermi-liquid behavior are well documented. Specifically, the advent of such flat bands is attributed to a spontaneous topological rearrangement of the Landau state that supplements the conventional Landau quasiparticle picture with a different set of quasiparticles, the so-called fermion condensate, whose single-particle spectrum is dispersionless. The celebrated Landau-Luttinger theorem is extended to marginal Fermi liquids, in which the density of the augmented quasiparticle system is shown to coincide with the particle density. On the other hand, the total density of a system hosting an interaction-driven flat band turns out to be the sum of the densities of the two quasiparticle subsystems: the Landau-like component and the fermion condensate. We demonstrate that within the framework of the scenario proposed, a long-standing problem faced by theories of $D$-wave superconductivity in cuprates, namely a consistent explanation of the so-called Uemera plot, can be naturally resolved.
△ Less
Submitted 10 March, 2024; v1 submitted 30 May, 2023;
originally announced May 2023.
-
Model evaluation for extreme risks
Authors:
Toby Shevlane,
Sebastian Farquhar,
Ben Garfinkel,
Mary Phuong,
Jess Whittlestone,
Jade Leung,
Daniel Kokotajlo,
Nahema Marchal,
Markus Anderljung,
Noam Kolt,
Lewis Ho,
Divya Siddarth,
Shahar Avin,
Will Hawkins,
Been Kim,
Iason Gabriel,
Vijay Bolina,
Jack Clark,
Yoshua Bengio,
Paul Christiano,
Allan Dafoe
Abstract:
Current approaches to building general-purpose AI systems tend to produce systems with both beneficial and harmful capabilities. Further progress in AI development could lead to capabilities that pose extreme risks, such as offensive cyber capabilities or strong manipulation skills. We explain why model evaluation is critical for addressing extreme risks. Developers must be able to identify danger…
▽ More
Current approaches to building general-purpose AI systems tend to produce systems with both beneficial and harmful capabilities. Further progress in AI development could lead to capabilities that pose extreme risks, such as offensive cyber capabilities or strong manipulation skills. We explain why model evaluation is critical for addressing extreme risks. Developers must be able to identify dangerous capabilities (through "dangerous capability evaluations") and the propensity of models to apply their capabilities for harm (through "alignment evaluations"). These evaluations will become critical for kee** policymakers and other stakeholders informed, and for making responsible decisions about model training, deployment, and security.
△ Less
Submitted 22 September, 2023; v1 submitted 24 May, 2023;
originally announced May 2023.
-
Evaluating and Modeling Attribution for Cross-Lingual Question Answering
Authors:
Benjamin Muller,
John Wieting,
Jonathan H. Clark,
Tom Kwiatkowski,
Sebastian Ruder,
Livio Baldini Soares,
Roee Aharoni,
Jonathan Herzig,
Xinyi Wang
Abstract:
Trustworthy answer content is abundant in many high-resource languages and is instantly accessible through question answering systems, yet this content can be hard to access for those that do not speak these languages. The leap forward in cross-lingual modeling quality offered by generative language models offers much promise, yet their raw generations often fall short in factuality. To improve tr…
▽ More
Trustworthy answer content is abundant in many high-resource languages and is instantly accessible through question answering systems, yet this content can be hard to access for those that do not speak these languages. The leap forward in cross-lingual modeling quality offered by generative language models offers much promise, yet their raw generations often fall short in factuality. To improve trustworthiness in these systems, a promising direction is to attribute the answer to a retrieved source, possibly in a content-rich language different from the query. Our work is the first to study attribution for cross-lingual question answering. First, we collect data in 5 languages to assess the attribution level of a state-of-the-art cross-lingual QA system. To our surprise, we find that a substantial portion of the answers is not attributable to any retrieved passages (up to 50% of answers exactly matching a gold reference) despite the system being able to attend directly to the retrieved text. Second, to address this poor attribution level, we experiment with a wide range of attribution detection techniques. We find that Natural Language Inference models and PaLM 2 fine-tuned on a very small amount of attribution data can accurately detect attribution. Based on these models, we improve the attribution level of a cross-lingual question-answering system. Overall, we show that current academic generative cross-lingual QA systems have substantial shortcomings in attribution and we build tooling to mitigate these issues.
△ Less
Submitted 15 November, 2023; v1 submitted 23 May, 2023;
originally announced May 2023.
-
XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Authors:
Sebastian Ruder,
Jonathan H. Clark,
Alexander Gutkin,
Mihir Kale,
Min Ma,
Massimo Nicosia,
Shruti Rijhwani,
Parker Riley,
Jean-Michel A. Sarr,
Xinyi Wang,
John Wieting,
Nitish Gupta,
Anna Katanova,
Christo Kirov,
Dana L. Dickinson,
Brian Roark,
Bidisha Samanta,
Connie Tao,
David I. Adelani,
Vera Axelrod,
Isaac Caswell,
Colin Cherry,
Dan Garrette,
Reeve Ingle,
Melvin Johnson
, et al. (2 additional authors not shown)
Abstract:
Data scarcity is a crucial issue for the development of highly multilingual NLP systems. Yet for many under-represented languages (ULs) -- languages for which NLP re-search is particularly far behind in meeting user needs -- it is feasible to annotate small amounts of data. Motivated by this, we propose XTREME-UP, a benchmark defined by: its focus on the scarce-data scenario rather than zero-shot;…
▽ More
Data scarcity is a crucial issue for the development of highly multilingual NLP systems. Yet for many under-represented languages (ULs) -- languages for which NLP re-search is particularly far behind in meeting user needs -- it is feasible to annotate small amounts of data. Motivated by this, we propose XTREME-UP, a benchmark defined by: its focus on the scarce-data scenario rather than zero-shot; its focus on user-centric tasks -- tasks with broad adoption by speakers of high-resource languages; and its focus on under-represented languages where this scarce-data scenario tends to be most realistic. XTREME-UP evaluates the capabilities of language models across 88 under-represented languages over 9 key user-centric technologies including ASR, OCR, MT, and information access tasks that are of general utility. We create new datasets for OCR, autocomplete, semantic parsing, and transliteration, and build on and refine existing datasets for other tasks. XTREME-UP provides methodology for evaluating many modeling scenarios including text-only, multi-modal (vision, audio, and text),supervised parameter tuning, and in-context learning. We evaluate commonly used models on the benchmark. We release all code and scripts to train and evaluate models
△ Less
Submitted 24 May, 2023; v1 submitted 19 May, 2023;
originally announced May 2023.
-
PaLM 2 Technical Report
Authors:
Rohan Anil,
Andrew M. Dai,
Orhan Firat,
Melvin Johnson,
Dmitry Lepikhin,
Alexandre Passos,
Siamak Shakeri,
Emanuel Taropa,
Paige Bailey,
Zhifeng Chen,
Eric Chu,
Jonathan H. Clark,
Laurent El Shafey,
Yan** Huang,
Kathy Meier-Hellstern,
Gaurav Mishra,
Erica Moreira,
Mark Omernick,
Kevin Robinson,
Sebastian Ruder,
Yi Tay,
Kefan Xiao,
Yuanzhong Xu,
Yu**g Zhang,
Gustavo Hernandez Abrego
, et al. (103 additional authors not shown)
Abstract:
We introduce PaLM 2, a new state-of-the-art language model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. PaLM 2 is a Transformer-based model trained using a mixture of objectives. Through extensive evaluations on English and multilingual language, and reasoning tasks, we demonstrate that PaLM 2 has significantly improved quality on…
▽ More
We introduce PaLM 2, a new state-of-the-art language model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. PaLM 2 is a Transformer-based model trained using a mixture of objectives. Through extensive evaluations on English and multilingual language, and reasoning tasks, we demonstrate that PaLM 2 has significantly improved quality on downstream tasks across different model sizes, while simultaneously exhibiting faster and more efficient inference compared to PaLM. This improved efficiency enables broader deployment while also allowing the model to respond faster, for a more natural pace of interaction. PaLM 2 demonstrates robust reasoning capabilities exemplified by large improvements over PaLM on BIG-Bench and other reasoning tasks. PaLM 2 exhibits stable performance on a suite of responsible AI evaluations, and enables inference-time control over toxicity without additional overhead or impact on other capabilities. Overall, PaLM 2 achieves state-of-the-art performance across a diverse set of tasks and capabilities.
When discussing the PaLM 2 family, it is important to distinguish between pre-trained models (of various sizes), fine-tuned variants of these models, and the user-facing products that use these models. In particular, user-facing products typically include additional pre- and post-processing steps. Additionally, the underlying models may evolve over time. Therefore, one should not expect the performance of user-facing products to exactly match the results reported in this report.
△ Less
Submitted 13 September, 2023; v1 submitted 17 May, 2023;
originally announced May 2023.
-
Band-filling-controlled magnetism from transition metal intercalation in $N_{1/3}$NbS$_2$ revealed with first-principles calculations
Authors:
Z. Hawkhead,
T. J. Hicken,
N. P. Bentley,
B. M. Huddart,
S. J. Clark,
T. Lancaster
Abstract:
We present a first-principles study of the effect of 3$d$ transition metal intercalation on the magnetic properties of the 2H-NbS$_2$ system, using spin-resolved density functional theory calculations to investigate the electronic structure of $N_{1/3}$NbS$_2$ ($N$ = Ti, V, Cr, Mn, Fe, Co, Ni).
We are able to accurately determine the magnetic moments and crystal field splitting, and find that th…
▽ More
We present a first-principles study of the effect of 3$d$ transition metal intercalation on the magnetic properties of the 2H-NbS$_2$ system, using spin-resolved density functional theory calculations to investigate the electronic structure of $N_{1/3}$NbS$_2$ ($N$ = Ti, V, Cr, Mn, Fe, Co, Ni).
We are able to accurately determine the magnetic moments and crystal field splitting, and find that the magnetic properties of the materials are determined by a mechanism based on filling rigid bands with electrons from the intercalant.
We predict the dominant magnetic interaction of these materials by considering Fermi surface nesting, finding agreement with experiment where data are available.
△ Less
Submitted 15 May, 2023;
originally announced May 2023.
-
AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages
Authors:
Odunayo Ogundepo,
Tajuddeen R. Gwadabe,
Clara E. Rivera,
Jonathan H. Clark,
Sebastian Ruder,
David Ifeoluwa Adelani,
Bonaventure F. P. Dossou,
Abdou Aziz DIOP,
Claytone Sikasote,
Gilles Hacheme,
Happy Buzaaba,
Ignatius Ezeani,
Rooweither Mabuya,
Salomey Osei,
Chris Emezue,
Albert Njoroge Kahira,
Shamsuddeen H. Muhammad,
Akintunde Oladipo,
Abraham Toluwase Owodunni,
Atnafu Lambebo Tonja,
Iyanuoluwa Shode,
Akari Asai,
Tunde Oluwaseyi Ajayi,
Clemencia Siro,
Steven Arthur
, et al. (27 additional authors not shown)
Abstract:
African languages have far less in-language content available digitally, making it challenging for question answering systems to satisfy the information needs of users. Cross-lingual open-retrieval question answering (XOR QA) systems -- those that retrieve answer content from other languages while serving people in their native language -- offer a means of filling this gap. To this end, we create…
▽ More
African languages have far less in-language content available digitally, making it challenging for question answering systems to satisfy the information needs of users. Cross-lingual open-retrieval question answering (XOR QA) systems -- those that retrieve answer content from other languages while serving people in their native language -- offer a means of filling this gap. To this end, we create AfriQA, the first cross-lingual QA dataset with a focus on African languages. AfriQA includes 12,000+ XOR QA examples across 10 African languages. While previous datasets have focused primarily on languages where cross-lingual QA augments coverage from the target language, AfriQA focuses on languages where cross-lingual answer content is the only high-coverage source of answer content. Because of this, we argue that African languages are one of the most important and realistic use cases for XOR QA. Our experiments demonstrate the poor performance of automatic translation and multilingual retrieval methods. Overall, AfriQA proves challenging for state-of-the-art QA models. We hope that the dataset enables the development of more equitable QA technology.
△ Less
Submitted 11 May, 2023;
originally announced May 2023.
-
Predicting nuclear masses with product-unit networks
Authors:
Babette Dellen,
Uwe Jaekel,
Paulo S. A. Freitas,
John W. Clark
Abstract:
Accurate estimation of nuclear masses and their prediction beyond the experimentally explored domains of the nuclear landscape are crucial to an understanding of the fundamental origin of nuclear properties and to many applications of nuclear science, most notably in quantifying the $r$-process of stellar nucleosynthesis. Neural networks have been applied with some success to the prediction of nuc…
▽ More
Accurate estimation of nuclear masses and their prediction beyond the experimentally explored domains of the nuclear landscape are crucial to an understanding of the fundamental origin of nuclear properties and to many applications of nuclear science, most notably in quantifying the $r$-process of stellar nucleosynthesis. Neural networks have been applied with some success to the prediction of nuclear masses, but they are known to have shortcomings in application to extrapolation tasks. In this work, we propose and explore a novel type of neural network for mass prediction in which the usual neuron-like processing units are replaced by complex-valued product units that permit multiplicative couplings of inputs to be learned from the input data. This generalized network model is tested on both interpolation and extrapolation data sets drawn from the Atomic Mass Evaluation. Its performance is compared with that of several neural-network architectures, substantiating its suitability for nuclear mass prediction. Additionally, a prediction-uncertainty measure for such complex-valued networks is proposed that serves to identify regions of expected low prediction error.
△ Less
Submitted 8 May, 2023;
originally announced May 2023.
-
Alpha matter revisited
Authors:
J. W. Clark,
E. Krotscheck
Abstract:
We examine in detail two alternative descriptions of a system of $α$ particles interacting via local interactions of different character, highlighting the fact that a faithful microscopic description of such systems demands a consistent treatment of both short- and long-range correlations. In preparation, we examine four different versions of modern microscopic many-body theory and conclude by emp…
▽ More
We examine in detail two alternative descriptions of a system of $α$ particles interacting via local interactions of different character, highlighting the fact that a faithful microscopic description of such systems demands a consistent treatment of both short- and long-range correlations. In preparation, we examine four different versions of modern microscopic many-body theory and conclude by emphasizing that these approaches, although {\it a priori} very different, actually lead to the same equations for their efficient application. The only quantity that depends on the formulation of many-body theory chosen is an {\it irreducible} interaction correction. In the language of Green's functions and Feynman diagrams, it is the set of both particle-particle and particle-hole irreducible diagrams, and in variational Jastrow-Feenberg theory it is determined by {\it multipartite correlations} and {\it elementary diagrams}. We apply these theoretical methods to the calculation of the energetics, structure, thermodynamics, and dynamics of $α$ matter, as well as its condensate fraction. In dimensionless units, $α$ matter appears to be remarkably similar to the much-studied $^4$He quantum fluid, its low-temperature properties now basically solved in the Jastrow-Feenberg framework. Accordingly, one can have confidence in the results of application of the same procedure to $α$ matter. Even so, closer examination reveals significant differences between the physics of the two systems. Within an infinite nuclear medium, alpha matter is subject to a spinoidal instability. Extended mixtures of nucleons and alpha particles are yet to be given rigorous consideration in a corresponding theoretical framework.
△ Less
Submitted 12 November, 2023; v1 submitted 17 April, 2023;
originally announced April 2023.
-
Search for gravitational-lensing signatures in the full third observing run of the LIGO-Virgo network
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
R. Abbott,
H. Abe,
F. Acernese,
K. Ackley,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
C. Alléné,
A. Allocca,
P. A. Altin
, et al. (1670 additional authors not shown)
Abstract:
Gravitational lensing by massive objects along the line of sight to the source causes distortions of gravitational wave-signals; such distortions may reveal information about fundamental physics, cosmology and astrophysics. In this work, we have extended the search for lensing signatures to all binary black hole events from the third observing run of the LIGO--Virgo network. We search for repeated…
▽ More
Gravitational lensing by massive objects along the line of sight to the source causes distortions of gravitational wave-signals; such distortions may reveal information about fundamental physics, cosmology and astrophysics. In this work, we have extended the search for lensing signatures to all binary black hole events from the third observing run of the LIGO--Virgo network. We search for repeated signals from strong lensing by 1) performing targeted searches for subthreshold signals, 2) calculating the degree of overlap amongst the intrinsic parameters and sky location of pairs of signals, 3) comparing the similarities of the spectrograms amongst pairs of signals, and 4) performing dual-signal Bayesian analysis that takes into account selection effects and astrophysical knowledge. We also search for distortions to the gravitational waveform caused by 1) frequency-independent phase shifts in strongly lensed images, and 2) frequency-dependent modulation of the amplitude and phase due to point masses. None of these searches yields significant evidence for lensing. Finally, we use the non-detection of gravitational-wave lensing to constrain the lensing rate based on the latest merger-rate estimates and the fraction of dark matter composed of compact objects.
△ Less
Submitted 17 April, 2023;
originally announced April 2023.
-
Regulatory Markets: The Future of AI Governance
Authors:
Gillian K. Hadfield,
Jack Clark
Abstract:
Appropriately regulating artificial intelligence is an increasingly urgent policy challenge. Legislatures and regulators lack the specialized knowledge required to best translate public demands into legal requirements. Overreliance on industry self-regulation fails to hold producers and users of AI systems accountable to democratic demands. Regulatory markets, in which governments require the targ…
▽ More
Appropriately regulating artificial intelligence is an increasingly urgent policy challenge. Legislatures and regulators lack the specialized knowledge required to best translate public demands into legal requirements. Overreliance on industry self-regulation fails to hold producers and users of AI systems accountable to democratic demands. Regulatory markets, in which governments require the targets of regulation to purchase regulatory services from a private regulator, are proposed. This approach to AI regulation could overcome the limitations of both command-and-control regulation and self-regulation. Regulatory market could enable governments to establish policy priorities for the regulation of AI, whilst relying on market forces and industry R&D efforts to pioneer the methods of regulation that best achieve policymakers' stated objectives.
△ Less
Submitted 25 April, 2023; v1 submitted 10 April, 2023;
originally announced April 2023.
-
Fundamental Symmetries, Neutrons, and Neutrinos (FSNN): Whitepaper for the 2023 NSAC Long Range Plan
Authors:
B. Acharya,
C. Adams,
A. A. Aleksandrova,
K. Alfonso,
P. An,
S. Baeßler,
A. B. Balantekin,
P. S. Barbeau,
F. Bellini,
V. Bellini,
R. S. Beminiwattha,
J. C. Bernauer,
T. Bhattacharya,
M. Bishof,
A. E. Bolotnikov,
P. A. Breur,
M. Brodeur,
J. P. Brodsky,
L. J. Broussard,
T. Brunner,
D. P. Burdette,
J. Caylor,
M. Chiu,
V. Cirigliano,
J. A. Clark
, et al. (154 additional authors not shown)
Abstract:
This whitepaper presents the research priorities decided on by attendees of the 2022 Town Meeting for Fundamental Symmetries, Neutrons and Neutrinos, which took place December 13-15, 2022 in Chapel Hill, NC, as part of the Nuclear Science Advisory Committee (NSAC) 2023 Long Range Planning process. A total of 275 scientists registered for the meeting. The whitepaper makes a number of explicit recom…
▽ More
This whitepaper presents the research priorities decided on by attendees of the 2022 Town Meeting for Fundamental Symmetries, Neutrons and Neutrinos, which took place December 13-15, 2022 in Chapel Hill, NC, as part of the Nuclear Science Advisory Committee (NSAC) 2023 Long Range Planning process. A total of 275 scientists registered for the meeting. The whitepaper makes a number of explicit recommendations and justifies them in detail.
△ Less
Submitted 6 April, 2023;
originally announced April 2023.
-
A Privacy-Preserving Energy Theft Detection Model for Effective Demand-Response Management in Smart Grids
Authors:
Arwa Alromih,
John A. Clark,
Prosanta Gope
Abstract:
The detection of energy thefts is vital for the safety of the whole smart grid system. However, the detection alone is not enough since energy thefts can crucially affect the electricity supply leading to some blackouts. Moreover, privacy is one of the major challenges that must be preserved when dealing with clients' energy data. This is often overlooked in energy theft detection research as most…
▽ More
The detection of energy thefts is vital for the safety of the whole smart grid system. However, the detection alone is not enough since energy thefts can crucially affect the electricity supply leading to some blackouts. Moreover, privacy is one of the major challenges that must be preserved when dealing with clients' energy data. This is often overlooked in energy theft detection research as most current detection techniques rely on raw, unencrypted data, which may potentially expose sensitive and personal data. To solve this issue, we present a privacy-preserving energy theft detection technique with effective demand management that employs two layers of privacy protection. We explore a split learning mechanism that trains a detection model in a decentralised fashion without the need to exchange raw data. We also employ a second layer of privacy by the use of a masking scheme to mask clients' outputs in order to prevent inference attacks. A privacy-enhanced version of this mechanism also employs an additional layer of privacy protection by training a randomisation layer at the end of the client-side model. This is done to make the output as random as possible without compromising the detection performance. For the energy theft detection part, we design a multi-output machine learning model to identify energy thefts, estimate their volume, and effectively predict future demand. Finally, we use a comprehensive set of experiments to test our proposed scheme. The experimental results show that our scheme achieves high detection accuracy and greatly improves the privacy preservation degree.
△ Less
Submitted 23 March, 2023;
originally announced March 2023.
-
The MPIfR-MeerKAT Galactic Plane survey I -- System setup and early results
Authors:
P. V. Padmanabh,
E. D. Barr,
S. S. Sridhar,
M. R. Rugel,
A. Damas-Segovia,
A. M. Jacob,
V. Balakrishnan,
M. Berezina,
M. C. i Bernadich,
A. Brunthaler,
D. J. Champion,
P. C. C. Freire,
S. Khan,
H. -R. Klöckner,
M. Kramer,
Y. K. Ma,
S. A. Mao,
Y. P. Men,
K. M. Menten,
S. Sengupta,
V. Venkatraman Krishnan,
O. Wucknitz,
F. Wyrowski,
M. C. Bezuidenhout,
S. Buchner
, et al. (8 additional authors not shown)
Abstract:
Galactic plane radio surveys play a key role in improving our understanding of a wide range of astrophysical phenomena. Performing such a survey using the latest interferometric telescopes produces large data rates necessitating a shift towards fully or quasi-real-time data analysis with data being stored for only the time required to process them. We present here the overview and setup for the 30…
▽ More
Galactic plane radio surveys play a key role in improving our understanding of a wide range of astrophysical phenomena. Performing such a survey using the latest interferometric telescopes produces large data rates necessitating a shift towards fully or quasi-real-time data analysis with data being stored for only the time required to process them. We present here the overview and setup for the 3000 hour Max-Planck-Institut fuer Radioastronomie (MPIfR) MeerKAT Galactic Plane survey (MMGPS). The survey is unique by operating in a commensal mode, addressing key science objectives of the survey including the discovery of new pulsars and transients as well as studies of Galactic magnetism, the interstellar medium and star formation rates. We explain the strategy coupled with the necessary hardware and software infrastructure needed for data reduction in the imaging, spectral and time domains. We have so far discovered 78 new pulsars including 17 confirmed binary systems of which two are potential double neutron star systems. We have also developed an imaging pipeline sensitive to the order of a few tens of micro-Jansky with a spatial resolution of a few arcseconds. Further science operations with an in-house built S-Band receiver operating between 1.7-3.5 GHz are about to commence. Early spectral line commissioning observations conducted at S-Band, targeting transitions of the key molecular gas tracer CH at 3.3 GHz already illustrate the spectroscopic capabilities of this instrument. These results lay a strong foundation for future surveys with telescopes like the Square Kilometre Array (SKA).
△ Less
Submitted 21 June, 2023; v1 submitted 16 March, 2023;
originally announced March 2023.
-
High Resolution 3D Strain and Orientation Map** within a Grain of a Directed Energy Deposition Laser Additively Manufactured Superalloy
Authors:
Y. Chen,
Y. T. Tang,
D. M. Collins,
S. J. Clark,
W. Ludwig,
R. Rodriguez-Lamas,
C. Detlefs,
R. C. Reed,
P. D. Lee,
P. J. Withers,
C. Yildirim
Abstract:
The industrialization of Laser Additive Manufacturing (LAM) is challenged by the undesirable microstructures and high residual stresses originating from the fast and complex solidification process. Non-destructive assessment of the mechanical performance controlling deformation patterning is therefore critical. Here, we use Dark Field X-ray Microscopy (DFXM) to non-destructively map the 3D intragr…
▽ More
The industrialization of Laser Additive Manufacturing (LAM) is challenged by the undesirable microstructures and high residual stresses originating from the fast and complex solidification process. Non-destructive assessment of the mechanical performance controlling deformation patterning is therefore critical. Here, we use Dark Field X-ray Microscopy (DFXM) to non-destructively map the 3D intragranular orientation and strain variations throughout a surface breaking grain within a directed energy deposition nickel superalloy. DFXM results reveal a highly heterogenous 3D microstructure in terms of the local orientation and lattice strain. The grain comprises $\approx$ 5$μ$m-sized cells with alternating strain states, as high as 5 $\times 10^{-3}$, and orientation differences <0.5° . The DFXM results are compared to Electron Backscatter Diffraction measurements of the same grain from its cut-off surface. We discuss the microstructure developments during LAM, rationalising the development of the deformation patterning from the extreme thermal gradients during processing and the susceptibility for solute segregation.
△ Less
Submitted 8 March, 2023;
originally announced March 2023.
-
Search for Gravitational Waves from Scorpius X-1 in LIGO O3 Data With Corrected Orbital Ephemeris
Authors:
John T. Whelan,
Rodrigo Tenorio,
Jared K. Wofford,
James A. Clark,
Edward J. Daw,
Evan Goetz,
David Keitel,
Ansel Neunzert,
Alicia M. Sintes,
Katelyn J. Wagner,
Graham Woan,
Thomas L. Killestein,
Danny Steeghs
Abstract:
Improved observational constraints on the orbital parameters of the low-mass X-ray binary Scorpius~X-1 were recently published in Killestein et al (2023). In the process, errors were corrected in previous orbital ephemerides, which have been used in searches for continuous gravitational waves from Sco~X-1 using data from the Advanced LIGO detectors. We present the results of a re-analysis of LIGO…
▽ More
Improved observational constraints on the orbital parameters of the low-mass X-ray binary Scorpius~X-1 were recently published in Killestein et al (2023). In the process, errors were corrected in previous orbital ephemerides, which have been used in searches for continuous gravitational waves from Sco~X-1 using data from the Advanced LIGO detectors. We present the results of a re-analysis of LIGO detector data from the third observing run of Advanced LIGO and Advanced Virgo using a model-based cross-correlation search. The corrected region of parameter space, which was not covered by previous searches, was about 1/3 as large as the region searched in the original O3 analysis, reducing the required computing time. We have confirmed that no detectable signal is present over a range of gravitational-wave frequencies from $25\textrm{Hz}$ to $1600\textrm{Hz}$, analogous to the null result of Abbott et al (2022). Our search sensitivity is comparable to that of Abbott et al (2022), who set upper limits corresponding, between $100\textrm{Hz}$ and $200\textrm{Hz}$, to an amplitude $h_0$ of about $10^{-25}$ when marginalized isotropically over the unknown inclination angle of the neutron star's rotation axis, or less than $4\times 10^{-26}$ assuming the optimal orientation.
△ Less
Submitted 26 March, 2023; v1 submitted 20 February, 2023;
originally announced February 2023.
-
Tied-Array Beam Localisation of Radio Transients and Pulsars
Authors:
M. C. Bezuidenhout,
C. J. Clark,
R. P. Breton,
B. W. Stappers,
E. D. Barr,
M. Caleb,
W. Chen,
F. Jankowski,
M. Kramer,
K. Rajwade,
M. Surnis
Abstract:
Multi-element interferometers such as MeerKAT, which observe with high time resolution and have a wide field-of-view, provide an ideal opportunity to perform real-time, untargeted transient and pulsar searches. However, because of data storage limitations, it is not always feasible to store the baseband data required to image the field of a discovered transient or pulsar. This limits the ability o…
▽ More
Multi-element interferometers such as MeerKAT, which observe with high time resolution and have a wide field-of-view, provide an ideal opportunity to perform real-time, untargeted transient and pulsar searches. However, because of data storage limitations, it is not always feasible to store the baseband data required to image the field of a discovered transient or pulsar. This limits the ability of surveys to effectively localise their discoveries and may restrict opportunities for follow-up science, especially of one-off events like some Fast Radio Bursts (FRBs). Here we present a novel maximum-likelihood estimation approach to localising transients and pulsars detected in multiple MeerKAT tied-array beams at once, which we call Tied Array Beam Localisation (TABLo), as well as a Python implementation of the method named SeeKAT. We provide real-world examples of SeeKAT's use as well as a Monte Carlo analysis to show that it is capable of localising single pulses detected in beamformed MeerKAT data to (sub-)arcsecond precision.
△ Less
Submitted 20 February, 2023;
originally announced February 2023.
-
First-principles calculations of magnetic states in pyrochlores using a source-corrected exchange and correlation functional
Authors:
Z. Hawkhead,
N. Gidopoulos,
S. J. Blundell,
S. J. Clark,
T. Lancaster
Abstract:
We present a first-principles investigation of the spin-ice state in Dy$_2$Ti$_2$O$_7$ using a magnetic source-free exchange and correlation functional, implemented in the Castep electronic-structure code. By comparing results from the conventional local spin-density approximation, we show that a spin-ice state in Dy$_2$Ti$_2$O$_7$ can be reliably obtained by removing the magnetic sources from the…
▽ More
We present a first-principles investigation of the spin-ice state in Dy$_2$Ti$_2$O$_7$ using a magnetic source-free exchange and correlation functional, implemented in the Castep electronic-structure code. By comparing results from the conventional local spin-density approximation, we show that a spin-ice state in Dy$_2$Ti$_2$O$_7$ can be reliably obtained by removing the magnetic sources from the exchange and correlation contributions to the potential, and we contrast this against the computed ground states of other frustrated pyrochlore magnets.
△ Less
Submitted 16 February, 2023;
originally announced February 2023.
-
The Capacity for Moral Self-Correction in Large Language Models
Authors:
Deep Ganguli,
Amanda Askell,
Nicholas Schiefer,
Thomas I. Liao,
Kamilė Lukošiūtė,
Anna Chen,
Anna Goldie,
Azalia Mirhoseini,
Catherine Olsson,
Danny Hernandez,
Dawn Drain,
Dustin Li,
Eli Tran-Johnson,
Ethan Perez,
Jackson Kernion,
Jamie Kerr,
Jared Mueller,
Joshua Landau,
Kamal Ndousse,
Karina Nguyen,
Liane Lovitt,
Michael Sellitto,
Nelson Elhage,
Noemi Mercado,
Nova DasSarma
, et al. (24 additional authors not shown)
Abstract:
We test the hypothesis that language models trained with reinforcement learning from human feedback (RLHF) have the capability to "morally self-correct" -- to avoid producing harmful outputs -- if instructed to do so. We find strong evidence in support of this hypothesis across three different experiments, each of which reveal different facets of moral self-correction. We find that the capability…
▽ More
We test the hypothesis that language models trained with reinforcement learning from human feedback (RLHF) have the capability to "morally self-correct" -- to avoid producing harmful outputs -- if instructed to do so. We find strong evidence in support of this hypothesis across three different experiments, each of which reveal different facets of moral self-correction. We find that the capability for moral self-correction emerges at 22B model parameters, and typically improves with increasing model size and RLHF training. We believe that at this level of scale, language models obtain two capabilities that they can use for moral self-correction: (1) they can follow instructions and (2) they can learn complex normative concepts of harm like stereoty**, bias, and discrimination. As such, they can follow instructions to avoid certain kinds of morally harmful outputs. We believe our results are cause for cautious optimism regarding the ability to train language models to abide by ethical principles.
△ Less
Submitted 18 February, 2023; v1 submitted 14 February, 2023;
originally announced February 2023.
-
The Quest for the Missing Dust: II -- Two Orders of Magnitude of Evolution in the Dust-to-Gas Ratio Resolved Within Local Group Galaxies
Authors:
Christopher J. R. Clark,
Julia C. Roman-Duval,
Karl D. Gordon,
Caroline Bot,
Matthew W. L. Smith,
Lea M. Z. Hagen
Abstract:
We explore evolution in the dust-to-gas ratio with density within four well-resolved Local Group galaxies - the LMC, SMC, M31, and M33. We do this using new ${\it Herschel}$ maps, which restore extended emission that was missed by previous ${\it Herschel}$ reductions. This improved data allows us to probe the dust-to-gas ratio across 2.5 orders of magnitude in ISM surface density. We find signific…
▽ More
We explore evolution in the dust-to-gas ratio with density within four well-resolved Local Group galaxies - the LMC, SMC, M31, and M33. We do this using new ${\it Herschel}$ maps, which restore extended emission that was missed by previous ${\it Herschel}$ reductions. This improved data allows us to probe the dust-to-gas ratio across 2.5 orders of magnitude in ISM surface density. We find significant evolution in the dust-to-gas ratio, with dust-to-gas varying with density within each galaxy by up to a factor 22.4. We explore several possible reasons for this, and our favored explanation is dust grain growth in denser regions of ISM. We find that the evolution of the dust-to-gas ratio with ISM surface density is very similar between M31 and M33, despite their large differences in mass, metallicity, and star formation rate; conversely, we find M33 and the LMC to have very different dust-to-gas evolution profiles, despite their close similarity in those properties. Our dust-to-gas ratios address previous disagreement between UV- and FIR-based dust-to-gas estimates for the Magellanic Clouds, removing the disagreement for the LMC, and considerably reducing it for the SMC - with our new dust-to-gas measurements being factors of 2.4 and 2.0 greater than the previous far-infrared estimates, respectively. We also observe that the dust-to-gas ratio appears to fall at the highest densities for the LMC, M31, and M33; this is unlikely to be an actual physical phenomenon, and we posit that it may be due to a combined effect of dark gas, and changing dust mass opacity.
△ Less
Submitted 14 February, 2023;
originally announced February 2023.
-
The Arches cluster revisited: IV. Observational constraints on the binary properties of very massive stars
Authors:
J. S. Clark,
M. E. Lohr,
F. Najarro,
L. R. Patrick,
B. W. Ritchie
Abstract:
Serving as the progenitors of electromagnetic and gravitational wave transients, massive stars have received renewed interest in recent years. However, many aspects of their birth and evolution remain opaque, particularly in the context of binary interactions. The centre of our galaxy hosts a rich cohort of very massive stars, which appear to play a prominent role in the ecology of the region. In…
▽ More
Serving as the progenitors of electromagnetic and gravitational wave transients, massive stars have received renewed interest in recent years. However, many aspects of their birth and evolution remain opaque, particularly in the context of binary interactions. The centre of our galaxy hosts a rich cohort of very massive stars, which appear to play a prominent role in the ecology of the region. In this paper we investigate the binary properties of the Arches cluster, which is thought to host a large number of very massive stars. A combination of multi-epoch near-IR spectroscopy and photometry was utilised to identify binaries. 13 from 36 cluster members meet our criteria to be classed as RV variable. Combining the spectroscopic data with archival radio and X-ray observations - to detect colliding wind systems - provides a lower limit to the binary fraction of ~43%; increasing to >50% for the O-type hypergiants and WNLha. Dynamical and evolutionary masses reveal the primaries to be uniformly massive (>50M$_{\odot}$). Where available, orbital analysis reveals a number of short period, highly eccentric binaries, which appear to be pre-interaction systems. Such systems are X-ray luminous, with 80% above an empirical bound of $(L_{\rm x}/L_{\rm bol})\sim10^{-7}$ and their orbital configurations suggest formation and evolution via a single star channel; however, we cannot exclude a binary formation channel for a subset. Qualitative comparison to surveys of lower mass OB-type stars confirms that the trend to an extreme binary fraction (>60%) extends to the most massive stars currently forming in the local Universe.
△ Less
Submitted 8 February, 2023;
originally announced February 2023.
-
Open data from the third observing run of LIGO, Virgo, KAGRA and GEO
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
R. Abbott,
H. Abe,
F. Acernese,
K. Ackley,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
A. Al-Jodah,
C. Alléné,
A. Allocca
, et al. (1719 additional authors not shown)
Abstract:
The global network of gravitational-wave observatories now includes five detectors, namely LIGO Hanford, LIGO Livingston, Virgo, KAGRA, and GEO 600. These detectors collected data during their third observing run, O3, composed of three phases: O3a starting in April of 2019 and lasting six months, O3b starting in November of 2019 and lasting five months, and O3GK starting in April of 2020 and lasti…
▽ More
The global network of gravitational-wave observatories now includes five detectors, namely LIGO Hanford, LIGO Livingston, Virgo, KAGRA, and GEO 600. These detectors collected data during their third observing run, O3, composed of three phases: O3a starting in April of 2019 and lasting six months, O3b starting in November of 2019 and lasting five months, and O3GK starting in April of 2020 and lasting 2 weeks. In this paper we describe these data and various other science products that can be freely accessed through the Gravitational Wave Open Science Center at https://gwosc.org. The main dataset, consisting of the gravitational-wave strain time series that contains the astrophysical signals, is released together with supporting data useful for their analysis and documentation, tutorials, as well as analysis software packages.
△ Less
Submitted 7 February, 2023;
originally announced February 2023.
-
Causally-Interpretable Random-Effects Meta-Analysis
Authors:
Justin M. Clark,
Kollin W. Rott,
James S. Hodges,
Jared D. Huling
Abstract:
Recent work has made important contributions in the development of causally-interpretable meta-analysis. These methods transport treatment effects estimated in a collection of randomized trials to a target population of interest. Ideally, estimates targeted toward a specific population are more interpretable and relevant to policy-makers and clinicians. However, between-study heterogeneity not ari…
▽ More
Recent work has made important contributions in the development of causally-interpretable meta-analysis. These methods transport treatment effects estimated in a collection of randomized trials to a target population of interest. Ideally, estimates targeted toward a specific population are more interpretable and relevant to policy-makers and clinicians. However, between-study heterogeneity not arising from differences in the distribution of treatment effect modifiers can raise difficulties in synthesizing estimates across trials. The existence of such heterogeneity, including variations in treatment modality, also complicates the interpretation of transported estimates as a generic effect in the target population. We propose a conceptual framework and estimation procedures that attempt to account for such heterogeneity, and develop inferential techniques that aim to capture the accompanying excess variability in causal estimates. This framework also seeks to clarify the kind of treatment effects that are amenable to the techniques of generalizability and transportability.
△ Less
Submitted 7 February, 2023;
originally announced February 2023.
-
Neutron star mass estimates from gamma-ray eclipses in spider millisecond pulsar binaries
Authors:
C. J. Clark,
M. Kerr,
E. D. Barr,
B. Bhattacharyya,
R. P. Breton,
P. Bruel,
F. Camilo,
W. Chen,
I. Cognard,
H. T. Cromartie,
J. Deneva,
V. S. Dhillon,
L. Guillemot,
M. R. Kennedy,
M. Kramer,
A. G. Lyne,
D. Mata Sánchez,
L. Nieder,
C. Phillips,
S. M. Ransom,
P. S. Ray,
M. S. E. Roberts,
J. Roy,
D. A. Smith,
R. Spiewak
, et al. (4 additional authors not shown)
Abstract:
Reliable neutron star mass measurements are key to determining the equation-of-state of cold nuclear matter, but these are rare. "Black Widows" and "Redbacks" are compact binaries consisting of millisecond pulsars and semi-degenerate companion stars. Spectroscopy of the optically bright companions can determine their radial velocities, providing inclination-dependent pulsar mass estimates. While i…
▽ More
Reliable neutron star mass measurements are key to determining the equation-of-state of cold nuclear matter, but these are rare. "Black Widows" and "Redbacks" are compact binaries consisting of millisecond pulsars and semi-degenerate companion stars. Spectroscopy of the optically bright companions can determine their radial velocities, providing inclination-dependent pulsar mass estimates. While inclinations can be inferred from subtle features in optical light curves, such estimates may be systematically biased due to incomplete heating models and poorly-understood variability. Using data from the Fermi Large Area Telescope, we have searched for gamma-ray eclipses from 49 spider systems, discovering significant eclipses in 7 systems, including the prototypical black widow PSR B1957$+$20. Gamma-ray eclipses require direct occultation of the pulsar by the companion, and so the detection, or significant exclusion, of a gamma-ray eclipse strictly limits the binary inclination angle, providing new robust, model-independent pulsar mass constraints. For PSR B1957$+$20, the eclipse implies a much lighter pulsar ($M_{\rm psr} = 1.81 \pm 0.07\,M_{\odot}$) than inferred from optical light curve modelling.
△ Less
Submitted 26 January, 2023;
originally announced January 2023.
-
Identification, explanation and clinical evaluation of hospital patient subtypes
Authors:
Enrico Werner,
Jeffrey N. Clark,
Ranjeet S. Bhamber,
Michael Ambler,
Christopher P. Bourdeaux,
Alexander Hepburn,
Christopher J. McWilliams,
Raul Santos-Rodriguez
Abstract:
We present a pipeline in which unsupervised machine learning techniques are used to automatically identify subtypes of hospital patients admitted between 2017 and 2021 in a large UK teaching hospital. With the use of state-of-the-art explainability techniques, the identified subtypes are interpreted and assigned clinical meaning. In parallel, clinicians assessed intra-cluster similarities and inte…
▽ More
We present a pipeline in which unsupervised machine learning techniques are used to automatically identify subtypes of hospital patients admitted between 2017 and 2021 in a large UK teaching hospital. With the use of state-of-the-art explainability techniques, the identified subtypes are interpreted and assigned clinical meaning. In parallel, clinicians assessed intra-cluster similarities and inter-cluster differences of the identified patient subtypes within the context of their clinical knowledge. By confronting the outputs of both automatic and clinician-based explanations, we aim to highlight the mutual benefit of combining machine learning techniques with clinical expertise.
△ Less
Submitted 19 January, 2023;
originally announced January 2023.
-
A black widow population dissection through HiPERCAM multi-band light curve modelling
Authors:
D. Mata Sánchez,
M. R. Kennedy,
C. J. Clark,
R. P. Breton,
V. S. Dhillon,
G. Voisin,
F. Camilo,
S. Littlefair,
T. R. Marsh,
J. Stringer
Abstract:
Black widows are extreme millisecond pulsar binaries where the pulsar wind ablates their low-mass companion stars. Their optical light curves vary periodically due to the high irradiation and tidal distortion of the companion, which allows us to infer the binary parameters. We present simultaneous multi-band observations obtained with the HIPERCAM instrument at the 10.4-m GTC telescope for six of…
▽ More
Black widows are extreme millisecond pulsar binaries where the pulsar wind ablates their low-mass companion stars. Their optical light curves vary periodically due to the high irradiation and tidal distortion of the companion, which allows us to infer the binary parameters. We present simultaneous multi-band observations obtained with the HIPERCAM instrument at the 10.4-m GTC telescope for six of these systems. The combination of this five-band fast photometer with the world's largest optical telescope enables us to inspect the light curve range near minima. We present the first light curve for PSR J1641+8049, as well as attain a significant increase in signal-to-noise and cadence compared with previous publications for the remaining 5 targets: PSR J0023+0923, PSR J0251+2606, PSR J0636+5129, PSR J0952-0607 and PSR J1544+4937. We report on the results of the light curve modelling with the Icarus code for all six systems, which reveals some of the hottest and densest companion stars known. We compare the parameters derived with the limited but steadily growing black widow population for which optical modelling is available. We find some expected correlations, such as that between the companion star mean density and the orbital period of the system, but also a puzzling positive correlation between the orbital inclination and the irradiation temperature of the companion. We propose such a correlation would arise if pulsars with magnetic axis orthogonal to their spin axis are capable of irradiating their companions to a higher degree.
△ Less
Submitted 17 January, 2023;
originally announced January 2023.
-
A rapid optical and X-ray timing study of the neutron star X-ray binary Swift J1858.6-0814
Authors:
T. Shahbaz,
J. A. Paice,
K. M. Rajwade,
A. Veledina,
P. Gandhi.,
V. S. Dhillon,
T. R. Marsh,
S. Littlefair,
M. R. Kennedy,
R. P. Breton,
C. J. Clark
Abstract:
We present a rapid timing analysis of optical (HiPERCAM and ULTRACAM) and X-ray (NICER) observations of the X-ray transient Swift J1858.6-0814 during 2018 and 2019. The optical light curves show relatively slow, large amplitude (~1 mags in g$_s$) `blue' flares (i.e. stronger at shorter wavelengths) on time-scales of ~minutes as well as fast, small amplitude (~0.1 mag in g$_s$) `red' flares (i.e. s…
▽ More
We present a rapid timing analysis of optical (HiPERCAM and ULTRACAM) and X-ray (NICER) observations of the X-ray transient Swift J1858.6-0814 during 2018 and 2019. The optical light curves show relatively slow, large amplitude (~1 mags in g$_s$) `blue' flares (i.e. stronger at shorter wavelengths) on time-scales of ~minutes as well as fast, small amplitude (~0.1 mag in g$_s$) `red' flares (i.e. stronger at longer wavelengths) on time-scales of ~seconds. The `blue' and `red' flares are consistent with X-ray reprocessing and optically thin synchrotron emission, respectively, similar to what is observed in other X-ray binaries. The simultaneous optical versus soft- and hard-band X-ray light curves show time- and energy dependent correlations.
The 2019 March 4 and parts of the June data show a nearly symmetric positive cross correlations (CCFs) at positive lags consistent with simple X-ray disc reprocessing. The soft- and hard-band CCFs are similar and can be reproduced if disc reprocessing dominates in the optical and one component (disc or synchrotron Comptonization) dominates both the soft and hard X-rays. A part of the 2019 June data shows a very different CCFs. The observed positive correlation at negative lag in the soft-band can be reproduced if the optical synchrotron emission is correlated with the hot flow X-ray emission.
The observed timing properties are in qualitative agreement with the hybrid inner hot accretion flow model, where the relative role of the different X-ray and optical components that vary during the course of the outburst, as well as on shorter time-scales, govern the shape of the optical/X-ray CCFs.
△ Less
Submitted 16 January, 2023;
originally announced January 2023.
-
Nuclear $β$ decay as a probe for physics beyond the Standard Model
Authors:
M. Brodeur,
N. Buzinsky,
M. A. Caprio,
V. Cirigliano,
J. A. Clark,
P. J. Fasano,
J. A. Formaggio,
A. T. Gallant,
A. Garcia,
S. Gandolfi,
S. Gardner,
A. Glick-Magid,
L. Hayen,
H. Hergert,
J. D. Holt,
M. Horoi,
M. Y. Huang,
K. D. Launey,
K. G. Leach,
B. Longfellow,
A. Lovato,
A. E. McCoy,
D. Melconian,
P. Mohanmurthy,
D. C. Moore
, et al. (21 additional authors not shown)
Abstract:
This white paper was submitted to the 2022 Fundamental Symmetries, Neutrons, and Neutrinos (FSNN) Town Hall Meeting in preparation for the next NSAC Long Range Plan. We advocate to support current and future theoretical and experimental searches for physics beyond the Standard Model using nuclear $β$ decay.
This white paper was submitted to the 2022 Fundamental Symmetries, Neutrons, and Neutrinos (FSNN) Town Hall Meeting in preparation for the next NSAC Long Range Plan. We advocate to support current and future theoretical and experimental searches for physics beyond the Standard Model using nuclear $β$ decay.
△ Less
Submitted 10 January, 2023;
originally announced January 2023.
-
Coexistence of bulk-nodal and surface-nodeless Cooper pairings in a superconducting Dirac semimetal
Authors:
Xian P. Yang,
Yigui Zhong,
Sougata Mardanya,
Tyler A. Cochran,
Ramakanta Chapai,
Akifumi Mine,
Junyi Zhang,
Jaime Sánchez-Barriga,
Zi-Jia Cheng,
Oliver J. Clark,
Jia- Xin Yin,
Joanna Blawat,
Guangming Cheng,
Ilya Belopolski,
Tsubaki Nagashima,
Najafzadeh Sahand,
Shiyuan Gao,
Nan Yao,
Arun Bansil,
Rongying **,
Tay-Rong Chang,
Shik Shin,
Kozo Okazaki,
M. Zahid Hasan
Abstract:
The interplay of nontrivial topology and superconductivity in condensed matter physics gives rise to exotic phenomena. However, materials are extremely rare where it is possible to explore the full details of the superconducting pairing. Here, we investigate the momentum dependence of the superconducting gap distribution in a novel Dirac material PdTe. Using high resolution, low temperature photoe…
▽ More
The interplay of nontrivial topology and superconductivity in condensed matter physics gives rise to exotic phenomena. However, materials are extremely rare where it is possible to explore the full details of the superconducting pairing. Here, we investigate the momentum dependence of the superconducting gap distribution in a novel Dirac material PdTe. Using high resolution, low temperature photoemission spectroscopy, we establish it as a spin-orbit coupled Dirac semimetal with the topological Fermi arc crossing the Fermi level on the (010) surface. This spin-textured surface state exhibits a fully gapped superconducting Cooper pairing structure below Tc~4.5K. Moreover, we find a node in the bulk near the Brillouin zone boundary, away from the topological Fermi arc.These observations not only demonstrate the band resolved electronic correlation between topological Fermi arc states and the way it induces Cooper pairing in PdTe, but also provide a rare case where surface and bulk states host a coexistence of nodeless and nodal gap structures enforced by spin-orbit coupling.
△ Less
Submitted 3 January, 2023;
originally announced January 2023.
-
Weak-disorder limit for directed polymers on critical hierarchical graphs with vertex disorder
Authors:
Jeremy Clark,
Casey Lochridge
Abstract:
We study models for a directed polymer in a random environment (DPRE) in which the polymer traverses a hierarchical diamond graph and the random environment is defined through random variables attached to the vertices. For these models, we prove a distributional limit theorem for the partition function in a limiting regime wherein the system grows as the coupling of the polymer to the random envir…
▽ More
We study models for a directed polymer in a random environment (DPRE) in which the polymer traverses a hierarchical diamond graph and the random environment is defined through random variables attached to the vertices. For these models, we prove a distributional limit theorem for the partition function in a limiting regime wherein the system grows as the coupling of the polymer to the random environment is appropriately attenuated. The sequence of diamond graphs is determined by a choice of a branching number $b\in \{2,3,\ldots\}$ and segmenting number $s\in \{2,3,\ldots\}$, and our focus is on the critical case of the model where $b=s$. This extends recent work in the critical case of analogous models with disorder variables placed at the edges of the graphs rather than the vertices.
△ Less
Submitted 29 December, 2022;
originally announced December 2022.
-
BD-KD: Balancing the Divergences for Online Knowledge Distillation
Authors:
Ibtihel Amara,
Nazanin Sepahvand,
Brett H. Meyer,
Warren J. Gross,
James J. Clark
Abstract:
Knowledge distillation (KD) has gained a lot of attention in the field of model compression for edge devices thanks to its effectiveness in compressing large powerful networks into smaller lower-capacity models. Online distillation, in which both the teacher and the student are learning collaboratively, has also gained much interest due to its ability to improve on the performance of the networks…
▽ More
Knowledge distillation (KD) has gained a lot of attention in the field of model compression for edge devices thanks to its effectiveness in compressing large powerful networks into smaller lower-capacity models. Online distillation, in which both the teacher and the student are learning collaboratively, has also gained much interest due to its ability to improve on the performance of the networks involved. The Kullback-Leibler (KL) divergence ensures the proper knowledge transfer between the teacher and student. However, most online KD techniques present some bottlenecks under the network capacity gap. By cooperatively and simultaneously training, the models the KL distance becomes incapable of properly minimizing the teacher's and student's distributions. Alongside accuracy, critical edge device applications are in need of well-calibrated compact networks. Confidence calibration provides a sensible way of getting trustworthy predictions. We propose BD-KD: Balancing of Divergences for online Knowledge Distillation. We show that adaptively balancing between the reverse and forward divergences shifts the focus of the training strategy to the compact student network without limiting the teacher network's learning process. We demonstrate that, by performing this balancing design at the level of the student distillation loss, we improve upon both performance accuracy and calibration of the compact student network. We conducted extensive experiments using a variety of network architectures and show improvements on multiple datasets including CIFAR-10, CIFAR-100, Tiny-ImageNet, and ImageNet. We illustrate the effectiveness of our approach through comprehensive comparisons and ablations with current state-of-the-art online and offline KD techniques.
△ Less
Submitted 25 December, 2022;
originally announced December 2022.
-
Detecting Axion-Like Particles with Primordial Black Holes
Authors:
Kaustubh Agashe,
Jae Hyeok Chang,
Steven J. Clark,
Bhaskar Dutta,
Yuhsin Tsai,
Tao Xu
Abstract:
Future gamma-ray experiments, such as the e-ASTROGAM and AMEGO telescopes, can detect the Hawking radiation of photons from primordial black holes (PBHs) if they make up a fraction or all of dark matter. PBHs can analogously also Hawking radiate new particles, which is especially interesting if these particles are mostly secluded from the Standard Model (SM) sector, since they might therefore be l…
▽ More
Future gamma-ray experiments, such as the e-ASTROGAM and AMEGO telescopes, can detect the Hawking radiation of photons from primordial black holes (PBHs) if they make up a fraction or all of dark matter. PBHs can analogously also Hawking radiate new particles, which is especially interesting if these particles are mostly secluded from the Standard Model (SM) sector, since they might therefore be less accessible otherwise. A well-motivated example of this type is axion-like particles (ALPs) with a tiny coupling to photons. We assume that the ALPs produced by PBHs decay into photons well before reaching the earth, so these will augment the photons directly radiated by the PBHs. Remarkably, we find that the peaks in the energy distributions of ALPs produced from PBHs are different than the corresponding ones for Hawking radiated photons due to the spin-dependent greybody factor. Therefore, we demonstrate that this process will in fact distinctively modify the PBHs' gamma-ray spectrum relative to the SM prediction. We use monochromatic asteroid-mass PBHs as an example to show that e-ASTROGAM can observe the PBH-produced ALP gamma-ray signal (for masses up to ~60 MeV) and further distinguish it from Hawking radiation without ALPs. By measuring the gamma-ray signals, e-ASTROGAM can thereby probe yet unexplored parameters in the ALP mass and photon coupling.
△ Less
Submitted 22 December, 2022;
originally announced December 2022.
-
Beyond Contrastive Learning: A Variational Generative Model for Multilingual Retrieval
Authors:
John Wieting,
Jonathan H. Clark,
William W. Cohen,
Graham Neubig,
Taylor Berg-Kirkpatrick
Abstract:
Contrastive learning has been successfully used for retrieval of semantically aligned sentences, but it often requires large batch sizes or careful engineering to work well. In this paper, we instead propose a generative model for learning multilingual text embeddings which can be used to retrieve or score sentence pairs. Our model operates on parallel data in $N$ languages and, through an approxi…
▽ More
Contrastive learning has been successfully used for retrieval of semantically aligned sentences, but it often requires large batch sizes or careful engineering to work well. In this paper, we instead propose a generative model for learning multilingual text embeddings which can be used to retrieve or score sentence pairs. Our model operates on parallel data in $N$ languages and, through an approximation we introduce, efficiently encourages source separation in this multilingual setting, separating semantic information that is shared between translations from stylistic or language-specific variation. We show careful large-scale comparisons between contrastive and generation-based approaches for learning multilingual text embeddings, a comparison that has not been done to the best of our knowledge despite the popularity of these approaches. We evaluate this method on a suite of tasks including semantic similarity, bitext mining, and cross-lingual question retrieval -- the last of which we introduce in this paper. Overall, our Variational Multilingual Source-Separation Transformer (VMSST) model outperforms both a strong contrastive and generative baseline on these tasks.
△ Less
Submitted 4 June, 2023; v1 submitted 20 December, 2022;
originally announced December 2022.
-
KronA: Parameter Efficient Tuning with Kronecker Adapter
Authors:
Ali Edalati,
Marzieh Tahaei,
Ivan Kobyzev,
Vahid Partovi Nia,
James J. Clark,
Mehdi Rezagholizadeh
Abstract:
Fine-tuning a Pre-trained Language Model (PLM) on a specific downstream task has been a well-known paradigm in Natural Language Processing. However, with the ever-growing size of PLMs, training the entire model on several downstream tasks becomes very expensive and resource-hungry. Recently, different Parameter Efficient Tuning (PET) techniques are proposed to improve the efficiency of fine-tuning…
▽ More
Fine-tuning a Pre-trained Language Model (PLM) on a specific downstream task has been a well-known paradigm in Natural Language Processing. However, with the ever-growing size of PLMs, training the entire model on several downstream tasks becomes very expensive and resource-hungry. Recently, different Parameter Efficient Tuning (PET) techniques are proposed to improve the efficiency of fine-tuning PLMs. One popular category of PET methods is the low-rank adaptation methods which insert learnable truncated SVD modules into the original model either sequentially or in parallel. However, low-rank decomposition suffers from limited representation power. In this work, we address this problem using the Kronecker product instead of the low-rank representation. We introduce KronA, a Kronecker product-based adapter module for efficient fine-tuning of Transformer-based PLMs. We apply the proposed methods for fine-tuning T5 on the GLUE benchmark to show that incorporating the Kronecker-based modules can outperform state-of-the-art PET methods.
△ Less
Submitted 20 December, 2022;
originally announced December 2022.
-
Discovering Language Model Behaviors with Model-Written Evaluations
Authors:
Ethan Perez,
Sam Ringer,
Kamilė Lukošiūtė,
Karina Nguyen,
Edwin Chen,
Scott Heiner,
Craig Pettit,
Catherine Olsson,
Sandipan Kundu,
Saurav Kadavath,
Andy Jones,
Anna Chen,
Ben Mann,
Brian Israel,
Bryan Seethor,
Cameron McKinnon,
Christopher Olah,
Da Yan,
Daniela Amodei,
Dario Amodei,
Dawn Drain,
Dustin Li,
Eli Tran-Johnson,
Guro Khundadze,
Jackson Kernion
, et al. (38 additional authors not shown)
Abstract:
As language models (LMs) scale, they develop many novel behaviors, good and bad, exacerbating the need to evaluate how they behave. Prior work creates evaluations with crowdwork (which is time-consuming and expensive) or existing data sources (which are not always available). Here, we automatically generate evaluations with LMs. We explore approaches with varying amounts of human effort, from inst…
▽ More
As language models (LMs) scale, they develop many novel behaviors, good and bad, exacerbating the need to evaluate how they behave. Prior work creates evaluations with crowdwork (which is time-consuming and expensive) or existing data sources (which are not always available). Here, we automatically generate evaluations with LMs. We explore approaches with varying amounts of human effort, from instructing LMs to write yes/no questions to making complex Winogender schemas with multiple stages of LM-based generation and filtering. Crowdworkers rate the examples as highly relevant and agree with 90-100% of labels, sometimes more so than corresponding human-written datasets. We generate 154 datasets and discover new cases of inverse scaling where LMs get worse with size. Larger LMs repeat back a dialog user's preferred answer ("sycophancy") and express greater desire to pursue concerning goals like resource acquisition and goal preservation. We also find some of the first examples of inverse scaling in RL from Human Feedback (RLHF), where more RLHF makes LMs worse. For example, RLHF makes LMs express stronger political views (on gun rights and immigration) and a greater desire to avoid shut down. Overall, LM-written evaluations are high-quality and let us quickly discover many novel LM behaviors.
△ Less
Submitted 19 December, 2022;
originally announced December 2022.
-
The TRAPUM L-band survey for pulsars in Fermi-LAT gamma-ray sources
Authors:
C. J. Clark,
R. P. Breton,
E. D. Barr,
M. Burgay,
T. Thongmeearkom,
L. Nieder,
S. Buchner,
B. Stappers,
M. Kramer,
W. Becker,
M. Mayer,
A. Phosrisom,
A. Ashok,
M. C. Bezuidenhout,
F. Calore,
I. Cognard,
P. C. C. Freire,
M. Geyer,
J. -M. Grießmeier,
R. Karuppusamy,
L. Levin,
P. V. Padmanabh,
A. Possenti,
S. Ransom,
M. Serylak
, et al. (13 additional authors not shown)
Abstract:
More than 100 millisecond pulsars (MSPs) have been discovered in radio observations of gamma-ray sources detected by the Fermi Large Area Telescope (LAT), but hundreds of pulsar-like sources remain unidentified. Here we present the first results from the targeted survey of Fermi-LAT sources being performed by the Transients and Pulsars with MeerKAT (TRAPUM) Large Survey Project. We observed 79 sou…
▽ More
More than 100 millisecond pulsars (MSPs) have been discovered in radio observations of gamma-ray sources detected by the Fermi Large Area Telescope (LAT), but hundreds of pulsar-like sources remain unidentified. Here we present the first results from the targeted survey of Fermi-LAT sources being performed by the Transients and Pulsars with MeerKAT (TRAPUM) Large Survey Project. We observed 79 sources identified as possible gamma-ray pulsar candidates by a Random Forest classification of unassociated sources from the 4FGL catalogue. Each source was observed for 10 minutes on two separate epochs using MeerKAT's L-band receiver (856-1712 MHz), with typical pulsed flux density sensitivities of $\sim$100$\,μ$Jy. Nine new MSPs were discovered, eight of which are in binary systems, including two eclipsing redbacks and one system, PSR J1526$-$2744, that appears to have a white dwarf companion in an unusually compact 5 hr orbit. We obtained phase-connected timing solutions for two of these MSPs, enabling the detection of gamma-ray pulsations in the Fermi-LAT data. A follow-up search for continuous gravitational waves from PSR J1526$-$2744 in Advanced LIGO data using the resulting Fermi-LAT timing ephemeris yielded no detection, but sets an upper limit on the neutron star ellipticity of $2.45\times10^{-8}$. We also detected X-ray emission from the redback PSR J1803$-$6707 in data from the first eROSITA all-sky survey, likely due to emission from an intra-binary shock.
△ Less
Submitted 16 December, 2022;
originally announced December 2022.
-
Spinning up a Daze: TESS Uncovers a Hot Jupiter orbiting the Rapid-Rotator TOI-778
Authors:
Jake Clark,
Brett Addison,
Jack Okumura,
Sydney Vach,
Alexis Heitzmann,
Joseph Rodriguez,
Duncan Wright,
Mathieu Clerte,
Carolyn Brown,
Tara Fetherolf,
Robert Wittenmyer,
Peter Plavchan,
Stephen Kane,
Jonathan Horner,
John Kielkopf,
Avi Shporer,
C. Tinney,
Liu Hui-Gen,
Sarah Ballard,
Brendan Bowler,
Matthew Mengel,
George Zhou,
Annette Lee,
Avelyn David,
Jessica Heim
, et al. (46 additional authors not shown)
Abstract:
NASA's Transiting Exoplanet Survey Satellite (TESS) mission, has been uncovering a growing number of exoplanets orbiting nearby, bright stars. Most exoplanets that have been discovered by TESS orbit narrow-line, slow-rotating stars, facilitating the confirmation and mass determination of these worlds. We present the discovery of a hot Jupiter orbiting a rapidly rotating ($v\sin{(i)}= 35.1\pm1.0$km…
▽ More
NASA's Transiting Exoplanet Survey Satellite (TESS) mission, has been uncovering a growing number of exoplanets orbiting nearby, bright stars. Most exoplanets that have been discovered by TESS orbit narrow-line, slow-rotating stars, facilitating the confirmation and mass determination of these worlds. We present the discovery of a hot Jupiter orbiting a rapidly rotating ($v\sin{(i)}= 35.1\pm1.0$km/s) early F3V-dwarf, HD115447 (TOI-778). The transit signal taken from Sectors 10 and 37 of TESS's initial detection of the exoplanet is combined with follow-up ground-based photometry and velocity measurements taken from Minerva-Australis, TRES, CORALIE and CHIRON to confirm and characterise TOI-778b. A joint analysis of the light curves and the radial velocity measurements yield a mass, radius, and orbital period for TOI-778b of $2.76^{+0.24}_{-0.23}$Mjup, $1.370\pm0.043$Rjup and $\sim4.63$ days, respectively. The planet orbits a bright ($V = 9.1$mag) F3-dwarf with $M=1.40\pm0.05$Msun, $R=1.70\pm0.05$Rsun, and $\log g=4.05\pm0.17$. We observed a spectroscopic transit of TOI-778b, which allowed us to derive a sky-projected spin-orbit angle of $18^{\circ}\pm11^{\circ}$, consistent with an aligned planetary system. This discovery demonstrates the capability of smaller aperture telescopes such as Minerva-Australis to detect the radial velocity signals produced by planets orbiting broad-line, rapidly rotating stars.
△ Less
Submitted 30 April, 2023; v1 submitted 15 December, 2022;
originally announced December 2022.