-
Hybrid approach predicts a lower binding energy for benzene on water ice
Authors:
Victoria H. J. Clark,
David M. Benoit,
Marie Van de Sande,
Catherine Walsh
Abstract:
In this paper we provide a highly accurate value for the binding energy of benzene to proton-ordered crystalline water ice (XIh), as a model for interstellar ices. We compare our computed value to the latest experimental data available from temperature programmed desorption (TPD) experiments and find that our binding energy value agrees well with data obtained from binding to either crystalline or…
▽ More
In this paper we provide a highly accurate value for the binding energy of benzene to proton-ordered crystalline water ice (XIh), as a model for interstellar ices. We compare our computed value to the latest experimental data available from temperature programmed desorption (TPD) experiments and find that our binding energy value agrees well with data obtained from binding to either crystalline or amorphous ice. Importantly, our new value is lower than that used in most astrochemical networks by about nearly half its value. We explore the impact of this revised binding energy value for both an AGB outflow and a protoplanetary disk. We find that the lower value of the binding energy predicted here compared with values used in the literature (4050 K versus 7587 K) leads to less depletion of gas-phase benzene in an AGB outflow, and leads to a shift outwards in the benzene snowline in the midplane of a protoplanetary disk. Using this new value, the AGB model predicts lower abundances of benzene in the solid phase throughout the outflow. The disk model also predicts a larger reservoir of gas-phase benzene in the inner disk, which is consistent with the recent detections of benzene for the first time in protoplanetary disks with JWST.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Microwave-optical spectroscopy of Rydberg excitons in the ultrastrong driving regime
Authors:
Alistair Brewin,
Liam A P Gallagher,
Jon D Pritchett,
Horatio Q X Wong,
Robert M Potvliege,
Stewart J Clark,
Matthew P A Jones
Abstract:
We study the ultrastrong driving of Rydberg excitons in Cu$_2$O by a microwave field. The effect of the microwaves was studied using optical absorption specstroscopy, and via the observation of sidebands on the transmitted laser light. A model based on Floquet theory was constructed to explore the system beyond the rotating wave approximation. We obtain near quantitative agreement between theory a…
▽ More
We study the ultrastrong driving of Rydberg excitons in Cu$_2$O by a microwave field. The effect of the microwaves was studied using optical absorption specstroscopy, and via the observation of sidebands on the transmitted laser light. A model based on Floquet theory was constructed to explore the system beyond the rotating wave approximation. We obtain near quantitative agreement between theory and experiment across a 16-fold range of microwave field strength, from $43.5\pm 0.3$ V m$^{-1}$ up to $688 \pm 5$ V m$^{-1}$, crossing from the perturbative to the ultrastrong driving regime. Compared to Rydberg atoms, the large non-radiative width of Rydberg exctions leads to new behaviour such as the emergence of an absoprtion continuum without ionization.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Mildly boosted dark matter annihilation and reconciling indirect galactic signals
Authors:
Steven J. Clark
Abstract:
The galactic center excess is a possible non-gravitational observation of dark matter; however, the canonical dark matter model (thermal freeze-out) is in conflict with other gamma-ray observations, in particular those made of the Milky Way's satellite dwarf galaxies. Here we consider the effects of a two-component dark matter model which results in minimally boosted particles that must remain bou…
▽ More
The galactic center excess is a possible non-gravitational observation of dark matter; however, the canonical dark matter model (thermal freeze-out) is in conflict with other gamma-ray observations, in particular those made of the Milky Way's satellite dwarf galaxies. Here we consider the effects of a two-component dark matter model which results in minimally boosted particles that must remain bound to their host galaxy in order to produce an observational signal. This leads to a signal that is heavily dependent on galactic scale and can help reconcile the differences in the galactic center and dwarf galaxy measurements under the dark matter paradigm.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Artificial Intelligence Index Report 2024
Authors:
Nestor Maslej,
Loredana Fattorini,
Raymond Perrault,
Vanessa Parli,
Anka Reuel,
Erik Brynjolfsson,
John Etchemendy,
Katrina Ligett,
Terah Lyons,
James Manyika,
Juan Carlos Niebles,
Yoav Shoham,
Russell Wald,
Jack Clark
Abstract:
The 2024 Index is our most comprehensive to date and arrives at an important moment when AI's influence on society has never been more pronounced. This year, we have broadened our scope to more extensively cover essential trends such as technical advancements in AI, public perceptions of the technology, and the geopolitical dynamics surrounding its development. Featuring more original data than ev…
▽ More
The 2024 Index is our most comprehensive to date and arrives at an important moment when AI's influence on society has never been more pronounced. This year, we have broadened our scope to more extensively cover essential trends such as technical advancements in AI, public perceptions of the technology, and the geopolitical dynamics surrounding its development. Featuring more original data than ever before, this edition introduces new estimates on AI training costs, detailed analyses of the responsible AI landscape, and an entirely new chapter dedicated to AI's impact on science and medicine. The AI Index report tracks, collates, distills, and visualizes data related to artificial intelligence (AI). Our mission is to provide unbiased, rigorously vetted, broadly sourced data in order for policymakers, researchers, executives, journalists, and the general public to develop a more thorough and nuanced understanding of the complex field of AI. The AI Index is recognized globally as one of the most credible and authoritative sources for data and insights on artificial intelligence. Previous editions have been cited in major newspapers, including the The New York Times, Bloomberg, and The Guardian, have amassed hundreds of academic citations, and been referenced by high-level policymakers in the United States, the United Kingdom, and the European Union, among other places. This year's edition surpasses all previous ones in size, scale, and scope, reflecting the growing significance that AI is coming to hold in all of our lives.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
The Impacts of Data, Ordering, and Intrinsic Dimensionality on Recall in Hierarchical Navigable Small Worlds
Authors:
Owen Pendrigh Elliott,
Jesse Clark
Abstract:
Vector search systems, pivotal in AI applications, often rely on the Hierarchical Navigable Small Worlds (HNSW) algorithm. However, the behaviour of HNSW under real-world scenarios using vectors generated with deep learning models remains under-explored. Existing Approximate Nearest Neighbours (ANN) benchmarks and research typically has an over-reliance on simplistic datasets like MNIST or SIFT1M…
▽ More
Vector search systems, pivotal in AI applications, often rely on the Hierarchical Navigable Small Worlds (HNSW) algorithm. However, the behaviour of HNSW under real-world scenarios using vectors generated with deep learning models remains under-explored. Existing Approximate Nearest Neighbours (ANN) benchmarks and research typically has an over-reliance on simplistic datasets like MNIST or SIFT1M and fail to reflect the complexity of current use-cases. Our investigation focuses on HNSW's efficacy across a spectrum of datasets, including synthetic vectors tailored to mimic specific intrinsic dimensionalities, widely-used retrieval benchmarks with popular embedding models, and proprietary e-commerce image data with CLIP models. We survey the most popular HNSW vector databases and collate their default parameters to provide a realistic fixed parameterisation for the duration of the paper.
We discover that the recall of approximate HNSW search, in comparison to exact K Nearest Neighbours (KNN) search, is linked to the vector space's intrinsic dimensionality and significantly influenced by the data insertion sequence. Our methodology highlights how insertion order, informed by measurable properties such as the pointwise Local Intrinsic Dimensionality (LID) or known categories, can shift recall by up to 12 percentage points. We also observe that running popular benchmark datasets with HNSW instead of KNN can shift rankings by up to three positions for some models. This work underscores the need for more nuanced benchmarks and design considerations in develo** robust vector search systems using approximate vector search algorithms. This study presents a number of scenarios with varying real world applicability which aim to better increase understanding and future development of ANN algorithms and embedding
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Charge transfer and Spin-Valley locking in 4Hb-TaS$_{2}$
Authors:
Avior Almoalem,
Roni Gofman,
Yuval Nitzav,
Ilay Mangel,
Irena Feldman,
Jahyun Koo,
Federico Mazzola,
Jun Fujii,
Ivana Vobornik,
J. Sanchez-Barriga,
Oliver J. Clark,
Nicholas Clark Plumb,
Ming Shi,
Binghai Yan,
Amit Kanigel
Abstract:
4Hb-TaS$_2$ is a superconductor that exhibits unique characteristics such as time-reversal symmetry breaking, hidden magnetic memory, and topological edge modes. It is a naturally occurring heterostructure comprising of alternating layers of 1H-TaS$_2$ and 1T-TaS$_2$. The former is a well-known superconductor, while the latter is a correlated insulator with a possible non-trivial magnetic ground s…
▽ More
4Hb-TaS$_2$ is a superconductor that exhibits unique characteristics such as time-reversal symmetry breaking, hidden magnetic memory, and topological edge modes. It is a naturally occurring heterostructure comprising of alternating layers of 1H-TaS$_2$ and 1T-TaS$_2$. The former is a well-known superconductor, while the latter is a correlated insulator with a possible non-trivial magnetic ground state. In this study, we use angle resolved photoemission spectroscopy to investigate the normal state electronic structure of this unconventional superconductor. Our findings reveal that the band structure of 4H-TaS$_2$ fundamentally differs from that of its constituent materials. Specifically, we observe a significant charge transfer from the 1T layers to the 1H layers that drives the 1T layers away from half-filling. In addition, we find a substantial reduction in inter-layer coupling in 4Hb-TaS$_2$ compared to the coupling in 2H-TaS$_2$ that results in a pronounced spin-valley locking within 4Hb-TaS$_2$
△ Less
Submitted 26 May, 2024;
originally announced May 2024.
-
Assessment of the Role and Origin of S* in Orange Carotenoid Protein Photoconversion
Authors:
James P. Pidgeon,
George A. Sutherland,
Matthew S. Proctor,
Shuangqing Wang,
Dimitri Chekulaev,
Sayantan Bhattacharya,
Rahul Jayaprakash,
Andrew Hitchcock,
Ravi Kumar Venkatraman,
Matthew P. Johnson,
C. Neil Hunter,
Jenny Clark
Abstract:
The orange carotenoid protein (OCP) is the water-soluble mediator of non-photochemical quenching in cyanobacteria, a crucial photoprotective mechanism in response to excess illumination. OCP converts from a globular, inactive state (OCPo) to an extended, active conformation (OCPr) under high-light conditions, resulting in a concomitant redshift in the absorption of the bound carotenoid. Here, OCP…
▽ More
The orange carotenoid protein (OCP) is the water-soluble mediator of non-photochemical quenching in cyanobacteria, a crucial photoprotective mechanism in response to excess illumination. OCP converts from a globular, inactive state (OCPo) to an extended, active conformation (OCPr) under high-light conditions, resulting in a concomitant redshift in the absorption of the bound carotenoid. Here, OCP was trapped in either the active or inactive state by fixing each protein conformation in trehalose-sucrose glass. Glass-encapsulated OCPo did not convert under intense illumination and OCPr did not convert in darkness, allowing the optical properties of each conformation to be determined at room temperature. We measured pump wavelength-dependent transient absorption of OCPo in glass films and found that initial OCP photoproducts are still formed, despite the glass preventing completion of the photocycle. By comparison to the pump wavelength dependence of the OCPo to OCPr photoconversion yield in buffer, we show that the long-lived carotenoid singlet-like feature (S*) is associated with ground-state heterogeneity within OCPo, rather than triggering OCP photoconversion.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
Design Editing for Offline Model-based Optimization
Authors:
Ye Yuan,
Youyuan Zhang,
Can Chen,
Haolun Wu,
Zixuan Li,
Jianmo Li,
James J. Clark,
Xue Liu
Abstract:
Offline model-based optimization (MBO) aims to maximize a black-box objective function using only an offline dataset of designs and scores. A prevalent approach involves training a conditional generative model on existing designs and their associated scores, followed by the generation of new designs conditioned on higher target scores. However, these newly generated designs often underperform due…
▽ More
Offline model-based optimization (MBO) aims to maximize a black-box objective function using only an offline dataset of designs and scores. A prevalent approach involves training a conditional generative model on existing designs and their associated scores, followed by the generation of new designs conditioned on higher target scores. However, these newly generated designs often underperform due to the lack of high-scoring training data. To address this challenge, we introduce a novel method, Design Editing for Offline Model-based Optimization (DEMO), which consists of two phases. In the first phase, termed pseudo-target distribution generation, we apply gradient ascent on the offline dataset using a trained surrogate model, producing a synthetic dataset where the predicted scores serve as new labels. A conditional diffusion model is subsequently trained on this synthetic dataset to capture a pseudo-target distribution, which enhances the accuracy of the conditional diffusion model in generating higher-scoring designs. Nevertheless, the pseudo-target distribution is susceptible to noise stemming from inaccuracies in the surrogate model, consequently predisposing the conditional diffusion model to generate suboptimal designs. We hence propose the second phase, existing design editing, to directly incorporate the high-scoring features from the offline dataset into design generation. In this phase, top designs from the offline dataset are edited by introducing noise, which are subsequently refined using the conditional diffusion model to produce high-scoring designs. Overall, high-scoring designs begin with inheriting high-scoring features from the second phase and are further refined with a more accurate conditional diffusion model in the first phase. Empirical evaluations on 7 offline MBO tasks show that DEMO outperforms various baseline methods.
△ Less
Submitted 26 May, 2024; v1 submitted 22 May, 2024;
originally announced May 2024.
-
TRAPUM search for pulsars in supernova remnants and pulsar wind nebulae -- I. Survey description and initial discoveries
Authors:
J. D. Turner,
B. W. Stappers,
E. Carli,
E. D. Barr,
W. Becker,
J. Behrend,
R. P. Breton,
S. Buchner,
M. Burgay,
D. J. Champion,
W. Chen,
C. J. Clark,
D. M. Horn,
E. F. Keane,
M. Kramer,
L. K ünkel,
L. Levin,
Y. P. Men,
P. V. Padmanabh,
A. Ridolfi,
V. Venkatraman Krishnan
Abstract:
We present the description and initial results of the TRAPUM (TRAnsients And PUlsars with MeerKAT) search for pulsars associated with supernova remnants (SNRs), pulsar wind nebulae and unidentified TeV emission. The list of sources to be targeted includes a large number of well-known candidate pulsar locations but also new candidate SNRs identified using a range of criteria. Using the 64-dish Meer…
▽ More
We present the description and initial results of the TRAPUM (TRAnsients And PUlsars with MeerKAT) search for pulsars associated with supernova remnants (SNRs), pulsar wind nebulae and unidentified TeV emission. The list of sources to be targeted includes a large number of well-known candidate pulsar locations but also new candidate SNRs identified using a range of criteria. Using the 64-dish MeerKAT radio telescope, we use an interferometric beamforming technique to tile the potential pulsar locations with coherent beams which we search for radio pulsations, above a signal-to-noise of 9, down to an average flux density upper limit of 30 $μ$Jy. This limit is target-dependent due to the contribution of the sky and nebula to the system temperature. Coherent beams are arranged to overlap at their 50 per cent power radius, so the sensitivity to pulsars is not degraded by more than this amount, though realistically averages around 65 per cent if every location in the beam is considered. We report the discovery of two new pulsars; PSR J1831$-$0941 is an adolescent pulsar likely to be the plerionic engine of the candidate PWN G20.0+0.0, and PSR J1818$-$1502 appears to be an old and faint pulsar that we serendipitously discovered near the centre of a SNR already hosting a compact central object. The survey holds importance for better understanding of neutron star birth rates and the energetics of young pulsars.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
Transportability of Principal Causal Effects
Authors:
Justin M. Clark,
Kollin W. Rott,
James S. Hodges,
Jared D. Huling
Abstract:
Recent research in causal inference has made important progress in addressing challenges to the external validity of trial findings. Such methods weight trial participant data to more closely resemble the distribution of effect-modifying covariates in a well-defined target population. In the presence of participant non-adherence to study medication, these methods effectively transport an intention…
▽ More
Recent research in causal inference has made important progress in addressing challenges to the external validity of trial findings. Such methods weight trial participant data to more closely resemble the distribution of effect-modifying covariates in a well-defined target population. In the presence of participant non-adherence to study medication, these methods effectively transport an intention-to-treat effect that averages over heterogeneous compliance behaviors. In this paper, we develop a principal stratification framework to identify causal effects conditioning on both on compliance behavior and membership in the target population. We also develop non-parametric efficiency theory for and construct efficient estimators of such "transported" principal causal effects and characterize their finite-sample performance in simulation experiments. While this work focuses on treatment non-adherence, the framework is applicable to a broad class of estimands that target effects in clinically-relevant, possibly latent subsets of a target population.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
Adapting Pretrained Networks for Image Quality Assessment on High Dynamic Range Displays
Authors:
Andrei Chubarau,
Hyun** Yoo,
Tara Akhavan,
James Clark
Abstract:
Conventional image quality metrics (IQMs), such as PSNR and SSIM, are designed for perceptually uniform gamma-encoded pixel values and cannot be directly applied to perceptually non-uniform linear high-dynamic-range (HDR) colors. Similarly, most of the available datasets consist of standard-dynamic-range (SDR) images collected in standard and possibly uncontrolled viewing conditions. Popular pre-t…
▽ More
Conventional image quality metrics (IQMs), such as PSNR and SSIM, are designed for perceptually uniform gamma-encoded pixel values and cannot be directly applied to perceptually non-uniform linear high-dynamic-range (HDR) colors. Similarly, most of the available datasets consist of standard-dynamic-range (SDR) images collected in standard and possibly uncontrolled viewing conditions. Popular pre-trained neural networks are likewise intended for SDR inputs, restricting their direct application to HDR content. On the other hand, training HDR models from scratch is challenging due to limited available HDR data. In this work, we explore more effective approaches for training deep learning-based models for image quality assessment (IQA) on HDR data. We leverage networks pre-trained on SDR data (source domain) and re-target these models to HDR (target domain) with additional fine-tuning and domain adaptation. We validate our methods on the available HDR IQA datasets, demonstrating that models trained with our combined recipe outperform previous baselines, converge much quicker, and reliably generalize to HDR inputs.
△ Less
Submitted 1 May, 2024;
originally announced May 2024.
-
Generalized Contrastive Learning for Multi-Modal Retrieval and Ranking
Authors:
Tianyu Zhu,
Myong Chol Jung,
Jesse Clark
Abstract:
Contrastive learning has gained widespread adoption for retrieval tasks due to its minimal requirement for manual annotations. However, popular contrastive frameworks typically learn from binary relevance, making them ineffective at incorporating direct fine-grained rankings. In this paper, we curate a large-scale dataset featuring detailed relevance scores for each query-document pair to facilita…
▽ More
Contrastive learning has gained widespread adoption for retrieval tasks due to its minimal requirement for manual annotations. However, popular contrastive frameworks typically learn from binary relevance, making them ineffective at incorporating direct fine-grained rankings. In this paper, we curate a large-scale dataset featuring detailed relevance scores for each query-document pair to facilitate future research and evaluation. Subsequently, we propose Generalized Contrastive Learning for Multi-Modal Retrieval and Ranking (GCL), which is designed to learn from fine-grained rankings beyond binary relevance scores. Our results show that GCL achieves a 94.5% increase in NDCG@10 for in-domain and 26.3 to 48.8% increases for cold-start evaluations, all relative to the CLIP baseline and involving ground truth rankings.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Exploring the MMRD Relation for Novae in M31
Authors:
J. Grace Clark,
Kamil Hornoch,
Allen W. Shafter,
Hana Kučáková,
Jan Vraštil,
Peter Kušnirák,
Marek Wolf
Abstract:
The results of a two decade long $R$-band photometric survey of novae in M31 are presented. From these data, $R$-band light curves have been determined for 180 novae with data sufficient for estimating peak brightness and subsequent rate of decline. The data show a weak correlation of peak brightness with fade rate consistent with the well-known Maximum Magnitude versus Rate of Decline (MMRD) rela…
▽ More
The results of a two decade long $R$-band photometric survey of novae in M31 are presented. From these data, $R$-band light curves have been determined for 180 novae with data sufficient for estimating peak brightness and subsequent rate of decline. The data show a weak correlation of peak brightness with fade rate consistent with the well-known Maximum Magnitude versus Rate of Decline (MMRD) relation. As generally appreciated for Galactic novae, the large scatter in the MMRD relation precludes its use in determining distances to individual novae. The novae at maximum light are distributed with standard deviation $σ=0.89$ mag about a mean $R$-band absolute magnitude given by $\langle M_R \rangle=-7.57\pm0.07$. The overall M31 luminosity distribution is in excellent agreement with that found for Galactic novae suggesting that the nova populations in M31 and the Galaxy are quite similar. The notion that all novae can be characterized by a standard luminosity 15 d after maximum light ($M_{15}$) is also explored. Surprisingly, the distribution of $M_{15}$ values is characterized by a standard deviation only slightly smaller than that for novae at maximum light and thus offers little promise for precise extragalactic distance determinations. A dozen faint and fast novae that are likely to be previously unidentified recurrent novae have been identified from their position in the MMRD plot and in the $M_{15}$ distribution.
△ Less
Submitted 6 April, 2024;
originally announced April 2024.
-
Discovery and timing of ten new millisecond pulsars in the globular cluster Terzan 5
Authors:
P. V. Padmanabh,
S. M. Ransom,
P. C. C. Freire,
A. Ridolfi,
J. D. Taylor,
C. Choza,
C. J. Clark,
F. Abbate,
M. Bailes,
E. D. Barr,
S. Buchner,
M. Burgay,
M. E. DeCesar,
W. Chen,
A. Corongiu,
D. J. Champion,
A. Dutta,
M. Geyer,
J. W. T. Hessels,
M. Kramer,
A. Possenti,
I. H. Stairs,
B. W. Stappers,
V. Venkatraman Krishnan,
L. Vleeschower
, et al. (1 additional authors not shown)
Abstract:
We report the discovery of ten new pulsars in the globular cluster Terzan 5 as part of the Transients and Pulsars with MeerKAT (TRAPUM) Large Survey Project. We observed Terzan 5 at L-band (856--1712 MHz) with the MeerKAT radio telescope for four hours on two epochs, and performed acceleration searches of 45 out of 288 tied-array beams covering the core of the cluster. We obtained phase-connected…
▽ More
We report the discovery of ten new pulsars in the globular cluster Terzan 5 as part of the Transients and Pulsars with MeerKAT (TRAPUM) Large Survey Project. We observed Terzan 5 at L-band (856--1712 MHz) with the MeerKAT radio telescope for four hours on two epochs, and performed acceleration searches of 45 out of 288 tied-array beams covering the core of the cluster. We obtained phase-connected timing solutions for nine discoveries, covering nearly two decades of archival observations from the Green Bank Telescope for all but one. Highlights include PSR J1748$-$2446ao which is an eccentric ($e = 0.32$) wide-orbit (orbital period $P_{\rm b} = 57.55$ d) system. We were able to measure the rate of advance of periastron ($\dotω$) for this system allowing us to determine a total mass of $3.17 \pm \, 0.02\, \rm M_{\odot}$. With a minimum companion mass ($M_{\rm c}$) of $\sim 0.8\, \rm M_{\odot}$, PSR J1748$-$2446ao is a candidate double neutron star (DNS) system. If confirmed to be a DNS, it would be the fastest spinning pulsar ($P = 2.27$ ms) and the longest orbital period measured for any known DNS system. PSR J1748$-$2446ap has the second highest eccentricity for any recycled pulsar ($e \sim 0.905$) and for this system we can measure the total mass ($1.997 \pm 0.006\, \rm M_{\odot}$) and also estimate the individual pulsar and companion masses. PSR J1748$-$2446ar is an eclipsing redback (minimum $M_{\rm c} \sim 0.34\, \rm M_{\odot}$) system whose properties confirm it to be the counterpart to a previously published source identified in radio and X-ray imaging. With these discoveries, the total number of confirmed pulsars in Terzan 5 is 49, the highest for any globular cluster so far. These discoveries further enhance the rich set of pulsars known in Terzan 5 and provide scope for a deeper understanding of binary stellar evolution, cluster dynamics and ensemble population studies.
△ Less
Submitted 19 June, 2024; v1 submitted 26 March, 2024;
originally announced March 2024.
-
Explore until Confident: Efficient Exploration for Embodied Question Answering
Authors:
Allen Z. Ren,
Jaden Clark,
Anushri Dixit,
Masha Itkina,
Anirudha Majumdar,
Dorsa Sadigh
Abstract:
We consider the problem of Embodied Question Answering (EQA), which refers to settings where an embodied agent such as a robot needs to actively explore an environment to gather information until it is confident about the answer to a question. In this work, we leverage the strong semantic reasoning capabilities of large vision-language models (VLMs) to efficiently explore and answer such questions…
▽ More
We consider the problem of Embodied Question Answering (EQA), which refers to settings where an embodied agent such as a robot needs to actively explore an environment to gather information until it is confident about the answer to a question. In this work, we leverage the strong semantic reasoning capabilities of large vision-language models (VLMs) to efficiently explore and answer such questions. However, there are two main challenges when using VLMs in EQA: they do not have an internal memory for map** the scene to be able to plan how to explore over time, and their confidence can be miscalibrated and can cause the robot to prematurely stop exploration or over-explore. We propose a method that first builds a semantic map of the scene based on depth information and via visual prompting of a VLM - leveraging its vast knowledge of relevant regions of the scene for exploration. Next, we use conformal prediction to calibrate the VLM's question answering confidence, allowing the robot to know when to stop exploration - leading to a more calibrated and efficient exploration strategy. To test our framework in simulation, we also contribute a new EQA dataset with diverse, realistic human-robot scenarios and scenes built upon the Habitat-Matterport 3D Research Dataset (HM3D). Both simulated and real robot experiments show our proposed approach improves the performance and efficiency over baselines that do no leverage VLM for exploration or do not calibrate its confidence. Webpage with experiment videos and code: https://explore-eqa.github.io/
△ Less
Submitted 26 May, 2024; v1 submitted 23 March, 2024;
originally announced March 2024.
-
Semantics from Space: Satellite-Guided Thermal Semantic Segmentation Annotation for Aerial Field Robots
Authors:
Connor Lee,
Saraswati Soedarmadji,
Matthew Anderson,
Anthony J. Clark,
Soon-Jo Chung
Abstract:
We present a new method to automatically generate semantic segmentation annotations for thermal imagery captured from an aerial vehicle by utilizing satellite-derived data products alongside onboard global positioning and attitude estimates. This new capability overcomes the challenge of develo** thermal semantic perception algorithms for field robots due to the lack of annotated thermal field d…
▽ More
We present a new method to automatically generate semantic segmentation annotations for thermal imagery captured from an aerial vehicle by utilizing satellite-derived data products alongside onboard global positioning and attitude estimates. This new capability overcomes the challenge of develo** thermal semantic perception algorithms for field robots due to the lack of annotated thermal field datasets and the time and costs of manual annotation, enabling precise and rapid annotation of thermal data from field collection efforts at a massively-parallelizable scale. By incorporating a thermal-conditioned refinement step with visual foundation models, our approach can produce highly-precise semantic segmentation labels using low-resolution satellite land cover data for little-to-no cost. It achieves 98.5% of the performance from using costly high-resolution options and demonstrates between 70-160% improvement over popular zero-shot semantic segmentation methods based on large vision-language models currently used for generating annotations for RGB imagery. Code will be available at: https://github.com/connorlee77/aerial-auto-segment.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
Deep Few-view High-resolution Photon-counting Extremity CT at Halved Dose for a Clinical Trial
Authors:
Mengzhou Li,
Chuang Niu,
Ge Wang,
Maya R Amma,
Krishna M Chapagain,
Stefan Gabrielson,
Andrew Li,
Kevin Jonker,
Niels de Ruiter,
Jennifer A Clark,
Phil Butler,
Anthony Butler,
Hengyong Yu
Abstract:
The latest X-ray photon-counting computed tomography (PCCT) for extremity allows multi-energy high-resolution (HR) imaging for tissue characterization and material decomposition. However, both radiation dose and imaging speed need improvement for contrast-enhanced and other studies. Despite the success of deep learning methods for 2D few-view reconstruction, applying them to HR volumetric reconstr…
▽ More
The latest X-ray photon-counting computed tomography (PCCT) for extremity allows multi-energy high-resolution (HR) imaging for tissue characterization and material decomposition. However, both radiation dose and imaging speed need improvement for contrast-enhanced and other studies. Despite the success of deep learning methods for 2D few-view reconstruction, applying them to HR volumetric reconstruction of extremity scans for clinical diagnosis has been limited due to GPU memory constraints, training data scarcity, and domain gap issues. In this paper, we propose a deep learning-based approach for PCCT image reconstruction at halved dose and doubled speed in a New Zealand clinical trial. Particularly, we present a patch-based volumetric refinement network to alleviate the GPU memory limitation, train network with synthetic data, and use model-based iterative refinement to bridge the gap between synthetic and real-world data. The simulation and phantom experiments demonstrate consistently improved results under different acquisition conditions on both in- and off-domain structures using a fixed network. The image quality of 8 patients from the clinical trial are evaluated by three radiologists in comparison with the standard image reconstruction with a full-view dataset. It is shown that our proposed approach is essentially identical to or better than the clinical benchmark in terms of diagnostic image quality scores. Our approach has a great potential to improve the safety and efficiency of PCCT without compromising image quality.
△ Less
Submitted 18 March, 2024;
originally announced March 2024.
-
Bayesian analysis of verbal autopsy data using factor models with age- and sex-dependent associations between symptoms
Authors:
Tsuyoshi Kunihama,
Zehang Richard Li,
Samuel J. Clark,
Tyler H. McCormick
Abstract:
Verbal autopsies (VAs) are extensively used to investigate the population-level distributions of deaths by cause in low-resource settings without well-organized vital statistics systems. Computer-based methods are often adopted to assign causes of death to deceased individuals based on the interview responses of their family members or caregivers. In this article, we develop a new Bayesian approac…
▽ More
Verbal autopsies (VAs) are extensively used to investigate the population-level distributions of deaths by cause in low-resource settings without well-organized vital statistics systems. Computer-based methods are often adopted to assign causes of death to deceased individuals based on the interview responses of their family members or caregivers. In this article, we develop a new Bayesian approach that extracts information about cause-of-death distributions from VA data considering the age- and sex-related variation in the associations between symptoms. Its performance is compared with that of existing approaches using gold-standard data from the Population Health Metrics Research Consortium. In addition, we compute the relevance of predictors to causes of death based on information-theoretic measures.
△ Less
Submitted 18 March, 2024;
originally announced March 2024.
-
A targeted radio pulsar survey of redback candidates with MeerKAT
Authors:
T. Thongmeearkom,
C. J. Clark,
R. P. Breton,
M. Burgay,
L. Nieder,
P. C. C. Freire,
E. D. Barr,
B. W. Stappers,
S. M. Ransom,
S. Buchner,
F. Calore,
D. J. Champion,
I. Cognard,
J. -M. Grießmeier,
M. Kramer,
L. Levin,
P. V. Padmanabh,
A. Possenti,
A. Ridolfi,
V. Venkatraman Krishnan,
L. Vleeschower
Abstract:
Redbacks are millisecond pulsar binaries with low mass, irradiated companions. These systems have a rich phenomenology that can be used to probe binary evolution models, pulsar wind physics, and the neutron star mass distribution. A number of high-confidence redback candidates have been identified through searches for variable optical and X-ray sources within the localisation regions of unidentifi…
▽ More
Redbacks are millisecond pulsar binaries with low mass, irradiated companions. These systems have a rich phenomenology that can be used to probe binary evolution models, pulsar wind physics, and the neutron star mass distribution. A number of high-confidence redback candidates have been identified through searches for variable optical and X-ray sources within the localisation regions of unidentified but pulsar-like Fermi-LAT gamma-ray sources. However, these candidates remain unconfirmed until pulsations are detected. As part of the TRAPUM project, we searched for radio pulsations from six of these redback candidates with MeerKAT. We discovered three new radio millisecond pulsars, PSRs J0838$-$2527, J0955$-$3947 and J2333$-$5526, confirming their redback nature. PSR J0838$-$2827 remained undetected for two years after our discovery despite repeated observations, likely due to evaporated material absorbing the radio emission for long periods of time. While, to our knowledge, this system has not undergone a transition to an accreting state, the disappearance, likely caused by extreme eclipses, illustrates the transient nature of spider pulsars and the heavy selection bias in uncovering their radio population. Radio timing enabled the detection of gamma-ray pulsations from all three pulsars, from which we obtained 15-year timing solutions. All of these sources exhibit complex orbital period variations consistent with gravitational quadrupole moment variations in the companion stars. These timing solutions also constrain the binary mass ratios, allowing us to narrow down the pulsar masses. We find that PSR J2333$-$5526 may have a neutron star mass in excess of 2 M$_{\odot}$.
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
An optically defined phononic crystal defect
Authors:
Thomas J. Clark,
Simon Bernard,
Jiaxing Ma,
Vincent Dumont,
Jack C. Sankey
Abstract:
We demonstrate a mechanical crystal with an optically programmable defect mode. By applying an optical spring to a single unit cell of a phononic crystal membrane, we smoothly transfer a single mechanical mode into the bandgap, thereby localizing its spatial profile from one spanning the entire crystal to one confined within a few unit cells. This localization is evidenced by an enhanced mechanica…
▽ More
We demonstrate a mechanical crystal with an optically programmable defect mode. By applying an optical spring to a single unit cell of a phononic crystal membrane, we smoothly transfer a single mechanical mode into the bandgap, thereby localizing its spatial profile from one spanning the entire crystal to one confined within a few unit cells. This localization is evidenced by an enhanced mechanical frequency shift commensurate with a 37-fold reduction in the mode's participating mass. Our results lay groundwork for a new class of optomechanical systems that control mechanical mode profile and participating mass.
△ Less
Submitted 28 June, 2024; v1 submitted 13 March, 2024;
originally announced March 2024.
-
FastVideoEdit: Leveraging Consistency Models for Efficient Text-to-Video Editing
Authors:
Youyuan Zhang,
Xuan Ju,
James J. Clark
Abstract:
Diffusion models have demonstrated remarkable capabilities in text-to-image and text-to-video generation, opening up possibilities for video editing based on textual input. However, the computational cost associated with sequential sampling in diffusion models poses challenges for efficient video editing. Existing approaches relying on image generation models for video editing suffer from time-con…
▽ More
Diffusion models have demonstrated remarkable capabilities in text-to-image and text-to-video generation, opening up possibilities for video editing based on textual input. However, the computational cost associated with sequential sampling in diffusion models poses challenges for efficient video editing. Existing approaches relying on image generation models for video editing suffer from time-consuming one-shot fine-tuning, additional condition extraction, or DDIM inversion, making real-time applications impractical. In this work, we propose FastVideoEdit, an efficient zero-shot video editing approach inspired by Consistency Models (CMs). By leveraging the self-consistency property of CMs, we eliminate the need for time-consuming inversion or additional condition extraction, reducing editing time. Our method enables direct map** from source video to target video with strong preservation ability utilizing a special variance schedule. This results in improved speed advantages, as fewer sampling steps can be used while maintaining comparable generation quality. Experimental results validate the state-of-the-art performance and speed advantages of FastVideoEdit across evaluation metrics encompassing editing speed, temporal consistency, and text-video alignment.
△ Less
Submitted 10 March, 2024;
originally announced March 2024.
-
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
Authors:
Gemini Team,
Petko Georgiev,
Ving Ian Lei,
Ryan Burnell,
Libin Bai,
Anmol Gulati,
Garrett Tanzer,
Damien Vincent,
Zhufeng Pan,
Shibo Wang,
Soroosh Mariooryad,
Yifan Ding,
Xinyang Geng,
Fred Alcober,
Roy Frostig,
Mark Omernick,
Lexi Walker,
Cosmin Paduraru,
Christina Sorokin,
Andrea Tacchetti,
Colin Gaffney,
Samira Daruki,
Olcan Sercinoglu,
Zach Gleicher,
Juliette Love
, et al. (1092 additional authors not shown)
Abstract:
In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February…
▽ More
In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February version on the great majority of capabilities and benchmarks; (2) Gemini 1.5 Flash, a more lightweight variant designed for efficiency with minimal regression in quality. Gemini 1.5 models achieve near-perfect recall on long-context retrieval tasks across modalities, improve the state-of-the-art in long-document QA, long-video QA and long-context ASR, and match or surpass Gemini 1.0 Ultra's state-of-the-art performance across a broad set of benchmarks. Studying the limits of Gemini 1.5's long-context ability, we find continued improvement in next-token prediction and near-perfect retrieval (>99%) up to at least 10M tokens, a generational leap over existing models such as Claude 3.0 (200k) and GPT-4 Turbo (128k). Finally, we highlight real-world use cases, such as Gemini 1.5 collaborating with professionals on completing their tasks achieving 26 to 75% time savings across 10 different job categories, as well as surprising new capabilities of large language models at the frontier; when given a grammar manual for Kalamang, a language with fewer than 200 speakers worldwide, the model learns to translate English to Kalamang at a similar level to a person who learned from the same content.
△ Less
Submitted 14 June, 2024; v1 submitted 8 March, 2024;
originally announced March 2024.
-
Ultralight vector dark matter search using data from the KAGRA O3GK run
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
A. G. Abac,
R. Abbott,
H. Abe,
I. Abouelfettouh,
F. Acernese,
K. Ackley,
C. Adamcewicz,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
O. D. Aguiar,
I. Aguilar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi
, et al. (1778 additional authors not shown)
Abstract:
Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese…
▽ More
Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we present the result of a search for $U(1)_{B-L}$ gauge boson DM using the KAGRA data from auxiliary length channels during the first joint observation run together with GEO600. By applying our search pipeline, which takes into account the stochastic nature of ultralight DM, upper bounds on the coupling strength between the $U(1)_{B-L}$ gauge boson and ordinary matter are obtained for a range of DM masses. While our constraints are less stringent than those derived from previous experiments, this study demonstrates the applicability of our method to the lower-mass vector DM search, which is made difficult in this measurement by the short observation time compared to the auto-correlation time scale of DM.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
Multitask Multilingual Model Adaptation with Featurized Low-Rank Mixtures
Authors:
Chu-Cheng Lin,
Xinyi Wang,
Jonathan H. Clark,
Han Lu,
Yun Zhu,
Chenxi Whitehouse,
Hongkun Yu
Abstract:
Adapting pretrained large language models (LLMs) to various downstream tasks in tens or hundreds of human languages is computationally expensive. Parameter-efficient fine-tuning (PEFT) significantly reduces the adaptation cost, by tuning only a small amount of parameters. However, directly applying PEFT methods such as LoRA (Hu et al., 2022) on diverse dataset mixtures could lead to suboptimal per…
▽ More
Adapting pretrained large language models (LLMs) to various downstream tasks in tens or hundreds of human languages is computationally expensive. Parameter-efficient fine-tuning (PEFT) significantly reduces the adaptation cost, by tuning only a small amount of parameters. However, directly applying PEFT methods such as LoRA (Hu et al., 2022) on diverse dataset mixtures could lead to suboptimal performance due to limited parameter capacity and negative interference among different datasets. In this work, we propose Featurized Low-rank Mixtures (FLix), a novel PEFT method designed for effective multitask multilingual tuning. FLix associates each unique dataset feature, such as the dataset's language or task, with its own low-rank weight update parameters. By composing feature-specific parameters for each dataset, FLix can accommodate diverse dataset mixtures and generalize better to unseen datasets. Our experiments show that FLix leads to significant improvements over a variety of tasks for both supervised learning and zero-shot settings using different training data mixtures.
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
Towards Improved Uncertainty Quantification of Stochastic Epidemic Models Using Sequential Monte Carlo
Authors:
Arindam Fadikar,
Abby Stevens,
Nicholson Collier,
Kok Ben Toh,
Olga Morozova,
Anna Hotton,
Jared Clark,
David Higdon,
Jonathan Ozik
Abstract:
Sequential Monte Carlo (SMC) algorithms represent a suite of robust computational methodologies utilized for state estimation and parameter inference within dynamical systems, particularly in real-time or online environments where data arrives sequentially over time. In this research endeavor, we propose an integrated framework that combines a stochastic epidemic simulator with a sequential import…
▽ More
Sequential Monte Carlo (SMC) algorithms represent a suite of robust computational methodologies utilized for state estimation and parameter inference within dynamical systems, particularly in real-time or online environments where data arrives sequentially over time. In this research endeavor, we propose an integrated framework that combines a stochastic epidemic simulator with a sequential importance sampling (SIS) scheme to dynamically infer model parameters, which evolve due to social as well as biological processes throughout the progression of an epidemic outbreak and are also influenced by evolving data measurement bias. Through iterative updates of a set of weighted simulated trajectories based on observed data, this framework enables the estimation of posterior distributions for these parameters, thereby capturing their temporal variability and associated uncertainties. Through simulation studies, we showcase the efficacy of SMC in accurately tracking the evolving dynamics of epidemics while appropriately accounting for uncertainties. Moreover, we delve into practical considerations and challenges inherent in implementing SMC for parameter estimation within dynamic epidemiological settings, areas where the substantial computational capabilities of high-performance computing resources can be usefully brought to bear.
△ Less
Submitted 6 March, 2024; v1 submitted 23 February, 2024;
originally announced February 2024.
-
Substrate Prediction for RiPP Biosynthetic Enzymes via Masked Language Modeling and Transfer Learning
Authors:
Joseph D. Clark,
Xuenan Mi,
Douglas A. Mitchell,
Diwakar Shukla
Abstract:
Ribosomally synthesized and post-translationally modified peptide (RiPP) biosynthetic enzymes often exhibit promiscuous substrate preferences that cannot be reduced to simple rules. Large language models are promising tools for predicting such peptide fitness landscapes. However, state-of-the-art protein language models are trained on relatively few peptide sequences. A previous study comprehensiv…
▽ More
Ribosomally synthesized and post-translationally modified peptide (RiPP) biosynthetic enzymes often exhibit promiscuous substrate preferences that cannot be reduced to simple rules. Large language models are promising tools for predicting such peptide fitness landscapes. However, state-of-the-art protein language models are trained on relatively few peptide sequences. A previous study comprehensively profiled the peptide substrate preferences of LazBF (a two-component serine dehydratase) and LazDEF (a three-component azole synthetase) from the lactazole biosynthetic pathway. We demonstrated that masked language modeling of LazBF substrate preferences produced language model embeddings that improved downstream classification models of both LazBF and LazDEF substrates. Similarly, masked language modeling of LazDEF substrate preferences produced embeddings that improved the performance of classification models of both LazBF and LazDEF substrates. Our results suggest that the models learned functional forms that are transferable between distinct enzymatic transformations that act within the same biosynthetic pathway. Our transfer learning method improved performance and data efficiency in data-scarce scenarios. We then fine-tuned models on each data set and showed that the fine-tuned models provided interpretable insight that we anticipate will facilitate the design of substrate libraries that are compatible with desired RiPP biosynthetic pathways.
△ Less
Submitted 23 February, 2024;
originally announced February 2024.
-
A 350-MHz Green Bank Telescope Survey of Unassociated Fermi LAT Sources: Discovery and Timing of Ten Millisecond Pulsars
Authors:
P. Bangale,
B. Bhattacharyya,
F. Camilo,
C. J. Clark,
I. Cognard,
M. E. DeCesar,
E. C. Ferrara,
P. Gentile,
L. Guillemot,
J. W. T. Hessels,
T. J. Johnson,
M. Kerr,
M. A. McLaughlin,
L. Nieder,
S. M. Ransom,
P. S. Ray,
M. S. E. Roberts,
J. Roy,
S. Sanpa-Arsa,
G. Theureau,
M. T. Wolff
Abstract:
We have searched for radio pulsations towards 49 Fermi Large Area Telescope (LAT) 1FGL Catalog $γ$-ray sources using the Green Bank Telescope at 350 MHz. We detected 18 millisecond pulsars (MSPs) in blind searches of the data; 10 of these were discoveries unique to our survey. Sixteen are binaries, with eight having short orbital periods $P_B < 1$ day. No radio pulsations from young pulsars were d…
▽ More
We have searched for radio pulsations towards 49 Fermi Large Area Telescope (LAT) 1FGL Catalog $γ$-ray sources using the Green Bank Telescope at 350 MHz. We detected 18 millisecond pulsars (MSPs) in blind searches of the data; 10 of these were discoveries unique to our survey. Sixteen are binaries, with eight having short orbital periods $P_B < 1$ day. No radio pulsations from young pulsars were detected, although three targets are coincident with apparently radio-quiet $γ$-ray pulsars discovered in LAT data. Here, we give an overview of the survey and present radio and $γ$-ray timing results for the 10 MSPs discovered. These include the only isolated MSP discovered in our survey and six short-$P_B$ binary MSPs. Of these, three have very low-mass companions ($M_c$ $\ll$ 0.1M$_{\odot}$) and hence belong to the class of black widow pulsars. Two have more massive, non-degenerate companions with extensive radio eclipses and orbitally modulated X-ray emission consistent with the redback class. Significant $γ$-ray pulsations have been detected from nine of the discoveries. This survey and similar efforts suggest that the majority of Galactic $γ$-ray sources at high Galactic latitudes are either MSPs or relatively nearby non-recycled pulsars, with the latter having on average a much smaller radio/$γ$-ray beaming ratio as compared to MSPs. It also confirms that past surveys suffered from an observational bias against finding short-$P_B$ MSP systems.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
Properties of the cone of polynomials of fixed degree that preserve nonnegative matrices
Authors:
Jared J. L. Brannan,
Benjamin J. Clark
Abstract:
As was detailed by Loewy and London in [Linear and Multilinear Algebra 6 (1978/79), no.~1, 83--90], the cone of polynomials that preserve the nonnegativity of matrices may play an important role in the solution to the nonnegative inverse eigenvalue problem. In this paper, we start by showing the cone generated by polynomials of degree greater than or equal to $2n$ that preserve nonnegative matrice…
▽ More
As was detailed by Loewy and London in [Linear and Multilinear Algebra 6 (1978/79), no.~1, 83--90], the cone of polynomials that preserve the nonnegativity of matrices may play an important role in the solution to the nonnegative inverse eigenvalue problem. In this paper, we start by showing the cone generated by polynomials of degree greater than or equal to $2n$ that preserve nonnegative matrices of order $n$ is non-polyhedral. Then it is shown, that a polynomial that preserves nonnegative matrices of order $n$ can have all the center terms be negative. These polynomials can also have the largest term, in absolute value, be arbitrarily negative with the remaining coefficients being one. We explore properties of the measure of the cone when restricted to the unit sphere and prove some initial bounds of that volume.
△ Less
Submitted 31 March, 2024; v1 submitted 6 February, 2024;
originally announced February 2024.
-
Determination of the spins and parities for the 0$_{4}^{+}$ and 0$_{5}^{+}$ states in $^{100}$Zr
Authors:
J. Wu,
M. P. Carpenter,
F. G. Kondev,
R. V. F. Janssens,
S. Zhu,
E. A. McCutchan,
A. D. Ayangeakaa,
J. Chen,
J. Clark,
D. J. Hartley,
T. Lauritsen,
N. Pietralla,
G. Savard,
D. Seweryniak,
V. Werner
Abstract:
Two 0$^{+}$ states at 1294.5 and 1774.0 keV, together with three 2$^{+}$ and one 4$^{+}$ levels, were identified or unambiguously spin-parity assigned for the first time in $^{100}$Zr utilizing $γ$-ray spectroscopy and $γ$-$γ$ angular correlation techniques with the Gammasphere spectrometer, following the $β^{-}$ decay of neutron-rich, mass separated $^{100,100m}$Y isotopes. Comparisons with recen…
▽ More
Two 0$^{+}$ states at 1294.5 and 1774.0 keV, together with three 2$^{+}$ and one 4$^{+}$ levels, were identified or unambiguously spin-parity assigned for the first time in $^{100}$Zr utilizing $γ$-ray spectroscopy and $γ$-$γ$ angular correlation techniques with the Gammasphere spectrometer, following the $β^{-}$ decay of neutron-rich, mass separated $^{100,100m}$Y isotopes. Comparisons with recent Monte Carlo Shell-Model (MCSM) calculations indicate that these two states are candidates for the bandhead of a sequence in a shape-coexisting spherical minimum predicted to be located around $\approx$1500 keV. According to the measured relative B(E2)$_{relative}$ transition probabilities, the 0$_{5}^{+}$ state exhibits decay properties which more closely align with those predicted for a spherical shape, while the 0$_{4}^{+}$ level is suggested to be associated with a weakly-deformed shape similar to one related to the 0$_{2}^{+}$ state.
△ Less
Submitted 5 February, 2024; v1 submitted 4 February, 2024;
originally announced February 2024.
-
Faster Inference of Integer SWIN Transformer by Removing the GELU Activation
Authors:
Mohammadreza Tayaranian,
Seyyed Hasan Mozafari,
James J. Clark,
Brett Meyer,
Warren Gross
Abstract:
SWIN transformer is a prominent vision transformer model that has state-of-the-art accuracy in image classification tasks. Despite this success, its unique architecture causes slower inference compared with similar deep neural networks. Integer quantization of the model is one of the methods used to improve its inference latency. However, state-of-the-art has not been able to fully quantize the mo…
▽ More
SWIN transformer is a prominent vision transformer model that has state-of-the-art accuracy in image classification tasks. Despite this success, its unique architecture causes slower inference compared with similar deep neural networks. Integer quantization of the model is one of the methods used to improve its inference latency. However, state-of-the-art has not been able to fully quantize the model. In this work, we improve upon the inference latency of the state-of-the-art methods by removing the floating-point operations, which are associated with the GELU activation in Swin Transformer. While previous work proposed to replace the non-integer operations with linear approximation functions, we propose to replace GELU with ReLU activation. The advantage of ReLU over previous methods is its low memory and computation complexity. We use iterative knowledge distillation to compensate for the lost accuracy due to replacing GELU with ReLU. We quantize our GELU-less SWIN transformer and show that on an RTX 4090 NVIDIA GPU we can improve the inference latency of the quantized SWIN transformer by at least $11\%$ while maintaining an accuracy drop of under $0.5\%$ on the ImageNet evaluation dataset.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
AdCorDA: Classifier Refinement via Adversarial Correction and Domain Adaptation
Authors:
Lulan Shen,
Ali Edalati,
Brett Meyer,
Warren Gross,
James J. Clark
Abstract:
This paper describes a simple yet effective technique for refining a pretrained classifier network. The proposed AdCorDA method is based on modification of the training set and making use of the duality between network weights and layer inputs. We call this input space training. The method consists of two stages - adversarial correction followed by domain adaptation. Adversarial correction uses ad…
▽ More
This paper describes a simple yet effective technique for refining a pretrained classifier network. The proposed AdCorDA method is based on modification of the training set and making use of the duality between network weights and layer inputs. We call this input space training. The method consists of two stages - adversarial correction followed by domain adaptation. Adversarial correction uses adversarial attacks to correct incorrect training-set classifications. The incorrectly classified samples of the training set are removed and replaced with the adversarially corrected samples to form a new training set, and then, in the second stage, domain adaptation is performed back to the original training set. Extensive experimental validations show significant accuracy boosts of over 5% on the CIFAR-100 dataset. The technique can be straightforwardly applied to refinement of weight-quantized neural networks, where experiments show substantial enhancement in performance over the baseline. The adversarial correction technique also results in enhanced robustness to adversarial attacks.
△ Less
Submitted 23 January, 2024;
originally announced January 2024.
-
Robustness to distribution shifts of compressed networks for edge devices
Authors:
Lulan Shen,
Ali Edalati,
Brett Meyer,
Warren Gross,
James J. Clark
Abstract:
It is necessary to develop efficient DNNs deployed on edge devices with limited computation resources. However, the compressed networks often execute new tasks in the target domain, which is different from the source domain where the original network is trained. It is important to investigate the robustness of compressed networks in two types of data distribution shifts: domain shifts and adversar…
▽ More
It is necessary to develop efficient DNNs deployed on edge devices with limited computation resources. However, the compressed networks often execute new tasks in the target domain, which is different from the source domain where the original network is trained. It is important to investigate the robustness of compressed networks in two types of data distribution shifts: domain shifts and adversarial perturbations. In this study, we discover that compressed models are less robust to distribution shifts than their original networks. Interestingly, larger networks are more vulnerable to losing robustness than smaller ones, even when they are compressed to a similar size as the smaller networks. Furthermore, compact networks obtained by knowledge distillation are much more robust to distribution shifts than pruned networks. Finally, post-training quantization is a reliable method for achieving significant robustness to distribution shifts, and it outperforms both pruned and distilled models in terms of robustness.
△ Less
Submitted 22 January, 2024;
originally announced January 2024.
-
Mass estimates from optical modelling of the new TRAPUM redback PSR J1910-5320
Authors:
O. G. Dodge,
R. P. Breton,
C. J. Clark,
M. Burgay,
J. Strader,
K. -Y. Au,
E. D. Barr,
S. Buchner,
V. S. Dhillon,
E. C. Ferrara,
P. C. C. Freire,
J. -M. Griessmeier,
M. R. Kennedy,
M. Kramer,
K. -L. Li,
P. V. Padmanabh,
A. Phosrisom,
B. W. Stappers,
S. J. Swihart,
T. Thongmeearkom
Abstract:
Spider pulsars continue to provide promising candidates for neutron star mass measurements. Here we present the discovery of PSR~J1910$-$5320, a new millisecond pulsar discovered in a MeerKAT observation of an unidentified \textit{Fermi}-LAT gamma-ray source. This pulsar is coincident with a recently identified candidate redback binary, independently discovered through its periodic optical flux an…
▽ More
Spider pulsars continue to provide promising candidates for neutron star mass measurements. Here we present the discovery of PSR~J1910$-$5320, a new millisecond pulsar discovered in a MeerKAT observation of an unidentified \textit{Fermi}-LAT gamma-ray source. This pulsar is coincident with a recently identified candidate redback binary, independently discovered through its periodic optical flux and radial velocity. New multi-color optical light curves obtained with ULTRACAM/NTT in combination with MeerKAT timing and updated SOAR/Goodman spectroscopic radial velocity measurements allow a mass constraint for PSR~J1910$-$5320. \texttt{Icarus} optical light curve modelling, with streamlined radial velocity fitting, constrains the orbital inclination and companion velocity, unlocking the binary mass function given the precise radio ephemeris. Our modelling aims to unite the photometric and spectroscopic measurements available by fitting each simultaneously to the same underlying physical model, ensuring self-consistency. This targets centre-of-light radial velocity corrections necessitated by the irradiation endemic to spider systems. Depending on the gravity darkening prescription used, we find a moderate neutron star mass of either $1.6\pm0.2$ or $1.4\pm0.2$ $M_\odot$. The companion mass of either $0.45\pm0.04$ or $0.43^{+0.04}_{-0.03}$ $M_\odot$ also further confirms PSR~J1910$-$5320 as an irradiated redback spider pulsar.radiated redback spider pulsar.
△ Less
Submitted 18 January, 2024;
originally announced January 2024.
-
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training
Authors:
Evan Hubinger,
Carson Denison,
Jesse Mu,
Mike Lambert,
Meg Tong,
Monte MacDiarmid,
Tamera Lanham,
Daniel M. Ziegler,
Tim Maxwell,
Newton Cheng,
Adam Jermyn,
Amanda Askell,
Ansh Radhakrishnan,
Cem Anil,
David Duvenaud,
Deep Ganguli,
Fazl Barez,
Jack Clark,
Kamal Ndousse,
Kshitij Sachan,
Michael Sellitto,
Mrinank Sharma,
Nova DasSarma,
Roger Grosse,
Shauna Kravec
, et al. (14 additional authors not shown)
Abstract:
Humans are capable of strategically deceptive behavior: behaving helpfully in most situations, but then behaving very differently in order to pursue alternative objectives when given the opportunity. If an AI system learned such a deceptive strategy, could we detect it and remove it using current state-of-the-art safety training techniques? To study this question, we construct proof-of-concept exa…
▽ More
Humans are capable of strategically deceptive behavior: behaving helpfully in most situations, but then behaving very differently in order to pursue alternative objectives when given the opportunity. If an AI system learned such a deceptive strategy, could we detect it and remove it using current state-of-the-art safety training techniques? To study this question, we construct proof-of-concept examples of deceptive behavior in large language models (LLMs). For example, we train models that write secure code when the prompt states that the year is 2023, but insert exploitable code when the stated year is 2024. We find that such backdoor behavior can be made persistent, so that it is not removed by standard safety training techniques, including supervised fine-tuning, reinforcement learning, and adversarial training (eliciting unsafe behavior and then training to remove it). The backdoor behavior is most persistent in the largest models and in models trained to produce chain-of-thought reasoning about deceiving the training process, with the persistence remaining even when the chain-of-thought is distilled away. Furthermore, rather than removing backdoors, we find that adversarial training can teach models to better recognize their backdoor triggers, effectively hiding the unsafe behavior. Our results suggest that, once a model exhibits deceptive behavior, standard techniques could fail to remove such deception and create a false impression of safety.
△ Less
Submitted 17 January, 2024; v1 submitted 10 January, 2024;
originally announced January 2024.
-
4XMM~J182531.5$-$144036: A new persistent Be/X-ray binary found within the \emph{XMM-Newton} serendipitous survey
Authors:
A. B. Mason,
A. J. Norton,
J. S. Clark,
S. A. Farrell,
A. J. Gosling
Abstract:
We aim to investigate the nature of time-variable X-ray sources detected in the {\it XMM-Newton} serendipitous survey. The X-ray light curves of objects in the {\it XMM-Newton} serendipitous survey were searched for variability and coincident serendipitous sources observed by {\it Chandra} were also investigated. Subsequent infrared spectroscopy of the counterparts to the X-ray objects that were i…
▽ More
We aim to investigate the nature of time-variable X-ray sources detected in the {\it XMM-Newton} serendipitous survey. The X-ray light curves of objects in the {\it XMM-Newton} serendipitous survey were searched for variability and coincident serendipitous sources observed by {\it Chandra} were also investigated. Subsequent infrared spectroscopy of the counterparts to the X-ray objects that were identified using UKIDSS was carried out using {\it ISAAC} on the VLT. We found that the object 4XMM~J182531.5--144036 detected in the {\it XMM-Newton} serendipitous survey in April 2008 was also detected by {\it Chandra} as CXOU~J182531.4--144036 in July 2004. Both observations reveal a hard X-ray source displaying a coherent X-ray pulsation at a period of 781~s. The source position is coincident with a $K=14$ mag infrared object whose spectrum exhibits strong HeI and Br$γ$ emission lines and an infrared excess above that of early B-type dwarf or giant stars. We conclude that 4XMM~J182531.5--144036 is a Be/X-ray binary pulsar exhibiting persistent X-ray emission and is likely in a long period, low eccentricity orbit, similar to X Per.
△ Less
Submitted 4 January, 2024;
originally announced January 2024.
-
Polynomials that preserve nonnegative monomial matrices
Authors:
Benjamin J. Clark,
Pietro Paparella
Abstract:
A recently-established necessary condition for polynomials that preserve the class of entrywise nonnegative matrices of a fixed order is shown to be necessary and sufficient for the class of nonnegative monomial matrices. Along the way, we provide a formula for computing an arbitrary power of a monomial matrix and a formula for computing the polynomial of a nonnegative monomial matrix.
A recently-established necessary condition for polynomials that preserve the class of entrywise nonnegative matrices of a fixed order is shown to be necessary and sufficient for the class of nonnegative monomial matrices. Along the way, we provide a formula for computing an arbitrary power of a monomial matrix and a formula for computing the polynomial of a nonnegative monomial matrix.
△ Less
Submitted 2 January, 2024;
originally announced January 2024.
-
Anisotropic skyrmion and multi-$q$ spin dynamics in centrosymmetric Gd$_2$PdSi$_3$
Authors:
M. Gomilšek,
T. J. Hicken,
M. N. Wilson,
K. J. A. Franke,
B. M. Huddart,
A. Štefančič,
S. J. R. Holt,
G. Balakrishnan,
D. A. Mayoh,
M. T. Birch,
S. H. Moody,
H. Luetkens,
Z. Guguchia,
M. T. F. Telling,
P. J. Baker,
S. J. Clark,
T. Lancaster
Abstract:
Skyrmions are particle-like vortices of magnetization with non-trivial topology, which are usually stabilized by Dzyaloshinskii-Moriya interactions (DMI) in noncentrosymmetric bulk materials. Exceptions are centrosymmetric Gd- and Eu-based skyrmion-lattice (SkL) hosts with net-zero DMI, where both the SkL stabilization mechanisms and magnetic ground states remain controversial. We address these by…
▽ More
Skyrmions are particle-like vortices of magnetization with non-trivial topology, which are usually stabilized by Dzyaloshinskii-Moriya interactions (DMI) in noncentrosymmetric bulk materials. Exceptions are centrosymmetric Gd- and Eu-based skyrmion-lattice (SkL) hosts with net-zero DMI, where both the SkL stabilization mechanisms and magnetic ground states remain controversial. We address these by investigating both static and dynamic spin properties of the centrosymmetric SkL host Gd$_2$PdSi$_3$ using muon spectroscopy ($μ$SR). We find that spin fluctuations in its non-coplanar SkL phase are highly anisotropic, implying that spin anisotropy plays a prominent role in stabilizing this phase. We also observe strongly-anisotropic spin dynamics in the ground-state (IC-1) incommensurate magnetic phase of the material, indicating that it is a meron-like multi-$q$ structure. In contrast, the higher-field, coplanar IC-2 phase is found to be single-$q$ with nearly-isotropic spin dynamics.
△ Less
Submitted 13 March, 2024; v1 submitted 28 December, 2023;
originally announced December 2023.
-
Precise Mass Measurements of $A=133$ Isobars with the Canadian Penning Trap: Resolving the $Q_{β^-}$ anomaly at $^{133}$Te
Authors:
A. A. Valverde,
F. G. Kondev,
B. Liu,
D. Ray,
M. Brodeur,
D. P. Burdette,
N. Callahan,
A. Cannon,
J. A. Clark,
D. E. M. Hoff,
R. Orford,
W. S. Porter,
K. S. Sharma,
L. Varriano
Abstract:
We report precision mass measurements of $^{133}$Sb, $^{133g,m}$Te, and $^{133g,m}$I, produced at CARIBU at Argonne National Laboratory's ATLAS facility and measured using the Canadian Penning Trap mass spectrometer. These masses clarify an anomaly in the $^{133}$Te $β$-decay. The masses reported in the 2020 Atomic Mass Evaluation (M. Wang et al., 2021) produce $Q_{β^-}(^{133}$Te)=2920(6) keV; how…
▽ More
We report precision mass measurements of $^{133}$Sb, $^{133g,m}$Te, and $^{133g,m}$I, produced at CARIBU at Argonne National Laboratory's ATLAS facility and measured using the Canadian Penning Trap mass spectrometer. These masses clarify an anomaly in the $^{133}$Te $β$-decay. The masses reported in the 2020 Atomic Mass Evaluation (M. Wang et al., 2021) produce $Q_{β^-}(^{133}$Te)=2920(6) keV; however, the highest-lying $^{133}$I level populated in this decay is observed at $E_i=2935.83(15)$ keV, resulting in an anomalous $Q_{β^{-}}^{i}=-16(6)$~keV. Our new measurements give $Q_{β^-}(^{133}\text{Te})=2934.8(11)$ keV, a factor of five more precise, yielding $Q{_β^i}=-1.0(12)$~keV, a 3$σ$ shift from the previous results. This resolves the anomaly and indicates the possibility of an ultralow $Q$-value $β$ decay in this system.
△ Less
Submitted 14 May, 2024; v1 submitted 11 December, 2023;
originally announced December 2023.
-
Monitoring Sustainable Global Development Along Shared Socioeconomic Pathways
Authors:
Michelle W. L. Wan,
Jeffrey N. Clark,
Edward A. Small,
Elena Fillola Mayoral,
Raúl Santos-Rodríguez
Abstract:
Sustainable global development is one of the most prevalent challenges facing the world today, hinging on the equilibrium between socioeconomic growth and environmental sustainability. We propose approaches to monitor and quantify sustainable development along the Shared Socioeconomic Pathways (SSPs), including mathematically derived scoring algorithms, and machine learning methods. These integrat…
▽ More
Sustainable global development is one of the most prevalent challenges facing the world today, hinging on the equilibrium between socioeconomic growth and environmental sustainability. We propose approaches to monitor and quantify sustainable development along the Shared Socioeconomic Pathways (SSPs), including mathematically derived scoring algorithms, and machine learning methods. These integrate socioeconomic and environmental datasets, to produce an interpretable metric for SSP alignment. An initial study demonstrates promising results, laying the groundwork for the application of different methods to the monitoring of sustainable global development.
△ Less
Submitted 7 December, 2023;
originally announced December 2023.
-
Characterizing the efficacy of methods to subtract terrestrial transient noise near gravitational wave events and the effects on parameter estimation
Authors:
Sudarshan Ghonge,
Joshua Brandt,
J. M. Sullivan,
Margaret Millhouse,
Katerina Chatziioannou,
James A. Clark,
Tyson Littenberg,
Neil Cornish,
Sophie Hourihane,
Laura Cadonati
Abstract:
We investigate the impact of transient noise artifacts, or {\it glitches}, on gravitational wave inference, and the efficacy of data cleaning procedures in recovering unbiased source properties. Due to their time-frequency morphology, broadband glitches demonstrate moderate to significant biasing of posterior distributions away from true values. In contrast, narrowband glitches have negligible bia…
▽ More
We investigate the impact of transient noise artifacts, or {\it glitches}, on gravitational wave inference, and the efficacy of data cleaning procedures in recovering unbiased source properties. Due to their time-frequency morphology, broadband glitches demonstrate moderate to significant biasing of posterior distributions away from true values. In contrast, narrowband glitches have negligible biasing effects owing to distinct signal and glitch morphologies. We inject simulated binary black hole signals into data containing three common glitch types from past LIGO-Virgo observing runs, and reconstruct both signal and glitch waveforms using {\tt BayesWave}, a wavelet-based Bayesian analysis. We apply the standard LIGO-Virgo-KAGRA deglitching procedure to the detector data - we subtract the glitch waveform estimated by the joint {\tt BayesWave} inference before performing parameter estimation with detailed compact binary waveform models. We find that this deglitching effectively mitigates bias from broadband glitches, with posterior peaks aligning with true values post deglitching. This provides a baseline validation of existing techniques, while demonstrating waveform reconstruction improvements to the Bayesian algorithm for robust astrophysical characterization in glitch-prone detector data.
△ Less
Submitted 15 November, 2023;
originally announced November 2023.
-
Anisotropic positive linear and sub-linear magnetoresistivity in the cubic type-II Dirac metal Pd$_3$In$_7$
Authors:
Aikaterini Flessa Savvidou,
Andrzej Ptok,
G. Sharma,
Brian Casas,
Judith K. Clark,
Victoria M. Li,
Michael Shatruk,
Sumanta Tewari,
Luis Balicas
Abstract:
We report a transport study on Pd$_3$In$_7$ which displays multiple Dirac type-II nodes in its electronic dispersion. Pd$_3$In$_7$ is characterized by low residual resistivities and high mobilities, which are consistent with Dirac-like quasiparticles. For an applied magnetic field $(μ_{\text{0}} H)$ having a non-zero component along the electrical current, we find a large, positive, and linear in…
▽ More
We report a transport study on Pd$_3$In$_7$ which displays multiple Dirac type-II nodes in its electronic dispersion. Pd$_3$In$_7$ is characterized by low residual resistivities and high mobilities, which are consistent with Dirac-like quasiparticles. For an applied magnetic field $(μ_{\text{0}} H)$ having a non-zero component along the electrical current, we find a large, positive, and linear in $μ_{\text{0}} H$ longitudinal magnetoresistivity (LMR). The sign of the LMR and its linear dependence deviate from the behavior reported for the chiral-anomaly-driven LMR in Weyl semimetals. Interestingly, such anomalous LMR is consistent with predictions for the role of the anomaly in type-II Weyl semimetals. In contrast, the transverse or conventional magnetoresistivity (CMR for electric fields $\textbf{E} \bot μ_{\text{0}} \textbf{H}$) is large and positive, increasing by $10^3-10^4$ \% as a function of $μ_{\text{0}}H$ while following an anomalous, angle-dependent power law $ρ_{\text{xx}}\propto (μ_{\text{0}}H)^n$ with $n(θ) \leq 1$. The order of magnitude of the CMR, and its anomalous power-law, is explained in terms of uncompensated electron and hole-like Fermi surfaces characterized by anisotropic carrier scattering likely due to the lack of Lorentz invariance.
△ Less
Submitted 3 November, 2023;
originally announced November 2023.
-
The Beta-decay Paul Trap Mk IV: Design and commissioning
Authors:
L. Varriano,
G. Savard,
J. A. Clark,
D. P. Burdette,
M. T. Burkey,
A. T. Gallant,
T. Y. Hirsh,
B. Longfellow,
N. D. Scielzo,
R. Segel,
E. J. Boron III,
M. Brodeur,
N. Callahan,
A. Cannon,
K. Kolos,
B. Liu,
S. Lopez-Caceres,
M. Gott,
B. Maaß,
S. T. Marley,
C. Mohs,
G. E. Morgan,
P. Mueller,
M. Oberling,
P. D. O'Malley
, et al. (7 additional authors not shown)
Abstract:
The Beta-decay Paul Trap is an open-geometry, linear trap used to measure the decays of $^8$Li and $^8$B to search for a tensor contribution to the weak interaction. In the latest $^8$Li measurement of Burkey et al. (2022), $β$ scattering was the dominant experimental systematic uncertainty. The Beta-decay Paul Trap Mk IV reduces the prevalence of $β$ scattering by a factor of 4 through a redesign…
▽ More
The Beta-decay Paul Trap is an open-geometry, linear trap used to measure the decays of $^8$Li and $^8$B to search for a tensor contribution to the weak interaction. In the latest $^8$Li measurement of Burkey et al. (2022), $β$ scattering was the dominant experimental systematic uncertainty. The Beta-decay Paul Trap Mk IV reduces the prevalence of $β$ scattering by a factor of 4 through a redesigned electrode geometry and the use of glassy carbon and graphite as electrode materials. The trap has been constructed and successfully commissioned with $^8$Li in a new data campaign that collected 2.6 million triple coincidence events, an increase in statistics by 30% with 4 times less $β$ scattering compared to the previous $^8$Li data set.
△ Less
Submitted 30 October, 2023;
originally announced November 2023.
-
Exploring Behavior Discovery Methods for Heterogeneous Swarms of Limited-Capability Robots
Authors:
Connor Mattson,
Jeremy C. Clark,
Daniel S. Brown
Abstract:
We study the problem of determining the emergent behaviors that are possible given a functionally heterogeneous swarm of robots with limited capabilities. Prior work has considered behavior search for homogeneous swarms and proposed the use of novelty search over either a hand-specified or learned behavior space followed by clustering to return a taxonomy of emergent behaviors to the user. In this…
▽ More
We study the problem of determining the emergent behaviors that are possible given a functionally heterogeneous swarm of robots with limited capabilities. Prior work has considered behavior search for homogeneous swarms and proposed the use of novelty search over either a hand-specified or learned behavior space followed by clustering to return a taxonomy of emergent behaviors to the user. In this paper, we seek to better understand the role of novelty search and the efficacy of using clustering to discover novel emergent behaviors. Through a large set of experiments and ablations, we analyze the effect of representations, evolutionary search, and various clustering methods in the search for novel behaviors in a heterogeneous swarm. Our results indicate that prior methods fail to discover many interesting behaviors and that an iterative human-in-the-loop discovery process discovers more behaviors than random search, swarm chemistry, and automated behavior discovery. The combined discoveries of our experiments uncover 23 emergent behaviors, 18 of which are novel discoveries. To the best of our knowledge, these are the first known emergent behaviors for heterogeneous swarms of computation-free agents. Videos, code, and appendix are available at the project website: https://sites.google.com/view/heterogeneous-bd-methods
△ Less
Submitted 25 October, 2023;
originally announced October 2023.
-
Target Variable Engineering
Authors:
Jessica Clark
Abstract:
How does the formulation of a target variable affect performance within the ML pipeline? The experiments in this study examine numeric targets that have been binarized by comparing against a threshold. We compare the predictive performance of regression models trained to predict the numeric targets vs. classifiers trained to predict their binarized counterparts. Specifically, we make this comparis…
▽ More
How does the formulation of a target variable affect performance within the ML pipeline? The experiments in this study examine numeric targets that have been binarized by comparing against a threshold. We compare the predictive performance of regression models trained to predict the numeric targets vs. classifiers trained to predict their binarized counterparts. Specifically, we make this comparison at every point of a randomized hyperparameter optimization search to understand the effect of computational resource budget on the tradeoff between the two. We find that regression requires significantly more computational effort to converge upon the optimal performance, and is more sensitive to both randomness and heuristic choices in the training process. Although classification can and does benefit from systematic hyperparameter tuning and model selection, the improvements are much less than for regression. This work comprises the first systematic comparison of regression and classification within the framework of computational resource requirements. Our findings contribute to calls for greater replicability and efficiency within the ML pipeline for the sake of building more sustainable and robust AI systems.
△ Less
Submitted 13 October, 2023;
originally announced October 2023.
-
Exploring the interstellar medium of NGC 891 at millimeter wavelengths using the NIKA2 camera
Authors:
S. Katsioli,
R. Adam,
P. Ade,
H. Ajeddig,
P. André,
E. Artis,
H. Aussel,
M. Baes,
A. Beelen,
A. Benoît,
S. Berta,
L. Bing,
O. Bourrion,
M. Calvo,
A. Catalano,
C. J. R. Clark,
I. De Looze,
M. De Petris,
F. -X. Désert,
S. Doyle,
E. F. C. Driessen,
G. Ejlali,
M. Galametz,
F. Galliano,
A. Gomez
, et al. (39 additional authors not shown)
Abstract:
In the framework of the IMEGIN Large Program, we used the NIKA2 camera on the IRAM 30-m telescope to observe the edge-on galaxy NGC 891 at 1.15 mm and 2 mm and at a FWHM of 11.1" and 17.6", respectively. Multiwavelength data enriched with the new NIKA2 observations fitted by the HerBIE SED code (coupled with the THEMIS dust model) were used to constrain the physical properties of the ISM. Emission…
▽ More
In the framework of the IMEGIN Large Program, we used the NIKA2 camera on the IRAM 30-m telescope to observe the edge-on galaxy NGC 891 at 1.15 mm and 2 mm and at a FWHM of 11.1" and 17.6", respectively. Multiwavelength data enriched with the new NIKA2 observations fitted by the HerBIE SED code (coupled with the THEMIS dust model) were used to constrain the physical properties of the ISM. Emission originating from the diffuse dust disk is detected at all wavelengths from mid-IR to mm, while mid-IR observations reveal warm dust emission from compact HII regions. Indications of mm excess emission have also been found in the outer parts of the galactic disk. Furthermore, our SED fitting analysis constrained the mass fraction of the small (< 15 Angstrom) dust grains. We found that small grains constitute 9.5% of the total dust mass in the galactic plane, but this fraction increases up to ~ 20% at large distances (|z| > 3 kpc) from the galactic plane.
△ Less
Submitted 6 October, 2023;
originally announced October 2023.
-
Artificial Intelligence Index Report 2023
Authors:
Nestor Maslej,
Loredana Fattorini,
Erik Brynjolfsson,
John Etchemendy,
Katrina Ligett,
Terah Lyons,
James Manyika,
Helen Ngo,
Juan Carlos Niebles,
Vanessa Parli,
Yoav Shoham,
Russell Wald,
Jack Clark,
Raymond Perrault
Abstract:
Welcome to the sixth edition of the AI Index Report. This year, the report introduces more original data than any previous edition, including a new chapter on AI public opinion, a more thorough technical performance chapter, original analysis about large language and multimodal models, detailed trends in global AI legislation records, a study of the environmental impact of AI systems, and more. Th…
▽ More
Welcome to the sixth edition of the AI Index Report. This year, the report introduces more original data than any previous edition, including a new chapter on AI public opinion, a more thorough technical performance chapter, original analysis about large language and multimodal models, detailed trends in global AI legislation records, a study of the environmental impact of AI systems, and more. The AI Index Report tracks, collates, distills, and visualizes data related to artificial intelligence. Our mission is to provide unbiased, rigorously vetted, broadly sourced data in order for policymakers, researchers, executives, journalists, and the general public to develop a more thorough and nuanced understanding of the complex field of AI. The report aims to be the world's most credible and authoritative source for data and insights about AI.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
TraCE: Trajectory Counterfactual Explanation Scores
Authors:
Jeffrey N. Clark,
Edward A. Small,
Nawid Keshtmand,
Michelle W. L. Wan,
Elena Fillola Mayoral,
Enrico Werner,
Christopher P. Bourdeaux,
Raul Santos-Rodriguez
Abstract:
Counterfactual explanations, and their associated algorithmic recourse, are typically leveraged to understand, explain, and potentially alter a prediction coming from a black-box classifier. In this paper, we propose to extend the use of counterfactuals to evaluate progress in sequential decision making tasks. To this end, we introduce a model-agnostic modular framework, TraCE (Trajectory Counterf…
▽ More
Counterfactual explanations, and their associated algorithmic recourse, are typically leveraged to understand, explain, and potentially alter a prediction coming from a black-box classifier. In this paper, we propose to extend the use of counterfactuals to evaluate progress in sequential decision making tasks. To this end, we introduce a model-agnostic modular framework, TraCE (Trajectory Counterfactual Explanation) scores, which is able to distill and condense progress in highly complex scenarios into a single value. We demonstrate TraCE's utility across domains by showcasing its main properties in two case studies spanning healthcare and climate change.
△ Less
Submitted 26 January, 2024; v1 submitted 27 September, 2023;
originally announced September 2023.
-
The stratification of ISM properties in the edge-on galaxy NGC 891 revealed by NIKA2
Authors:
S. Katsioli,
E. M. Xilouris,
C. Kramer,
R. Adam,
P. Ade,
H. Ajeddig,
P. André,
E. Artis,
H. Aussel,
M. Baes,
A. Beelen,
A. Benoît,
S. Berta,
L. Bing,
O. Bourrion,
M. Calvo,
A. Catalano,
C. J. R. Clark,
I. De Looze,
M. De Petris,
F. -X. Désert,
S. Doyle,
E. F. C. Driessen,
G. Ejlali,
M. Galametz
, et al. (38 additional authors not shown)
Abstract:
As the millimeter wavelength range remains a largely unexplored spectral region for galaxies, the IMEGIN large program aims to map the millimeter continuum emission of 22 nearby galaxies at 1.15 and 2 mm. Using the high-resolution maps produced by the NIKA2 camera, we explore the existence of very cold dust and take possible contamination by free-free and synchrotron emission into account. We stud…
▽ More
As the millimeter wavelength range remains a largely unexplored spectral region for galaxies, the IMEGIN large program aims to map the millimeter continuum emission of 22 nearby galaxies at 1.15 and 2 mm. Using the high-resolution maps produced by the NIKA2 camera, we explore the existence of very cold dust and take possible contamination by free-free and synchrotron emission into account. We study the IR-to-radio emission coming from different regions along the galactic plane and at large vertical distances. New observations of NGC 891, using the NIKA2 camera on the IRAM 30m telescope, along with a suite of observations at other wavelengths were used to perform a multiwavelength study of the spectral energy distribution in the interstellar medium in this galaxy. This analysis was performed globally and locally, using the advanced hierarchical Bayesian fitting code, HerBIE, coupled with the THEMIS dust model. Our dust modeling is able to reproduce the near-IR to millimeter emission of NGC 891, with the exception of an excess at a level of 25% obtained by the NIKA2 observations in the outermost parts of the disk. The radio continuum and thermal dust emission are distributed differently in the disk and galaxy halo. Different dusty environments are also revealed by a multiwavelength investigation of the emission features. Our detailed decomposition at millimeter and centimeter wavelengths shows that emission at 1 mm is purely originated by dust. Radio components become progressively important with increasing wavelengths. Finally, we find that emission arising from small dust grains accounts for ~ 9.5% of the total dust mass, reaching up to 20% at large galactic latitudes. Shock waves in the outflows that shatter the dust grains might explain this higher fraction of small grains in the halo.
△ Less
Submitted 15 September, 2023;
originally announced September 2023.
-
AIDPS:Adaptive Intrusion Detection and Prevention System for Underwater Acoustic Sensor Networks
Authors:
Soumadeep Das,
Aryan Mohammadi Pasikhani,
Prosanta Gope,
John A. Clark,
Chintan Patel,
Biplab Sikdar
Abstract:
Underwater Acoustic Sensor Networks (UW-ASNs) are predominantly used for underwater environments and find applications in many areas. However, a lack of security considerations, the unstable and challenging nature of the underwater environment, and the resource-constrained nature of the sensor nodes used for UW-ASNs (which makes them incapable of adopting security primitives) make the UW-ASN prone…
▽ More
Underwater Acoustic Sensor Networks (UW-ASNs) are predominantly used for underwater environments and find applications in many areas. However, a lack of security considerations, the unstable and challenging nature of the underwater environment, and the resource-constrained nature of the sensor nodes used for UW-ASNs (which makes them incapable of adopting security primitives) make the UW-ASN prone to vulnerabilities. This paper proposes an Adaptive decentralised Intrusion Detection and Prevention System called AIDPS for UW-ASNs. The proposed AIDPS can improve the security of the UW-ASNs so that they can efficiently detect underwater-related attacks (e.g., blackhole, grayhole and flooding attacks). To determine the most effective configuration of the proposed construction, we conduct a number of experiments using several state-of-the-art machine learning algorithms (e.g., Adaptive Random Forest (ARF), light gradient-boosting machine, and K-nearest neighbours) and concept drift detection algorithms (e.g., ADWIN, kdqTree, and Page-Hinkley). Our experimental results show that incremental ARF using ADWIN provides optimal performance when implemented with One-class support vector machine (SVM) anomaly-based detectors. Furthermore, our extensive evaluation results also show that the proposed scheme outperforms state-of-the-art bench-marking methods while providing a wider range of desirable features such as scalability and complexity.
△ Less
Submitted 14 September, 2023;
originally announced September 2023.
-
FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning
Authors:
Xinyi Wang,
John Wieting,
Jonathan H. Clark
Abstract:
Learning paradigms for large language models (LLMs) currently tend to fall within either in-context learning (ICL) or full fine-tuning. Each of these comes with their own trade-offs based on available data, model size, compute cost, ease-of-use, and final quality with neither solution performing well across-the-board. In this article, we first describe ICL and fine-tuning paradigms in a way that h…
▽ More
Learning paradigms for large language models (LLMs) currently tend to fall within either in-context learning (ICL) or full fine-tuning. Each of these comes with their own trade-offs based on available data, model size, compute cost, ease-of-use, and final quality with neither solution performing well across-the-board. In this article, we first describe ICL and fine-tuning paradigms in a way that highlights their natural connections. Based on these connections, we propose a new learning paradigm called FIAT that fuses the best of these paradigms together, enabling prompt-engineered instructions and chain-of-thought reasoning with the very largest models while also using similar methods to perform parameter updates on a modestly-sized LLM with parameter-efficient tuning. We evaluate FIAT's effectiveness on a variety of multilingual tasks and observe that FIAT performs better than both ICL and fine-tuning at scales ranging from 100-10,000 training examples. We hope that FIAT provides a practical way of harnessing the full potential of LLMs without needing to make a hard choice between learning paradigms.
△ Less
Submitted 12 September, 2023; v1 submitted 8 September, 2023;
originally announced September 2023.