-
Frameworking for a Community-led Feminist Ethics
Authors:
Ana O Henriques,
Hugo Nicolau,
Kyle Montague
Abstract:
This paper introduces a relational perspective on ethics within the context of Feminist Digital Civics and community-led design. Ethics work in HCI has primarily focused on prescriptive machine ethics and bioethics principles rather than people. In response, we advocate for a community-led, processual approach to ethics, acknowledging power dynamics and local contexts. We thus propose a multidimen…
▽ More
This paper introduces a relational perspective on ethics within the context of Feminist Digital Civics and community-led design. Ethics work in HCI has primarily focused on prescriptive machine ethics and bioethics principles rather than people. In response, we advocate for a community-led, processual approach to ethics, acknowledging power dynamics and local contexts. We thus propose a multidimensional adaptive model for ethics in HCI design, integrating an intersectional feminist ethical lens. This framework embraces feminist epistemologies, methods, and methodologies, fostering a reflexive practice. By weaving together situated knowledges, standpoint theory, intersectionality, participatory methods, and care ethics, our approach offers a holistic foundation for ethics in HCI, aiming to advance community-led practices and enrich the discourse surrounding ethics within this field.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
VLST: Virtual Lung Screening Trial for Lung Cancer Detection Using Virtual Imaging Trial
Authors:
Fakrul Islam Tushar,
Liesbeth Vancoillie,
Cindy McCabe,
Amareswararao Kavuri,
Lavsen Dahal,
Brian Harrawood,
Milo Fryling,
Mojtaba Zarei,
Saman Sotoudeh-Paima,
Fong Chi Ho,
Dhrubajyoti Ghosh,
Sheng Luo,
W. Paul Segars,
Ehsan Abadi,
Kyle J. Lafata,
Ehsan Samei,
Joseph Y. Lo
Abstract:
Importance: The efficacy of lung cancer screening can be significantly impacted by the imaging modality used. This Virtual Lung Screening Trial (VLST) addresses the critical need for precision in lung cancer diagnostics and the potential for reducing unnecessary radiation exposure in clinical settings.
Objectives: To establish a virtual imaging trial (VIT) platform that accurately simulates real…
▽ More
Importance: The efficacy of lung cancer screening can be significantly impacted by the imaging modality used. This Virtual Lung Screening Trial (VLST) addresses the critical need for precision in lung cancer diagnostics and the potential for reducing unnecessary radiation exposure in clinical settings.
Objectives: To establish a virtual imaging trial (VIT) platform that accurately simulates real-world lung screening trials (LSTs) to assess the diagnostic accuracy of CT and CXR modalities.
Design, Setting, and Participants: Utilizing computational models and machine learning algorithms, we created a diverse virtual patient population. The cohort, designed to mirror real-world demographics, was assessed using virtual imaging techniques that reflect historical imaging technologies.
Main Outcomes and Measures: The primary outcome was the difference in the Area Under the Curve (AUC) for CT and CXR modalities across lesion types and sizes.
Results: The study analyzed 298 CT and 313 CXR simulated images from 313 virtual patients, with a lesion-level AUC of 0.81 (95% CI: 0.78-0.84) for CT and 0.55 (95% CI: 0.53-0.56) for CXR. At the patient level, CT demonstrated an AUC of 0.85 (95% CI: 0.80-0.89), compared to 0.53 (95% CI: 0.47-0.60) for CXR. Subgroup analyses indicated CT's superior performance in detecting homogeneous lesions (AUC of 0.97 for lesion-level) and heterogeneous lesions (AUC of 0.71 for lesion-level) as well as in identifying larger nodules (AUC of 0.98 for nodules > 8 mm).
Conclusion and Relevance: The VIT platform validated the superior diagnostic accuracy of CT over CXR, especially for smaller nodules, underscoring its potential to replicate real clinical imaging trials. These findings advocate for the integration of virtual trials in the evaluation and improvement of imaging-based diagnostic tools.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Complexity and algorithms for Arc-Kayles and Non-Disconnecting Arc-Kayles
Authors:
Kyle Burke,
Antoine Dailly,
Nacim Oijid
Abstract:
Arc-Kayles is a game where two players alternate removing two adjacent vertices until no move is left. Introduced in 1978, its computational complexity is still open. More recently, subtraction games, where the players cannot disconnect the graph while removing vertices, were introduced. In particular, Arc-Kayles admits a non-disconnecting variant that is a subtraction game. We study the computati…
▽ More
Arc-Kayles is a game where two players alternate removing two adjacent vertices until no move is left. Introduced in 1978, its computational complexity is still open. More recently, subtraction games, where the players cannot disconnect the graph while removing vertices, were introduced. In particular, Arc-Kayles admits a non-disconnecting variant that is a subtraction game. We study the computational complexity of subtraction games on graphs, proving that they are PSPACE-complete even on very structured graph classes (split, bipartite of any even girth). We prove that Non-Disconnecting Arc-Kayles can be solved in polynomial-time on unicyclic graphs, clique trees, and subclasses of threshold graphs. We also show that a sufficient condition for a second player-win on Arc-Kayles is equivalent to the graph isomorphism problem.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning
Authors:
Kyle Hsu,
Jubayer Ibn Hamid,
Kaylee Burns,
Chelsea Finn,
Jiajun Wu
Abstract:
Inductive biases are crucial in disentangled representation learning for narrowing down an underspecified solution set. In this work, we consider endowing a neural network autoencoder with three select inductive biases from the literature: data compression into a grid-like latent space via quantization, collective independence amongst latents, and minimal functional influence of any latent on how…
▽ More
Inductive biases are crucial in disentangled representation learning for narrowing down an underspecified solution set. In this work, we consider endowing a neural network autoencoder with three select inductive biases from the literature: data compression into a grid-like latent space via quantization, collective independence amongst latents, and minimal functional influence of any latent on how other latents determine data generation. In principle, these inductive biases are deeply complementary: they most directly specify properties of the latent space, encoder, and decoder, respectively. In practice, however, naively combining existing techniques instantiating these inductive biases fails to yield significant benefits. To address this, we propose adaptations to the three techniques that simplify the learning problem, equip key regularization terms with stabilizing invariances, and quash degenerate incentives. The resulting model, Tripod, achieves state-of-the-art results on a suite of four image disentanglement benchmarks. We also verify that Tripod significantly improves upon its naive incarnation and that all three of its "legs" are necessary for best performance.
△ Less
Submitted 24 May, 2024; v1 submitted 16 April, 2024;
originally announced April 2024.
-
RoofDiffusion: Constructing Roofs from Severely Corrupted Point Data via Diffusion
Authors:
Kyle Shih-Huang Lo,
Jörg Peters,
Eric Spellman
Abstract:
Accurate completion and denoising of roof height maps are crucial to reconstructing high-quality 3D buildings. Repairing sparse points can enhance low-cost sensor use and reduce UAV flight overlap. RoofDiffusion is a new end-to-end self-supervised diffusion technique for robustly completing, in particular difficult, roof height maps. RoofDiffusion leverages widely-available curated footprints and…
▽ More
Accurate completion and denoising of roof height maps are crucial to reconstructing high-quality 3D buildings. Repairing sparse points can enhance low-cost sensor use and reduce UAV flight overlap. RoofDiffusion is a new end-to-end self-supervised diffusion technique for robustly completing, in particular difficult, roof height maps. RoofDiffusion leverages widely-available curated footprints and can so handle up to 99\% point sparsity and 80\% roof area occlusion (regional incompleteness). A variant, No-FP RoofDiffusion, simultaneously predicts building footprints and heights. Both quantitatively outperform state-of-the-art unguided depth completion and representative inpainting methods for Digital Elevation Models (DEM), on both a roof-specific benchmark and the BuildingNet dataset. Qualitative assessments show the effectiveness of RoofDiffusion for datasets with real-world scans including AHN3, Dales3D, and USGS 3DEP LiDAR. Tested with the leading City3D algorithm, preprocessing height maps with RoofDiffusion noticeably improves 3D building reconstruction. RoofDiffusion is complemented by a new dataset of 13k complex roof geometries, focusing on long-tail issues in remote sensing; a novel simulation of tree occlusion; and a wide variety of large-area roof cut-outs for data augmentation and benchmarking.
△ Less
Submitted 14 April, 2024;
originally announced April 2024.
-
Do Large Language Models Learn Human-Like Strategic Preferences?
Authors:
Jesse Roberts,
Kyle Moore,
Doug Fisher
Abstract:
We evaluate whether LLMs learn to make human-like preference judgements in strategic scenarios as compared with known empirical results. We show that Solar and Mistral exhibit stable value-based preference consistent with human in the prisoner's dilemma, including stake-size effect, and traveler's dilemma, including penalty-size effect. We establish a relationship between model size, value based p…
▽ More
We evaluate whether LLMs learn to make human-like preference judgements in strategic scenarios as compared with known empirical results. We show that Solar and Mistral exhibit stable value-based preference consistent with human in the prisoner's dilemma, including stake-size effect, and traveler's dilemma, including penalty-size effect. We establish a relationship between model size, value based preference, and superficiality. Finally, we find that models that tend to be less brittle were trained with sliding window attention. Additionally, we contribute a novel method for constructing preference relations from arbitrary LLMs and support for a hypothesis regarding human behavior in the traveler's dilemma.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
FORGE'd in FIRE III: The IMF in Quasar Accretion Disks from STARFORGE
Authors:
Philip F. Hopkins,
Michael Y. Grudic,
Kyle Kremer,
Stella S. R. Offner,
David Guszejnov,
Anna L. Rosen
Abstract:
Recently, we demonstrated self-consistent formation of strongly-magnetized quasar accretion disks (QADs) from cosmological radiation-magnetohydrodynamic-thermochemical galaxy-star formation simulations, including the full STARFORGE physics shown previously to produce a reasonable IMF under typical ISM conditions. Here we study star formation and the stellar IMF in QADs, on scales from 100 au to 10…
▽ More
Recently, we demonstrated self-consistent formation of strongly-magnetized quasar accretion disks (QADs) from cosmological radiation-magnetohydrodynamic-thermochemical galaxy-star formation simulations, including the full STARFORGE physics shown previously to produce a reasonable IMF under typical ISM conditions. Here we study star formation and the stellar IMF in QADs, on scales from 100 au to 10 pc from the SMBH. We show it is critical to include physics often previously neglected, including magnetic fields, radiation, and (proto)stellar feedback. Closer to the SMBH, star formation is suppressed, but the (rare) stars that do form exhibit top-heavy IMFs. Stars can form only in special locations (e.g. magnetic field switches) in the outer QAD. Protostars accrete their natal cores rapidly but then dynamically decouple from the gas and 'wander,' ceasing accretion on timescales ~100 yr. Their jets control initial core accretion, but the ejecta are 'swept up' into the larger-scale QAD flow without much dynamical effect. The strong tidal environment strongly suppresses common-core multiplicity. The IMF shape depends sensitively on un-resolved dynamics of protostellar disks (PSDs), as the global dynamical times can become incredibly short (< yr) and tidal fields are incredibly strong, so whether PSDs can efficiently transport angular momentum or fragment catastrophically at <10 au scales requires novel PSD simulations to properly address. Most analytic IMF models and analogies with planet formation in PSDs fail qualitatively to explain the simulation IMFs, though we discuss a couple of viable models.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
The solar dynamo begins near the surface
Authors:
Geoffrey M Vasil,
Daniel Lecoanet,
Kyle Augustson,
Keaton J Burns,
Jeffrey S Oishi,
Benjamin P Brown,
Nicholas Brummell,
Keith Julien
Abstract:
The Sun's magnetic dynamo cycle features a distinct pattern: a propagating region of sunspot emergence appears around 30 degrees latitude and vanishes near the equator every 11 years. Moreover, longitudinal flows called "torsional oscillations" closely shadow sunspot migration, undoubtedly sharing a common cause. Contrary to theories suggesting deep origins for these phenomena, helioseismology pin…
▽ More
The Sun's magnetic dynamo cycle features a distinct pattern: a propagating region of sunspot emergence appears around 30 degrees latitude and vanishes near the equator every 11 years. Moreover, longitudinal flows called "torsional oscillations" closely shadow sunspot migration, undoubtedly sharing a common cause. Contrary to theories suggesting deep origins for these phenomena, helioseismology pinpoints low-latitude torsional oscillations to the Sun's outer 5-10%, the "Near-Surface Shear Layer". Within this zone, inwardly increasing differential rotation coupled with a poloidal magnetic field strongly implicates the Magneto-Rotational Instability prominent in accretion-disk theory and observed in laboratory experiments. Together, these two facts prompt the general question: Is it possible that the solar dynamo is a near-surface instability? Here, we report strong affirmative evidence in stark contrast to traditional paradigms focusing on the deeper tachocline. Simple analytic estimates show that the near-surface magneto-rotational instability better explains the spatiotemporal scales of the torsional oscillations and inferred subsurface magnetic field amplitudes. State-of-the-art numerical simulations corroborate these estimates and, strikingly, reproduce hemispherical magnetic current helicity laws. The dynamo resulting from a well-understood near-surface phenomenon improves prospects for accurate predictions of full magnetic cycles and space weather, impacting Earth's electromagnetic infrastructure.
△ Less
Submitted 13 April, 2024; v1 submitted 11 April, 2024;
originally announced April 2024.
-
Modification of Jet Velocities in an Explosively Loaded Copper Target with a Conical Defect
Authors:
Michael P. Hennessey,
Finnegan Wilson,
Grace I. Rabinowitz,
Max J. Sevcik,
Kadyn J. Tucker,
Dylan J. Kline,
David K. Amondson,
H. Keo Springer,
Kyle T. Sullivan,
Veronica Eliasson,
Jonathan L. Belof
Abstract:
In this work, the design and execution of an experiment with the goal of demonstrating control over the evolution of a copper jet is described. Simulations show that when using simple multi-material buffers placed between a copper target with a conical defect and a cylinder of high-explosive, a variety of jetting behaviors occur based on material placement, including both jet velocity augmentation…
▽ More
In this work, the design and execution of an experiment with the goal of demonstrating control over the evolution of a copper jet is described. Simulations show that when using simple multi-material buffers placed between a copper target with a conical defect and a cylinder of high-explosive, a variety of jetting behaviors occur based on material placement, including both jet velocity augmentation and mitigation. A parameter sweep was performed to determine optimal buffer designs in two configurations. Experiments using the optimal buffer designs verified the effectiveness of the buffer and validated the modeling.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
Multi Digit Ising Map** for Low Precision Ising Solvers
Authors:
Abhishek Kumar Singh,
Kyle Jamieson
Abstract:
The last couple of years have seen an ever-increasing interest in using different Ising solvers, like Quantum annealers, Coherent Ising machines, and Oscillator-based Ising machines, for solving tough computational problems in various domains. Although the simulations predict massive performance improvements for several tough computational problems, the real implementations of the Ising solvers te…
▽ More
The last couple of years have seen an ever-increasing interest in using different Ising solvers, like Quantum annealers, Coherent Ising machines, and Oscillator-based Ising machines, for solving tough computational problems in various domains. Although the simulations predict massive performance improvements for several tough computational problems, the real implementations of the Ising solvers tend to have limited precision, which can cause significant performance deterioration. This paper presents a novel methodology for map** the problem on the Ising solvers to artificially increase the effective precision. We further evaluate our method for the Multiple-Input-Multiple-Output signal detection problem.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
Exact and Approximate Solutions for Magnetohydrodynamic Flow Control in Hele-Shaw Cells
Authors:
Kyle McKee
Abstract:
Consider the motion of a thin layer of electrically conducting fluid, between two closely spaced parallel plates, in a classical Hele-Shaw geometry. Furthermore, let the system be immersed in a uniform external magnetic field (normal to the plates) and let electrical current be driven between conducting probes immersed in the fluid layer. In the present paper, we analyse the ensuing fluid flow at…
▽ More
Consider the motion of a thin layer of electrically conducting fluid, between two closely spaced parallel plates, in a classical Hele-Shaw geometry. Furthermore, let the system be immersed in a uniform external magnetic field (normal to the plates) and let electrical current be driven between conducting probes immersed in the fluid layer. In the present paper, we analyse the ensuing fluid flow at low Hartmann numbers. We first elucidate the mechanism of flow generation both physically and mathematically. We proceed by presenting mathematical solutions for a class of canonical multiply-connected geometries, in terms of the prime function developed by Crowdy (2020). Notably, those solutions can be written explicitly as series, and are thus exact, in doubly-connected geometries. Note that in higher connectivities, the prime function must be evaluated numerically. We then demonstrate how recently developed fast numerical methods may be applied to accurately determine the flow-field in arbitrary geometries when exact solutions are inaccessible.
△ Less
Submitted 7 April, 2024;
originally announced April 2024.
-
A Ground Mobile Robot for Autonomous Terrestrial Laser Scanning-Based Field Phenoty**
Authors:
Javier Rodriguez-Sanchez,
Kyle Johnsen,
Changying Li
Abstract:
Traditional field phenoty** methods are often manual, time-consuming, and destructive, posing a challenge for breeding progress. To address this bottleneck, robotics and automation technologies offer efficient sensing tools to monitor field evolution and crop development throughout the season. This study aimed to develop an autonomous ground robotic system for LiDAR-based field phenoty** in pl…
▽ More
Traditional field phenoty** methods are often manual, time-consuming, and destructive, posing a challenge for breeding progress. To address this bottleneck, robotics and automation technologies offer efficient sensing tools to monitor field evolution and crop development throughout the season. This study aimed to develop an autonomous ground robotic system for LiDAR-based field phenoty** in plant breeding trials. A Husky platform was equipped with a high-resolution three-dimensional (3D) laser scanner to collect in-field terrestrial laser scanning (TLS) data without human intervention. To automate the TLS process, a 3D ray casting analysis was implemented for optimal TLS site planning, and a route optimization algorithm was utilized to minimize travel distance during data collection. The platform was deployed in two cotton breeding fields for evaluation, where it autonomously collected TLS data. The system provided accurate pose information through RTK-GNSS positioning and sensor fusion techniques, with average errors of less than 0.6 cm for location and 0.38$^{\circ}$ for heading. The achieved localization accuracy allowed point cloud registration with mean point errors of approximately 2 cm, comparable to traditional TLS methods that rely on artificial targets and manual sensor deployment. This work presents an autonomous phenoty** platform that facilitates the quantitative assessment of plant traits under field conditions of both large agricultural fields and small breeding trials to contribute to the advancement of plant phenomics and breeding programs.
△ Less
Submitted 5 April, 2024;
originally announced April 2024.
-
CBR-RAG: Case-Based Reasoning for Retrieval Augmented Generation in LLMs for Legal Question Answering
Authors:
Nirmalie Wiratunga,
Ramitha Abeyratne,
Lasal Jayawardena,
Kyle Martin,
Stewart Massie,
Ikechukwu Nkisi-Orji,
Ruvan Weerasinghe,
Anne Liret,
Bruno Fleisch
Abstract:
Retrieval-Augmented Generation (RAG) enhances Large Language Model (LLM) output by providing prior knowledge as context to input. This is beneficial for knowledge-intensive and expert reliant tasks, including legal question-answering, which require evidence to validate generated text outputs. We highlight that Case-Based Reasoning (CBR) presents key opportunities to structure retrieval as part of…
▽ More
Retrieval-Augmented Generation (RAG) enhances Large Language Model (LLM) output by providing prior knowledge as context to input. This is beneficial for knowledge-intensive and expert reliant tasks, including legal question-answering, which require evidence to validate generated text outputs. We highlight that Case-Based Reasoning (CBR) presents key opportunities to structure retrieval as part of the RAG process in an LLM. We introduce CBR-RAG, where CBR cycle's initial retrieval stage, its indexing vocabulary, and similarity knowledge containers are used to enhance LLM queries with contextually relevant cases. This integration augments the original LLM query, providing a richer prompt. We present an evaluation of CBR-RAG, and examine different representations (i.e. general and domain-specific embeddings) and methods of comparison (i.e. inter, intra and hybrid similarity) on the task of legal question-answering. Our results indicate that the context provided by CBR's case reuse enforces similarity between relevant components of the questions and the evidence base leading to significant improvements in the quality of generated answers.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
An overlooked source of uncertainty in the mass of the Milky Way
Authors:
Kyle A. Oman,
Alexander H. Riley
Abstract:
In the conventional approach to decomposing a rotation curve into a set of contributions from mass model components, the measurements of the rotation curve at different radii are taken to be independent. It is clear, however, that radial correlations are present in such data, for instance (but not only) because the orbital speed depends on the mass distribution at all (or, minimally, inner) radii.…
▽ More
In the conventional approach to decomposing a rotation curve into a set of contributions from mass model components, the measurements of the rotation curve at different radii are taken to be independent. It is clear, however, that radial correlations are present in such data, for instance (but not only) because the orbital speed depends on the mass distribution at all (or, minimally, inner) radii. We adopt a very simple parametric form for a covariance matrix and constrain its parameters using Gaussian process regression. Applied to the rotation curve of the Milky Way, this suggests the presence of correlations between neighbouring rotation curve points with amplitudes $<10\,\mathrm{km}\,\mathrm{s}^{-1}$ over length scales of $1.5$-$2.5\,\mathrm{kpc}$ regardless of the assumed dark halo component. We show that accounting for such covariance can result in a $\sim 50$ per cent lower total mass estimate for the Milky Way than when it is neglected, and that the statistical uncertainty associated with the covariance is comparable to or exceeds the total systematic uncertainty budget. Our findings motivate including more detailed treatment of rotation curve covariance in future analyses.
△ Less
Submitted 20 May, 2024; v1 submitted 4 April, 2024;
originally announced April 2024.
-
High redshift LBGs from deep broadband imaging for future spectroscopic surveys
Authors:
Vanina Ruhlmann-Kleider,
Christophe Yèche,
Christophe Magneville,
Henri Coquinot,
Eric Armengaud,
Nathalie Palanque-Delabrouille,
Anand Raichoor,
Jessica Nicole Aguilar,
Steven Ahlen,
Stéphane Arnouts,
David Brooks,
Edmond Chaussidon,
Todd Claybaugh,
Kyle Dawson,
Axel de la Macorra,
Arjun Dey,
Biprateep Dey,
Peter Doel,
Kevin Fanning,
Simone Ferraro,
Jaime E. Forero-Romero,
Satya Gontcho A Gontcho,
Gaston Gutierrez,
Stephen Gwyn,
Klaus Honscheid
, et al. (38 additional authors not shown)
Abstract:
Lyman break galaxies (LBGs) are promising probes for clustering measurements at high redshift, $z>2$, a region only covered so far by Lyman-$α$ forest measurements. In this paper, we investigate the feasibility of selecting LBGs by exploiting the existence of a strong deficit of flux shortward of the Lyman limit, due to various absorption processes along the line of sight. The target selection rel…
▽ More
Lyman break galaxies (LBGs) are promising probes for clustering measurements at high redshift, $z>2$, a region only covered so far by Lyman-$α$ forest measurements. In this paper, we investigate the feasibility of selecting LBGs by exploiting the existence of a strong deficit of flux shortward of the Lyman limit, due to various absorption processes along the line of sight. The target selection relies on deep imaging data from the HSC and CLAUDS surveys in the $g,r,z$ and $u$ bands, respectively, with median depths reaching 27 AB in all bands. The selections were validated by several dedicated spectroscopic observation campaigns with DESI. Visual inspection of spectra has enabled us to develop an automated spectroscopic ty** and redshift estimation algorithm specific to LBGs. Based on these data and tools, we assess the efficiency and purity of target selections optimised for different purposes. Selections providing a wide redshift coverage retain $57\%$ of the observed targets after spectroscopic confirmation with DESI, and provide an efficiency for LBGs of $83\pm3\%$, for a purity of the selected LBG sample of $90\pm2\%$. This would deliver a confirmed LBG density of $\sim 620$ deg$^{-2}$ in the range $2.3<z<3.5$ for a $r$-band limiting magnitude $r<24.2$. Selections optimised for high redshift efficiency retain $73\%$ of the observed targets after spectroscopic confirmation, with $89\pm4\%$ efficiency for $97\pm2\%$ purity. This would provide a confirmed LBG density of $\sim 470$ deg$^{-2}$ in the range $2.8<z<3.5$ for a $r$-band limiting magnitude $r<24.5$. A preliminary study of the LBG sample 3d-clustering properties is also presented and used to estimate the LBG linear bias. A value of $b_{LBG} = 3.3 \pm 0.2 (stat.)$ is obtained for a mean redshift of 2.9 and a limiting magnitude in $r$ of 24.2, in agreement with results reported in the literature.
△ Less
Submitted 26 June, 2024; v1 submitted 4 April, 2024;
originally announced April 2024.
-
How Much Data are Enough? Investigating Dataset Requirements for Patch-Based Brain MRI Segmentation Tasks
Authors:
Dongang Wang,
Peilin Liu,
Hengrui Wang,
Heidi Beadnall,
Kain Kyle,
Linda Ly,
Mariano Cabezas,
Geng Zhan,
Ryan Sullivan,
Weidong Cai,
Wanli Ouyang,
Fernando Calamante,
Michael Barnett,
Chenyu Wang
Abstract:
Training deep neural networks reliably requires access to large-scale datasets. However, obtaining such datasets can be challenging, especially in the context of neuroimaging analysis tasks, where the cost associated with image acquisition and annotation can be prohibitive. To mitigate both the time and financial costs associated with model development, a clear understanding of the amount of data…
▽ More
Training deep neural networks reliably requires access to large-scale datasets. However, obtaining such datasets can be challenging, especially in the context of neuroimaging analysis tasks, where the cost associated with image acquisition and annotation can be prohibitive. To mitigate both the time and financial costs associated with model development, a clear understanding of the amount of data required to train a satisfactory model is crucial. This paper focuses on an early stage phase of deep learning research, prior to model development, and proposes a strategic framework for estimating the amount of annotated data required to train patch-based segmentation networks. This framework includes the establishment of performance expectations using a novel Minor Boundary Adjustment for Threshold (MinBAT) method, and standardizing patch selection through the ROI-based Expanded Patch Selection (REPS) method. Our experiments demonstrate that tasks involving regions of interest (ROIs) with different sizes or shapes may yield variably acceptable Dice Similarity Coefficient (DSC) scores. By setting an acceptable DSC as the target, the required amount of training data can be estimated and even predicted as data accumulates. This approach could assist researchers and engineers in estimating the cost associated with data collection and annotation when defining a new segmentation task based on deep neural networks, ultimately contributing to their efficient translation to real-world applications.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
Direct Experimental Constraints on the Spatial Extent of a Neutrino Wavepacket
Authors:
Joseph Smolsky,
Kyle G Leach,
Ryan Abells,
Pedro Amaro,
Adrien Andoche,
Keith Borbridge,
Connor Bray,
Robin Cantor,
David Diercks,
Spencer Fretwell,
Stephan Friedrich,
Abigail Gillespie,
Mauro Guerra,
Ad Hall,
Cameron N Harris,
Jackson T Harris,
Calvin Hinkle,
Amii Lamm,
Leendert M Hayen,
Paul-Antoine Hervieux,
Geon-Bo Kim,
Inwook Kim,
Annika Lennarz,
Vincenzo Lordi,
Jorge Machado
, et al. (13 additional authors not shown)
Abstract:
Despite their high relative abundance in our Universe, neutrinos are the least understood fundamental particles of nature. They also provide a unique system to study quantum coherence and the wavelike nature of particles in fundamental systems due to their extremely weak interaction probabilities. In fact, the quantum properties of neutrinos emitted in experimentally relevant sources are virtually…
▽ More
Despite their high relative abundance in our Universe, neutrinos are the least understood fundamental particles of nature. They also provide a unique system to study quantum coherence and the wavelike nature of particles in fundamental systems due to their extremely weak interaction probabilities. In fact, the quantum properties of neutrinos emitted in experimentally relevant sources are virtually unknown and the spatial extent of the neutrino wavepacket is only loosely constrained by reactor neutrino oscillation data with a spread of 13 orders of magnitude. Here, we present the first direct limits of this quantity through a new experimental concept to extract the energy width, $σ_{\textrm{N},E}$, of the recoil daughter nucleus emitted in the nuclear electron capture (EC) decay of $^7$Be. The final state in the EC decay process contains a recoiling $^7$Li nucleus and an electron neutrino ($ν_e$) which are entangled at their creation. The $^7$Li energy spectrum is measured to high precision by directly embedding $^7$Be radioisotopes into a high resolution superconducting tunnel junction that is operated as a cryogenic sensor. The lower limit on the spatial uncertainty of the recoil daughter was found to be $σ_{\textrm{N}, x} \geq 6.2$\,pm, which implies the final-state system is localized at a scale more than a thousand times larger than the nucleus itself. From this measurement, the first direct lower limits on the spatial extent of the neutrino wavepacket were extracted using two different theoretical methods. These results have wide-reaching implications in several areas including the nature of spatial localization at sub-atomic scales, interpretation of neutrino physics data, and the potential reach of future large-scale experiments.
△ Less
Submitted 30 April, 2024; v1 submitted 3 April, 2024;
originally announced April 2024.
-
Dephasing in Fluxonium Qubits from Coherent Quantum Phase Slips
Authors:
Mallika T. Randeria,
Thomas M. Hazard,
Agustin Di Paolo,
Kate Azar,
Max Hays,
Leon Ding,
Junyoung An,
Michael Gingras,
Bethany M. Niedzielski,
Hannah Stickler,
Jeffrey A. Grover,
Jonilyn L. Yoder,
Mollie E. Schwartz,
William D. Oliver,
Kyle Serniak
Abstract:
Phase slips occur across all Josephson junctions (JJs) at a rate that increases with the impedance of the junction. In superconducting qubits composed of JJ-array superinductors -- such as fluxonium -- phase slips in the array can lead to decoherence. In particular, phase-slip processes at the individual array junctions can coherently interfere, each with an Aharonov--Casher phase that depends on…
▽ More
Phase slips occur across all Josephson junctions (JJs) at a rate that increases with the impedance of the junction. In superconducting qubits composed of JJ-array superinductors -- such as fluxonium -- phase slips in the array can lead to decoherence. In particular, phase-slip processes at the individual array junctions can coherently interfere, each with an Aharonov--Casher phase that depends on the offset charges of the array islands. These coherent quantum phase slips (CQPS) perturbatively modify the qubit frequency, and therefore charge noise on the array islands will lead to dephasing. By varying the impedance of the array junctions, we design a set of fluxonium qubits in which the expected phase-slip rate within the JJ-array changes by several orders of magnitude. We characterize the coherence times of these qubits and demonstrate that the scaling of CQPS-induced dephasing rates agrees with our theoretical model. Furthermore, we perform noise spectroscopy of two qubits in regimes dominated by either CQPS or flux noise. We find the noise power spectrum associated with CQPS dephasing appears to be featureless at low frequencies and not $1/f$. Numerical simulations indicate this behavior is consistent with charge noise generated by charge-parity fluctuations within the array. Our findings broadly inform JJ-array-design tradeoffs, relevant for the numerous superconducting qubit designs employing JJ-array superinductors.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
Unmasking Correlations in Nuclear Cross Sections with Graph Neural Networks
Authors:
Sin**i Mitra,
Hongjun Choi,
Shusen Liu,
Ruben Glatt,
Kyle Wendt,
Nicolas Schunck
Abstract:
In this work, we explore the use of deep learning techniques to learn the relationships between nuclear cross-sections across the chart of isotopes. As a proof of principle, we focus on the neutron-induced reactions in the fast energy regime that are the most important in nuclear science and engineering. We use variational autoencoders (VAEs) and implicit neural representations (INRs) to build a l…
▽ More
In this work, we explore the use of deep learning techniques to learn the relationships between nuclear cross-sections across the chart of isotopes. As a proof of principle, we focus on the neutron-induced reactions in the fast energy regime that are the most important in nuclear science and engineering. We use variational autoencoders (VAEs) and implicit neural representations (INRs) to build a learned feature representation space of nuclear cross sections and reduce the dimensionality of the problem. We then train graph neural networks (GNNs) on the resulting latent space to leverage the topological information encoded in the chart of isotopes and to capture the relationships between cross sections in different nuclei. We find that hypernetworks based on INRs significantly outperforms VAEs in encoding nuclear cross-sections. This superiority is attributed to INR's ability to model complex, varying frequency details, which enables lower prediction errors when combined with GNNs. We also observe that GNN optimization is much more successful when performed in the latent space, whether using INRs or VAEs. However VAEs' continuous representation also allows for direct GNN training in the original input space. We leverage these representational learning techniques and successfully predict cross sections for a 17x17 block of nuclei with high accuracy and precision. These findings suggest that both representation encoding of cross-sections and the prediction task hold significant potential in augmenting nuclear theory models, e.g., providing reliable estimates of covariances of cross sections, including cross-material covariances.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
FastqZip: An Improved Reference-Based Genome Sequence Lossy Compression Framework
Authors:
Yuanjian Liu,
Huihao Luo,
Zhijun Han,
Yao Hu,
Yehui Yang,
Kyle Chard,
Sheng Di,
Ian Foster,
Jiesheng Wu
Abstract:
Storing and archiving data produced by next-generation sequencing (NGS) is a huge burden for research institutions. Reference-based compression algorithms are effective in dealing with these data. Our work focuses on compressing FASTQ format files with an improved reference-based compression algorithm to achieve a higher compression ratio than other state-of-the-art algorithms. We propose FastqZip…
▽ More
Storing and archiving data produced by next-generation sequencing (NGS) is a huge burden for research institutions. Reference-based compression algorithms are effective in dealing with these data. Our work focuses on compressing FASTQ format files with an improved reference-based compression algorithm to achieve a higher compression ratio than other state-of-the-art algorithms. We propose FastqZip, which uses a new method map** the sequence to reference for compression, allows reads-reordering and lossy quality scores, and the BSC or ZPAQ algorithm to perform final lossless compression for a higher compression ratio and relatively fast speed. Our method ensures the sequence can be losslessly reconstructed while allowing lossless or lossy compression for the quality scores. We reordered the reads to get a higher compression ratio. We evaluate our algorithms on five datasets and show that FastqZip can outperform the SOTA algorithm Genozip by around 10% in terms of compression ratio while having an acceptable slowdown.
△ Less
Submitted 22 February, 2024;
originally announced April 2024.
-
Mass calibration of DES Year-3 clusters via SPT-3G CMB cluster lensing
Authors:
B. Ansarinejad,
S. Raghunathan,
T. M. C. Abbott,
P. A. R. Ade,
M. Aguena,
O. Alves,
A. J. Anderson,
F. Andrade-Oliveira,
M. Archipley,
L. Balkenhol,
K. Benabed,
A. N. Bender,
B. A. Benson,
E. Bertin,
F. Bianchini,
L. E. Bleem,
S. Bocquet,
F. R. Bouchet,
D. Brooks,
L. Bryant,
D. L. Burke,
E. Camphuis,
J. E. Carlstrom,
A. Carnero Rosell,
J. Carretero
, et al. (120 additional authors not shown)
Abstract:
We measure the stacked lensing signal in the direction of galaxy clusters in the Dark Energy Survey Year 3 (DES Y3) redMaPPer sample, using cosmic microwave background (CMB) temperature data from SPT-3G, the third-generation CMB camera on the South Pole Telescope (SPT). We estimate the lensing signal using temperature maps constructed from the initial 2 years of data from the SPT-3G 'Main' survey,…
▽ More
We measure the stacked lensing signal in the direction of galaxy clusters in the Dark Energy Survey Year 3 (DES Y3) redMaPPer sample, using cosmic microwave background (CMB) temperature data from SPT-3G, the third-generation CMB camera on the South Pole Telescope (SPT). We estimate the lensing signal using temperature maps constructed from the initial 2 years of data from the SPT-3G 'Main' survey, covering 1500 deg$^2$ of the Southern sky. We then use this signal as a proxy for the mean cluster mass of the DES sample. In this work, we employ three versions of the redMaPPer catalogue: a Flux-Limited sample containing 8865 clusters, a Volume-Limited sample with 5391 clusters, and a Volume&Redshift-Limited sample with 4450 clusters. For the three samples, we find the mean cluster masses to be ${M}_{200{\rm{m}}}=1.66\pm0.13$ [stat.]$\pm0.03$ [sys.], $1.97\pm0.18$ [stat.]$\pm0.05$ [sys.], and $2.11\pm0.20$ [stat.]$\pm0.05$ [sys.]$\times{10}^{14}\ {\rm{M}}_{\odot }$, respectively. This is a factor of $\sim2$ improvement relative to the precision of measurements with previous generations of SPT surveys and the most constraining cluster mass measurements using CMB cluster lensing to date. Overall, we find no significant tensions between our results and masses given by redMaPPer mass-richness scaling relations of previous works, which were calibrated using CMB cluster lensing, optical weak lensing, and velocity dispersion measurements from various combinations of DES, SDSS and Planck data. We then divide our sample into 3 redshift and 3 richness bins, finding no significant tensions with optical weak-lensing calibrated masses in these bins. We forecast a $5.7\%$ constraint on the mean cluster mass of the DES Y3 sample with the complete SPT-3G surveys when using both temperature and polarization data and including an additional $\sim1400$ deg$^2$ of observations from the 'Extended' SPT-3G survey.
△ Less
Submitted 12 June, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
The onset of bar formation in a massive galaxy at $z \sim 3.8$
Authors:
Aristeidis Amvrosiadis,
Samuel Lange,
James Nightingale,
Qiuhan He,
Carlos S. Frenk,
Kyle A. Oman,
Ian Smail,
Mark A. Swinbank,
Francesca Fragkoudi,
Dimitri A. Gadotti,
Shaun Cole,
Edoardo Borsato,
Andrew Robertson,
Richard Massey,
Xiaoyue Cao,
Ran Li
Abstract:
We examine the morphological and kinematical properties of SPT-2147, a strongly lensed, massive, dusty, star-forming galaxy at $z = 3.762$. Combining data from JWST, HST, and ALMA, we study the galaxy's stellar emission, dust continuum and gas properties. The imaging reveals a central bar structure in the stars and gas embedded within an extended disc with a spiral arm-like feature. The kinematics…
▽ More
We examine the morphological and kinematical properties of SPT-2147, a strongly lensed, massive, dusty, star-forming galaxy at $z = 3.762$. Combining data from JWST, HST, and ALMA, we study the galaxy's stellar emission, dust continuum and gas properties. The imaging reveals a central bar structure in the stars and gas embedded within an extended disc with a spiral arm-like feature. The kinematics confirm the presence of the bar and of the regularly rotating disc. Dynamical modeling yields a dynamical mass, ${M}_{\rm dyn} = (9.7 \pm 2.0) \times 10^{10}$ ${\rm M}_{\odot}$, and a maximum rotational velocity to velocity dispersion ratio, $V / σ= 9.8 \pm 1.2$. From multi-band imaging we infer, via SED fitting, a stellar mass, ${M}_{\star} = (6.3 \pm 0.9) \times 10^{10}$ $\rm{M}_{\odot}$, and a star formation rate, ${\rm SFR} = 781 \pm 99$ ${\rm M_{\odot} yr^{-1}}$, after correcting for magnification. Combining these measurements with the molecular gas mass, we derive a baryonic-to-total mass ratio of ${M}_{\rm bar} / {M}_{\rm dyn} = 0.9 \pm 0.2$ within 4.0 kpc. This finding suggests that the formation of bars in galaxies begins earlier in the history of the Universe than previously thought and can also occur in galaxies with elevated gas fractions.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
MHONGOOSE -- A MeerKAT Nearby Galaxy HI Survey
Authors:
W. J. G. de Blok,
J. Healy,
F. M. Maccagni,
D. J. Pisano,
A. Bosma,
J. English,
T. Jarrett,
A. Marasco,
G. R. Meurer,
S. Veronese,
F. Bigiel,
L. Chemin,
F. Fraternali,
B. W. Holwerda,
P. Kamphuis,
H. R. Klöckner,
D. Kleiner,
A. K. Leroy,
M. Mogotsi,
K. A. Oman,
E. Schinnerer,
L. Verdes-Montenegro,
T. Westmeier,
O. I. Wong,
N. Zabel
, et al. (35 additional authors not shown)
Abstract:
The MHONGOOSE (MeerKAT HI Observations of Nearby Galactic Objects: Observing Southern Emitters) survey maps the distribution and kinematics of the neutral atomic hydrogen (HI) gas in and around 30 nearby star-forming spiral and dwarf galaxies to extremely low HI column densities. The HI column density sensitivity (3 sigma over 16 km/s) ranges from ~ 5 x 10^{17} cm^{-2} at 90'' resolution to ~4 x 1…
▽ More
The MHONGOOSE (MeerKAT HI Observations of Nearby Galactic Objects: Observing Southern Emitters) survey maps the distribution and kinematics of the neutral atomic hydrogen (HI) gas in and around 30 nearby star-forming spiral and dwarf galaxies to extremely low HI column densities. The HI column density sensitivity (3 sigma over 16 km/s) ranges from ~ 5 x 10^{17} cm^{-2} at 90'' resolution to ~4 x 10^{19} cm^{-2} at the highest resolution of 7''. The HI mass sensitivity (3 sigma over 50 km/s) is ~5.5 X 10^5 M_sun at a distance of 10 Mpc (the median distance of the sample galaxies). The velocity resolution of the data is 1.4 km/s. One of the main science goals of the survey is the detection of cold, accreting gas in the outskirts of the sample galaxies. The sample was selected to cover a range in HI masses, from 10^7 M_sun to almost 10^{11} M_sun, to optimally sample possible accretion scenarios and environments. The distance to the sample galaxies ranges from 3 to 23 Mpc. In this paper, we present the sample selection, survey design, and observation and reduction procedures. We compare the integrated HI fluxes based on the MeerKAT data with those derived from single-dish measurement and find good agreement, indicating that our MeerKAT observations are recovering all flux. We present HI moment maps of the entire sample based on the first ten percent of the survey data, and find that a comparison of the zeroth- and second-moment values shows a clear separation between the physical properties of the HI in areas with star formation and areas without, related to the formation of a cold neutral medium. Finally, we give an overview of the HI-detected companion and satellite galaxies in the 30 fields, five of which have not previously been catalogued. We find a clear relation between the number of companion galaxies and the mass of the main target galaxy.
△ Less
Submitted 6 June, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
FABLES: Evaluating faithfulness and content selection in book-length summarization
Authors:
Yekyung Kim,
Yapei Chang,
Marzena Karpinska,
Aparna Garimella,
Varun Manjunatha,
Kyle Lo,
Tanya Goyal,
Mohit Iyyer
Abstract:
While long-context large language models (LLMs) can technically summarize book-length documents (>100K tokens), the length and complexity of the documents have so far prohibited evaluations of input-dependent aspects like faithfulness. In this paper, we conduct the first large-scale human evaluation of faithfulness and content selection on LLM-generated summaries of fictional books. Our study miti…
▽ More
While long-context large language models (LLMs) can technically summarize book-length documents (>100K tokens), the length and complexity of the documents have so far prohibited evaluations of input-dependent aspects like faithfulness. In this paper, we conduct the first large-scale human evaluation of faithfulness and content selection on LLM-generated summaries of fictional books. Our study mitigates the issue of data contamination by focusing on summaries of books published in 2023 or 2024, and we hire annotators who have fully read each book prior to the annotation task to minimize cost and cognitive burden. We collect FABLES, a dataset of annotations on 3,158 claims made in LLM-generated summaries of 26 books, at a cost of $5.2K USD, which allows us to rank LLM summarizers based on faithfulness: Claude-3-Opus significantly outperforms all closed-source LLMs, while the open-source Mixtral is on par with GPT-3.5-Turbo. An analysis of the annotations reveals that most unfaithful claims relate to events and character states, and they generally require indirect reasoning over the narrative to invalidate. While LLM-based auto-raters have proven reliable for factuality and coherence in other settings, we implement several LLM raters of faithfulness and find that none correlates strongly with human annotations, especially with regard to detecting unfaithful claims. Our experiments suggest that detecting unfaithful claims is an important future direction not only for summarization evaluation but also as a testbed for long-context understanding. Finally, we move beyond faithfulness by exploring content selection errors in book-length summarization: we develop a typology of omission errors related to crucial narrative elements and also identify a systematic over-emphasis on events occurring towards the end of the book.
△ Less
Submitted 1 April, 2024;
originally announced April 2024.
-
Continuously tunable uniaxial strain control of van der Waals heterostructure devices
Authors:
Zhaoyu Liu,
Xuetao Ma,
John Cenker,
Jiaqi Cai,
Zaiyao Fei,
Paul Malinowski,
Joshua Mutch,
Yuzhou Zhao,
Kyle Hwangbo,
Zhong Lin,
Arnab Manna,
Jihui Yang,
David Cobden,
Xiaodong Xu,
Matthew Yankowitz,
Jiun-Haw Chu
Abstract:
Uniaxial strain has been widely used as a powerful tool for investigating and controlling the properties of quantum materials. However, existing strain techniques have so far mostly been limited to use with bulk crystals. Although recent progress has been made in extending the application of strain to two-dimensional van der Waals (vdW) heterostructures, these techniques have been limited to optic…
▽ More
Uniaxial strain has been widely used as a powerful tool for investigating and controlling the properties of quantum materials. However, existing strain techniques have so far mostly been limited to use with bulk crystals. Although recent progress has been made in extending the application of strain to two-dimensional van der Waals (vdW) heterostructures, these techniques have been limited to optical characterization and extremely simple electrical device geometries. Here, we report a piezoelectric-based \textit{in situ} uniaxial strain technique enabling simultaneous electrical transport and optical spectroscopy characterization of dual-gated vdW heterostructure devices. Critically, our technique remains compatible with vdW heterostructure devices of arbitrary complexity fabricated on conventional silicon/silicon dioxide wafer substrates. We demonstrate a large and continuously tunable strain of up to $-0.15\%$ at millikelvin temperatures, with larger strain values also likely achievable. We quantify the strain transmission from the silicon wafer to the vdW heterostructure, and further demonstrate the ability of strain to modify the electronic properties of twisted bilayer graphene. Our technique provides a highly versatile new method for exploring the effect of uniaxial strain on both the electrical and optical properties of vdW heterostructures, and can be easily extended to include additional characterization techniques.
△ Less
Submitted 23 May, 2024; v1 submitted 1 April, 2024;
originally announced April 2024.
-
Enhancing interferometry using weak value amplification with real weak values
Authors:
**g-Hui Huang,
Kyle M. Jordan,
Adetunmise C. Dada,
Xiang-Yun Hu,
Jeff. S. Lundeen
Abstract:
We introduce an ultra-sensitive interferometric protocol that combines weak value amplification (WVA) with traditional interferometry. This WVA+interferometry protocol uses weak value amplification of the relative delay between two paths to enhance the interferometric sensitivity, approaching the quantum limit for classical light. As an example, we demonstrate a proof-of-principle experiment that…
▽ More
We introduce an ultra-sensitive interferometric protocol that combines weak value amplification (WVA) with traditional interferometry. This WVA+interferometry protocol uses weak value amplification of the relative delay between two paths to enhance the interferometric sensitivity, approaching the quantum limit for classical light. As an example, we demonstrate a proof-of-principle experiment that achieves few-attosecond timing resolution (few-nanometer path length resolution) with a double-slit interferometer using only common optical components. Since our example uses only the spatial shift of double-slit interference fringes, its precision is not limited by the timing resolution of the detectors, but is instead limited solely by the fundamental shot noise associated with classical light. We experimentally demonstrate that the signal-to-noise ratio can be improved by one to three orders of magnitude and approaches the shot-noise limit in the large amplification regime. Previously, quantum-limited WVA delay measurements were thought to require imaginary weak values, which necessitate light with a broad spectral bandwidth and high-resolution spectrometers. In contrast, our protocol highlights the feasibility of using real weak values and narrowband light. Thus, our protocol is a compelling and cost-effective approach to enhance interferometry.
△ Less
Submitted 30 March, 2024;
originally announced April 2024.
-
Interactive Multi-Robot Flocking with Gesture Responsiveness and Musical Accompaniment
Authors:
Catie Cuan,
Kyle Jeffrey,
Kim Kleiven,
Adrian Li,
Emre Fisher,
Matt Harrison,
Benjie Holson,
Allison Okamura,
Matt Bennice
Abstract:
For decades, robotics researchers have pursued various tasks for multi-robot systems, from cooperative manipulation to search and rescue. These tasks are multi-robot extensions of classical robotic tasks and often optimized on dimensions such as speed or efficiency. As robots transition from commercial and research settings into everyday environments, social task aims such as engagement or enterta…
▽ More
For decades, robotics researchers have pursued various tasks for multi-robot systems, from cooperative manipulation to search and rescue. These tasks are multi-robot extensions of classical robotic tasks and often optimized on dimensions such as speed or efficiency. As robots transition from commercial and research settings into everyday environments, social task aims such as engagement or entertainment become increasingly relevant. This work presents a compelling multi-robot task, in which the main aim is to enthrall and interest. In this task, the goal is for a human to be drawn to move alongside and participate in a dynamic, expressive robot flock. Towards this aim, the research team created algorithms for robot movements and engaging interaction modes such as gestures and sound. The contributions are as follows: (1) a novel group navigation algorithm involving human and robot agents, (2) a gesture responsive algorithm for real-time, human-robot flocking interaction, (3) a weight mode characterization system for modifying flocking behavior, and (4) a method of encoding a choreographer's preferences inside a dynamic, adaptive, learned system. An experiment was performed to understand individual human behavior while interacting with the flock under three conditions: weight modes selected by a human choreographer, a learned model, or subset list. Results from the experiment showed that the perception of the experience was not influenced by the weight mode selection. This work elucidates how differing task aims such as engagement manifest in multi-robot system design and execution, and broadens the domain of multi-robot tasks.
△ Less
Submitted 30 March, 2024;
originally announced April 2024.
-
A PPO-based DRL Auto-Tuning Nonlinear PID Drone Controller for Robust Autonomous Flights
Authors:
Junyang Zhang,
Cristian Emanuel Ocampo Rivera,
Kyle Tyni,
Steven Nguyen
Abstract:
This project aims to revolutionize drone flight control by implementing a nonlinear Deep Reinforcement Learning (DRL) agent as a replacement for traditional linear Proportional Integral Derivative (PID) controllers. The primary objective is to seamlessly transition drones between manual and autonomous modes, enhancing responsiveness and stability. We utilize the Proximal Policy Optimization (PPO)…
▽ More
This project aims to revolutionize drone flight control by implementing a nonlinear Deep Reinforcement Learning (DRL) agent as a replacement for traditional linear Proportional Integral Derivative (PID) controllers. The primary objective is to seamlessly transition drones between manual and autonomous modes, enhancing responsiveness and stability. We utilize the Proximal Policy Optimization (PPO) reinforcement learning strategy within the Gazebo simulator to train the DRL agent. Adding a $20,000 indoor Vicon tracking system offers <1mm positioning accuracy, which significantly improves autonomous flight precision. To navigate the drone in the shortest collision-free trajectory, we also build a 3 dimensional A* path planner and implement it into the real flight successfully.
△ Less
Submitted 29 March, 2024;
originally announced April 2024.
-
Invertibility of Discrete-Time Linear Systems with Sparse Inputs
Authors:
Kyle Poe,
Enrique Mallada,
Rene Vidal
Abstract:
One of the fundamental problems of interest for discrete-time linear systems is whether its input sequence may be recovered given its output sequence, a.k.a. the left inversion problem. Many conditions on the state space geometry, dynamics, and spectral structure of a system have been used to characterize the well-posedness of this problem, without assumptions on the inputs. However, certain struc…
▽ More
One of the fundamental problems of interest for discrete-time linear systems is whether its input sequence may be recovered given its output sequence, a.k.a. the left inversion problem. Many conditions on the state space geometry, dynamics, and spectral structure of a system have been used to characterize the well-posedness of this problem, without assumptions on the inputs. However, certain structural assumptions, such as input sparsity, have been shown to translate to practical gains in the performance of inversion algorithms, surpassing classical guarantees. Establishing necessary and sufficient conditions for left invertibility of systems with sparse inputs is therefore a crucial step toward understanding the performance limits of system inversion under structured input assumptions. In this work, we provide the first necessary and sufficient characterizations of left invertibility for linear systems with sparse inputs, echoing classic characterizations for standard linear systems. The key insight in deriving these results is in establishing the existence of two novel geometric invariants unique to the sparse-input setting, the weakly unobservable and strongly reachable subspace arrangements. By means of a concrete example, we demonstrate the utility of these characterizations. We conclude by discussing extensions and applications of this framework to several related problems in sparse control.
△ Less
Submitted 29 March, 2024;
originally announced March 2024.
-
Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNs
Authors:
Kanishka Misra,
Kyle Mahowald
Abstract:
Language models learn rare syntactic phenomena, but it has been argued that they rely on rote memorization, as opposed to grammatical generalization. Training on a corpus of human-scale in size (100M words), we iteratively trained transformer language models on systematically manipulated corpora and then evaluated their learning of a particular rare grammatical phenomenon: the English Article+Adje…
▽ More
Language models learn rare syntactic phenomena, but it has been argued that they rely on rote memorization, as opposed to grammatical generalization. Training on a corpus of human-scale in size (100M words), we iteratively trained transformer language models on systematically manipulated corpora and then evaluated their learning of a particular rare grammatical phenomenon: the English Article+Adjective+Numeral+Noun (AANN) construction (``a beautiful five days''). We first compared how well this construction was learned on the default corpus relative to a counterfactual corpus in which the AANN sentences were removed. AANNs were still learned better than systematically perturbed variants of the construction. Using additional counterfactual corpora, we suggest that this learning occurs through generalization from related constructions (e.g., ``a few days''). An additional experiment showed that this learning is enhanced when there is more variability in the input. Taken together, our results provide an existence proof that models learn rare grammatical phenomena by generalization from less rare phenomena. Code available at https://github.com/kanishkamisra/aannalysis
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
On the Exact Fourier Dimension of Sets of Well-Approximable Matrices
Authors:
Thomas Cai,
Kyle Hambrook
Abstract:
We compute the exact Fourier dimension of the set of $Ψ$-well-approximable $m \times n$ matrices (and the set of $Ψ$-well-approximable numbers) in the homogeneous and inhomogeneous cases for any approximation function $Ψ$ satisfying $\sum_{q \in \mathbb{Z}^n} Ψ(q)^m < \infty$.
We compute the exact Fourier dimension of the set of $Ψ$-well-approximable $m \times n$ matrices (and the set of $Ψ$-well-approximable numbers) in the homogeneous and inhomogeneous cases for any approximation function $Ψ$ satisfying $\sum_{q \in \mathbb{Z}^n} Ψ(q)^m < \infty$.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
UniFaaS: Programming across Distributed Cyberinfrastructure with Federated Function Serving
Authors:
Yifei Li,
Ryan Chard,
Yadu Babuji,
Kyle Chard,
Ian Foster,
Zhuozhao Li
Abstract:
Modern scientific applications are increasingly decomposable into individual functions that may be deployed across distributed and diverse cyberinfrastructure such as supercomputers, clouds, and accelerators. Such applications call for new approaches to programming, distributed execution, and function-level management. We present UniFaaS, a parallel programming framework that relies on a federated…
▽ More
Modern scientific applications are increasingly decomposable into individual functions that may be deployed across distributed and diverse cyberinfrastructure such as supercomputers, clouds, and accelerators. Such applications call for new approaches to programming, distributed execution, and function-level management. We present UniFaaS, a parallel programming framework that relies on a federated function-as-a-service (FaaS) model to enable composition of distributed, scalable, and high-performance scientific workflows, and to support fine-grained function-level management. UniFaaS provides a unified programming interface to compose dynamic task graphs with transparent wide-area data management. UniFaaS exploits an observe-predict-decide approach to efficiently map workflow tasks to target heterogeneous and dynamic resources. We propose a dynamic heterogeneity-aware scheduling algorithm that employs a delay mechanism and a re-scheduling mechanism to accommodate dynamic resource capacity. Our experiments show that UniFaaS can efficiently execute workflows across computing resources with minimal scheduling overhead. We show that UniFaaS can improve the performance of a real-world drug screening workflow by as much as 22.99% when employing an additional 19.48% of resources and a montage workflow by 54.41% when employing an additional 47.83% of resources across multiple distributed clusters, in contrast to using a single cluster
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
Symmetry criteria for the equality of interior and exterior shape factors
Authors:
Kyle McKee,
John H. Lienhard
Abstract:
Lienhard (2019) reported that the shape factor of the interior of a simply-connected region ($Ω$) is equal to that of its exterior ($\mathbb{R}^2\backslashΩ$) under the same boundary conditions. In that study, numerical examples supported the claim in particular cases; for example, it was shown that for certain boundary conditions on circles and squares, the conjecture holds. In the present paper,…
▽ More
Lienhard (2019) reported that the shape factor of the interior of a simply-connected region ($Ω$) is equal to that of its exterior ($\mathbb{R}^2\backslashΩ$) under the same boundary conditions. In that study, numerical examples supported the claim in particular cases; for example, it was shown that for certain boundary conditions on circles and squares, the conjecture holds. In the present paper, we show that the conjecture is not generally true, unless some additional condition is met. We proceed by elucidating why the conjecture does in fact hold in all of the examples analysed by Lienhard. We thus deduce a simple criterion which, when satisfied, ensures the equality of interior and exterior shape factors in general. Our criterion notably relies on a beautiful and little-known symmetry method due to Hersch (1982) which we introduce in a tutorial manner.
△ Less
Submitted 7 April, 2024; v1 submitted 27 March, 2024;
originally announced March 2024.
-
A Transformer-Based Framework for Payload Malware Detection and Classification
Authors:
Kyle Stein,
Arash Mahyari,
Guillermo Francia III,
Eman El-Sheikh
Abstract:
As malicious cyber threats become more sophisticated in breaching computer networks, the need for effective intrusion detection systems (IDSs) becomes crucial. Techniques such as Deep Packet Inspection (DPI) have been introduced to allow IDSs analyze the content of network packets, providing more context for identifying potential threats. IDSs traditionally rely on using anomaly-based and signatur…
▽ More
As malicious cyber threats become more sophisticated in breaching computer networks, the need for effective intrusion detection systems (IDSs) becomes crucial. Techniques such as Deep Packet Inspection (DPI) have been introduced to allow IDSs analyze the content of network packets, providing more context for identifying potential threats. IDSs traditionally rely on using anomaly-based and signature-based detection techniques to detect unrecognized and suspicious activity. Deep learning techniques have shown great potential in DPI for IDSs due to their efficiency in learning intricate patterns from the packet content being transmitted through the network. In this paper, we propose a revolutionary DPI algorithm based on transformers adapted for the purpose of detecting malicious traffic with a classifier head. Transformers learn the complex content of sequence data and generalize them well to similar scenarios thanks to their self-attention mechanism. Our proposed method uses the raw payload bytes that represent the packet contents and is deployed as man-in-the-middle. The payload bytes are used to detect malicious packets and classify their types. Experimental results on the UNSW-NB15 and CIC-IOT23 datasets demonstrate that our transformer-based model is effective in distinguishing malicious from benign traffic in the test dataset, attaining an average accuracy of 79\% using binary classification and 72\% on the multi-classification experiment, both using solely payload bytes.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
Vision-Based Force Estimation for Minimally Invasive Telesurgery Through Contact Detection and Local Stiffness Models
Authors:
Shuyuan Yang,
My H. Le,
Kyle R. Golobish,
Juan C. Beaver,
Zonghe Chua
Abstract:
In minimally invasive telesurgery, obtaining accurate force information is difficult due to the complexities of in-vivo end effector force sensing. This constrains development and implementation of haptic feedback and force-based automated performance metrics, respectively. Vision-based force sensing approaches using deep learning are a promising alternative to intrinsic end effector force sensing…
▽ More
In minimally invasive telesurgery, obtaining accurate force information is difficult due to the complexities of in-vivo end effector force sensing. This constrains development and implementation of haptic feedback and force-based automated performance metrics, respectively. Vision-based force sensing approaches using deep learning are a promising alternative to intrinsic end effector force sensing. However, they have limited ability to generalize to novel scenarios, and require learning on high-quality force sensor training data that can be difficult to obtain. To address these challenges, this paper presents a novel vision-based contact-conditional approach for force estimation in telesurgical environments. Our method leverages supervised learning with human labels and end effector position data to train deep neural networks. Predictions from these trained models are optionally combined with robot joint torque information to estimate forces indirectly from visual data. We benchmark our method against ground truth force sensor data and demonstrate generality by fine-tuning to novel surgical scenarios in a data-efficient manner. Our methods demonstrated greater than 90% accuracy on contact detection and less than 10% force prediction error. These results suggest potential usefulness of contact-conditional force estimation for sensory substitution haptic feedback and tissue handling skill evaluation in clinical settings.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
Testing the $\mathbfΛ$CDM Cosmological Model with Forthcoming Measurements of the Cosmic Microwave Background with SPT-3G
Authors:
K. Prabhu,
S. Raghunathan,
M. Millea,
G. Lynch,
P. A. R. Ade,
E. Anderes,
A. J. Anderson,
B. Ansarinejad,
M. Archipley,
L. Balkenhol,
K. Benabed,
A. N. Bender,
B. A. Benson,
F. Bianchini,
L. E. Bleem,
F. R. Bouchet,
L. Bryant,
E. Camphuis,
J. E. Carlstrom,
T. W. Cecil,
C. L. Chang,
P. Chaubal,
P. M. Chichura,
T. -L. Chou,
A. Coerver
, et al. (76 additional authors not shown)
Abstract:
We forecast constraints on cosmological parameters enabled by three surveys conducted with SPT-3G, the third-generation camera on the South Pole Telescope. The surveys cover separate regions of 1500, 2650, and 6000 ${\rm deg}^{2}$ to different depths, in total observing 25% of the sky. These regions will be measured to white noise levels of roughly 2.5, 9, and 12 $μ{\rm K-arcmin}$, respectively, i…
▽ More
We forecast constraints on cosmological parameters enabled by three surveys conducted with SPT-3G, the third-generation camera on the South Pole Telescope. The surveys cover separate regions of 1500, 2650, and 6000 ${\rm deg}^{2}$ to different depths, in total observing 25% of the sky. These regions will be measured to white noise levels of roughly 2.5, 9, and 12 $μ{\rm K-arcmin}$, respectively, in CMB temperature units at 150 GHz by the end of 2024. The survey also includes measurements at 95 and 220 GHz, which have noise levels a factor of ~1.2 and 3.5 times higher than 150 GHz, respectively, with each band having a polarization noise level ~$\sqrt{\text{2}}$ times higher than the temperature noise. We use a novel approach to obtain the covariance matrices for jointly and optimally estimated gravitational lensing potential bandpowers and unlensed CMB temperature and polarization bandpowers. We demonstrate the ability to test the $Λ{\rm CDM}$ model via the consistency of cosmological parameters constrained independently from SPT-3G and Planck data, and consider the improvement in constraints on $Λ{\rm CDM}$ extension parameters from a joint analysis of SPT-3G and Planck data. The $Λ{\rm CDM}$ cosmological parameters are typically constrained with uncertainties up to ~2 times smaller with SPT-3G data, compared to Planck, with the two data sets measuring significantly different angular scales and polarization levels, providing additional tests of the standard cosmological model.
△ Less
Submitted 5 July, 2024; v1 submitted 26 March, 2024;
originally announced March 2024.
-
Local magnetic response of superconducting Sr$\mathrm{_2}$RuO$\mathrm{_4}$ thin films and rings
Authors:
G. M. Ferguson,
Hari P. Nair,
Nathaniel J. Schreiber,
Ludi Miao,
Kyle M. Shen,
Darrell G. Schlom,
Katja C. Nowack
Abstract:
We conduct local magnetic measurements on superconducting thin-film samples of Sr$\mathrm{_2}$RuO$\mathrm{_4}$ using scanning Superconducting Quantum Interference Device (SQUID) susceptometry. From the diamagnetic response, we extract the magnetic penetration depth, $λ$, which exhibits a quadratic temperature dependence at low temperatures. Although a quadratic dependence in high-purity bulk sampl…
▽ More
We conduct local magnetic measurements on superconducting thin-film samples of Sr$\mathrm{_2}$RuO$\mathrm{_4}$ using scanning Superconducting Quantum Interference Device (SQUID) susceptometry. From the diamagnetic response, we extract the magnetic penetration depth, $λ$, which exhibits a quadratic temperature dependence at low temperatures. Although a quadratic dependence in high-purity bulk samples has been attributed to non-local electrodynamics, our analysis suggests that in our thin-film samples the presence of scattering is the origin of the quadratic dependence. While we observe micron-scale variations in the diamagnetic response and superconducting transition temperature, the form of the temperature dependence of $λ$ is independent of position. Finally, we characterize flux trap** in superconducting rings lithographically fabricated from the thin films, paving the way to systematic device-based tests of the superconducting order parameter in Sr$\mathrm{_2}$RuO$\mathrm{_4}$.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Addressing the Band Gap Problem with a Machine-Learned Exchange Functional
Authors:
Kyle Bystrom,
Stefano Falletta,
Boris Kozinsky
Abstract:
The systematic underestimation of band gaps is one of the most fundamental challenges in semilocal density functional theory (DFT). In addition to hindering the application of DFT to predicting electronic properties, the band gap problem is intimately related to self-interaction and delocalization errors, which make the study of charge transfer mechanisms with DFT difficult. In this work, we prese…
▽ More
The systematic underestimation of band gaps is one of the most fundamental challenges in semilocal density functional theory (DFT). In addition to hindering the application of DFT to predicting electronic properties, the band gap problem is intimately related to self-interaction and delocalization errors, which make the study of charge transfer mechanisms with DFT difficult. In this work, we present two key innovations to address the band gap problem. First, we design an approach for machine learning density functionals based on Gaussian processes to explicitly fit single-particle energy levels. Second, we introduce novel nonlocal features of the density matrix that are expressive enough to fit these single-particle levels. Combining these developments, we train a machine-learned functional for the exact exchange energy that predicts molecular energy gaps and reaction energies of a wide range of molecules in excellent agreement with reference hybrid DFT calculations. In addition, while being trained solely on molecular data, our model predicts reasonable formation energies of polarons in solids, showcasing its transferability and robustness. Our approach generalizes straightforwardly to full exchange-correlation functionals, thus paving the way to the design of novel state-of-the-art functionals for the prediction of electronic properties of molecules and materials.
△ Less
Submitted 10 April, 2024; v1 submitted 25 March, 2024;
originally announced March 2024.
-
Multimodal operando microscopy reveals that interfacial chemistry and nanoscale performance disorder dictate perovskite solar cell stability
Authors:
Kyle Frohna,
Cullen Chosy,
Amran Al-Ashouri,
Florian Scheler,
Yu-Hsien Chiang,
Milos Dubajic,
Julia E. Parker,
Jessica M. Walker,
Lea Zimmermann,
Thomas A. Selby,
Yang Lu,
Bart Roose,
Steve Albrecht,
Miguel Anaya,
Samuel D. Stranks
Abstract:
Next-generation low-cost semiconductors such as halide perovskites exhibit optoelectronic properties dominated by nanoscale variations in their structure, composition and photophysics. While microscopy provides a proxy for ultimate device function, past works have focused on neat thin-films on insulating substrates, missing crucial information about charge extraction losses and recombination losse…
▽ More
Next-generation low-cost semiconductors such as halide perovskites exhibit optoelectronic properties dominated by nanoscale variations in their structure, composition and photophysics. While microscopy provides a proxy for ultimate device function, past works have focused on neat thin-films on insulating substrates, missing crucial information about charge extraction losses and recombination losses introduced by transport layers. Here we use a multimodal operando microscopy toolkit to measure nanoscale current-voltage curves, recombination losses and chemical composition in an array of state-of-the-art perovskite solar cells before and after extended operational stress. We apply this toolkit to the same scan areas before and after extended operation to reveal that devices with the highest performance have the lowest initial performance spatial heterogeneity - a crucial link that is missed in conventional microscopy. We find that subtle compositional engineering of the perovskite has surprising effects on local disorder and resilience to operational stress. Minimising variations in local efficiency, rather than compositional disorder, is predictive of improved performance and stability. Modulating the interfaces with different contact layers or passivation treatments can increase initial performance but can also lead to dramatic nanoscale, interface-dominated degradation even in the presence of local performance homogeneity, inducing spatially varying transport, recombination, and electrical losses. These operando measurements of full devices act as screenable diagnostic tools, uniquely unveiling the microscopic mechanistic origins of device performance losses and degradation in an array of halide perovskite devices and treatments. This information in turn reveals guidelines for future improvements to both performance and stability.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Illuminating Systematic Trends in Nuclear Data with Generative Machine Learning Models
Authors:
Jordan M. R. Fox,
Kyle A. Wendt
Abstract:
We introduce a novel method for studying systematic trends in nuclear reaction data using generative adversarial networks. Libraries of nuclear cross section evaluations exhibit intricate systematic trends across the nuclear landscape, and predictive models capable of reproducing and analyzing these trends are valuable for many applications. We have developed a predictive model using deep generati…
▽ More
We introduce a novel method for studying systematic trends in nuclear reaction data using generative adversarial networks. Libraries of nuclear cross section evaluations exhibit intricate systematic trends across the nuclear landscape, and predictive models capable of reproducing and analyzing these trends are valuable for many applications. We have developed a predictive model using deep generative adversarial networks to learn trends from the inelastic neutron scattering channel of TENDL for even-even nuclei. The system predicts cross sections based on adding/subtracting particles to/from the target nucleus. It can thus help identify cross sections that break from expected trends and predict beyond the limit of current experiments. Our model can produce good predictions for cross section curves for many nuclides, and it is most robust near the line of stability. We also create an ensemble of predictions to leverage different correlations and estimate model uncertainty. This research marks an important first step in computer generation of nuclear cross-section libraries.
△ Less
Submitted 29 April, 2024; v1 submitted 24 March, 2024;
originally announced March 2024.
-
Minimal Cellular Resolutions of Path Ideals
Authors:
Trung Chau,
Selvi Kara,
Kyle Wang
Abstract:
In this paper, we prove that the path ideals of both paths and cycles have minimal cellular resolutions. Specifically, these minimal free resolutions coincide with the Barile-Macchia resolutions for paths, and their generalized counterparts for cycles. Furthermore, we identify edge ideals of cycles as a class of ideals that lack a minimal Barile-Macchia resolution, yet have a minimal generalized B…
▽ More
In this paper, we prove that the path ideals of both paths and cycles have minimal cellular resolutions. Specifically, these minimal free resolutions coincide with the Barile-Macchia resolutions for paths, and their generalized counterparts for cycles. Furthermore, we identify edge ideals of cycles as a class of ideals that lack a minimal Barile-Macchia resolution, yet have a minimal generalized Barile-Macchia resolution.
△ Less
Submitted 13 April, 2024; v1 submitted 24 March, 2024;
originally announced March 2024.
-
Elemental Patterns from the Erdős Straus Conjecture
Authors:
Kyle Bradford
Abstract:
This paper makes the following conjecture: For every prime $p$ there exists a positive integer $x$ with $\left\lceil \frac{p}{4} \right\rceil \leq x \leq \left\lceil \frac{p}{2} \right\rceil$ and a positive divisor $d|x^2$ so that either: (1) $ d \bmod \left( 4x - p \right) \equiv -px$; or (2) $d \leq x$ and $ d \bmod \left( 4x - p \right) \equiv -x$. Furthermore this paper proves that the solutio…
▽ More
This paper makes the following conjecture: For every prime $p$ there exists a positive integer $x$ with $\left\lceil \frac{p}{4} \right\rceil \leq x \leq \left\lceil \frac{p}{2} \right\rceil$ and a positive divisor $d|x^2$ so that either: (1) $ d \bmod \left( 4x - p \right) \equiv -px$; or (2) $d \leq x$ and $ d \bmod \left( 4x - p \right) \equiv -x$. Furthermore this paper proves that the solutions to these modular equations are in one-to-one correspondence with the solutions of the diophantine equation used in the Erdős Straus conjecture.
△ Less
Submitted 24 March, 2024;
originally announced March 2024.
-
A2DMN: Anatomy-Aware Dilated Multiscale Network for Breast Ultrasound Semantic Segmentation
Authors:
Kyle Lucke,
Aleksandar Vakanski,
Min Xian
Abstract:
In recent years, convolutional neural networks for semantic segmentation of breast ultrasound (BUS) images have shown great success; however, two major challenges still exist. 1) Most current approaches inherently lack the ability to utilize tissue anatomy, resulting in misclassified image regions. 2) They struggle to produce accurate boundaries due to the repeated down-sampling operations. To add…
▽ More
In recent years, convolutional neural networks for semantic segmentation of breast ultrasound (BUS) images have shown great success; however, two major challenges still exist. 1) Most current approaches inherently lack the ability to utilize tissue anatomy, resulting in misclassified image regions. 2) They struggle to produce accurate boundaries due to the repeated down-sampling operations. To address these issues, we propose a novel breast anatomy-aware network for capturing fine image details and a new smoothness term that encodes breast anatomy. It incorporates context information across multiple spatial scales to generate more accurate semantic boundaries. Extensive experiments are conducted to compare the proposed method and eight state-of-the-art approaches using a BUS dataset with 325 images. The results demonstrate the proposed method significantly improves the segmentation of the muscle, mammary, and tumor classes and produces more accurate fine details of tissue boundaries.
△ Less
Submitted 22 March, 2024;
originally announced March 2024.
-
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
Authors:
Orion Weller,
Benjamin Chang,
Sean MacAvaney,
Kyle Lo,
Arman Cohan,
Benjamin Van Durme,
Dawn Lawrie,
Luca Soldaini
Abstract:
Modern Language Models (LMs) are capable of following long and complex instructions that enable a large and diverse set of user requests. While Information Retrieval (IR) models use these LMs as the backbone of their architectures, virtually none of them allow users to provide detailed instructions alongside queries, thus limiting their ability to satisfy complex information needs. In this work, w…
▽ More
Modern Language Models (LMs) are capable of following long and complex instructions that enable a large and diverse set of user requests. While Information Retrieval (IR) models use these LMs as the backbone of their architectures, virtually none of them allow users to provide detailed instructions alongside queries, thus limiting their ability to satisfy complex information needs. In this work, we study the use of instructions in IR systems. First, we introduce our dataset FollowIR, which contains a rigorous instruction evaluation benchmark as well as a training set for hel** IR models learn to better follow real-world instructions. FollowIR repurposes detailed instructions -- also known as narratives -- developed for professional assessors to evaluate retrieval systems. In particular, we build our benchmark from three collections curated for shared tasks at the Text REtrieval Conference (TREC). These collections contains hundreds to thousands of labeled documents per query, making them suitable for our exploration. Through this process, we can measure how well IR models follow instructions, through a new pairwise evaluation framework. Our results indicate that existing retrieval models fail to correctly use instructions, using them for basic keywords and struggling to understand long-form information. However, we show that it is possible for IR models to learn to follow complex instructions: our new FollowIR-7B model has significant improvements after fine-tuning on our training set.
△ Less
Submitted 7 May, 2024; v1 submitted 22 March, 2024;
originally announced March 2024.
-
DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset
Authors:
Alexander Khazatsky,
Karl Pertsch,
Suraj Nair,
Ashwin Balakrishna,
Sudeep Dasari,
Siddharth Karamcheti,
Soroush Nasiriany,
Mohan Kumar Srirama,
Lawrence Yunliang Chen,
Kirsty Ellis,
Peter David Fagan,
Joey Hejna,
Masha Itkina,
Marion Lepert,
Yecheng Jason Ma,
Patrick Tree Miller,
Jimmy Wu,
Suneel Belkhale,
Shivin Dass,
Huy Ha,
Arhan Jain,
Abraham Lee,
Youngwoon Lee,
Marius Memmel,
Sungjae Park
, et al. (74 additional authors not shown)
Abstract:
The creation of large, diverse, high-quality robot manipulation datasets is an important step** stone on the path toward more capable and robust robotic manipulation policies. However, creating such datasets is challenging: collecting robot manipulation data in diverse environments poses logistical and safety challenges and requires substantial investments in hardware and human labour. As a resu…
▽ More
The creation of large, diverse, high-quality robot manipulation datasets is an important step** stone on the path toward more capable and robust robotic manipulation policies. However, creating such datasets is challenging: collecting robot manipulation data in diverse environments poses logistical and safety challenges and requires substantial investments in hardware and human labour. As a result, even the most general robot manipulation policies today are mostly trained on data collected in a small number of environments with limited scene and task diversity. In this work, we introduce DROID (Distributed Robot Interaction Dataset), a diverse robot manipulation dataset with 76k demonstration trajectories or 350 hours of interaction data, collected across 564 scenes and 84 tasks by 50 data collectors in North America, Asia, and Europe over the course of 12 months. We demonstrate that training with DROID leads to policies with higher performance and improved generalization ability. We open source the full dataset, policy learning code, and a detailed guide for reproducing our robot hardware setup.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
A Unified Framework for Rerandomization using Quadratic Forms
Authors:
Kyle Schindl,
Zach Branson
Abstract:
In the design stage of a randomized experiment, one way to ensure treatment and control groups exhibit similar covariate distributions is to randomize treatment until some prespecified level of covariate balance is satisfied. This experimental design strategy is known as rerandomization. Most rerandomization methods utilize balance metrics based on a quadratic form $v^TAv$ , where $v$ is a vector…
▽ More
In the design stage of a randomized experiment, one way to ensure treatment and control groups exhibit similar covariate distributions is to randomize treatment until some prespecified level of covariate balance is satisfied. This experimental design strategy is known as rerandomization. Most rerandomization methods utilize balance metrics based on a quadratic form $v^TAv$ , where $v$ is a vector of covariate mean differences and $A$ is a positive semi-definite matrix. In this work, we derive general results for treatment-versus-control rerandomization schemes that employ quadratic forms for covariate balance. In addition to allowing researchers to quickly derive properties of rerandomization schemes not previously considered, our theoretical results provide guidance on how to choose the matrix $A$ in practice. We find the Mahalanobis and Euclidean distances optimize different measures of covariate balance. Furthermore, we establish how the covariates' eigenstructure and their relationship to the outcomes dictates which matrix $A$ yields the most precise mean-difference estimator for the average treatment effect. We find that the Euclidean distance is minimax optimal, in the sense that the mean-difference estimator's precision is never too far from the optimal choice, regardless of the relationship between covariates and outcomes. Our theoretical results are verified via simulation, where we find that rerandomization using the Euclidean distance has better performance in high-dimensional settings and typically achieves greater variance reduction to the mean-difference estimator than other quadratic forms.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
Optical Atomic Clock Interrogation Via an Integrated Spiral Cavity Laser
Authors:
William Loh,
David Reens,
Dave Kharas,
Alkesh Sumant,
Connor Belanger,
Ryan T. Maxson,
Alexander Medeiros,
William Setzer,
Dodd Gray,
Kyle DeBry,
Colin D. Bruzewicz,
Jason Plant,
John Liddell,
Gavin N. West,
Sagar Doshi,
Matthew Roychowdhury,
May Kim,
Danielle Braje,
Paul W. Juodawlkis,
John Chiaverini,
Robert McConnell
Abstract:
Optical atomic clocks have demonstrated revolutionary advances in precision timekee**, but their applicability to the real world is critically dependent on whether such clocks can operate outside a laboratory setting. The challenge to clock portability stems from the many obstacles not only in miniaturizing the underlying components of the clock $-$ namely the ultrastable laser, the frequency co…
▽ More
Optical atomic clocks have demonstrated revolutionary advances in precision timekee**, but their applicability to the real world is critically dependent on whether such clocks can operate outside a laboratory setting. The challenge to clock portability stems from the many obstacles not only in miniaturizing the underlying components of the clock $-$ namely the ultrastable laser, the frequency comb, and the atomic reference itself $-$ but also in making the clock resilient to environmental fluctuations. Photonic integration offers one compelling solution to simultaneously address the problems of miniaturization and ruggedization, but brings with it a new set of challenges in recreating the functionality of an optical clock using chip-scale building blocks. The clock laser used for atom interrogation is one particular point of uncertainty, as the performance of the meticulously-engineered bulk-cavity stabilized lasers would be exceptionally difficult to transfer to chip. Here we demonstrate that a chip-integrated ultrahigh quality factor (Q) spiral cavity, when interfaced with a 1348 nm seed laser, reaches a fractional frequency instability of $7.5 \times 10^{-14}$, meeting the stability requirements for interrogating the narrow-linewidth transition of $^{88}$Sr$^+$ upon frequency doubling to 674 nm. In addition to achieving the record for laser stability on chip, we use this laser to showcase the operation of a Sr-ion clock with short-term instability averaging down as $3.9 \times 10^{-14} / \sqrtτ$, where $τ$ is the averaging time. Our demonstration of an optical atomic clock interrogated by an integrated spiral cavity laser opens the door for future advanced clock systems to be entirely constructed using lightweight, portable, and mass-manufacturable integrated optics and electronics.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
Optimizing Reconfigurable Antenna MIMO Systems with Coherent Ising Machines
Authors:
Ioannis Krikidis,
Abhishek Kumar Singh,
Kyle Jamieson
Abstract:
Reconfigurable antenna multiple-input multiple-output (MIMO) is a promising technology for upcoming 6G communication systems. In this paper, we deal with the problem of configuration selection for reconfigurable antenna MIMO by leveraging Coherent Ising Machines (CIMs). By adopting the CIM as a heuristic solver for the Ising problem, the optimal antenna configuration that maximizes the received si…
▽ More
Reconfigurable antenna multiple-input multiple-output (MIMO) is a promising technology for upcoming 6G communication systems. In this paper, we deal with the problem of configuration selection for reconfigurable antenna MIMO by leveraging Coherent Ising Machines (CIMs). By adopting the CIM as a heuristic solver for the Ising problem, the optimal antenna configuration that maximizes the received signal-to-noise ratio is investigated. A mathematical framework that converts the selection problem into a CIM-compatible unconstrained quadratic formulation is presented. Numerical studies show that the proposed CIM-based design outperforms classical counterparts and achieves near-optimal performance (similar to exponentially complex exhaustive searching) while ensuring polynomial complexity.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
ARTEMIS emulator: exploring the effect of cosmology and galaxy formation physics on Milky Way-mass haloes and their satellites
Authors:
Shaun T. Brown,
Azadeh Fattahi,
Ian G. McCarthy,
Andreea S. Font,
Kyle A. Oman,
Alexander H. Riley
Abstract:
We present the new ARTEMIS emulator suite of high resolution (baryon mass of $2.23 \times 10^{4}$ $h^{-1}$M$_{\odot}$) zoom-in simulations of Milky Way mass systems. Here, three haloes from the original ARTEMIS sample have been rerun multiple times, systematically varying parameters for the stellar feedback model, the density threshold for star formation, the reionisation redshift and the assumed…
▽ More
We present the new ARTEMIS emulator suite of high resolution (baryon mass of $2.23 \times 10^{4}$ $h^{-1}$M$_{\odot}$) zoom-in simulations of Milky Way mass systems. Here, three haloes from the original ARTEMIS sample have been rerun multiple times, systematically varying parameters for the stellar feedback model, the density threshold for star formation, the reionisation redshift and the assumed warm dark matter (WDM) particle mass (assuming a thermal relic). From these simulations emulators are trained for a wide range of statistics that allow for fast predictions at combinations of parameters not originally sampled, running in $\sim 1$ms (a factor of $\sim 10^{11}$ faster than the simulations). In this paper we explore the dependence of the central haloes' stellar mass on the varied parameters, finding the stellar feedback parameters to be the most important. When constraining the parameters to match the present-day stellar mass halo mass relation inferred from abundance matching we find that there is a strong degeneracy in the stellar feedback parameters, corresponding to a freedom in formation time of the stellar component for a fixed halo assembly history. We additionally explore the dependence of the satellite stellar mass function, where it is found that variations in stellar feedback, the reionisation redshift and the WDM mass all have a significant effect. The presented emulators are a powerful tool which allows for fundamentally new ways of analysing and interpreting cosmological hydrodynamic simulations. Crucially, allowing their free (subgrid) parameters to be varied and marginalised, leading to more robust constraints and predictions.
△ Less
Submitted 18 March, 2024;
originally announced March 2024.
-
Carrier confinement and alloy disorder exacerbate Auger-Meitner recombination in AlGaN ultraviolet light-emitting diodes
Authors:
Nick Pant,
Kyle Bushick,
Andrew McAllister,
Woncheol Lee,
Chris G. Van de Walle,
Emmanouil Kioupakis
Abstract:
The quantum efficiency of AlGaN ultraviolet light-emitting diodes (LEDs) declines (droops) at increasing operating powers due to Auger-Meitner recombination (AMR). Using first-principles density-functional theory, we show that indirect AMR mediated by electron-phonon coupling and alloy disorder can induce bulk $C$ coefficients as large as $\sim10^{-31}$ cm$^6$/s. Furthermore, we find that the conf…
▽ More
The quantum efficiency of AlGaN ultraviolet light-emitting diodes (LEDs) declines (droops) at increasing operating powers due to Auger-Meitner recombination (AMR). Using first-principles density-functional theory, we show that indirect AMR mediated by electron-phonon coupling and alloy disorder can induce bulk $C$ coefficients as large as $\sim10^{-31}$ cm$^6$/s. Furthermore, we find that the confinement of carriers by polarization fields within quantum wells severely relaxes crystal-momentum conservation, which exacerbates the rate of AMR over radiative recombination by an order of magnitude relative to the bulk. This results in a striking decrease in quantum efficiency at high power. Suppressing polarization fields and jointly increasing the well width would greatly mitigate AMR and efficiency droop.
△ Less
Submitted 16 March, 2024;
originally announced March 2024.