Search | arXiv e-print repository

arXiv:2406.19225 [pdf, other]

ProtoGMM: Multi-prototype Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation

Authors: Nazanin Moradinasab, Laura S. Shankman, Rebecca A. Deaton, Gary K. Owens, Donald E. Brown

Abstract: Domain adaptive semantic segmentation aims to generate accurate and dense predictions for an unlabeled target domain by leveraging a supervised model trained on a labeled source domain. The prevalent self-training approach involves retraining the dense discriminative classifier of $p(class|pixel feature)$ using the pseudo-labels from the target domain. While many methods focus on mitigating the is… ▽ More Domain adaptive semantic segmentation aims to generate accurate and dense predictions for an unlabeled target domain by leveraging a supervised model trained on a labeled source domain. The prevalent self-training approach involves retraining the dense discriminative classifier of $p(class|pixel feature)$ using the pseudo-labels from the target domain. While many methods focus on mitigating the issue of noisy pseudo-labels, they often overlook the underlying data distribution p(pixel feature|class) in both the source and target domains. To address this limitation, we propose the multi-prototype Gaussian-Mixture-based (ProtoGMM) model, which incorporates the GMM into contrastive losses to perform guided contrastive learning. Contrastive losses are commonly executed in the literature using memory banks, which can lead to class biases due to underrepresented classes. Furthermore, memory banks often have fixed capacities, potentially restricting the model's ability to capture diverse representations of the target/source domains. An alternative approach is to use global class prototypes (i.e. averaged features per category). However, the global prototypes are based on the unimodal distribution assumption per class, disregarding within-class variation. To address these challenges, we propose the ProtoGMM model. This novel approach involves estimating the underlying multi-prototype source distribution by utilizing the GMM on the feature space of the source samples. The components of the GMM model act as representative prototypes. To achieve increased intra-class semantic similarity, decreased inter-class similarity, and domain alignment between the source and target domains, we employ multi-prototype contrastive learning between source distribution and target samples. The experiments show the effectiveness of our method on UDA benchmarks. △ Less

Submitted 27 June, 2024; originally announced June 2024.

arXiv:2406.16646 [pdf, other]

The VISTA Variables in the Vía Láctea eXtended (VVVX) ESO public survey: Completion of the observations and legacy

Authors: R. K. Saito, M. Hempel, J. Alonso-García, P. W. Lucas, D. Minniti, S. Alonso, L. Baravalle, J. Borissova, C. Caceres, A. N. Chené, N. J. G. Cross, F. Duplancic, E. R. Garro, M. Gómez, V. D. Ivanov, R. Kurtev, A. Luna, D. Majaess, M. G. Navarro, J. B. Pullen, M. Rejkuba, J. L. Sanders, L. C. Smith, P. H. C. Albino, M. V. Alonso , et al. (121 additional authors not shown)

Abstract: The ESO public survey VISTA Variables in the Vía Láctea (VVV) surveyed the inner Galactic bulge and the adjacent southern Galactic disk from $2009-2015$. Upon its conclusion, the complementary VVV eXtended (VVVX) survey has expanded both the temporal as well as spatial coverage of the original VVV area, widening it from $562$ to $1700$ sq. deg., as well as providing additional epochs in… ▽ More The ESO public survey VISTA Variables in the Vía Láctea (VVV) surveyed the inner Galactic bulge and the adjacent southern Galactic disk from $2009-2015$. Upon its conclusion, the complementary VVV eXtended (VVVX) survey has expanded both the temporal as well as spatial coverage of the original VVV area, widening it from $562$ to $1700$ sq. deg., as well as providing additional epochs in $JHK_{\rm s}$ filters from $2016-2023$. With the completion of VVVX observations during the first semester of 2023, we present here the observing strategy, a description of data quality and access, and the legacy of VVVX. VVVX took $\sim 2000$ hours, covering about 4% of the sky in the bulge and southern disk. VVVX covered most of the gaps left between the VVV and the VISTA Hemisphere Survey (VHS) areas and extended the VVV time baseline in the obscured regions affected by high extinction and hence hidden from optical observations. VVVX provides a deep $JHK_{\rm s}$ catalogue of $\gtrsim 1.5\times10^9$ point sources, as well as a $K_{\rm s}$ band catalogue of $\sim 10^7$ variable sources. Within the existing VVV area, we produced a $5D$ map of the surveyed region by combining positions, distances, and proper motions of well-understood distance indicators such as red clump stars, RR Lyrae, and Cepheid variables. In March 2023 we successfully finished the VVVX survey observations that started in 2016, an accomplishment for ESO Paranal Observatory upon 4200 hours of observations for VVV+VVVX. The VVV+VVVX catalogues complement those from the Gaia mission at low Galactic latitudes and provide spectroscopic targets for the forthcoming ESO high-multiplex spectrographs MOONS and 4MOST. △ Less

Submitted 24 June, 2024; originally announced June 2024.

Comments: 17 pages, 11 figures (+ appendix). Accepted for publication in Astronomy and Astrophysics in section 14: Catalogs and data

arXiv:2406.05447 [pdf, other]

The PLATO Mission

Authors: Heike Rauer, Conny Aerts, Juan Cabrera, Magali Deleuil, Anders Erikson, Laurent Gizon, Mariejo Goupil, Ana Heras, Jose Lorenzo-Alvarez, Filippo Marliani, Cesar Martin-Garcia, J. Miguel Mas-Hesse, Laurence O'Rourke, Hugh Osborn, Isabella Pagano, Giampaolo Piotto, Don Pollacco, Roberto Ragazzoni, Gavin Ramsay, Stéphane Udry, Thierry Appourchaux, Willy Benz, Alexis Brandeker, Manuel Güdel, Eduardo Janot-Pacheco , et al. (801 additional authors not shown)

Abstract: PLATO (PLAnetary Transits and Oscillations of stars) is ESA's M3 mission designed to detect and characterise extrasolar planets and perform asteroseismic monitoring of a large number of stars. PLATO will detect small planets (down to <2 R_(Earth)) around bright stars (<11 mag), including terrestrial planets in the habitable zone of solar-like stars. With the complement of radial velocity observati… ▽ More PLATO (PLAnetary Transits and Oscillations of stars) is ESA's M3 mission designed to detect and characterise extrasolar planets and perform asteroseismic monitoring of a large number of stars. PLATO will detect small planets (down to <2 R_(Earth)) around bright stars (<11 mag), including terrestrial planets in the habitable zone of solar-like stars. With the complement of radial velocity observations from the ground, planets will be characterised for their radius, mass, and age with high accuracy (5 %, 10 %, 10 % for an Earth-Sun combination respectively). PLATO will provide us with a large-scale catalogue of well-characterised small planets up to intermediate orbital periods, relevant for a meaningful comparison to planet formation theories and to better understand planet evolution. It will make possible comparative exoplanetology to place our Solar System planets in a broader context. In parallel, PLATO will study (host) stars using asteroseismology, allowing us to determine the stellar properties with high accuracy, substantially enhancing our knowledge of stellar structure and evolution. The payload instrument consists of 26 cameras with 12cm aperture each. For at least four years, the mission will perform high-precision photometric measurements. Here we review the science objectives, present PLATO's target samples and fields, provide an overview of expected core science performance as well as a description of the instrument and the mission profile at the beginning of the serial production of the flight cameras. PLATO is scheduled for a launch date end 2026. This overview therefore provides a summary of the mission to the community in preparation of the upcoming operational phases. △ Less

Submitted 8 June, 2024; originally announced June 2024.

arXiv:2406.04058 [pdf, ps, other]

Watching Popular Musicians Learn by Ear: A Hypothesis-Generating Study of Human-Recording Interactions in YouTube Videos

Authors: Christopher Liscio, Daniel G. Brown

Abstract: Popular musicians often learn music by ear. It is unclear what role technology plays for those with experience at this task. In search of opportunities for the development of novel human-recording interactions, we analyze 18 YouTube videos depicting real-world examples of by-ear learning, and discuss why, during this preliminary phase of research, online videos are appropriate data. From our obser… ▽ More Popular musicians often learn music by ear. It is unclear what role technology plays for those with experience at this task. In search of opportunities for the development of novel human-recording interactions, we analyze 18 YouTube videos depicting real-world examples of by-ear learning, and discuss why, during this preliminary phase of research, online videos are appropriate data. From our observations we generate hypotheses that can inform future work. For example, a musician's scope of learning may influence what technological interactions would help them, they could benefit from tools that accommodate their working memory, and transcription does not appear to play a key role in ear learning. Based on these findings, we pose a number of research questions, and discuss their methodological considerations to guide future study. △ Less

Submitted 6 June, 2024; originally announced June 2024.

arXiv:2406.03301 [pdf, other]

Effects of Mosaic Crystal Instrument Functions on X-ray Thomson Scattering Diagnostics

Authors: Thomas Gawne, Hannah Bellenbaum, Luke B. Fletcher, Karen Appel, Carsten Baehtz, Victorien Bouffetier, Erik Brambrink, Danielle Brown, Attila Cangi, Adrien Descamps, Sebastian Göde, Nicholas J. Hartley, Marie-Luise Herbert, Philipp Hesselbach, Hauke Höppner, Oliver S. Humphries, Zuzana Konôpková, Alejandro Laso, Björn Lindqvist, Julian Lütgert, Michael J. MacDonald, Mikako Makita, Willow Martin, Mikhail Mishchenko, Zhandos A. Moldabekov , et al. (14 additional authors not shown)

Abstract: Mosaic crystals, with their high integrated reflectivities, are widely-employed in spectrometers used to diagnose high energy density systems. X-ray Thomson scattering (XRTS) has emerged as a powerful diagnostic tool of these systems, providing in principle direct access to important properties such as the temperature via detailed balance. However, the measured XRTS spectrum is broadened by the sp… ▽ More Mosaic crystals, with their high integrated reflectivities, are widely-employed in spectrometers used to diagnose high energy density systems. X-ray Thomson scattering (XRTS) has emerged as a powerful diagnostic tool of these systems, providing in principle direct access to important properties such as the temperature via detailed balance. However, the measured XRTS spectrum is broadened by the spectrometer instrument function (IF), and without careful consideration of the IF one risks misdiagnosing system conditions. Here, we consider in detail the IF of mosaic crystals and how the broadening varies across the spectrometer. Notably, we find a strong asymmetry in the shape of the IF towards higher energies. As an example, we consider the effect on the inferred temperature, and find that it can be overestimated if an approximate symmetric IF is used. We therefore expect a detailed consideration of the full IF will have an important impact on system properties inferred via XRTS in both forward modelling and model-free approaches. △ Less

Submitted 5 June, 2024; originally announced June 2024.

Comments: 18 pages, 13 figures

arXiv:2406.01855 [pdf, other]

TruthEval: A Dataset to Evaluate LLM Truthfulness and Reliability

Authors: Aisha Khatun, Daniel G. Brown

Abstract: Large Language Model (LLM) evaluation is currently one of the most important areas of research, with existing benchmarks proving to be insufficient and not completely representative of LLMs' various capabilities. We present a curated collection of challenging statements on sensitive topics for LLM benchmarking called TruthEval. These statements were curated by hand and contain known truth values.… ▽ More Large Language Model (LLM) evaluation is currently one of the most important areas of research, with existing benchmarks proving to be insufficient and not completely representative of LLMs' various capabilities. We present a curated collection of challenging statements on sensitive topics for LLM benchmarking called TruthEval. These statements were curated by hand and contain known truth values. The categories were chosen to distinguish LLMs' abilities from their stochastic nature. We perform some initial analyses using this dataset and find several instances of LLMs failing in simple tasks showing their inability to understand simple questions. △ Less

Submitted 3 June, 2024; originally announced June 2024.

arXiv:2405.20558 [pdf, other]

Towards accelerated nuclear-physics parameter estimation from binary neutron star mergers: Emulators for the Tolman-Oppenheimer-Volkoff equations

Authors: Brendan T. Reed, Rahul Somasundaram, Soumi De, Cassandra L. Armstrong, Pablo Giuliani, Collin Capano, Duncan A. Brown, Ingo Tews

Abstract: Gravitational-wave observations of binary neutron-star (BNS) mergers have the potential to revolutionize our understanding of the nuclear equation of state (EOS) and the fundamental interactions that determine its properties. However, Bayesian parameter estimation frameworks do not typically sample over microscopic nuclear-physics parameters that determine the EOS. One of the major hurdles in doin… ▽ More Gravitational-wave observations of binary neutron-star (BNS) mergers have the potential to revolutionize our understanding of the nuclear equation of state (EOS) and the fundamental interactions that determine its properties. However, Bayesian parameter estimation frameworks do not typically sample over microscopic nuclear-physics parameters that determine the EOS. One of the major hurdles in doing so is the computational cost involved in solving the neutron-star structure equations, known as the Tolman-Oppenheimer-Volkoff (TOV) equations. In this paper, we explore approaches to emulating solutions for the TOV equations: Multilayer Perceptrons (MLP), Gaussian Processes (GP), and a data-driven variant of the reduced basis method (RBM). We implement these emulators for three different parameterizations of the nuclear EOS, each with a different degree of complexity represented by the number of model parameters. We find that our MLP-based emulators are generally more accurate than the other two algorithms whereas the RBM results in the largest speedup with respect to the full, high-fidelity TOV solver. We employ these emulators for a simple parameter inference using a potentially loud BNS observation, and show that the posteriors predicted by our emulators are in excellent agreement with those obtained from the full TOV solver. △ Less

Submitted 30 May, 2024; originally announced May 2024.

Comments: 13 pages and 9 figures. Comments Welcome

Report number: LA-UR-24-25009

arXiv:2405.06112 [pdf, other]

Bayesian Optimization of Sample Entropy Hyperparameters for Short Time Series

Authors: Zachary Blanks, Donald E. Brown

Abstract: Quantifying the complexity and irregularity of time series data is a primary pursuit across various data-scientific disciplines. Sample entropy (SampEn) is a widely adopted metric for this purpose, but its reliability is sensitive to the choice of its hyperparameters, the embedding dimension $(m)$ and the similarity radius $(r)$, especially for short-duration signals. This paper presents a novel m… ▽ More Quantifying the complexity and irregularity of time series data is a primary pursuit across various data-scientific disciplines. Sample entropy (SampEn) is a widely adopted metric for this purpose, but its reliability is sensitive to the choice of its hyperparameters, the embedding dimension $(m)$ and the similarity radius $(r)$, especially for short-duration signals. This paper presents a novel methodology that addresses this challenge. We introduce a Bayesian optimization framework, integrated with a bootstrap-based variance estimator tailored for short signals, to simultaneously and optimally select the values of $m$ and $r$ for reliable SampEn estimation. Through validation on synthetic signal experiments, our approach outperformed existing benchmarks. It achieved a 60 to 90% reduction in relative error for estimating SampEn variance and a 22 to 45% decrease in relative mean squared error for SampEn estimation itself ($p \leq 0.043$). Applying our method to publicly available short-signal benchmarks yielded promising results. Unlike existing competitors, our approach was the only one to successfully identify known entropy differences across all signal sets ($p \leq 0.042$). Additionally, we introduce "EristroPy," an open-source Python package that implements our proposed optimization framework for SampEn hyperparameter selection. This work holds potential for applications where accurate estimation of entropy from short-duration signals is paramount. △ Less

Submitted 9 May, 2024; originally announced May 2024.

arXiv:2405.00492 [pdf, other]

Is Temperature the Creativity Parameter of Large Language Models?

Authors: Max Peeperkorn, Tom Kouwenhoven, Dan Brown, Anna Jordanous

Abstract: Large language models (LLMs) are applied to all sorts of creative tasks, and their outputs vary from beautiful, to peculiar, to pastiche, into plain plagiarism. The temperature parameter of an LLM regulates the amount of randomness, leading to more diverse outputs; therefore, it is often claimed to be the creativity parameter. Here, we investigate this claim using a narrative generation task with… ▽ More Large language models (LLMs) are applied to all sorts of creative tasks, and their outputs vary from beautiful, to peculiar, to pastiche, into plain plagiarism. The temperature parameter of an LLM regulates the amount of randomness, leading to more diverse outputs; therefore, it is often claimed to be the creativity parameter. Here, we investigate this claim using a narrative generation task with a predetermined fixed context, model and prompt. Specifically, we present an empirical analysis of the LLM output for different temperature values using four necessary conditions for creativity in narrative generation: novelty, typicality, cohesion, and coherence. We find that temperature is weakly correlated with novelty, and unsurprisingly, moderately correlated with incoherence, but there is no relationship with either cohesion or typicality. However, the influence of temperature on creativity is far more nuanced and weak than suggested by the "creativity parameter" claim; overall results suggest that the LLM generates slightly more novel outputs as temperatures get higher. Finally, we discuss ideas to allow more controlled LLM creativity, rather than relying on chance via changing the temperature parameter. △ Less

Submitted 1 May, 2024; originally announced May 2024.

Comments: To be published in the Proceedings of the 15th International Conference on Computational Creativity (ICCC'24), 8 pages, 2 figures, 2 tables

arXiv:2404.17051 [pdf, other]

Toward Improving Binary Program Comprehension via Embodied Immersion: A Survey

Authors: Dennis Brown, Emily Mulder, Samuel Mulder

Abstract: Binary program comprehension is critical for many use cases but is difficult, suffering from compounded uncertainty and lack of full automation. We seek methods to improve the effectiveness of the human-machine joint cognitive system performing binary PC. We survey three research areas to perform an indirect cognitive task analysis: cognitive models of the PC process, related elements of cognitive… ▽ More Binary program comprehension is critical for many use cases but is difficult, suffering from compounded uncertainty and lack of full automation. We seek methods to improve the effectiveness of the human-machine joint cognitive system performing binary PC. We survey three research areas to perform an indirect cognitive task analysis: cognitive models of the PC process, related elements of cognitive theory, and applicable affordances of virtual reality. Based on common elements in these areas, we identify three overarching themes: enhancing abductive iteration, augmenting working memory, and supporting information organization. These themes spotlight several affordances of VR to exploit in future studies of immersive tools for binary PC. △ Less

Submitted 25 April, 2024; originally announced April 2024.

Comments: 27 pages, 4 figures, Submitted to ACM Computing Surveys

ACM Class: H.1.2; H.5.1; D.2.7

arXiv:2404.14569 [pdf, other]

LIGO operates with quantum noise below the Standard Quantum Limit

Authors: Wenxuan Jia, Victoria Xu, Kevin Kuns, Masayuki Nakano, Lisa Barsotti, Matthew Evans, Nergis Mavalvala, Rich Abbott, Ibrahim Abouelfettouh, Rana Adhikari, Alena Ananyeva, Stephen Appert, Koji Arai, Naoki Aritomi, Stuart Aston, Matthew Ball, Stefan Ballmer, David Barker, Beverly Berger, Joseph Betzwieser, Dripta Bhattacharjee, Garilynn Billingsley, Nina Bode, Edgard Bonilla, Vladimir Bossilkov , et al. (146 additional authors not shown)

Abstract: Precision measurements of space and time, like those made by the detectors of the Laser Interferometer Gravitational-wave Observatory (LIGO), are often confronted with fundamental limitations imposed by quantum mechanics. The Heisenberg uncertainty principle dictates that the position and momentum of an object cannot both be precisely measured, giving rise to an apparent limitation called the Stan… ▽ More Precision measurements of space and time, like those made by the detectors of the Laser Interferometer Gravitational-wave Observatory (LIGO), are often confronted with fundamental limitations imposed by quantum mechanics. The Heisenberg uncertainty principle dictates that the position and momentum of an object cannot both be precisely measured, giving rise to an apparent limitation called the Standard Quantum Limit (SQL). Reducing quantum noise below the SQL in gravitational-wave detectors, where photons are used to continuously measure the positions of freely falling mirrors, has been an active area of research for decades. Here we show how the LIGO A+ upgrade reduced the detectors' quantum noise below the SQL by up to 3 dB while achieving a broadband sensitivity improvement, more than two decades after this possibility was first presented. △ Less

Submitted 22 April, 2024; originally announced April 2024.

Report number: LIGO-P2400059

arXiv:2404.09270 [pdf, other]

The Next Generation of MeV Energy X-ray Sources for use in the Inspection of Additively Manufactured Parts for Industry

Authors: C. Thornton, S. Karimi, S. Glenn, W. D. Brown, N. Draganic, M. Skeate, M. Ferrucci, Q. Chen, R. Jacob, K. Nakamura, T. Ostermayr, J. van Tilborg, C. Armstrong, O. J. Finlay, N. Turner, S. Glanvill, H. Martz, C. Geddes

Abstract: For the first time, we demonstrate the application of an inverse Compton scattering X-ray Source, driven by a laser-plasma accelerator, to image an additively manufactured component. X-rays with a mean energy of 380 keV were produced and used to image an additively manufactured part made of an Inconel (Nickel 718) alloy. Because inverse Compton scattering driven by laser-plasma acceleration produc… ▽ More For the first time, we demonstrate the application of an inverse Compton scattering X-ray Source, driven by a laser-plasma accelerator, to image an additively manufactured component. X-rays with a mean energy of 380 keV were produced and used to image an additively manufactured part made of an Inconel (Nickel 718) alloy. Because inverse Compton scattering driven by laser-plasma acceleration produces high-energy X-rays while maintaining a focal spot size on the order of a micron, the source can provide several benefits over conventional X-ray production methods, particularly when imaging superalloy parts, with the potential to revolutionise what can be inspected. △ Less

Submitted 14 April, 2024; originally announced April 2024.

arXiv:2404.07185 [pdf, other]

Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery

Authors: Zohre Karimi, Shing-Hei Ho, Bao Thach, Alan Kuntz, Daniel S. Brown

Abstract: Automating robotic surgery via learning from demonstration (LfD) techniques is extremely challenging. This is because surgical tasks often involve sequential decision-making processes with complex interactions of physical objects and have low tolerance for mistakes. Prior works assume that all demonstrations are fully observable and optimal, which might not be practical in the real world. This pap… ▽ More Automating robotic surgery via learning from demonstration (LfD) techniques is extremely challenging. This is because surgical tasks often involve sequential decision-making processes with complex interactions of physical objects and have low tolerance for mistakes. Prior works assume that all demonstrations are fully observable and optimal, which might not be practical in the real world. This paper introduces a sample-efficient method that learns a robust reward function from a limited amount of ranked suboptimal demonstrations consisting of partial-view point cloud observations. The method then learns a policy by optimizing the learned reward function using reinforcement learning (RL). We show that using a learned reward function to obtain a policy is more robust than pure imitation learning. We apply our approach on a physical surgical electrocautery task and demonstrate that our method can perform well even when the provided demonstrations are suboptimal and the observations are high-dimensional point clouds. Code and videos available here: https://sites.google.com/view/lfdinelectrocautery △ Less

Submitted 15 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

Comments: In proceedings of the International Symposium on Medical Robotics (ISMR) 2024. Equal contribution from two first authors

arXiv:2404.04241 [pdf, other]

Modeling Kinematic Uncertainty of Tendon-Driven Continuum Robots via Mixture Density Networks

Authors: Jordan Thompson, Brian Y. Cho, Daniel S. Brown, Alan Kuntz

Abstract: Tendon-driven continuum robot kinematic models are frequently computationally expensive, inaccurate due to unmodeled effects, or both. In particular, unmodeled effects produce uncertainties that arise during the robot's operation that lead to variability in the resulting geometry. We propose a novel solution to these issues through the development of a Gaussian mixture kinematic model. We train a… ▽ More Tendon-driven continuum robot kinematic models are frequently computationally expensive, inaccurate due to unmodeled effects, or both. In particular, unmodeled effects produce uncertainties that arise during the robot's operation that lead to variability in the resulting geometry. We propose a novel solution to these issues through the development of a Gaussian mixture kinematic model. We train a mixture density network to output a Gaussian mixture model representation of the robot geometry given the current tendon displacements. This model computes a probability distribution that is more representative of the true distribution of geometries at a given configuration than a model that outputs a single geometry, while also reducing the computation time. We demonstrate one use of this model through a trajectory optimization method that explicitly reasons about the workspace uncertainty to minimize the probability of collision. △ Less

Submitted 5 April, 2024; originally announced April 2024.

arXiv:2403.19831 [pdf, other]

TASR: A Novel Trust-Aware Stackelberg Routing Algorithm to Mitigate Traffic Congestion

Authors: Doris E. M. Brown, Venkata Sriram Siddhardh Nadendla, Sajal K. Das

Abstract: Stackelberg routing platforms (SRP) reduce congestion in one-shot traffic networks by proposing optimal route recommendations to selfish travelers. Traditionally, Stackelberg routing is cast as a partial control problem where a fraction of traveler flow complies with route recommendations, while the remaining respond as selfish travelers. In this paper, a novel Stackelberg routing framework is for… ▽ More Stackelberg routing platforms (SRP) reduce congestion in one-shot traffic networks by proposing optimal route recommendations to selfish travelers. Traditionally, Stackelberg routing is cast as a partial control problem where a fraction of traveler flow complies with route recommendations, while the remaining respond as selfish travelers. In this paper, a novel Stackelberg routing framework is formulated where the agents exhibit \emph{probabilistic compliance} by accepting SRP's route recommendations with a \emph{trust} probability. A greedy \emph{\textbf{T}rust-\textbf{A}ware \textbf{S}tackelberg \textbf{R}outing} algorithm (in short, TASR) is proposed for SRP to compute unique path recommendations to each traveler flow with a unique demand. Simulation experiments are designed with random travel demands with diverse trust values on real road networks such as Sioux Falls, Chicago Sketch, and Sydney networks for both single-commodity and multi-commodity flows. The performance of TASR is compared with state-of-the-art Stackelberg routing methods in terms of traffic congestion and trust dynamics over repeated interaction between the SRP and the travelers. Results show that TASR improves network congestion without causing a significant reduction in trust towards the SRP, when compared to most well-known Stackelberg routing strategies. △ Less

Submitted 28 March, 2024; originally announced March 2024.

arXiv:2403.11323 [pdf, other]

Diffusion and Multi-Domain Adaptation Methods for Eosinophil Segmentation

Authors: Kevin Lin, Donald Brown, Sana Syed, Adam Greene

Abstract: Eosinophilic Esophagitis (EoE) represents a challenging condition for medical providers today. The cause is currently unknown, the impact on a patient's daily life is significant, and it is increasing in prevalence. Traditional approaches for medical image diagnosis such as standard deep learning algorithms are limited by the relatively small amount of data and difficulty in generalization. As a r… ▽ More Eosinophilic Esophagitis (EoE) represents a challenging condition for medical providers today. The cause is currently unknown, the impact on a patient's daily life is significant, and it is increasing in prevalence. Traditional approaches for medical image diagnosis such as standard deep learning algorithms are limited by the relatively small amount of data and difficulty in generalization. As a response, two methods have arisen that seem to perform well: Diffusion and Multi-Domain methods with current research efforts favoring diffusion methods. For the EoE dataset, we discovered that a Multi-Domain Adversarial Network outperformed a Diffusion based method with a FID of 42.56 compared to 50.65. Future work with diffusion methods should include a comparison with Multi-Domain adaptation methods to ensure that the best performance is achieved. △ Less

Submitted 17 March, 2024; originally announced March 2024.

Comments: Preprint, Final Article Submitted to ICMVA 2024 and will be published in the International Conference Proceedings by ACM, Association for Computing Machinery (ISBN: 979-8-4007-1655-3), which will be archived in ACM Digital Library, indexed by Ei Compendex and Scopus

ACM Class: I.4.6

arXiv:2403.03004 [pdf, other]

Ultralight vector dark matter search using data from the KAGRA O3GK run

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi , et al. (1778 additional authors not shown)

Abstract: Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese… ▽ More Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we present the result of a search for $U(1)_{B-L}$ gauge boson DM using the KAGRA data from auxiliary length channels during the first joint observation run together with GEO600. By applying our search pipeline, which takes into account the stochastic nature of ultralight DM, upper bounds on the coupling strength between the $U(1)_{B-L}$ gauge boson and ordinary matter are obtained for a range of DM masses. While our constraints are less stringent than those derived from previous experiments, this study demonstrates the applicability of our method to the lower-mass vector DM search, which is made difficult in this measurement by the short observation time compared to the auto-correlation time scale of DM. △ Less

Submitted 5 March, 2024; originally announced March 2024.

Comments: 20 pages, 5 figures

Report number: LIGO-P2300250

arXiv:2403.02431 [pdf, other]

Bayesian Constraint Inference from User Demonstrations Based on Margin-Respecting Preference Models

Authors: Dimitris Papadimitriou, Daniel S. Brown

Abstract: It is crucial for robots to be aware of the presence of constraints in order to acquire safe policies. However, explicitly specifying all constraints in an environment can be a challenging task. State-of-the-art constraint inference algorithms learn constraints from demonstrations, but tend to be computationally expensive and prone to instability issues. In this paper, we propose a novel Bayesian… ▽ More It is crucial for robots to be aware of the presence of constraints in order to acquire safe policies. However, explicitly specifying all constraints in an environment can be a challenging task. State-of-the-art constraint inference algorithms learn constraints from demonstrations, but tend to be computationally expensive and prone to instability issues. In this paper, we propose a novel Bayesian method that infers constraints based on preferences over demonstrations. The main advantages of our proposed approach are that it 1) infers constraints without calculating a new policy at each iteration, 2) uses a simple and more realistic ranking of groups of demonstrations, without requiring pairwise comparisons over all demonstrations, and 3) adapts to cases where there are varying levels of constraint violation. Our empirical results demonstrate that our proposed Bayesian approach infers constraints of varying severity, more accurately than state-of-the-art constraint inference methods. △ Less

Submitted 4 March, 2024; originally announced March 2024.

arXiv:2402.14953 [pdf, other]

Dot Product Representations of Graphs Using Tropical Arithmetic

Authors: Sean Bailey, David Brown, Michael Snyder, Nicole Turner

Abstract: A dot-product representation of a graph is a map** of its vertices to vectors of length $k$ so that vertices are adjacent if and only if the inner product (a.k.a. dot product) of their corresponding vertices exceeds some threshold. Minimizing dimension of the vector space into which the vectors must be mapped is a typical focus. We investigate this and structural characterizations of graphs whos… ▽ More A dot-product representation of a graph is a map** of its vertices to vectors of length $k$ so that vertices are adjacent if and only if the inner product (a.k.a. dot product) of their corresponding vertices exceeds some threshold. Minimizing dimension of the vector space into which the vectors must be mapped is a typical focus. We investigate this and structural characterizations of graphs whose dot product representations are map**s into the tropical semi-rings of min-plus and max-plus. We also observe that the minimum dimension required to represent a graph using a \emph{tropical representation} is equal to the better-known threshold dimension of the graph; that is, the minimum number of subgraphs that are threshold graphs whose union is the graph being represented. △ Less

Submitted 22 February, 2024; originally announced February 2024.

arXiv:2402.12520 [pdf, other]

doi 10.1016/j.matdes.2024.113096

Data-driven study of composition-dependent phase compatibility in NiTi shape memory alloys

Authors: Sina Hossein Zadeh, Cem Cakirhan, Danial Khatamsaz, John Broucek, Timothy D. Brown, Xiaoning Qian, Ibrahim Karaman, Raymundo Arroyave

Abstract: The martensitic transformation in NiTi-based Shape Memory Alloys (SMAs) provides a basis for shape memory effect and superelasticity, thereby enabling applications requiring solid-state actuation and large recoverable shape changes upon mechanical load cycling. In order to tailor the transformation to a particular application, the compositional dependence of properties in NiTi-based SMAs, such as… ▽ More The martensitic transformation in NiTi-based Shape Memory Alloys (SMAs) provides a basis for shape memory effect and superelasticity, thereby enabling applications requiring solid-state actuation and large recoverable shape changes upon mechanical load cycling. In order to tailor the transformation to a particular application, the compositional dependence of properties in NiTi-based SMAs, such as martensitic transformation temperatures and hysteresis, has been exploited. However, the compositional design space is large and complex, and experimental studies are expensive. In this work, we develop an interpretable piecewise linear regression model that predicts the $λ_2$ parameter, a measure of compatibility between austenite and martensite phases, and an (indirect) factor that is well-correlated with martensitic transformation hysteresis, based on the chemical features derived from the alloy composition. The model is capable of predicting, for the first time, the type of martensitic transformation for a given alloy chemistry. The proposed model is validated by experimental data from the literature as well as in-house measurements. The results show that the model can effectively distinguish between $B19$ and $B19^{\prime}$ regions for any given composition in NiTi-based SMAs and accurately estimate the $λ_2$ parameter. Our analysis also reveals that the weighted average of the quotient of the first ionization energy and the Voronoi coordination number is a key compositional characteristic that correlates with the $λ_2$ parameter and thermodynamic responses, including the transformation hysteresis, martensite start temperature, and critical temperature. The work herein demonstrates the potential of data-driven methodologies for understanding and designing NiTi-based SMAs with desired transformation characteristics. △ Less

Submitted 19 February, 2024; originally announced February 2024.

arXiv:2402.05056 [pdf, other]

Measuring Neutron Star Radius with second and third generation Gravitational Wave Detector Networks

Authors: Ananya Bandopadhyay, Keisi Kacanja, Rahul Somasundaram, Alexander H. Nitz, Duncan A. Brown

Abstract: The next generation of ground-based interferometric gravitational wave detectors will observe mergers of black holes and neutron stars throughout cosmic time. A large number of the binary neutron star merger events will be observed with extreme high fidelity, and will provide stringent constraints on the equation of state of nuclear matter. In this paper, we investigate the systematic improvement… ▽ More The next generation of ground-based interferometric gravitational wave detectors will observe mergers of black holes and neutron stars throughout cosmic time. A large number of the binary neutron star merger events will be observed with extreme high fidelity, and will provide stringent constraints on the equation of state of nuclear matter. In this paper, we investigate the systematic improvement in the measurability of the equation of state with increase in detector sensitivity by combining constraints obtained on the radius of a $1.4 \, \mathrm{M}_{\odot}$ neutron star from a simulated source population. Since the measurability of the equation of state depends on its stiffness, we consider a range of realistic equations of state that span the current observational constraints. We show that a single 40km Cosmic Explorer detector can pin down the neutron star radius for a soft, medium and stiff equation of state to an accuracy of 10m within a decade, whereas the current generation of ground-based detectors like the Advanced LIGO-Virgo network would take $\mathcal{O}(10^5)$ years to do so for a soft equation of state. △ Less

Submitted 7 February, 2024; originally announced February 2024.

Comments: 14 pages, 3 figures, 1 table, supplemental materials at https://github.com/sugwg/bns-eos-nggw

Report number: LA-UR-24-21031

arXiv:2401.13013 [pdf, ps, other]

Angular control noise in Advanced Virgo and implications for the Einstein Telescope

Authors: Riccardo Maggiore, Paolo Ruggi, Andreas Freise, Daniel Brown, Jonathan W. Perry, Enzo N. Tapia San Martín, Conor M. Mow-Lowry, Maddalena Mantovani, Julia Casanueva Diaz, Diego Bersanetti, Matteo Tacca

Abstract: With significantly improved sensitivity, the Einstein Telescope (ET), along with other upcoming gravitational wave detectors, will mark the beginning of precision gravitational wave astronomy. However, the pursuit of surpassing current detector capabilities requires careful consideration of technical constraints inherent in existing designs. The significant improvement of ET lies in the low-freque… ▽ More With significantly improved sensitivity, the Einstein Telescope (ET), along with other upcoming gravitational wave detectors, will mark the beginning of precision gravitational wave astronomy. However, the pursuit of surpassing current detector capabilities requires careful consideration of technical constraints inherent in existing designs. The significant improvement of ET lies in the low-frequency range, where it anticipates a one million-fold increase in sensitivity compared to current detectors. Angular control noise is a primary limitation for LIGO detectors in this frequency range, originating from the need to maintain optical alignment. Given the expected improvements in ET's low-frequency range, precise assessment of angular control noise becomes crucial for achieving target sensitivity. To address this, we developed a model of the angular control system of Advanced Virgo, closely matching experimental data and providing a robust foundation for modeling future-generation detectors. Our model, for the first time, enables replication of the measured coupling level between angle and length. Additionally, our findings confirm that Virgo, unlike LIGO, is not constrained by alignment control noise, even if the detector were operating at full power. △ Less

Submitted 4 March, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

arXiv:2401.07955 [pdf, other]

A Study on Large Language Models' Limitations in Multiple-Choice Question Answering

Authors: Aisha Khatun, Daniel G. Brown

Abstract: The widespread adoption of Large Language Models (LLMs) has become commonplace, particularly with the emergence of open-source models. More importantly, smaller models are well-suited for integration into consumer devices and are frequently employed either as standalone solutions or as subroutines in various AI tasks. Despite their ubiquitous use, there is no systematic analysis of their specific… ▽ More The widespread adoption of Large Language Models (LLMs) has become commonplace, particularly with the emergence of open-source models. More importantly, smaller models are well-suited for integration into consumer devices and are frequently employed either as standalone solutions or as subroutines in various AI tasks. Despite their ubiquitous use, there is no systematic analysis of their specific capabilities and limitations. In this study, we tackle one of the most widely used tasks - answering Multiple Choice Question (MCQ). We analyze 26 small open-source models and find that 65% of the models do not understand the task, only 4 models properly select an answer from the given choices, and only 5 of these models are choice order independent. These results are rather alarming given the extensive use of MCQ tests with these models. We recommend exercising caution and testing task understanding before using MCQ to evaluate LLMs in any field whatsoever. △ Less

Submitted 15 January, 2024; originally announced January 2024.

arXiv:2401.06348 [pdf, other]

A Fully Bayesian Approach for Comprehensive Map** of Magnitude and Phase Brain Activation in Complex-Valued fMRI Data

Authors: Zhengxin Wang, Daniel B. Rowe, Xinyi Li, D. Andrew Brown

Abstract: Functional magnetic resonance imaging (fMRI) plays a crucial role in neuroimaging, enabling the exploration of brain activity through complex-valued signals. These signals, composed of magnitude and phase, offer a rich source of information for understanding brain functions. Traditional fMRI analyses have largely focused on magnitude information, often overlooking the potential insights offered by… ▽ More Functional magnetic resonance imaging (fMRI) plays a crucial role in neuroimaging, enabling the exploration of brain activity through complex-valued signals. These signals, composed of magnitude and phase, offer a rich source of information for understanding brain functions. Traditional fMRI analyses have largely focused on magnitude information, often overlooking the potential insights offered by phase data. In this paper, we propose a novel fully Bayesian model designed for analyzing single-subject complex-valued fMRI (cv-fMRI) data. Our model, which we refer to as the CV-M&P model, is distinctive in its comprehensive utilization of both magnitude and phase information in fMRI signals, allowing for independent prediction of different types of activation maps. We incorporate Gaussian Markov random fields (GMRFs) to capture spatial correlations within the data, and employ image partitioning and parallel computation to enhance computational efficiency. Our model is rigorously tested through simulation studies, and then applied to a real dataset from a unilateral finger-tap** experiment. The results demonstrate the model's effectiveness in accurately identifying brain regions activated in response to specific tasks, distinguishing between magnitude and phase activation. △ Less

Submitted 11 January, 2024; originally announced January 2024.

arXiv:2312.13274 [pdf, other]

A Broad Comparative Evaluation of Software Debloating Tools

Authors: Michael D. Brown, Adam Meily, Brian Fairservice, Akshay Sood, Jonathan Dorn, Eric Kilmer, Ronald Eytchison

Abstract: Software debloating tools seek to improve program security and performance by removing unnecessary code, called bloat. While many techniques have been proposed, several barriers to their adoption have emerged. Namely, debloating tools are highly specialized, making it difficult for adopters to find the right type of tool for their needs. This is further hindered by a lack of established metrics an… ▽ More Software debloating tools seek to improve program security and performance by removing unnecessary code, called bloat. While many techniques have been proposed, several barriers to their adoption have emerged. Namely, debloating tools are highly specialized, making it difficult for adopters to find the right type of tool for their needs. This is further hindered by a lack of established metrics and comparative evaluations between tools. To close this information gap, we surveyed 10 years of debloating literature and several tools currently under commercial development to taxonomize knowledge about the debloating ecosystem. We then conducted a broad comparative evaluation of 10 debloating tools to determine their relative strengths and weaknesses. Our evaluation, conducted on a diverse set of 20 benchmark programs, measures tools across 12 performance, security, and correctness metrics. Our evaluation surfaces several concerning findings that contradict the prevailing narrative in the debloating literature. First, debloating tools lack the maturity required to be used on real-world software, evidenced by a slim 22% overall success rate for creating passable debloated versions of medium- and high-complexity benchmarks. Second, debloating tools struggle to produce sound and robust programs. Using our novel differential fuzzing tool, DIFFER, we discovered that only 13% of our debloating attempts produced a sound and robust debloated program. Finally, our results indicate that debloating tools typically do not improve the performance or security posture of debloated programs by a significant degree according to our evaluation metrics. We believe that our contributions in this paper will help potential adopters better understand the landscape of tools and will motivate future research and development of more capable debloating tools. △ Less

Submitted 12 June, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

Comments: 17 pages, 8 tables

arXiv:2312.12628 [pdf]

Structural maturation of myofilaments in engineered 3D cardiac microtissues characterized using small angle X-ray scattering

Authors: Geoffrey van Dover, Josh Javor, Jourdan Ewoldt, Ha Eun Lee, Mikhail Zhernenkov, Guillaume Freychet, Patryk Wasik, Dana Brown, David Bishop, Christopher Chen

Abstract: Understanding the structural and functional development of human-induced pluripotent stem-cell-derived cardiomyocytes is essential to engineering cardiac tissue that enables pharmaceutical testing, modeling diseases, and designing therapies. Here we use a method not commonly applied to biological materials, small angle X-ray scattering, to characterize the structural development of human-induced p… ▽ More Understanding the structural and functional development of human-induced pluripotent stem-cell-derived cardiomyocytes is essential to engineering cardiac tissue that enables pharmaceutical testing, modeling diseases, and designing therapies. Here we use a method not commonly applied to biological materials, small angle X-ray scattering, to characterize the structural development of human-induced pluripotent stem-cell-derived cardiomyocytes within 3D engineered tissues during their preliminary stages of maturation. An X-ray scattering experimental method enables the reliable characterization of the cardiomyocyte myofilament spacing with maturation time. The myofilament lattice spacing monotonically decreases as the tissue matures from its initial post-seeding state over the span of ten days. Visualization of the spacing at a grid of positions in the tissue provides an approach to characterizing the maturation and organization of cardiomyocyte myofilaments and has the potential to help elucidate mechanisms of pathophysiology, and disease progression, thereby stimulating new biological hypotheses in stem cell engineering. △ Less

Submitted 19 December, 2023; originally announced December 2023.

arXiv:2312.07656 [pdf, other]

Optimization of an Optical Testbed for Characterization of EXCLAIM u-Spec Integrated Spectrometers

Authors: Maryam Rahmani, Emily M. Barrentine, Eric R. Switzer, Alyssa Barlis, Ari D. Brown, Giuseppe Cataldo, Jake A. Connors, Negar Ehsan, Thomas M. Essinger-Hileman, Henry Grant, James Hays-Wehle, Wen-Ting Hsieh, Vilem Mikula, S. Harvey Moseley, Omid Noroozian, Manuel A. Quijada, Jessica Patel, Thomas R. Stevenson, Carole Tucker, Kongpop U-Yen, Carolyn G. Volpert, Edward J. Wollack

Abstract: We describe a testbed to characterize the optical response of compact superconducting on-chip spectrometers in development for the Experiment for Cryogenic Large-Aperture Intensity Map** (EXCLAIM) mission. EXCLAIM is a balloonborne far-infrared experiment to probe the CO and CII emission lines in galaxies from redshift 3.5 to the present. The spectrometer, called u-Spec, comprises a diffraction… ▽ More We describe a testbed to characterize the optical response of compact superconducting on-chip spectrometers in development for the Experiment for Cryogenic Large-Aperture Intensity Map** (EXCLAIM) mission. EXCLAIM is a balloonborne far-infrared experiment to probe the CO and CII emission lines in galaxies from redshift 3.5 to the present. The spectrometer, called u-Spec, comprises a diffraction grating on a silicon chip coupled to kinetic inductance detectors (KIDs) read out via a single microwave feedline. We use a prototype spectrometer for EXCLAIM to demonstrate our ability to characterize the spectrometers spectral response using a photomixer source. We utilize an on-chip reference detector to normalize relative to spectral structure from the off-chip optics and a silicon etalon to calibrate the absolute frequency. △ Less

Submitted 12 December, 2023; originally announced December 2023.

arXiv:2312.04600 [pdf, other]

Haldane Bundles: A Dataset for Learning to Predict the Chern Number of Line Bundles on the Torus

Authors: Cody Tipton, Elizabeth Coda, Davis Brown, Alyson Bittner, Jung Lee, Grayson Jorgenson, Tegan Emerson, Henry Kvinge

Abstract: Characteristic classes, which are abstract topological invariants associated with vector bundles, have become an important notion in modern physics with surprising real-world consequences. As a representative example, the incredible properties of topological insulators, which are insulators in their bulk but conductors on their surface, can be completely characterized by a specific characteristic… ▽ More Characteristic classes, which are abstract topological invariants associated with vector bundles, have become an important notion in modern physics with surprising real-world consequences. As a representative example, the incredible properties of topological insulators, which are insulators in their bulk but conductors on their surface, can be completely characterized by a specific characteristic class associated with their electronic band structure, the first Chern class. Given their importance to next generation computing and the computational challenge of calculating them using first-principles approaches, there is a need to develop machine learning approaches to predict the characteristic classes associated with a material system. To aid in this program we introduce the {\emph{Haldane bundle dataset}}, which consists of synthetically generated complex line bundles on the $2$-torus. We envision this dataset, which is not as challenging as noisy and sparsely measured real-world datasets but (as we show) still difficult for off-the-shelf architectures, to be a testing ground for architectures that incorporate the rich topological and geometric priors underlying characteristic classes. △ Less

Submitted 6 December, 2023; originally announced December 2023.

arXiv:2312.01435 [pdf, other]

Automatic Report Generation for Histopathology images using pre-trained Vision Transformers and BERT

Authors: Saurav Sengupta, Donald E. Brown

Abstract: Deep learning for histopathology has been successfully used for disease classification, image segmentation and more. However, combining image and text modalities using current state-of-the-art (SOTA) methods has been a challenge due to the high resolution of histopathology images. Automatic report generation for histopathology images is one such challenge. In this work, we show that using an exist… ▽ More Deep learning for histopathology has been successfully used for disease classification, image segmentation and more. However, combining image and text modalities using current state-of-the-art (SOTA) methods has been a challenge due to the high resolution of histopathology images. Automatic report generation for histopathology images is one such challenge. In this work, we show that using an existing pre-trained Vision Transformer (ViT) to encode 4096x4096 sized patches of the Whole Slide Image (WSI) and a pre-trained Bidirectional Encoder Representations from Transformers (BERT) model for language modeling-based decoder for report generation, we can build a performant and portable report generation mechanism that takes into account the whole high resolution image. Our method allows us to not only generate and evaluate captions that describe the image, but also helps us classify the image into tissue types and the gender of the patient as well. Our best performing model achieves a 89.52% accuracy in Tissue Type classification with a BLEU-4 score of 0.12 in our caption generation task. △ Less

Submitted 15 March, 2024; v1 submitted 3 December, 2023; originally announced December 2023.

Comments: Accepted at IEEE ISBI 2024. arXiv admin note: substantial text overlap with arXiv:2311.06176

arXiv:2311.15696 [pdf, other]

Peptide Binding Classification on Quantum Computers

Authors: Charles London, Douglas Brown, Wenduan Xu, Sezen Vatansever, Christopher James Langmead, Dimitri Kartsaklis, Stephen Clark, Konstantinos Meichanetzidis

Abstract: We conduct an extensive study on using near-term quantum computers for a task in the domain of computational biology. By constructing quantum models based on parameterised quantum circuits we perform sequence classification on a task relevant to the design of therapeutic proteins, and find competitive performance with classical baselines of similar scale. To study the effect of noise, we run some… ▽ More We conduct an extensive study on using near-term quantum computers for a task in the domain of computational biology. By constructing quantum models based on parameterised quantum circuits we perform sequence classification on a task relevant to the design of therapeutic proteins, and find competitive performance with classical baselines of similar scale. To study the effect of noise, we run some of the best-performing quantum models with favourable resource requirements on emulators of state-of-the-art noisy quantum processors. We then apply error mitigation methods to improve the signal. We further execute these quantum models on the Quantinuum H1-1 trapped-ion quantum processor and observe very close agreement with noiseless exact simulation. Finally, we perform feature attribution methods and find that the quantum models indeed identify sensible relationships, at least as well as the classical baselines. This work constitutes the first proof-of-concept application of near-term quantum computing to a task critical to the design of therapeutic proteins, opening the route toward larger-scale applications in this and related fields, in line with the hardware development roadmaps of near-term quantum technologies. △ Less

Submitted 27 November, 2023; originally announced November 2023.

arXiv:2311.15071 [pdf, other]

Model-independent extraction of form factors and $|V_{cb}|$ in $\overline{B} \rightarrow D \ell^- \overlineν_\ell$ with hadronic tagging at BaBar

Authors: BaBar Collaboration, J. P. Lees, V. Poireau, V. Tisserand, E. Grauges, A. Palano, G. Eigen, D. N. Brown, Yu. G. Kolomensky, M. Fritsch, H. Koch, R. Cheaib, C. Hearty, T. S. Mattison, J. A. McKenna, R. Y. So, V. E. Blinov, A. R. Buzykaev, V. P. Druzhinin, E. A. Kozyrev, E. A. Kravchenko, S. I. Serednyakov, Yu. I. Skovpen, E. P. Solodov, K. Yu. Todyshev , et al. (186 additional authors not shown)

Abstract: Using the entire BaBar $Υ(4S)$ data set, the first two-dimensional unbinned angular analysis of the semileptonic decay $\overline{B} \rightarrow D \ell^- \overlineν_\ell$ is performed, employing hadronic reconstruction of the tag-side $B$ meson from $Υ(4S)\to B\overline{B}$. Here, $\ell$ denotes the light charged leptons $e$ and $μ$. A novel data-driven signal-background separation procedure with… ▽ More Using the entire BaBar $Υ(4S)$ data set, the first two-dimensional unbinned angular analysis of the semileptonic decay $\overline{B} \rightarrow D \ell^- \overlineν_\ell$ is performed, employing hadronic reconstruction of the tag-side $B$ meson from $Υ(4S)\to B\overline{B}$. Here, $\ell$ denotes the light charged leptons $e$ and $μ$. A novel data-driven signal-background separation procedure with minimal dependence on simulation is developed. This procedure preserves all multi-dimensional correlations present in the data. The expected $\sin^2θ_\ell$ dependence of the differential decay rate in the Standard Model is demonstrated, where $θ_\ell$ is the lepton helicity angle. Including input from the latest lattice QCD calculations and previously available experimental data, the underlying form factors are extracted using both model-independent (BGL) and dependent (CLN) methods. Comparisons with lattice calculations show flavor SU(3) symmetry to be a good approximation in the $B_{(s)}\to D_{(s)}$ sector. Using the BGL results, the CKM matrix element $|V_{cb}|=(41.09\pm 1.16)\times 10^{-3}$ and the Standard Model prediction of the lepton-flavor universality violation variable $\mathcal{R}(D)=0.300\pm 0.004$, are extracted. The value of $|V_{cb}|$ from $\overline{B} \rightarrow D \ell^- \overlineν_\ell$ tends to be higher than that extracted using $\overline{B} \rightarrow D \ell^- \overlineν_\ell$. The Standard Model $\mathcal{R}(D)$ calculation is at a $1.97σ$ tension with the latest HFLAV experimental average. △ Less

Submitted 25 November, 2023; originally announced November 2023.

arXiv:2311.06176 [pdf, other]

Automatic Report Generation for Histopathology images using pre-trained Vision Transformers

Authors: Saurav Sengupta, Donald E. Brown

Abstract: Deep learning for histopathology has been successfully used for disease classification, image segmentation and more. However, combining image and text modalities using current state-of-the-art methods has been a challenge due to the high resolution of histopathology images. Automatic report generation for histopathology images is one such challenge. In this work, we show that using an existing pre… ▽ More Deep learning for histopathology has been successfully used for disease classification, image segmentation and more. However, combining image and text modalities using current state-of-the-art methods has been a challenge due to the high resolution of histopathology images. Automatic report generation for histopathology images is one such challenge. In this work, we show that using an existing pre-trained Vision Transformer in a two-step process of first using it to encode 4096x4096 sized patches of the Whole Slide Image (WSI) and then using it as the encoder and an LSTM decoder for report generation, we can build a fairly performant and portable report generation mechanism that takes into account the whole of the high resolution image, instead of just the patches. We are also able to use representations from an existing powerful pre-trained hierarchical vision transformer and show its usefulness in not just zero shot classification but also for report generation. △ Less

Submitted 13 November, 2023; v1 submitted 10 November, 2023; originally announced November 2023.

Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2023, December 10th, 2023, New Orleans, United States, 09 pages

arXiv:2311.04736 [pdf, other]

Transverse Mode Control in Quantum Enhanced Interferometers: A Review and Recommendations for a New Generation

Authors: Aaron W. Goodwin-Jones, Ricardo Cabrita, Mikhail Korobko, Martin van Beuzekom, Daniel D. Brown, Viviana Fafone, Joris van Heijningen, Alessio Rocchi, Mitchell G. Schiworski, Matteo Tacca

Abstract: Adaptive optics has made significant advancement over the past decade, becoming the essential technology in a wide variety of applications, particularly in the realm of quantum optics. One key area of impact is gravitational-wave detection, where quantum correlations are distributed over kilometer-long distances by beams with hundreds of kilowatts of optical power. Decades of development were requ… ▽ More Adaptive optics has made significant advancement over the past decade, becoming the essential technology in a wide variety of applications, particularly in the realm of quantum optics. One key area of impact is gravitational-wave detection, where quantum correlations are distributed over kilometer-long distances by beams with hundreds of kilowatts of optical power. Decades of development were required to develop robust and stable techniques to sense mismatches between the Gaussian beams and the resonators, all while maintaining the quantum correlations. Here we summarize the crucial advancements in transverse mode control required for gravitational-wave detection. As we look towards the advanced designs of future detectors, we highlight key challenges and offer recommendations for the design of these instruments. We conclude the review with a discussion of the broader application of adaptive optics in quantum technologies: communication, computation, imaging and sensing. △ Less

Submitted 8 November, 2023; originally announced November 2023.

Report number: LIGO-P2300282, VIR-0769A-23

arXiv:2311.00841 [pdf, other]

Nuclear Magnetic Resonance Investigation of Superconducting and Normal State Nb$_3$Sn

Authors: Gan Zhai, William P. Halperin, Arneil P. Reyes, Sam Posen, Zuhawn Sung, Chiara Tarantini, Michael D. Brown, David C. Larbalestier

Abstract: The superconductor Nb$_3$Sn has a high critical temperature and high critical field, widely used for high-field superconducting magnets. In this work we investigate its microscopic electronic structure with $^{93}$Nb nuclear magnetic resonance (NMR). The high-quality Nb$_3$Sn powder sample was studied in both 3.2T and 7T magnetic fields in the temperature range from 1.5K to 300K. From measurement… ▽ More The superconductor Nb$_3$Sn has a high critical temperature and high critical field, widely used for high-field superconducting magnets. In this work we investigate its microscopic electronic structure with $^{93}$Nb nuclear magnetic resonance (NMR). The high-quality Nb$_3$Sn powder sample was studied in both 3.2T and 7T magnetic fields in the temperature range from 1.5K to 300K. From measurement of the spectrum and its theoretical analysis, we find evidence for anisotropy despite its cubic crystal structure. This anisotropy is manifest in alignment of powder grains under certain temperature and field cycling conditions. The Knight shift and spin-lattice relaxation rate, $T_1^{-1}$, were measured in the normal state. Additionally, $T_1^{-1}$ was measured in the superconducting state and compared with BCS theory revealing a weak field dependence, with an energy gap $Δ(0)=2.0\pm0.08k_B T_c$ at 3.2T and $Δ(0)=1.73\pm0.08k_B T_c$ at 7T, indicating suppression of the order parameter by magnetic field. △ Less

Submitted 1 November, 2023; originally announced November 2023.

arXiv:2310.18536 [pdf, other]

Efficient Fully Bayesian Approach to Brain Activity Map** with Complex-Valued fMRI Data

Authors: Zhengxin Wang, Daniel B. Rowe, Xinyi Li, D. Andrew Brown

Abstract: Functional magnetic resonance imaging (fMRI) enables indirect detection of brain activity changes via the blood-oxygen-level-dependent (BOLD) signal. Conventional analysis methods mainly rely on the real-valued magnitude of these signals. In contrast, research suggests that analyzing both real and imaginary components of the complex-valued fMRI (cv-fMRI) signal provides a more holistic approach th… ▽ More Functional magnetic resonance imaging (fMRI) enables indirect detection of brain activity changes via the blood-oxygen-level-dependent (BOLD) signal. Conventional analysis methods mainly rely on the real-valued magnitude of these signals. In contrast, research suggests that analyzing both real and imaginary components of the complex-valued fMRI (cv-fMRI) signal provides a more holistic approach that can increase power to detect neuronal activation. We propose a fully Bayesian model for brain activity map** with cv-fMRI data. Our model accommodates temporal and spatial dynamics. Additionally, we propose a computationally efficient sampling algorithm, which enhances processing speed through image partitioning. Our approach is shown to be computationally efficient via image partitioning and parallel computation while being competitive with state-of-the-art methods. We support these claims with both simulated numerical studies and an application to real cv-fMRI data obtained from a finger-tap** experiment. △ Less

Submitted 27 October, 2023; originally announced October 2023.

arXiv:2310.16941 [pdf, other]

Exploring Behavior Discovery Methods for Heterogeneous Swarms of Limited-Capability Robots

Authors: Connor Mattson, Jeremy C. Clark, Daniel S. Brown

Abstract: We study the problem of determining the emergent behaviors that are possible given a functionally heterogeneous swarm of robots with limited capabilities. Prior work has considered behavior search for homogeneous swarms and proposed the use of novelty search over either a hand-specified or learned behavior space followed by clustering to return a taxonomy of emergent behaviors to the user. In this… ▽ More We study the problem of determining the emergent behaviors that are possible given a functionally heterogeneous swarm of robots with limited capabilities. Prior work has considered behavior search for homogeneous swarms and proposed the use of novelty search over either a hand-specified or learned behavior space followed by clustering to return a taxonomy of emergent behaviors to the user. In this paper, we seek to better understand the role of novelty search and the efficacy of using clustering to discover novel emergent behaviors. Through a large set of experiments and ablations, we analyze the effect of representations, evolutionary search, and various clustering methods in the search for novel behaviors in a heterogeneous swarm. Our results indicate that prior methods fail to discover many interesting behaviors and that an iterative human-in-the-loop discovery process discovers more behaviors than random search, swarm chemistry, and automated behavior discovery. The combined discoveries of our experiments uncover 23 emergent behaviors, 18 of which are novel discoveries. To the best of our knowledge, these are the first known emergent behaviors for heterogeneous swarms of computation-free agents. Videos, code, and appendix are available at the project website: https://sites.google.com/view/heterogeneous-bd-methods △ Less

Submitted 25 October, 2023; originally announced October 2023.

Comments: 11 pages, 9 figures, To be published in Proceedings IEEE International Symposium on Multi-Robot & Multi-Agent Systems (MRS 2023)

arXiv:2310.14993 [pdf, other]

Understanding the Inner Workings of Language Models Through Representation Dissimilarity

Authors: Davis Brown, Charles Godfrey, Nicholas Konz, Jonathan Tu, Henry Kvinge

Abstract: As language models are applied to an increasing number of real-world applications, understanding their inner workings has become an important issue in model trust, interpretability, and transparency. In this work we show that representation dissimilarity measures, which are functions that measure the extent to which two model's internal representations differ, can be a valuable tool for gaining in… ▽ More As language models are applied to an increasing number of real-world applications, understanding their inner workings has become an important issue in model trust, interpretability, and transparency. In this work we show that representation dissimilarity measures, which are functions that measure the extent to which two model's internal representations differ, can be a valuable tool for gaining insight into the mechanics of language models. Among our insights are: (i) an apparent asymmetry in the internal representations of model using SoLU and GeLU activation functions, (ii) evidence that dissimilarity measures can identify and locate generalization properties of models that are invisible via in-distribution test set performance, and (iii) new evaluations of how language model features vary as width and depth are increased. Our results suggest that dissimilarity measures are a promising set of tools for shedding light on the inner workings of language models. △ Less

Submitted 23 October, 2023; originally announced October 2023.

Comments: EMNLP 2023 (main)

arXiv:2310.11492 [pdf, other]

On the natal kick of the black hole X-ray binary H 1705--250

Authors: Cordelia Dashwood Brown, Poshak Gandhi, Yue Zhao

Abstract: When a compact object is formed, an impulse (kick) will be imparted to the system by the mass lost during the core-collapse supernova (SN). A number of other mechanisms may impart an additional kick on the system, although evidence for these natal kicks in black hole systems remains limited. Updated Gaia astrometry has recently identified a number of high peculiar velocity (in excess of Galactic m… ▽ More When a compact object is formed, an impulse (kick) will be imparted to the system by the mass lost during the core-collapse supernova (SN). A number of other mechanisms may impart an additional kick on the system, although evidence for these natal kicks in black hole systems remains limited. Updated Gaia astrometry has recently identified a number of high peculiar velocity (in excess of Galactic motion) compact objects. Here, we focus on the black hole low-mass X-ray binary H 1705--250, which has a peculiar velocity $\upsilon_{\mathrm{pec}}\,=\,221^{+101}_{-108}\,\mathrm{km}\,\mathrm{s}^{-1}$. Using population synthesis to reconstruct its evolutionary history (assuming formation via isolated binary evolution within the Galactic plane), we constrain the properties of the progenitor and pre-SN orbit. The magnitude of a kick solely due to mass loss is found to be $\sim\,30\,\mathrm{km}\,\mathrm{s}^{-1}$, which cannot account for the high present-day peculiar motion. We therefore deduce that the black hole received an additional natal kick at formation, and place limits on its magnitude, finding it to be $\sim\,295\,\mathrm{km}\,\mathrm{s}^{-1}$ (minimum $90\,\mathrm{km}\,\mathrm{s}^{-1}$). This furthers the argument that these kicks are not limited to neutron stars. △ Less

Submitted 17 October, 2023; originally announced October 2023.

Comments: MNRAS in press

arXiv:2310.10610 [pdf, other]

Quantifying Assistive Robustness Via the Natural-Adversarial Frontier

Authors: Jerry Zhi-Yang He, Zackory Erickson, Daniel S. Brown, Anca D. Dragan

Abstract: Our ultimate goal is to build robust policies for robots that assist people. What makes this hard is that people can behave unexpectedly at test time, potentially interacting with the robot outside its training distribution and leading to failures. Even just measuring robustness is a challenge. Adversarial perturbations are the default, but they can paint the wrong picture: they can correspond to… ▽ More Our ultimate goal is to build robust policies for robots that assist people. What makes this hard is that people can behave unexpectedly at test time, potentially interacting with the robot outside its training distribution and leading to failures. Even just measuring robustness is a challenge. Adversarial perturbations are the default, but they can paint the wrong picture: they can correspond to human motions that are unlikely to occur during natural interactions with people. A robot policy might fail under small adversarial perturbations but work under large natural perturbations. We propose that capturing robustness in these interactive settings requires constructing and analyzing the entire natural-adversarial frontier: the Pareto-frontier of human policies that are the best trade-offs between naturalness and low robot performance. We introduce RIGID, a method for constructing this frontier by training adversarial human policies that trade off between minimizing robot reward and acting human-like (as measured by a discriminator). On an Assistive Gym task, we use RIGID to analyze the performance of standard collaborative Reinforcement Learning, as well as the performance of existing methods meant to increase robustness. We also compare the frontier RIGID identifies with the failures identified in expert adversarial interaction, and with naturally-occurring failures during user interaction. Overall, we find evidence that RIGID can provide a meaningful measure of robustness predictive of deployment performance, and uncover failure cases in human-robot interaction that are difficult to find manually. https://ood-human.github.io. △ Less

Submitted 16 October, 2023; originally announced October 2023.

arXiv:2310.09207 [pdf, other]

Four-Dimensional Computational Ultrasound Imaging of Brain Haemodynamics

Authors: Michael D. Brown, Bastian S. Generowicz, Stephanie Dijkhuizen, Sebastiaan K. E. Koekkoek, Christos Strydis, Johannes G. Bosch, Petros Arvanitis, Geert Springeling, Geert J. T. Leus, Chris I. De Zeeuw, Pieter Kruizinga

Abstract: Four-dimensional ultrasound imaging of complex biological systems such as the brain is technically challenging because of the spatiotemporal sampling requirements. We present computational ultrasound imaging (cUSi), a new imaging method that uses complex ultrasound fields that can be generated with simple hardware and a physical wave prediction model to alleviate the sampling constraints. cUSi all… ▽ More Four-dimensional ultrasound imaging of complex biological systems such as the brain is technically challenging because of the spatiotemporal sampling requirements. We present computational ultrasound imaging (cUSi), a new imaging method that uses complex ultrasound fields that can be generated with simple hardware and a physical wave prediction model to alleviate the sampling constraints. cUSi allows for high-resolution four-dimensional imaging of brain haemodynamics in awake and anesthetized mice. △ Less

Submitted 13 October, 2023; originally announced October 2023.

arXiv:2310.07667 [pdf, other]

Global Minima, Recoverability Thresholds, and Higher-Order Structure in GNNS

Authors: Drake Brown, Trevor Garrity, Kaden Parker, Jason Oliphant, Stone Carson, Cole Hanson, Zachary Boyd

Abstract: We analyze the performance of graph neural network (GNN) architectures from the perspective of random graph theory. Our approach promises to complement existing lenses on GNN analysis, such as combinatorial expressive power and worst-case adversarial analysis, by connecting the performance of GNNs to typical-case properties of the training data. First, we theoretically characterize the nodewise ac… ▽ More We analyze the performance of graph neural network (GNN) architectures from the perspective of random graph theory. Our approach promises to complement existing lenses on GNN analysis, such as combinatorial expressive power and worst-case adversarial analysis, by connecting the performance of GNNs to typical-case properties of the training data. First, we theoretically characterize the nodewise accuracy of one- and two-layer GCNs relative to the contextual stochastic block model (cSBM) and related models. We additionally prove that GCNs cannot beat linear models under certain circumstances. Second, we numerically map the recoverability thresholds, in terms of accuracy, of four diverse GNN architectures (GCN, GAT, SAGE, and Graph Transformer) under a variety of assumptions about the data. Sample results of this second analysis include: heavy-tailed degree distributions enhance GNN performance, GNNs can work well on strongly heterophilous graphs, and SAGE and Graph Transformer can perform well on arbitrarily noisy edge data, but no architecture handled sufficiently noisy feature data well. Finally, we show how both specific higher-order structures in synthetic data and the mix of empirical structures in real data have dramatic effects (usually negative) on GNN performance. △ Less

Submitted 11 October, 2023; originally announced October 2023.

Comments: 28 pages

arXiv:2310.03149 [pdf, other]

Attributing Learned Concepts in Neural Networks to Training Data

Authors: Nicholas Konz, Charles Godfrey, Madelyn Shapiro, Jonathan Tu, Henry Kvinge, Davis Brown

Abstract: By now there is substantial evidence that deep learning models learn certain human-interpretable features as part of their internal representations of data. As having the right (or wrong) concepts is critical to trustworthy machine learning systems, it is natural to ask which inputs from the model's original training set were most important for learning a concept at a given layer. To answer this,… ▽ More By now there is substantial evidence that deep learning models learn certain human-interpretable features as part of their internal representations of data. As having the right (or wrong) concepts is critical to trustworthy machine learning systems, it is natural to ask which inputs from the model's original training set were most important for learning a concept at a given layer. To answer this, we combine data attribution methods with methods for probing the concepts learned by a model. Training network and probe ensembles for two concept datasets on a range of network layers, we use the recently developed TRAK method for large-scale data attribution. We find some evidence for convergence, where removing the 10,000 top attributing images for a concept and retraining the model does not change the location of the concept in the network nor the probing sparsity of the concept. This suggests that rather than being highly dependent on a few specific examples, the features that inform the development of a concept are spread in a more diffuse manner across its exemplars, implying robustness in concept formation. △ Less

Submitted 28 December, 2023; v1 submitted 4 October, 2023; originally announced October 2023.

Comments: ATTRIB Workshop at NeurIPS 2023

arXiv:2309.16536 [pdf, other]

Uncertainty Quantification for Eosinophil Segmentation

Authors: Kevin Lin, Donald Brown, Sana Syed, Adam Greene

Abstract: Eosinophilic Esophagitis (EoE) is an allergic condition increasing in prevalence. To diagnose EoE, pathologists must find 15 or more eosinophils within a single high-power field (400X magnification). Determining whether or not a patient has EoE can be an arduous process and any medical imaging approaches used to assist diagnosis must consider both efficiency and precision. We propose an improvemen… ▽ More Eosinophilic Esophagitis (EoE) is an allergic condition increasing in prevalence. To diagnose EoE, pathologists must find 15 or more eosinophils within a single high-power field (400X magnification). Determining whether or not a patient has EoE can be an arduous process and any medical imaging approaches used to assist diagnosis must consider both efficiency and precision. We propose an improvement of Adorno et al's approach for quantifying eosinphils using deep image segmentation. Our new approach leverages Monte Carlo Dropout, a common approach in deep learning to reduce overfitting, to provide uncertainty quantification on current deep learning models. The uncertainty can be visualized in an output image to evaluate model performance, provide insight to how deep learning algorithms function, and assist pathologists in identifying eosinophils. △ Less

Submitted 7 November, 2023; v1 submitted 28 September, 2023; originally announced September 2023.

Comments: Preprint, Final Article Submitted to ICBRA 2023 and will be published in the International Conference Proceedings by ACM, Association for Computing Machinery (ISBN: 979-8-4007-0815-2), which will be archived in ACM Digital Library, indexed by Ei Compendex and Scopus

arXiv:2309.11408 [pdf, other]

Indirect Swarm Control: Characterization and Analysis of Emergent Swarm Behaviors

Authors: Ricardo Vega, Connor Mattson, Daniel S. Brown, Cameron Nowzari

Abstract: Emergence and emergent behaviors are often defined as cases where changes in local interactions between agents at a lower level effectively changes what occurs in the higher level of the system (i.e., the whole swarm) and its properties. However, the manner in which these collective emergent behaviors self-organize is less understood. The focus of this paper is in presenting a new framework for ch… ▽ More Emergence and emergent behaviors are often defined as cases where changes in local interactions between agents at a lower level effectively changes what occurs in the higher level of the system (i.e., the whole swarm) and its properties. However, the manner in which these collective emergent behaviors self-organize is less understood. The focus of this paper is in presenting a new framework for characterizing the conditions that lead to different macrostates and how to predict/analyze their macroscopic properties, allowing us to indirectly engineer the same behaviors from the bottom up by tuning their environmental conditions rather than local interaction rules. We then apply this framework to a simple system of binary sensing and acting agents as an example to see if a re-framing of this swarms problem can help us push the state of the art forward. By first creating some working definitions of macrostates in a particular swarm system, we show how agent-based modeling may be combined with control theory to enable a generalized understanding of controllable emergent processes without needing to simulate everything. Whereas phase diagrams can generally only be created through Monte Carlo simulations or swee** through ranges of parameters in a simulator, we develop closed-form functions that can immediately produce them revealing an infinite set of swarm parameter combinations that can lead to a specifically chosen self-organized behavior. While the exact methods are still under development, we believe simply laying out a potential path towards solutions that have evaded our traditional methods using a novel method is worth considering. Our results are characterized through both simulations and real experiments on ground robots. △ Less

Submitted 28 March, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

Comments: 8 pages, 13 figures, submitted to IROS 2024 conference

arXiv:2309.05933 [pdf, other]

Workshop on a future muon program at FNAL

Authors: S. Corrodi, Y. Oksuzian, A. Edmonds, J. Miller, H. N. Tran, R. Bonventre, D. N. Brown, F. Meot, V. Singh, Y. Kolomensky, S. Tripathy, L. Borrel, M. Bub, B. Echenard, D. G. Hitlin, H. Jafree, S. Middleton, R. Plestid, F. C. Porter, R. Y. Zhu, L. Bottura, E. Pinsard, A. M. Teixeira, C. Carelli, D. Ambrose , et al. (68 additional authors not shown)

Abstract: The Snowmass report on rare processes and precision measurements recommended Mu2e-II and a next generation muon facility at Fermilab (Advanced Muon Facility) as priorities for the frontier. The Workshop on a future muon program at FNAL was held in March 2023 to discuss design studies for Mu2e-II, organizing efforts for the next generation muon facility, and identify synergies with other efforts (e… ▽ More The Snowmass report on rare processes and precision measurements recommended Mu2e-II and a next generation muon facility at Fermilab (Advanced Muon Facility) as priorities for the frontier. The Workshop on a future muon program at FNAL was held in March 2023 to discuss design studies for Mu2e-II, organizing efforts for the next generation muon facility, and identify synergies with other efforts (e.g., muon collider). Topics included high-power targetry, status of R&D for Mu2e-II, development of compressor rings, FFA and concepts for muon experiments (conversion, decays, muonium and other opportunities) at AMF. This document summarizes the workshop discussions with a focus on future R&D tasks needed to realize these concepts. △ Less

Submitted 11 September, 2023; originally announced September 2023.

Comments: 68 pages, 36 figures

Report number: FERMILAB-CONF-23-464-PPD, CALT-TH-2023-036

arXiv:2309.03744 [pdf, other]

Label-efficient Contrastive Learning-based model for nuclei detection and classification in 3D Cardiovascular Immunofluorescent Images

Authors: Nazanin Moradinasab, Rebecca A. Deaton, Laura S. Shankman, Gary K. Owens, Donald E. Brown

Abstract: Recently, deep learning-based methods achieved promising performance in nuclei detection and classification applications. However, training deep learning-based methods requires a large amount of pixel-wise annotated data, which is time-consuming and labor-intensive, especially in 3D images. An alternative approach is to adapt weak-annotation methods, such as labeling each nucleus with a point, but… ▽ More Recently, deep learning-based methods achieved promising performance in nuclei detection and classification applications. However, training deep learning-based methods requires a large amount of pixel-wise annotated data, which is time-consuming and labor-intensive, especially in 3D images. An alternative approach is to adapt weak-annotation methods, such as labeling each nucleus with a point, but this method does not extend from 2D histopathology images (for which it was originally developed) to 3D immunofluorescent images. The reason is that 3D images contain multiple channels (z-axis) for nuclei and different markers separately, which makes training using point annotations difficult. To address this challenge, we propose the Label-efficient Contrastive learning-based (LECL) model to detect and classify various types of nuclei in 3D immunofluorescent images. Previous methods use Maximum Intensity Projection (MIP) to convert immunofluorescent images with multiple slices to 2D images, which can cause signals from different z-stacks to falsely appear associated with each other. To overcome this, we devised an Extended Maximum Intensity Projection (EMIP) approach that addresses issues using MIP. Furthermore, we performed a Supervised Contrastive Learning (SCL) approach for weakly supervised settings. We conducted experiments on cardiovascular datasets and found that our proposed framework is effective and efficient in detecting and classifying various types of nuclei in 3D immunofluorescent images. △ Less

Submitted 14 January, 2024; v1 submitted 7 September, 2023; originally announced September 2023.

Comments: 11 pages, 5 figures, MICCAI Workshop Conference 2023

arXiv:2308.15675 [pdf, other]

Single and coupled cavity mode sensing schemes using a diagnostic field

Authors: Aaron W. Goodwin-Jones, Haochen Zhu, Carl Blair, Daniel D. Brown, Joris van Heijningen, Li Ju, Chunnong Zhao

Abstract: Precise optical mode matching is of critical importance in experiments using squeezed-vacuum states. Automatic spatial-mode matching schemes have the potential to reduce losses and improve loss stability. However, in quantum-enhanced coupled-cavity experiments, such as gravitational-wave detectors, one must also ensure that the sub-cavities are also mode matched. We propose a new mode sensing sche… ▽ More Precise optical mode matching is of critical importance in experiments using squeezed-vacuum states. Automatic spatial-mode matching schemes have the potential to reduce losses and improve loss stability. However, in quantum-enhanced coupled-cavity experiments, such as gravitational-wave detectors, one must also ensure that the sub-cavities are also mode matched. We propose a new mode sensing scheme, which works for simple and coupled cavities. The scheme requires no moving parts, nor tuning of Gouy phases. Instead a diagnostic field tuned to the HG20/LG10 mode frequency is used. The error signals are derived to be proportional to the difference in waist position, and difference in Rayleigh ranges, between the sub-cavity eigenmodes. The two error signals are separable by 90 degrees of demodulation phase. We demonstrate reasonable error signals for a simplified Einstein Telescope optical design. This work will facilitate routine use of extremely high levels of squeezing in current and future gravitational-wave detectors. △ Less

Submitted 29 August, 2023; originally announced August 2023.

Report number: LIGO-P2300010

arXiv:2308.13666 [pdf, other]

A Joint Fermi-GBM and Swift-BAT Analysis of Gravitational-Wave Candidates from the Third Gravitational-wave Observing Run

Authors: C. Fletcher, J. Wood, R. Hamburg, P. Veres, C. M. Hui, E. Bissaldi, M. S. Briggs, E. Burns, W. H. Cleveland, M. M. Giles, A. Goldstein, B. A. Hristov, D. Kocevski, S. Lesage, B. Mailyan, C. Malacaria, S. Poolakkil, A. von Kienlin, C. A. Wilson-Hodge, The Fermi Gamma-ray Burst Monitor Team, M. Crnogorčević, J. DeLaunay, A. Tohuvavohu, R. Caputo, S. B. Cenko , et al. (1674 additional authors not shown)

Abstract: We present Fermi Gamma-ray Burst Monitor (Fermi-GBM) and Swift Burst Alert Telescope (Swift-BAT) searches for gamma-ray/X-ray counterparts to gravitational wave (GW) candidate events identified during the third observing run of the Advanced LIGO and Advanced Virgo detectors. Using Fermi-GBM on-board triggers and sub-threshold gamma-ray burst (GRB) candidates found in the Fermi-GBM ground analyses,… ▽ More We present Fermi Gamma-ray Burst Monitor (Fermi-GBM) and Swift Burst Alert Telescope (Swift-BAT) searches for gamma-ray/X-ray counterparts to gravitational wave (GW) candidate events identified during the third observing run of the Advanced LIGO and Advanced Virgo detectors. Using Fermi-GBM on-board triggers and sub-threshold gamma-ray burst (GRB) candidates found in the Fermi-GBM ground analyses, the Targeted Search and the Untargeted Search, we investigate whether there are any coincident GRBs associated with the GWs. We also search the Swift-BAT rate data around the GW times to determine whether a GRB counterpart is present. No counterparts are found. Using both the Fermi-GBM Targeted Search and the Swift-BAT search, we calculate flux upper limits and present joint upper limits on the gamma-ray luminosity of each GW. Given these limits, we constrain theoretical models for the emission of gamma-rays from binary black hole mergers. △ Less

Submitted 25 August, 2023; originally announced August 2023.

arXiv:2308.13035 [pdf]

The intersection of video capsule endoscopy and artificial intelligence: addressing unique challenges using machine learning

Authors: Shan Guleria, Benjamin Schwartz, Yash Sharma, Philip Fernandes, James Jablonski, Sodiq Adewole, Sanjana Srivastava, Fisher Rhoads, Michael Porter, Michelle Yeghyayan, Dylan Hyatt, Andrew Copland, Lubaina Ehsan, Donald Brown, Sana Syed

Abstract: Introduction: Technical burdens and time-intensive review processes limit the practical utility of video capsule endoscopy (VCE). Artificial intelligence (AI) is poised to address these limitations, but the intersection of AI and VCE reveals challenges that must first be overcome. We identified five challenges to address. Challenge #1: VCE data are stochastic and contains significant artifact. Cha… ▽ More Introduction: Technical burdens and time-intensive review processes limit the practical utility of video capsule endoscopy (VCE). Artificial intelligence (AI) is poised to address these limitations, but the intersection of AI and VCE reveals challenges that must first be overcome. We identified five challenges to address. Challenge #1: VCE data are stochastic and contains significant artifact. Challenge #2: VCE interpretation is cost-intensive. Challenge #3: VCE data are inherently imbalanced. Challenge #4: Existing VCE AIMLT are computationally cumbersome. Challenge #5: Clinicians are hesitant to accept AIMLT that cannot explain their process. Methods: An anatomic landmark detection model was used to test the application of convolutional neural networks (CNNs) to the task of classifying VCE data. We also created a tool that assists in expert annotation of VCE data. We then created more elaborate models using different approaches including a multi-frame approach, a CNN based on graph representation, and a few-shot approach based on meta-learning. Results: When used on full-length VCE footage, CNNs accurately identified anatomic landmarks (99.1%), with gradient weighted-class activation map** showing the parts of each frame that the CNN used to make its decision. The graph CNN with weakly supervised learning (accuracy 89.9%, sensitivity of 91.1%), the few-shot model (accuracy 90.8%, precision 91.4%, sensitivity 90.9%), and the multi-frame model (accuracy 97.5%, precision 91.5%, sensitivity 94.8%) performed well. Discussion: Each of these five challenges is addressed, in part, by one of our AI-based models. Our goal of producing high performance using lightweight models that aim to improve clinician confidence was achieved. △ Less

Submitted 24 August, 2023; originally announced August 2023.

arXiv:2308.07963 [pdf, other]

doi 10.1103/PhysRevD.108.084020

Extended body dynamics in general relativity: hyperelastic models

Authors: Nishita Jadoo, J. David Brown, Charles R. Evans

Abstract: We present a numerical framework for modeling extended hyperelastic bodies based on a Lagrangian formulation of general relativistic elasticity theory. We use finite element methods to discretize the body, then use the semi--discrete action to derive ordinary differential equations of motion for the discrete nodes. The nodes are evolved in time using fourth--order Runge--Kutta. We validate our cod… ▽ More We present a numerical framework for modeling extended hyperelastic bodies based on a Lagrangian formulation of general relativistic elasticity theory. We use finite element methods to discretize the body, then use the semi--discrete action to derive ordinary differential equations of motion for the discrete nodes. The nodes are evolved in time using fourth--order Runge--Kutta. We validate our code against the normal modes of oscillation of a hyperelastic sphere, which are known analytically in the limit of small (linear), slow (Newtonian) oscillations. The algorithm displays second order convergence. This numerical framework can be used to obtain the orbital motion and internal dynamics of a hyperelastic body of any shape, for any spacetime metric, and for varying hyperelastic energy models. △ Less

Submitted 17 October, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

Comments: Corrected typos

Journal ref: Phys. Rev. D 108, 084020 (2023)

Showing 1–50 of 994 results for author: Brown, D