Skip to main content

Showing 1–50 of 456 results for author: Roberts, M

.
  1. arXiv:2406.19314  [pdf, other

    cs.CL cs.AI cs.LG

    LiveBench: A Challenging, Contamination-Free LLM Benchmark

    Authors: Colin White, Samuel Dooley, Manley Roberts, Arka Pal, Ben Feuer, Siddhartha Jain, Ravid Shwartz-Ziv, Neel Jain, Khalid Saifullah, Siddartha Naidu, Chinmay Hegde, Yann LeCun, Tom Goldstein, Willie Neiswanger, Micah Goldblum

    Abstract: Test set contamination, wherein test data from a benchmark ends up in a newer model's training set, is a well-documented obstacle for fair LLM evaluation and can quickly render benchmarks obsolete. To mitigate this, many recent benchmarks crowdsource new prompts and evaluations from human or LLM judges; however, these can introduce significant biases, and break down when scoring hard questions. In… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.10086  [pdf

    cs.CL cs.LG stat.ME

    Discovering influential text using convolutional neural networks

    Authors: Megan Ayers, Luke Sanford, Margaret Roberts, Eddie Yang

    Abstract: Experimental methods for estimating the impacts of text on human evaluation have been widely used in the social sciences. However, researchers in experimental settings are usually limited to testing a small number of pre-specified text treatments. While efforts to mine unstructured texts for features that causally affect outcomes have been ongoing in recent years, these models have primarily focus… ▽ More

    Submitted 21 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: To be published in ACL 2024 Findings

  3. arXiv:2406.08391  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Large Language Models Must Be Taught to Know What They Don't Know

    Authors: Sanyam Kapoor, Nate Gruver, Manley Roberts, Katherine Collins, Arka Pal, Umang Bhatt, Adrian Weller, Samuel Dooley, Micah Goldblum, Andrew Gordon Wilson

    Abstract: When using large language models (LLMs) in high-stakes applications, we need to know when we can trust their predictions. Some works argue that prompting high-performance LLMs is sufficient to produce calibrated uncertainties, while others introduce sampling methods that can be prohibitively expensive. In this work, we first argue that prompting on its own is insufficient to achieve good calibrati… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Code available at: https://github.com/activatedgeek/calibration-tuning

  4. arXiv:2405.19224  [pdf, other

    eess.IV cs.CV

    A study on the adequacy of common IQA measures for medical images

    Authors: Anna Breger, Clemens Karner, Ian Selby, Janek Gröhl, Sören Dittmer, Edward Lilley, Judith Babar, Jake Beckford, Timothy J Sadler, Shahab Shahipasand, Arthikkaa Thavakumar, Michael Roberts, Carola-Bibiane Schönlieb

    Abstract: Image quality assessment (IQA) is standard practice in the development stage of novel machine learning algorithms that operate on images. The most commonly used IQA measures have been developed and tested for natural images, but not in the medical setting. Reported inconsistencies arising in medical images are not surprising, as they have different properties than natural images. In this study, we… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  5. arXiv:2405.19097  [pdf, other

    eess.IV cs.CV

    A study of why we need to reassess full reference image quality assessment with medical images

    Authors: Anna Breger, Ander Biguri, Malena Sabaté Landman, Ian Selby, Nicole Amberg, Elisabeth Brunner, Janek Gröhl, Sepideh Hatamikia, Clemens Karner, Lipeng Ning, Sören Dittmer, Michael Roberts, AIX-COVNET Collaboration, Carola-Bibiane Schönlieb

    Abstract: Image quality assessment (IQA) is not just indispensable in clinical practice to ensure high standards, but also in the development stage of novel algorithms that operate on medical images with reference data. This paper provides a structured and comprehensive collection of examples where the two most common full reference (FR) image quality measures prove to be unsuitable for the assessment of no… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  6. arXiv:2405.19000  [pdf, other

    cs.LG

    FedMAP: Unlocking Potential in Personalized Federated Learning through Bi-Level MAP Optimization

    Authors: Fan Zhang, Carlos Esteve-Yagüe, Sören Dittmer, Carola-Bibiane Schönlieb, Michael Roberts

    Abstract: Federated Learning (FL) enables collaborative training of machine learning models on decentralized data while preserving data privacy. However, data across clients often differs significantly due to class imbalance, feature distribution skew, sample size imbalance, and other phenomena. Leveraging information from these not identically distributed (non-IID) datasets poses substantial challenges. FL… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  7. arXiv:2405.09597  [pdf

    cs.LG cs.AI

    When AI Eats Itself: On the Caveats of Data Pollution in the Era of Generative AI

    Authors: Xiaodan Xing, Fadong Shi, Jiahao Huang, Yinzhe Wu, Yang Nan, Sheng Zhang, Yingying Fang, Mike Roberts, Carola-Bibiane Schönlieb, Javier Del Ser, Guang Yang

    Abstract: Generative artificial intelligence (AI) technologies and large models are producing realistic outputs across various domains, such as images, text, speech, and music. Creating these advanced generative models requires significant resources, particularly large and high-quality datasets. To minimize training expenses, many algorithm developers use data created by the models themselves as a cost-effe… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  8. arXiv:2405.06014  [pdf, other

    hep-th

    Superconformal Monodromy Defects in $\mathcal{N}$=4 SYM and LS theory

    Authors: Igal Arav, Jerome P. Gauntlett, Yusheng Jiao, Matthew M. Roberts, Christopher Rosen

    Abstract: We study type IIB supergravity solutions that are dual to two-dimensional superconformal defects in $d=4$ SCFTs which preserve $\mathcal{N}=(0,2)$ supersymmetry. We consider solutions dual to defects in $\mathcal{N}=4$ SYM theory that have non-trivial monodromy for $U(1)^3\subset SO(6)$ global symmetry and we also allow for the possibility of conical singularities. In addition, we consider the add… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 86 pages, 7 figures

    Report number: APCTP Pre2024-005

  9. arXiv:2405.02755  [pdf, other

    astro-ph.HE

    On the Impact of Dark Matter Scattering on the Trajectory of High-Energy Cosmic Rays

    Authors: Stefano Profumo, M. Grant Roberts, Shashank Dharanibalan

    Abstract: We study the impact on the trajectory of high-energy cosmic-ray protons of scattering off the cosmic dark matter. We compute the scattering angle as a function of the cosmic-ray energy, of the dark matter mass, and of the interaction strength for a few representative choices for the relevant interaction cross section. We find that the typical deflection angle over the cosmic ray path is largely in… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: 26 pages, 9 Figures

  10. arXiv:2404.06325  [pdf, other

    cs.AI

    Automatically Learning HTN Methods from Landmarks

    Authors: Ruoxi Li, Dana Nau, Mark Roberts, Morgan Fine-Morris

    Abstract: Hierarchical Task Network (HTN) planning usually requires a domain engineer to provide manual input about how to decompose a planning problem. Even HTN-MAKER, a well-known method-learning algorithm, requires a domain engineer to annotate the tasks with information about what to learn. We introduce CURRICULAMA, an HTN method learning algorithm that completely automates the learning process. It uses… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: This work has been submitted to FLAIRS-24

  11. arXiv:2403.15755  [pdf, other

    stat.ME cs.MA cs.SI stat.AP

    Optimized Model Selection for Estimating Treatment Effects from Costly Simulations of the US Opioid Epidemic

    Authors: Abdulrahman A. Ahmed, M. Amin Rahimian, Mark S. Roberts

    Abstract: Agent-based simulation with a synthetic population can help us compare different treatment conditions while kee** everything else constant within the same population (i.e., as digital twins). Such population-scale simulations require large computational power (i.e., CPU resources) to get accurate estimates for treatment effects. We can use meta models of the simulation results to circumvent the… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: To be presented in 2024 Annual Simulation Conference (ANNSIM'24)

  12. Goal-Oriented End-User Programming of Robots

    Authors: David Porfirio, Mark Roberts, Laura M. Hiatt

    Abstract: End-user programming (EUP) tools must balance user control with the robot's ability to plan and act autonomously. Many existing task-oriented EUP tools enforce a specific level of control, e.g., by requiring that users hand-craft detailed sequences of actions, rather than offering users the flexibility to choose the level of task detail they wish to express. We thereby created a novel EUP system,… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Published in the proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction

  13. arXiv:2402.17836  [pdf, other

    cs.RO

    Considerations for End-User Development in the Caregiving Domain

    Authors: Laura Stegner, David Porfirio, Mark Roberts, Laura M. Hiatt

    Abstract: As service robots become more capable of autonomous behaviors, it becomes increasingly important to consider how people communicate with a robot what task it should perform and how to do the task. Accordingly, there has been a rise in attention to end-user development (EUD) interfaces, which enable non-roboticist end users to specify tasks for autonomous robots to perform. However, state-of-the-ar… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: Presented at AAAI Fall Symposium Series 2023 UR-RAD

  14. arXiv:2402.14189  [pdf, other

    econ.GN eess.SY

    Optimal transmission expansion minimally reduces decarbonization costs of U.S. electricity

    Authors: Rangrang Zheng, Greg Schivley, Patricia Hidalgo-Gonzalez, Matthias Fripp, Michael J. Roberts

    Abstract: Solar and wind power are cost-competitive with fossil fuels, yet their intermittent nature presents challenges. Significant temporal and geographic differences in land, wind, and solar resources suggest that long-distance transmission could be particularly beneficial. Using a detailed, open-source model, we analyze optimal transmission expansion jointly with storage, generation, and hourly operati… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 23 pages, 7 figures in main paper. Additional 11 pages including 7 additional figures and one table in the appendix

  15. arXiv:2402.13228  [pdf, other

    cs.CL cs.AI cs.LG

    Smaug: Fixing Failure Modes of Preference Optimisation with DPO-Positive

    Authors: Arka Pal, Deep Karkhanis, Samuel Dooley, Manley Roberts, Siddartha Naidu, Colin White

    Abstract: Direct Preference Optimisation (DPO) is effective at significantly improving the performance of large language models (LLMs) on downstream tasks such as reasoning, summarisation, and alignment. Using pairs of preferred and dispreferred data, DPO models the relative probability of picking one response over another. In this work, first we show theoretically that the standard DPO loss can lead to a r… ▽ More

    Submitted 3 July, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  16. arXiv:2402.10224  [pdf, other

    cs.RO cs.AI cs.MA

    Human-Centric Goal Reasoning with Ripple-Down Rules

    Authors: Kenji Brameld, Germán Castro, Claude Sammut, Mark Roberts, David W. Aha

    Abstract: ActorSim is a goal reasoning framework developed at the Naval Research Laboratory. Originally, all goal reasoning rules were hand-crafted. This work extends ActorSim with the capability of learning by demonstration, that is, when a human trainer disagrees with a decision made by the system, the trainer can take over and show the system the correct decision. The learning component uses Ripple-Down… ▽ More

    Submitted 30 January, 2024; originally announced February 2024.

    Comments: Proceedings of the Ninth Goal Reasoning Workshop (Advances in Cognitive Systems, 2021)

  17. A 350-MHz Green Bank Telescope Survey of Unassociated Fermi LAT Sources: Discovery and Timing of Ten Millisecond Pulsars

    Authors: P. Bangale, B. Bhattacharyya, F. Camilo, C. J. Clark, I. Cognard, M. E. DeCesar, E. C. Ferrara, P. Gentile, L. Guillemot, J. W. T. Hessels, T. J. Johnson, M. Kerr, M. A. McLaughlin, L. Nieder, S. M. Ransom, P. S. Ray, M. S. E. Roberts, J. Roy, S. Sanpa-Arsa, G. Theureau, M. T. Wolff

    Abstract: We have searched for radio pulsations towards 49 Fermi Large Area Telescope (LAT) 1FGL Catalog $γ$-ray sources using the Green Bank Telescope at 350 MHz. We detected 18 millisecond pulsars (MSPs) in blind searches of the data; 10 of these were discoveries unique to our survey. Sixteen are binaries, with eight having short orbital periods $P_B < 1$ day. No radio pulsations from young pulsars were d… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Accepted for publication in ApJ (25 pages, 15 figues, 4 tables)

    Journal ref: ApJ, Vol 966, 20 pp. (2024)

  18. arXiv:2402.00601  [pdf, other

    math.PR

    Asymptotics for the growth of the infinite-parent Spatial Lambda-Fleming-Viot model

    Authors: Apolline Louvet, Matthew I. Roberts

    Abstract: The infinite-parent spatial Lambda-Fleming-Viot (SLFV) process is a model of random growth, in which a set evolves by the addition of balls according to points of an underlying Poisson point process, and which was recently introduced to study genetic diversity in spatially expanding populations. In this article, we give asymptotics for the location and depth of the moving interface, and identify t… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 57 pages, 2 figures

    MSC Class: 92D25 (Primary) 60G55; 60J25; 60D05; 60K35; 82C43 (Secondary)

  19. arXiv:2312.16188  [pdf, other

    cs.LG stat.ME

    The curious case of the test set AUROC

    Authors: Michael Roberts, Alon Hazan, Sören Dittmer, James H. F. Rudd, Carola-Bibiane Schönlieb

    Abstract: Whilst the size and complexity of ML models have rapidly and significantly increased over the past decade, the methods for assessing their performance have not kept pace. In particular, among the many potential performance metrics, the ML community stubbornly continues to use (a) the area under the receiver operating characteristic curve (AUROC) for a validation and test cohort (distinct from trai… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 3 pages, 4 figures

  20. arXiv:2312.13723  [pdf, other

    hep-ph astro-ph.CO physics.atom-ph

    Ultralight Dark Matter Search with Space-Time Separated Atomic Clocks and Cavities

    Authors: Melina Filzinger, Ashlee R. Caddell, Dhruv Jani, Martin Steinel, Leonardo Giani, Nils Huntemann, Benjamin M. Roberts

    Abstract: We devise and demonstrate a method to search for non-gravitational couplings of ultralight dark matter to standard model particles using space-time separated atomic clocks and cavity-stabilized lasers. By making use of space-time separated sensors, which probe different values of an oscillating dark matter field, we can search for couplings that cancel in typical local experiments. We demonstrate… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 7 pages, 2 figures

  21. arXiv:2312.12482  [pdf, other

    q-bio.QM cs.LG

    New Horizons: Pioneering Pharmaceutical R&D with Generative AI from lab to the clinic -- an industry perspective

    Authors: Guy Doron, Sam Genway, Mark Roberts, Sai Jasti

    Abstract: The rapid advance of generative AI is resha** the strategic vision for R&D across industries. The unique challenges of pharmaceutical R&D will see applications of generative AI deliver value along the entire value chain from early discovery to regulatory approval. This perspective reviews these challenges and takes a three-horizon approach to explore the generative AI applications already delive… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 21 pages, 4 figures

    MSC Class: 92C50 ACM Class: I.2.0; J.3

  22. arXiv:2311.03913  [pdf, other

    math.DG

    Classifying bi-invariant 2-forms on infinite-dimensional Lie groups

    Authors: David Michael Roberts

    Abstract: A bi-invariant differential 2-form on a Lie group G is a highly constrained object, being determined by purely linear data: an Ad-invariant alternating bilinear form on the Lie algebra of G. On a compact connected Lie group these have an known classification, in terms of de Rham cohomology, which is here generalised to arbitrary finite-dimensional Lie groups, at the cost of losing the connection t… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 16 pages

  23. arXiv:2310.10628  [pdf, other

    cs.CL

    Data Contamination Through the Lens of Time

    Authors: Manley Roberts, Himanshu Thakur, Christine Herlihy, Colin White, Samuel Dooley

    Abstract: Recent claims about the impressive abilities of large language models (LLMs) are often supported by evaluating publicly available benchmarks. Since LLMs train on wide swaths of the internet, this practice raises concerns of data contamination, i.e., evaluating on examples that are explicitly or implicitly included in the training data. Data contamination remains notoriously challenging to measure… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  24. arXiv:2310.02874  [pdf, other

    cs.LG cs.AI

    Recent Methodological Advances in Federated Learning for Healthcare

    Authors: Fan Zhang, Daniel Kreuter, Yichen Chen, Sören Dittmer, Samuel Tull, Tolou Shadbahr, BloodCounts! Collaboration, Jacobus Preller, James H. F. Rudd, John A. D. Aston, Carola-Bibiane Schönlieb, Nicholas Gleadall, Michael Roberts

    Abstract: For healthcare datasets, it is often not possible to combine data samples from multiple sites due to ethical, privacy or logistical concerns. Federated learning allows for the utilisation of powerful machine learning algorithms without requiring the pooling of data. Healthcare data has many simultaneous challenges which require new methodologies to address, such as highly-siloed data, class imbala… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: Supplementary table of extracted data at the end of the document

  25. arXiv:2309.11050  [pdf

    astro-ph.IM astro-ph.EP astro-ph.SR

    The development of HISPEC for Keck and MODHIS for TMT: science cases and predicted sensitivities

    Authors: Quinn M. Konopacky, Ashley D. Baker, Dimitri Mawet, Michael P. Fitzgerald, Nemanja Jovanovic, Charles Beichman, Garreth Ruane, Rob Bertz, Hiroshi Terada, Richard Dekany, Larry Lingvay, Marc Kassis, David Anderson, Motohide Tamura, Bjorn Benneke, Thomas Beatty, Tuan Do, Shogo Nishiyama, Peter Plavchan, Jason Wang, Ji Wang, Adam Burgasser, Jean-Baptiste Ruffio, Huihao Zhang, Aaron Brown , et al. (50 additional authors not shown)

    Abstract: HISPEC is a new, high-resolution near-infrared spectrograph being designed for the W.M. Keck II telescope. By offering single-shot, R=100,000 between 0.98 - 2.5 um, HISPEC will enable spectroscopy of transiting and non-transiting exoplanets in close orbits, direct high-contrast detection and spectroscopy of spatially separated substellar companions, and exoplanet dynamical mass and orbit measureme… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: 25 pages, 9 figures. To appear in the Proceedings of SPIE: Techniques and Instrumentation for Detection of Exoplanets XI, vol. 12680 (2023)

  26. arXiv:2308.13040  [pdf, other

    cs.MA cs.SI stat.AP

    Estimating Treatment Effects Using Costly Simulation Samples from a Population-Scale Model of Opioid Use Disorder

    Authors: Abdulrahman A. Ahmed, M. Amin Rahimian, Mark S. Roberts

    Abstract: Large-scale models require substantial computational resources for analysis and studying treatment conditions. Specifically, estimating treatment effects using simulations may require a lot of infeasible resources to allocate at every treatment condition. Therefore, it is essential to develop efficient methods to allocate computational resources for estimating treatment effects. Agent-based simula… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: To be presented in IEEE International Conference on Biomedical and Health Informatics 2023, repository link: https://github.com/abdulrahmanfci/intervention-estimation

  27. arXiv:2308.10882  [pdf, other

    cs.AI cs.CL

    Giraffe: Adventures in Expanding Context Lengths in LLMs

    Authors: Arka Pal, Deep Karkhanis, Manley Roberts, Samuel Dooley, Arvind Sundararajan, Siddartha Naidu

    Abstract: Modern large language models (LLMs) that rely on attention mechanisms are typically trained with fixed context lengths which enforce upper limits on the length of input sequences that they can handle at evaluation time. To use these models on sequences longer than the train-time context length, one might employ techniques from the growing family of context length extrapolation methods -- most of w… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

  28. arXiv:2308.07832  [pdf, ps, other

    cs.LG cs.AI stat.ME

    REFORMS: Reporting Standards for Machine Learning Based Science

    Authors: Sayash Kapoor, Emily Cantrell, Kenny Peng, Thanh Hien Pham, Christopher A. Bail, Odd Erik Gundersen, Jake M. Hofman, Jessica Hullman, Michael A. Lones, Momin M. Malik, Priyanka Nanayakkara, Russell A. Poldrack, Inioluwa Deborah Raji, Michael Roberts, Matthew J. Salganik, Marta Serra-Garcia, Brandon M. Stewart, Gilles Vandewiele, Arvind Narayanan

    Abstract: Machine learning (ML) methods are proliferating in scientific research. However, the adoption of these methods has been accompanied by failures of validity, reproducibility, and generalizability. These failures can hinder scientific progress, lead to false consensus around invalid claims, and undermine the credibility of ML-based science. ML methods are often applied and fail in similar ways acros… ▽ More

    Submitted 19 September, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

  29. arXiv:2307.13579  [pdf, other

    cs.LG cs.AI math.ST

    Reinterpreting survival analysis in the universal approximator age

    Authors: Sören Dittmer, Michael Roberts, Jacobus Preller, AIX COVNET, James H. F. Rudd, John A. D. Aston, Carola-Bibiane Schönlieb

    Abstract: Survival analysis is an integral part of the statistical toolbox. However, while most domains of classical statistics have embraced deep learning, survival analysis only recently gained some minor attention from the deep learning community. This recent development is likely in part motivated by the COVID-19 pandemic. We aim to provide the tools needed to fully harness the potential of survival ana… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

  30. arXiv:2307.12186  [pdf, other

    cs.MA cs.SI stat.AP

    Inferring epidemic dynamics using Gaussian process emulation of agent-based simulations

    Authors: Abdulrahman A. Ahmed, M. Amin Rahimian, Mark S. Roberts

    Abstract: Computational models help decision makers understand epidemic dynamics to optimize public health interventions. Agent-based simulation of disease spread in synthetic populations allows us to compare and contrast different effects across identical populations or to investigate the effect of interventions kee** every other factor constant between ``digital twins''. FRED (A Framework for Reconstruc… ▽ More

    Submitted 11 September, 2023; v1 submitted 22 July, 2023; originally announced July 2023.

    Comments: To be presented in Winter Simulation Conference 2023, repository link: https://github.com/abdulrahmanfci/gpr-abm

  31. arXiv:2306.09177  [pdf, other

    cs.LG

    Dis-AE: Multi-domain & Multi-task Generalisation on Real-World Clinical Data

    Authors: Daniel Kreuter, Samuel Tull, Julian Gilbey, Jacobus Preller, BloodCounts! Consortium, John A. D. Aston, James H. F. Rudd, Suthesh Sivapalaratnam, Carola-Bibiane Schönlieb, Nicholas Gleadall, Michael Roberts

    Abstract: Clinical data is often affected by clinically irrelevant factors such as discrepancies between measurement devices or differing processing methods between sites. In the field of machine learning (ML), these factors are known as domains and the distribution differences they cause in the data are known as domain shifts. ML models trained using data from one domain often perform poorly when applied t… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: 17 pages main body, 5 figures, 18 pages of appendix

  32. arXiv:2305.09035  [pdf, other

    cs.LG

    Algorithmic Censoring in Dynamic Learning Systems

    Authors: Jennifer Chien, Margaret Roberts, Berk Ustun

    Abstract: Dynamic learning systems subject to selective labeling exhibit censoring, i.e. persistent negative predictions assigned to one or more subgroups of points. In applications like consumer finance, this results in groups of applicants that are persistently denied and thus never enter into the training data. In this work, we formalize censoring, demonstrate how it can arise, and highlight difficulties… ▽ More

    Submitted 29 June, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: 28 pages, 9 figures

  33. arXiv:2305.08897  [pdf, other

    hep-th cond-mat.mes-hall cond-mat.str-el gr-qc

    Analog gravity and the continuum effective theory of the graphene tight binding lattice model

    Authors: Matthew M. Roberts, Toby Wiseman

    Abstract: We consider the tight-binding model of graphene with slowly spatially varying hop** functions. We develop a low energy approximation as a derivative expansion in a Dirac spinor that is perturbative in the hop** function deformation. The leading description is the Dirac equation in flat 2+1-d spacetime with (strain-)gauge field. Prior work considered subleading corrections written as non-trivia… ▽ More

    Submitted 15 August, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: 39 pages, 7 figures, downloadable Mathematica notebook, abridged abstract; v2. typos corrected, references added

    Report number: Imperial/TP/2023/TW/01, APCTP Pre2023 - 006

  34. arXiv:2305.05434  [pdf, other

    physics.space-ph astro-ph.IM

    RAAD: LIGHT-1 CubeSat's Payload for the Detection of Terrestrial Gamma-Ray Flashes

    Authors: A. Di Giovanni, F. Arneodo, A. Al Qasim, H. Alblooshi, F. AlKhouri, L. Alkindi, A. AlMannei, M. L. Benabderrahmane, G. Bruno, V. Conicella, O. Fawwaz, G. Franchi, S. Kalos, P. Oikonomou, L. Perillo, C. Pittori, M. S. Roberts, R. Torres

    Abstract: The Rapid Acquisition Atmospheric Detector (RAAD), onboard the LIGHT-1 3U CubeSat, detects photons between hard X-rays and soft gamma-rays, in order to identify and characterize Terrestrial Gamma Ray Flashes (TGFs). Three detector configurations are tested, making use of Cerium Bromide and Lanthanum BromoChloride scintillating crystals coupled to photomultiplier tubes or Multi-Pixel Photon Counter… ▽ More

    Submitted 16 August, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 19 pages, 15 figures

  35. arXiv:2305.05125  [pdf, other

    hep-ph astro-ph.CO physics.atom-ph

    Accurate electron-recoil ionization factors for dark matter direct detection in xenon, krypton and argon

    Authors: A. R. Caddell, V. V. Flambaum, B. M. Roberts

    Abstract: While most scintillation-based dark matter experiments search for Weakly Interacting Massive Particles (WIMPs), a sub-GeV WIMP-like particle may also be detectable in these experiments. While dark matter of this type and scale would not leave appreciable nuclear recoil signals, it may instead induce ionization of atomic electrons. Accurate modelling of the atomic wavefunctions is key to investigat… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Journal ref: Physical Review D 108, 083030 (2023)

  36. arXiv:2302.10114  [pdf, other

    physics.med-ph

    open-UST: An Open-Source Ultrasound Tomography Transducer Array System

    Authors: Morgan Roberts, Eleanor Martin, Michael D. Brown, Ben T. Cox, Bradley E. Treeby

    Abstract: Fast imaging methods are needed to promote widespread clinical adoption of Ultrasound Tomography (UST), and more widely available UST hardware could support the experimental validation of new measurement configurations. In this work, an open-source 256-element transducer ring array was developed (morganjroberts.github.io/open-UST) and manufactured using rapid prototy**, for only £2k. Novel manuf… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: 16 pages, 12 figures

  37. Neutron star mass estimates from gamma-ray eclipses in spider millisecond pulsar binaries

    Authors: C. J. Clark, M. Kerr, E. D. Barr, B. Bhattacharyya, R. P. Breton, P. Bruel, F. Camilo, W. Chen, I. Cognard, H. T. Cromartie, J. Deneva, V. S. Dhillon, L. Guillemot, M. R. Kennedy, M. Kramer, A. G. Lyne, D. Mata Sánchez, L. Nieder, C. Phillips, S. M. Ransom, P. S. Ray, M. S. E. Roberts, J. Roy, D. A. Smith, R. Spiewak , et al. (4 additional authors not shown)

    Abstract: Reliable neutron star mass measurements are key to determining the equation-of-state of cold nuclear matter, but these are rare. "Black Widows" and "Redbacks" are compact binaries consisting of millisecond pulsars and semi-degenerate companion stars. Spectroscopy of the optically bright companions can determine their radial velocities, providing inclination-dependent pulsar mass estimates. While i… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

    Comments: 31 pages, 4 figures, includes supplementary tables; published in Nature Astronomy

  38. QED radiative corrections to electric dipole amplitudes in heavy atoms

    Authors: C. J. Fairhall, B. M. Roberts, J. S. M. Ginges

    Abstract: We use the radiative potential method to perform a detailed study of quantum electrodynamics (QED) radiative corrections to electric dipole (E1) transition amplitudes in heavy alkali-metal atoms Rb, Cs, Fr, and alkali-metal-like ions Sr+, Ba+, and Ra+. The validity of the method is checked by comparing with the results of rigorous QED in simple atomic potentials. We study the effects of core relax… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

    Journal ref: Phys. Rev. A 107, 022813 (2023)

  39. Experimental and theoretical study of dynamic polarizabilities in the $5S_{1/2}$-$5D_{5/2}$ clock transition in rubidium-87 and determination of E1 matrix elements

    Authors: Rhona Hamilton, Benjamin M. Roberts, Sarah K. Scholten, Clayton Locke, Andre N. Luiten, Jacinda S. M. Ginges, Christopher Perrella

    Abstract: The interaction between light and an atom causes perturbations in the atom's energy levels, known as the light-shift. These light-shifts are a key source of inaccuracy in atomic clocks, and can also deteriorate their precision. We present a study of light-shifts and associated dynamic polarizabilities for a two-photon atomic clock based on the $5S_{1/2}$-$5D_{5/2}$ transition in rubidium-87 over t… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Journal ref: Physical Review Applied 19, 054059 (2023)

  40. Electric dipole transition amplitudes for atoms and ions with one valence electron

    Authors: B. M. Roberts, C. J. Fairhall, J. S. M. Ginges

    Abstract: Motivated by recent measurements for several alkali-metal atoms and alkali-metal-like ions, we perform a detailed study of electric dipole (E1) transition amplitudes in K, Ca+, Rb, Sr+, Cs, Ba+, Fr, and Ra+, which are of interest for studies of atomic parity violation, electric dipole moments, and polarizabilities. Using the all-orders correlation potential method, we perform high-precision calcul… ▽ More

    Submitted 6 March, 2023; v1 submitted 20 November, 2022; originally announced November 2022.

    Journal ref: Physical Review A 107, 052812 (2023)

  41. High-Resolution Radio Study of the Dragonfly Nebula

    Authors: Ruolan **, C. -Y. Ng, Mallory S. E. Roberts, Kwan-Lok Li

    Abstract: The Dragonfly Nebula (G75.2$+$0.1) powered by the young pulsar J2021$+$3651 is a rare pulsar wind nebula (PWN) that shows double tori and polar jets enclosed by a bow-shock structure in X-rays. We present new radio observations of this source taken with the Very Large Array (VLA) at 6 GHz. The radio PWN has an overall size about two times as large as the X-ray counterpart, consisting of a bright m… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: 12 pages, 11 figures; submitted to ApJ

  42. arXiv:2211.09970  [pdf, other

    cs.LG cs.AI cs.CE physics.data-an

    Estimating defection in subscription-type markets: empirical analysis from the scholarly publishing industry

    Authors: Michael Roberts, J. Ignacio Deza, Hisham Ihshaish, Yanhui Zhu

    Abstract: We present the first empirical study on customer churn prediction in the scholarly publishing industry. The study examines our proposed method for prediction on a customer subscription data over a period of 6.5 years, which was provided by a major academic publisher. We explore the subscription-type market within the context of customer defection and modelling, and provide analysis of the business… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  43. arXiv:2210.15915  [pdf, other

    astro-ph.EP astro-ph.IM

    Phase II of the Keck Planet Imager and Characterizer: system-level laboratory characterization and preliminary on-sky commissioning

    Authors: Daniel Echeverri, Nemanja Jovanovic, Jacques-Robert Delorme, Yinzi Xin, Tobias Schofield, Luke Finnerty, Jason J. Wang, Jerry Xuan, Dimitri Mawet, Ashley Baker, Randall Bartos, Charlotte Z. Bond, Marta L. Bryan, Benjamin Calvin, Sylvain Cetre, Greg Doppmann, Michael P. Fitzgerald, Jason Fucik, Katelyn Horstman, Ronald Lopez, Emily C. Martin, Stefan Martin, Bertrand Mennesson, Evan Morris, Reston Nash , et al. (13 additional authors not shown)

    Abstract: The Keck Planet Imager and Characterizer (KPIC) is a series of upgrades for the Keck II Adaptive Optics (AO) system and the NIRSPEC spectrograph to enable diffraction-limited, high-resolution ($R>30,000$) spectroscopy of exoplanets and low-mass companions in the K and L bands. Phase I consisted of single-mode fiber injection/extraction units (FIU/FEU) used in conjunction with an H-band pyramid wav… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: 13 pages; 6 figures; to appear in Proceedings of the SPIE, Ground-based and Airborne Instrumentation for Astronomy IX, Vol. 12184

    Journal ref: Proc. SPIE 12184, 121841W (2022)

  44. Navigating the challenges in creating complex data systems: a development philosophy

    Authors: Sören Dittmer, Michael Roberts, Julian Gilbey, Ander Biguri, AIX-COVNET Collaboration, Jacobus Preller, James H. F. Rudd, John A. D. Aston, Carola-Bibiane Schönlieb

    Abstract: In this perspective, we argue that despite the democratization of powerful tools for data science and machine learning over the last decade, develo** the code for a trustworthy and effective data science system (DSS) is getting harder. Perverse incentives and a lack of widespread software engineering (SE) skills are among many root causes we identify that naturally give rise to the current syste… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

  45. arXiv:2210.09465  [pdf, other

    cs.CV cs.LG

    Understanding CNN Fragility When Learning With Imbalanced Data

    Authors: Damien Dablain, Kristen N. Jacobson, Colin Bellinger, Mark Roberts, Nitesh Chawla

    Abstract: Convolutional neural networks (CNNs) have achieved impressive results on imbalanced image data, but they still have difficulty generalizing to minority classes and their decisions are difficult to interpret. These problems are related because the method by which CNNs generalize to minority classes, which requires improvement, is wrapped in a blackbox. To demystify CNN decisions on imbalanced data,… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

  46. arXiv:2210.06849  [pdf, other

    cs.CV

    Retrospectives on the Embodied AI Workshop

    Authors: Matt Deitke, Dhruv Batra, Yonatan Bisk, Tommaso Campari, Angel X. Chang, Devendra Singh Chaplot, Changan Chen, Claudia Pérez D'Arpino, Kiana Ehsani, Ali Farhadi, Li Fei-Fei, Anthony Francis, Chuang Gan, Kristen Grauman, David Hall, Winson Han, Unnat Jain, Aniruddha Kembhavi, Jacob Krantz, Stefan Lee, Chengshu Li, Sagnik Majumder, Oleksandr Maksymets, Roberto Martín-Martín, Roozbeh Mottaghi , et al. (14 additional authors not shown)

    Abstract: We present a retrospective on the state of Embodied AI research. Our analysis focuses on 13 challenges presented at the Embodied AI Workshop at CVPR. These challenges are grouped into three themes: (1) visual navigation, (2) rearrangement, and (3) embodied vision-and-language. We discuss the dominant datasets within each theme, evaluation metrics for the challenges, and the performance of state-of… ▽ More

    Submitted 4 December, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

  47. arXiv:2209.05521  [pdf, other

    math.DG math-ph math.CT

    Rigid models for 2-gerbes I: Chern-Simons geometry

    Authors: David Michael Roberts, Raymond F. Vozzo

    Abstract: Motivated by the problem of constructing explicit geometric string structures, we give a rigid model for bundle 2-gerbes, and define connective structures thereon. This model is designed to make explicit calculations easier, for instance in applications to physics. To compare to the existing definition, we give a functorial construction of a bundle 2-gerbe as in the literature from our rigid model… ▽ More

    Submitted 12 April, 2023; v1 submitted 12 September, 2022; originally announced September 2022.

    Comments: v2 Streamlined introduction, now 62 pages+19 pages appendices+3 pages refs

    MSC Class: 53C08 (Primary); 57R19; 57R20; 81T99

  48. arXiv:2209.05099  [pdf, other

    physics.atom-ph hep-ph nucl-ex nucl-th

    Empirical determination of the Bohr-Weisskopf effect in cesium and improved tests of precision atomic theory in searches for new physics

    Authors: G. Sanamyan, B. M. Roberts, J. S. M. Ginges

    Abstract: The finite distribution of the nuclear magnetic moment across the nucleus gives a contribution to the hyperfine structure known as the Bohr-Weisskopf (BW) effect. We have obtained an empirical value of -0.24(18)% for this effect in the ground and excited s states of atomic Cs-133. This value is found from historical muonic-atom measurements in combination with our muonic-atom and atomic many-body… ▽ More

    Submitted 13 September, 2022; v1 submitted 12 September, 2022; originally announced September 2022.

    Journal ref: Phys. Rev. Lett. 130, 053001 (2023)

  49. arXiv:2208.11769  [pdf

    cond-mat.mtrl-sci

    Designing low-cost TaC virtual substrates for $Al_xGa_{1-x}N$ epitaxy

    Authors: Dennice M. Roberts, Andrew Norman, Moira K. Miller, M. Brooks Tellekamp

    Abstract: $Al_xGa_{1-x}N$ is a critical ultra-wide bandgap material for optoelectronics, but the deposition of thick, high quality epitaxial layers has been hindered by a lack of lattice-matched substrates. Here we identify the (111) face of transition metal carbides as a suitable class of materials for substrates lattice matched to (0001) $Al_xGa_{1-x}N… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

    Comments: 6 figures, supporting information available

  50. arXiv:2207.13179  [pdf, other

    cs.LG stat.ML

    Unsupervised Learning under Latent Label Shift

    Authors: Manley Roberts, Pranav Mani, Saurabh Garg, Zachary C. Lipton

    Abstract: What sorts of structure might enable a learner to discover classes from unlabeled data? Traditional approaches rely on feature-space similarity and heroic assumptions on the data. In this paper, we introduce unsupervised learning under Latent Label Shift (LLS), where we have access to unlabeled data from multiple domains such that the label marginals $p_d(y)$ can shift across domains but the class… ▽ More

    Submitted 1 December, 2022; v1 submitted 26 July, 2022; originally announced July 2022.

    Comments: NeurIPS 2022. Manley Roberts and Pranav Mani contributed equally to this work