Skip to main content

Showing 1–14 of 14 results for author: Booth, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.04922  [pdf, other

    stat.ML cs.LG

    Voronoi Candidates for Bayesian Optimization

    Authors: Nathan Wycoff, John W. Smith, Annie S. Booth, Robert B. Gramacy

    Abstract: Bayesian optimization (BO) offers an elegant approach for efficiently optimizing black-box functions. However, acquisition criteria demand their own challenging inner-optimization, which can induce significant overhead. Many practical BO methods, particularly in high dimension, eschew a formal, continuous optimization of the acquisition function and instead search discretely over a finite set of s… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: comments very welcome

  2. arXiv:2312.14369  [pdf, other

    cs.CY cs.LG

    Quality-Diversity Generative Sampling for Learning with Synthetic Data

    Authors: Allen Chang, Matthew C. Fontaine, Serena Booth, Maja J. Matarić, Stefanos Nikolaidis

    Abstract: Generative models can serve as surrogates for some real data sources by creating synthetic training datasets, but in doing so they may transfer biases to downstream tasks. We focus on protecting quality and diversity when generating synthetic training datasets. We propose quality-diversity generative sampling (QDGS), a framework for sampling data uniformly across a user-defined measure space, desp… ▽ More

    Submitted 27 February, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted at AAAI 2024; 7 pages main, 12 pages total, 9 figures

  3. arXiv:2310.02456  [pdf, other

    cs.LG cs.AI

    Learning Optimal Advantage from Preferences and Mistaking it for Reward

    Authors: W. Bradley Knox, Stephane Hatgis-Kessell, Sigurdur Orn Adalgeirsson, Serena Booth, Anca Dragan, Peter Stone, Scott Niekum

    Abstract: We consider algorithms for learning reward functions from human preferences over pairs of trajectory segments, as used in reinforcement learning from human feedback (RLHF). Most recent work assumes that human preferences are generated based only upon the reward accrued within those segments, or their partial return. Recent work casts doubt on the validity of this assumption, proposing an alternati… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 8 pages (16 pages with references and appendix), 11 figures

    ACM Class: I.2.6; I.2.8

  4. arXiv:2309.10069  [pdf

    q-bio.NC cs.AI

    Sex-based Disparities in Brain Aging: A Focus on Parkinson's Disease

    Authors: Iman Beheshti, Samuel Booth, Ji Hyun Ko

    Abstract: PD is linked to faster brain aging. Sex is recognized as an important factor in PD, such that males are twice as likely as females to have the disease and have more severe symptoms and a faster progression rate. Despite previous research, there remains a significant gap in understanding the function of sex in the process of brain aging in PD patients. The T1-weighted MRI-driven brain-predicted age… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: 35 pages, 5 figures

  5. arXiv:2303.16320  [pdf

    physics.med-ph cs.CV

    SynthRAD2023 Grand Challenge dataset: generating synthetic CT for radiotherapy

    Authors: Adrian Thummerer, Erik van der Bijl, Arthur Jr Galapon, Joost JC Verhoeff, Johannes A Langendijk, Stefan Both, Cornelis, AT van den Berg, Matteo Maspero

    Abstract: Purpose: Medical imaging has become increasingly important in diagnosing and treating oncological patients, particularly in radiotherapy. Recent advances in synthetic computed tomography (sCT) generation have increased interest in public challenges to provide data and evaluation metrics for comparing different approaches openly. This paper describes a dataset of brain and pelvis computed tomograph… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: 15 pages, 4 figures, 9 tables, pre-print submitted to Medical Physics - dataset. The training dataset is available on Zenodo at https://doi.org/10.5281/zenodo.7260705 from April, 1st 2023

  6. arXiv:2206.02231  [pdf, other

    cs.LG cs.AI eess.SY

    Models of human preference for learning reward functions

    Authors: W. Bradley Knox, Stephane Hatgis-Kessell, Serena Booth, Scott Niekum, Peter Stone, Alessandro Allievi

    Abstract: The utility of reinforcement learning is limited by the alignment of reward functions with the interests of human stakeholders. One promising method for alignment is to learn the reward function from human-generated preferences between pairs of trajectory segments, a type of reinforcement learning from human feedback (RLHF). These human preferences are typically assumed to be informed solely by pa… ▽ More

    Submitted 6 September, 2023; v1 submitted 5 June, 2022; originally announced June 2022.

    Comments: 16 pages (40 pages with references and appendix), 23 figures

    ACM Class: I.2.6; I.2.8

  7. arXiv:2112.07457  [pdf, other

    stat.CO cs.LG stat.ML

    Triangulation candidates for Bayesian optimization

    Authors: Robert B. Gramacy, Annie Sauer, Nathan Wycoff

    Abstract: Bayesian optimization involves "inner optimization" over a new-data acquisition criterion which is non-convex/highly multi-modal, may be non-differentiable, or may otherwise thwart local numerical optimizers. In such cases it is common to replace continuous search with a discrete one over random candidates. Here we propose using candidates based on a Delaunay triangulation of the existing input de… ▽ More

    Submitted 20 May, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: 10 pages, 5 figures

  8. arXiv:2110.07550  [pdf, other

    cs.CL

    The Irrationality of Neural Rationale Models

    Authors: Yiming Zheng, Serena Booth, Julie Shah, Yilun Zhou

    Abstract: Neural rationale models are popular for interpretable predictions of NLP tasks. In these, a selector extracts segments of the input text, called rationales, and passes these segments to a classifier for prediction. Since the rationale is the only information accessible to the classifier, it is plausibly defined as the explanation. Is such a characterization unconditionally correct? In this paper,… ▽ More

    Submitted 23 July, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: NAACL Workshop on Trustworthy Natural Language Processing (TrustNLP) 2022

  9. Machine Learning Practices Outside Big Tech: How Resource Constraints Challenge Responsible Development

    Authors: Aspen Hopkins, Serena Booth

    Abstract: Practitioners from diverse occupations and backgrounds are increasingly using machine learning (ML) methods. Nonetheless, studies on ML Practitioners typically draw populations from Big Tech and academia, as researchers have easier access to these communities. Through this selection bias, past research often excludes the broader, lesser-resourced ML community -- for example, practitioners working… ▽ More

    Submitted 6 October, 2021; originally announced October 2021.

    Journal ref: AAAI/ACM Conference on AI, Ethics, and Society 2021

  10. arXiv:2104.14403  [pdf, other

    cs.LG cs.CV

    Do Feature Attribution Methods Correctly Attribute Features?

    Authors: Yilun Zhou, Serena Booth, Marco Tulio Ribeiro, Julie Shah

    Abstract: Feature attribution methods are popular in interpretable machine learning. These methods compute the attribution of each input feature to represent its importance, but there is no consensus on the definition of "attribution", leading to many competing methods with little systematic evaluation, complicated in particular by the lack of ground truth attribution. To address this, we propose a dataset… ▽ More

    Submitted 15 December, 2021; v1 submitted 27 April, 2021; originally announced April 2021.

    Comments: AAAI 2022. Video summary at https://www.youtube.com/watch?v=kAodFw6jvvo

  11. arXiv:2012.13615  [pdf, other

    cs.RO

    RoCUS: Robot Controller Understanding via Sampling

    Authors: Yilun Zhou, Serena Booth, Nadia Figueroa, Julie Shah

    Abstract: As robots are deployed in complex situations, engineers and end users must develop a holistic understanding of their behaviors, capabilities, and limitations. Some behaviors are directly optimized by the objective function. They often include success rate, completion time or energy consumption. Other behaviors -- e.g., collision avoidance, trajectory smoothness or motion legibility -- are typicall… ▽ More

    Submitted 14 October, 2021; v1 submitted 25 December, 2020; originally announced December 2020.

    Comments: CoRL 2021. The project website is at https://yilunzhou.github.io/RoCUS/

  12. arXiv:2002.10248  [pdf, other

    cs.LG stat.ML

    Bayes-TrEx: a Bayesian Sampling Approach to Model Transparency by Example

    Authors: Serena Booth, Yilun Zhou, Ankit Shah, Julie Shah

    Abstract: Post-hoc explanation methods are gaining popularity for interpreting, understanding, and debugging neural networks. Most analyses using such methods explain decisions in response to inputs drawn from the test set. However, the test set may have few examples that trigger some model behaviors, such as high-confidence failures or ambiguous classifications. To address these challenges, we introduce a… ▽ More

    Submitted 16 December, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: Accepted at AAAI 2021

  13. arXiv:2001.03076  [pdf, other

    cs.LG stat.ML

    Sampling Prediction-Matching Examples in Neural Networks: A Probabilistic Programming Approach

    Authors: Serena Booth, Ankit Shah, Yilun Zhou, Julie Shah

    Abstract: Though neural network models demonstrate impressive performance, we do not understand exactly how these black-box models make individual predictions. This drawback has led to substantial research devoted to understand these models in areas such as robustness, interpretability, and generalization ability. In this paper, we consider the problem of exploring the prediction level sets of a classifier… ▽ More

    Submitted 9 January, 2020; originally announced January 2020.

    Comments: AAAI 2020 Workshop on Statistical Relational AI (StarAI 2020)

  14. arXiv:1306.5771  [pdf, other

    astro-ph.IM astro-ph.CO cs.MS

    Panphasia: a user guide

    Authors: Adrian Jenkins, Stephen Booth

    Abstract: We make a very large realisation of a Gaussian white noise field, called PANPHASIA, public by releasing software that computes this field. Panphasia is designed specifically for setting up Gaussian initial conditions for cosmological simulations and resimulations of structure formation. We make available both software to compute the field itself and codes to illustrate applications including a mod… ▽ More

    Submitted 24 June, 2013; originally announced June 2013.

    Comments: 11 pages, 2 figures. Software to calculate Panphasia is available from: http://icc.dur.ac.uk/Panphasia.php