Skip to main content

Showing 1–6 of 6 results for author: Becker, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03441  [pdf, other

    cs.CL cs.LG

    Cycles of Thought: Measuring LLM Confidence through Stable Explanations

    Authors: Evan Becker, Stefano Soatto

    Abstract: In many high-risk machine learning applications it is essential for a model to indicate when it is uncertain about a prediction. While large language models (LLMs) can reach and even surpass human-level accuracy on a variety of benchmarks, their overconfidence in incorrect responses is still a well-documented failure mode. Traditional methods for ML uncertainty quantification can be difficult to d… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  2. arXiv:2305.08277  [pdf, other

    cs.LG stat.ML

    Local Convergence of Gradient Descent-Ascent for Training Generative Adversarial Networks

    Authors: Evan Becker, Parthe Pandit, Sundeep Rangan, Alyson K. Fletcher

    Abstract: Generative Adversarial Networks (GANs) are a popular formulation to train generative models for complex high dimensional data. The standard method for training GANs involves a gradient descent-ascent (GDA) procedure on a minimax optimization problem. This procedure is hard to analyze in general due to the nonlinear nature of the dynamics. We study the local dynamics of GDA for training a GAN with… ▽ More

    Submitted 29 May, 2023; v1 submitted 14 May, 2023; originally announced May 2023.

  3. arXiv:2303.11937  [pdf, other

    cs.DS cs.LG math.OC

    High Probability Bounds for Stochastic Continuous Submodular Maximization

    Authors: Evan Becker, **gdong Gao, Ted Zadouri, Baharan Mirzasoleiman

    Abstract: We consider maximization of stochastic monotone continuous submodular functions (CSF) with a diminishing return property. Existing algorithms only guarantee the performance \textit{in expectation}, and do not bound the probability of getting a bad solution. This implies that for a particular run of the algorithms, the solution may be much worse than the provided guarantee in expectation. In this p… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: Proceedings of the 26th International Conference on Artificial Intelligence and Statistics (AISTATS) 2023

  4. K-D Bonsai: ISA-Extensions to Compress K-D Trees for Autonomous Driving Tasks

    Authors: Pedro H. E. Becker, José María Arnau, Antonio González

    Abstract: Autonomous Driving (AD) systems extensively manipulate 3D point clouds for object detection and vehicle localization. Thereby, efficient processing of 3D point clouds is crucial in these systems. In this work we propose K-D Bonsai, a technique to cut down memory usage during radius search, a critical building block of point cloud processing. K-D Bonsai exploits value similarity in the data structu… ▽ More

    Submitted 30 August, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

    MSC Class: Article No. 18; 2018 Related DOI: https://doi.org/10.1145/3243176.3243184 Focus to learn more

    Journal ref: ISCA'23 Proceedings of the 50th Annual International Symposium on Computer Architecture, Article No. 20, 2023

  5. arXiv:2208.09938  [pdf, other

    cs.LG

    Instability and Local Minima in GAN Training with Kernel Discriminators

    Authors: Evan Becker, Parthe Pandit, Sundeep Rangan, Alyson K. Fletcher

    Abstract: Generative Adversarial Networks (GANs) are a widely-used tool for generative modeling of complex data. Despite their empirical success, the training of GANs is not fully understood due to the min-max optimization of the generator and discriminator. This paper analyzes these joint dynamics when the true samples, as well as the generated samples, are discrete, finite sets, and the discriminator is k… ▽ More

    Submitted 21 August, 2022; originally announced August 2022.

  6. Software Sustainability & High Energy Physics

    Authors: Daniel S. Katz, Sudhir Malik, Mark S. Neubauer, Graeme A. Stewart, Kétévi A. Assamagan, Erin A. Becker, Neil P. Chue Hong, Ian A. Cosden, Samuel Meehan, Edward J. W. Moyse, Adrian M. Price-Whelan, Elizabeth Sexton-Kennedy, Meirin Oan Evans, Matthew Feickert, Clemens Lange, Kilian Lieret, Rob Quick, Arturo Sánchez Pineda, Christopher Tunnell

    Abstract: New facilities of the 2020s, such as the High Luminosity Large Hadron Collider (HL-LHC), will be relevant through at least the 2030s. This means that their software efforts and those that are used to analyze their data need to consider sustainability to enable their adaptability to new challenges, longevity, and efficiency, over at least this period. This will help ensure that this software will b… ▽ More

    Submitted 16 October, 2020; v1 submitted 10 October, 2020; originally announced October 2020.

    Comments: A report from the "Sustainable Software in HEP" IRIS-HEP blueprint workshop: https://indico.cern.ch/event/930127/