Skip to main content

Showing 1–5 of 5 results for author: Hendryx, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.00332  [pdf, other

    cs.CL cs.AI cs.LG

    A Careful Examination of Large Language Model Performance on Grade School Arithmetic

    Authors: Hugh Zhang, Jeff Da, Dean Lee, Vaughn Robinson, Catherine Wu, Will Song, Tiffany Zhao, Pranav Raja, Dylan Slack, Qin Lyu, Sean Hendryx, Russell Kaplan, Michele Lunati, Summer Yue

    Abstract: Large language models (LLMs) have achieved impressive success on many benchmarks for mathematical reasoning. However, there is growing concern that some of this performance actually reflects dataset contamination, where data closely resembling benchmark questions leaks into the training data, instead of true reasoning ability. To investigate this claim rigorously, we commission Grade School Math 1… ▽ More

    Submitted 3 May, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

  2. arXiv:2401.12129  [pdf, other

    cs.CV cs.LG

    Out-of-Distribution Detection & Applications With Ablated Learned Temperature Energy

    Authors: Will LeVine, Benjamin Pikus, Jacob Phillips, Berk Norman, Fernando Amat Gil, Sean Hendryx

    Abstract: As deep neural networks become adopted in high-stakes domains, it is crucial to be able to identify when inference inputs are Out-of-Distribution (OOD) so that users can be alerted of likely drops in performance and calibration despite high confidence. Among many others, existing methods use the following two scores to do so without training on any apriori OOD examples: a learned temperature and a… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  3. arXiv:2311.14743  [pdf, other

    cs.CL cs.LG

    A Baseline Analysis of Reward Models' Ability To Accurately Analyze Foundation Models Under Distribution Shift

    Authors: Will LeVine, Benjamin Pikus, Anthony Chen, Sean Hendryx

    Abstract: Foundation models, specifically Large Language Models (LLMs), have lately gained wide-spread attention and adoption. Reinforcement Learning with Human Feedback (RLHF) involves training a reward model to capture desired behaviors, which is then used to align LLM's. These reward models are additionally used at inference-time to estimate LLM responses' adherence to those desired behaviors. However, t… ▽ More

    Submitted 24 January, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

  4. arXiv:2109.00150  [pdf, other

    cs.LG

    Federated Reconnaissance: Efficient, Distributed, Class-Incremental Learning

    Authors: Sean M. Hendryx, Dharma Raj KC, Bradley Walls, Clayton T. Morrison

    Abstract: We describe federated reconnaissance, a class of learning problems in which distributed clients learn new concepts independently and communicate that knowledge efficiently. In particular, we propose an evaluation framework and methodological baseline for a system in which each client is expected to learn a growing set of classes and communicate knowledge of those classes efficiently with other cli… ▽ More

    Submitted 31 August, 2021; originally announced September 2021.

  5. arXiv:1912.06290  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Meta-Learning Initializations for Image Segmentation

    Authors: Sean M. Hendryx, Andrew B. Leach, Paul D. Hein, Clayton T. Morrison

    Abstract: We extend first-order model agnostic meta-learning algorithms (including FOMAML and Reptile) to image segmentation, present a novel neural network architecture built for fast learning which we call EfficientLab, and leverage a formal definition of the test error of meta-learning algorithms to decrease error on out of distribution tasks. We show state of the art results on the FSS-1000 dataset by m… ▽ More

    Submitted 7 May, 2020; v1 submitted 12 December, 2019; originally announced December 2019.