Skip to main content

Showing 1–2 of 2 results for author: Bentham, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.04965  [pdf, other

    cs.CL

    Beyond Perplexity: Multi-dimensional Safety Evaluation of LLM Compression

    Authors: Zhichao Xu, Ashim Gupta, Tao Li, Oliver Bentham, Vivek Srikumar

    Abstract: Large language models (LLMs) are increasingly deployed in real-world scenarios with the help of recent model compression techniques. Such momentum towards local deployment means the use of compressed LLMs will widely impact a large population. However, prior analysis works often prioritize on preserving perplexity which is a direct analogy to training loss. The impact of compression method on othe… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  2. arXiv:2402.14897  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Chain-of-Thought Unfaithfulness as Disguised Accuracy

    Authors: Oliver Bentham, Nathan Stringham, Ana Marasović

    Abstract: Understanding the extent to which Chain-of-Thought (CoT) generations align with a large language model's (LLM) internal computations is critical for deciding whether to trust an LLM's output. As a proxy for CoT faithfulness, Lanham et al. (2023) propose a metric that measures a model's dependence on its CoT for producing an answer. Within a single family of proprietary models, they find that LLMs… ▽ More

    Submitted 21 June, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: TMLR accepted paper camera-ready version. First two authors contributed equally. 8 pages main, 13 pages appendix