Skip to main content

Showing 1–3 of 3 results for author: Beigi, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12053  [pdf, other

    cs.CL

    InternalInspector $I^2$: Robust Confidence Estimation in LLMs through Internal States

    Authors: Mohammad Beigi, Ying Shen, Runing Yang, Zihao Lin, Qifan Wang, Ankith Mohan, Jianfeng He, Ming **, Chang-Tien Lu, Lifu Huang

    Abstract: Despite their vast capabilities, Large Language Models (LLMs) often struggle with generating reliable outputs, frequently producing high-confidence inaccuracies known as hallucinations. Addressing this challenge, our research introduces InternalInspector, a novel framework designed to enhance confidence estimation in LLMs by leveraging contrastive learning on internal states including attention st… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 8 pages

  2. arXiv:2402.11122  [pdf, other

    cs.CL cs.AI

    Navigating the Dual Facets: A Comprehensive Evaluation of Sequential Memory Editing in Large Language Models

    Authors: Zihao Lin, Mohammad Beigi, Hongxuan Li, Yufan Zhou, Yuxiang Zhang, Qifan Wang, Wenpeng Yin, Lifu Huang

    Abstract: Memory Editing (ME) has emerged as an efficient method to modify erroneous facts or inject new facts into Large Language Models (LLMs). Two mainstream ME methods exist: parameter-modifying ME and parameter-preserving ME (integrating extra modules while preserving original parameters). Regrettably, previous studies on ME evaluation have two critical limitations: (i) evaluating LLMs with single edit… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: preprint, 15 pages

  3. arXiv:1105.5675  [pdf

    cs.MM cs.CV

    Scale-Invariant Local Descriptor for Event Recognition in 1D Sensor Signals

    Authors: Jierui Xie, Mandis S. Beigi

    Abstract: In this paper, we introduce a shape-based, time-scale invariant feature descriptor for 1-D sensor signals. The time-scale invariance of the feature allows us to use feature from one training event to describe events of the same semantic class which may take place over varying time scales such as walking slow and walking fast. Therefore it requires less training set. The descriptor takes advantage… ▽ More

    Submitted 27 May, 2011; originally announced May 2011.

    Journal ref: IEEE International Conference on Multimedia & Expo(ICME),Page(s):1226 - 1229, 2009