Skip to main content

Showing 1–1 of 1 results for author: Michon, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.18680  [pdf, other

    cs.CL cs.LG

    Non-Linear Inference Time Intervention: Improving LLM Truthfulness

    Authors: Jakub Hoscilowicz, Adam Wiacek, Jan Chojnacki, Adam Cieslak, Leszek Michon, Vitalii Urbanevych, Artur Janicki

    Abstract: In this work, we explore LLM's internal representation space to identify attention heads that contain the most truthful and accurate information. We further developed the Inference Time Intervention (ITI) framework, which lets bias LLM without the need for fine-tuning. The improvement manifests in introducing a non-linear multi-token probing and multi-token intervention: Non-Linear ITI (NL-ITI), w… ▽ More

    Submitted 6 June, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted on Interspeech 2024 Conference. Code is available at https://github.com/Samsung/NL-ITI