-
Non-Linear Inference Time Intervention: Improving LLM Truthfulness
Authors:
Jakub Hoscilowicz,
Adam Wiacek,
Jan Chojnacki,
Adam Cieslak,
Leszek Michon,
Vitalii Urbanevych,
Artur Janicki
Abstract:
In this work, we explore LLM's internal representation space to identify attention heads that contain the most truthful and accurate information. We further developed the Inference Time Intervention (ITI) framework, which lets bias LLM without the need for fine-tuning. The improvement manifests in introducing a non-linear multi-token probing and multi-token intervention: Non-Linear ITI (NL-ITI), w…
▽ More
In this work, we explore LLM's internal representation space to identify attention heads that contain the most truthful and accurate information. We further developed the Inference Time Intervention (ITI) framework, which lets bias LLM without the need for fine-tuning. The improvement manifests in introducing a non-linear multi-token probing and multi-token intervention: Non-Linear ITI (NL-ITI), which significantly enhances performance on evaluation benchmarks. NL-ITI is tested on diverse multiple-choice datasets, including TruthfulQA, on which we report over 16% relative MC1 (accuracy of model pointing to the correct answer) improvement with respect to the baseline ITI results. Moreover, we achieved a 10% relative improvement over the recently released Truth Forest (TrFf) method that also focused on ITI improvement.
△ Less
Submitted 6 June, 2024; v1 submitted 27 March, 2024;
originally announced March 2024.
-
On ideals related to Laver and Miller trees
Authors:
Aleksander Cieślak,
Arturo Martínez-Celis
Abstract:
In this work we consider the ideals $m^0(\mathcal{I})$ and $\ell^0(\mathcal{I})$, ideals generated by the $\mathcal{I}$-positive Miller trees and $\mathcal{I}$-positive Laver trees, respectively. We investigate in which cases these ideals have cofinality larger than $\mathfrak{c}$ and we calculate some cardinal invariants closely related to these ideals.
In this work we consider the ideals $m^0(\mathcal{I})$ and $\ell^0(\mathcal{I})$, ideals generated by the $\mathcal{I}$-positive Miller trees and $\mathcal{I}$-positive Laver trees, respectively. We investigate in which cases these ideals have cofinality larger than $\mathfrak{c}$ and we calculate some cardinal invariants closely related to these ideals.
△ Less
Submitted 22 May, 2024; v1 submitted 3 December, 2023;
originally announced December 2023.
-
Nonmeasurable images
Authors:
Aleksander Cieślak,
Robert Rałowski
Abstract:
In this article we will investigate nonmeasurability with respect to some $σ$-ideals in Polish space $X,$ of images of subsets of $X$ by selected map**s defined on the space $X$. Among of them we answer the following question: "It is true that there exists a subset of the unit disc in the real plane such that the continuum many projections onto lines are Lebesgue measurable and continuum many pr…
▽ More
In this article we will investigate nonmeasurability with respect to some $σ$-ideals in Polish space $X,$ of images of subsets of $X$ by selected map**s defined on the space $X$. Among of them we answer the following question: "It is true that there exists a subset of the unit disc in the real plane such that the continuum many projections onto lines are Lebesgue measurable and continuum many projections are not?". It is known that there exists continuous function $f:[0,1]\to [0,1]$ such that for every Bernstein set $B\subseteq [0,1]$ we have $f[B]=[0,1].$ We show relative consistency with $ZFC$ of fact that the above result is not true for some $\cn$ or $\cm$-completely nonmeasurable sets, even if we take less than $\c$ many continuous functions.
△ Less
Submitted 29 December, 2021;
originally announced December 2021.
-
Optimal Transport of Information
Authors:
Semyon Malamud,
Anna Cieslak,
Andreas Schrimpf
Abstract:
We study the general problem of Bayesian persuasion (optimal information design) with continuous actions and continuous state space in arbitrary dimensions. First, we show that with a finite signal space, the optimal information design is always given by a partition. Second, we take the limit of an infinite signal space and characterize the solution in terms of a Monge-Kantorovich optimal transpor…
▽ More
We study the general problem of Bayesian persuasion (optimal information design) with continuous actions and continuous state space in arbitrary dimensions. First, we show that with a finite signal space, the optimal information design is always given by a partition. Second, we take the limit of an infinite signal space and characterize the solution in terms of a Monge-Kantorovich optimal transport problem with an endogenous information transport cost. We use our novel approach to: 1. Derive necessary and sufficient conditions for optimality based on Bregman divergences for non-convex functions. 2. Compute exact bounds for the Hausdorff dimension of the support of an optimal policy. 3. Derive a non-linear, second-order partial differential equation whose solutions correspond to regular optimal policies. We illustrate the power of our approach by providing explicit solutions to several non-linear, multidimensional Bayesian persuasion problems.
△ Less
Submitted 9 March, 2021; v1 submitted 22 February, 2021;
originally announced February 2021.
-
Universal sets for ideals
Authors:
Aleksander Cieślak,
Marcin Michalski
Abstract:
In this paper we consider a notion of universal sets for ideals. We show that there exist universal sets of minimal Borel complexity for classic ideals like null subsets of $2^ω$ and meager subsets of any Polish space, and demonstrate that the existence of such sets is helpful in establishing some facts about the real line in generic extensions. We also construct universal sets for $\mathcal{E}$ -…
▽ More
In this paper we consider a notion of universal sets for ideals. We show that there exist universal sets of minimal Borel complexity for classic ideals like null subsets of $2^ω$ and meager subsets of any Polish space, and demonstrate that the existence of such sets is helpful in establishing some facts about the real line in generic extensions. We also construct universal sets for $\mathcal{E}$ - the $σ$-ideal generated by closed null subsets of $2^ω$, and for some ideals connected with forcing notions: $\mathcal{K}_σ$ subsets of $ω^ω$ and the Laver ideal. We also consider Fubini products of ideals and show that there are $Σ^0_3$ universal sets for $\mathcal{N}\otimes\mathcal{M}$ and $\mathcal{M}\otimes\mathcal{N}$.
△ Less
Submitted 18 July, 2019;
originally announced July 2019.