Search | arXiv e-print repository

Understanding Jailbreak Success: A Study of Latent Space Dynamics in Large Language Models

Authors: Sarah Ball, Frauke Kreuter, Nina Rimsky

Abstract: Conversational Large Language Models are trained to refuse to answer harmful questions. However, emergent jailbreaking techniques can still elicit unsafe outputs, presenting an ongoing challenge for model alignment. To better understand how different jailbreak types circumvent safeguards, this paper analyses model activations on different jailbreak inputs. We find that it is possible to extract a… ▽ More Conversational Large Language Models are trained to refuse to answer harmful questions. However, emergent jailbreaking techniques can still elicit unsafe outputs, presenting an ongoing challenge for model alignment. To better understand how different jailbreak types circumvent safeguards, this paper analyses model activations on different jailbreak inputs. We find that it is possible to extract a jailbreak vector from a single class of jailbreaks that works to mitigate jailbreak effectiveness from other classes. This may indicate that different kinds of effective jailbreaks operate via similar internal mechanisms. We investigate a potential common mechanism of harmfulness feature suppression, and provide evidence for its existence by looking at the harmfulness vector component. These findings offer actionable insights for develo** more robust jailbreak countermeasures and lay the groundwork for a deeper, mechanistic understanding of jailbreak dynamics in language models. △ Less

Submitted 13 June, 2024; originally announced June 2024.

arXiv:2406.08916 [pdf, ps, other]

On additive codes over finite fields

Authors: Simeon Ball, Michel Lavrauw, Tabriz Popatia

Abstract: In this article we prove a Griesmer type bound for additive codes over finite fields. This new bound gives an upper bound on the length of maximum distance separable (MDS) codes, codes which attain the Singleton bound. We will also consider codes to be MDS if they obtain the fractional Singleton bound, due to Huffman. We prove that this bound in the fractional case can be obtained by codes whose l… ▽ More In this article we prove a Griesmer type bound for additive codes over finite fields. This new bound gives an upper bound on the length of maximum distance separable (MDS) codes, codes which attain the Singleton bound. We will also consider codes to be MDS if they obtain the fractional Singleton bound, due to Huffman. We prove that this bound in the fractional case can be obtained by codes whose length surpasses the length of the longest known codes in the non-fractional case. We also provide some exhaustive computational results over small fields and dimensions. △ Less

Submitted 13 June, 2024; originally announced June 2024.

MSC Class: 94B65

arXiv:2405.20524 [pdf, other]

Practical implementation of geometric quasi-cyclic LDPC codes

Authors: Simeon Ball, Tomàs Ortega

Abstract: We detail for the first time a complete explicit description of the quasi-cyclic structure of all classical finite generalized quadrangles. Using these descriptions we construct families of quasi-cyclic LDPC codes derived from the point-line incidence matrix of the quadrangles by explicitly calculating quasi-cyclic generator and parity check matrices for these codes. This allows us to construct pa… ▽ More We detail for the first time a complete explicit description of the quasi-cyclic structure of all classical finite generalized quadrangles. Using these descriptions we construct families of quasi-cyclic LDPC codes derived from the point-line incidence matrix of the quadrangles by explicitly calculating quasi-cyclic generator and parity check matrices for these codes. This allows us to construct parity check and generator matrices of all such codes of length up to 400000. These codes cover a wide range of transmission rates, are easy and fast to implement and perform close to Shannon's limit with no visible error floors. We also include some performance data for these codes. Furthermore, we include a complete explicit description of the quasi-cyclic structure of the point-line and point-hyperplane incidences of the finite projective and affine spaces. △ Less

Submitted 30 May, 2024; originally announced May 2024.

MSC Class: 68P30; 51E12 ACM Class: H.1.1; E.4; G.2.2

arXiv:2401.06618 [pdf, ps, other]

Stabiliser codes over fields of even order

Authors: Simeon Ball, Edgar Moreno, Robin Simoens

Abstract: We prove that the natural isomorphism between GF(2^h) and GF(2)^h induces a bijection between stabiliser codes on n quqits with local dimension q=2^h and binary stabiliser codes on hn qubits. This allows us to describe these codes geometrically: a stabiliser code over a field of even order corresponds to a so-called quantum set of symplectic polar spaces. Moreover, equivalent stabiliser codes have… ▽ More We prove that the natural isomorphism between GF(2^h) and GF(2)^h induces a bijection between stabiliser codes on n quqits with local dimension q=2^h and binary stabiliser codes on hn qubits. This allows us to describe these codes geometrically: a stabiliser code over a field of even order corresponds to a so-called quantum set of symplectic polar spaces. Moreover, equivalent stabiliser codes have a similar geometry, which can be used to prove the uniqueness of a [[4,0,3]]_4 stabiliser code and the nonexistence of both a [[7,1,4]]_4 and an [[8,0,5]]_4 stabiliser code. △ Less

Submitted 12 January, 2024; originally announced January 2024.

MSC Class: 94B27

arXiv:2305.14883 [pdf, other]

Classical and quantum cyclic redundancy check codes

Authors: Simeon Ball, Ricard Vilar

Abstract: We prove that certain classical cyclic redundancy check codes can be used for classical error correction and not just classical error detection. We extend the idea of classical cyclic redundancy check codes to quantum cyclic redundancy check codes. This allows us to construct quantum stabiliser codes which can correct burst errors where the burst length attains the quantum Reiger bound. We then co… ▽ More We prove that certain classical cyclic redundancy check codes can be used for classical error correction and not just classical error detection. We extend the idea of classical cyclic redundancy check codes to quantum cyclic redundancy check codes. This allows us to construct quantum stabiliser codes which can correct burst errors where the burst length attains the quantum Reiger bound. We then consider a certain family of quantum cyclic redundancy check codes for which we present a fast linear time decoding algorithm. △ Less

Submitted 12 June, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

MSC Class: 94B20; 94B15; 94B35

arXiv:2303.05349 [pdf, other]

Seeing ChatGPT Through Students' Eyes: An Analysis of TikTok Data

Authors: Anna-Carolina Haensch, Sarah Ball, Markus Herklotz, Frauke Kreuter

Abstract: Advanced large language models like ChatGPT have gained considerable attention recently, including among students. However, while the debate on ChatGPT in academia is making waves, more understanding is needed among lecturers and teachers on how students use and perceive ChatGPT. To address this gap, we analyzed the content on ChatGPT available on TikTok in February 2023. TikTok is a rapidly growi… ▽ More Advanced large language models like ChatGPT have gained considerable attention recently, including among students. However, while the debate on ChatGPT in academia is making waves, more understanding is needed among lecturers and teachers on how students use and perceive ChatGPT. To address this gap, we analyzed the content on ChatGPT available on TikTok in February 2023. TikTok is a rapidly growing social media platform popular among individuals under 30. Specifically, we analyzed the content of the 100 most popular videos in English tagged with #chatgpt, which collectively garnered over 250 million views. Most of the videos we studied promoted the use of ChatGPT for tasks like writing essays or code. In addition, many videos discussed AI detectors, with a focus on how other tools can help to transform ChatGPT output to fool these detectors. This also mirrors the discussion among educators on how to treat ChatGPT as lecturers and teachers in teaching and grading. What is, however, missing from the analyzed clips on TikTok are videos that discuss ChatGPT producing content that is nonsensical or unfaithful to the training data. △ Less

Submitted 9 March, 2023; originally announced March 2023.

arXiv:2301.00876 [pdf, other]

MAUD: An Expert-Annotated Legal NLP Dataset for Merger Agreement Understanding

Authors: Steven H. Wang, Antoine Scardigli, Leonard Tang, Wei Chen, Dimitry Levkin, Anya Chen, Spencer Ball, Thomas Woodside, Oliver Zhang, Dan Hendrycks

Abstract: Reading comprehension of legal text can be a particularly challenging task due to the length and complexity of legal clauses and a shortage of expert-annotated datasets. To address this challenge, we introduce the Merger Agreement Understanding Dataset (MAUD), an expert-annotated reading comprehension dataset based on the American Bar Association's 2021 Public Target Deal Points Study, with over 3… ▽ More Reading comprehension of legal text can be a particularly challenging task due to the length and complexity of legal clauses and a shortage of expert-annotated datasets. To address this challenge, we introduce the Merger Agreement Understanding Dataset (MAUD), an expert-annotated reading comprehension dataset based on the American Bar Association's 2021 Public Target Deal Points Study, with over 39,000 examples and over 47,000 total annotations. Our fine-tuned Transformer baselines show promising results, with models performing well above random on most questions. However, on a large subset of questions, there is still room for significant improvement. As the only expert-annotated merger agreement dataset, MAUD is valuable as a benchmark for both the legal profession and the NLP community. △ Less

Submitted 24 November, 2023; v1 submitted 2 January, 2023; originally announced January 2023.

Comments: EMNLP 2023. 5 pages + appendix. Code and dataset are available at https://github.com/TheAtticusProject/maud

arXiv:2209.09767 [pdf, ps, other]

On additive MDS codes with linear projections

Authors: Sam Adriaensen, Simeon Ball

Abstract: We support some evidence that a long additive MDS code over a finite field must be equivalent to a linear code. More precisely, let $C$ be an $\mathbb F_q$-linear $(n,q^{hk},n-k+1)_{q^h}$ MDS code over $\mathbb F_{q^h}$. If $k=3$, $h \in \{2,3\}$, $n > \max \{q^{h-1},h q -1\} + 3$, and $C$ has three coordinates from which its projections are equivalent to linear codes, we prove that $C$ itself is… ▽ More We support some evidence that a long additive MDS code over a finite field must be equivalent to a linear code. More precisely, let $C$ be an $\mathbb F_q$-linear $(n,q^{hk},n-k+1)_{q^h}$ MDS code over $\mathbb F_{q^h}$. If $k=3$, $h \in \{2,3\}$, $n > \max \{q^{h-1},h q -1\} + 3$, and $C$ has three coordinates from which its projections are equivalent to linear codes, we prove that $C$ itself is equivalent to a linear code. If $k>3$, $n > q+k$, and there are two disjoint subsets of coordinates whose combined size is at most $k-2$ from which the projections of $C$ are equivalent to linear codes, we prove that $C$ is equivalent to a code which is linear over a larger field than $\mathbb F_q$. △ Less

Submitted 20 September, 2022; originally announced September 2022.

Comments: 15 pages

MSC Class: 94B05; 51E22

arXiv:2112.11896 [pdf, ps, other]

The Grassl-Rötteler cyclic and consta-cyclic MDS codes are generalised Reed-Solomon codes

Authors: Simeon Ball

Abstract: We prove that the cyclic and constacyclic codes constructed by Grassl and Rötteler in arXiv:1502.05267 are generalised Reed-Solomon codes. This note can be considered as an addendum to that article. It can also be considered as an appendix to arXiv:2106.10180, where Conjecture 11 of arXiv:1502.0526, which was stated for Grassl-Rötteler codes, is proven for generalised Reed-Solomon codes. The conte… ▽ More We prove that the cyclic and constacyclic codes constructed by Grassl and Rötteler in arXiv:1502.05267 are generalised Reed-Solomon codes. This note can be considered as an addendum to that article. It can also be considered as an appendix to arXiv:2106.10180, where Conjecture 11 of arXiv:1502.0526, which was stated for Grassl-Rötteler codes, is proven for generalised Reed-Solomon codes. The content of this note, together with arXiv:2106.10180, therefore implies that Conjecture 11 from arXiv:1502.0526 is true. △ Less

Submitted 15 June, 2022; v1 submitted 22 December, 2021; originally announced December 2021.

MSC Class: 94B05

arXiv:2108.08088 [pdf, ps, other]

The geometry of Hermitian self-orthogonal codes

Authors: Simeon Ball, Ricard Vilar

Abstract: We prove that if $n >k^2$ then a $k$-dimensional linear code of length $n$ over ${\mathbb F}_{q^2}$ has a truncation which is linearly equivalent to a Hermitian self-orthogonal linear code. In the contrary case we prove that truncations of linear codes to codes equivalent to Hermitian self-orthogonal linear codes occur when the columns of a generator matrix of the code do not impose independent co… ▽ More We prove that if $n >k^2$ then a $k$-dimensional linear code of length $n$ over ${\mathbb F}_{q^2}$ has a truncation which is linearly equivalent to a Hermitian self-orthogonal linear code. In the contrary case we prove that truncations of linear codes to codes equivalent to Hermitian self-orthogonal linear codes occur when the columns of a generator matrix of the code do not impose independent conditions on the space of Hermitian forms. In the case that there are more than $n$ common zeros to the set of Hermitian forms which are zero on the columns of a generator matrix of the code, the additional zeros give the extension of the code to a code that has a truncation which is equivalent to a Hermitian self-orthogonal code. △ Less

Submitted 18 August, 2021; originally announced August 2021.

MSC Class: 94Bxx; 11E39; 51Exx

arXiv:2107.11281 [pdf, ps, other]

The geometry of non-additive stabiliser codes

Authors: Simeon Ball, Pablo Puig

Abstract: We present a geometric framework for constructing additive and non-additive stabiliser codes which encompasses stabiliser codes and graphical non-additive stabiliser codes. We present a geometric framework for constructing additive and non-additive stabiliser codes which encompasses stabiliser codes and graphical non-additive stabiliser codes. △ Less

Submitted 23 July, 2021; originally announced July 2021.

arXiv:2107.07912 [pdf, ps, other]

The equivalence of linear codes implies semi-linear equivalence

Authors: Simeon Ball, James Dixon

Abstract: We prove that if two linear codes are equivalent then they are semi-linearly equivalent. We also prove that if two additive MDS codes over a field are equivalent then they are additively equivalent. We prove that if two linear codes are equivalent then they are semi-linearly equivalent. We also prove that if two additive MDS codes over a field are equivalent then they are additively equivalent. △ Less

Submitted 16 July, 2021; originally announced July 2021.

arXiv:2106.10180 [pdf, ps, other]

Determining when a truncated generalised Reed-Solomon code is Hermitian self-orthogonal

Authors: Simeon Ball, Ricard Vilar

Abstract: We prove that there is a Hermitian self-orthogonal $k$-dimensional truncated generalised Reed-Solomon code of length $n \leqslant q^2$ over ${\mathbb F}_{q^2}$ if and only if there is a polynomial $g \in {\mathbb F}_{q^2}$ of degree at most $(q-k)q-1$ such that $g+g^q$ has $q^2-n$ distinct zeros. This allows us to determine the smallest $n$ for which there is a Hermitian self-orthogonal $k$-dimens… ▽ More We prove that there is a Hermitian self-orthogonal $k$-dimensional truncated generalised Reed-Solomon code of length $n \leqslant q^2$ over ${\mathbb F}_{q^2}$ if and only if there is a polynomial $g \in {\mathbb F}_{q^2}$ of degree at most $(q-k)q-1$ such that $g+g^q$ has $q^2-n$ distinct zeros. This allows us to determine the smallest $n$ for which there is a Hermitian self-orthogonal $k$-dimensional truncated generalised Reed-Solomon code of length $n$ over ${\mathbb F}_{q^2}$, verifying a conjecture of Grassl and Rötteler. We also provide examples of Hermitian self-orthogonal $k$-dimensional generalised Reed-Solomon codes of length $q^2+1$ over ${\mathbb F}_{q^2}$, for $k=q-1$ and $q$ an odd power of two. △ Less

Submitted 23 December, 2021; v1 submitted 18 June, 2021; originally announced June 2021.

MSC Class: 94B05 ACM Class: E.4

arXiv:2103.06268 [pdf, other]

CUAD: An Expert-Annotated NLP Dataset for Legal Contract Review

Authors: Dan Hendrycks, Collin Burns, Anya Chen, Spencer Ball

Abstract: Many specialized domains remain untouched by deep learning, as large labeled datasets require expensive expert annotators. We address this bottleneck within the legal domain by introducing the Contract Understanding Atticus Dataset (CUAD), a new dataset for legal contract review. CUAD was created with dozens of legal experts from The Atticus Project and consists of over 13,000 annotations. The tas… ▽ More Many specialized domains remain untouched by deep learning, as large labeled datasets require expensive expert annotators. We address this bottleneck within the legal domain by introducing the Contract Understanding Atticus Dataset (CUAD), a new dataset for legal contract review. CUAD was created with dozens of legal experts from The Atticus Project and consists of over 13,000 annotations. The task is to highlight salient portions of a contract that are important for a human to review. We find that Transformer models have nascent performance, but that this performance is strongly influenced by model design and training dataset size. Despite these promising results, there is still substantial room for improvement. As one of the only large, specialized NLP benchmarks annotated by experts, CUAD can serve as a challenging research benchmark for the broader NLP community. △ Less

Submitted 8 November, 2021; v1 submitted 10 March, 2021; originally announced March 2021.

Comments: NeurIPS 2021. Code and the CUAD dataset are available at https://github.com/TheAtticusProject/cuad/

arXiv:2012.06183 [pdf, ps, other]

On additive MDS codes over small fields

Authors: Simeon Ball, Guillermo Gamboa, Michel Lavrauw

Abstract: Let $C$ be a $(n,q^{2k},n-k+1)_{q^2}$ additive MDS code which is linear over ${\mathbb F}_q$. We prove that if $n \geqslant q+k$ and $k+1$ of the projections of $C$ are linear over ${\mathbb F}_{q^2}$ then $C$ is linear over ${\mathbb F}_{q^2}$. We use this geometrical theorem, other geometric arguments and some computations to classify all additive MDS codes over ${\mathbb F}_q$ for… ▽ More Let $C$ be a $(n,q^{2k},n-k+1)_{q^2}$ additive MDS code which is linear over ${\mathbb F}_q$. We prove that if $n \geqslant q+k$ and $k+1$ of the projections of $C$ are linear over ${\mathbb F}_{q^2}$ then $C$ is linear over ${\mathbb F}_{q^2}$. We use this geometrical theorem, other geometric arguments and some computations to classify all additive MDS codes over ${\mathbb F}_q$ for $q \in \{4,8,9\}$. We also classify the longest additive MDS codes over ${\mathbb F}_{16}$ which are linear over ${\mathbb F}_4$. In these cases, the classifications not only verify the MDS conjecture for additive codes, but also confirm there are no additive non-linear MDS codes which perform as well as their linear counterparts. These results imply that the quantum MDS conjecture holds for $q \in \{ 2,3\}$. △ Less

Submitted 11 December, 2020; originally announced December 2020.

MSC Class: 94B27; 51E22

arXiv:1709.04558 [pdf]

Using NLU in Context for Question Answering: Improving on Facebook's bAbI Tasks

Authors: John S. Ball

Abstract: For the next step in human to machine interaction, Artificial Intelligence (AI) should interact predominantly using natural language because, if it worked, it would be the fastest way to communicate. Facebook's toy tasks (bAbI) provide a useful benchmark to compare implementations for conversational AI. While the published experiments so far have been based on exploiting the distributional hypothe… ▽ More For the next step in human to machine interaction, Artificial Intelligence (AI) should interact predominantly using natural language because, if it worked, it would be the fastest way to communicate. Facebook's toy tasks (bAbI) provide a useful benchmark to compare implementations for conversational AI. While the published experiments so far have been based on exploiting the distributional hypothesis with machine learning, our model exploits natural language understanding (NLU) with the decomposition of language based on Role and Reference Grammar (RRG) and the brain-based Patom theory. Our combinatorial system for conversational AI based on linguistics has many advantages: passing bAbI task tests without parsing or statistics while increasing scalability. Our model validates both the training and test data to find 'garbage' input and output (GIGO). It is not rules-based, nor does it use parts of speech, but instead relies on meaning. While Deep Learning is difficult to debug and fix, every step in our model can be understood and changed like any non-statistical computer program. Deep Learning's lack of explicable reasoning has raised opposition to AI, partly due to fear of the unknown. To support the goals of AI, we propose extended tasks to use human-level statements with tense, aspect and voice, and embedded clauses with junctures: and answers to be natural language generation (NLG) instead of keywords. While machine learning permits invalid training data to produce incorrect test responses, our system cannot because the context tracking would need to be intentionally broken. We believe no existing learning systems can currently solve these extended natural language tests. There appears to be a knowledge gap between NLP researchers and linguists, but ongoing competitive results such as these promise to narrow that gap. △ Less

Submitted 20 September, 2017; v1 submitted 13 September, 2017; originally announced September 2017.

Comments: 38 Pages, 10 Tables

Showing 1–16 of 16 results for author: Ball, S