Skip to main content

Showing 1–5 of 5 results for author: Sutton, O J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12670  [pdf, other

    cs.AI cs.LG

    Stealth edits for provably fixing or attacking large language models

    Authors: Oliver J. Sutton, Qinghua Zhou, Wei Wang, Desmond J. Higham, Alexander N. Gorban, Alexander Bastounis, Ivan Y. Tyukin

    Abstract: We reveal new methods and the theoretical foundations of techniques for editing large language models. We also show how the new theory can be used to assess the editability of models and to expose their susceptibility to previously unknown malicious attacks. Our theoretical approach shows that a single metric (a specific measure of the intrinsic dimensionality of the model's features) is fundament… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 24 pages, 9 figures. Open source implementation: https://github.com/qinghua-zhou/stealth-edits

    MSC Class: 68T07; 68T50; 68W40 ACM Class: I.2.7; F.2.0

  2. arXiv:2402.00899  [pdf, other

    cs.LG cs.AI stat.ML

    Weakly Supervised Learners for Correction of AI Errors with Provable Performance Guarantees

    Authors: Ivan Y. Tyukin, Tatiana Tyukina, Daniel van Helden, Zedong Zheng, Evgeny M. Mirkes, Oliver J. Sutton, Qinghua Zhou, Alexander N. Gorban, Penelope Allison

    Abstract: We present a new methodology for handling AI errors by introducing weakly supervised AI error correctors with a priori performance guarantees. These AI correctors are auxiliary maps whose role is to moderate the decisions of some previously constructed underlying classifier by either approving or rejecting its decisions. The rejection of a decision can be used as a signal to suggest abstaining fro… ▽ More

    Submitted 13 February, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

    MSC Class: 68T05; 68T37

  3. Relative intrinsic dimensionality is intrinsic to learning

    Authors: Oliver J. Sutton, Qinghua Zhou, Alexander N. Gorban, Ivan Y. Tyukin

    Abstract: High dimensional data can have a surprising property: pairs of data points may be easily separated from each other, or even from arbitrary subsets, with high probability using just simple linear classifiers. However, this is more of a rule of thumb than a reliable property as high dimensionality alone is neither necessary nor sufficient for successful learning. Here, we introduce a new notion of t… ▽ More

    Submitted 10 October, 2023; originally announced November 2023.

    Comments: 12 pages, 5 figures

    MSC Class: 68T09; 68T10

    Journal ref: Artificial Neural Networks and Machine Learning ICANN 2023. Lecture Notes in Computer Science, vol 14254, pp 516-529. Springer, Cham

  4. arXiv:2309.03665  [pdf, other

    cs.LG cs.AI

    How adversarial attacks can disrupt seemingly stable accurate classifiers

    Authors: Oliver J. Sutton, Qinghua Zhou, Ivan Y. Tyukin, Alexander N. Gorban, Alexander Bastounis, Desmond J. Higham

    Abstract: Adversarial attacks dramatically change the output of an otherwise accurate learning system using a seemingly inconsequential modification to a piece of input data. Paradoxically, empirical evidence indicates that even systems which are robust to large random perturbations of the input data remain susceptible to small, easily constructed, adversarial perturbations of their inputs. Here, we show th… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: 11 pages, 8 figures, additional supplementary materials

  5. arXiv:2211.03607  [pdf, other

    cs.LG cs.AI cs.CV

    Towards a mathematical understanding of learning from few examples with nonlinear feature maps

    Authors: Oliver J. Sutton, Alexander N. Gorban, Ivan Y. Tyukin

    Abstract: We consider the problem of data classification where the training set consists of just a few data points. We explore this phenomenon mathematically and reveal key relationships between the geometry of an AI model's feature space, the structure of the underlying data distributions, and the model's generalisation capabilities. The main thrust of our analysis is to reveal the influence on the model's… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: 18 pages, 8 figures

    MSC Class: 68Q32; 68T05