Skip to main content

Showing 1–3 of 3 results for author: Quirke, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.02619  [pdf, other

    cs.LG cs.CL

    Increasing Trust in Language Models through the Reuse of Verified Circuits

    Authors: Philip Quirke, Clement Neo, Fazl Barez

    Abstract: Language Models (LMs) are increasingly used for a wide range of prediction tasks, but their training can often neglect rare edge cases, reducing their reliability. Here, we define a stringent standard of trustworthiness whereby the task algorithm and circuit implementation must be verified, accounting for edge cases, with no known failure modes. We show that a model can be trained to meet this sta… ▽ More

    Submitted 16 June, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: 8 pages, 4 figures

  2. arXiv:2310.13121  [pdf, other

    cs.LG cs.AI

    Understanding Addition in Transformers

    Authors: Philip Quirke, Fazl Barez

    Abstract: Understanding the inner workings of machine learning models like Transformers is vital for their safe and ethical use. This paper provides a comprehensive analysis of a one-layer Transformer model trained to perform n-digit integer addition. Our findings suggest that the model dissects the task into parallel streams dedicated to individual digits, employing varied algorithms tailored to different… ▽ More

    Submitted 23 April, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: 9 pages, 8 figures, accepted by ICLR 2024

  3. arXiv:2301.09617  [pdf, other

    cs.CV

    Fully transformer-based biomarker prediction from colorectal cancer histology: a large-scale multicentric study

    Authors: Sophia J. Wagner, Daniel Reisenbüchler, Nicholas P. West, Jan Moritz Niehues, Gregory Patrick Veldhuizen, Philip Quirke, Heike I. Grabsch, Piet A. van den Brandt, Gordon G. A. Hutchins, Susan D. Richman, Tanwei Yuan, Rupert Langer, Josien Christina Anna Jenniskens, Kelly Offermans, Wolfram Mueller, Richard Gray, Stephen B. Gruber, Joel K. Greenson, Gad Rennert, Joseph D. Bonner, Daniel Schmolze, Jacqueline A. James, Maurice B. Loughrey, Manuel Salto-Tellez, Hermann Brenner , et al. (6 additional authors not shown)

    Abstract: Background: Deep learning (DL) can extract predictive and prognostic biomarkers from routine pathology slides in colorectal cancer. For example, a DL test for the diagnosis of microsatellite instability (MSI) in CRC has been approved in 2022. Current approaches rely on convolutional neural networks (CNNs). Transformer networks are outperforming CNNs and are replacing them in many applications, but… ▽ More

    Submitted 1 March, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: Updated Figure 2 and Table A.5