Skip to main content

Showing 1–7 of 7 results for author: Chan, R S

.
  1. arXiv:2406.02329  [pdf, other

    cs.CL cs.LG

    On Affine Homotopy between Language Encoders

    Authors: Robin SM Chan, Reda Boumasmoud, Anej Svete, Yuxin Ren, Qipeng Guo, Zhi**g **, Shauli Ravfogel, Mrinmaya Sachan, Bernhard Schölkopf, Mennatallah El-Assady, Ryan Cotterell

    Abstract: Pre-trained language encoders -- functions that represent text as vectors -- are an integral component of many NLP tasks. We tackle a natural question in language encoder analysis: What does it mean for two encoders to be similar? We contend that a faithful measure of similarity needs to be \emph{intrinsic}, that is, task-independent, yet still be informative of \emph{extrinsic} similarity -- the… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 10 pages

  2. arXiv:2405.00708  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Interactive Analysis of LLMs using Meaningful Counterfactuals

    Authors: Furui Cheng, Vilém Zouhar, Robin Shing Moon Chan, Daniel Fürst, Hendrik Strobelt, Mennatallah El-Assady

    Abstract: Counterfactual examples are useful for exploring the decision boundaries of machine learning models and determining feature attributions. How can we apply counterfactual-based methods to analyze and explain LLMs? We identify the following key challenges. First, the generated textual counterfactuals should be meaningful and readable to users and thus can be mentally compared to draw conclusions. Se… ▽ More

    Submitted 23 April, 2024; originally announced May 2024.

    ACM Class: I.2.7; H.5.2

  3. arXiv:2402.15814  [pdf, other

    cs.CL cs.CC cs.LG

    On Efficiently Representing Regular Languages as RNNs

    Authors: Anej Svete, Robin Shing Moon Chan, Ryan Cotterell

    Abstract: Recent work by Hewitt et al. (2020) provides an interpretation of the empirical success of recurrent neural networks (RNNs) as language models (LMs). It shows that RNNs can efficiently represent bounded hierarchical structures that are prevalent in human language. This suggests that RNNs' success might be linked to their ability to model hierarchy. However, a closer inspection of Hewitt et al.'s (… ▽ More

    Submitted 18 June, 2024; v1 submitted 24 February, 2024; originally announced February 2024.

  4. arXiv:2312.03523  [pdf, other

    cs.CL

    Sig-Networks Toolkit: Signature Networks for Longitudinal Language Modelling

    Authors: Talia Tseriotou, Ryan Sze-Yin Chan, Adam Tsakalidis, Iman Munire Bilal, Elena Kochkina, Terry Lyons, Maria Liakata

    Abstract: We present an open-source, pip installable toolkit, Sig-Networks, the first of its kind for longitudinal language modelling. A central focus is the incorporation of Signature-based Neural Network models, which have recently shown success in temporal tasks. We apply and extend published research providing a full suite of signature-based models. Their components can be used as PyTorch building block… ▽ More

    Submitted 6 February, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: To appear in EACL 2024: System Demonstrations

  5. arXiv:2309.09770  [pdf, other

    cs.AI

    How to Data in Datathons

    Authors: Carlos Mougan, Richard Plant, Clare Teng, Marya Bazzi, Alvaro Cabrejas-Egea, Ryan Sze-Yin Chan, David Salvador Jasin, Martin Stoffel, Kirstie Jane Whitaker, Jules Manser

    Abstract: The rise of datathons, also known as data or data science hackathons, has provided a platform to collaborate, learn, and innovate in a short timeframe. Despite their significant potential benefits, organizations often struggle to effectively work with data due to a lack of clear guidelines and best practices for potential issues that might arise. Drawing on our own experiences and insights from or… ▽ More

    Submitted 25 October, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 37th Conference on Neural Information Processing Systems (NeurIPS 2023) Track on Datasets and Benchmark

  6. arXiv:2110.07265  [pdf, ps, other

    stat.ME stat.CO stat.ML

    Divide-and-Conquer Fusion

    Authors: Ryan S. Y. Chan, Murray Pollock, Adam M. Johansen, Gareth O. Roberts

    Abstract: Combining several (sample approximations of) distributions, which we term sub-posteriors, into a single distribution proportional to their product, is a common challenge. Occurring, for instance, in distributed 'big data' problems, or when working under multi-party privacy constraints. Many existing approaches resort to approximating the individual sub-posteriors for practical necessity, then find… ▽ More

    Submitted 12 July, 2023; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: 73 pages, 14 figures

  7. arXiv:2006.03487  [pdf, other

    cs.LG stat.ML

    Dimensionless Anomaly Detection on Multivariate Streams with Variance Norm and Path Signature

    Authors: Zhen Shao, Ryan Sze-Yin Chan, Thomas Cochrane, Peter Foster, Terry Lyons

    Abstract: In this paper, we propose a dimensionless anomaly detection method for multivariate streams. Our method is independent of the unit of measurement for the different stream channels, therefore dimensionless. We first propose the variance norm, a generalisation of Mahalanobis distance to handle infinite-dimensional feature space and singular empirical covariance matrix rigorously. We then combine the… ▽ More

    Submitted 6 December, 2023; v1 submitted 5 June, 2020; originally announced June 2020.