Skip to main content

Showing 1–50 of 70 results for author: Henderson, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06397  [pdf, other

    q-bio.BM cs.AI cs.LG

    Contrastive learning of T cell receptor representations

    Authors: Yuta Nagano, Andrew Pyo, Martina Milighetti, James Henderson, John Shawe-Taylor, Benny Chain, Andreas Tiffeau-Mayer

    Abstract: Computational prediction of the interaction of T cell receptors (TCRs) and their ligands is a grand challenge in immunology. Despite advances in high-throughput assays, specificity-labelled TCR data remains sparse. In other domains, the pre-training of language models on unlabelled data has been successfully used to address data bottlenecks. However, it is unclear how to best pre-train protein lan… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 19 pages, 17 figures

    ACM Class: J.3; I.2.7

  2. arXiv:2404.17142  [pdf, other

    quant-ph cs.CR

    Automated Quantum Circuit Generation for Computing Inverse Hash Functions

    Authors: Elena R. Henderson, Jessie M. Henderson, William V. Oxford, Mitchell A. Thornton

    Abstract: Several cryptographic systems depend upon the computational difficulty of reversing cryptographic hash functions. Robust hash functions transform inputs to outputs in such a way that the inputs cannot be later retrieved in a reasonable amount of time even if the outputs and the function that created them are known. Consequently, hash functions can be cryptographically secure, and they are employed… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 12 pages, 9 figures, 1 table

  3. arXiv:2404.12565  [pdf, other

    q-bio.BM cond-mat.stat-mech cs.IT

    Limits on Inferring T-cell Specificity from Partial Information

    Authors: James Henderson, Yuta Nagano, Martina Milighetti, Andreas Tiffeau-Mayer

    Abstract: A key challenge in molecular biology is to decipher the map** of protein sequence to function. To perform this map** requires the identification of sequence features most informative about function. Here, we quantify the amount of information (in bits) that T-cell receptor (TCR) sequence features provide about antigen specificity. We identify informative features by their degree of conservatio… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 24 pages, 15 figures

  4. arXiv:2404.02440  [pdf, other

    cs.CR physics.optics

    Designing a Photonic Physically Unclonable Function Having Resilience to Machine Learning Attacks

    Authors: Elena R. Henderson, Jessie M. Henderson, Hiva Shahoei, William V. Oxford, Eric C. Larson, Duncan L. MacFarlane, Mitchell A. Thornton

    Abstract: Physically unclonable functions (PUFs) are designed to act as device 'fingerprints.' Given an input challenge, the PUF circuit should produce an unpredictable response for use in situations such as root-of-trust applications and other hardware-level cybersecurity applications. PUFs are typically subcircuits present within integrated circuits (ICs), and while conventional IC PUFs are well-understoo… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 14 pages, 8 figures

  5. arXiv:2403.01299  [pdf, other

    cs.CR cs.LG

    A Photonic Physically Unclonable Function's Resilience to Multiple-Valued Machine Learning Attacks

    Authors: Jessie M. Henderson, Elena R. Henderson, Clayton A. Harper, Hiva Shahoei, William V. Oxford, Eric C. Larson, Duncan L. MacFarlane, Mitchell A. Thornton

    Abstract: Physically unclonable functions (PUFs) identify integrated circuits using nonlinearly-related challenge-response pairs (CRPs). Ideally, the relationship between challenges and corresponding responses is unpredictable, even if a subset of CRPs is known. Previous work developed a photonic PUF offering improved security compared to non-optical counterparts. Here, we investigate this PUF's susceptibil… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: 6 pages, 4 figures

  6. arXiv:2312.00662  [pdf, other

    cs.LG cs.CL

    Nonparametric Variational Regularisation of Pretrained Transformers

    Authors: Fabio Fehr, James Henderson

    Abstract: The current paradigm of large-scale pre-training and fine-tuning Transformer large language models has lead to significant improvements across the board in natural language processing. However, such large models are susceptible to overfitting to their training data, and as a result the models perform poorly when the domain changes. Also, due to the model's scale, the cost of fine-tuning the model… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  7. arXiv:2311.03611  [pdf, other

    cs.HC cs.LG q-bio.NC

    Plug-and-Play Stability for Intracortical Brain-Computer Interfaces: A One-Year Demonstration of Seamless Brain-to-Text Communication

    Authors: Chaofei Fan, Nick Hahn, Foram Kamdar, Donald Avansino, Guy H. Wilson, Leigh Hochberg, Krishna V. Shenoy, Jaimie M. Henderson, Francis R. Willett

    Abstract: Intracortical brain-computer interfaces (iBCIs) have shown promise for restoring rapid communication to people with neurological disorders such as amyotrophic lateral sclerosis (ALS). However, to maintain high performance over time, iBCIs typically need frequent recalibration to combat changes in the neural recordings that accrue over days. This requires iBCI users to stop using the iBCI and engag… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  8. arXiv:2311.02339  [pdf, other

    cs.DC

    Progress Metrics in DAG-based Consensus

    Authors: Quan Nguyen, James Henderson, Egor Lysenko

    Abstract: Lachesis protocol~\cite{lachesis2021} leverages a DAG of events to allow nodes to reach fast consensus of events. This work introduces DAG progress metrics to drive the nodes to emit new events more effectively. With these metrics, nodes can select event timing and can choose previous events as parents for their own new events. Our results show that our event timing and parent selection methods ca… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

  9. arXiv:2311.02096  [pdf, other

    physics.acc-ph cs.LG

    Variational Autoencoders for Noise Reduction in Industrial LLRF Systems

    Authors: J. P. Edelen, M. J. Henderson, J. Einstein-Curtis, C. C. Hall, J. A. Diaz Cruz, A. L. Edelen

    Abstract: Industrial particle accelerators inherently operate in much dirtier environments than typical research accelerators. This leads to an increase in noise both in the RF system and in other electronic systems. Combined with the fact that industrial accelerators are mass produced, there is less attention given to optimizing the performance of an individual system. As a result, industrial systems tend… ▽ More

    Submitted 7 November, 2023; v1 submitted 29 October, 2023; originally announced November 2023.

    Comments: Talk presented at LLRF Workshop 2023 (LLRF2023, arXiv: 2310.03199)

    Report number: LLRF2023/97

  10. arXiv:2310.17936  [pdf, other

    cs.CL cs.AI cs.LG

    Transformers as Graph-to-Graph Models

    Authors: James Henderson, Alireza Mohammadshahi, Andrei C. Coman, Lesly Miculicich

    Abstract: We argue that Transformers are essentially graph-to-graph models, with sequences just being a special case. Attention weights are functionally equivalent to graph edges. Our Graph-to-Graph Transformer architecture makes this ability explicit, by inputting graph edges into the attention weight computations and predicting graph edges with attention-like functions, thereby integrating explicit graphs… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: Accepted to Big Picture workshop at EMNLP 2023

  11. arXiv:2310.17284  [pdf, other

    cs.CL

    Learning to Abstract with Nonparametric Variational Information Bottleneck

    Authors: Melika Behjati, Fabio Fehr, James Henderson

    Abstract: Learned representations at the level of characters, sub-words, words and sentences, have each contributed to advances in understanding different NLP tasks and linguistic phenomena. However, learning textual embeddings is costly as they are tokenization specific and require different models to be trained for each level of abstraction. We introduce a novel language representation model which can lea… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: Accepted to Findings of EMNLP 2023

  12. arXiv:2308.14423  [pdf, other

    cs.CL

    GADePo: Graph-Assisted Declarative Pooling Transformers for Document-Level Relation Extraction

    Authors: Andrei C. Coman, Christos Theodoropoulos, Marie-Francine Moens, James Henderson

    Abstract: Document-level relation extraction typically relies on text-based encoders and hand-coded pooling heuristics to aggregate information learned by the encoder. In this paper, we leverage the intrinsic graph processing capabilities of the Transformer model and propose replacing hand-coded pooling methods with new tokens in the input, which are designed to aggregate information via explicit graph rela… ▽ More

    Submitted 18 June, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: Accepted to KnowledgeNLP workshop at ACL 2024

  13. arXiv:2305.08379  [pdf, other

    cs.CL cs.LG

    TESS: Text-to-Text Self-Conditioned Simplex Diffusion

    Authors: Rabeeh Karimi Mahabadi, Hamish Ivison, Jaesung Tae, James Henderson, Iz Beltagy, Matthew E. Peters, Arman Cohan

    Abstract: Diffusion models have emerged as a powerful paradigm for generation, obtaining strong performance in various continuous domains. However, applying continuous diffusion models to natural language remains challenging due to its discrete nature and the need for a large number of diffusion steps to generate text, making diffusion-based generation expensive. In this work, we propose Text-to-text Self-c… ▽ More

    Submitted 20 February, 2024; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: EACL 2024

  14. arXiv:2211.09860  [pdf, other

    quant-ph cs.ET

    Automated Quantum Memory Compilation with Improved Dynamic Range

    Authors: Aviraj Sinha, Elena R. Henderson, Jessie M. Henderson, Mitchell A. Thornton

    Abstract: Emerging quantum algorithms that process data require that classical input data be represented as a quantum state. These data-processing algorithms often follow the gate model of quantum computing--which requires qubits to be initialized to a basis state, typically $\lvert 0 \rangle$--and thus often employ state generation circuits to transform the initialized basis state to a data-representation… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: 14 pages, 9 figures, and 13 tables

  15. arXiv:2211.01482  [pdf, other

    cs.CL cs.AI cs.LG

    RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question

    Authors: Alireza Mohammadshahi, Thomas Scialom, Majid Yazdani, Pouya Yanki, Angela Fan, James Henderson, Marzieh Saeidi

    Abstract: Existing metrics for evaluating the quality of automatically generated questions such as BLEU, ROUGE, BERTScore, and BLEURT compare the reference and predicted questions, providing a high score when there is a considerable lexical overlap or semantic similarity between the candidate and the reference questions. This approach has two major shortcomings. First, we need expensive human-provided refer… ▽ More

    Submitted 26 May, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: Accepted to Findings of ACL 2023

  16. arXiv:2210.14702  [pdf, other

    cs.CR

    Privacy Analysis of Samsung's Crowd-Sourced Bluetooth Location Tracking System

    Authors: Tingfeng Yu, James Henderson, Alwen Tiu, Thomas Haines

    Abstract: We present a detailed privacy analysis of Samsung's Offline Finding (OF) protocol, which is part of Samsung's Find My Mobile (FMM) location tracking system for locating Samsung mobile devices, such as Samsung smartphones and Bluetooth trackers (Galaxy SmartTags). The OF protocol uses Bluetooth Low Energy (BLE) to broadcast a unique beacon for a lost device. This beacon is then picked up by nearby… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

  17. arXiv:2210.11685  [pdf, other

    quant-ph cs.CE physics.comp-ph

    Quantum Algorithms for Geologic Fracture Networks

    Authors: Jessie M. Henderson, Marianna Podzorova, M. Cerezo, John K. Golden, Leonard Gleyzer, Hari S. Viswanathan, Daniel O'Malley

    Abstract: Solving large systems of equations is a challenge for modeling natural phenomena, such as simulating subsurface flow. To avoid systems that are intractable on current computers, it is often necessary to neglect information at small scales, an approach known as coarse-graining. For many practical applications, such as flow in porous, homogenous materials, coarse-graining offers a sufficiently-accur… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: 20 pages, 12 figures

    Report number: LA-UR-22-29135

    Journal ref: Sci Rep 13, 2906 (2023)

  18. arXiv:2210.11621  [pdf, other

    cs.CL cs.AI cs.LG

    SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages

    Authors: Alireza Mohammadshahi, Vassilina Nikoulina, Alexandre Berard, Caroline Brun, James Henderson, Laurent Besacier

    Abstract: In recent years, multilingual machine translation models have achieved promising performance on low-resource language pairs by sharing information between similar languages, thus enabling zero-shot translation. To overcome the "curse of multilinguality", these models often opt for scaling up the number of parameters, which makes their use in resource-constrained environments challenging. We introd… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP 2022

    Journal ref: https://aclanthology.org/2022.emnlp-main.571

  19. arXiv:2210.06578  [pdf, other

    cs.LG

    FASTER-CE: Fast, Sparse, Transparent, and Robust Counterfactual Explanations

    Authors: Shubham Sharma, Alan H. Gee, Jette Henderson, Joydeep Ghosh

    Abstract: Counterfactual explanations have substantially increased in popularity in the past few years as a useful human-centric way of understanding individual black-box model predictions. While several properties desired of high-quality counterfactuals have been identified in the literature, three crucial concerns: the speed of explanation generation, robustness/sensitivity and succinctness of explanation… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  20. arXiv:2210.04995  [pdf, other

    cs.LG cs.AI cs.CY

    FEAMOE: Fair, Explainable and Adaptive Mixture of Experts

    Authors: Shubham Sharma, Jette Henderson, Joydeep Ghosh

    Abstract: Three key properties that are desired of trustworthy machine learning models deployed in high-stakes environments are fairness, explainability, and an ability to account for various kinds of "drift". While drifts in model accuracy, for example due to covariate shift, have been widely investigated, drifts in fairness metrics over time remain largely unexplored. In this paper, we propose FEAMOE, a n… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

  21. arXiv:2208.01710  [pdf, other

    cs.RO

    Smart Visual Beacons with Asynchronous Optical Communications using Event Cameras

    Authors: Ziwei Wang, Yonhon Ng, Jack Henderson, Robert Mahony

    Abstract: Event cameras are bio-inspired dynamic vision sensors that respond to changes in image intensity with a high temporal resolution, high dynamic range and low latency. These sensor characteristics are ideally suited to enable visual target tracking in concert with a broadcast visual communication channel for smart visual beacons with applications in distributed robotics. Visual beacons can be constr… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

    Comments: 7 pages, 8 figures, accepted by IEEE International Conference on Intelligent Robots and Systems (IROS) 2022

  22. arXiv:2207.13529  [pdf, other

    cs.LG cs.CL

    A Variational AutoEncoder for Transformers with Nonparametric Variational Information Bottleneck

    Authors: James Henderson, Fabio Fehr

    Abstract: We propose a VAE for Transformers by develo** a variational information bottleneck regulariser for Transformer embeddings. We formalise the embedding space of Transformer encoders as mixture probability distributions, and use Bayesian nonparametrics to derive a nonparametric variational information bottleneck (NVIB) for such attention-based embeddings. The variable number of mixture components s… ▽ More

    Submitted 12 August, 2022; v1 submitted 27 July, 2022; originally announced July 2022.

    Comments: 33 pages, 10 figures, 3 tables. First time this work has been made public

  23. arXiv:2205.11456  [pdf, other

    cs.CL

    Multilingual Extraction and Categorization of Lexical Collocations with Graph-aware Transformers

    Authors: Luis Espinosa-Anke, Alexander Shvets, Alireza Mohammadshahi, James Henderson, Leo Wanner

    Abstract: Recognizing and categorizing lexical collocations in context is useful for language learning, dictionary compilation and downstream NLP. However, it is a challenging task due to the varying degrees of frozenness lexical collocations exhibit. In this paper, we put forward a sequence tagging BERT-based model enhanced with a graph-aware transformer architecture, which we evaluate on the task of collo… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: Accepted to *SEM2022

  24. arXiv:2205.10828  [pdf, other

    cs.CL cs.AI cs.LG

    What Do Compressed Multilingual Machine Translation Models Forget?

    Authors: Alireza Mohammadshahi, Vassilina Nikoulina, Alexandre Berard, Caroline Brun, James Henderson, Laurent Besacier

    Abstract: Recently, very large pre-trained models achieve state-of-the-art results in various natural language processing (NLP) tasks, but their size makes it more challenging to apply them in resource-constrained environments. Compression techniques allow to drastically reduce the size of the models and therefore their inference time with negligible impact on top-tier metrics. However, the general performa… ▽ More

    Submitted 27 June, 2023; v1 submitted 22 May, 2022; originally announced May 2022.

    Comments: Accepted to Findings of EMNLP 2022, presented at WMT 2022

    Journal ref: https://aclanthology.org/2022.findings-emnlp.317/

  25. arXiv:2204.01172  [pdf, other

    cs.CL

    PERFECT: Prompt-free and Efficient Few-shot Learning with Language Models

    Authors: Rabeeh Karimi Mahabadi, Luke Zettlemoyer, James Henderson, Marzieh Saeidi, Lambert Mathias, Veselin Stoyanov, Majid Yazdani

    Abstract: Current methods for few-shot fine-tuning of pretrained masked language models (PLMs) require carefully engineered prompts and verbalizers for each new task to convert examples into a cloze-format that the PLM can score. In this work, we propose PERFECT, a simple and efficient method for few-shot fine-tuning of PLMs without relying on any such handcrafting, which is highly effective given as few as… ▽ More

    Submitted 25 April, 2022; v1 submitted 3 April, 2022; originally announced April 2022.

    Comments: ACL, 2022

  26. arXiv:2203.16574  [pdf, other

    cs.CL cs.LG

    Graph Refinement for Coreference Resolution

    Authors: Lesly Miculicich, James Henderson

    Abstract: The state-of-the-art models for coreference resolution are based on independent mention pair-wise decisions. We propose a modelling approach that learns coreference at the document-level and takes global decisions. For this purpose, we model coreference links in a graph structure where the nodes are tokens in the text, and the edges represent the relationship between them. Our model predicts the g… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

  27. arXiv:2203.03691  [pdf, other

    cs.CL cs.AI cs.LG

    HyperMixer: An MLP-based Low Cost Alternative to Transformers

    Authors: Florian Mai, Arnaud Pannatier, Fabio Fehr, Haolin Chen, Francois Marelli, Francois Fleuret, James Henderson

    Abstract: Transformer-based architectures are the model of choice for natural language understanding, but they come at a significant cost, as they have quadratic complexity in the input length, require a lot of training data, and can be difficult to tune. In the pursuit of lower costs, we investigate simple MLP-based architectures. We find that existing architectures such as MLPMixer, which achieves token m… ▽ More

    Submitted 13 November, 2023; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: Published at ACL 2023

    Journal ref: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

  28. arXiv:2110.07002  [pdf, other

    cs.CL cs.AI cs.LG

    Bag-of-Vectors Autoencoders for Unsupervised Conditional Text Generation

    Authors: Florian Mai, James Henderson

    Abstract: Text autoencoders are often used for unsupervised conditional text generation by applying map**s in the latent space to change attributes to the desired values. Recently, Mai et al. (2020) proposed Emb2Emb, a method to learn these map**s in the embedding space of an autoencoder. However, their method is restricted to autoencoders with a single-vector embedding, which limits how much informatio… ▽ More

    Submitted 4 February, 2023; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: Published at AACL 2022

    Journal ref: In Proceedings of AACL/IJCNLP 2022, pages 468-488. Association of Computational Linguistics (2022)

  29. Imposing Relation Structure in Language-Model Embeddings Using Contrastive Learning

    Authors: Christos Theodoropoulos, James Henderson, Andrei C. Coman, Marie-Francine Moens

    Abstract: Though language model text embeddings have revolutionized NLP research, their ability to capture high-level semantic information, such as relations between entities in text, is limited. In this paper, we propose a novel contrastive learning framework that trains sentence embeddings to encode the relations in a graph structure. Given a sentence (unstructured text) and its graph, we use contrastive… ▽ More

    Submitted 4 September, 2021; v1 submitted 2 September, 2021; originally announced September 2021.

    Comments: To be presented at CoNLL 2021

    Journal ref: Conference: 2021 Proceedings of the 25th Conference on Computational Natural Language Learning

  30. arXiv:2107.01982  [pdf, other

    cs.CL

    The DCU-EPFL Enhanced Dependency Parser at the IWPT 2021 Shared Task

    Authors: James Barry, Alireza Mohammadshahi, Joachim Wagner, Jennifer Foster, James Henderson

    Abstract: We describe the DCU-EPFL submission to the IWPT 2021 Shared Task on Parsing into Enhanced Universal Dependencies. The task involves parsing Enhanced UD graphs, which are an extension of the basic dependency trees designed to be more facilitative towards representing semantic structure. Evaluation is carried out on 29 treebanks in 17 languages and participants are required to parse the data from ea… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: Submitted to the IWPT 2021 Shared Task: From Raw Text to Enhanced Universal Dependencies: the Parsing Shared Task at IWPT 2021

  31. arXiv:2106.05469  [pdf, other

    cs.CL

    Variational Information Bottleneck for Effective Low-Resource Fine-Tuning

    Authors: Rabeeh Karimi Mahabadi, Yonatan Belinkov, James Henderson

    Abstract: While large-scale pretrained language models have obtained impressive results when fine-tuned on a wide variety of tasks, they still often suffer from overfitting in low-resource scenarios. Since such models are general-purpose feature extractors, many of these features are inevitably irrelevant for a given target task. We propose to use Variational Information Bottleneck (VIB) to suppress irrelev… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: ICLR, 2021

  32. arXiv:2106.04647  [pdf, other

    cs.CL

    Compacter: Efficient Low-Rank Hypercomplex Adapter Layers

    Authors: Rabeeh Karimi Mahabadi, James Henderson, Sebastian Ruder

    Abstract: Adapting large-scale pretrained language models to downstream tasks via fine-tuning is the standard method for achieving state-of-the-art performance on NLP benchmarks. However, fine-tuning all weights of models with millions or billions of parameters is sample-inefficient, unstable in low-resource settings, and wasteful as it requires storing a separate copy of the model for each task. Recent wor… ▽ More

    Submitted 27 November, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: accepted in NeurIPS, 2021

  33. arXiv:2106.04489  [pdf, other

    cs.CL

    Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks

    Authors: Rabeeh Karimi Mahabadi, Sebastian Ruder, Mostafa Dehghani, James Henderson

    Abstract: State-of-the-art parameter-efficient fine-tuning methods rely on introducing adapter modules between the layers of a pretrained language model. However, such modules are trained separately for each task and thus do not enable sharing information across tasks. In this paper, we show that we can learn adapter parameters for all layers and tasks by generating them using shared hypernetworks, which co… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

    Comments: accepted in ACL, 2021

  34. arXiv:2106.02623  [pdf, other

    cs.CR

    The Closer You Look, The More You Learn: A Grey-box Approach to Protocol State Machine Learning

    Authors: Chris McMahon Stone, Sam L. Thomas, Mathy Vanhoef, James Henderson, Nicolas Bailluet, Tom Chothia

    Abstract: In this paper, we propose a new approach to infer state machine models from protocol implementations. Our method, STATEINSPECTOR, learns protocol states by using novel program analyses to combine observations of run-time memory and I/O. It requires no access to source code and only lightweight execution monitoring of the implementation under test. We demonstrate and evaluate STATEINSPECTOR's effec… ▽ More

    Submitted 7 June, 2021; v1 submitted 4 June, 2021; originally announced June 2021.

  35. arXiv:2104.07704  [pdf, other

    cs.CL

    Syntax-Aware Graph-to-Graph Transformer for Semantic Role Labelling

    Authors: Alireza Mohammadshahi, James Henderson

    Abstract: Recent models have shown that incorporating syntactic knowledge into the semantic role labelling (SRL) task leads to a significant improvement. In this paper, we propose Syntax-aware Graph-to-Graph Transformer (SynG2G-Tr) model, which encodes the syntactic structure using a novel way to input graph relations as embeddings, directly into the self-attention mechanism of Transformer. This approach ad… ▽ More

    Submitted 2 June, 2023; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: Accepted to Rep4NLP at ACL 2023

  36. arXiv:2104.05897  [pdf, other

    cs.RO eess.SY

    Inertial Collaborative Localisation for Autonomous Vehicles using a Minimum Energy Filter

    Authors: Jack Henderson, Mohammad Zamani, Robert Mahony, Jochen Trumpf

    Abstract: Collaborative Localisation has been studied extensively in recent years as a way to improve pose estimation of unmanned aerial vehicles in challenging environments. However little attention has been paid toward advancing the underlying filter design beyond standard Extended Kalman Filter-based approaches. In this paper, we detail a discrete-time collaborative localisation filter using the determin… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: Submitted to IEEE 2021 Conference on Decision and Control (CDC2021)

  37. arXiv:2102.01223  [pdf, other

    cs.CL cs.LG

    Inducing Meaningful Units from Character Sequences with Dynamic Capacity Slot Attention

    Authors: Melika Behjati, James Henderson

    Abstract: Characters do not convey meaning, but sequences of characters do. We propose an unsupervised distributional method to learn the abstract meaningful units in a sequence of characters. Rather than segmenting the sequence, our Dynamic Capacity Slot Attention model discovers continuous representations of the objects in the sequence, extending an architecture for object discovery in images. We train ou… ▽ More

    Submitted 16 January, 2024; v1 submitted 1 February, 2021; originally announced February 2021.

    Comments: Accepted to TMLR 2023

  38. arXiv:2010.08432  [pdf, other

    cs.CL

    Multi-Adversarial Learning for Cross-Lingual Word Embeddings

    Authors: Haozhou Wang, James Henderson, Paola Merlo

    Abstract: Generative adversarial networks (GANs) have succeeded in inducing cross-lingual word embeddings -- maps of matching words across languages -- without supervision. Despite these successes, GANs' performance for the difficult case of distant languages is still not satisfactory. These limitations have been explained by GANs' incorrect assumption that source and target embedding spaces are related by… ▽ More

    Submitted 25 August, 2021; v1 submitted 16 October, 2020; originally announced October 2020.

  39. arXiv:2010.02983  [pdf, other

    cs.CL cs.AI

    Plug and Play Autoencoders for Conditional Text Generation

    Authors: Florian Mai, Nikolaos Pappas, Ivan Montero, Noah A. Smith, James Henderson

    Abstract: Text autoencoders are commonly used for conditional generation tasks such as style transfer. We propose methods which are plug and play, where any pretrained autoencoder can be used, and only require learning a map** within the autoencoder's embedding space, training embedding-to-embedding (Emb2Emb). This reduces the need for labeled training data for the task and makes the training procedure mo… ▽ More

    Submitted 12 October, 2020; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: To be published in EMNLP 2020

  40. A Minimum Energy Filter for Localisation of an Unmanned Aerial Vehicle

    Authors: Jack Henderson, Mohammad Zamani, Robert Mahony, Jochen Trumpf

    Abstract: Accurate localisation of unmanned aerial vehicles is vital for the next generation of automation tasks. This paper proposes a minimum energy filter for velocity-aided pose estimation on the extended special Euclidean group. The approach taken exploits the Lie-group symmetry of the problem to combine Inertial Measurement Unit (IMU) sensor output with landmark measurements into a robust and high per… ▽ More

    Submitted 9 September, 2020; originally announced September 2020.

    Comments: To be presented at the 59th IEEE Conference on Decision and Control (CDC), 14-18 December 2020

  41. arXiv:2006.14621  [pdf, other

    stat.ME cs.LG stat.ML

    Understanding collections of related datasets using dependent MMD coresets

    Authors: Sinead A. Williamson, Jette Henderson

    Abstract: Understanding how two datasets differ can help us determine whether one dataset under-represents certain sub-populations, and provides insights into how well models will generalize across datasets. Representative points selected by a maximum mean discrepency (MMD) coreset can provide interpretable summaries of a single dataset, but are not easily compared across datasets. In this paper we introduc… ▽ More

    Submitted 4 August, 2021; v1 submitted 24 June, 2020; originally announced June 2020.

  42. A Minimum Energy Filter for Distributed Multirobot Localisation

    Authors: Jack Henderson, Jochen Trumpf, Mohammad Zamani

    Abstract: We present a new approach to the cooperative localisation problem by applying the theory of minimum energy filtering. We consider the problem of estimating the pose of a group of mobile robots in an environment where robots can perceive fixed landmarks and neighbouring robots as well as share information with others over a communication channel. Whereas the vast majority of the existing literature… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

    Comments: To be published at 21st IFAC World Congress, Berlin, Germany, July 12-17, 2020

  43. arXiv:2005.06420  [pdf, other

    cs.CL cs.LG

    The Unstoppable Rise of Computational Linguistics in Deep Learning

    Authors: James Henderson

    Abstract: In this paper, we trace the history of neural networks applied to natural language understanding tasks, and identify key contributions which the nature of language has made to the development of neural network architectures. We focus on the importance of variable binding and its instantiation in attention-based models, and argue that Transformer is not a sequence model but an induced-structure mod… ▽ More

    Submitted 11 June, 2020; v1 submitted 13 May, 2020; originally announced May 2020.

    Comments: 13 pages. Accepted for publication at ACL 2020, in the theme track

  44. Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement

    Authors: Alireza Mohammadshahi, James Henderson

    Abstract: We propose the Recursive Non-autoregressive Graph-to-Graph Transformer architecture (RNGTr) for the iterative refinement of arbitrary graphs through the recursive application of a non-autoregressive Graph-to-Graph Transformer and apply it to syntactic dependency parsing. We demonstrate the power and effectiveness of RNGTr on several dependency corpora, using a refinement model pre-trained with BER… ▽ More

    Submitted 10 November, 2020; v1 submitted 29 March, 2020; originally announced March 2020.

    Comments: Accepted to Transactions of the Association for Computational Linguistics (TACL) journal

  45. arXiv:2003.00602  [pdf, other

    cs.IR cs.CR cs.LG stat.ML

    Federating Recommendations Using Differentially Private Prototypes

    Authors: Mónica Ribero, Jette Henderson, Sinead Williamson, Haris Vikalo

    Abstract: Machine learning methods allow us to make recommendations to users in applications across fields including entertainment, dating, and commerce, by exploiting similarities in users' interaction patterns. However, in domains that demand protection of personally sensitive data, such as medicine or banking, how can we learn such a model without accessing the sensitive data, and without inadvertently l… ▽ More

    Submitted 1 March, 2020; originally announced March 2020.

  46. Graph-to-Graph Transformer for Transition-based Dependency Parsing

    Authors: Alireza Mohammadshahi, James Henderson

    Abstract: We propose the Graph2Graph Transformer architecture for conditioning on and predicting arbitrary graphs, and apply it to the challenging task of transition-based dependency parsing. After proposing two novel Transformer models of transition-based dependency parsing as strong baselines, we show that adding the proposed mechanisms for conditioning on and predicting graphs of Graph2Graph Transformer… ▽ More

    Submitted 30 October, 2020; v1 submitted 8 November, 2019; originally announced November 2019.

    Comments: Accepted to Findings of EMNLP 2020

  47. arXiv:1909.06321  [pdf, other

    cs.CL

    End-to-End Bias Mitigation by Modelling Biases in Corpora

    Authors: Rabeeh Karimi Mahabadi, Yonatan Belinkov, James Henderson

    Abstract: Several recent studies have shown that strong natural language understanding (NLU) models are prone to relying on unwanted dataset biases without learning the underlying task, resulting in models that fail to generalize to out-of-domain datasets and are likely to perform poorly in real-world scenarios. We propose two learning strategies to train neural models, which are more robust to such biases… ▽ More

    Submitted 23 April, 2020; v1 submitted 13 September, 2019; originally announced September 2019.

    Comments: Accepted in ACL 2020 as a long paper

  48. arXiv:1908.09507  [pdf, other

    cs.CL cs.LG

    Partially-supervised Mention Detection

    Authors: Lesly Miculicich, James Henderson

    Abstract: Learning to detect entity mentions without using syntactic information can be useful for integration and joint optimization with other tasks. However, it is common to have partially annotated data for this problem. Here, we investigate two approaches to deal with partial annotation of mentions: weighted loss and soft-target classification. We also propose two neural mention detection approaches: a… ▽ More

    Submitted 26 August, 2019; originally announced August 2019.

  49. arXiv:1906.01496  [pdf, other

    cs.CL cs.LG stat.ML

    Regularization Advantages of Multilingual Neural Language Models for Low Resource Domains

    Authors: Navid Rekabsaz, Nikolaos Pappas, James Henderson, Banriskhem K. Khonglah, Srikanth Madikeri

    Abstract: Neural language modeling (LM) has led to significant improvements in several applications, including Automatic Speech Recognition. However, they typically require large amounts of training data, which is not available for many domains and languages. In this study, we propose a multilingual neural language model architecture, trained jointly on the domain-specific data of several low-resource langu… ▽ More

    Submitted 29 May, 2019; originally announced June 2019.

  50. CERTIFAI: Counterfactual Explanations for Robustness, Transparency, Interpretability, and Fairness of Artificial Intelligence models

    Authors: Shubham Sharma, Jette Henderson, Joydeep Ghosh

    Abstract: As artificial intelligence plays an increasingly important role in our society, there are ethical and moral obligations for both businesses and researchers to ensure that their machine learning models are designed, deployed, and maintained responsibly. These models need to be rigorously audited for fairness, robustness, transparency, and interpretability. A variety of methods have been developed t… ▽ More

    Submitted 19 May, 2019; originally announced May 2019.