Skip to main content

Showing 1–9 of 9 results for author: Sherborne, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.03646  [pdf, other

    cs.LG cs.CL

    TRAM: Bridging Trust Regions and Sharpness Aware Minimization

    Authors: Tom Sherborne, Naomi Saphra, Pradeep Dasigi, Hao Peng

    Abstract: Sharpness-aware minimization (SAM) reports improving domain generalization by reducing the loss surface curvature in the parameter space. However, generalization during fine-tuning is often more dependent on the transferability of representations in the function space. Trust-region methods (TR) target this goal by regularizing representation curvature to reduce catastrophic forgetting of pre-train… ▽ More

    Submitted 12 March, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: Camera Ready for ICLR 2024 (Accepted as Spotlight). 21 pages, 14 tables, 2 figures

  2. arXiv:2307.09701  [pdf, other

    cs.CL

    Efficiency Pentathlon: A Standardized Arena for Efficiency Evaluation

    Authors: Hao Peng, Qingqing Cao, Jesse Dodge, Matthew E. Peters, Jared Fernandez, Tom Sherborne, Kyle Lo, Sam Skjonsberg, Emma Strubell, Darrell Plessas, Iz Beltagy, Evan Pete Walsh, Noah A. Smith, Hannaneh Hajishirzi

    Abstract: Rising computational demands of modern natural language processing (NLP) systems have increased the barrier to entry for cutting-edge research while posing serious environmental concerns. Yet, progress on model efficiency has been impeded by practical challenges in model evaluation and comparison. For example, hardware is challenging to control due to disparate levels of accessibility across diffe… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  3. arXiv:2307.04096  [pdf, other

    cs.CL

    Optimal Transport Posterior Alignment for Cross-lingual Semantic Parsing

    Authors: Tom Sherborne, Tom Hosking, Mirella Lapata

    Abstract: Cross-lingual semantic parsing transfers parsing capability from a high-resource language (e.g., English) to low-resource languages with scarce training data. Previous work has primarily considered silver-standard data augmentation or zero-shot methods, however, exploiting few-shot gold data is comparatively unexplored. We propose a new approach to cross-lingual semantic parsing by explicitly mini… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

    Comments: Accepted to TACL 2023. Pre-MIT Press publication. 17 pages, 3 figures, 6 tables

  4. arXiv:2305.14864  [pdf, other

    cs.CL

    How To Train Your (Compressed) Large Language Model

    Authors: Ananya Harsh Jha, Tom Sherborne, Evan Pete Walsh, Dirk Groeneveld, Emma Strubell, Iz Beltagy

    Abstract: With the increase in the size of large language models (LLMs), we need compression methods that can reduce the model size while preserving the generality and zero-shot promptability of the model. This goal is more ambitious than the typical compression setup, which reduces the model's size at the expense of specializing it to a specific end-task. To study this, we develop a task-agnostic compressi… ▽ More

    Submitted 18 November, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: 13 pages, 6 figures, 5 tables

  5. arXiv:2212.10297  [pdf, other

    cs.CL cs.AI

    Extrinsic Evaluation of Machine Translation Metrics

    Authors: Nikita Moghe, Tom Sherborne, Mark Steedman, Alexandra Birch

    Abstract: Automatic machine translation (MT) metrics are widely used to distinguish the translation qualities of machine translation systems across relatively large test sets (system-level evaluation). However, it is unclear if automatic metrics are reliable at distinguishing good translations from bad translations at the sentence level (segment-level evaluation). In this paper, we investigate how useful MT… ▽ More

    Submitted 18 June, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: ACL 2023 Camera Ready

  6. arXiv:2209.12577  [pdf, other

    cs.CL

    Meta-Learning a Cross-lingual Manifold for Semantic Parsing

    Authors: Tom Sherborne, Mirella Lapata

    Abstract: Localizing a semantic parser to support new languages requires effective cross-lingual generalization. Recent work has found success with machine-translation or zero-shot methods although these approaches can struggle to model how native speakers ask questions. We consider how to effectively leverage minimal annotated examples in new languages for few-shot cross-lingual semantic parsing. We introd… ▽ More

    Submitted 27 September, 2022; v1 submitted 26 September, 2022; originally announced September 2022.

    Comments: Accepted to TACL 2022. Pre-MIT Press publication

  7. arXiv:2104.07554  [pdf, other

    cs.CL

    Zero-Shot Cross-lingual Semantic Parsing

    Authors: Tom Sherborne, Mirella Lapata

    Abstract: Recent work in cross-lingual semantic parsing has successfully applied machine translation to localize parsers to new languages. However, these advances assume access to high-quality machine translation systems and word alignment tools. We remove these assumptions and study cross-lingual semantic parsing as a zero-shot problem, without parallel data (i.e., utterance-logical form pairs) for new lan… ▽ More

    Submitted 7 March, 2022; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: Accepted to ACL2022 Main Conference. 19 pages, 3 figures, 12 tables

  8. arXiv:2004.02585  [pdf, other

    cs.CL

    Bootstrap** a Crosslingual Semantic Parser

    Authors: Tom Sherborne, Yumo Xu, Mirella Lapata

    Abstract: Recent progress in semantic parsing scarcely considers languages other than English but professional translation can be prohibitively expensive. We adapt a semantic parser trained on a single language, such as English, to new languages and multiple domains with minimal annotation. We query if machine translation is an adequate substitute for training data, and extend this to investigate bootstrapp… ▽ More

    Submitted 23 September, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

    Comments: Camera Ready for EMNLP2020 Findings

  9. arXiv:1912.08960  [pdf, other

    cs.CL

    Going Beneath the Surface: Evaluating Image Captioning for Grammaticality, Truthfulness and Diversity

    Authors: Huiyuan Xie, Tom Sherborne, Alexander Kuhnle, Ann Copestake

    Abstract: Image captioning as a multimodal task has drawn much interest in recent years. However, evaluation for this task remains a challenging problem. Existing evaluation metrics focus on surface similarity between a candidate caption and a set of reference captions, and do not check the actual relation between a caption and the underlying visual content. We introduce a new diagnostic evaluation framewor… ▽ More

    Submitted 18 December, 2019; originally announced December 2019.