Skip to main content

Showing 1–5 of 5 results for author: Gee, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12502  [pdf, other

    cs.CL

    Code-Optimise: Self-Generated Preference Data for Correctness and Efficiency

    Authors: Leonidas Gee, Milan Gritta, Gerasimos Lampouras, Ignacio Iacobacci

    Abstract: Code Language Models have been trained to generate accurate solutions, typically with no regard for runtime. On the other hand, previous works that explored execution optimisation have observed corresponding drops in functional correctness. To that end, we introduce Code-Optimise, a framework that incorporates both correctness (passed, failed) and runtime (quick, slow) as learning signals via self… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Under review at ARR (for EMNLP 2024)

  2. Are Compressed Language Models Less Subgroup Robust?

    Authors: Leonidas Gee, Andrea Zugarini, Novi Quadrianto

    Abstract: To reduce the inference cost of large language models, model compression is increasingly used to create smaller scalable models. However, little is known about their robustness to minority subgroups defined by the labels and attributes of a dataset. In this paper, we investigate the effects of 18 different compression methods and settings on the subgroup robustness of BERT language models. We show… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

    Journal ref: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: Main Track

  3. Fast Vocabulary Transfer for Language Model Compression

    Authors: Leonidas Gee, Andrea Zugarini, Leonardo Rigutini, Paolo Torroni

    Abstract: Real-world business applications require a trade-off between language model performance and size. We propose a new method for model compression that relies on vocabulary transfer. We evaluate the method on various vertical domains and downstream tasks. Our results indicate that vocabulary transfer can be effectively used in combination with other compression techniques, yielding a significant redu… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: The 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022)

    Journal ref: Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022): Industry Track

  4. Multi-word Tokenization for Sequence Compression

    Authors: Leonidas Gee, Leonardo Rigutini, Marco Ernandes, Andrea Zugarini

    Abstract: Large Language Models have proven highly successful at modelling a variety of tasks. However, this comes at a steep computational cost that hinders wider industrial uptake. In this paper, we present MWT: a Multi-Word Tokenizer that goes beyond word boundaries by representing frequent multi-word expressions as single tokens. MWTs produce a more compact and efficient tokenization that yields two ben… ▽ More

    Submitted 4 April, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

    Journal ref: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: Industry Track

  5. arXiv:2007.00824  [pdf, ps, other

    cs.CL cs.LG

    Lightme: Analysing Language in Internet Support Groups for Mental Health

    Authors: Gabriela Ferraro, Brendan Loo Gee, Shenjia Ji, Luis Salvador-Carulla

    Abstract: Background: Assisting moderators to triage harmful posts in Internet Support Groups is relevant to ensure its safe use. Automated text classification methods analysing the language expressed in posts of online forums is a promising solution. Methods: Natural Language Processing and Machine Learning technologies were used to build a triage post classifier using a dataset from Reachout mental health… ▽ More

    Submitted 2 July, 2020; v1 submitted 1 July, 2020; originally announced July 2020.