Skip to main content

Showing 1–4 of 4 results for author: Thillainathan, S

.
  1. arXiv:2406.11338  [pdf, other

    cs.CL

    Fine-grained Controllable Text Generation through In-context Learning with Feedback

    Authors: Sarubi Thillainathan, Alexander Koller

    Abstract: We present a method for rewriting an input sentence to match specific values of nontrivial linguistic features, such as dependency depth. In contrast to earlier work, our method uses in-context learning rather than finetuning, making it applicable in use cases where data is sparse. We show that our model performs accurate rewrites and matches the state of the art on rewriting sentences to a specif… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2404.04212  [pdf, other

    cs.CL

    Unlocking Parameter-Efficient Fine-Tuning for Low-Resource Language Translation

    Authors: Tong Su, Xin Peng, Sarubi Thillainathan, David Guzmán, Surangika Ranathunga, En-Shiun Annie Lee

    Abstract: Parameter-efficient fine-tuning (PEFT) methods are increasingly vital in adapting large-scale pre-trained language models for diverse tasks, offering a balance between adaptability and computational efficiency. They are important in Low-Resource Language (LRL) Neural Machine Translation (NMT) to enhance translation accuracy with minimal resources. However, their practical effectiveness varies sign… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: Accepted to the Findings of NAACL 2024

  3. arXiv:2306.01382  [pdf, other

    cs.CL

    Leveraging Auxiliary Domain Parallel Data in Intermediate Task Fine-tuning for Low-resource Translation

    Authors: Shravan Nayak, Surangika Ranathunga, Sarubi Thillainathan, Rikki Hung, Anthony Rinaldi, Yining Wang, Jonah Mackey, Andrew Ho, En-Shiun Annie Lee

    Abstract: NMT systems trained on Pre-trained Multilingual Sequence-Sequence (PMSS) models flounder when sufficient amounts of parallel data is not available for fine-tuning. This specifically holds for languages missing/under-represented in these models. The problem gets aggravated when the data comes from different domains. In this paper, we show that intermediate-task fine-tuning (ITFT) of PMSS models is… ▽ More

    Submitted 23 September, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: Accepted for poster presentation at the Practical Machine Learning for Develo** Countries (PML4DC) workshop, ICLR 2023

  4. arXiv:2203.08850  [pdf, other

    cs.CL

    Pre-Trained Multilingual Sequence-to-Sequence Models: A Hope for Low-Resource Language Translation?

    Authors: En-Shiun Annie Lee, Sarubi Thillainathan, Shravan Nayak, Surangika Ranathunga, David Ifeoluwa Adelani, Ruisi Su, Arya D. McCarthy

    Abstract: What can pre-trained multilingual sequence-to-sequence models like mBART contribute to translating low-resource languages? We conduct a thorough empirical experiment in 10 languages to ascertain this, considering five factors: (1) the amount of fine-tuning data, (2) the noise in the fine-tuning data, (3) the amount of pre-training data in the model, (4) the impact of domain mismatch, and (5) langu… ▽ More

    Submitted 30 April, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: Accepted to Findings of ACL 2022