Skip to main content

Showing 1–3 of 3 results for author: Meepegama, U

Searching in archive cs. Search in all archives.
.
  1. arXiv:2107.08251  [pdf, other

    cs.CL cs.LG

    Generative Pretraining for Paraphrase Evaluation

    Authors: Jack Weston, Raphael Lenain, Udeepa Meepegama, Emil Fristed

    Abstract: We introduce ParaBLEU, a paraphrase representation learning model and evaluation metric for text generation. Unlike previous approaches, ParaBLEU learns to understand paraphrasis using generative conditioning as a pretraining objective. ParaBLEU correlates more strongly with human judgements than existing metrics, obtaining new state-of-the-art results on the 2017 WMT Metrics Shared Task. We show… ▽ More

    Submitted 24 July, 2021; v1 submitted 17 July, 2021; originally announced July 2021.

    Comments: Under review

  2. arXiv:2107.08248  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Learning De-identified Representations of Prosody from Raw Audio

    Authors: Jack Weston, Raphael Lenain, Udeepa Meepegama, Emil Fristed

    Abstract: We propose a method for learning de-identified prosody representations from raw audio using a contrastive self-supervised signal. Whereas prior work has relied on conditioning models on bottlenecks, we introduce a set of inductive biases that exploit the natural structure of prosody to minimize timbral information and decouple prosody from speaker representations. Despite aggressive downsampling o… ▽ More

    Submitted 17 July, 2021; originally announced July 2021.

    Comments: ICML 2021

    Journal ref: Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event. Proceedings of Machine Learning Research 139, PMLR 2021

  3. arXiv:1906.00254  [pdf, other

    cs.LG cs.CV cs.SD eess.AS stat.ML

    Super-resolution of Time-series Labels for Bootstrapped Event Detection

    Authors: Ivan Kiskin, Udeepa Meepegama, Steven Roberts

    Abstract: Solving real-world problems, particularly with deep learning, relies on the availability of abundant, quality data. In this paper we develop a novel framework that maximises the utility of time-series datasets that contain only small quantities of expertly-labelled data, larger quantities of weakly (or coarsely) labelled data and a large volume of unlabelled data. This represents scenarios commonl… ▽ More

    Submitted 1 June, 2019; originally announced June 2019.

    Comments: Accepted at the Time-series workshop at ICML 2019, Long Beach