Skip to main content

Showing 1–2 of 2 results for author: Jayasena, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2208.07864  [pdf, ps, other

    cs.CL

    BERTifying Sinhala -- A Comprehensive Analysis of Pre-trained Language Models for Sinhala Text Classification

    Authors: Vinura Dhananjaya, Piyumal Demotte, Surangika Ranathunga, Sanath Jayasena

    Abstract: This research provides the first comprehensive analysis of the performance of pre-trained language models for Sinhala text classification. We test on a set of different Sinhala text classification tasks and our analysis shows that out of the pre-trained multilingual models that include Sinhala (XLM-R, LaBSE, and LASER), XLM-R is the best model by far for Sinhala text classification. We also pre-tr… ▽ More

    Submitted 17 August, 2022; v1 submitted 16 August, 2022; originally announced August 2022.

  2. arXiv:2107.02453  [pdf, other

    cs.LG cs.AI cs.CV

    Neural Mixture Models with Expectation-Maximization for End-to-end Deep Clustering

    Authors: Dumindu Tissera, Kasun Vithanage, Rukshan Wijesinghe, Alex Xavier, Sanath Jayasena, Subha Fernando, Ranga Rodrigo

    Abstract: Any clustering algorithm must synchronously learn to model the clusters and allocate data to those clusters in the absence of labels. Mixture model-based methods model clusters with pre-defined statistical distributions and allocate data to those clusters based on the cluster likelihoods. They iteratively refine those distribution parameters and member assignments following the Expectation-Maximiz… ▽ More

    Submitted 2 October, 2022; v1 submitted 6 July, 2021; originally announced July 2021.

    Comments: Accepted and published at Neurocomputing 2022

    MSC Class: 68T10; 62H30 ACM Class: I.2; I.4; I.5