Skip to main content

Showing 1–2 of 2 results for author: Padi, B

Searching in archive eess. Search in all archives.
.
  1. arXiv:2205.06655   

    cs.CL cs.SD eess.AS

    Unified Modeling of Multi-Domain Multi-Device ASR Systems

    Authors: Soumyajit Mitra, Swayambhu Nath Ray, Bharat Padi, Arunasish Sen, Raghavendra Bilgi, Harish Arsikere, Shalini Ghosh, Ajay Srinivasamurthy, Sri Garimella

    Abstract: Modern Automatic Speech Recognition (ASR) systems often use a portfolio of domain-specific models in order to get high accuracy for distinct user utterance types across different devices. In this paper, we propose an innovative approach that integrates the different per-domain per-device models into a unified model, using a combination of domain embedding, domain experts, mixture of experts and ad… ▽ More

    Submitted 13 October, 2022; v1 submitted 13 May, 2022; originally announced May 2022.

    Comments: We will update the paper completely with our latest experiments and analysis

  2. arXiv:2004.01221  [pdf, other

    eess.AS cs.CL cs.LG cs.SD stat.ML

    Towards Relevance and Sequence Modeling in Language Recognition

    Authors: Bharat Padi, Anand Mohan, Sriram Ganapathy

    Abstract: The task of automatic language identification (LID) involving multiple dialects of the same language family in the presence of noise is a challenging problem. In these scenarios, the identity of the language/dialect may be reliably present only in parts of the temporal sequence of the speech signal. The conventional approaches to LID (and for speaker recognition) ignore the sequence information by… ▽ More

    Submitted 2 April, 2020; originally announced April 2020.

    Comments: https://github.com/iiscleap/lre-relevance-weighting Accepted to IEEE Transactions on Audio, Speech and Language Processing