Skip to main content

Showing 1–6 of 6 results for author: Ray, S N

.
  1. arXiv:2205.06655   

    cs.CL cs.SD eess.AS

    Unified Modeling of Multi-Domain Multi-Device ASR Systems

    Authors: Soumyajit Mitra, Swayambhu Nath Ray, Bharat Padi, Arunasish Sen, Raghavendra Bilgi, Harish Arsikere, Shalini Ghosh, Ajay Srinivasamurthy, Sri Garimella

    Abstract: Modern Automatic Speech Recognition (ASR) systems often use a portfolio of domain-specific models in order to get high accuracy for distinct user utterance types across different devices. In this paper, we propose an innovative approach that integrates the different per-domain per-device models into a unified model, using a combination of domain embedding, domain experts, mixture of experts and ad… ▽ More

    Submitted 13 October, 2022; v1 submitted 13 May, 2022; originally announced May 2022.

    Comments: We will update the paper completely with our latest experiments and analysis

  2. arXiv:2106.14622  [pdf, other

    cs.CL cs.LG

    Timestam** Documents and Beliefs

    Authors: Swayambhu Nath Ray

    Abstract: Most of the textual information available to us are temporally variable. In a world where information is dynamic, time-stam** them is a very important task. Documents are a good source of information and are used for many tasks like, sentiment analysis, classification of reviews etc. The knowledge of creation date of documents facilitates several tasks like summarization, event extraction, tempo… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

    Comments: Master's Report

    ACM Class: I.2.7

  3. arXiv:2106.06183  [pdf, other

    eess.AS cs.CL

    Improving RNN-T ASR Performance with Date-Time and Location Awareness

    Authors: Swayambhu Nath Ray, Soumyajit Mitra, Raghavendra Bilgi, Sri Garimella

    Abstract: In this paper, we explore the benefits of incorporating context into a Recurrent Neural Network (RNN-T) based Automatic Speech Recognition (ASR) model to improve the speech recognition for virtual assistants. Specifically, we use meta information extracted from the time at which the utterance is spoken and the approximate location information to make ASR context aware. We show that these contextua… ▽ More

    Submitted 16 June, 2021; v1 submitted 11 June, 2021; originally announced June 2021.

    Comments: To appear in TSD 2021

  4. Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End

    Authors: Swayambhu Nath Ray, Minhua Wu, Anirudh Raju, Pegah Ghahremani, Raghavendra Bilgi, Milind Rao, Harish Arsikere, Ariya Rastrow, Andreas Stolcke, Jasha Droppo

    Abstract: Comprehending the overall intent of an utterance helps a listener recognize the individual words spoken. Inspired by this fact, we perform a novel study of the impact of explicitly incorporating intent representations as additional information to improve a recurrent neural network-transducer (RNN-T) based automatic speech recognition (ASR) system. An audio-to-intent (A2I) model encodes the intent… ▽ More

    Submitted 16 June, 2021; v1 submitted 14 May, 2021; originally announced May 2021.

    Comments: To appear in Interspeech 2021

    Journal ref: Proc. Interspeech, Sept. 2021, pp. 3455-3459

  5. arXiv:1902.02161  [pdf, other

    cs.CL

    AD3: Attentive Deep Document Dater

    Authors: Swayambhu Nath Ray, Shib Sankar Dasgupta, Partha Talukdar

    Abstract: Knowledge of the creation date of documents facilitates several tasks such as summarization, event extraction, temporally focused information extraction etc. Unfortunately, for most of the documents on the Web, the time-stamp metadata is either missing or can't be trusted. Thus, predicting creation time from document content itself is an important task. In this paper, we propose Attentive Deep Doc… ▽ More

    Submitted 21 January, 2019; originally announced February 2019.

    Journal ref: DBLP:conf/emnlp/RayDT18 (2018)

  6. arXiv:1902.00175  [pdf, other

    cs.CL cs.AI cs.LG

    Dating Documents using Graph Convolution Networks

    Authors: Shikhar Vashishth, Shib Sankar Dasgupta, Swayambhu Nath Ray, Partha Talukdar

    Abstract: Document date is essential for many important tasks, such as document retrieval, summarization, event detection, etc. While existing approaches for these tasks assume accurate knowledge of the document date, this is not always available, especially for arbitrary documents from the Web. Document Dating is a challenging problem which requires inference over the temporal structure of the document. Pr… ▽ More

    Submitted 31 January, 2019; originally announced February 2019.

    Comments: Accepted at ACL 2018

    Journal ref: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics 2018