Skip to main content

Showing 1–9 of 9 results for author: Purushothaman, A

.
  1. arXiv:2311.05757  [pdf, other

    cond-mat.soft physics.flu-dyn

    Confinement induced three-dimensional trajectories of microswimmers in rectangular channels

    Authors: Byjesh N. Radhakrishnan, Ahana Purushothaman, Ranabir Dey, Sumesh P Thampi

    Abstract: We study the trajectories of a model microorganism inside three-dimensional channels with square and rectangular cross-sections. Using (i) numerical simulations based on lattice-Boltzmann method, and (ii) analytical expressions using far-field hydrodynamic approximations and method of images we systematically investigate the role of the strength and finite-size of the squirmer, confinement dimensi… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  2. arXiv:2309.13537  [pdf, other

    eess.AS cs.AI cs.SD

    Speech enhancement with frequency domain auto-regressive modeling

    Authors: Anurenjan Purushothaman, Debottam Dutta, Rohit Kumar, Sriram Ganapathy

    Abstract: Speech applications in far-field real world settings often deal with signals that are corrupted by reverberation. The task of dereverberation constitutes an important step to improve the audible quality and to reduce the error rates in applications like automatic speech recognition (ASR). We propose a unified framework of speech dereverberation for improving the speech quality and the ASR performa… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

    Comments: 10 pages

    Journal ref: IEEE/ACM Transactions on Audio, Speech and Language Processing 2023

  3. arXiv:2108.05520  [pdf, other

    eess.AS cs.SD eess.SP

    Dereverberation of Autoregressive Envelopes for Far-field Speech Recognition

    Authors: Anurenjan Purushothaman, Anirudh Sreeram, Rohit Kumar, Sriram Ganapathy

    Abstract: The task of speech recognition in far-field environments is adversely affected by the reverberant artifacts that elicit as the temporal smearing of the sub-band envelopes. In this paper, we develop a neural model for speech dereverberation using the long-term sub-band envelopes of speech. The sub-band envelopes are derived using frequency domain linear prediction (FDLP) which performs an autoregre… ▽ More

    Submitted 13 August, 2021; v1 submitted 12 August, 2021; originally announced August 2021.

    Comments: arXiv admin note: text overlap with arXiv:2008.03339

  4. arXiv:2108.03975  [pdf, other

    eess.AS

    End-to-End Speech Recognition With Joint Dereverberation Of Sub-Band Autoregressive Envelopes

    Authors: Rohit Kumar, Anurenjan Purushothaman, Anirudh Sreeram, Sriram Ganapathy

    Abstract: The end-to-end (E2E) automatic speech recognition (ASR) systems are often required to operate in reverberant conditions, where the long-term sub-band envelopes of the speech are temporally smeared. In this paper, we develop a feature enhancement approach using a neural model operating on sub-band temporal envelopes. The temporal envelopes are modeled using the framework of frequency domain linear… ▽ More

    Submitted 17 February, 2022; v1 submitted 9 August, 2021; originally announced August 2021.

    Comments: 5 pages with refrences, e2e asr

  5. arXiv:2106.12763  [pdf, other

    eess.AS cs.SD eess.IV eess.SP

    SRIB-LEAP submission to Far-field Multi-Channel Speech Enhancement Challenge for Video Conferencing

    Authors: R G Prithvi Raj, Rohit Kumar, M K Jayesh, Anurenjan Purushothaman, Sriram Ganapathy, M A Basha Shaik

    Abstract: This paper presents the details of the SRIB-LEAP submission to the ConferencingSpeech challenge 2021. The challenge involved the task of multi-channel speech enhancement to improve the quality of far field speech from microphone arrays in a video conferencing room. We propose a two stage method involving a beamformer followed by single channel enhancement. For the beamformer, we incorporated self-… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

  6. arXiv:2008.03339  [pdf, other

    eess.AS cs.SD eess.SP

    Deep Learning Based Dereverberation of Temporal Envelopesfor Robust Speech Recognition

    Authors: Anurenjan Purushothaman, Anirudh Sreeram, Rohit Kumar, Sriram Ganapathy

    Abstract: Automatic speech recognition in reverberant conditions is a challenging task as the long-term envelopes of the reverberant speech are temporally smeared. In this paper, we propose a neural model for enhancement of sub-band temporal envelopes for dereverberation of speech. The temporal envelopes are derived using the autoregressive modeling framework of frequency domain linear prediction (FDLP). Th… ▽ More

    Submitted 7 August, 2020; originally announced August 2020.

  7. arXiv:2005.11258  [pdf, other

    eess.AS

    LEAP Submission to CHiME-6 ASR Challenge}

    Authors: Anirudh Sreeram, Anurenjan Purushothaman, Rohit Kumar, Sriram Ganapathy

    Abstract: This paper reports the LEAP submission to the CHiME-6 challenge. The CHiME-6 Automatic Speech Recognition (ASR) challenge Track 1 involved the recognition of speech in noisy and reverberant acoustic conditions in home environments with multiple-party interactions. For the challenge submission, the LEAP system used extensive data augmentation and a factorized time-delay neural network (TDNN) archit… ▽ More

    Submitted 22 May, 2020; originally announced May 2020.

  8. arXiv:1911.12617  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Unsupervised Neural Mask Estimator For Generalized Eigen-Value Beamforming Based ASR

    Authors: Rohit Kumar, Anirudh Sreeram, Anurenjan Purushothaman, Sriram Ganapathy

    Abstract: The state-of-art methods for acoustic beamforming in multi-channel ASR are based on a neural mask estimator that predicts the presence of speech and noise. These models are trained using a paired corpus of clean and noisy recordings (teacher model). In this paper, we attempt to move away from the requirements of having supervised clean recordings for training the mask estimator. The models based o… ▽ More

    Submitted 28 November, 2019; originally announced November 2019.

  9. arXiv:1911.05504  [pdf, other

    eess.AS cs.LG cs.SD

    3-D Feature and Acoustic Modeling for Far-Field Speech Recognition

    Authors: Anurenjan Purushothaman, Anirudh Sreeram, Sriram Ganapathy

    Abstract: Automatic speech recognition in multi-channel reverberant conditions is a challenging task. The conventional way of suppressing the reverberation artifacts involves a beamforming based enhancement of the multi-channel speech signal, which is used to extract spectrogram based features for a neural network acoustic model. In this paper, we propose to extract features directly from the multi-channel… ▽ More

    Submitted 26 January, 2020; v1 submitted 13 November, 2019; originally announced November 2019.