Skip to main content

Showing 1–5 of 5 results for author: Sreenivas, T V

Searching in archive eess. Search in all archives.
.
  1. arXiv:2109.04544  [pdf, other

    eess.AS cs.SD eess.SP

    Directional MCLP Analysis and Reconstruction for Spatial Speech Communication

    Authors: Srikanth Raj Chetupalli, Thippur V. Sreenivas

    Abstract: Spatial speech communication, i.e., the reconstruction of spoken signal along with the relative speaker position in the enclosure (reverberation information) is considered in this paper. Directional, diffuse components and the source position information are estimated at the transmitter, and perceptually effective reproduction is considered at the receiver. We consider spatially distributed microp… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: The manuscript is submitted as a full paper to IEEE/ACM Transactions on Audio, Speech and Language Processing

  2. arXiv:1910.09782  [pdf, ps, other

    eess.AS eess.SP

    Joint spatial filter and time-varying MCLP for dereverberation and interference suppression of a dynamic/static speech source

    Authors: Srikanth Raj Chetupalli, Thippur V. Sreenivas

    Abstract: Dereverberation of a moving speech source in the presence of other directional interferers, is a harder problem than that of stationary source and interference cancellation. We explore joint multi channel linear prediction (MCLP) and relative transfer function (RTF) formulation in a stochastic framework and maximum likelihood estimation. We found that the combination of spatial filtering with dist… ▽ More

    Submitted 22 October, 2019; originally announced October 2019.

    Comments: Manuscript submitted for review to IEEE/ACM Transactions on Audio, Speech, and Language Processing on 18 Jul 2019

  3. arXiv:1812.01346  [pdf, ps, other

    eess.AS cs.SD

    LSTM based AE-DNN constraint for better late reverb suppression in multi-channel LP formulation

    Authors: Srikanth Raj Chetupalli, Thippur V. Sreenivas

    Abstract: Prediction of late reverberation component using multi-channel linear prediction (MCLP) in short-time Fourier transform (STFT) domain is an effective means to enhance reverberant speech. Traditionally, a speech power spectral density (PSD) weighted prediction error (WPE) minimization approach is used to estimate the prediction filters. The method is sensitive to the estimate of the desired signal… ▽ More

    Submitted 4 December, 2018; originally announced December 2018.

  4. arXiv:1810.13109  [pdf, ps, other

    eess.AS cs.SD

    Latent variable approach to diarization of audio recordings using ad-hoc randomly placed mobile devices

    Authors: Srikanth Raj Chetupalli, Anirban Bhowmick, Thippur V. Sreenivas

    Abstract: Diarization of audio recordings from ad-hoc mobile devices using spatial information is considered in this paper. A two-channel synchronous recording is assumed for each mobile device, which is used to compute directional statistics separately at each device in a frame-wise manner. The recordings across the mobile devices are asynchronous, but a coarse synchronization is performed by aligning the… ▽ More

    Submitted 31 October, 2018; originally announced October 2018.

    Comments: Paper Submitted to the International Conference on Acoustics Speech and Signal Processing (ICASSP) 2019 to be held in Brighton, UK between May 12-17, 2019

  5. arXiv:1711.11357  [pdf, other

    eess.AS

    Raga Identification using Repetitive Note Patterns from prescriptive notations of Carnatic Music

    Authors: Ranjani H. G., T. V. Sreenivas

    Abstract: Carnatic music, a form of Indian Art Music, has relied on an oral tradition for transferring knowledge across several generations. Over the last two hundred years, the use of prescriptive notations has been adopted for learning, sight-playing and sight-singing. Prescriptive notations offer generic guidelines for a raga rendition and do not include information about the ornamentations or the gamaka… ▽ More

    Submitted 30 November, 2017; originally announced November 2017.