Skip to main content

Showing 1–11 of 11 results for author: Dutta, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2311.13028  [pdf, other

    cs.LG cs.AI cs.DC eess.SP

    DMLR: Data-centric Machine Learning Research -- Past, Present and Future

    Authors: Luis Oala, Manil Maskey, Lilith Bat-Leah, Alicia Parrish, Nezihe Merve Gürel, Tzu-Sheng Kuo, Yang Liu, Rotem Dror, Danilo Brajovic, Xiaozhe Yao, Max Bartolo, William A Gaviria Rojas, Ryan Hileman, Rainier Aliment, Michael W. Mahoney, Meg Risdal, Matthew Lease, Wojciech Samek, Debojyoti Dutta, Curtis G Northcutt, Cody Coleman, Braden Hancock, Bernard Koch, Girmaw Abebe Tadesse, Bojan Karlaš , et al. (13 additional authors not shown)

    Abstract: Drawing from discussions at the inaugural DMLR workshop at ICML 2023 and meetings prior, in this report we outline the relevance of community engagement and infrastructure development for the creation of next-generation public datasets that will advance machine learning science. We chart a path forward as a collective effort to sustain the creation and maintenance of these datasets and methods tow… ▽ More

    Submitted 1 June, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

    Comments: Published in the Journal of Data-centric Machine Learning Research (DMLR) at https://data.mlr.press/assets/pdf/v01-5.pdf

  2. arXiv:2309.13537  [pdf, other

    eess.AS cs.AI cs.SD

    Speech enhancement with frequency domain auto-regressive modeling

    Authors: Anurenjan Purushothaman, Debottam Dutta, Rohit Kumar, Sriram Ganapathy

    Abstract: Speech applications in far-field real world settings often deal with signals that are corrupted by reverberation. The task of dereverberation constitutes an important step to improve the audible quality and to reduce the error rates in applications like automatic speech recognition (ASR). We propose a unified framework of speech dereverberation for improving the speech quality and the ASR performa… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

    Comments: 10 pages

    Journal ref: IEEE/ACM Transactions on Audio, Speech and Language Processing 2023

  3. arXiv:2307.07717  [pdf

    cs.HC eess.SP

    Deep ANN-based Touch-less 3D Pad for Digit Recognition

    Authors: Pramit Kumar Pal, Debarshi Dutta, Attreyee Mandal, Dipshika Das

    Abstract: The Covid-19 pandemic has changed the way humans interact with their environment. Common touch surfaces such as elevator switches and ATM switches are hazardous to touch as they are used by countless people every day, increasing the chance of getting infected. So, a need for touch-less interaction with machines arises. In this paper, we propose a method of recognizing the ten decimal digits (0-9)… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

    Comments: 8 pages, 21 figures, International Conference on Artificial Intelligence: Theory and Applications (AITA-2021)

    ACM Class: I.2.6; I.2.3

    Journal ref: Journal of Biological Engineering Research and Review 2021 https://biologicalengineering.in/

  4. arXiv:2305.12741  [pdf, other

    eess.AS cs.LG cs.SD q-bio.QM

    Coswara: A respiratory sounds and symptoms dataset for remote screening of SARS-CoV-2 infection

    Authors: Debarpan Bhattacharya, Neeraj Kumar Sharma, Debottam Dutta, Srikanth Raj Chetupalli, Pravin Mote, Sriram Ganapathy, Chandrakiran C, Sahiti Nori, Suhail K K, Sadhana Gonuguntla, Murali Alagesan

    Abstract: This paper presents the Coswara dataset, a dataset containing diverse set of respiratory sounds and rich meta-data, recorded between April-2020 and February-2022 from 2635 individuals (1819 SARS-CoV-2 negative, 674 positive, and 142 recovered subjects). The respiratory sounds contained nine sound categories associated with variants of breathing, cough and speech. The rich metadata contained demogr… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted for publiation in Nature Scientific Data

  5. arXiv:2206.13365  [pdf, other

    eess.AS cs.LG cs.SD

    Interpretable Acoustic Representation Learning on Breathing and Speech Signals for COVID-19 Detection

    Authors: Debottam Dutta, Debarpan Bhattacharya, Sriram Ganapathy, Amir H. Poorjam, Deepak Mittal, Maneesh Singh

    Abstract: In this paper, we describe an approach for representation learning of audio signals for the task of COVID-19 detection. The raw audio samples are processed with a bank of 1-D convolutional filters that are parameterized as cosine modulated Gaussian functions. The choice of these kernels allows the interpretation of the filterbanks as smooth band-pass filters. The filtered outputs are pooled, log-c… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

  6. arXiv:2206.12309  [pdf, other

    eess.AS cs.LG eess.SP

    Analyzing the impact of SARS-CoV-2 variants on respiratory sound signals

    Authors: Debarpan Bhattacharya, Debottam Dutta, Neeraj Kumar Sharma, Srikanth Raj Chetupalli, Pravin Mote, Sriram Ganapathy, Chandrakiran C, Sahiti Nori, Suhail K K, Sadhana Gonuguntla, Murali Alagesan

    Abstract: The COVID-19 outbreak resulted in multiple waves of infections that have been associated with different SARS-CoV-2 variants. Studies have reported differential impact of the variants on respiratory health of patients. We explore whether acoustic signals, collected from COVID-19 subjects, show computationally distinguishable acoustic patterns suggesting a possibility to predict the underlying virus… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

    Journal ref: Interspeech, 2022

  7. arXiv:2206.05462  [pdf, other

    eess.AS cs.LG cs.SD

    Svadhyaya system for the Second Diagnosing COVID-19 using Acoustics Challenge 2021

    Authors: Deepak Mittal, Amir H. Poorjam, Debottam Dutta, Debarpan Bhattacharya, Zemin Yu, Sriram Ganapathy, Maneesh Singh

    Abstract: This report describes the system used for detecting COVID-19 positives using three different acoustic modalities, namely speech, breathing, and cough in the second DiCOVA challenge. The proposed system is based on the combination of 4 different approaches, each focusing more on one aspect of the problem, and reaches the blind test AUCs of 86.41, 77.60, and 84.55, in the breathing, cough, and speec… ▽ More

    Submitted 11 June, 2022; originally announced June 2022.

  8. arXiv:2206.05053  [pdf, other

    cs.HC cs.LG cs.SD eess.AS eess.SP

    Coswara: A website application enabling COVID-19 screening by analysing respiratory sound samples and health symptoms

    Authors: Debarpan Bhattacharya, Debottam Dutta, Neeraj Kumar Sharma, Srikanth Raj Chetupalli, Pravin Mote, Sriram Ganapathy, Chandrakiran C, Sahiti Nori, Suhail K K, Sadhana Gonuguntla, Murali Alagesan

    Abstract: The COVID-19 pandemic has accelerated research on design of alternative, quick and effective COVID-19 diagnosis approaches. In this paper, we describe the Coswara tool, a website application designed to enable COVID-19 detection by analysing respiratory sound samples and health symptoms. A user using this service can log into a website using any device connected to the internet, provide there curr… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Journal ref: Interspeech, 2022

  9. arXiv:2110.01177  [pdf, other

    eess.AS cs.SD q-bio.QM

    The Second DiCOVA Challenge: Dataset and performance analysis for COVID-19 diagnosis using acoustics

    Authors: Neeraj Kumar Sharma, Srikanth Raj Chetupalli, Debarpan Bhattacharya, Debottam Dutta, Pravin Mote, Sriram Ganapathy

    Abstract: The Second Diagnosis of COVID-19 using Acoustics (DiCOVA) Challenge aimed at accelerating the research in acoustics based detection of COVID-19, a topic at the intersection of acoustics, signal processing, machine learning, and healthcare. This paper presents the details of the challenge, which was an open call for researchers to analyze a dataset of audio recordings consisting of breathing, cough… ▽ More

    Submitted 11 October, 2021; v1 submitted 4 October, 2021; originally announced October 2021.

  10. arXiv:2107.14793  [pdf, other

    eess.AS cs.SD eess.SP

    A Multi-Head Relevance Weighting Framework For Learning Raw Waveform Audio Representations

    Authors: Debottam Dutta, Purvi Agrawal, Sriram Ganapathy

    Abstract: In this work, we propose a multi-head relevance weighting framework to learn audio representations from raw waveforms. The audio waveform, split into windows of short duration, are processed with a 1-D convolutional layer of cosine modulated Gaussian filters acting as a learnable filterbank. The key novelty of the proposed framework is the introduction of multi-head relevance on the learnt filterb… ▽ More

    Submitted 30 July, 2021; originally announced July 2021.

    Comments: Submitted to 2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics(WASPAA 2021)

  11. arXiv:1807.03625  [pdf, ps, other

    cs.SD cs.LG eess.AS stat.ML

    Foreign English Accent Adjustment by Learning Phonetic Patterns

    Authors: Fedor Kitashov, Elizaveta Svitanko, Debojyoti Dutta

    Abstract: State-of-the-art automatic speech recognition (ASR) systems struggle with the lack of data for rare accents. For sufficiently large datasets, neural engines tend to outshine statistical models in most natural language processing problems. However, a speech accent remains a challenge for both approaches. Phonologists manually create general rules describing a speaker's accent, but their results rem… ▽ More

    Submitted 9 July, 2018; originally announced July 2018.