Skip to main content

Showing 1–6 of 6 results for author: Sastry, C

Searching in archive eess. Search in all archives.
.
  1. arXiv:2404.05071  [pdf, other

    cs.LG cs.SD eess.AS

    Test-Time Training for Depression Detection

    Authors: Sri Harsha Dumpala, Chandramouli Shama Sastry, Rudolf Uher, Sageev Oore

    Abstract: Previous works on depression detection use datasets collected in similar environments to train and test the models. In practice, however, the train and test distributions cannot be guaranteed to be identical. Distribution shifts can be introduced due to variations such as recording environment (e.g., background noise) and demographics (e.g., gender, age, etc). Such distributional shifts can surpri… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  2. arXiv:2402.14285  [pdf, other

    cs.SD cs.LG eess.AS

    Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion

    Authors: Yujia Huang, Adishree Ghatare, Yuanzhe Liu, Ziniu Hu, Qinsheng Zhang, Chandramouli S Sastry, Siddharth Gururani, Sageev Oore, Yisong Yue

    Abstract: We study the problem of symbolic music generation (e.g., generating piano rolls), with a technical focus on non-differentiable rule guidance. Musical rules are often expressed in symbolic form on note characteristics, such as note density or chord progression, many of which are non-differentiable which pose a challenge when using them for guided diffusion. We propose \oursfull (\ours), a novel gui… ▽ More

    Submitted 2 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: ICML 2024 (Oral)

  3. arXiv:2309.10930  [pdf, other

    cs.SD cs.LG eess.AS

    Test-Time Training for Speech

    Authors: Sri Harsha Dumpala, Chandramouli Sastry, Sageev Oore

    Abstract: In this paper, we study the application of Test-Time Training (TTT) as a solution to handling distribution shifts in speech applications. In particular, we introduce distribution-shifts to the test datasets of standard speech-classification tasks -- for example, speaker-identification and emotion-detection -- and explore how Test-Time Training (TTT) can help adjust to the distribution-shift. In ou… ▽ More

    Submitted 28 September, 2023; v1 submitted 19 September, 2023; originally announced September 2023.

  4. arXiv:2108.01043  [pdf, other

    cs.SD cs.CV cs.LG eess.AS

    Musical Speech: A Transformer-based Composition Tool

    Authors: Jason d'Eon, Sri Harsha Dumpala, Chandramouli Shama Sastry, Dani Oore, Sageev Oore

    Abstract: In this paper, we propose a new compositional tool that will generate a musical outline of speech recorded/provided by the user for use as a musical building block in their compositions. The tool allows any user to use their own speech to generate musical material, while still being able to hear the direct connection between their recorded speech and the resulting music. The tool is built on our p… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: NeurIPS 2020 Demonstration Track; extended for PMLR

  5. arXiv:1807.03232  [pdf, ps, other

    eess.SP cs.CV physics.med-ph

    Robust Heartbeat Detection from Multimodal Data via CNN-based Generalizable Information Fusion

    Authors: B S Chandra, C S Sastry, S Jana

    Abstract: Objective: Heartbeat detection remains central to cardiac disease diagnosis and management, and is traditionally performed based on electrocardiogram (ECG). To improve robustness and accuracy of detection, especially, in certain critical-care scenarios, the use of additional physiological signals such as arterial blood pressure (BP) has recently been suggested. There, estimation of heartbeat locat… ▽ More

    Submitted 29 June, 2018; originally announced July 2018.

  6. arXiv:1806.04874  [pdf, other

    eess.SP

    Novel Light Weight Compressed Data Aggregation Using Sparse Measurements for IoT Networks

    Authors: Amarlingam M, Pradeep Kumar Mishra, P Rajalakshmi, Sumohana S. Channappayya, C. S. Sastry

    Abstract: Optimal data aggregation aimed at maximizing IoT network lifetime by minimizing constrained on-board resource utilization continues to be a challenging task. The existing data aggregation methods have proven that compressed sensing is promising for data aggregation. However, they compromise either on energy efficiency or recovery fidelity and require complex on-node computations. In this paper, we… ▽ More

    Submitted 13 June, 2018; originally announced June 2018.