Skip to main content

Showing 1–26 of 26 results for author: Murthy, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.01817  [pdf, other

    cs.AI cs.LG

    LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks

    Authors: Subbarao Kambhampati, Karthik Valmeekam, Lin Guan, Mudit Verma, Kaya Stechly, Siddhant Bhambri, Lucas Saldyt, Anil Murthy

    Abstract: There is considerable confusion about the role of Large Language Models (LLMs) in planning and reasoning tasks. On one side are over-optimistic claims that LLMs can indeed do these tasks with just the right prompting or self-verification strategies. On the other side are perhaps over-pessimistic claims that all that LLMs are good for in planning/reasoning tasks are as mere translators of the probl… ▽ More

    Submitted 11 June, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Journal ref: Proceedings of the 41 st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

  2. arXiv:2312.14292  [pdf, other

    cs.AI cs.LG cs.MA

    Benchmarking Multi-Agent Preference-based Reinforcement Learning for Human-AI Teaming

    Authors: Siddhant Bhambri, Mudit Verma, Anil Murthy, Subbarao Kambhampati

    Abstract: Preference-based Reinforcement Learning (PbRL) is an active area of research, and has made significant strides in single-agent actor and in observer human-in-the-loop scenarios. However, its application within the co-operative multi-agent RL frameworks, where humans actively participate and express preferences for agent behavior, remains largely uncharted. We consider a two-agent (Human-AI) cooper… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  3. arXiv:2303.07130  [pdf, other

    eess.IV cs.CV cs.LG

    Enhancing COVID-19 Severity Analysis through Ensemble Methods

    Authors: Anand Thyagachandran, Hema A Murthy

    Abstract: Computed Tomography (CT) scans provide a detailed image of the lungs, allowing clinicians to observe the extent of damage caused by COVID-19. The CT severity score (CTSS) based scoring method is used to identify the extent of lung involvement observed on a CT scan. This paper presents a domain knowledge-based pipeline for extracting regions of infection in COVID-19 patients using a combination of… ▽ More

    Submitted 17 March, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

  4. arXiv:2302.06227  [pdf, other

    eess.AS cs.SD

    Fast and small footprint Hybrid HMM-HiFiGAN based system for speech synthesis in Indian languages

    Authors: Sudhanshu Srivastava, Ishika Gupta, Anusha Prakash, Jom Kuriakose, Hema A. Murthy

    Abstract: Hidden-Markov-model (HMM) based text-to-speech (HTS) offers flexibility in speaking styles along with fast training and synthesis while being computationally less intense. HTS performs well even in low-resource scenarios. The primary drawback is that the voice quality is poor compared to that of E2E systems. A hybrid approach combining HMM-based feature generation and neural-network-based HiFi-GAN… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

    Comments: 5 pages, 5 figures

  5. arXiv:2211.08790  [pdf, other

    eess.AS cs.LG

    Structural Segmentation and Labeling of Tabla Solo Performances

    Authors: Gowriprasad R, R Aravind, Hema A Murthy

    Abstract: Tabla is a North Indian percussion instrument used as an accompaniment and an exclusive instrument for solo performances. Tabla solo is intricate and elaborate, exhibiting rhythmic evolution through a sequence of homogeneous sections marked by shared rhythmic characteristics. Each section has a specific structure and name associated with it. Tabla learning and performance in the Indian subcontinen… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: 35 pages, 11 figures

  6. arXiv:2211.01603  [pdf, other

    q-bio.GN cs.LG eess.SP

    Using Signal Processing in Tandem With Adapted Mixture Models for Classifying Genomic Signals

    Authors: Saish Jaiswal, Shreya Nema, Hema A Murthy, Manikandan Narayanan

    Abstract: Genomic signal processing has been used successfully in bioinformatics to analyze biomolecular sequences and gain varied insights into DNA structure, gene organization, protein binding, sequence evolution, etc. But challenges remain in finding the appropriate spectral representation of a biomolecular sequence, especially when multiple variable-length sequences need to be handled consistently. In t… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

  7. arXiv:2210.17153  [pdf, other

    eess.AS cs.SD

    The Importance of Accurate Alignments in End-to-End Speech Synthesis

    Authors: Anusha Prakash, Hema A Murthy

    Abstract: Unit selection synthesis systems required accurate segmentation and labeling of the speech signal owing to the concatenative nature. Hidden Markov model-based speech synthesis accommodates some transcription errors, but it was later shown that accurate transcriptions yield highly intelligible speech with smaller amounts of training data. With the arrival of end-to-end (E2E) systems, it was observe… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: Version 1 uploaded

  8. arXiv:2108.02517  [pdf, other

    cs.IT cs.AI cs.LG

    Multi-task Federated Edge Learning (MtFEEL) in Wireless Networks

    Authors: Sawan Singh Mahara, Shruti M., B. N. Bharath, Akash Murthy

    Abstract: Federated Learning (FL) has evolved as a promising technique to handle distributed machine learning across edge devices. A single neural network (NN) that optimises a global objective is generally learned in most work in FL, which could be suboptimal for edge devices. Although works finding a NN personalised for edge device specific tasks exist, they lack generalisation and/or convergence guarante… ▽ More

    Submitted 9 March, 2022; v1 submitted 5 August, 2021; originally announced August 2021.

  9. arXiv:2103.03215  [pdf, other

    eess.AS cs.SD

    Front-end Diarization for Percussion Separation in Taniavartanam of Carnatic Music Concerts

    Authors: Nauman Dawalatabad, Jilt Sebastian, Jom Kuriakose, C. Chandra Sekhar, Shrikanth Narayanan, Hema A. Murthy

    Abstract: Instrument separation in an ensemble is a challenging task. In this work, we address the problem of separating the percussive voices in the taniavartanam segments of Carnatic music. In taniavartanam, a number of percussive instruments play together or in tandem. Separation of instruments in regions where only one percussion is present leads to interference and artifacts at the output, as source se… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

  10. arXiv:2011.07279  [pdf, other

    cs.CV

    Towards Zero-Shot Learning with Fewer Seen Class Examples

    Authors: Vinay Kumar Verma, Ashish Mishra, Anubha Pandey, Hema A. Murthy, Piyush Rai

    Abstract: We present a meta-learning based generative model for zero-shot learning (ZSL) towards a challenging setting when the number of training examples from each \emph{seen} class is very few. This setup contrasts with the conventional ZSL approaches, where training typically assumes the availability of a sufficiently large number of training examples from each of the seen classes. The proposed approach… ▽ More

    Submitted 14 November, 2020; originally announced November 2020.

    Comments: Accepted in WACV 2021

  11. arXiv:2011.02195  [pdf, other

    eess.SP cs.LG cs.SD eess.AS

    Correlation based Multi-phasal models for improved imagined speech EEG recognition

    Authors: Rini A Sharon, Hema A Murthy

    Abstract: Translation of imagined speech electroencephalogram(EEG) into human understandable commands greatly facilitates the design of naturalistic brain computer interfaces. To achieve improved imagined speech unit classification, this work aims to profit from the parallel information contained in multi-phasal EEG data recorded while speaking, imagining and performing articulatory movements corresponding… ▽ More

    Submitted 4 November, 2020; originally announced November 2020.

    Journal ref: Interspeech SMM 2020

  12. arXiv:2010.05497  [pdf, other

    cs.LG eess.SP q-bio.NC

    The "Sound of Silence" in EEG -- Cognitive voice activity detection

    Authors: Rini A Sharon, Hema A Murthy

    Abstract: Speech cognition bears potential application as a brain computer interface that can improve the quality of life for the otherwise communication impaired people. While speech and resting state EEG are popularly studied, here we attempt to explore a "non-speech"(NS) state of brain activity corresponding to the silence regions of speech audio. Firstly, speech perception is studied to inspect the exis… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

  13. arXiv:2009.04983  [pdf, other

    eess.AS cs.SD

    Exploration of End-to-end Synthesisers forZero Resource Speech Challenge 2020

    Authors: Karthik Pandia D S, Anusha Prakash, Mano Ranjith Kumar, Hema A Murthy

    Abstract: A Spoken dialogue system for an unseen language is referred to as Zero resource speech. It is especially beneficial for develo** applications for languages that have low digital resources. Zero resource speech synthesis is the task of building text-to-speech (TTS) models in the absence of transcriptions. In this work, speech is modelled as a sequence of transient and steady-state acoustic units,… ▽ More

    Submitted 10 September, 2020; originally announced September 2020.

    Comments: Accepted for publication in Interspeech 2020

  14. Evidence of Task-Independent Person-Specific Signatures in EEG using Subspace Techniques

    Authors: Mari Ganesh Kumar, Shrikanth Narayanan, Mriganka Sur, Hema A Murthy

    Abstract: Electroencephalography (EEG) signals are promising as alternatives to other biometrics owing to their protection against spoofing. Previous studies have focused on capturing individual variability by analyzing task/condition-specific EEG. This work attempts to model biometric signatures independent of task/condition by normalizing the associated variance. Toward this goal, the paper extends ideas… ▽ More

    Submitted 25 March, 2021; v1 submitted 27 July, 2020; originally announced July 2020.

    Comments: ©2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

    Journal ref: IEEE Transactions on Information Forensics and Security, 2021

  15. Zero resource speech synthesis using transcripts derived from perceptual acoustic units

    Authors: Karthik Pandia D S, Hema A Murthy

    Abstract: Zerospeech synthesis is the task of building vocabulary independent speech synthesis systems, where transcriptions are not available for training data. It is, therefore, necessary to convert training data into a sequence of fundamental acoustic units that can be used for synthesis during the test. This paper attempts to discover, and model perceptual acoustic units consisting of steady-state, and… ▽ More

    Submitted 8 June, 2020; originally announced June 2020.

  16. arXiv:2001.06657  [pdf, other

    cs.CV cs.IR cs.LG stat.ML

    Stacked Adversarial Network for Zero-Shot Sketch based Image Retrieval

    Authors: Anubha Pandey, Ashish Mishra, Vinay Kumar Verma, Anurag Mittal, Hema A. Murthy

    Abstract: Conventional approaches to Sketch-Based Image Retrieval (SBIR) assume that the data of all the classes are available during training. The assumption may not always be practical since the data of a few classes may be unavailable, or the classes may not appear at the time of training. Zero-Shot Sketch-Based Image Retrieval (ZS-SBIR) relaxes this constraint and allows the algorithm to handle previous… ▽ More

    Submitted 18 January, 2020; originally announced January 2020.

    Comments: Accepted in WACV'2020

  17. arXiv:1904.07453  [pdf, other

    eess.AS cs.CR cs.LG cs.SD

    Spoof detection using time-delay shallow neural network and feature switching

    Authors: Mari Ganesh Kumar, Suvidha Rupesh Kumar, Saranya M, B. Bharathi, Hema A. Murthy

    Abstract: Detecting spoofed utterances is a fundamental problem in voice-based biometrics. Spoofing can be performed either by logical accesses like speech synthesis, voice conversion or by physical accesses such as replaying the pre-recorded utterance. Inspired by the state-of-the-art \emph{x}-vector based speaker verification approach, this paper proposes a time-delay shallow neural network (TD-SNN) for s… ▽ More

    Submitted 23 January, 2020; v1 submitted 16 April, 2019; originally announced April 2019.

    Journal ref: 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1011--1017

  18. arXiv:1903.10870  [pdf, other

    cs.LG stat.ML

    Algorithms and Improved bounds for online learning under finite hypothesis class

    Authors: Ankit Sharma, Late C. A. Murthy

    Abstract: Online learning is the process of answering a sequence of questions based on the correct answers to the previous questions. It is studied in many research areas such as game theory, information theory and machine learning. There are two main components of online learning framework. First, the learning algorithm also known as the learner and second, the hypothesis class which is essentially a set o… ▽ More

    Submitted 24 March, 2019; originally announced March 2019.

    Comments: 17 pages, 2 figures, 9 tables

  19. arXiv:1903.10672  [pdf, other

    cs.LG stat.ML

    Robustness of Neural Networks to Parameter Quantization

    Authors: Abhishek Murthy, Himel Das, Md Ariful Islam

    Abstract: Quantization, a commonly used technique to reduce the memory footprint of a neural network for edge computing, entails reducing the precision of the floating-point representation used for the parameters of the network. The impact of such rounding-off errors on the overall performance of the neural network is estimated using testing, which is not exhaustive and thus cannot be used to guarantee the… ▽ More

    Submitted 26 March, 2019; originally announced March 2019.

  20. Incremental Transfer Learning in Two-pass Information Bottleneck based Speaker Diarization System for Meetings

    Authors: Nauman Dawalatabad, Srikanth Madikeri, C Chandra Sekhar, Hema A Murthy

    Abstract: The two-pass information bottleneck (TPIB) based speaker diarization system operates independently on different conversational recordings. TPIB system does not consider previously learned speaker discriminative information while diarizing new conversations. Hence, the real time factor (RTF) of TPIB system is high owing to the training time required for the artificial neural network (ANN). This pap… ▽ More

    Submitted 21 February, 2019; originally announced February 2019.

    Comments: 5 pages, 2 figures, To appear in Proc. ICASSP 2019, May 12-17, 2019, Brighton, UK

  21. arXiv:1811.04661  [pdf, ps, other

    cs.CV cs.LG

    RelDenClu: A Relative Density based Biclustering Method for identifying non-linear feature relations

    Authors: Namita Jain, Susmita Ghosh, C. A. Murthy

    Abstract: The existing biclustering algorithms for finding feature relation based biclusters often depend on assumptions like monotonicity or linearity. Though a few algorithms overcome this problem by using density-based methods, they tend to miss out many biclusters because they use global criteria for identifying dense regions. The proposed method, RelDenClu uses the local variations in marginal and join… ▽ More

    Submitted 11 May, 2021; v1 submitted 12 November, 2018; originally announced November 2018.

  22. arXiv:1709.00663  [pdf, other

    cs.CV

    A Generative Model For Zero Shot Learning Using Conditional Variational Autoencoders

    Authors: Ashish Mishra, M Shiva Krishna Reddy, Anurag Mittal, Hema A Murthy

    Abstract: Zero shot learning in Image Classification refers to the setting where images from some novel classes are absent in the training data but other information such as natural language descriptions or attribute vectors of the classes are available. This setting is important in the real world since one may not be able to obtain images of all the possible classes at training. While previous approaches h… ▽ More

    Submitted 27 January, 2018; v1 submitted 3 September, 2017; originally announced September 2017.

  23. arXiv:1702.00787  [pdf, ps, other

    cs.DS cs.DC cs.DM

    Distributed Approximation Algorithms for the Multiple Knapsack Problem

    Authors: Ananth Murthy, Chandan Yeshwanth, Shrisha Rao

    Abstract: We consider the distributed version of the Multiple Knapsack Problem (MKP), where $m$ items are to be distributed amongst $n$ processors, each with a knapsack. We propose different distributed approximation algorithms with a tradeoff between time and message complexities. The algorithms are based on the greedy approach of assigning the best item to the knapsack with the largest capacity. These alg… ▽ More

    Submitted 2 February, 2017; originally announced February 2017.

    Comments: 18 pages

    MSC Class: 68W15; 68W15 ACM Class: C.2.4; I.1.2

  24. arXiv:1612.07074  [pdf

    cs.DM cs.SI physics.soc-ph

    Sparsity Measure of a Network Graph: Gini Index

    Authors: Swati Goswami, C. A. Murthy, Asit K. Das

    Abstract: This article examines the application of a popular measure of sparsity, Gini Index, on network graphs. A wide variety of network graphs happen to be sparse. But the index with which sparsity is commonly measured in network graphs is edge density, reflecting the proportion of the sum of the degrees of all nodes in the graph compared to the total possible degrees in the corresponding fully connected… ▽ More

    Submitted 21 December, 2016; originally announced December 2016.

    Comments: 15 pages with 6 figures

    MSC Class: 68R10; 05C82 ACM Class: G.2.2; G.2.3

  25. arXiv:1603.05435  [pdf, ps, other

    cs.SD

    Modified Group Delay Based MultiPitch Estimation in Co-Channel Speech

    Authors: Rajeev Rajan, Hema A. Murthy

    Abstract: Phase processing has been replaced by group delay processing for the extraction of source and system parameters from speech. Group delay functions are ill-behaved when the transfer function has zeros that are close to unit circle in the z-domain. The modified group delay function addresses this problem and has been successfully used for formant and monopitch estimation. In this paper, modified gro… ▽ More

    Submitted 17 March, 2016; originally announced March 2016.

  26. A new estimate of mutual information based measure of dependence between two variables: properties and fast implementation

    Authors: Namita Jain, C. A. Murthy

    Abstract: This article proposes a new method to estimate an existing mutual information based dependence measure using histogram density estimates. Finding a suitable bin length for histogram is an open problem. We propose a new way of computing the bin length for histogram using a function of maximum separation between points. The chosen bin length leads to consistent density estimates for histogram method… ▽ More

    Submitted 13 September, 2015; v1 submitted 28 October, 2014; originally announced November 2014.

    Comments: International Journal of Machine Learning and Cybernetics, Springer Berlin Heidelberg, 10-Sep-2015