Skip to main content

Showing 1–42 of 42 results for author: Deshmukh, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17713  [pdf, other

    cs.NE

    Multi-objective Binary Differential Approach with Parameter Tuning for Discovering Business Process Models: MoD-ProM

    Authors: Sonia Deshmukh, Shikha Gupta, Naveen Kumar

    Abstract: Process discovery approaches analyze the business data to automatically uncover structured information, known as a process model. The quality of a process model is measured using quality dimensions -- completeness (replay fitness), preciseness, simplicity, and generalization. Traditional process discovery algorithms usually output a single process model. A single model may not accurately capture t… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2406.05398  [pdf, other

    cs.AR

    Evaluation of Posits for Spectral Analysis Using a Software-Defined Dataflow Architecture

    Authors: Sameer Deshmukh, Daniel Khankin, William Killian, John Gustafson, Elad Raz

    Abstract: Spectral analysis plays an important role in detection of damage in structures and deep learning. The choice of a floating-point format plays a crucial role in determining the accuracy and performance of spectral analysis. The IEEE Std 754\textsuperscript{TM} floating-point format (IEEE~754 for short) is supported by most major hardware vendors for ``normal'' floats. However, it has several limita… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  3. arXiv:2402.09585  [pdf, other

    cs.SD eess.AS

    Domain Adaptation for Contrastive Audio-Language Models

    Authors: Soham Deshmukh, Rita Singh, Bhiksha Raj

    Abstract: Audio-Language Models (ALM) aim to be general-purpose audio models by providing zero-shot capabilities at test time. The zero-shot performance of ALM improves by using suitable text prompts for each domain. The text prompts are usually hand-crafted through an ad-hoc process and lead to a drop in ALM generalization and out-of-distribution performance. Existing approaches to improve domain performan… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  4. arXiv:2402.00282  [pdf, other

    eess.AS cs.SD

    PAM: Prompting Audio-Language Models for Audio Quality Assessment

    Authors: Soham Deshmukh, Dareen Alharthi, Benjamin Elizalde, Hannes Gamper, Mahmoud Al Ismail, Rita Singh, Bhiksha Raj, Huaming Wang

    Abstract: While audio quality is a key performance metric for various audio processing tasks, including generative modeling, its objective measurement remains a challenge. Audio-Language Models (ALMs) are pre-trained on audio-text pairs that may contain information about audio quality, the presence of artifacts, or noise. Given an audio input and a text prompt related to quality, an ALM can be used to calcu… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

  5. arXiv:2401.08264  [pdf, ps, other

    cs.SE cs.PL

    Towards a Transpiler for C/C++ to Safer Rust

    Authors: Dhiren Tripuramallu, Swapnil Singh, Shrirang Deshmukh, Srinivas Pinisetty, Shinde Arjun Shivaji, Raja Balusamy, Ajaganna Bandeppa

    Abstract: Rust is a multi-paradigm programming language developed by Mozilla that focuses on performance and safety. Rust code is arguably known best for its speed and memory safety, a property essential while develo** embedded systems. Thus, it becomes one of the alternatives when develo** operating systems for embedded devices. How to convert an existing C++ code base to Rust is also gaining greater a… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  6. arXiv:2311.07602  [pdf, other

    cs.PF cs.MS

    Cache Optimization and Performance Modeling of Batched, Small, and Rectangular Matrix Multiplication on Intel, AMD, and Fujitsu Processors

    Authors: Sameer Deshmukh, Rio Yokota, George Bosilca

    Abstract: Factorization and multiplication of dense matrices and tensors are critical, yet extremely expensive pieces of the scientific toolbox. Careful use of low rank approximation can drastically reduce the computation and memory requirements of these operations. In addition to a lower arithmetic complexity, such methods can, by their structure, be designed to efficiently exploit modern hardware architec… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  7. arXiv:2311.00921  [pdf, other

    math.NA cs.MS

    $O(N)$ distributed direct factorization of structured dense matrices using runtime systems

    Authors: Sameer Deshmukh, Qinxiang Ma, Rio Yokota, George Bosilca

    Abstract: Structured dense matrices result from boundary integral problems in electrostatics and geostatistics, and also Schur complements in sparse preconditioners such as multi-frontal methods. Exploiting the structure of such matrices can reduce the time for dense direct factorization from $O(N^3)$ to $O(N)$. The Hierarchically Semi-Separable (HSS) matrix is one such low rank matrix format that can be fa… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  8. arXiv:2310.04445  [pdf, other

    cs.CL cs.AI cs.LG

    LoFT: Local Proxy Fine-tuning For Improving Transferability Of Adversarial Attacks Against Large Language Model

    Authors: Muhammad Ahmed Shah, Roshan Sharma, Hira Dhamyal, Raphael Olivier, Ankit Shah, Joseph Konan, Dareen Alharthi, Hazim T Bukhari, Massa Baali, Soham Deshmukh, Michael Kuhlmann, Bhiksha Raj, Rita Singh

    Abstract: It has been shown that Large Language Model (LLM) alignments can be circumvented by appending specially crafted attack suffixes with harmful queries to elicit harmful responses. To conduct attacks against private target models whose characterization is unknown, public models can be used as proxies to fashion the attack, with successful attacks being transferred from public proxies to private targe… ▽ More

    Submitted 21 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

  9. arXiv:2310.02298  [pdf, other

    cs.SD cs.AI eess.AS

    Prompting Audios Using Acoustic Properties For Emotion Representation

    Authors: Hira Dhamyal, Benjamin Elizalde, Soham Deshmukh, Huaming Wang, Bhiksha Raj, Rita Singh

    Abstract: Emotions lie on a continuum, but current models treat emotions as a finite valued discrete variable. This representation does not capture the diversity in the expression of emotion. To better represent emotions we propose the use of natural language descriptions (or prompts). In this work, we address the challenge of automatically generating these prompts and training a model to better learn emoti… ▽ More

    Submitted 6 December, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2211.07737

  10. arXiv:2310.01995  [pdf, other

    cs.CV

    Development of Machine Vision Approach for Mechanical Component Identification based on its Dimension and Pitch

    Authors: Toshit Jain, Faisel Mushtaq, K Ramesh, Sandip Deshmukh, Tathagata Ray, Chandu Parimi, Praveen Tandon, Pramod Kumar Jha

    Abstract: In this work, a highly customizable and scalable vision based system for automation of mechanical assembly lines is described. The proposed system calculates the features that are required to classify and identify the different kinds of bolts that are used in the assembly line. The system describes a novel method of calculating the pitch of the bolt in addition to bolt identification and calculati… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 8 pages

    ACM Class: I.4.7

  11. arXiv:2309.07372  [pdf, other

    eess.AS cs.SD

    Training Audio Captioning Models without Audio

    Authors: Soham Deshmukh, Benjamin Elizalde, Dimitra Emmanouilidou, Bhiksha Raj, Rita Singh, Huaming Wang

    Abstract: Automated Audio Captioning (AAC) is the task of generating natural language descriptions given an audio stream. A typical AAC system requires manually curated training data of audio segments and corresponding text caption annotations. The creation of these audio-caption pairs is costly, resulting in general data scarcity for the task. In this work, we address this major limitation and propose an a… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

  12. arXiv:2309.05767  [pdf, other

    cs.SD eess.AS

    Natural Language Supervision for General-Purpose Audio Representations

    Authors: Benjamin Elizalde, Soham Deshmukh, Huaming Wang

    Abstract: Audio-Language models jointly learn multimodal text and audio representations that enable Zero-Shot inference. Models rely on the encoders to create powerful representations of the input and generalize to multiple tasks ranging from sounds, music, and speech. Although models have achieved remarkable performance, there is still a performance gap with task-specific models. In this paper, we propose… ▽ More

    Submitted 6 February, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

  13. arXiv:2308.11239  [pdf, other

    cs.CV

    LOCATE: Self-supervised Object Discovery via Flow-guided Graph-cut and Bootstrapped Self-training

    Authors: Silky Singh, Shripad Deshmukh, Mausoom Sarkar, Balaji Krishnamurthy

    Abstract: Learning object segmentation in image and video datasets without human supervision is a challenging problem. Humans easily identify moving salient objects in videos using the gestalt principle of common fate, which suggests that what moves together belongs together. Building upon this idea, we propose a self-supervised object discovery approach that leverages motion and appearance information to p… ▽ More

    Submitted 2 December, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

    Comments: Accepted to British Machine Vision Conference (BMVC) 2023

  14. BEAVIS: Balloon Enabled Aerial Vehicle for IoT and Sensing

    Authors: Suryansh Sharma, Ashutosh Simha, R. Venkatesha Prasad, Shubham Deshmukh, Kavin B. Saravanan, Ravi Ramesh, Luca Mottola

    Abstract: UAVs are becoming versatile and valuable platforms for various applications. However, the main limitation is their flying time. We present BEAVIS, a novel aerial robotic platform striking an unparalleled trade-off between the manoeuvrability of drones and the long lasting capacity of blimps. BEAVIS scores highly in applications where drones enjoy unconstrained mobility yet suffer from limited life… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: To be published in the 29th Annual International Conference on Mobile Computing and Networking (ACM MobiCom 23), October 2-6, 2023, Madrid, Spain. ACM, New York, NY, USA, 15 pages

  15. arXiv:2307.13192  [pdf, other

    cs.AI cs.LG

    Counterfactual Explanation Policies in RL

    Authors: Shripad V. Deshmukh, Srivatsan R, Supriti Vijay, Jayakumar Subramanian, Chirag Agarwal

    Abstract: As Reinforcement Learning (RL) agents are increasingly employed in diverse decision-making problems using reward preferences, it becomes important to ensure that policies learned by these frameworks in map** observations to a probability distribution of the possible actions are explainable. However, there is little to no work in the systematic understanding of these complex policies in a contras… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: ICML Workshop on Counterfactuals in Minds and Machines, 2023

  16. arXiv:2307.04392  [pdf, other

    cs.CV

    FODVid: Flow-guided Object Discovery in Videos

    Authors: Silky Singh, Shripad Deshmukh, Mausoom Sarkar, Rishabh Jain, Mayur Hemani, Balaji Krishnamurthy

    Abstract: Segmentation of objects in a video is challenging due to the nuances such as motion blurring, parallax, occlusions, changes in illumination, etc. Instead of addressing these nuances separately, we focus on building a generalizable solution that avoids overfitting to the individual intricacies. Such a solution would also help us save enormous resources involved in human annotation of video corpora.… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: CVPR 2023 (L3D-IVU workshop)

  17. arXiv:2305.11834  [pdf, other

    eess.AS cs.SD

    Pengi: An Audio Language Model for Audio Tasks

    Authors: Soham Deshmukh, Benjamin Elizalde, Rita Singh, Huaming Wang

    Abstract: In the domain of audio processing, Transfer Learning has facilitated the rise of Self-Supervised Learning and Zero-Shot Learning techniques. These approaches have led to the development of versatile models capable of tackling a wide array of tasks, while delivering state-of-the-art performance. However, current models inherently lack the capacity to produce the requisite language for open-ended ta… ▽ More

    Submitted 18 January, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Accepted at NeurIPS 2023. The manuscript is updated with additional experiments suggested by reviewers

  18. arXiv:2305.04073  [pdf, other

    cs.AI cs.LG

    Explaining RL Decisions with Trajectories

    Authors: Shripad Vilasrao Deshmukh, Arpan Dasgupta, Balaji Krishnamurthy, Nan Jiang, Chirag Agarwal, Georgios Theocharous, Jayakumar Subramanian

    Abstract: Explanation is a key component for the adoption of reinforcement learning (RL) in many real-world decision-making problems. In the literature, the explanation is often provided by saliency attribution to the features of the RL agent's state. In this work, we propose a complementary approach to these explanations, particularly for offline RL, where we attribute the policy decisions of a trained RL… ▽ More

    Submitted 22 January, 2024; v1 submitted 6 May, 2023; originally announced May 2023.

    Comments: Published at International Conference on Learning Representations (ICLR), 2023

  19. arXiv:2302.09719  [pdf, ps, other

    eess.AS cs.SD

    Synergy between human and machine approaches to sound/scene recognition and processing: An overview of ICASSP special session

    Authors: Laurie M. Heller, Benjamin Elizalde, Bhiksha Raj, Soham Deshmukh

    Abstract: Machine Listening, as usually formalized, attempts to perform a task that is, from our perspective, fundamentally human-performable, and performed by humans. Current automated models of Machine Listening vary from purely data-driven approaches to approaches imitating human systems. In recent years, the most promising approaches have been hybrid in that they have used data-driven approaches informe… ▽ More

    Submitted 23 February, 2023; v1 submitted 19 February, 2023; originally announced February 2023.

    Comments: 4 pages. Summary of Special Session planned for 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://2023.ieeeicassp.org/ Second version has corrected spelling of an author's name

  20. arXiv:2211.07737  [pdf, other

    cs.SD cs.LG eess.AS

    Describing emotions with acoustic property prompts for speech emotion recognition

    Authors: Hira Dhamyal, Benjamin Elizalde, Soham Deshmukh, Huaming Wang, Bhiksha Raj, Rita Singh

    Abstract: Emotions lie on a broad continuum and treating emotions as a discrete number of classes limits the ability of a model to capture the nuances in the continuum. The challenge is how to describe the nuances of emotions and how to enable a model to learn the descriptions. In this work, we devise a method to automatically create a description (or prompt) for a given audio by computing acoustic properti… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

  21. arXiv:2211.05100  [pdf, other

    cs.CL

    BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

    Authors: BigScience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major , et al. (369 additional authors not shown)

    Abstract: Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access… ▽ More

    Submitted 27 June, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  22. arXiv:2209.14275  [pdf, other

    eess.AS cs.AI

    Audio Retrieval with WavText5K and CLAP Training

    Authors: Soham Deshmukh, Benjamin Elizalde, Huaming Wang

    Abstract: Audio-Text retrieval takes a natural language query to retrieve relevant audio files in a database. Conversely, Text-Audio retrieval takes an audio file as a query to retrieve relevant natural language descriptions. Most of the literature train retrieval systems with one audio captioning dataset, but evaluating the benefit of training with multiple datasets is underexplored. Moreover, retrieval sy… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

  23. arXiv:2209.06584  [pdf, other

    cs.CV

    One-Shot Doc Snippet Detection: Powering Search in Document Beyond Text

    Authors: Abhinav Java, Shripad Deshmukh, Milan Aggarwal, Surgan Jandial, Mausoom Sarkar, Balaji Krishnamurthy

    Abstract: Active consumption of digital documents has yielded scope for research in various applications, including search. Traditionally, searching within a document has been cast as a text matching problem ignoring the rich layout and visual cues commonly present in structured documents, forms, etc. To that end, we ask a mostly unexplored question: "Can we search for other similar snippets present in a ta… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

  24. arXiv:2209.03578  [pdf

    cs.CV

    Sign Language Detection

    Authors: Shubham Deshmukh, Favin Fernandes, Amey Chavan

    Abstract: With the advancements in Computer vision techniques the need to classify images based on its features have become a huge task and necessity. In this project we proposed 2 models i.e. feature extraction and classification using ORB and SVM and the second is using CNN architecture. The end result of the project is to understand the concept behind feature extraction and image classification. The trai… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

    Comments: 8 pages, 10 figures

  25. arXiv:2209.03576  [pdf

    cs.CV

    Suspicious and Anomaly Detection

    Authors: Shubham Deshmukh, Favin Fernandes, Monali Ahire, Devarshi Borse, Amey Chavan

    Abstract: In this project we propose a CNN architecture to detect anomaly and suspicious activities; the activities chosen for the project are running, jum** and kicking in public places and carrying gun, bat and knife in public places. With the trained model we compare it with the pre-existing models like Yolo, vgg16, vgg19. The trained Model is then implemented for real time detection and also used the.… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

    Comments: 7 pages, 10 figures

  26. arXiv:2209.03570  [pdf

    cs.CV

    SANIP: Shop** Assistant and Navigation for the visually impaired

    Authors: Shubham Deshmukh, Favin Fernandes, Amey Chavan, Monali Ahire, Devashri Borse, Jyoti Madake

    Abstract: The proposed shop** assistant model SANIP is going to help blind persons to detect hand held objects and also to get a video feedback of the information retrieved from the detected and recognized objects. The proposed model consists of three python models i.e. Custom Object Detection, Text Detection and Barcode detection. For object detection of the hand held object, we have created our own cust… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

    Comments: 6 pages, 8 figures. arXiv admin note: text overlap with arXiv:2011.04244 by other authors

  27. arXiv:2208.09439  [pdf, other

    cs.CL

    Adapting Task-Oriented Dialogue Models for Email Conversations

    Authors: Soham Deshmukh, Charles Lee

    Abstract: Intent detection is a key part of any Natural Language Understanding (NLU) system of a conversational assistant. Detecting the correct intent is essential yet difficult for email conversations where multiple directives and intents are present. In such settings, conversation context can become a key disambiguating factor for detecting the user's request from the assistant. One prominent way of inco… ▽ More

    Submitted 19 August, 2022; originally announced August 2022.

  28. arXiv:2206.15076  [pdf, other

    cs.CL

    BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing

    Authors: Jason Alan Fries, Leon Weber, Natasha Seelam, Gabriel Altay, Debajyoti Datta, Samuele Garda, Myungsun Kang, Ruisi Su, Wojciech Kusa, Samuel Cahyawijaya, Fabio Barth, Simon Ott, Matthias Samwald, Stephen Bach, Stella Biderman, Mario Sänger, Bo Wang, Alison Callahan, Daniel León Periñán, Théo Gigant, Patrick Haller, Jenny Chim, Jose David Posada, John Michael Giorgi, Karthik Rangasai Sivaraman , et al. (18 additional authors not shown)

    Abstract: Training and evaluating language models increasingly requires the construction of meta-datasets --diverse collections of curated data with clear provenance. Natural language prompting has recently lead to improved zero-shot generalization by transforming existing, supervised datasets into a diversity of novel pretraining tasks, highlighting the benefits of meta-dataset curation. While successful i… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

    Comments: Submitted to NeurIPS 2022 Datasets and Benchmarks Track

  29. arXiv:2206.04769  [pdf, other

    cs.SD eess.AS

    CLAP: Learning Audio Concepts From Natural Language Supervision

    Authors: Benjamin Elizalde, Soham Deshmukh, Mahmoud Al Ismail, Huaming Wang

    Abstract: Mainstream Audio Analytics models are trained to learn under the paradigm of one class label to many recordings focusing on one task. Learning under such restricted supervision limits the flexibility of models because they require labeled audio for training and can only predict the predefined categories. Instead, we propose to learn audio concepts from natural language supervision. We call our app… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

  30. arXiv:2205.03513  [pdf, other

    eess.SP cs.AI cs.LG

    Digital Twin Framework for Time to Failure Forecasting of Wind Turbine Gearbox: A Concept

    Authors: Mili Wadhwani, Sakshi Deshmukh, Harsh S. Dhiman

    Abstract: Wind turbine is a complex machine with its rotating and non-rotating equipment being sensitive to faults. Due to increased wear and tear, the maintenance aspect of a wind turbine is of critical importance. Unexpected failure of wind turbine components can lead to increased O\&M costs which ultimately reduces effective power capture of a wind farm. Fault detection in wind turbines is often suppleme… ▽ More

    Submitted 28 April, 2022; originally announced May 2022.

  31. arXiv:2110.02148  [pdf, other

    cs.CL cs.LG

    NaRLE: Natural Language Models using Reinforcement Learning with Emotion Feedback

    Authors: Ruijie Zhou, Soham Deshmukh, Jeremiah Greer, Charles Lee

    Abstract: Current research in dialogue systems is focused on conversational assistants working on short conversations in either task-oriented or open domain settings. In this paper, we focus on improving task-based conversational assistants online, primarily those working on document-type conversations (e.g., emails) whose contents may or may not be completely related to the assistant's task. We propose "NA… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

  32. arXiv:2108.13307   

    cs.CR

    Security For System-On-Chip (SoC) Using Neural Networks

    Authors: Vedant Ghodke, Shubham Deshmukh, Atharva Deshpande, Ninad Ekbote, Swati Shilaskar

    Abstract: With the growth of embedded systems, VLSI design phases complexity and cost factors across the globe and has become outsourced. Modern computing ICs are now using system-on-chip for better on-chip processing and communication. In the era of Internet-of-Things (IoT), security has become one of the most crucial parts of a System-on-Chip (SoC). Malicious activities generate abnormal traffic patterns… ▽ More

    Submitted 28 September, 2022; v1 submitted 30 August, 2021; originally announced August 2021.

    Comments: Challenges with content validity

  33. arXiv:2106.06858  [pdf, other

    eess.AS cs.LG

    Improving weakly supervised sound event detection with self-supervised auxiliary tasks

    Authors: Soham Deshmukh, Bhiksha Raj, Rita Singh

    Abstract: While multitask and transfer learning has shown to improve the performance of neural networks in limited data settings, they require pretraining of the model on large datasets beforehand. In this paper, we focus on improving the performance of weakly supervised sound event detection in low data and noisy settings simultaneously without requiring any pretraining task. To that extent, we propose a s… ▽ More

    Submitted 12 June, 2021; originally announced June 2021.

    Comments: Accepted at INTERSPEECH 21

  34. Interpreting glottal flow dynamics for detecting COVID-19 from voice

    Authors: Soham Deshmukh, Mahmoud Al Ismail, Rita Singh

    Abstract: In the pathogenesis of COVID-19, impairment of respiratory functions is often one of the key symptoms. Studies show that in these cases, voice production is also adversely affected -- vocal fold oscillations are asynchronous, asymmetrical and more restricted during phonation. This paper proposes a method that analyzes the differential dynamics of the glottal flow waveform (GFW) during voice produc… ▽ More

    Submitted 29 October, 2020; originally announced October 2020.

  35. arXiv:2010.10707  [pdf, other

    eess.AS cs.LG cs.SD

    Detection of COVID-19 through the analysis of vocal fold oscillations

    Authors: Mahmoud Al Ismail, Soham Deshmukh, Rita Singh

    Abstract: Phonation, or the vibration of the vocal folds, is the primary source of vocalization in the production of voiced sounds by humans. It is a complex bio-mechanical process that is highly sensitive to changes in the speaker's respiratory parameters. Since most symptomatic cases of COVID-19 present with moderate to severe impairment of respiratory functions, we hypothesize that signatures of COVID-19… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

    Comments: 5 pages, 6 figures

  36. arXiv:2008.07085  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection

    Authors: Soham Deshmukh, Bhiksha Raj, Rita Singh

    Abstract: Weakly Labelled learning has garnered lot of attention in recent years due to its potential to scale Sound Event Detection (SED) and is formulated as Multiple Instance Learning (MIL) problem. This paper proposes a Multi-Task Learning (MTL) framework for learning from Weakly Labelled Audio data which encompasses the traditional MIL setup. To show the utility of proposed framework, we use the input… ▽ More

    Submitted 29 October, 2020; v1 submitted 17 August, 2020; originally announced August 2020.

  37. arXiv:2002.11500  [pdf, other

    eess.SP cs.IT

    Robust Underlay Device-to-Device Communications on Multiple Channels

    Authors: Mohamed Elnourani, Siddharth Deshmukh, Baltasar Beferull-Lozano, Daniel Romero

    Abstract: Most recent works in device-to-device (D2D) underlay communications focus on the optimization of either power or channel allocation to improve the spectral efficiency, and typically consider uplink and downlink separately. Further, several of them also assume perfect knowledge of channel-stateinformation (CSI). In this paper, we formulate a joint uplink and downlink resource allocation scheme, whi… ▽ More

    Submitted 26 February, 2020; originally announced February 2020.

    Comments: 30 pages, 7 figures, 2 table. Submitted to IEEE Transactions on Wireless Communications

  38. arXiv:1912.12191  [pdf, other

    cs.CV cs.AI

    Explain Your Move: Understanding Agent Actions Using Specific and Relevant Feature Attribution

    Authors: Nikaash Puri, Sukriti Verma, Piyush Gupta, Dhruv Kayastha, Shripad Deshmukh, Balaji Krishnamurthy, Sameer Singh

    Abstract: As deep reinforcement learning (RL) is applied to more tasks, there is a need to visualize and understand the behavior of learned agents. Saliency maps explain agent behavior by highlighting the features of the input state that are most relevant for the agent in taking an action. Existing perturbation-based approaches to compute saliency often highlight regions of the input that are not relevant t… ▽ More

    Submitted 3 April, 2020; v1 submitted 23 December, 2019; originally announced December 2019.

    Comments: Accepted at the International Conference on Learning Representations (ICLR) 2020

  39. arXiv:1912.03718  [pdf, other

    stat.ME cs.LG eess.SP

    Improved Covariance Matrix Estimator using Shrinkage Transformation and Random Matrix Theory

    Authors: Samruddhi Deshmukh, Amartansh Dubey

    Abstract: One of the major challenges in multivariate analysis is the estimation of population covariance matrix from sample covariance matrix (SCM). Most recent covariance matrix estimators use either shrinkage transformations or asymptotic results from Random Matrix Theory (RMT). Shrinkage techniques help in pulling extreme correlation values towards certain target values whereas tools from RMT help in re… ▽ More

    Submitted 8 December, 2019; originally announced December 2019.

  40. arXiv:1905.11824  [pdf, other

    cs.LG cs.CR stat.AP stat.ML

    Attacker Behaviour Profiling using Stochastic Ensemble of Hidden Markov Models

    Authors: Soham Deshmukh, Rahul Rade, Dr. Faruk Kazi

    Abstract: Cyber threat intelligence is one of the emerging areas of focus in information security. Much of the recent work has focused on rule-based methods and detection of network attacks using Intrusion Detection algorithms. In this paper we propose a framework for inspecting and modelling the behavioural aspect of an attacker to obtain better insight predictive power on his future actions. For modelling… ▽ More

    Submitted 6 June, 2021; v1 submitted 28 May, 2019; originally announced May 2019.

  41. arXiv:1608.05513   

    cs.AI cs.NE

    Data Centroid Based Multi-Level Fuzzy Min-Max Neural Network

    Authors: Shraddha Deshmukh, Sagar Gandhi, Pratap Sanap, Vivek Kulkarni

    Abstract: Recently, a multi-level fuzzy min max neural network (MLF) was proposed, which improves the classification accuracy by handling an overlapped region (area of confusion) with the help of a tree structure. In this brief, an extension of MLF is proposed which defines a new boundary region, where the previously proposed methods mark decisions with less confidence and hence misclassification is more fr… ▽ More

    Submitted 20 December, 2016; v1 submitted 19 August, 2016; originally announced August 2016.

    Comments: This paper has been withdrawn by the author due to crucial evidence that the similar work has already been published

  42. arXiv:1103.0633  [pdf

    cs.DB

    RDBNorma: - A semi-automated tool for relational database schema normalization up to third normal form

    Authors: Y. V. Dongare, P. S. Dhabe, S. V. Deshmukh

    Abstract: In this paper a tool called RDBNorma is proposed, that uses a novel approach to represent a relational database schema and its functional dependencies in computer memory using only one linked list and used for semi-automating the process of relational database schema normalization up to third normal form. This paper addresses all the issues of representing a relational schema along with its functi… ▽ More

    Submitted 3 March, 2011; originally announced March 2011.

    Comments: 22 pages and international journal

    Journal ref: International Journal of Database Management Systems ( IJDMS ), Vol.3, No.1, February 2011