Skip to main content

Showing 1–50 of 174 results for author: Gupta, A

Searching in archive eess. Search in all archives.
.
  1. Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet

    Authors: Manish Dhakal, Arman Chhetri, Aman Kumar Gupta, Prabin Lamichhane, Suraj Pandey, Subarna Shakya

    Abstract: This paper presents an end-to-end deep learning model for Automatic Speech Recognition (ASR) that transcribes Nepali speech to text. The model was trained and tested on the OpenSLR (audio, text) dataset. The majority of the audio dataset have silent gaps at both ends which are clipped during dataset preprocessing for a more uniform map** of audio frames and their corresponding texts. Mel Frequen… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Accepted at 2022 International Conference on Inventive Computation Technologies (ICICT), IEEE

    Journal ref: 2022 International Conference on Inventive Computation Technologies (ICICT), pp. 515-521

  2. arXiv:2406.08931  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    Exploring Multilingual Unseen Speaker Emotion Recognition: Leveraging Co-Attention Cues in Multitask Learning

    Authors: Arnav Goel, Medha Hira, Anubha Gupta

    Abstract: Advent of modern deep learning techniques has given rise to advancements in the field of Speech Emotion Recognition (SER). However, most systems prevalent in the field fail to generalize to speakers not seen during training. This study focuses on handling challenges of multilingual SER, specifically on unseen speakers. We introduce CAMuLeNet, a novel architecture leveraging co-attention based fusi… ▽ More

    Submitted 19 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: 5 pages, Accepted to INTERSPEECH 2024. The first two authors contributed equally

  3. arXiv:2406.07662  [pdf, other

    eess.IV cs.AI cs.CV cs.LG q-bio.NC

    Progress Towards Decoding Visual Imagery via fNIRS

    Authors: Michel Adamic, Wellington Avelino, Anna Brandenberger, Bryan Chiang, Hunter Davis, Stephen Fay, Andrew Gregory, Aayush Gupta, Raphael Hotter, Grace Jiang, Fiona Leng, Stephen Polcyn, Thomas Ribeiro, Paul Scotti, Michelle Wang, Marley Xiong, Jonathan Xu

    Abstract: We demonstrate the possibility of reconstructing images from fNIRS brain activity and start building a prototype to match the required specs. By training an image reconstruction model on downsampled fMRI data, we discovered that cm-scale spatial resolution is sufficient for image generation. We obtained 71% retrieval accuracy with 1-cm resolution, compared to 93% on the full-resolution fMRI, and 2… ▽ More

    Submitted 22 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

  4. arXiv:2406.00022  [pdf, other

    cs.CL cs.SD eess.AS

    Multilingual Prosody Transfer: Comparing Supervised & Transfer Learning

    Authors: Arnav Goel, Medha Hira, Anubha Gupta

    Abstract: The field of prosody transfer in speech synthesis systems is rapidly advancing. This research is focused on evaluating learning methods for adapting pre-trained monolingual text-to-speech (TTS) models to multilingual conditions, i.e., Supervised Fine-Tuning (SFT) and Transfer Learning (TL). This comparison utilizes three distinct metrics: Mean Opinion Score (MOS), Recognition Accuracy (RA), and Me… ▽ More

    Submitted 18 June, 2024; v1 submitted 23 May, 2024; originally announced June 2024.

    Comments: 7 pages, Accepted to ICLR 2024 - Tiny Track

  5. arXiv:2406.00021  [pdf, other

    cs.CL cs.SD eess.AS

    CrossVoice: Crosslingual Prosody Preserving Cascade-S2ST using Transfer Learning

    Authors: Medha Hira, Arnav Goel, Anubha Gupta

    Abstract: This paper presents CrossVoice, a novel cascade-based Speech-to-Speech Translation (S2ST) system employing advanced ASR, MT, and TTS technologies with cross-lingual prosody preservation through transfer learning. We conducted comprehensive experiments comparing CrossVoice with direct-S2ST systems, showing improved BLEU scores on tasks such as Fisher Es-En, VoxPopuli Fr-En and prosody preservation… ▽ More

    Submitted 18 June, 2024; v1 submitted 23 May, 2024; originally announced June 2024.

    Comments: 8 pages, Accepted at ICLR 2024 - Tiny Track

  6. arXiv:2405.13762  [pdf, other

    cs.CV cs.LG cs.MM cs.SD eess.AS

    A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation

    Authors: Gwanghyun Kim, Alonso Martinez, Yu-Chuan Su, Brendan Jou, José Lezama, Agrim Gupta, Lijun Yu, Lu Jiang, Aren Jansen, Jacob Walker, Krishna Somandepalli

    Abstract: Training diffusion models for audiovisual sequences allows for a range of generation tasks by learning conditional distributions of various input-output combinations of the two modalities. Nevertheless, this strategy often requires training a separate model for each task which is expensive. Here, we propose a novel training approach to effectively learn arbitrary conditional distributions in the a… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  7. arXiv:2405.10750  [pdf, other

    eess.SY cs.LG

    Parameter Identification for Electrochemical Models of Lithium-Ion Batteries Using Bayesian Optimization

    Authors: Jianzong Pi, Samuel Filgueira da Silva, Mehmet Fatih Ozkan, Abhishek Gupta, Marcello Canova

    Abstract: Efficient parameter identification of electrochemical models is crucial for accurate monitoring and control of lithium-ion cells. This process becomes challenging when applied to complex models that rely on a considerable number of interdependent parameters that affect the output response. Gradient-based and metaheuristic optimization techniques, although previously employed for this task, are lim… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 6 pages

  8. arXiv:2405.04023  [pdf, other

    eess.IV cs.CV

    Lumbar Spine Tumor Segmentation and Localization in T2 MRI Images Using AI

    Authors: Rikathi Pal, Sudeshna Mondal, Aditi Gupta, Priya Saha, Somoballi Ghoshal, Amlan Chakrabarti, Susmita Sur-Kolay

    Abstract: In medical imaging, segmentation and localization of spinal tumors in three-dimensional (3D) space pose significant computational challenges, primarily stemming from limited data availability. In response, this study introduces a novel data augmentation technique, aimed at automating spine tumor segmentation and localization through AI approaches. Leveraging a fusion of fuzzy c-means clustering an… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 9 pages, 12 figures

  9. arXiv:2404.12308  [pdf, other

    cs.RO cs.LG eess.SY

    ASID: Active Exploration for System Identification in Robotic Manipulation

    Authors: Marius Memmel, Andrew Wagenmaker, Chuning Zhu, Patrick Yin, Dieter Fox, Abhishek Gupta

    Abstract: Model-free control strategies such as reinforcement learning have shown the ability to learn control strategies without requiring an accurate model or simulator of the world. While this is appealing due to the lack of modeling requirements, such methods can be sample inefficient, making them impractical in many real-world domains. On the other hand, model-based control techniques leveraging accura… ▽ More

    Submitted 26 June, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: Project website at https://weirdlabuw.github.io/asid

  10. arXiv:2403.16175  [pdf, other

    eess.IV cs.CV

    Enhancing MRI-Based Classification of Alzheimer's Disease with Explainable 3D Hybrid Compact Convolutional Transformers

    Authors: Arindam Majee, Avisek Gupta, Sourav Raha, Swagatam Das

    Abstract: Alzheimer's disease (AD), characterized by progressive cognitive decline and memory loss, presents a formidable global health challenge, underscoring the critical importance of early and precise diagnosis for timely interventions and enhanced patient outcomes. While MRI scans provide valuable insights into brain structures, traditional analysis methods often struggle to discern intricate 3D patter… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: 8 pages, 4 figures

  11. arXiv:2403.13611  [pdf, other

    cs.NI eess.SP

    Densify & Conquer: Densified, smaller base-stations can conquer the increasing carbon footprint problem in nextG wireless

    Authors: Agrim Gupta, Adel Heidari, Jiaming **, Dinesh Bharadia

    Abstract: Connectivity on-the-go has been one of the most impressive technological achievements in the 2010s decade. However, multiple studies show that this has come at an expense of increased carbon footprint, that also rivals the entire aviation sector's carbon footprint. The two major contributors of this increased footprint are (a) smartphone batteries which affect the embodied footprint and (b) base-s… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 12 pages, 14 figures

  12. arXiv:2402.10211  [pdf, other

    cs.LG cs.RO eess.SP

    Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling

    Authors: Raunaq Bhirangi, Chenyu Wang, Venkatesh Pattabiraman, Carmel Majidi, Abhinav Gupta, Tess Hellebrekers, Lerrel Pinto

    Abstract: Reasoning from sequences of raw sensory data is a ubiquitous problem across fields ranging from medical devices to robotics. These problems often involve using long sequences of raw sensor data (e.g. magnetometers, piezoresistors) to predict sequences of desirable physical quantities (e.g. force, inertial measurements). While classical approaches are powerful for locally-linear prediction problems… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  13. arXiv:2402.09658  [pdf

    eess.IV cs.CV

    Towards Precision Cardiovascular Analysis in Zebrafish: The ZACAF Paradigm

    Authors: Amir Mohammad Naderi, Jennifer G. Casey, Mao-Hsiang Huang, Rachelle Victorio, David Y. Chiang, Calum MacRae, Hung Cao, Vandana A. Gupta

    Abstract: Quantifying cardiovascular parameters like ejection fraction in zebrafish as a host of biological investigations has been extensively studied. Since current manual monitoring techniques are time-consuming and fallible, several image processing frameworks have been proposed to automate the process. Most of these works rely on supervised deep-learning architectures. However, supervised methods tend… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  14. arXiv:2402.00235  [pdf, other

    cs.CL cs.SD eess.AS

    Exploring the limits of decoder-only models trained on public speech recognition corpora

    Authors: Ankit Gupta, George Saon, Brian Kingsbury

    Abstract: The emergence of industrial-scale speech recognition (ASR) models such as Whisper and USM, trained on 1M hours of weakly labelled and 12M hours of audio only proprietary data respectively, has led to a stronger need for large scale public ASR corpora and competitive open source pipelines. Unlike the said models, large language models are typically based on Transformer decoders, and it remains uncl… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

  15. arXiv:2401.04636  [pdf, other

    cs.IT eess.SP

    On the Target Detection Performance of a Molecular Communication Network with Multiple Mobile Nanomachines

    Authors: Nithin V. Sabu, Abhishek K. Gupta

    Abstract: A network of nanomachines (NMs) can be used to build a target detection system for a variety of promising applications. They have the potential to detect toxic chemicals, infectious bacteria, and biomarkers of dangerous diseases such as cancer within the human body. Many diseases and health disorders can be detected early and efficiently treated in the future by utilizing these systems. To fully g… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

  16. arXiv:2312.14364  [pdf, other

    eess.SY

    GreenScan: Towards large-scale terrestrial monitoring the health of urban trees using mobile sensing

    Authors: Akshit Gupta, Simone Mora, Fan Zhang, Martine Rutten, R. Venkatesha Prasad, Carlo Ratti

    Abstract: Healthy urban greenery is a fundamental asset to mitigate climate change phenomena such as extreme heat and air pollution. However, urban trees are often affected by abiotic and biotic stressors that hamper their functionality, and whenever not timely managed, even their survival. While the current greenery inspection techniques can help in taking effective measures, they often require a high amou… ▽ More

    Submitted 6 April, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: 13 pages, submitted to IEEE Sensors

  17. Model-Free Change Point Detection for Mixing Processes

    Authors: Hao Chen, Abhishek Gupta, Yin Sun, Ness Shroff

    Abstract: This paper considers the change point detection problem under dependent samples. In particular, we provide performance guarantees for the MMD-CUSUM test under exponentially $α$, $β$, and fast $φ$-mixing processes, which significantly expands its utility beyond the i.i.d. and Markovian cases used in previous studies. We obtain lower bounds for average-run-length (ARL) and upper bounds for average-d… ▽ More

    Submitted 1 May, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 20 pages, 4 figures. Accepted by IEEE OJ-CSYS

  18. arXiv:2311.17475  [pdf, other

    cs.CV eess.IV

    CLiSA: A Hierarchical Hybrid Transformer Model using Orthogonal Cross Attention for Satellite Image Cloud Segmentation

    Authors: Subhajit Paul, Ashutosh Gupta

    Abstract: Clouds in optical satellite images are a major concern since their presence hinders the ability to carry accurate analysis as well as processing. Presence of clouds also affects the image tasking schedule and results in wastage of valuable storage space on ground as well as space-based systems. Due to these reasons, deriving accurate cloud masks from optical remote-sensing images is an important t… ▽ More

    Submitted 1 December, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: 14 pages, 11 figures, 7 tables

  19. arXiv:2311.16490  [pdf, other

    eess.IV cs.CV cs.LG

    SIRAN: Sinkhorn Distance Regularized Adversarial Network for DEM Super-resolution using Discriminative Spatial Self-attention

    Authors: Subhajit Paul, Ashutosh Gupta

    Abstract: Digital Elevation Model (DEM) is an essential aspect in the remote sensing domain to analyze and explore different applications related to surface elevation information. In this study, we intend to address the generation of high-resolution DEMs using high-resolution multi-spectral (MX) satellite imagery by incorporating adversarial learning. To promptly regulate this process, we utilize the notion… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 15 pages, 14 figures

  20. arXiv:2311.00836  [pdf, ps, other

    math.OC eess.SP math.PR stat.CO

    Effective filtering approach for joint parameter-state estimation in SDEs via Rao-Blackwellization and modularization

    Authors: Zhou Fang, Ankit Gupta, Mustafa Khammash

    Abstract: Stochastic filtering is a vibrant area of research in both control theory and statistics, with broad applications in many scientific fields. Despite its extensive historical development, there still lacks an effective method for joint parameter-state estimation in SDEs. The state-of-the-art particle filtering methods suffer from either sample degeneracy or information loss, with both issues stemmi… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 8 pages, 2 figures

    MSC Class: 62M20; 62F15; 65C05; 92-08; 93E11

  21. arXiv:2310.12174  [pdf, other

    physics.soc-ph cs.CE eess.SY

    A Traffic Control Framework for Uncrewed Aircraft Systems

    Authors: Ananay Vikram Gupta, Aaditya Prakash Kattekola, Ansh Vikram Gupta, Dacharla Venkata Abhiram, Kamesh Namuduri, Ravichandran Subramanian

    Abstract: The exponential growth of Advanced Air Mobility (AAM) services demands assurances of safety in the airspace. This research a Traffic Control Framework (TCF) for develo** digital flight rules for Uncrewed Aircraft System (UAS) flying in designated air corridors. The proposed TCF helps model, deploy, and test UAS control, agents, regardless of their hardware configurations. This paper investigates… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: 6 pages, 7 figures

  22. arXiv:2309.13475  [pdf, other

    cs.RO cs.CV cs.LG eess.SY

    Detecting and Mitigating System-Level Anomalies of Vision-Based Controllers

    Authors: Aryaman Gupta, Kaustav Chakraborty, Somil Bansal

    Abstract: Autonomous systems, such as self-driving cars and drones, have made significant strides in recent years by leveraging visual inputs and machine learning for decision-making and control. Despite their impressive performance, these vision-based controllers can make erroneous predictions when faced with novel or out-of-distribution inputs. Such errors can cascade to catastrophic system failures and c… ▽ More

    Submitted 8 April, 2024; v1 submitted 23 September, 2023; originally announced September 2023.

  23. Data-Driven Computation of Robust Invariant Sets and Gain-Scheduled Controllers for Linear Parameter-Varying Systems

    Authors: Manas Mejari, Ankit Gupta, Dario Piga

    Abstract: We present a direct data-driven approach to synthesize robust control invariant (RCI) sets and their associated gain-scheduled feedback control laws for linear parameter-varying (LPV) systems subjected to bounded disturbances. A data-set consisting of a single state-input-scheduling trajectory is gathered from the system, which is directly utilized to compute polytopic RCI set and controllers by s… ▽ More

    Submitted 3 November, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: 6 pages, 3 figures. Accepted for publication, IEEE Control System Letters (LCSS) 2023

  24. arXiv:2308.10840  [pdf

    q-bio.NC eess.SP

    Deep Learning Architecture for Motor Imaged Words

    Authors: Vimal W, Akshansh Gupta

    Abstract: The notion of a Brain-Computer Interface system is the acquisition of signals from the brain, processing them, and translating them into commands. The study concentrated on a specific sort of brain signal known as Motor Imagery EEG signals, which are activated in the brain without any external stimulus of the needed motor activities in relation to the signal. The signals are further processed usin… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  25. arXiv:2308.08713  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Decoding Emotions: A comprehensive Multilingual Study of Speech Models for Speech Emotion Recognition

    Authors: Anant Singh, Akshat Gupta

    Abstract: Recent advancements in transformer-based speech representation models have greatly transformed speech processing. However, there has been limited research conducted on evaluating these models for speech emotion recognition (SER) across multiple languages and examining their internal representations. This article addresses these gaps by presenting a comprehensive benchmark for SER with eight speech… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  26. arXiv:2308.05864  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM

    The Multi-modality Cell Segmentation Challenge: Towards Universal Solutions

    Authors: Jun Ma, Ronald Xie, Shamini Ayyadhury, Cheng Ge, Anubha Gupta, Ritu Gupta, Song Gu, Yao Zhang, Gihun Lee, Joonkee Kim, Wei Lou, Haofeng Li, Eric Upschulte, Timo Dickscheid, José Guilherme de Almeida, Yixin Wang, Lin Han, Xin Yang, Marco Labagnara, Vojislav Gligorovski, Maxime Scheder, Sahand Jamal Rahi, Carly Kempster, Alice Pollitt, Leon Espinosa , et al. (15 additional authors not shown)

    Abstract: Cell segmentation is a critical step for quantitative single-cell analysis in microscopy images. Existing cell segmentation methods are often tailored to specific modalities or require manual interventions to specify hyper-parameters in different experimental settings. Here, we present a multi-modality cell segmentation benchmark, comprising over 1500 labeled images derived from more than 50 diver… ▽ More

    Submitted 1 April, 2024; v1 submitted 10 August, 2023; originally announced August 2023.

    Comments: NeurIPS22 Cell Segmentation Challenge: https://neurips22-cellseg.grand-challenge.org/ . Nature Methods (2024)

  27. arXiv:2308.05122  [pdf, other

    q-bio.QM cs.CV cs.LG eess.IV

    Copy Number Variation Informs fMRI-based Prediction of Autism Spectrum Disorder

    Authors: Nicha C. Dvornek, Catherine Sullivan, James S. Duncan, Abha R. Gupta

    Abstract: The multifactorial etiology of autism spectrum disorder (ASD) suggests that its study would benefit greatly from multimodal approaches that combine data from widely varying platforms, e.g., neuroimaging, genetics, and clinical characterization. Prior neuroimaging-genetic analyses often apply naive feature concatenation approaches in data-driven work or use the findings from one modality to guide p… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: Accepted by Machine Learning in Clinical Neuroimaging 2023 (MICCAI workshop), preprint version

  28. arXiv:2307.16462  [pdf, other

    eess.IV cs.CV

    A hybrid approach for improving U-Net variants in medical image segmentation

    Authors: Aitik Gupta, Dr. Joydip Dhar

    Abstract: Medical image segmentation is vital to the area of medical imaging because it enables professionals to more accurately examine and understand the information offered by different imaging modalities. The technique of splitting a medical image into various segments or regions of interest is known as medical image segmentation. The segmented images that are produced can be used for many different thi… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: 9 pages, 11 figures

  29. arXiv:2306.14079  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching

    Authors: H. J. Terry Suh, Glen Chou, Hongkai Dai, Lujie Yang, Abhishek Gupta, Russ Tedrake

    Abstract: Gradient-based methods enable efficient search capabilities in high dimensions. However, in order to apply them effectively in offline optimization paradigms such as offline Reinforcement Learning (RL) or Imitation Learning (IL), we require a more careful consideration of how uncertainty estimation interplays with first-order methods that attempt to minimize them. We study smoothed distance to dat… ▽ More

    Submitted 16 October, 2023; v1 submitted 24 June, 2023; originally announced June 2023.

    Comments: Glen Chou, Hongkai Dai, and Lujie Yang contributed equally to this work. Accepted to CoRL 2023

  30. arXiv:2306.01652  [pdf, other

    cs.IT eess.SP

    On the Coverage of Cognitive mmWave Networks with Directional Sensing and Communication

    Authors: Shuchi Tripathi, Abhishek K. Gupta, SaiDhiraj Amuru

    Abstract: Millimeter-waves' propagation characteristics create prospects for spatial and temporal spectrum sharing in a variety of contexts, including cognitive spectrum sharing (CSS). However, CSS along with omnidirectional sensing, is not efficient at mmWave frequencies due to their directional nature of transmission, as this limits secondary networks' ability to access the spectrum. This inspired us to c… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: 30 pages, 12 figures

  31. Inter Subject Emotion Recognition Using Spatio-Temporal Features From EEG Signal

    Authors: Mohammad Asif, Diya Srivastava, Aditya Gupta, Uma Shanker Tiwary

    Abstract: Inter-subject or subject-independent emotion recognition has been a challenging task in affective computing. This work is about an easy-to-implement emotion recognition model that classifies emotions from EEG signals subject independently. It is based on the famous EEGNet architecture, which is used in EEG-related BCIs. We used the Dataset on Emotion using Naturalistic Stimuli (DENS) dataset. The… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

    Report number: 2023 27th International Computer Science and Engineering Conference (ICSEC)

  32. arXiv:2305.05479  [pdf, other

    cs.CR cs.DC eess.SP eess.SY

    Multiple-stop** time Sequential Detection for Energy Efficient Mining in Blockchain-Enabled IoT

    Authors: Anurag Gupta, Vikram Krishnamurthy

    Abstract: What are the optimal times for an Internet of Things (IoT) device to act as a blockchain miner? The aim is to minimize the energy consumed by low-power IoT devices that log their data into a secure (tamper-proof) distributed ledger. We formulate a multiple stop** time Bayesian sequential detection problem to address energy-efficient blockchain mining for IoT devices. The objective is to identify… ▽ More

    Submitted 17 August, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

  33. arXiv:2305.03312  [pdf, other

    cs.RO eess.SY

    Experimental Validation of Safe MPC for Autonomous Driving in Uncertain Environments

    Authors: Ivo Batkovic, Ankit Gupta, Mario Zanon, Paolo Falcone

    Abstract: The full deployment of autonomous driving systems on a worldwide scale requires that the self-driving vehicle be operated in a provably safe manner, i.e., the vehicle must be able to avoid collisions in any possible traffic situation. In this paper, we propose a framework based on Model Predictive Control (MPC) that endows the self-driving vehicle with the necessary safety guarantees. In particula… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: 16 pages, 20 figures, accepted to IEEE Transactions on Control Systems Technology

  34. arXiv:2305.00257  [pdf

    eess.IV cs.CV cs.LG

    Brain Tumor Segmentation from MRI Images using Deep Learning Techniques

    Authors: Ayan Gupta, Mayank Dixit, Vipul Kumar Mishra, Attulya Singh, Atul Dayal

    Abstract: A brain tumor, whether benign or malignant, can potentially be life threatening and requires painstaking efforts in order to identify the type, origin and location, let alone cure one. Manual segmentation by medical specialists can be time-consuming, which calls out for the involvement of technology to hasten the process with high accuracy. For the purpose of medical image segmentation, we inspect… ▽ More

    Submitted 29 April, 2023; originally announced May 2023.

    Comments: 15 pages, 8 figures, 3 tables, 12th International Advanced Computing Conference

    Report number: 2155

  35. arXiv:2303.18154  [pdf, ps, other

    eess.SY

    Direct Data-Driven Computation of Polytopic Robust Control Invariant Sets and State-Feedback Controllers

    Authors: Manas Mejari, Ankit Gupta

    Abstract: This paper presents a direct data-driven approach for computing robust control invariant (RCI) sets and their associated state-feedback control laws for linear time-invariant systems affected by bounded disturbances. The proposed method utilizes a single state-input trajectory generated from the system, to compute a polytopic RCI set with a desired complexity and an invariance-inducing feedback co… ▽ More

    Submitted 2 October, 2023; v1 submitted 31 March, 2023; originally announced March 2023.

    Comments: 9 pages, 4 figures, accepted for publication, to appear at the 62nd IEEE Conference on Decision and Control (CDC 2023), Singapore

  36. arXiv:2303.11551  [pdf, other

    cs.CV cs.LG cs.SD eess.AS eess.IV

    ModEFormer: Modality-Preserving Embedding for Audio-Video Synchronization using Transformers

    Authors: Akash Gupta, Rohun Tripathi, Wondong Jang

    Abstract: Lack of audio-video synchronization is a common problem during television broadcasts and video conferencing, leading to an unsatisfactory viewing experience. A widely accepted paradigm is to create an error detection mechanism that identifies the cases when audio is leading or lagging. We propose ModEFormer, which independently extracts audio and video embeddings using modality-specific transforme… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: Paper accepted at ICASSP 2023

  37. arXiv:2303.00727  [pdf, other

    eess.SY

    Challenges and Opportunities for Beyond-5G Wireless Security

    Authors: Eric Ruzomberka, David J. Love, Christopher G. Brinton, Arpit Gupta, Chih-Chun Wang, H. Vincent Poor

    Abstract: The demand for broadband wireless access is driving research and standardization of 5G and beyond-5G wireless systems. In this paper, we aim to identify emerging security challenges for these wireless systems and pose multiple research areas to address these challenges.

    Submitted 1 March, 2023; originally announced March 2023.

  38. arXiv:2302.14120  [pdf, other

    eess.AS cs.SD

    Diagonal State Space Augmented Transformers for Speech Recognition

    Authors: George Saon, Ankit Gupta, Xiaodong Cui

    Abstract: We improve on the popular conformer architecture by replacing the depthwise temporal convolutions with diagonal state space (DSS) models. DSS is a recently introduced variant of linear RNNs obtained by discretizing a linear dynamical system with a diagonal state transition matrix. DSS layers project the input sequence onto a space of orthogonal polynomials where the choice of basis functions, metr… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: to be presented at ICASSP 2023

  39. arXiv:2301.05859  [pdf, ps, other

    cs.RO eess.SY physics.class-ph

    Pendulum Actuated Spherical Robot: Dynamic Modeling & Analysis for Wobble & Precession

    Authors: Animesh Singhal, Sahil Modi, Abhishek Gupta, Leena Vachhani, Omkar A. Ghag

    Abstract: A spherical robot has many practical advantages as the entire electronics are protected within a hull and can be carried easily by any Unmanned Aerial Vehicle (UAV). However, its use is limited due to finding mounts for sensors. Pendulum actuated spherical robot provides space for mounting sensors at the yoke. We study the non-linear dynamics of a pendulum-actuated spherical robot to analyze the d… ▽ More

    Submitted 14 January, 2023; originally announced January 2023.

    Comments: The paper has been accepted to the 22nd IFAC International Symposium on Automatic Control in Aerospace (ACA) 2022. It consists of 6 pages and 15 figures

  40. arXiv:2211.16373  [pdf, other

    eess.SP

    GreenMO: Virtualized User-proportionate MIMO

    Authors: Agrim Gupta, Sajjad Nassirpour, Manideep Dunna, Eamon Patamasing, Alireza Vahid, Dinesh Bharadia

    Abstract: With the turn of new decade, wireless communications face a major challenge on connecting many more new users and devices, at the same time being energy efficient and minimizing its carbon footprint. However, the current approaches to address the growing number of users and spectrum demands, like traditional fully digital architectures for Massive MIMO, demand exorbitant energy consumption. The re… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  41. arXiv:2211.08237  [pdf, other

    cs.SD cs.CL eess.AS

    Multilingual Speech Emotion Recognition With Multi-Gating Mechanism and Neural Architecture Search

    Authors: Zihan Wang, Qi Meng, HaiFeng Lan, XinRui Zhang, KeHao Guo, Akshat Gupta

    Abstract: Speech emotion recognition (SER) classifies audio into emotion categories such as Happy, Angry, Fear, Disgust and Neutral. While Speech Emotion Recognition (SER) is a common application for popular languages, it continues to be a problem for low-resourced languages, i.e., languages with no pretrained speech-to-text recognition models. This paper firstly proposes a language-specific model that extr… ▽ More

    Submitted 15 November, 2022; v1 submitted 31 October, 2022; originally announced November 2022.

  42. arXiv:2211.01731  [pdf

    eess.SP

    Data Converter Design Space Exploration for IoT Applications: An Overview of Challenges and Future Directions

    Authors: Buddhi Prakash Sharma, Anu Gupta, Chandra Shekhar

    Abstract: Human lives are improving with the widespread use of cutting-edge digital technology like the Internet of Things (IoT). Recently, the pandemic has shown the demand for more digitally advanced IoT-based devices. International Data Corporation (IDC) forecasts that by 2025, there will be approximately 42 billion of these devices in use, capable of producing around 80 ZB (zettabytes) of data. So data… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

  43. arXiv:2210.13127  [pdf, other

    eess.AS cs.SD eess.SP

    A Novel Frame Structure for Cloud-Based Audio-Visual Speech Enhancement in Multimodal Hearing-aids

    Authors: Abhijeet Bishnu, Ankit Gupta, Mandar Gogate, Kia Dashtipour, Ahsan Adeel, Amir Hussain, Mathini Sellathurai, Tharmalingam Ratnarajah

    Abstract: In this paper, we design a first of its kind transceiver (PHY layer) prototype for cloud-based audio-visual (AV) speech enhancement (SE) complying with high data rate and low latency requirements of future multimodal hearing assistive technology. The innovative design needs to meet multiple challenging constraints including up/down link communications, delay of transmission and signal processing,… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

  44. arXiv:2210.06257  [pdf, other

    cs.CV cs.LG eess.IV

    What can we learn about a generated image corrupting its latent representation?

    Authors: Agnieszka Tomczak, Aarushi Gupta, Slobodan Ilic, Nassir Navab, Shadi Albarqouni

    Abstract: Generative adversarial networks (GANs) offer an effective solution to the image-to-image translation problem, thereby allowing for new possibilities in medical imaging. They can translate images from one imaging modality to another at a low cost. For unpaired datasets, they rely mostly on cycle loss. Despite its effectiveness in learning the underlying data distribution, it can lead to a discrepan… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  45. arXiv:2209.13553  [pdf, other

    eess.SP

    Source detection via multi-label classification

    Authors: Jayakrishnan Vijayamohanan, Arjun Gupta, Oameed Noakoasteen, Sotirios Goudos, Christos Christodoulou

    Abstract: Radio source detection through conventional algorithms has been unreliable when trying to solve for large number of sources in the presence of low SINR and less number of snapshots. We address this by reformulating source detection as a multi-class classification problem solved using deep learning frameworks. Incoming waveforms are sampled using a centrosymmetric linear array with omni-directional… ▽ More

    Submitted 1 February, 2023; v1 submitted 27 September, 2022; originally announced September 2022.

    Comments: 8 pages and 7 figures. This work has been submitted to the IEEE OJSP for possible publication

  46. arXiv:2209.12937  [pdf, ps, other

    math.OC eess.SY

    Robustness to Modeling Errors in Risk-Sensitive Markov Decision Problems with Markov Risk Measures

    Authors: Shi** Shao, Abhishek Gupta, William B. Haskell

    Abstract: We consider risk-sensitive Markov decision processes (MDPs), where the MDP model is influenced by a parameter which takes values in a compact metric space. We identify sufficient conditions under which small perturbations in the model parameters lead to small changes in the optimal value function and optimal policy. We further establish the robustness of the risk-sensitive optimal policies to mode… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: 24 pages, submitted to SIAM Journal on Control and Optimization

  47. arXiv:2209.09217  [pdf, other

    cs.RO eess.SY

    WiForceSticker: Batteryless, Thin Sticker-like Flexible Force Sensor

    Authors: Agrim Gupta, Daegue Park, Shayaun Bashar, Cedric Girerd, Tania Morimoto, Dinesh Bharadia

    Abstract: Any two objects in contact with each other exert a force that could be simply due to gravity or mechanical contact, such as a robotic arm grip** an object or even the contact between two bones at our knee joints. The ability to naturally measure and monitor these contact forces allows a plethora of applications from warehouse management (detect faulty packages based on weights) to robotics (maki… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

  48. Preemptive Scheduling of EV Charging for Providing Demand Response Services

    Authors: Shi** Shao, Farshad Harirchi, Devang Dave, Abhishek Gupta

    Abstract: We develop a new algorithm for scheduling the charging process of a large number of electric vehicles (EVs) over a finite horizon. We assume that EVs arrive at the charging stations with different charge levels and different flexibility windows. The arrival process is assumed to have a known distribution and that the charging process of EVs can be preemptive. We pose the scheduling problem as a dy… ▽ More

    Submitted 30 November, 2022; v1 submitted 20 August, 2022; originally announced August 2022.

    Comments: 21 pages, submitted to SEGAN

    Journal ref: Sustainable Energy, Grids and Networks, Volume 33, 2023

  49. arXiv:2207.09450  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    Human-to-Robot Imitation in the Wild

    Authors: Shikhar Bahl, Abhinav Gupta, Deepak Pathak

    Abstract: We approach the problem of learning by watching humans in the wild. While traditional approaches in Imitation and Reinforcement Learning are promising for learning in the real world, they are either sample inefficient or are constrained to lab settings. Meanwhile, there has been a lot of success in processing passive, unstructured human data. We propose tackling this problem via an efficient one-s… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: Published at RSS 2022. Demos at https://human2robot.github.io

  50. arXiv:2207.00908  [pdf, other

    cs.NI eess.SY

    Interference Constrained Beam Alignment for Time-Varying Channels via Kernelized Bandits

    Authors: Yuntian Deng, Xingyu Zhou, Arnob Ghosh, Abhishek Gupta, Ness B. Shroff

    Abstract: To fully utilize the abundant spectrum resources in millimeter wave (mmWave), Beam Alignment (BA) is necessary for large antenna arrays to achieve large array gains. In practical dynamic wireless environments, channel modeling is challenging due to time-varying and multipath effects. In this paper, we formulate the beam alignment problem as a non-stationary online learning problem with the objecti… ▽ More

    Submitted 2 July, 2022; originally announced July 2022.