Skip to main content

Showing 1–50 of 103 results for author: Venkatesh

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.14861  [pdf, other

    eess.SY cs.ET

    Resilience of the Electric Grid through Trustable IoT-Coordinated Assets

    Authors: Vineet J. Nair, Venkatesh Venkataramanan, Priyank Srivastava, Partha S. Sarker, Anurag Srivastava, Laurentiu D. Marinovici, Jun Zha, Christopher Irwin, Prateek Mittal, John Williams, H. Vincent Poor, Anuradha M. Annaswamy

    Abstract: The electricity grid has evolved from a physical system to a cyber-physical system with digital devices that perform measurement, control, communication, computation, and actuation. The increased penetration of distributed energy resources (DERs) that include renewable generation, flexible loads, and storage provides extraordinary opportunities for improvements in efficiency and sustainability. Ho… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Submitted to the Proceedings of the National Academy of Sciences (PNAS), under review

  2. arXiv:2406.14799  [pdf, other

    cs.RO eess.SY

    Capture Point Control in Thruster-Assisted Bipedal Locomotion

    Authors: Shreyansh Pitroda, Aditya Bondada, Kaushik Venkatesh Krishnamurthy, Adarsh Salagame, Chenghao Wang, Taoran Liu, Bibek Gupta, Eric Sihite, Reza Nemovi, Alireza Ramezani, Morteza Gharib

    Abstract: Despite major advancements in control design that are robust to unplanned disturbances, bipedal robots are still susceptible to falling over and struggle to negotiate rough terrains. By utilizing thrusters in our bipedal robot, we can perform additional posture manipulation and expand the modes of locomotion to enhance the robot's stability and ability to negotiate rough and difficult-to-navigate… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Submitted and to be presented at IEEE AIM 2024. arXiv admin note: substantial text overlap with arXiv:2103.15952

  3. arXiv:2406.13118  [pdf, other

    cs.RO eess.SY

    Thruster-Assisted Incline Walking

    Authors: Kaushik Venkatesh Krishnamurthy, Chenghao Wang, Shreyansh Pitroda, Adarsh Salagame, Eric Sihite, Reza Nemovi, Alireza Ramezani, Morteza Gharib

    Abstract: In this study, our aim is to evaluate the effectiveness of thruster-assisted steep slope walking for the Husky Carbon, a quadrupedal robot equipped with custom-designed actuators and plural electric ducted fans, through simulation prior to conducting experimental trials. Thruster-assisted steep slope walking draws inspiration from wing-assisted incline running (WAIR) observed in birds, and intrigu… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 7 pages, 7 figures, submitted to CDC 2024 conference. arXiv admin note: text overlap with arXiv:2405.06070

  4. arXiv:2405.14405  [pdf, other

    cs.CV eess.IV quant-ph

    Qubit-efficient Variational Quantum Algorithms for Image Segmentation

    Authors: Supreeth Mysore Venkatesh, Antonio Macaluso, Marlon Nuske, Matthias Klusch, Andreas Dengel

    Abstract: Quantum computing is expected to transform a range of computational tasks beyond the reach of classical algorithms. In this work, we examine the application of variational quantum algorithms (VQAs) for unsupervised image segmentation to partition images into separate semantic regions. Specifically, we formulate the task as a graph cut optimization problem and employ two established qubit-efficient… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 7 pages, 4 figures, 2 tables

  5. arXiv:2405.06070  [pdf, other

    cs.RO eess.SY

    Narrow-Path, Dynamic Walking Using Integrated Posture Manipulation and Thrust Vectoring

    Authors: Kaushik Venkatesh Krishnamurthy, Chenghao Wang, Shreyansh Pitroda, Adarsh Salagame, Eric Sihite, Reza Nemovi, Alireza Ramezani, Morteza Gharib

    Abstract: This research concentrates on enhancing the navigational capabilities of Northeastern Universitys Husky, a multi-modal quadrupedal robot, that can integrate posture manipulation and thrust vectoring, to traverse through narrow pathways such as walking over pipes and slacklining. The Husky is outfitted with thrusters designed to stabilize its body during dynamic walking over these narrow paths. The… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2312.12586

  6. arXiv:2403.17730  [pdf, ps, other

    math.OC eess.SY

    On Structural Non-commutativity in Affine Feedback of SISO Nonlinear Systems

    Authors: Venkatesh G. S.

    Abstract: The affine feedback connection of SISO nonlinear systems modeled by Chen--Fliess series is shown to be a group action on the plant which is isomorphic to the semi-direct product of shuffle and additive group of non-commutative formal power series. The additive and multiplicative feedback loops in an affine feedback connection are thus proven to be structurally non-commutative. A flip in the order… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: submitted to $26^{th}$ International Symposium on Mathematical Theory of Networks and Systems, 2024

  7. arXiv:2403.10495  [pdf, other

    eess.SP

    PnP Restoration with Domain Adaptation for SANS

    Authors: Shirin Shoushtari, Edward P. Chandler, Jialiang Zhang, Manjula Senanayake, Sai Venkatesh **ali, Marcus Foston, Ulugbek S. Kamilov

    Abstract: Small Angle Neutron Scattering (SANS) is a non-destructive technique utilized to probe the nano- to mesoscale structure of materials by analyzing the scattering pattern of neutrons. Accelerating SANS acquisition for in-situ analysis is essential, but it often reduces the signal-to-noise ratio (SNR), highlighting the need for methods to enhance SNR even with short acquisition times. While deep lear… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  8. arXiv:2403.02863  [pdf, other

    cs.ET eess.IV physics.app-ph

    Spintronic Implementation of UNet for Image Segmentation

    Authors: Venkatesh Vadde, Bhaskaran Muralidharan, Abhishek Sharma

    Abstract: Image segmentation plays a crucial role in computer vision applications like self-driving cars, satellite imagery analysis, and medical diagnosis. Implementing these complex deep neural networks on conventional hardware is highly inefficient. In this work, we propose hardware implementation of UNet for segmentation tasks, using spintronic devices. Our approach involves designing hardware for convo… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  9. arXiv:2402.17701  [pdf, other

    eess.AS cs.LG cs.SD

    Real-time Low-latency Music Source Separation using Hybrid Spectrogram-TasNet

    Authors: Satvik Venkatesh, Arthur Benilov, Philip Coleman, Frederic Roskam

    Abstract: There have been significant advances in deep learning for music demixing in recent years. However, there has been little attention given to how these neural networks can be adapted for real-time low-latency applications, which could be helpful for hearing aids, remixing audio streams and live shows. In this paper, we investigate the various challenges involved in adapting current demixing models i… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted to ICASSP 2024

    ACM Class: I.5.1; I.5.4

  10. arXiv:2402.10211  [pdf, other

    cs.LG cs.RO eess.SP

    Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling

    Authors: Raunaq Bhirangi, Chenyu Wang, Venkatesh Pattabiraman, Carmel Majidi, Abhinav Gupta, Tess Hellebrekers, Lerrel Pinto

    Abstract: Reasoning from sequences of raw sensory data is a ubiquitous problem across fields ranging from medical devices to robotics. These problems often involve using long sequences of raw sensor data (e.g. magnetometers, piezoresistors) to predict sequences of desirable physical quantities (e.g. force, inertial measurements). While classical approaches are powerful for locally-linear prediction problems… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  11. arXiv:2402.09551  [pdf, ps, other

    cs.IT eess.SP

    Zak-OTFS and LDPC Codes

    Authors: Beyza Dabak, Venkatesh Khammammetti, Saif Khan Mohammed, Robert Calderbank

    Abstract: Orthogonal Time Frequency Space (OTFS) is a framework for communications and active sensing that processes signals in the delay-Doppler (DD) domain. It is informed by 6G propagation environments, where Doppler spreads measured in kHz make it more and more difficult to estimate channels, and the standard model-dependent approach to wireless communication is starting to break down. We consider Zak-O… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: 7 pages (double column), 6 figures, accepted at 2024 IEEE International Conference on Communications (ICC)

  12. arXiv:2402.08088  [pdf, other

    cs.AI cs.LG eess.IV

    Out-of-Distribution Detection and Data Drift Monitoring using Statistical Process Control

    Authors: Ghada Zamzmi, Kesavan Venkatesh, Brandon Nelson, Smriti Prathapan, Paul H. Yi, Berkman Sahiner, Jana G. Delfino

    Abstract: Background: Machine learning (ML) methods often fail with data that deviates from their training distribution. This is a significant concern for ML-enabled devices in clinical settings, where data drift may cause unexpected performance that jeopardizes patient safety. Method: We propose a ML-enabled Statistical Process Control (SPC) framework for out-of-distribution (OOD) detection and drift mon… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  13. arXiv:2401.14717  [pdf, other

    cs.CL cs.AI cs.LG cs.SD eess.AS

    Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion

    Authors: **han Wang, Long Chen, Aparna Khare, Anirudh Raju, Pranav Dheram, Di He, Minhua Wu, Andreas Stolcke, Venkatesh Ravichandran

    Abstract: We propose an approach for continuous prediction of turn-taking and backchanneling locations in spoken dialogue by fusing a neural acoustic model with a large language model (LLM). Experiments on the Switchboard human-human conversation dataset demonstrate that our approach consistently outperforms the baseline models with single modality. We also develop a novel multi-task instruction fine-tuning… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: To appear in IEEE ICASSP 2024

  14. Two-pass Endpoint Detection for Speech Recognition

    Authors: Anirudh Raju, Aparna Khare, Di He, Ilya Sklyar, Long Chen, Sam Alptekin, Viet Anh Trinh, Zhe Zhang, Colin Vaz, Venkatesh Ravichandran, Roland Maas, Ariya Rastrow

    Abstract: Endpoint (EP) detection is a key component of far-field speech recognition systems that assist the user through voice commands. The endpoint detector has to trade-off between accuracy and latency, since waiting longer reduces the cases of users being cut-off early. We propose a novel two-pass solution for endpointing, where the utterance endpoint detected from a first pass endpointer is verified b… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: ASRU 2023

  15. arXiv:2401.05425  [pdf, other

    eess.SP cs.LG

    An Unobtrusive and Lightweight Ear-worn System for Continuous Epileptic Seizure Detection

    Authors: Abdul Aziz, Nhat Pham, Neel Vora, Cody Reynolds, Jaime Lehnen, Pooja Venkatesh, Zhuoran Yao, Jay Harvey, Tam Vu, Kan Ding, Phuc Nguyen

    Abstract: Epilepsy is one of the most common neurological diseases globally, affecting around 50 million people worldwide. Fortunately, up to 70 percent of people with epilepsy could live seizure-free if properly diagnosed and treated, and a reliable technique to monitor the onset of seizures could improve the quality of life of patients who are constantly facing the fear of random seizure attacks. The scal… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  16. arXiv:2312.14378  [pdf, other

    cs.LG cs.SD eess.AS

    Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification

    Authors: Anirudh S. Sundar, Chao-Han Huck Yang, David M. Chan, Shalini Ghosh, Venkatesh Ravichandran, Phani Sankar Nidadavolu

    Abstract: Training large foundation models using self-supervised objectives on unlabeled data, followed by fine-tuning on downstream tasks, has emerged as a standard procedure. Unfortunately, the efficacy of this approach is often constrained by both limited fine-tuning compute and scarcity in labeled downstream data. We introduce Multimodal Attention Merging (MAM), an attempt that facilitates direct knowle… ▽ More

    Submitted 9 February, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: 5 pages, 1 figure, ICASSP 2024 Workshop on Self-supervision in Audio, Speech and Beyond

  17. arXiv:2312.08621  [pdf, other

    cs.RO eess.SY

    Quadrupedal Locomotion Control On Inclined Surfaces Using Collocation Method

    Authors: Adarsh Salagame, Maria Gianello, Chenghao Wang, Kaushik Venkatesh, Shreyansh Pitroda, Rohit Rajput, Eric Sihite, Miriam Leeser, Alireza Ramezani

    Abstract: Inspired by Chukars wing-assisted incline running (WAIR), in this work, we employ a high-fidelity model of our Husky Carbon quadrupedal-legged robot to walk over steep slopes of up to 45 degrees. Chukars use the aerodynamic forces generated by their flap** wings to manipulate ground contact forces and traverse steep slopes and even overhangs. By exploiting the thrusters on Husky, we employed a c… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2306.00179

  18. arXiv:2312.05352  [pdf, other

    cs.CV cs.LG eess.IV

    A Review of Machine Learning Methods Applied to Video Analysis Systems

    Authors: Marios S. Pattichis, Venkatesh Jatla, Alvaro E. Ullao Cerna

    Abstract: The paper provides a survey of the development of machine-learning techniques for video analysis. The survey provides a summary of the most popular deep learning methods used for human activity recognition. We discuss how popular architectures perform on standard datasets and highlight the differences from real-life datasets dominated by multiple activities performed by multiple participants over… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  19. arXiv:2311.14878  [pdf, other

    cs.RO eess.SY

    How Strong a Kick Should be to Topple Northeastern's Tumbling Robot?

    Authors: Adarsh Salagame, Neha Bhattachan, Andre Caetano, Ian McCarthy, Henry Noyes, Brandon Petersen, Alexander Qiu, Matthew Schroeter, Nolan Smithwick, Konrad Sroka, Jason Widjaja, Yash Bohra, Kaushik Venkatesh, Kruthika Gangaraju, Paul Ghanem, Ioannis Mandralis, Eric Sihite, Arash Kalantari, Alireza Ramezani

    Abstract: Rough terrain locomotion has remained one of the most challenging mobility questions. In 2022, NASA's Innovative Advanced Concepts (NIAC) Program invited US academic institutions to participate NASA's Breakthrough, Innovative \& Game-changing (BIG) Idea competition by proposing novel mobility systems that can negotiate extremely rough terrain, lunar bumpy craters. In this competition, Northeastern… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

  20. arXiv:2311.10149  [pdf, other

    eess.AS

    Improving fairness for spoken language understanding in atypical speech with Text-to-Speech

    Authors: Helin Wang, Venkatesh Ravichandran, Milind Rao, Becky Lammers, Myra Sydnor, Nicholas Maragakis, Ankur A. Butala, Jayne Zhang, Lora Clawson, Victoria Chovaz, Laureano Moro-Velazquez

    Abstract: Spoken language understanding (SLU) systems often exhibit suboptimal performance in processing atypical speech, typically caused by neurological conditions and motor impairments. Recent advancements in Text-to-Speech (TTS) synthesis-based augmentation for more fair SLU have struggled to accurately capture the unique vocal characteristics of atypical speakers, largely due to insufficient data. To a… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted at SyntheticData4ML 2023 Oral

  21. arXiv:2311.03666  [pdf, other

    math.OC eess.SY

    Stochastic Control with Distributionally Robust Constraints for Cyber-Physical Systems Vulnerable to Attacks

    Authors: Nishanth Venkatesh, Aditya Dave, Ioannis Faros, Andreas A. Malikopoulos

    Abstract: In this paper, we investigate the control of a cyber-physical system (CPS) while accounting for its vulnerability to external attacks. We formulate a constrained stochastic problem with a robust constraint to ensure robust operation against potential attacks. We seek to minimize the expected cost subject to a constraint limiting the worst-case expected damage an attacker can impose on the CPS. We… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 8 pages, 2 Figures with 3 sub-figures each, submitted to the ECC 2024 conference for review

  22. arXiv:2309.11891  [pdf, other

    eess.IV cs.CV

    Heart Rate Detection Using an Event Camera

    Authors: Aniket Jagtap, RamaKrishna Venkatesh Saripalli, Joe Lemley, Waseem Shariff, Alan F. Smeaton

    Abstract: Event cameras, also known as neuromorphic cameras, are an emerging technology that offer advantages over traditional shutter and frame-based cameras, including high temporal resolution, low power consumption, and selective data acquisition. In this study, we propose to harnesses the capabilities of event-based cameras to capture subtle changes in the surface of the skin caused by the pulsatile flo… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: Dataset available at https://doi.org/10.6084/m9.figshare.24039501.v1

  23. arXiv:2308.06069  [pdf, other

    cs.SE cs.LG cs.LO eess.SY

    Safeguarding Learning-based Control for Smart Energy Systems with Sampling Specifications

    Authors: Chih-Hong Cheng, Venkatesh Prasad Venkataramanan, Pragya Kirti Gupta, Yun-Fei Hsu, Simon Burton

    Abstract: We study challenges using reinforcement learning in controlling energy systems, where apart from performance requirements, one has additional safety requirements such as avoiding blackouts. We detail how these safety requirements in real-time temporal logic can be strengthened via discretization into linear temporal logic (LTL), such that the satisfaction of the LTL formulae implies the satisfacti… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

  24. arXiv:2308.00183  [pdf, other

    cs.RO eess.SY

    Hovering Control of Flap** Wings in Tandem with Multi-Rotors

    Authors: Aniket Dhole, Bibek Gupta, Adarsh Salagame, Xuejian Niu, Yizhe Xu, Kaushik Venkatesh, Paul Ghanem, Ioannis Mandralis, Eric Sihite, Alireza Ramezani

    Abstract: This work briefly covers our efforts to stabilize the flight dynamics of Northeastern's tailless bat-inspired micro aerial vehicle, Aerobat. Flap** robots are not new. A plethora of examples is mainly dominated by insect-style design paradigms that are passively stable. However, Aerobat, in addition for being tailless, possesses morphing wings that add to the inherent complexity of flight contro… ▽ More

    Submitted 31 July, 2023; originally announced August 2023.

  25. arXiv:2306.01570  [pdf

    cs.LG eess.SY math.OC

    Spatio-Temporal Deep Learning-Assisted Reduced Security-Constrained Unit Commitment

    Authors: Arun Venkatesh Ramesh, Xingpeng Li

    Abstract: Security-constrained unit commitment (SCUC) is a computationally complex process utilized in power system day-ahead scheduling and market clearing. SCUC is run daily and requires state-of-the-art algorithms to speed up the process. The constraints and data associated with SCUC are both geographically and temporally correlated to ensure the reliability of the solution, which further increases the c… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: 8 Figures, 5 Tables, 1 Algorithm

  26. arXiv:2305.08113  [pdf, ps, other

    eess.IV

    Modelling Quasi-Orthographic Captures for Surface Imaging

    Authors: Maniratnam Mandal, Venkatesh K. Subramanian

    Abstract: Surveillance and surveying are two important applications of empirical research. A major part of terrain modelling is supported by photographic surveys which are used for capturing expansive natural surfaces using a wide range of sensors -- visual, infrared, ultrasonic, radio, etc. A natural surface is non-smooth, unpredictable and fast-varying, and it is difficult to capture all features and reco… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2305.08121

  27. arXiv:2305.00334  [pdf, other

    eess.IV eess.SP

    Maximum Likelihood based Phase-Retrieval using Fresnel Propagation Forward Models with Optional Constraints

    Authors: K. Aditya Mohan, Jean-Baptiste Forien, Venkatesh Sridhar, Jefferson A. Cuadra, Dilworth Parkinson

    Abstract: X-ray phase-contrast tomography (XPCT) is widely used for high contrast 3D imaging using either synchrotron or laboratory microfocus X-ray sources. XPCT enables an order of magnitude improvement in image contrast of the reconstructed material interfaces with low X-ray absorption contrast. The dominant approaches to 3D reconstruction using XPCT relies on the use of phase-retrieval algorithms that m… ▽ More

    Submitted 2 October, 2023; v1 submitted 29 April, 2023; originally announced May 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  28. Hardware-Impaired Rician-Faded Cell-Free Massive MIMO Systems With Channel Aging

    Authors: Venkatesh Tentu, Dheeraj N Amudala, Anish Chattopadhyay, Rohit Budhiraja

    Abstract: We study the impact of channel aging on the uplink of a cell-free (CF) massive multiple-input multiple-output (mMIMO) system by considering i) spatially-correlated Rician-faded channels; ii) hardware impairments at the access points and user equipments (UEs); and iii) two-layer large-scale fading decoding (LSFD). We first derive a closed-form spectral efficiency (SE) expression for this system, an… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: This work has been submitted to the IEEE Transactions on Communications for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible, 32 pages, 14 figures

  29. arXiv:2304.00397  [pdf, other

    cs.LG cs.AI eess.SY

    Connected and Automated Vehicles in Mixed-Traffic: Learning Human Driver Behavior for Effective On-Ramp Merging

    Authors: Nishanth Venkatesh, Viet-Anh Le, Aditya Dave, Andreas A. Malikopoulos

    Abstract: Highway merging scenarios featuring mixed traffic conditions pose significant modeling and control challenges for connected and automated vehicles (CAVs) interacting with incoming on-ramp human-driven vehicles (HDVs). In this paper, we present an approach to learn an approximate information state model of CAV-HDV interactions for a CAV to maneuver safely during highway merging. In our approach, th… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

  30. arXiv:2303.16321  [pdf, other

    math.OC cs.AI eess.SY

    Worst-Case Control and Learning Using Partial Observations Over an Infinite Time-Horizon

    Authors: Aditya Dave, Ioannis Faros, Nishanth Venkatesh, Andreas A. Malikopoulos

    Abstract: Safety-critical cyber-physical systems require control strategies whose worst-case performance is robust against adversarial disturbances and modeling uncertainties. In this paper, we present a framework for approximate control and learning in partially observed systems to minimize the worst-case discounted cost over an infinite time horizon. We model disturbances to the system as finite-valued un… ▽ More

    Submitted 31 March, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

  31. arXiv:2303.15528  [pdf, other

    cs.CV eess.IV

    Few-Shot Domain Adaptation for Low Light RAW Image Enhancement

    Authors: K. Ram Prabhakar, Vishal Vinod, Nihar Ranjan Sahoo, R. Venkatesh Babu

    Abstract: Enhancing practical low light raw images is a difficult task due to severe noise and color distortions from short exposure time and limited illumination. Despite the success of existing Convolutional Neural Network (CNN) based methods, their performance is not adaptable to different camera domains. In addition, such methods also require large datasets with short-exposure and corresponding long-exp… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: BMVC 2021 Best Student Paper Award (Runner-Up). Project Page: https://val.cds.iisc.ac.in/HDR/BMVC21/index.html

    Journal ref: 32nd British Machine Vision Conference 2021, BMVC 2021, 327

  32. arXiv:2303.15132  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Cross-utterance ASR Rescoring with Graph-based Label Propagation

    Authors: Srinath Tankasala, Long Chen, Andreas Stolcke, Anirudh Raju, Qianli Deng, Chander Chandak, Aparna Khare, Roland Maas, Venkatesh Ravichandran

    Abstract: We propose a novel approach for ASR N-best hypothesis rescoring with graph-based label propagation by leveraging cross-utterance acoustic similarity. In contrast to conventional neural language model (LM) based ASR rescoring/reranking models, our approach focuses on acoustic information and conducts the rescoring collaboratively among utterances, instead of individually. Experiments on the VCTK da… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: To appear in IEEE ICASSP 2023

    Journal ref: Proc. IEEE ICASSP, June 2023

  33. Adaptive Endpointing with Deep Contextual Multi-armed Bandits

    Authors: Do June Min, Andreas Stolcke, Anirudh Raju, Colin Vaz, Di He, Venkatesh Ravichandran, Viet Anh Trinh

    Abstract: Current endpointing (EP) solutions learn in a supervised framework, which does not allow the model to incorporate feedback and improve in an online setting. Also, it is a common practice to utilize costly grid-search to find the best configuration for an endpointing model. In this paper, we aim to provide a solution for adaptive endpointing by proposing an efficient method for choosing an optimal… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Journal ref: Proc. IEEE ICASSP, June 2023

  34. arXiv:2302.13494  [pdf, other

    eess.IV

    X-ray Spectral Estimation using Dictionary Learning

    Authors: Wenrui Li, Venkatesh Sridhar, K. Aditya Mohan, Saransh Singh, Jean-Baptiste Forien, Xin Liu, Gregery T. Buzzard, Charles A. Bouman

    Abstract: As computational tools for X-ray computed tomography (CT) become more quantitatively accurate, knowledge of the source-detector spectral response is critical for quantitative system-independent reconstruction and material characterization capabilities. Directly measuring the spectral response of a CT system is hard, which motivates spectral estimation using transmission data obtained from a collec… ▽ More

    Submitted 26 February, 2023; originally announced February 2023.

    Comments: Document Release Number: LLNL-CONF-845171 Submitted to 2023 ICIP conference

  35. LSFD for Rician-Faded Cell-Free mMIMO Systems With Channel Aging and Hardware Impairments

    Authors: Anish Chattopadhyay, Venkatesh Tentu, Dheeraj Naidu Amudala, Rohit Budhiraja

    Abstract: We study the impact of channel aging on the uplink of a cell-free massive multiple-input multiple-output system with hardware impairments. We consider a dynamic analog-to-digital converter architecture at the access points (APs), and low-resolution digital-to-analog converters at the user equipments (UEs). We derive a closed-form spectral efficiency expression by considering i) practical spatially… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: This paper is accepted for presentation in 2023 IEEE International Conference on Communications (ICC): Wireless Communications Symposium (IEEE ICC'23 - WC Symposium), 6 pages and 4 figures

    Journal ref: ICC 2023 - IEEE International Conference on Communications, 28 May 2023 - 01 June 2023

  36. arXiv:2301.05089  [pdf, other

    eess.SY cs.AI math.OC

    Approximate Information States for Worst-Case Control and Learning in Uncertain Systems

    Authors: Aditya Dave, Nishanth Venkatesh, Andreas A. Malikopoulos

    Abstract: In this paper, we investigate discrete-time decision-making problems in uncertain systems with partially observed states. We consider a non-stochastic model, where uncontrolled disturbances acting on the system take values in bounded sets with unknown distributions. We present a general framework for decision-making in such problems by using the notion of the information state and approximate info… ▽ More

    Submitted 5 April, 2024; v1 submitted 12 January, 2023; originally announced January 2023.

    Comments: Preliminary results related to this article were reported in arXiv:2203.15271

  37. arXiv:2211.16276  [pdf, ps, other

    cs.IT eess.SP

    Hardware-Aware Pilot Decontamination Precoding for Multi-cell mMIMO Systems With Rician Fading

    Authors: Harshit Kesarwani, Dheeraj Naidu Amudala, Venkatesh Tentu, Rohit Budhiraja

    Abstract: We consider a hardware-impaired multi-cell Rician faded massive multi-input multi-output (mMIMO) system with two-layer pilot decontamination precoding, also known as large-scale fading precoding (LSFP). Each BS is equipped with a flexible dynamic analog-to-digital converter (ADC)/digital-to-analog converter (DAC) architecture and the user equipments (UEs) have low-resolution ADCs. Further, both BS… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: This paper is accepted for presentation in 2022 IEEE Global Communications Conference: Wireless Communications (Globecom 2022 WC), 7 pages and 4 figures

  38. arXiv:2211.09731  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech

    Authors: Xin Zhang, Iván Vallés-Pérez, Andreas Stolcke, Chengzhu Yu, Jasha Droppo, Olabanji Shonibare, Roberto Barra-Chicote, Venkatesh Ravichandran

    Abstract: Stuttering is a speech disorder where the natural flow of speech is interrupted by blocks, repetitions or prolongations of syllables, words and phrases. The majority of existing automatic speech recognition (ASR) interfaces perform poorly on utterances with stutter, mainly due to lack of matched training data. Synthesis of speech with stutter thus presents an opportunity to improve ASR for this ty… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: 8 pages, 3 figures, 2 tables

    Journal ref: NeurIPS Workshop on SyntheticData4ML, December 2022

  39. arXiv:2210.07654  [pdf, other

    cs.CV cs.LG eess.IV

    Towards Transformer-based Homogenization of Satellite Imagery for Landsat-8 and Sentinel-2

    Authors: Venkatesh Thirugnana Sambandham, Konstantin Kirchheim, Sayan Mukhopadhaya, Frank Ortmeier

    Abstract: Landsat-8 (NASA) and Sentinel-2 (ESA) are two prominent multi-spectral imaging satellite projects that provide publicly available data. The multi-spectral imaging sensors of the satellites capture images of the earth's surface in the visible and infrared region of the electromagnetic spectrum. Since the majority of the earth's surface is constantly covered with clouds, which are not transparent at… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Journal ref: ESST2022: Transformers Workshop for Environmental Science

  40. arXiv:2209.04445  [pdf

    eess.IV cs.CR cs.CV

    Privacy-Preserving Deep Learning Model for Covid-19 Disease Detection

    Authors: Vijay Srinivas Tida Sai Venkatesh Chilukoti, Sonya Hsu, Xiali Hei

    Abstract: Recent studies demonstrated that X-ray radiography showed higher accuracy than Polymerase Chain Reaction (PCR) testing for COVID-19 detection. Therefore, applying deep learning models to X-rays and radiography images increases the speed and accuracy of determining COVID-19 cases. However, due to Health Insurance Portability and Accountability (HIPAA) compliance, the hospitals were unwilling to sha… ▽ More

    Submitted 9 October, 2022; v1 submitted 7 September, 2022; originally announced September 2022.

  41. arXiv:2208.09932  [pdf, other

    cs.CV cs.LG eess.IV

    Improving GANs for Long-Tailed Data through Group Spectral Regularization

    Authors: Harsh Rangwani, Naman Jaswani, Tejan Karmali, Varun Jampani, R. Venkatesh Babu

    Abstract: Deep long-tailed learning aims to train useful deep networks on practical, real-world imbalanced distributions, wherein most labels of the tail classes are associated with a few samples. There has been a large body of work to train discriminative models for visual recognition on long-tailed distribution. In contrast, we aim to train conditional Generative Adversarial Networks, a class of image gen… ▽ More

    Submitted 21 August, 2022; originally announced August 2022.

    Comments: ECCV 2022. Project Page: https://sites.google.com/view/gsr-eccv22

  42. arXiv:2208.06742  [pdf

    eess.SY cs.LG

    Feasibility Layer Aided Machine Learning Approach for Day-Ahead Operations

    Authors: Arun Venkatesh Ramesh, Xingpeng Li

    Abstract: Day-ahead operations involves a complex and computationally intensive optimization process to determine the generator commitment schedule and dispatch. The optimization process is a mixed-integer linear program (MILP) also known as security-constrained unit commitment (SCUC). Independent system operators (ISOs) run SCUC daily and require state-of-the-art algorithms to speed up the process. Existin… ▽ More

    Submitted 13 August, 2022; originally announced August 2022.

    Comments: 10 pages, 9 figures, 8 tables

  43. arXiv:2207.12254  [pdf, other

    cs.RO eess.SY

    A Letter on Progress Made on Husky Carbon: A Legged-Aerial, Multi-modal Platform

    Authors: Adarsh Salagame, Shoghair Manjikian, Chenghao Wang, Kaushik Venkatesh Krishnamurthy, Shreyansh Pitroda, Bibek Gupta, Tobias Jacob, Benjamin Mottis, Eric Sihite, Milad Ramezani, Alireza Ramezani

    Abstract: Animals, such as birds, widely use multi-modal locomotion by combining legged and aerial mobility with dominant inertial effects. The robotic biomimicry of this multi-modal locomotion feat can yield ultra-flexible systems in terms of their ability to negotiate their task spaces. The main objective of this paper is to discuss the challenges in achieving multi-modal locomotion, and to report our pro… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: arXiv admin note: text overlap with arXiv:2104.05834, arXiv:2205.06392

  44. arXiv:2207.10070  [pdf, other

    astro-ph.SR astro-ph.IM eess.IV

    Automatic Segmentation of Coronal Holes in Solar Images and Solar Prediction Map Classification

    Authors: Venkatesh Jatla

    Abstract: Solar image analysis relies on the detection of coronal holes for predicting disruptions to earth's magnetic field. The coronal holes act as sources of solar wind that can reach the earth. Thus, coronal holes are used in physical models for predicting the evolution of solar wind and its potential for interfering with the earth's magnetic field. Due to inherent uncertainties in the physical models,… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

  45. arXiv:2207.04081  [pdf

    eess.AS cs.CL cs.LG cs.SD eess.IV

    Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification

    Authors: Long Chen, Yixiong Meng, Venkatesh Ravichandran, Andreas Stolcke

    Abstract: Speaker identification (SID) in the household scenario (e.g., for smart speakers) is an important but challenging problem due to limited number of labeled (enrollment) utterances, confusable voices, and demographic imbalances. Conventional speaker recognition systems generalize from a large random sample of speakers, causing the recognition to underperform for households drawn from specific cohort… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

    Comments: To appear in Interspeech 2022. arXiv admin note: text overlap with arXiv:2106.08207

    Journal ref: Proc. Interspeech, Sept. 2022, pp. 4805-4809

  46. arXiv:2206.03543  [pdf, other

    eess.SY

    CPES-QSM: A Quantitative Method Towards the Secure Operation of Cyber-Physical Energy Systems

    Authors: Juan Ospina, Venkatesh Venkataramanan, Charalambos Konstantinou

    Abstract: Power systems are evolving into cyber-physical energy systems (CPES) due to the integration of modern communication and Internet-of-Things (IoT) devices. CPES security evaluation is challenging since the physical and cyber layers are often not considered holistically. Existing literature focuses on only optimizing the operation of either the physical or cyber layer while ignoring the interactions… ▽ More

    Submitted 26 September, 2022; v1 submitted 7 June, 2022; originally announced June 2022.

  47. An Improved Adaptive Smo for Speed Estimation of Sensorless Dsfoc Induction Motor Drives and Stability Analysis using Lyapunov Theorem at Low Frequencies

    Authors: Appalabathula Venkatesh

    Abstract: In this paper, An Improved Adaptive Sliding Mode Observer (ASMO) is proposed to a Sensorless DSFOC Induction Motor Drives and their stability is analyzed. ASMO is used to estimate the Rotor Speed, Rotor Resistance, Flux, Stator and Rotor currents and the developed electromagnetic Torques.To improve the robustness and accuracy of an adaptive SMO during very low frequency operation, the sliding mode… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

  48. arXiv:2204.04604  [pdf, other

    cs.IT cs.NI eess.SP

    A High Capacity Preamble Sequence for Random Access in Beyond 5G Networks: Design and Analysis

    Authors: Sagar Pawar, Lokesh Bommisetty, T. G. Venkatesh

    Abstract: The widely used Zadoff-Chu sequence (ZC sequence) for random access preamble in 5G has limitations in terms of the total number of preambles generated, forcing the reuse of preambles. Hence, the probability of collision of preambles of UEs increase, resulting in the failure of random access procedure. To truly qualify beyond 5G networks as green technology, the preamble capacity should be increase… ▽ More

    Submitted 10 April, 2022; originally announced April 2022.

  49. arXiv:2204.02090  [pdf, other

    cs.CV cs.IR cs.SD eess.AS

    VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices

    Authors: Venkatesh S. Kadandale, Juan F. Montesinos, Gloria Haro

    Abstract: In this paper, we address the problem of lip-voice synchronisation in videos containing human face and voice. Our approach is based on determining if the lips motion and the voice in a video are synchronised or not, depending on their audio-visual correspondence score. We propose an audio-visual cross-modal transformer-based model that outperforms several baseline models in the audio-visual synchr… ▽ More

    Submitted 30 June, 2022; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: Paper accepted to Interspeech 2022; Project Page: https://ipcv.github.io/VocaLiST/

  50. arXiv:2203.04099  [pdf, other

    cs.SD cs.CV cs.LG eess.AS

    VoViT: Low Latency Graph-based Audio-Visual Voice Separation Transformer

    Authors: Juan F. Montesinos, Venkatesh S. Kadandale, Gloria Haro

    Abstract: This paper presents an audio-visual approach for voice separation which produces state-of-the-art results at a low latency in two scenarios: speech and singing voice. The model is based on a two-stage network. Motion cues are obtained with a lightweight graph convolutional network that processes face landmarks. Then, both audio and motion features are fed to an audio-visual transformer which produ… ▽ More

    Submitted 19 July, 2022; v1 submitted 8 March, 2022; originally announced March 2022.

    Comments: Accepted to ECCV 2022