Skip to main content

Showing 1–22 of 22 results for author: Narayanan, S S

.
  1. arXiv:2406.10318  [pdf, other

    cs.CV cs.AI

    Creating a Lens of Chinese Culture: A Multimodal Dataset for Chinese Pun Rebus Art Understanding

    Authors: Tuo Zhang, Tiantian Feng, Yibin Ni, Mengqin Cao, Ruying Liu, Katharine Butler, Yanjun Weng, Mi Zhang, Shrikanth S. Narayanan, Salman Avestimehr

    Abstract: Large vision-language models (VLMs) have demonstrated remarkable abilities in understanding everyday content. However, their performance in the domain of art, particularly culturally rich art forms, remains less explored. As a pearl of human wisdom and creativity, art encapsulates complex cultural narratives and symbolism. In this paper, we offer the Pun Rebus Art Dataset, a multimodal dataset for… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  2. arXiv:2404.17983  [pdf, other

    cs.SD cs.CL eess.AS

    TI-ASU: Toward Robust Automatic Speech Understanding through Text-to-speech Imputation Against Missing Speech Modality

    Authors: Tiantian Feng, Xuan Shi, Rahul Gupta, Shrikanth S. Narayanan

    Abstract: Automatic Speech Understanding (ASU) aims at human-like speech interpretation, providing nuanced intent, emotion, sentiment, and content understanding from speech and language (text) content conveyed in speech. Typically, training a robust ASU model relies heavily on acquiring large-scale, high-quality speech and associated transcriptions. However, it is often challenging to collect or use speech… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  3. arXiv:2404.09215  [pdf, other

    eess.SP math.OC

    Optimum Beamforming and Grating Lobe Mitigation for Intelligent Reflecting Surfaces

    Authors: Sai Sanjay Narayanan, Uday K Khankhoje, Radha Krishna Ganti

    Abstract: Ensuring adequate wireless coverage in upcoming communication technologies such as 6G is expected to be challenging. This is because user demands of higher datarate require an increase in carrier frequencies, which in turn reduce the diffraction effects (and hence coverage) in complex multipath environments. Intelligent reflecting surfaces have been proposed as a way of restoring coverage by adapt… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 12 pages, 16 figures

  4. arXiv:2403.14464  [pdf, other

    eess.SY

    Synthesizing Controller for Safe Navigation using Control Density Function

    Authors: Joseph Moyalan, Sriram S. K. S Narayanan, Andrew Zheng, Umesh Vaidya

    Abstract: We consider the problem of navigating a nonlinear dynamical system from some initial set to some target set while avoiding collision with an unsafe set. We extend the concept of density function to control density function (CDF) for solving navigation problems with safety constraints. The occupancy-based interpretation of the measure associated with the density function is instrumental in imposing… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  5. arXiv:2403.14048  [pdf, ps, other

    cs.SD cs.CL eess.AS

    The NeurIPS 2023 Machine Learning for Audio Workshop: Affective Audio Benchmarks and Novel Data

    Authors: Alice Baird, Rachel Manzelli, Panagiotis Tzirakis, Chris Gagne, Haoqi Li, Sadie Allen, Sander Dieleman, Brian Kulis, Shrikanth S. Narayanan, Alan Cowen

    Abstract: The NeurIPS 2023 Machine Learning for Audio Workshop brings together machine learning (ML) experts from various audio domains. There are several valuable audio-driven ML tasks, from speech emotion recognition to audio event detection, but the community is sparse compared to other ML areas, e.g., computer vision or natural language processing. A major limitation with audio is the available data; wi… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  6. arXiv:2312.09173  [pdf, other

    cs.RO

    Safe Motion Planning for Quadruped Robots Using Density Functions

    Authors: Sriram S. K. S Narayanan, Andrew Zheng, Umesh Vaidya

    Abstract: This paper presents a motion planning algorithm for quadruped locomotion based on density functions. We decompose the locomotion problem into a high-level density planner and a model predictive controller (MPC). Due to density functions having a physical interpretation through the notion of occupancy, it is intuitive to represent the environment with safety constraints. Hence, there is an ease of… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: arXiv admin note: text overlap with arXiv:2306.15830

  7. arXiv:2310.00109  [pdf, other

    cs.LG cs.DC cs.DL

    FedAIoT: A Federated Learning Benchmark for Artificial Intelligence of Things

    Authors: Samiul Alam, Tuo Zhang, Tiantian Feng, Hui Shen, Zhichao Cao, Dong Zhao, JeongGil Ko, Kiran Somasundaram, Shrikanth S. Narayanan, Salman Avestimehr, Mi Zhang

    Abstract: There is a significant relevance of federated learning (FL) in the realm of Artificial Intelligence of Things (AIoT). However, most existing FL works do not use datasets collected from authentic IoT devices and thus do not capture unique modalities and inherent challenges of IoT data. To fill this critical gap, in this work, we introduce FedAIoT, an FL benchmark for AIoT. FedAIoT includes eight da… ▽ More

    Submitted 19 June, 2024; v1 submitted 29 September, 2023; originally announced October 2023.

  8. arXiv:2307.06805  [pdf, other

    math.DS

    Path-Integral Formula for Computing Koopman Eigenfunctions

    Authors: Shankar A. Deka, Sriram S. K. S. Narayanan, Umesh Vaidya

    Abstract: The paper is about the computation of the principal spectrum of the Koopman operator (i.e., eigenvalues and eigenfunctions). The principal eigenfunctions of the Koopman operator are the ones with the corresponding eigenvalues equal to the eigenvalues of the linearization of the nonlinear system at an equilibrium point. The main contribution of this paper is to provide a novel approach for computin… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  9. Safe Navigation using Density Functions

    Authors: Andrew Zheng, Sriram S. K. S. Narayanan, Umesh Vaidya

    Abstract: This paper presents a novel approach for safe control synthesis using the dual formulation of the navigation problem. The main contribution of this paper is in the analytical construction of density functions for almost everywhere navigation with safety constraints. In contrast to the existing approaches, where density functions are used for the analysis of navigation problems, we use density func… ▽ More

    Submitted 16 October, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

  10. arXiv:2306.02210  [pdf, other

    cs.LG cs.DC

    GPT-FL: Generative Pre-trained Model-Assisted Federated Learning

    Authors: Tuo Zhang, Tiantian Feng, Samiul Alam, Dimitrios Dimitriadis, Sunwoo Lee, Mi Zhang, Shrikanth S. Narayanan, Salman Avestimehr

    Abstract: In this work, we propose GPT-FL, a generative pre-trained model-assisted federated learning (FL) framework. At its core, GPT-FL leverages generative pre-trained models to generate diversified synthetic data. These generated data are used to train a downstream model on the server, which is then fine-tuned with private client data under the standard FL framework. We show that GPT-FL consistently out… ▽ More

    Submitted 17 June, 2024; v1 submitted 3 June, 2023; originally announced June 2023.

  11. SE(3) Koopman-MPC: Data-driven Learning and Control of Quadrotor UAVs

    Authors: Sriram S. K. S. Narayanan, Duvan Tellez-Castro, Sarang Sutavani, Umesh Vaidya

    Abstract: In this paper, we propose a novel data-driven approach for learning and control of quadrotor UAVs based on the Koopman operator and extended dynamic mode decomposition (EDMD). Building observables for EDMD based on conventional methods like Euler angles (to represent orientation) is known to involve singularities. To address this issue, we employ a set of physics-informed observables based on the… ▽ More

    Submitted 16 October, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

  12. arXiv:2305.02938  [pdf, other

    cs.RO eess.SY

    Off-Road Navigation of Legged Robots Using Linear Transfer Operators

    Authors: Joseph Moyalan, Andrew Zheng, Sriram S. K. S Narayanan, Umesh Vaidya

    Abstract: This paper presents the implementation of off-road navigation on legged robots using convex optimization through linear transfer operators. Given a traversability measure that captures the off-road environment, we lift the navigation problem into the density space using the Perron-Frobenius (P-F) operator. This allows the problem formulation to be represented as a convex optimization. Due to the o… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

  13. arXiv:2212.05154  [pdf, other

    cs.RO

    Optimal Control for Quadruped Locomotion using LTV MPC

    Authors: Andrew Zheng, Sriram S. K. S Narayanan

    Abstract: This paper presents a state-of-the-art optimal controller for quadruped locomotion. The robot dynamics is represented using a single rigid body (SRB) model. A linear time-varying model predictive controller (LTV MPC) is proposed by using linearization schemes. Simulation results show that the LTV MPC can execute various gaits, such as trot and crawl, and is capable of tracking desired reference tr… ▽ More

    Submitted 16 October, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

  14. arXiv:2210.15707  [pdf, other

    cs.SD cs.DC eess.AS

    FedAudio: A Federated Learning Benchmark for Audio Tasks

    Authors: Tuo Zhang, Tiantian Feng, Samiul Alam, Sunwoo Lee, Mi Zhang, Shrikanth S. Narayanan, Salman Avestimehr

    Abstract: Federated learning (FL) has gained substantial attention in recent years due to the data privacy concerns related to the pervasiveness of consumer devices that continuously collect data from users. While a number of FL benchmarks have been developed to facilitate FL research, none of them include audio data and audio-related tasks. In this paper, we fill this critical gap by introducing a new FL b… ▽ More

    Submitted 8 February, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

  15. arXiv:2112.13416  [pdf, other

    cs.CR cs.LG cs.MM

    Attribute Inference Attack of Speech Emotion Recognition in Federated Learning Settings

    Authors: Tiantian Feng, Hanieh Hashemi, Rajat Hebbar, Murali Annavaram, Shrikanth S. Narayanan

    Abstract: Speech emotion recognition (SER) processes speech signals to detect and characterize expressed perceived emotions. Many SER application systems often acquire and transmit speech data collected at the client-side to remote cloud platforms for inference and decision making. However, speech data carry rich information not only about emotions conveyed in vocal expressions, but also other sensitive dem… ▽ More

    Submitted 22 December, 2022; v1 submitted 26 December, 2021; originally announced December 2021.

  16. arXiv:2102.07896  [pdf, other

    eess.SP cs.SD eess.AS eess.IV

    A multispeaker dataset of raw and reconstructed speech production real-time MRI video and 3D volumetric images

    Authors: Yongwan Lim, Asterios Toutios, Yannick Bliesener, Ye Tian, Sajan Goud Lingala, Colin Vaz, Tanner Sorensen, Miran Oh, Sarah Harper, Weiyi Chen, Yoonjeong Lee, Johannes Töger, Mairym Lloréns Montesserin, Caitlin Smith, Bianca Godinez, Louis Goldstein, Dani Byrd, Krishna S. Nayak, Shrikanth S. Narayanan

    Abstract: Real-time magnetic resonance imaging (RT-MRI) of human speech production is enabling significant advances in speech science, linguistics, bio-inspired speech technology development, and clinical applications. Easy access to RT-MRI is however limited, and comprehensive datasets with broad access are needed to catalyze research across numerous domains. The imaging of the rapidly moving articulators… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

    Comments: 27 pages, 6 figures, 5 tables, submitted to Nature Scientific Data

  17. arXiv:2102.07271  [pdf, other

    eess.IV cs.CV

    Attention-gated convolutional neural networks for off-resonance correction of spiral real-time MRI

    Authors: Yongwan Lim, Shrikanth S. Narayanan, Krishna S. Nayak

    Abstract: Spiral acquisitions are preferred in real-time MRI because of their efficiency, which has made it possible to capture vocal tract dynamics during natural speech. A fundamental limitation of spirals is blurring and signal loss due to off-resonance, which degrades image quality at air-tissue boundaries. Here, we present a new CNN-based off-resonance correction method that incorporates an attention-g… ▽ More

    Submitted 14 February, 2021; originally announced February 2021.

    Comments: 8 pages, 4 figures, 1 table

    Journal ref: 28th Int. Soc. Magn. Reson. Med. (ISMRM) Scientific Sessions, 2020, p.1005

  18. arXiv:2001.09427  [pdf, other

    eess.IV

    Deblurring for Spiral Real-Time MRI Using Convolutional Neural Networks

    Authors: Yongwan Lim, Shrikanth S Narayanan, Krishna S Nayak

    Abstract: Spiral acquisitions are preferred in real-time MRI because of their time efficiency. A fundamental limitation of spirals is image blurring due to off-resonance, which degrades image quality significantly at air-tissue boundaries. Here, we demonstrate a simple CNN-based deblurring method for spiral real-time MRI of human speech production. We show the CNN-based deblurring is capable of restoring bl… ▽ More

    Submitted 29 May, 2020; v1 submitted 26 January, 2020; originally announced January 2020.

    Comments: Presented at International Conference on Medical Imaging with Deep Learning (MIDL 2020) (A short conference paper of the full journal paper in the earlier submission version)

    Report number: MIDL/2020/ExtendedAbstract/zYareJYs8Z

  19. arXiv:1808.00876  [pdf, ps, other

    cs.LG cs.HC cs.MM cs.SD eess.AS

    Normalization Before Shaking Toward Learning Symmetrically Distributed Representation Without Margin in Speech Emotion Recognition

    Authors: Che-Wei Huang, Shrikanth S. Narayanan

    Abstract: Regularization is crucial to the success of many practical deep learning models, in particular in a more often than not scenario where there are only a few to a moderate number of accessible training samples. In addition to weight decay, data augmentation and dropout, regularization based on multi-branch architectures, such as Shake-Shake regularization, has been proven successful in many applicat… ▽ More

    Submitted 5 August, 2018; v1 submitted 2 August, 2018; originally announced August 2018.

    Comments: Submission to The IEEE Transactions

  20. arXiv:1706.02901  [pdf, ps, other

    cs.LG cs.CL cs.MM cs.SD

    Characterizing Types of Convolution in Deep Convolutional Recurrent Neural Networks for Robust Speech Emotion Recognition

    Authors: Che-Wei Huang, Shrikanth. S. Narayanan

    Abstract: Deep convolutional neural networks are being actively investigated in a wide range of speech and audio processing applications including speech recognition, audio event detection and computational paralinguistics, owing to their ability to reduce factors of variations, for learning from speech. However, studies have suggested to favor a certain type of convolutional operations when building a deep… ▽ More

    Submitted 13 January, 2018; v1 submitted 7 June, 2017; originally announced June 2017.

    Comments: Revised Submission to IEEE Transactions

  21. arXiv:1405.3122  [pdf

    physics.bio-ph cond-mat.mtrl-sci q-bio.BM

    Selection of Arginine-Rich Anti-Gold Antibodies Engineered for Plasmonic Colloid Self-Assembly

    Authors: Purvi Jain, Anandakumar Soshee, S Shankara Narayanan, Jadab Sharma, Christian Girard, Erik Dujardin, Clément Nizak

    Abstract: Antibodies are affinity proteins with a wide spectrum of applications in analytical and therapeutic biology. Proteins showing specific recognition for a chosen molecular target can be isolated and their encoding sequence identified in vitro from a large and diverse library by phage display selection. In this work, we show that this standard biochemical technique rapidly yields a collection of anti… ▽ More

    Submitted 13 May, 2014; originally announced May 2014.

    Comments: 34 pages, 6 figures, 1 table

  22. arXiv:1312.7463  [pdf, ps, other

    stat.ML cs.CV cs.LG

    Generalized Ambiguity Decomposition for Understanding Ensemble Diversity

    Authors: Kartik Audhkhasi, Abhinav Sethy, Bhuvana Ramabhadran, Shrikanth S. Narayanan

    Abstract: Diversity or complementarity of experts in ensemble pattern recognition and information processing systems is widely-observed by researchers to be crucial for achieving performance improvement upon fusion. Understanding this link between ensemble diversity and fusion performance is thus an important research question. However, prior works have theoretically characterized ensemble diversity and hav… ▽ More

    Submitted 28 December, 2013; originally announced December 2013.

    Comments: 32 pages, 10 figures

    ACM Class: I.5