Skip to main content

Showing 1–22 of 22 results for author: Shankar, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.01369  [pdf, other

    eess.AS cs.AI cs.LG

    A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement

    Authors: Ravi Shankar, Ke Tan, Buye Xu, Anurag Kumar

    Abstract: Self-supervised learned models have been found to be very effective for certain speech tasks such as automatic speech recognition, speaker identification, keyword spotting and others. While the features are undeniably useful in speech recognition and associated tasks, their utility in speech enhancement systems is yet to be firmly established, and perhaps not properly understood. In this paper, we… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: 8 pages; Shorter form accepted in ICASSP 2024

  2. arXiv:2211.07798  [pdf, other

    math.CO cs.CG math.GT

    A Uniform Sampling Procedure for Abstract Triangulations of Surfaces

    Authors: Rajan Shankar, Jonathan Spreer

    Abstract: We present a procedure to sample uniformly from the set of combinatorial isomorphism types of balanced triangulations of surfaces - also known as graph-encoded surfaces. For a given number $n$, the sample is a weighted set of graph-encoded surfaces with $2n$ triangles. The sampling procedure relies on connections between graph-encoded surfaces and permutations, and basic properties of the symmet… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: 12 pages, 17 figures

    MSC Class: 57Q15; 57N05; 20B30; 05C15; 05C80

    Journal ref: This paper will be published in the proceedings of the SIAM Symposium on Algorithm Engineering and Experiments (ALENEX) 2023

  3. arXiv:2211.05071  [pdf, other

    eess.AS cs.SD

    A Diffeomorphic Flow-based Variational Framework for Multi-speaker Emotion Conversion

    Authors: Ravi Shankar, Hsi-Wei Hsieh, Nicolas Charon, Archana Venkataraman

    Abstract: This paper introduces a new framework for non-parallel emotion conversion in speech. Our framework is based on two key contributions. First, we propose a stochastic version of the popular CycleGAN model. Our modified loss function introduces a Kullback Leibler (KL) divergence term that aligns the source and target data distributions learned by the generators, thus overcoming the limitations of sam… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: Accepted in IEEE Transactions on Audio, Speech and Language Processing

  4. arXiv:2211.05047  [pdf, other

    eess.AS cs.AI cs.SD

    A Comparative Study of Data Augmentation Techniques for Deep Learning Based Emotion Recognition

    Authors: Ravi Shankar, Abdouh Harouna Kenfack, Arjun Somayazulu, Archana Venkataraman

    Abstract: Automated emotion recognition in speech is a long-standing problem. While early work on emotion recognition relied on hand-crafted features and simple classifiers, the field has now embraced end-to-end feature learning and classification using deep neural networks. In parallel to these models, researchers have proposed several data augmentation techniques to increase the size and variability of ex… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: Under Submission

  5. arXiv:2207.02157  [pdf, other

    cs.IT eess.SP

    Multi-IRS-Aided Doppler-Tolerant Wideband DFRC System

    Authors: Tong Wei, Linlong Wu, Kumar Vijay Mishra, M. R. Bhavani Shankar

    Abstract: Intelligent reflecting surface (IRS) is recognized as an enabler of future dual-function radar-communications (DFRC) by improving spectral efficiency, coverage, parameter estimation, and interference suppression. Prior studies on IRS-aided DFRC focus either on narrowband processing, single-IRS deployment, static targets, non-clutter scenario, or on the under-utilized line-of-sight (LoS) and non-li… ▽ More

    Submitted 10 August, 2023; v1 submitted 5 July, 2022; originally announced July 2022.

    Comments: 16 pages, 8 figures, 2 tables

  6. arXiv:2205.15952  [pdf, other

    cs.CL cs.AI cs.LG

    Knowledge Graph - Deep Learning: A Case Study in Question Answering in Aviation Safety Domain

    Authors: Ankush Agarwal, Raj Gite, Shreya Laddha, Pushpak Bhattacharyya, Satyanarayan Kar, Asif Ekbal, Prabhjit Thind, Rajesh Zele, Ravi Shankar

    Abstract: In the commercial aviation domain, there are a large number of documents, like, accident reports (NTSB, ASRS) and regulatory directives (ADs). There is a need for a system to access these diverse repositories efficiently in order to service needs in the aviation industry, like maintenance, compliance, and safety. In this paper, we propose a Knowledge Graph (KG) guided Deep Learning (DL) based Ques… ▽ More

    Submitted 9 June, 2022; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: LREC 2022 Main Conference Accepted Paper

  7. The Rise of Intelligent Reflecting Surfaces in Integrated Sensing and Communications Paradigms

    Authors: Ahmet M. Elbir, Kumar Vijay Mishra, M. R. Bhavani Shankar, Symeon Chatzinotas

    Abstract: The intelligent reflecting surface (IRS) alters the behavior of wireless media and, consequently, has potential to improve the performance and reliability of wireless systems such as communications and radar remote sensing. Recently, integrated sensing and communications (ISAC) has been widely studied as a means to efficiently utilize spectrum and thereby save cost and power. This article investig… ▽ More

    Submitted 20 December, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted paper in IEEE Network Magazine

    Journal ref: IEEE Network, 2023

  8. arXiv:2202.12014  [pdf, other

    cs.CY

    TriggerCit: Early Flood Alerting using Twitter and Geolocation -- a comparison with alternative sources

    Authors: Carlo Bono, Barbara Pernici, Jose Luis Fernandez-Marquez, Amudha Ravi Shankar, Mehmet Oğuz Mülâyim, Edoardo Nemni

    Abstract: Rapid impact assessment in the immediate aftermath of a natural disaster is essential to provide adequate information to international organisations, local authorities, and first responders. Social media can support emergency response with evidence-based content posted by citizens and organisations during ongoing events. In the paper, we propose TriggerCit: an early flood alerting tool with a mult… ▽ More

    Submitted 5 March, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

    Comments: 12 pages Keywords Social Media, Disaster management, Early Alerting

  9. arXiv:2107.04973  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    A Deep-Bayesian Framework for Adaptive Speech Duration Modification

    Authors: Ravi Shankar, Archana Venkataraman

    Abstract: We propose the first method to adaptively modify the duration of a given speech signal. Our approach uses a Bayesian framework to define a latent attention map that links frames of the input and target utterances. We train a masked convolutional encoder-decoder network to produce this attention map via a stochastic version of the mean absolute error loss function; our model also predicts the lengt… ▽ More

    Submitted 11 July, 2021; originally announced July 2021.

    Comments: 6 pages, 7 figures

  10. arXiv:2106.15764  [pdf, other

    cs.AI cs.CR cs.CY cs.LG

    The Threat of Offensive AI to Organizations

    Authors: Yisroel Mirsky, Ambra Demontis, Jaidip Kotak, Ram Shankar, Deng Gelei, Liu Yang, Xiangyu Zhang, Wenke Lee, Yuval Elovici, Battista Biggio

    Abstract: AI has provided us with the ability to automate tasks, extract information from vast amounts of data, and synthesize media that is nearly indistinguishable from the real thing. However, positive tools can also be used for negative purposes. In particular, cyber adversaries can use AI (such as machine learning) to enhance their attacks and expand their campaigns. Although offensive AI has been di… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

  11. arXiv:2010.03021  [pdf, other

    cs.CY cs.SI

    Image-based Social Sensing: Combining AI and the Crowd to Mine Policy-Adherence Indicators from Twitter

    Authors: Virginia Negri, Dario Scuratti, Stefano Agresti, Donya Rooein, Gabriele Scalia, Amudha Ravi Shankar, Jose Luis Fernandez Marquez, Mark James Carman, Barbara Pernici

    Abstract: Social Media provides a trove of information that, if aggregated and analysed appropriately can provide important statistical indicators to policy makers. In some situations these indicators are not available through other mechanisms. For example, given the ongoing COVID-19 outbreak, it is essential for governments to have access to reliable data on policy-adherence with regards to mask wearing, s… ▽ More

    Submitted 5 March, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: 10 pages, 9 figures, to be published in Proceedings of ICSE Software Engineering in Society, May 2021

  12. arXiv:2007.15108  [pdf, other

    eess.SP cs.IR math.OC

    Localization with One-Bit Passive Radars in Narrowband Internet-of-Things using Multivariate Polynomial Optimization

    Authors: Saeid Sedighi, Kumar Vijay Mishra, M. R. Bhavani Shankar, Björn Ottersten

    Abstract: Several Internet-of-Things (IoT) applications provide location-based services, wherein it is critical to obtain accurate position estimates by aggregating information from individual sensors. In the recently proposed narrowband IoT (NB-IoT) standard, which trades off bandwidth to gain wide coverage, the location estimation is compounded by the low sampling rate receivers and limited-capacity links… ▽ More

    Submitted 9 April, 2021; v1 submitted 29 July, 2020; originally announced July 2020.

    Comments: 16 pages, 11 figures

  13. arXiv:2007.12937  [pdf, other

    eess.AS cs.LG cs.SD

    Multi-speaker Emotion Conversion via Latent Variable Regularization and a Chained Encoder-Decoder-Predictor Network

    Authors: Ravi Shankar, Hsi-Wei Hsieh, Nicolas Charon, Archana Venkataraman

    Abstract: We propose a novel method for emotion conversion in speech based on a chained encoder-decoder-predictor neural network architecture. The encoder constructs a latent embedding of the fundamental frequency (F0) contour and the spectrum, which we regularize using the Large Diffeomorphic Metric Map** (LDDMM) registration framework. The decoder uses this embedding to predict the modified F0 contour i… ▽ More

    Submitted 10 August, 2020; v1 submitted 25 July, 2020; originally announced July 2020.

    Comments: Paper Accepted in Interspeech 2020

  14. arXiv:2007.12932  [pdf, other

    eess.AS cs.LG cs.SD

    Non-parallel Emotion Conversion using a Deep-Generative Hybrid Network and an Adversarial Pair Discriminator

    Authors: Ravi Shankar, Jacob Sager, Archana Venkataraman

    Abstract: We introduce a novel method for emotion conversion in speech that does not require parallel training data. Our approach loosely relies on a cycle-GAN schema to minimize the reconstruction error from converting back and forth between emotion pairs. However, unlike the conventional cycle-GAN, our discriminator classifies whether a pair of input real and generated samples corresponds to the desired e… ▽ More

    Submitted 10 August, 2020; v1 submitted 25 July, 2020; originally announced July 2020.

    Comments: Paper accepted in Interspeech 2020

  15. arXiv:2001.01406  [pdf

    cs.NI eess.SP

    Analysis of Selective-Decode and Forward Relaying Protocol Over kappa-mu Fading Channel Distribution

    Authors: Ravi Shankar, Lokesh Bhardwaj, Ritesh Kumar Mishra

    Abstract: In this work, we examine the performance of selective-decode and forward (S-DF) relay systems over kappa-mu fading channel condition. We discuss about the probability density function (PDF), system model, and cumulative distribution function (CDF) of kappa-mu distributed envelope and signal to noise ratio (SNR) and the techniques to generate samples that follow kappa-mu distribution. Specifically,… ▽ More

    Submitted 6 January, 2020; originally announced January 2020.

  16. arXiv:1912.10036  [pdf, other

    eess.SP cs.IT cs.LG

    A Family of Deep Learning Architectures for Channel Estimation and Hybrid Beamforming in Multi-Carrier mm-Wave Massive MIMO

    Authors: Ahmet M. Elbir, Kumar Vijay Mishra, M. R. Bhavani Shankar, Björn Ottersten

    Abstract: Hybrid analog and digital beamforming transceivers are instrumental in addressing the challenge of expensive hardware and high training overheads in the next generation millimeter-wave (mm-Wave) massive MIMO (multiple-input multiple-output) systems. However, lack of fully digital beamforming in hybrid architectures and short coherence times at mm-Wave impose additional constraints on the channel e… ▽ More

    Submitted 3 January, 2022; v1 submitted 20 December, 2019; originally announced December 2019.

    Comments: Accepted Paper in IEEE Transactions on Cognitive Communications and Networking. arXiv admin note: text overlap with arXiv:1910.14240

  17. arXiv:1811.09850  [pdf

    cs.NI

    Outage Probability Analysis of Selective-Decode and Forward Cooperative Wireless Network over Time Varying Fading Channels with Node Mobility and Imperfect CSI Condition

    Authors: Ravi Shankar, Ritesh Kumar Mishra

    Abstract: In this work, we explore the outage probability (OP) analysis of selective decode and forward (SDF) cooperation protocol employing multiple-input multipleoutput (MIMO) orthogonal space-time block-code (OSTBC) over time varying Rayleigh fading channel conditions with imperfect channel state information (CSI) and mobile nodes. The closed-form expressions of the per-block average OP, probability dist… ▽ More

    Submitted 24 November, 2018; originally announced November 2018.

  18. arXiv:1809.00654  [pdf

    cs.NI

    PEP Analysis of Selective Decode and Forward Protocol over Keyhole Fading

    Authors: Ravi Shankar, Yamini Chandrakar, Radhika Sinha, Ritesh Kumar Mishra

    Abstract: We provide a closed form upper bound formulation for the average pairwise-error probability (PEP) of selective decode and forward (SDF) cooperation protocol for a keyhole (pinhole) channel condition. We have employed orthogonal space-time block-code scheme (OSTBC) in conjunction with multi-antenna (MIMO) technology. We have used moment generating function (MGF) based approach for deriving the uppe… ▽ More

    Submitted 10 September, 2018; v1 submitted 3 September, 2018; originally announced September 2018.

    Comments: MICRO 2017

  19. arXiv:1802.06270  [pdf, other

    cs.DC

    MAVIS: Managing Datacenters using Smartphones

    Authors: Raghav Shankar, Benjamin Kobin, Saurabh Bagchi, Michael Kistler, Jan Rellermeyer

    Abstract: Distributed monitoring plays a crucial role in managing the activities of cloud-based datacenters. System administrators have long relied on monitoring systems such as Nagios and Ganglia to obtain status alerts on their desktop-class machines. However, the popularity of mobile devices is pushing the community to develop datacenter monitoring solutions for smartphone-class devices. Here we lay out… ▽ More

    Submitted 17 February, 2018; originally announced February 2018.

    Comments: ACM Classification (2012): Data center networks; System management; Ubiquitous and mobile computing systems and tools

  20. Signal Processing for High Throughput Satellite Systems: Challenges in New Interference-Limited Scenarios

    Authors: Ana I. Perez-Neira, Miguel Angel Vazquez, Sina Maleki, M. R. Bhavani Shankar, Symeon Chatzinotas

    Abstract: The field of satellite communications is enjoying a renewed interest in the global telecom market, and very high throughput satellites (V/HTS), with their multiple spot-beams, are key for delivering the future rate demands. In this article, the state-of-the-art and open research challenges of signal processing techniques for V/HTS systems are presented for the first time, with focus on novel appro… ▽ More

    Submitted 12 February, 2018; originally announced February 2018.

  21. Security Enhancement With Optimal QOS Using EAP-AKA In Hybrid Coupled 3G-WLAN Convergence Network

    Authors: R. Shankar, Timothy Rajkumar. K, P. Dananjayan

    Abstract: The third generation partnership project (3GPP) has addressed the feasibility of interworking and specified the interworking architecture and security architecture for third generation (3G)-wireless local area network (WLAN), it is develo**, system architecture evolution (SAE)/ long term evolution (LTE) architecture, for the next generation mobile communication system. To provide a secure 3G-WLA… ▽ More

    Submitted 29 July, 2010; originally announced July 2010.

    Comments: 12 pages, 5 figures

    Journal ref: International Journal Of UbiComp 1.3 (2010) 31-42

  22. arXiv:cs/0506032  [pdf

    cs.NE

    Framework for Hopfield Network based Adaptive routing - A design level approach for adaptive routing phenomena with Artificial Neural Network

    Authors: R. Shankar

    Abstract: Routing, as a basic phenomena, by itself, has got umpteen scopes to analyse, discuss and arrive at an optimal solution for the technocrats over years. Routing is analysed based on many factors; few key constraints that decide the factors are communication medium, time dependency, information source nature. Parametric routing has become the requirement of the day, with some kind of adaptation to… ▽ More

    Submitted 10 June, 2005; originally announced June 2005.

    Comments: (13 pages, 7 figures, code)