Skip to main content

Showing 1–31 of 31 results for author: Rao, K S

.
  1. arXiv:2406.13384  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    Straight Through Gumbel Softmax Estimator based Bimodal Neural Architecture Search for Audio-Visual Deepfake Detection

    Authors: Aravinda Reddy PN, Raghavendra Ramachandra, Krothapalli Sreenivasa Rao, Pabitra Mitra, Vinod Rathod

    Abstract: Deepfakes are a major security risk for biometric authentication. This technology creates realistic fake videos that can impersonate real people, fooling systems that rely on facial features and voice patterns for identification. Existing multimodal deepfake detectors rely on conventional fusion methods, such as majority rule and ensemble voting, which often struggle to adapt to changing data char… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  2. arXiv:2404.12679  [pdf, other

    cs.CV cs.CR

    MLSD-GAN -- Generating Strong High Quality Face Morphing Attacks using Latent Semantic Disentanglement

    Authors: Aravinda Reddy PN, Raghavendra Ramachandra, Krothapalli Sreenivasa Rao, Pabitra Mitra

    Abstract: Face-morphing attacks are a growing concern for biometric researchers, as they can be used to fool face recognition systems (FRS). These attacks can be generated at the image level (supervised) or representation level (unsupervised). Previous unsupervised morphing attacks have relied on generative adversarial networks (GANs). More recently, researchers have used linear interpolation of StyleGAN-en… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  3. BOXREC: Recommending a Box of Preferred Outfits in Online Shop**

    Authors: Debopriyo Banerjee, Krothapalli Sreenivasa Rao, Shamik Sural, Niloy Ganguly

    Abstract: Over the past few years, automation of outfit composition has gained much attention from the research community. Most of the existing outfit recommendation systems focus on pairwise item compatibility prediction (using visual and text features) to score an outfit combination having several items, followed by recommendation of top-n outfits or a capsule wardrobe having a collection of outfits based… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Journal ref: ACM Trans. Intell. Syst. Technol. 11, 6, Article 69 (December 2020), pages 69:1-69:28

  4. arXiv:2401.01356  [pdf, other

    cs.IR

    Efficient Indexing of Meta-Data (Extracted from Educational Videos)

    Authors: Shalika Kumbham, Abhijit Debnath, Krothapalli Sreenivasa Rao

    Abstract: Video lectures are becoming more popular and in demand as online classroom teaching is becoming more prevalent. Massive Open Online Courses (MOOCs), such as NPTEL, have been creating high-quality educational content that is freely accessible to students online. A large number of colleges across the country are now using NPTEL videos in their classrooms. So more video lectures are being recorded, m… ▽ More

    Submitted 11 December, 2023; originally announced January 2024.

  5. SEFGAN: Harvesting the Power of Normalizing Flows and GANs for Efficient High-Quality Speech Enhancement

    Authors: Martin Strauss, Nicola Pia, Nagashree K. S. Rao, Bernd Edler

    Abstract: This paper proposes SEFGAN, a Deep Neural Network (DNN) combining maximum likelihood training and Generative Adversarial Networks (GANs) for efficient speech enhancement (SE). For this, a DNN is trained to synthesize the enhanced speech conditioned on noisy speech using a Normalizing Flow (NF) as generator in a GAN framework. While the combination of likelihood models and GANs is not trivial, SEFG… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Preprint. Accepted to IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2023

  6. arXiv:2310.15071  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci quant-ph

    Experimental signatures of quantum and topological states in frustrated magnetism

    Authors: J. Khatua, B. Sana, A. Zorko, M. Gomilšek, K. Sethupathi M. S. Ramachandra Rao, M. Baenitz, B. Schmidt, P. Khuntia

    Abstract: Frustration in magnetic materials arising from competing exchange interactions can prevent the system from adopting long-range magnetic order and can instead lead to a diverse range of novel quantum and topological states with exotic quasiparticle excitations. Here, we review prominent examples of such emergent phenomena, including magnetically-disordered and extensively degenerate spin ices, whic… ▽ More

    Submitted 15 November, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

    Journal ref: Physics Reports 1041, 1 (2023)

  7. arXiv:2310.12736  [pdf, other

    cs.CV

    ExtSwap: Leveraging Extended Latent Mapper for Generating High Quality Face Swap**

    Authors: Aravinda Reddy PN, K. Sreenivasa Rao, Raghavendra Ramachandra, Pabitra mitra

    Abstract: We present a novel face swap** method using the progressively growing structure of a pre-trained StyleGAN. Previous methods use different encoder decoder structures, embedding integration networks to produce high-quality results, but their quality suffers from entangled representation. We disentangle semantics by deriving identity and attribute features separately. By learning to map the concate… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  8. arXiv:2202.01078  [pdf, other

    cs.SD eess.AS

    Melody Extraction from Polyphonic Music by Deep Learning Approaches: A Review

    Authors: Gurunath Reddy M, K. Sreenivasa Rao, Partha Pratim Das

    Abstract: Melody extraction is a vital music information retrieval task among music researchers for its potential applications in education pedagogy and the music industry. Melody extraction is a notoriously challenging task due to the presence of background instruments. Also, often melodic source exhibits similar characteristics to that of the other instruments. The interfering background accompaniment wit… ▽ More

    Submitted 2 February, 2022; originally announced February 2022.

    Comments: 72 pages

  9. arXiv:2112.04841  [pdf, other

    eess.AS cs.MM cs.SD eess.SP

    On The Effect Of Coding Artifacts On Acoustic Scene Classification

    Authors: Nagashree K. S. Rao, Nils Peters

    Abstract: Previous DCASE challenges contributed to an increase in the performance of acoustic scene classification systems. State-of-the-art classifiers demand significant processing capabilities and memory which is challenging for resource-constrained mobile or IoT edge devices. Thus, it is more likely to deploy these models on more powerful hardware and classify audio recordings previously uploaded (or st… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: paper presented at the 2021 Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE)

  10. arXiv:2109.04138  [pdf, other

    cs.CR cs.CV

    Multilingual Audio-Visual Smartphone Dataset And Evaluation

    Authors: Hareesh Mandalapu, Aravinda Reddy P N, Raghavendra Ramachandra, K Sreenivasa Rao, Pabitra Mitra, S R Mahadeva Prasanna, Christoph Busch

    Abstract: Smartphones have been employed with biometric-based verification systems to provide security in highly sensitive applications. Audio-visual biometrics are getting popular due to their usability, and also it will be challenging to spoof because of their multimodal nature. In this work, we present an audio-visual smartphone dataset captured in five different recent smartphones. This new dataset cont… ▽ More

    Submitted 15 November, 2021; v1 submitted 9 September, 2021; originally announced September 2021.

  11. Audio-Visual Biometric Recognition and Presentation Attack Detection: A Comprehensive Survey

    Authors: Hareesh Mandalapu, P N Aravinda Reddy, Raghavendra Ramachandra, K Sreenivasa Rao, Pabitra Mitra, S R Mahadeva Prasanna, Christoph Busch

    Abstract: Biometric recognition is a trending technology that uses unique characteristics data to identify or verify/authenticate security applications. Amidst the classically used biometrics, voice and face attributes are the most propitious for prevalent applications in day-to-day life because they are easy to obtain through restrained and user-friendly procedures. The pervasiveness of low-cost audio and… ▽ More

    Submitted 12 March, 2021; v1 submitted 24 January, 2021; originally announced January 2021.

    Journal ref: in IEEE Access, vol. 9, pp. 37431-37455, 2021

  12. arXiv:2011.06455  [pdf

    cs.GT physics.soc-ph q-bio.PE

    Optimal governance and implementation of vaccination programmes to contain the COVID-19 pandemic

    Authors: Mahendra Piraveenan, Shailendra Sawleshwarkar, Michael Walsh, Iryna Zablotska, Samit Bhattacharyya, Habib Hassan Farooqui, Tarun Bhatnagar, Anup Karan, Manoj Murhekar, Sanjay Zodpey, K. S. Mallikarjuna Rao, Philippa Pattison, Albert Zomaya, Matjaz Perc

    Abstract: Since the recent introduction of several viable vaccines for SARS-CoV-2, vaccination uptake has become the key factor that will determine our success in containing the COVID-19 pandemic. We argue that game theory and social network models should be used to guide decisions pertaining to vaccination programmes for the best possible results. In the months following the introduction of vaccines, their… ▽ More

    Submitted 9 June, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

    Comments: 15 pages, 1 figure; published in Royal Society Open Science

    Journal ref: R. Soc. Open Sci. 8, 210429 (2021)

  13. arXiv:2011.04297  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Knowledge Distillation for Singing Voice Detection

    Authors: Soumava Paul, Gurunath Reddy M, K Sreenivasa Rao, Partha Pratim Das

    Abstract: Singing Voice Detection (SVD) has been an active area of research in music information retrieval (MIR). Currently, two deep neural network-based methods, one based on CNN and the other on RNN, exist in literature that learn optimized features for the voice detection (VD) task and achieve state-of-the-art performance on common datasets. Both these models have a huge number of parameters (1.4M for C… ▽ More

    Submitted 19 August, 2021; v1 submitted 9 November, 2020; originally announced November 2020.

    Comments: Accepted at INTERSPEECH 2021. 5 pages, 3 figures

  14. arXiv:1909.03974  [pdf, other

    eess.AS cs.LG cs.SD

    DNN-based cross-lingual voice conversion using Bottleneck Features

    Authors: M Kiran Reddy, K Sreenivasa Rao

    Abstract: Cross-lingual voice conversion (CLVC) is a quite challenging task since the source and target speakers speak different languages. This paper proposes a CLVC framework based on bottleneck features and deep neural network (DNN). In the proposed method, the bottleneck features extracted from a deep auto-encoder (DAE) are used to represent speaker-independent features of speech signals from different… ▽ More

    Submitted 10 September, 2019; v1 submitted 9 September, 2019; originally announced September 2019.

  15. arXiv:1908.09634  [pdf, ps, other

    eess.AS cs.SD eess.SP

    Multilingual and Multimode Phone Recognition System for Indian Languages

    Authors: Kumud Tripathi, M. Kiran Reddy, K. Sreenivasa Rao

    Abstract: The aim of this paper is to develop a flexible framework capable of automatically recognizing phonetic units present in a speech utterance of any language spoken in any mode. In this study, we considered two modes of speech: conversation, and read modes in four Indian languages, namely, Telugu, Kannada, Odia, and Bengali. The proposed approach consists of two stages: (1) Automatic speech mode clas… ▽ More

    Submitted 23 August, 2019; originally announced August 2019.

    Comments: 33 pages, 5 figures, 6 tables, article

  16. arXiv:1908.08668  [pdf, ps, other

    eess.AS cs.SD

    VOP Detection for Read and Conversation Speech using CWT Coefficients and Phone Boundaries

    Authors: Kumud Tripathi, K. Sreenivasa Rao

    Abstract: In this paper, we propose a novel approach for accurate detection of the vowel onset points (VOPs). VOP is the instant at which the vowel begins in the speech signal. Precise identification of VOPs is important for various speech applications such as speech segmentation and speech rate modification. The existing methods detect the majority of VOPs within 40 ms deviation, and it may not be appropri… ▽ More

    Submitted 23 August, 2019; originally announced August 2019.

    Comments: 21 pages, 8 figures, 4 tables, article

  17. arXiv:1904.09765  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    hf0: A hybrid pitch extraction method for multimodal voice

    Authors: Pradeep Rengaswamy, Gurunath Reddy M, Krothapalli Sreenivasa Rao

    Abstract: Pitch or fundamental frequency (f0) extraction is a fundamental problem studied extensively for its potential applications in speech and clinical applications. In literature, explicit mode specific (modal speech or singing voice or emotional/ expressive speech or noisy speech) signal processing and deep learning f0 extraction methods that exploit the quasi periodic nature of the signal in time, ha… ▽ More

    Submitted 22 April, 2019; originally announced April 2019.

    Comments: Pitch Extraction, F0 extraction, harmonic signals, speech, monophonic songs, Convolutional Neural Network, 5 pages, 5 figures

  18. arXiv:1811.09956  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Glottal Closure Instants Detection From Pathological Acoustic Speech Signal Using Deep Learning

    Authors: Gurunath Reddy M, Tanumay Mandal, Krothapalli Sreenivasa Rao

    Abstract: In this paper, we propose a classification based glottal closure instants (GCI) detection from pathological acoustic speech signal, which finds many applications in vocal disorder analysis. Till date, GCI for pathological disorder is extracted from laryngeal (glottal source) signal recorded from Electroglottograph, a dedicated device designed to measure the vocal folds vibration around the larynx.… ▽ More

    Submitted 25 November, 2018; originally announced November 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

    Report number: ML4H/2018/39

  19. arXiv:1807.07710  [pdf, ps, other

    cs.CR

    Multivariate Public Key Cryptography and Digital Signature

    Authors: Pulugurtha Krishna Subba Rao, Duggirala Meher Krishna, Duggirala Ravi

    Abstract: In this paper, algorithms for multivariate public key cryptography and digital signature are described. Plain messages and encrypted messages are arrays, consisting of elements from a fixed finite ring or field. The encryption and decryption algorithms are based on multivariate map**s. The security of the private key depends on the difficulty of solving a system of parametric simultaneous multiv… ▽ More

    Submitted 23 July, 2018; v1 submitted 20 July, 2018; originally announced July 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1608.06472

    MSC Class: 03C10; 11C08; 11T71; 12E20; 12Y05; 13A15; 13P10; 81P94; 94A60

  20. arXiv:1706.05879  [pdf, other

    cond-mat.mtrl-sci

    Competing Ferromagnetic and Anti-Ferromagnetic interactions in Iron Nitride $ζ$-Fe$_2$N

    Authors: K. Sandeep Rao, H. G. Salunke

    Abstract: The paper discusses the magnetic state of zeta phase of iron nitride viz. $ζ$-Fe$_2$N on the basis of spin polarized first principles electronic structure calculations together with a review of already published data. Results of our first principles study suggest that the ground state of $ζ$-Fe$_2$N is ferromagnetic (FM) with a magnetic moment of 1.528 $μ_\text{B}$ on the Fe site. The FM ground st… ▽ More

    Submitted 19 June, 2017; originally announced June 2017.

    Comments: 10 pages, 7 figures, 3 tables

  21. arXiv:1605.07544  [pdf, ps, other

    math.OC

    Evolutionary Stability of Polymorphic Population States in Continuous Games

    Authors: Dharini Hingu, K. S. Mallikarjuna Rao, A. J. Shaiju

    Abstract: In games with continuous strategy spaces, if a rest point of the replicator dynamics is asymptotically stable then the rest point must be finitely supported (van Veelen, M., Spreij, P., 2009. Evolution in games with a continuous action space. Econom. Theory 39 (3), 355-376). In this article, we address the converse question that is, we prove that a finitely supported population state is asymptotic… ▽ More

    Submitted 24 May, 2016; originally announced May 2016.

    Comments: 19 Pages

  22. Characterization of maximum hands-off control

    Authors: Debasish Chatterjee, Masaaki Nagahara, Daniel Quevedo, K. S. Mallikarjuna Rao

    Abstract: Maximum hands-off control aims to maximize the length of time over which zero actuator values are applied to a system when executing specified control tasks. To tackle such problems, recent literature has investigated optimal control problems which penalize the size of the support of the control function and thereby lead to desired sparsity properties. This article gives the exact set of necessary… ▽ More

    Submitted 29 February, 2016; originally announced February 2016.

    Comments: 6 pages

    Journal ref: Systems & Control Letters, Vol. 94, pp. 31-36, 2016

  23. arXiv:1409.4962  [pdf

    physics.chem-ph

    Facile preparation of agarose-chitosan hybrid materials and nanocomposite ionogels using an ionic liquid via dissolution, regeneration and sol-gel transition

    Authors: Tushar J. Trivedi, K. Srinivasa Rao, Arvind Kumar

    Abstract: We report simultaneous dissolution of agarose (AG) and chitosan (CH) in varying proportions in an ionic liquid (IL), 1-butyl-3-methylimidazolium chloride [C4mim][Cl]. Composite materials were constructed from AG-CH-IL solutions using the antisolvent methanol, and IL was recovered from the solutions. Composite materials could be uniformly decorated with silver oxide (Ag2O) nanoparticles (Ag NPs) to… ▽ More

    Submitted 17 September, 2014; originally announced September 2014.

  24. arXiv:1405.2049  [pdf, other

    cs.IT cs.CR

    A New Upperbound for the Oblivious Transfer Capacity of Discrete Memoryless Channels

    Authors: K. Sankeerth Rao, Vinod M. Prabhakaran

    Abstract: We derive a new upper bound on the string oblivious transfer capacity of discrete memoryless channels. The main tool we use is the tension region of a pair of random variables introduced in Prabhakaran and Prabhakaran (2014) where it was used to derive upper bounds on rates of secure sampling in the source model. In this paper, we consider secure computation of string oblivious transfer in the cha… ▽ More

    Submitted 8 May, 2014; originally announced May 2014.

    Comments: 7 pages, 3 figures, extended version of submission to IEEE Information Theory Workshop, 2014

  25. arXiv:1209.4157  [pdf, ps, other

    cs.OH

    AutoAmp : An Open-Source Analog Amplifier Design Tool - For Classroom and Lab Purposes

    Authors: Om Prasad Patri, K. Sanmukh Rao

    Abstract: This correspondence presents an open-source tool AutoAmp developed at the Indian Institute of Technology, Guwahati. It is available at http://sourceforge.net/projects/autoamp-iitg/ This tool helps the user to design different types of electronic amplifiers, using solid state devices, for a given specification. It can handle several types of designs namely common-emitter BJT amplifier (single and t… ▽ More

    Submitted 19 September, 2012; originally announced September 2012.

    Comments: presented at the Indian Conference for Academic Research by Undergraduate Students (ICARUS), 2010, IIT Kanpur; AutoAmp : An Open-Source Analog Amplifier Design Tool - For Classroom and Lab Purposes, Proceedings of the Indian Conference for Academic Research by Undergraduate Students (ICARUS), 2010

  26. Evolutionary Stability Against Multiple Mutations

    Authors: Anirban Ghatak, K. S. Mallikarjuna Rao, A. J. Shaiju

    Abstract: It is known (see e.g. Weibull (1995)) that ESS is not robust against multiple mutations. In this article, we introduce robustness against multiple mutations and study some equivalent formulations and consequences.

    Submitted 11 January, 2012; originally announced January 2012.

    Comments: Submitted article

    MSC Class: 91A22

  27. arXiv:1001.4190  [pdf

    cs.SD

    Speech Recognition of the letter 'zha' in Tamil Language using HMM

    Authors: A. Srinivasan, K. Srinivasa Rao, K. Kannan, D. Narasimhan

    Abstract: Speech signals of the letter 'zha' in Tamil language of 3 males and 3 females were coded using an improved version of Linear Predictive Coding (LPC). The sampling frequency was at 16 kHz and the bit rate was at 15450 bits per second, where the original bit rate was at 128000 bits per second with the help of wave surfer audio tool. The output LPC cepstrum is implemented in first order three state… ▽ More

    Submitted 23 January, 2010; originally announced January 2010.

    Comments: 6 Pages

    Report number: IJEST09-01-02-05

    Journal ref: IJEST Volume 1 Issue 2 2009 67-72

  28. arXiv:math/0602613  [pdf, ps, other

    math.NT math-ph math.QA

    Two-parameter quantum algebras, twin-basic numbers, and associated generalized hypergeometric series

    Authors: R. Jagannathan, K. Srinivasa Rao

    Abstract: We give a method to embed the q-series in a (p,q)-series and derive the corresponding (p,q)-extensions of the known q-identities. The (p,q)-hypergeometric series, or twin-basic hypergeometric series (diferent from the usual bibasic hypergeometric series), is based on the concept of twin-basic number [n]_{p,q} = (p^n - q^n)/(p-q). This twin-basic number occurs in the theory of two-parameter quant… ▽ More

    Submitted 27 February, 2006; originally announced February 2006.

    Comments: 16 pages, To appear in the Proceedings of the International Conference on Number Theory and Mathematical Physics, 20-21 December 2005, Srinivasa Ramanujan Centre, Kumbakonam, India

  29. arXiv:math/0406076  [pdf, ps, other

    math.AP math.PR

    A probabilistic approach to second order variational inequalities with bilateral constraints

    Authors: Mrinal K Ghosh, K S Mallikarjuna Rao

    Abstract: We study a class of second order variational inequalities with bilateral constraints. Under certain conditions we show the existence of a unique viscosity solution of these variational inequalities and give a stochastic representation to this solution. As an application, we study a stochastic game with stop** times and show the existence of a saddle point equilibrium.

    Submitted 4 June, 2004; originally announced June 2004.

    Comments: 12 pages, no figures, no tables

    Journal ref: Proc. Indian Acad. Sci. (Math. Sci.), Vol. 113, No. 4, November 2003, pp. 431-442

  30. arXiv:math/0304317  [pdf, ps, other

    math.CA

    An Entry of Ramanujan on Hypergeometric Series in his Notebooks

    Authors: K. Srinivasa Rao, G. Vanden Berghe, Christian Krattenthaler

    Abstract: Example 7, after Entry 43, in Chapter XII of the first Notebook of Srinivasa Ramanujan is proved and, more generally, a summation theorem for $_3F_2(a,a,x;1+a,1+a+N;1)$, where $N$ is a non-negative integer, is derived.

    Submitted 22 April, 2003; originally announced April 2003.

    Comments: 8 pages, AmS-LaTeX

    MSC Class: 33C20 (Primary) 33C05; 33B15 (Secondary)

    Journal ref: J. Comput. Math. Appl. 173 (2004), 239-246.

  31. arXiv:math/0003184  [pdf, ps, other

    math.HO math.NT

    Life and work of the mathemagician Srinivasa Ramanujan

    Authors: K. Srinivasa Rao

    Abstract: The Life of Srinivasa Ramanujan (1887 - 1920), the renowned Indian Mathematician, is presented, in this the first of a series of lectures, delivered at the Indian Institute for Advanced Study, Shimla.

    Submitted 28 March, 2000; originally announced March 2000.

    Comments: 30 pages, LaTeX file