Skip to main content

Showing 1–7 of 7 results for author: Kumar, M G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.04504  [pdf, other

    cs.LG cs.AI

    Compositional Learning of Visually-Grounded Concepts Using Reinforcement

    Authors: Zijun Lin, Haidi Azaman, M Ganesh Kumar, Cheston Tan

    Abstract: Children can rapidly generalize compositionally-constructed rules to unseen test sets. On the other hand, deep reinforcement learning (RL) agents need to be trained over millions of episodes, and their ability to generalize to unseen combinations remains unclear. Hence, we investigate the compositional abilities of RL agents, using the task of navigating to specified color-shape targets in synthet… ▽ More

    Submitted 3 May, 2024; v1 submitted 8 September, 2023; originally announced September 2023.

  2. arXiv:2309.03483  [pdf, other

    cs.CV

    DetermiNet: A Large-Scale Diagnostic Dataset for Complex Visually-Grounded Referencing using Determiners

    Authors: Clarence Lee, M Ganesh Kumar, Cheston Tan

    Abstract: State-of-the-art visual grounding models can achieve high detection accuracy, but they are not designed to distinguish between all objects versus only certain objects of interest. In natural language, in order to specify a particular object or set of objects of interest, humans use determiners such as "my", "either" and "those". Determiners, as an important word class, are a type of schema in natu… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: 10 pages, 6 figures

  3. arXiv:2106.13541  [pdf

    cs.NE q-bio.NC

    A nonlinear hidden layer enables actor-critic agents to learn multiple paired association navigation

    Authors: M Ganesh Kumar, Cheston Tan, Camilo Libedinsky, Shih-Cheng Yen, Andrew Yong-Yi Tan

    Abstract: Navigation to multiple cued reward locations has been increasingly used to study rodent learning. Though deep reinforcement learning agents have been shown to be able to learn the task, they are not biologically plausible. Biologically plausible classic actor-critic agents have been shown to learn to navigate to single reward locations, but which biologically plausible agents are able to learn mul… ▽ More

    Submitted 15 July, 2021; v1 submitted 25 June, 2021; originally announced June 2021.

    Comments: 31 pages, 8 figures. Acknowledgements revised

    Journal ref: Cerebral Cortex, 2022;, bhab456

  4. arXiv:2106.03580  [pdf

    cs.NE q-bio.NC

    One-shot learning of paired association navigation with biologically plausible schemas

    Authors: M Ganesh Kumar, Cheston Tan, Camilo Libedinsky, Shih-Cheng Yen, Andrew Yong-Yi Tan

    Abstract: Schemas are knowledge structures that can enable rapid learning. Rodent one-shot learning in a multiple paired association navigation task has been postulated to be schema-dependent. But how schemas, conceptualized at Marr's computational level, correspond with neural implementations remains poorly understood, and a biologically plausible computational model of the rodent learning has not been dem… ▽ More

    Submitted 27 August, 2023; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: Minor revisions from version 2 preprint

  5. arXiv:2106.01400  [pdf, other

    eess.AS cs.LG cs.SD

    Dual Script E2E framework for Multilingual and Code-Switching ASR

    Authors: Mari Ganesh Kumar, Jom Kuriakose, Anand Thyagachandran, Arun Kumar A, Ashish Seth, Lodagala Durga Prasad, Saish Jaiswal, Anusha Prakash, Hema Murthy

    Abstract: India is home to multiple languages, and training automatic speech recognition (ASR) systems for languages is challenging. Over time, each language has adopted words from other languages, such as English, leading to code-mixing. Most Indian languages also have their own unique scripts, which poses a major limitation in training multilingual and code-switching ASR systems. Inspired by results in… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: Accepted for publication at Interspeech 2021

  6. Evidence of Task-Independent Person-Specific Signatures in EEG using Subspace Techniques

    Authors: Mari Ganesh Kumar, Shrikanth Narayanan, Mriganka Sur, Hema A Murthy

    Abstract: Electroencephalography (EEG) signals are promising as alternatives to other biometrics owing to their protection against spoofing. Previous studies have focused on capturing individual variability by analyzing task/condition-specific EEG. This work attempts to model biometric signatures independent of task/condition by normalizing the associated variance. Toward this goal, the paper extends ideas… ▽ More

    Submitted 25 March, 2021; v1 submitted 27 July, 2020; originally announced July 2020.

    Comments: ©2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

    Journal ref: IEEE Transactions on Information Forensics and Security, 2021

  7. arXiv:1904.07453  [pdf, other

    eess.AS cs.CR cs.LG cs.SD

    Spoof detection using time-delay shallow neural network and feature switching

    Authors: Mari Ganesh Kumar, Suvidha Rupesh Kumar, Saranya M, B. Bharathi, Hema A. Murthy

    Abstract: Detecting spoofed utterances is a fundamental problem in voice-based biometrics. Spoofing can be performed either by logical accesses like speech synthesis, voice conversion or by physical accesses such as replaying the pre-recorded utterance. Inspired by the state-of-the-art \emph{x}-vector based speaker verification approach, this paper proposes a time-delay shallow neural network (TD-SNN) for s… ▽ More

    Submitted 23 January, 2020; v1 submitted 16 April, 2019; originally announced April 2019.

    Journal ref: 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1011--1017