Search | arXiv e-print repository

doi 10.1093/cercor/bhab456

A nonlinear hidden layer enables actor-critic agents to learn multiple paired association navigation

Authors: M Ganesh Kumar, Cheston Tan, Camilo Libedinsky, Shih-Cheng Yen, Andrew Yong-Yi Tan

Abstract: Navigation to multiple cued reward locations has been increasingly used to study rodent learning. Though deep reinforcement learning agents have been shown to be able to learn the task, they are not biologically plausible. Biologically plausible classic actor-critic agents have been shown to learn to navigate to single reward locations, but which biologically plausible agents are able to learn mul… ▽ More Navigation to multiple cued reward locations has been increasingly used to study rodent learning. Though deep reinforcement learning agents have been shown to be able to learn the task, they are not biologically plausible. Biologically plausible classic actor-critic agents have been shown to learn to navigate to single reward locations, but which biologically plausible agents are able to learn multiple cue-reward location tasks has remained unclear. In this computational study, we show versions of classic agents that learn to navigate to a single reward location, and adapt to reward location displacement, but are not able to learn multiple paired association navigation. The limitation is overcome by an agent in which place cell and cue information are first processed by a feedforward nonlinear hidden layer with synapses to the actor and critic subject to temporal difference error-modulated plasticity. Faster learning is obtained when the feedforward layer is replaced by a recurrent reservoir network. △ Less

Submitted 15 July, 2021; v1 submitted 25 June, 2021; originally announced June 2021.

Comments: 31 pages, 8 figures. Acknowledgements revised

Journal ref: Cerebral Cortex, 2022;, bhab456

arXiv:2106.03580 [pdf]

One-shot learning of paired association navigation with biologically plausible schemas

Authors: M Ganesh Kumar, Cheston Tan, Camilo Libedinsky, Shih-Cheng Yen, Andrew Yong-Yi Tan

Abstract: Schemas are knowledge structures that can enable rapid learning. Rodent one-shot learning in a multiple paired association navigation task has been postulated to be schema-dependent. But how schemas, conceptualized at Marr's computational level, correspond with neural implementations remains poorly understood, and a biologically plausible computational model of the rodent learning has not been dem… ▽ More Schemas are knowledge structures that can enable rapid learning. Rodent one-shot learning in a multiple paired association navigation task has been postulated to be schema-dependent. But how schemas, conceptualized at Marr's computational level, correspond with neural implementations remains poorly understood, and a biologically plausible computational model of the rodent learning has not been demonstrated. Here, we compose such an agent from schemas with biologically plausible neural implementations. The agent contains an associative memory that can form one-shot associations between sensory cues and goal coordinates, implemented with a feedforward layer or a reservoir of recurrently connected neurons whose plastic output weights are governed by a novel 4-factor reward-modulated Exploratory Hebbian (EH) rule. Adding an actor-critic allows the agent to succeed even if an obstacle prevents direct heading. With the addition of working memory, the rodent behavior is replicated. Temporal-difference learning of a working memory gating mechanism enables one-shot learning despite distractors. △ Less

Submitted 27 August, 2023; v1 submitted 7 June, 2021; originally announced June 2021.

Comments: Minor revisions from version 2 preprint

arXiv:1812.04786 [pdf, ps, other]

Experimental Comparison of Hardware-Amenable Spike Detection Algorithms for iBMIs

Authors: Shoeb Shaikh, Rosa So, Camilo Libedinsky, Arindam Basu

Abstract: This paper presents an experiment based comparison of absolute threshold (AT) and non-linear energy operator (NEO) spike detection algorithms in Intra-cortical Brain Machine Interfaces (iBMIs). Results show an average increase in decoding performance of approx. 5% in monkey A across 28 sessions recorded over 6 days and approx. 2% in monkey B across 35 sessions recorded over 8 days when using NEO o… ▽ More This paper presents an experiment based comparison of absolute threshold (AT) and non-linear energy operator (NEO) spike detection algorithms in Intra-cortical Brain Machine Interfaces (iBMIs). Results show an average increase in decoding performance of approx. 5% in monkey A across 28 sessions recorded over 6 days and approx. 2% in monkey B across 35 sessions recorded over 8 days when using NEO over AT. To the best of our knowledge, this is the first ever reported comparison of spike detection algorithms in an iBMI experimental framework involving two monkeys. Based on the improvements observed in an experimental setting backed by previously reported improvements in simulation studies, we advocate switching from state of the art spike detection technique - AT to NEO. △ Less

Submitted 11 December, 2018; originally announced December 2018.

Comments: accepted at NER (Neural Engineering Conference) - 2019

arXiv:1812.03991 [pdf, ps, other]

Real-time Closed Loop Neural Decoding on a Neuromorphic Chip

Authors: Shoeb Shaikh, Rosa So, Tafadzwa Sibindi, Camilo Libedinsky, Arindam Basu

Abstract: This paper presents for the first time a real-time closed loop neuromorphic decoder chip-driven intra-cortical brain machine interface (iBMI) in a non-human primate (NHP) based experimental setup. Decoded results show trial success rates and mean times to target comparable to those obtained by hand-controlled joystick. Neural control trial success rates of approximately 96% of those obtained by ha… ▽ More This paper presents for the first time a real-time closed loop neuromorphic decoder chip-driven intra-cortical brain machine interface (iBMI) in a non-human primate (NHP) based experimental setup. Decoded results show trial success rates and mean times to target comparable to those obtained by hand-controlled joystick. Neural control trial success rates of approximately 96% of those obtained by hand-controlled joystick have been demonstrated. Also, neural control has shown mean target reach speeds of approximately 85% of those obtained by hand-controlled joystick . These results pave the way for fast and accurate, fully implantable neuromorphic neural decoders in iBMIs. △ Less

Submitted 10 December, 2018; originally announced December 2018.

Comments: accepted at Neural Engineering Conference (NER), 2019

Showing 1–4 of 4 results for author: Libedinsky, C