Skip to main content

Showing 1–9 of 9 results for author: Gopalakrishnan, A

.
  1. arXiv:2405.17283  [pdf, other

    cs.LG cs.NE

    Recurrent Complex-Weighted Autoencoders for Unsupervised Object Discovery

    Authors: Anand Gopalakrishnan, Aleksandar Stanić, Jürgen Schmidhuber, Michael Curtis Mozer

    Abstract: Current state-of-the-art synchrony-based models encode object bindings with complex-valued activations and compute with real-valued weights in feedforward architectures. We argue for the computational advantages of a recurrent architecture with complex-valued weights. We propose a fully convolutional autoencoder, SynCx, that performs iterative constraint satisfaction: at each iteration, a hidden l… ▽ More

    Submitted 28 May, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: minor typo fixed

  2. arXiv:2311.07534  [pdf, other

    cs.SD cs.LG eess.AS

    Unsupervised Musical Object Discovery from Audio

    Authors: Joonsu Gha, Vincent Herrmann, Benjamin Grewe, Jürgen Schmidhuber, Anand Gopalakrishnan

    Abstract: Current object-centric learning models such as the popular SlotAttention architecture allow for unsupervised visual scene decomposition. Our novel MusicSlots method adapts SlotAttention to the audio domain, to achieve unsupervised music decomposition. Since concepts of opacity and occlusion in vision have no auditory analogues, the softmax normalization of alpha masks in the decoders of visual obj… ▽ More

    Submitted 14 November, 2023; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: Accepted to Machine Learning for Audio Workshop, NeurIPS 2023

  3. arXiv:2305.19044  [pdf, other

    cs.LG

    Exploring the Promise and Limits of Real-Time Recurrent Learning

    Authors: Kazuki Irie, Anand Gopalakrishnan, Jürgen Schmidhuber

    Abstract: Real-time recurrent learning (RTRL) for sequence-processing recurrent neural networks (RNNs) offers certain conceptual advantages over backpropagation through time (BPTT). RTRL requires neither caching past activations nor truncating context, and enables online learning. However, RTRL's time and space complexity make it impractical. To overcome this problem, most recent work on RTRL focuses on app… ▽ More

    Submitted 28 February, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: Accepted to ICLR 2024

  4. arXiv:2305.17066  [pdf, other

    cs.AI cs.CL cs.CV cs.LG cs.MA

    Mindstorms in Natural Language-Based Societies of Mind

    Authors: Mingchen Zhuge, Haozhe Liu, Francesco Faccio, Dylan R. Ashley, Róbert Csordás, Anand Gopalakrishnan, Abdullah Hamdi, Hasan Abed Al Kader Hammoud, Vincent Herrmann, Kazuki Irie, Louis Kirsch, Bing Li, Guohao Li, Shuming Liu, **jie Mai, Piotr Piękos, Aditya Ramesh, Imanol Schlag, Weimin Shi, Aleksandar Stanić, Wenyi Wang, Yuhui Wang, Mengmeng Xu, Deng-** Fan, Bernard Ghanem , et al. (1 additional authors not shown)

    Abstract: Both Minsky's "society of mind" and Schmidhuber's "learning to think" inspire diverse societies of large multimodal neural networks (NNs) that solve problems by interviewing each other in a "mindstorm." Recent implementations of NN-based societies of minds consist of large language models (LLMs) and other NN-based experts communicating through a natural language interface. In doing so, they overco… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: 9 pages in main text + 7 pages of references + 38 pages of appendices, 14 figures in main text + 13 in appendices, 7 tables in appendices

    MSC Class: 68T07 ACM Class: I.2.6; I.2.11

  5. arXiv:2305.15001  [pdf, other

    cs.LG cs.AI cs.CV

    Contrastive Training of Complex-Valued Autoencoders for Object Discovery

    Authors: Aleksandar Stanić, Anand Gopalakrishnan, Kazuki Irie, Jürgen Schmidhuber

    Abstract: Current state-of-the-art object-centric models use slots and attention-based routing for binding. However, this class of models has several conceptual limitations: the number of slots is hardwired; all slots have equal capacity; training has high computational cost; there are no object-level relational factors within slots. Synchrony-based models in principle can address these limitations by using… ▽ More

    Submitted 9 November, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: accepted to NeurIPS 2023

  6. arXiv:2203.13573  [pdf, other

    cs.LG cs.AI cs.NE

    Unsupervised Learning of Temporal Abstractions with Slot-based Transformers

    Authors: Anand Gopalakrishnan, Kazuki Irie, Jürgen Schmidhuber, Sjoerd van Steenkiste

    Abstract: The discovery of reusable sub-routines simplifies decision-making and planning in complex reinforcement learning problems. Previous approaches propose to learn such temporal abstractions in a purely unsupervised fashion through observing state-action trajectories gathered from executing a policy. However, a current limitation is that they process each trajectory in an entirely sequential manner, w… ▽ More

    Submitted 22 November, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

    Comments: accepted to Neural Computation journal

  7. arXiv:2011.12930  [pdf, other

    cs.CV cs.AI cs.LG cs.NE

    Unsupervised Object Keypoint Learning using Local Spatial Predictability

    Authors: Anand Gopalakrishnan, Sjoerd van Steenkiste, Jürgen Schmidhuber

    Abstract: We propose PermaKey, a novel approach to representation learning based on object keypoints. It leverages the predictability of local image regions from spatial neighborhoods to identify salient regions that correspond to object parts, which are then converted to keypoints. Unlike prior approaches, it utilizes predictability to discover object keypoints, an intrinsic property of objects. This ensur… ▽ More

    Submitted 8 March, 2021; v1 submitted 25 November, 2020; originally announced November 2020.

    Comments: Accepted to ICLR 2021

  8. arXiv:1809.03036  [pdf, ps, other

    cs.CV

    A Neural Temporal Model for Human Motion Prediction

    Authors: Anand Gopalakrishnan, Ankur Mali, Dan Kifer, C. Lee Giles, Alexander G. Ororbia

    Abstract: We propose novel neural temporal models for predicting and synthesizing human motion, achieving state-of-the-art in modeling long-term motion trajectories while being competitive with prior work in short-term prediction and requiring significantly less computation. Key aspects of our proposed system include: 1) a novel, two-level processing architecture that aids in generating planned trajectories… ▽ More

    Submitted 22 November, 2019; v1 submitted 9 September, 2018; originally announced September 2018.

    Comments: accepted to cvpr 2019

    Journal ref: In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 12116-12125. 2019

  9. arXiv:0810.0914  [pdf, ps, other

    math.ST

    Probability models characterized by generalized reversed lack of memory property

    Authors: Asha Gopalakrishnan, Rejeesh C. John

    Abstract: A binary operator * over real numbers is said to be associative if $(x*y)*z=x*(y*z)$ and is said to be reducible if $x*y=x*z$ or $y*w=z*w$ if and only if $z=y$. The operation is said to have an identity element $\tilde{e}$ if $x*\tilde{e}=x$. In this paper a characterization of a subclass of the reversed generalized Pareto distribution (Castillo and Hadi (1995)) in terms of the reversed lack of… ▽ More

    Submitted 6 October, 2008; originally announced October 2008.

    Comments: Submitted to the Electronic Journal of Statistics (http://www.i-journals.org/ejs/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-EJS-EJS_2008_307