Skip to main content

Showing 1–8 of 8 results for author: Herrmann, V

.
  1. arXiv:2403.11998  [pdf, other

    cs.LG

    Learning Useful Representations of Recurrent Neural Network Weight Matrices

    Authors: Vincent Herrmann, Francesco Faccio, Jürgen Schmidhuber

    Abstract: Recurrent Neural Networks (RNNs) are general-purpose parallel-sequential computers. The program of an RNN is its weight matrix. How to learn useful representations of RNN weights that facilitate RNN analysis as well as downstream tasks? While the mechanistic approach directly looks at some RNN's weights to predict its behavior, the functionalist approach analyzes its overall functionality-specific… ▽ More

    Submitted 18 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    ACM Class: I.2.6

  2. arXiv:2311.07534  [pdf, other

    cs.SD cs.LG eess.AS

    Unsupervised Musical Object Discovery from Audio

    Authors: Joonsu Gha, Vincent Herrmann, Benjamin Grewe, Jürgen Schmidhuber, Anand Gopalakrishnan

    Abstract: Current object-centric learning models such as the popular SlotAttention architecture allow for unsupervised visual scene decomposition. Our novel MusicSlots method adapts SlotAttention to the audio domain, to achieve unsupervised music decomposition. Since concepts of opacity and occlusion in vision have no auditory analogues, the softmax normalization of alpha masks in the decoders of visual obj… ▽ More

    Submitted 14 November, 2023; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: Accepted to Machine Learning for Audio Workshop, NeurIPS 2023

  3. arXiv:2305.17066  [pdf, other

    cs.AI cs.CL cs.CV cs.LG cs.MA

    Mindstorms in Natural Language-Based Societies of Mind

    Authors: Mingchen Zhuge, Haozhe Liu, Francesco Faccio, Dylan R. Ashley, Róbert Csordás, Anand Gopalakrishnan, Abdullah Hamdi, Hasan Abed Al Kader Hammoud, Vincent Herrmann, Kazuki Irie, Louis Kirsch, Bing Li, Guohao Li, Shuming Liu, **jie Mai, Piotr Piękos, Aditya Ramesh, Imanol Schlag, Weimin Shi, Aleksandar Stanić, Wenyi Wang, Yuhui Wang, Mengmeng Xu, Deng-** Fan, Bernard Ghanem , et al. (1 additional authors not shown)

    Abstract: Both Minsky's "society of mind" and Schmidhuber's "learning to think" inspire diverse societies of large multimodal neural networks (NNs) that solve problems by interviewing each other in a "mindstorm." Recent implementations of NN-based societies of minds consist of large language models (LLMs) and other NN-based experts communicating through a natural language interface. In doing so, they overco… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: 9 pages in main text + 7 pages of references + 38 pages of appendices, 14 figures in main text + 13 in appendices, 7 tables in appendices

    MSC Class: 68T07 ACM Class: I.2.6; I.2.11

  4. arXiv:2212.14374  [pdf, other

    cs.LG cs.AI

    Learning One Abstract Bit at a Time Through Self-Invented Experiments Encoded as Neural Networks

    Authors: Vincent Herrmann, Louis Kirsch, Jürgen Schmidhuber

    Abstract: There are two important things in science: (A) Finding answers to given questions, and (B) Coming up with good questions. Our artificial scientists not only learn to answer given questions, but also continually invent new questions, by proposing hypotheses to be verified or falsified through potentially complex and time-consuming experiments, including thought experiments akin to those of mathemat… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

    Comments: 20 pages, 6 figures

  5. arXiv:2211.12423  [pdf, other

    cs.CL cs.AI cs.LG cs.MM cs.NE cs.SD eess.AS

    On Narrative Information and the Distillation of Stories

    Authors: Dylan R. Ashley, Vincent Herrmann, Zachary Friggstad, Jürgen Schmidhuber

    Abstract: The act of telling stories is a fundamental part of what it means to be human. This work introduces the concept of narrative information, which we define to be the overlap in information space between a story and the items that compose the story. Using contrastive learning methods, we show how modern artificial neural networks can be leveraged to distill stories and extract a representation of the… ▽ More

    Submitted 13 February, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: presented in the Information-Theoretic Principles in Cognitive Systems Workshop at the 36th Conference on Neural Information Processing Systems; 4 pages in main text + 2 pages of references + 8 pages of appendices, 2 figures in main text + 3 in appendices, 1 table in main text, 2 algorithms in appendices; source code available at https://github.com/dylanashley/story-distiller

    MSC Class: 68T07 (Primary) 68P30; 68W50; 94A15 (Secondary) ACM Class: H.1.1; H.5.5; I.2.6; I.5.1; J.5

  6. arXiv:2207.01570  [pdf, other

    cs.LG stat.ML

    Goal-Conditioned Generators of Deep Policies

    Authors: Francesco Faccio, Vincent Herrmann, Aditya Ramesh, Louis Kirsch, Jürgen Schmidhuber

    Abstract: Goal-conditioned Reinforcement Learning (RL) aims at learning optimal policies, given goals encoded in special command inputs. Here we study goal-conditioned neural nets (NNs) that learn to generate deep NN policies in form of context-specific weight matrices, similar to Fast Weight Programmers and other methods from the 1990s. Using context commands of the form "generate a policy that achieves a… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: Preprint. Under Review

  7. arXiv:2207.01566  [pdf, other

    cs.LG stat.ML

    General Policy Evaluation and Improvement by Learning to Identify Few But Crucial States

    Authors: Francesco Faccio, Aditya Ramesh, Vincent Herrmann, Jean Harb, Jürgen Schmidhuber

    Abstract: Learning to evaluate and improve policies is a core problem of Reinforcement Learning (RL). Traditional RL algorithms learn a value function defined for a single policy. A recently explored competitive alternative is to learn a single value function for many policies. Here we combine the actor-critic architecture of Parameter-Based Value Functions and the policy embedding of Policy Evaluation Netw… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: Preprint. Under review

  8. arXiv:2111.02216  [pdf, other

    cs.CL cs.LG cs.MM cs.SD eess.AS

    Automatic Embedding of Stories Into Collections of Independent Media

    Authors: Dylan R. Ashley, Vincent Herrmann, Zachary Friggstad, Kory W. Mathewson, Jürgen Schmidhuber

    Abstract: We look at how machine learning techniques that derive properties of items in a collection of independent media can be used to automatically embed stories into such collections. To do so, we use models that extract the tempo of songs to make a music playlist follow a narrative arc. Our work specifies an open-source tool that uses pre-trained neural network models to extract the global tempo of a s… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: 2 pages in main text + 1 page of references + 6 pages of appendices, 2 figures in main text + 3 figures in appendices, 1 algorithm in appendices; source code available at https://gist.github.com/dylanashley/1387a99deb85bfc0bce11286810cd98b

    ACM Class: H.5.5; I.2.6; J.5