-
Logical Activation Functions: Logit-space equivalents of Probabilistic Boolean Operators
Authors:
Scott C. Lowe,
Robert Earle,
Jason d'Eon,
Thomas Trappenberg,
Sageev Oore
Abstract:
The choice of activation functions and their motivation is a long-standing issue within the neural network community. Neuronal representations within artificial neural networks are commonly understood as logits, representing the log-odds score of presence of features within the stimulus. We derive logit-space operators equivalent to probabilistic Boolean logic-gates AND, OR, and XNOR for independe…
▽ More
The choice of activation functions and their motivation is a long-standing issue within the neural network community. Neuronal representations within artificial neural networks are commonly understood as logits, representing the log-odds score of presence of features within the stimulus. We derive logit-space operators equivalent to probabilistic Boolean logic-gates AND, OR, and XNOR for independent probabilities. Such theories are important to formalize more complex dendritic operations in real neurons, and these operations can be used as activation functions within a neural network, introducing probabilistic Boolean-logic as the core operation of the neural network. Since these functions involve taking multiple exponents and logarithms, they are computationally expensive and not well suited to be directly used within neural networks. Consequently, we construct efficient approximations named $\text{AND}_\text{AIL}$ (the AND operator Approximate for Independent Logits), $\text{OR}_\text{AIL}$, and $\text{XNOR}_\text{AIL}$, which utilize only comparison and addition operations, have well-behaved gradients, and can be deployed as activation functions in neural networks. Like MaxOut, $\text{AND}_\text{AIL}$ and $\text{OR}_\text{AIL}$ are generalizations of ReLU to two-dimensions. While our primary aim is to formalize dendritic computations within a logit-space probabilistic-Boolean framework, we deploy these new activation functions, both in isolation and in conjunction to demonstrate their effectiveness on a variety of tasks including image classification, transfer learning, abstract reasoning, and compositional zero-shot learning.
△ Less
Submitted 29 November, 2022; v1 submitted 22 October, 2021;
originally announced October 2021.
-
Musical Speech: A Transformer-based Composition Tool
Authors:
Jason d'Eon,
Sri Harsha Dumpala,
Chandramouli Shama Sastry,
Dani Oore,
Sageev Oore
Abstract:
In this paper, we propose a new compositional tool that will generate a musical outline of speech recorded/provided by the user for use as a musical building block in their compositions. The tool allows any user to use their own speech to generate musical material, while still being able to hear the direct connection between their recorded speech and the resulting music. The tool is built on our p…
▽ More
In this paper, we propose a new compositional tool that will generate a musical outline of speech recorded/provided by the user for use as a musical building block in their compositions. The tool allows any user to use their own speech to generate musical material, while still being able to hear the direct connection between their recorded speech and the resulting music. The tool is built on our proposed pipeline. This pipeline begins with speech-based signal processing, after which some simple musical heuristics are applied, and finally these pre-processed signals are passed through Transformer models trained on new musical tasks. We illustrate the effectiveness of our pipeline -- which does not require a paired dataset for training -- through examples of music created by musicians making use of our tool.
△ Less
Submitted 2 August, 2021;
originally announced August 2021.
-
The Spotlight: A General Method for Discovering Systematic Errors in Deep Learning Models
Authors:
Greg d'Eon,
Jason d'Eon,
James R. Wright,
Kevin Leyton-Brown
Abstract:
Supervised learning models often make systematic errors on rare subsets of the data. When these subsets correspond to explicit labels in the data (e.g., gender, race) such poor performance can be identified straightforwardly. This paper introduces a method for discovering systematic errors that do not correspond to such explicitly labelled subgroups. The key idea is that similar inputs tend to hav…
▽ More
Supervised learning models often make systematic errors on rare subsets of the data. When these subsets correspond to explicit labels in the data (e.g., gender, race) such poor performance can be identified straightforwardly. This paper introduces a method for discovering systematic errors that do not correspond to such explicitly labelled subgroups. The key idea is that similar inputs tend to have similar representations in the final hidden layer of a neural network. We leverage this structure by "shining a spotlight" on this representation space to find contiguous regions where the model performs poorly. We show that the spotlight surfaces semantically meaningful areas of weakness in a wide variety of existing models spanning computer vision, NLP, and recommender systems.
△ Less
Submitted 15 October, 2021; v1 submitted 1 July, 2021;
originally announced July 2021.
-
Algebraic Structure of the Varikon Box
Authors:
Jason d'Eon,
Chrystopher L. Nehaniv
Abstract:
The 15-Puzzle is a well studied permutation puzzle. This paper explores the group structure of a three-dimensional variant of the 15-Puzzle known as the Varikon Box, with the goal of providing a heuristic that would help a human solve it while minimizing the number of moves. First, we show by a parity argument which configurations of the puzzle are reachable. We define a generating set based on th…
▽ More
The 15-Puzzle is a well studied permutation puzzle. This paper explores the group structure of a three-dimensional variant of the 15-Puzzle known as the Varikon Box, with the goal of providing a heuristic that would help a human solve it while minimizing the number of moves. First, we show by a parity argument which configurations of the puzzle are reachable. We define a generating set based on the three dimensions of movement, which generates a group that acts on the puzzle configurations, and we explore the structure of this group. Finally, we show a heuristic for solving the puzzle by writing an element of the symmetry group as a word in terms of a generating set, and we compute the shortest possible word for each puzzle configuration.
△ Less
Submitted 1 June, 2020;
originally announced June 2020.