Skip to main content

Showing 1–4 of 4 results for author: Abolafia, D A

.
  1. arXiv:2003.04227  [pdf, other

    cs.LG cs.AI

    Towards Modular Algorithm Induction

    Authors: Daniel A. Abolafia, Rishabh Singh, Manzil Zaheer, Charles Sutton

    Abstract: We present a modular neural network architecture Main that learns algorithms given a set of input-output examples. Main consists of a neural controller that interacts with a variable-length input tape and learns to compose modules together with their corresponding argument choices. Unlike previous approaches, Main uses a general domain-agnostic mechanism for selection of modules and their argument… ▽ More

    Submitted 27 February, 2020; originally announced March 2020.

    Comments: 10 pages, 4 figures, 2 tables

  2. arXiv:1810.05148  [pdf, other

    stat.ML cs.AI cs.LG cs.NE

    Bayesian Deep Convolutional Networks with Many Channels are Gaussian Processes

    Authors: Roman Novak, Lechao Xiao, Jaehoon Lee, Yasaman Bahri, Greg Yang, Jiri Hron, Daniel A. Abolafia, Jeffrey Pennington, Jascha Sohl-Dickstein

    Abstract: There is a previously identified equivalence between wide fully connected neural networks (FCNs) and Gaussian processes (GPs). This equivalence enables, for instance, test set predictions that would have resulted from a fully Bayesian, infinitely wide trained FCN to be computed without ever instantiating the FCN, but by instead evaluating the corresponding GP. In this work, we derive an analogous… ▽ More

    Submitted 21 August, 2020; v1 submitted 11 October, 2018; originally announced October 2018.

    Comments: Published as a conference paper at ICLR 2019

  3. arXiv:1802.08760  [pdf, other

    stat.ML cs.AI cs.LG cs.NE

    Sensitivity and Generalization in Neural Networks: an Empirical Study

    Authors: Roman Novak, Yasaman Bahri, Daniel A. Abolafia, Jeffrey Pennington, Jascha Sohl-Dickstein

    Abstract: In practice it is often found that large over-parameterized neural networks generalize better than their smaller counterparts, an observation that appears to conflict with classical notions of function complexity, which typically favor smaller models. In this work, we investigate this tension between complexity and generalization through an extensive empirical exploration of two natural metrics of… ▽ More

    Submitted 18 June, 2018; v1 submitted 23 February, 2018; originally announced February 2018.

    Comments: Published as a conference paper at ICLR 2018

  4. arXiv:1801.03526  [pdf, ps, other

    cs.AI

    Neural Program Synthesis with Priority Queue Training

    Authors: Daniel A. Abolafia, Mohammad Norouzi, Jonathan Shen, Rui Zhao, Quoc V. Le

    Abstract: We consider the task of program synthesis in the presence of a reward function over the output of programs, where the goal is to find programs with maximal rewards. We employ an iterative optimization scheme, where we train an RNN on a dataset of K best programs from a priority queue of the generated programs so far. Then, we synthesize new programs and add them to the priority queue by sampling f… ▽ More

    Submitted 23 March, 2018; v1 submitted 10 January, 2018; originally announced January 2018.