Skip to main content

Showing 1–9 of 9 results for author: Banburski, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2110.11536  [pdf, other

    cs.AI cs.LG

    Neural-guided, Bidirectional Program Search for Abstraction and Reasoning

    Authors: Simon Alford, Anshula Gandhi, Akshay Rangamani, Andrzej Banburski, Tony Wang, Sylee Dandekar, John Chin, Tomaso Poggio, Peter Chin

    Abstract: One of the challenges facing artificial intelligence research today is designing systems capable of utilizing systematic reasoning to generalize to new tasks. The Abstraction and Reasoning Corpus (ARC) measures such a capability through a set of visual reasoning tasks. In this paper we report incremental progress on ARC and lay the foundations for two approaches to abstraction and reasoning not ba… ▽ More

    Submitted 26 October, 2021; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: Published as a conference paper at Complex Networks 2021

  2. arXiv:2107.10199  [pdf, other

    cs.LG cs.AI stat.ML

    Distribution of Classification Margins: Are All Data Equal?

    Authors: Andrzej Banburski, Fernanda De La Torre, Nishka Pant, Ishana Shastri, Tomaso Poggio

    Abstract: Recent theoretical results show that gradient descent on deep neural networks under exponential loss functions locally maximizes classification margin, which is equivalent to minimizing the norm of the weight matrices under margin constraints. This property of the solution however does not fully characterize the generalization performance. We motivate theoretically and show empirically that the ar… ▽ More

    Submitted 21 July, 2021; originally announced July 2021.

    Comments: Previously online as CBMM Memo 115 on the CBMM MIT site

  3. arXiv:2006.16427  [pdf, other

    cs.LG cs.CV stat.ML

    Biologically Inspired Mechanisms for Adversarial Robustness

    Authors: Manish V. Reddy, Andrzej Banburski, Nishka Pant, Tomaso Poggio

    Abstract: A convolutional neural network strongly robust to adversarial perturbations at reasonable computational and performance cost has not yet been demonstrated. The primate visual ventral stream seems to be robust to small perturbations in visual stimuli but the underlying mechanisms that give rise to this robust perception are not understood. In this work, we investigate the role of two biologically p… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

    Comments: 25 pages, 15 figures

  4. arXiv:2006.13915  [pdf, other

    cs.LG eess.IV q-bio.NC stat.ML

    Hierarchically Compositional Tasks and Deep Convolutional Networks

    Authors: Arturo Deza, Qianli Liao, Andrzej Banburski, Tomaso Poggio

    Abstract: The main success stories of deep learning, starting with ImageNet, depend on deep convolutional networks, which on certain tasks perform significantly better than traditional shallow classifiers, such as support vector machines, and also better than deep fully connected networks; but what is so special about deep convolutional networks? Recent results in approximation theory proved an exponential… ▽ More

    Submitted 25 March, 2021; v1 submitted 24 June, 2020; originally announced June 2020.

    Comments: A pre-print. Currently Under Review

    Report number: MIT Center for Brains, Minds and Machines (CBMM) Memo #109

  5. arXiv:1912.06190  [pdf, other

    cs.LG stat.ML

    Double descent in the condition number

    Authors: Tomaso Poggio, Gil Kur, Andrzej Banburski

    Abstract: In solving a system of $n$ linear equations in $d$ variables $Ax=b$, the condition number of the $n,d$ matrix $A$ measures how much errors in the data $b$ affect the solution $x$. Estimates of this type are important in many inverse problems. An example is machine learning where the key task is to estimate an underlying function from a set of measurements at random points in a high dimensional spa… ▽ More

    Submitted 28 April, 2020; v1 submitted 12 December, 2019; originally announced December 2019.

    Comments: Removed parts relating to kernel regression to streamline the presentation, fixed some typos

  6. arXiv:1908.09375  [pdf, other

    cs.LG stat.ML

    Theoretical Issues in Deep Networks: Approximation, Optimization and Generalization

    Authors: Tomaso Poggio, Andrzej Banburski, Qianli Liao

    Abstract: While deep learning is successful in a number of applications, it is not yet well understood theoretically. A satisfactory theoretical characterization of deep learning however, is beginning to emerge. It covers the following questions: 1) representation power of deep networks 2) optimization of the empirical risk 3) generalization properties of gradient descent techniques --- why the expected err… ▽ More

    Submitted 25 August, 2019; originally announced August 2019.

    Comments: arXiv admin note: text overlap with arXiv:1611.00740

  7. arXiv:1903.04991  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Theory III: Dynamics and Generalization in Deep Networks

    Authors: Andrzej Banburski, Qianli Liao, Brando Miranda, Lorenzo Rosasco, Fernanda De La Torre, Jack Hidary, Tomaso Poggio

    Abstract: The key to generalization is controlling the complexity of the network. However, there is no obvious control of complexity -- such as an explicit regularization term -- in the training of deep networks for classification. We will show that a classical form of norm control -- but kind of hidden -- is present in deep networks trained with gradient descent techniques on exponential-type losses. In pa… ▽ More

    Submitted 10 April, 2020; v1 submitted 12 March, 2019; originally announced March 2019.

    Comments: 47 pages, 11 figures. This replaces previous versions of Theory III, that appeared on Arxiv [arXiv:1806.11379, arXiv:1801.00173] or on the CBMM site. v5: Changes throughout the paper to the presentation and tightening some of the statements

  8. arXiv:1807.09659  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    A Surprising Linear Relationship Predicts Test Performance in Deep Networks

    Authors: Qianli Liao, Brando Miranda, Andrzej Banburski, Jack Hidary, Tomaso Poggio

    Abstract: Given two networks with the same training loss on a dataset, when would they have drastically different test losses and errors? Better understanding of this question of generalization may improve practical applications of deep networks. In this paper we show that with cross-entropy loss it is surprisingly simple to induce significantly different generalization performances for two networks that ha… ▽ More

    Submitted 25 July, 2018; originally announced July 2018.

  9. arXiv:1806.11379  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Theory IIIb: Generalization in Deep Networks

    Authors: Tomaso Poggio, Qianli Liao, Brando Miranda, Andrzej Banburski, Xavier Boix, Jack Hidary

    Abstract: A main puzzle of deep neural networks (DNNs) revolves around the apparent absence of "overfitting", defined in this paper as follows: the expected error does not get worse when increasing the number of neurons or of iterations of gradient descent. This is surprising because of the large capacity demonstrated by DNNs to fit randomly labeled data and the absence of explicit regularization. Recent re… ▽ More

    Submitted 29 June, 2018; originally announced June 2018.

    Comments: 38 pages, 7 figures