Skip to main content

Showing 1–10 of 10 results for author: Glorot, X

Searching in archive cs. Search in all archives.
.
  1. arXiv:2112.10664  [pdf, other

    cs.AI cs.LO

    Proving Theorems using Incremental Learning and Hindsight Experience Replay

    Authors: Eser Aygün, Laurent Orseau, Ankit Anand, Xavier Glorot, Vlad Firoiu, Lei M. Zhang, Doina Precup, Shibl Mourad

    Abstract: Traditional automated theorem provers for first-order logic depend on speed-optimized search and many handcrafted heuristics that are designed to work best over a wide range of domains. Machine learning approaches in literature either depend on these traditional provers to bootstrap themselves or fall short on reaching comparable performance. In this paper, we propose a general incremental learnin… ▽ More

    Submitted 20 December, 2021; originally announced December 2021.

    Comments: 16 pages, 2 figures

    ACM Class: I.2.3

  2. arXiv:2103.03798  [pdf, other

    cs.AI

    Training a First-Order Theorem Prover from Synthetic Data

    Authors: Vlad Firoiu, Eser Aygun, Ankit Anand, Zafarali Ahmed, Xavier Glorot, Laurent Orseau, Lei Zhang, Doina Precup, Shibl Mourad

    Abstract: A major challenge in applying machine learning to automated theorem proving is the scarcity of training data, which is a key ingredient in training successful deep learning models. To tackle this problem, we propose an approach that relies on training purely with synthetically generated theorems, without any human data aside from axioms. We use these theorems to train a neurally-guided saturation-… ▽ More

    Submitted 6 April, 2021; v1 submitted 5 March, 2021; originally announced March 2021.

  3. arXiv:2006.11259  [pdf, other

    cs.LO cs.LG

    Learning to Prove from Synthetic Theorems

    Authors: Eser Aygün, Zafarali Ahmed, Ankit Anand, Vlad Firoiu, Xavier Glorot, Laurent Orseau, Doina Precup, Shibl Mourad

    Abstract: A major challenge in applying machine learning to automated theorem proving is the scarcity of training data, which is a key ingredient in training successful deep learning models. To tackle this problem, we propose an approach that relies on training with synthetic theorems, generated from a set of axioms. We show that such theorems can be used to train an automated prover and that the learned pr… ▽ More

    Submitted 19 June, 2020; originally announced June 2020.

    Comments: 17 pages, 6 figures, submitted to NeurIPS 2020

    ACM Class: I.2.3

  4. arXiv:1606.05579  [pdf, other

    stat.ML cs.LG q-bio.NC

    Early Visual Concept Learning with Unsupervised Deep Learning

    Authors: Irina Higgins, Loic Matthey, Xavier Glorot, Arka Pal, Benigno Uria, Charles Blundell, Shakir Mohamed, Alexander Lerchner

    Abstract: Automated discovery of early visual concepts from raw image data is a major open challenge in AI research. Addressing this problem, we propose an unsupervised approach for learning disentangled representations of the underlying factors of variation. We draw inspiration from neuroscience, and show how this can be achieved in an unsupervised generative model by applying the same learning pressures a… ▽ More

    Submitted 20 September, 2016; v1 submitted 17 June, 2016; originally announced June 2016.

  5. arXiv:1605.02688  [pdf, other

    cs.SC cs.LG cs.MS

    Theano: A Python framework for fast computation of mathematical expressions

    Authors: The Theano Development Team, Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermueller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre-Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul Christiano , et al. (88 additional authors not shown)

    Abstract: Theano is a Python library that allows to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Since its introduction, it has been one of the most used CPU and GPU mathematical compilers - especially in the machine learning community - and has shown steady performance improvements. Theano is being actively and continuously developed since 2008, mu… ▽ More

    Submitted 9 May, 2016; originally announced May 2016.

    Comments: 19 pages, 5 figures

  6. arXiv:1301.3485  [pdf, ps, other

    cs.LG

    A Semantic Matching Energy Function for Learning with Multi-relational Data

    Authors: Xavier Glorot, Antoine Bordes, Jason Weston, Yoshua Bengio

    Abstract: Large-scale relational learning becomes crucial for handling the huge amounts of structured data generated daily in many application domains ranging from computational biology or information retrieval, to natural language processing. In this paper, we present a new neural network architecture designed to embed multi-relational graphs into a flexible continuous vector space in which the original da… ▽ More

    Submitted 21 March, 2013; v1 submitted 15 January, 2013; originally announced January 2013.

  7. arXiv:1107.3663  [pdf, other

    cs.AI

    Towards Open-Text Semantic Parsing via Multi-Task Learning of Structured Embeddings

    Authors: Antoine Bordes, Xavier Glorot, Jason Weston, Yoshua Bengio

    Abstract: Open-text (or open-domain) semantic parsers are designed to interpret any statement in natural language by inferring a corresponding meaning representation (MR). Unfortunately, large scale systems cannot be easily machine-learned due to lack of directly supervised data. We propose here a method that learns to assign MRs to a wide range of text (using a dictionary of more than 70,000 words, which a… ▽ More

    Submitted 19 July, 2011; originally announced July 2011.

  8. arXiv:1104.4153  [pdf, other

    cs.AI

    Learning invariant features through local space contraction

    Authors: Salah Rifai, Xavier Muller, Xavier Glorot, Gregoire Mesnil, Yoshua Bengio, Pascal Vincent

    Abstract: We present in this paper a novel approach for training deterministic auto-encoders. We show that by adding a well chosen penalty term to the classical reconstruction cost function, we can achieve results that equal or surpass those attained by other regularized auto-encoders as well as denoising auto-encoders on a range of datasets. This penalty term corresponds to the Frobenius norm of the Jacobi… ▽ More

    Submitted 20 April, 2011; originally announced April 2011.

  9. arXiv:1104.3250  [pdf, other

    cs.AI

    Adding noise to the input of a model trained with a regularized objective

    Authors: Salah Rifai, Xavier Glorot, Yoshua Bengio, Pascal Vincent

    Abstract: Regularization is a well studied problem in the context of neural networks. It is usually used to improve the generalization performance when the number of input samples is relatively small or heavily contaminated with noise. The regularization of a parametric model can be achieved in different manners some of which are early stop** (Morgan and Bourlard, 1990), weight decay, output smoothing tha… ▽ More

    Submitted 16 April, 2011; originally announced April 2011.

  10. arXiv:1009.3589  [pdf, other

    cs.LG cs.CV cs.NE

    Deep Self-Taught Learning for Handwritten Character Recognition

    Authors: Frédéric Bastien, Yoshua Bengio, Arnaud Bergeron, Nicolas Boulanger-Lewandowski, Thomas Breuel, Youssouf Chherawala, Moustapha Cisse, Myriam Côté, Dumitru Erhan, Jeremy Eustache, Xavier Glorot, Xavier Muller, Sylvain Pannetier Lebeuf, Razvan Pascanu, Salah Rifai, Francois Savard, Guillaume Sicard

    Abstract: Recent theoretical and empirical work in statistical machine learning has demonstrated the importance of learning algorithms for deep architectures, i.e., function classes obtained by composing multiple non-linear transformations. Self-taught learning (exploiting unlabeled examples or examples from other distributions) has already been applied to deep learners, but mostly to show the advantage of… ▽ More

    Submitted 18 September, 2010; originally announced September 2010.

    Report number: 1353, Dept. IRO, U. Montreal MSC Class: 68T05 ACM Class: I.2.6