Skip to main content

Showing 1–15 of 15 results for author: Becigneul, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10256  [pdf, other

    cs.CL cs.AI cs.LG

    Explicit Word Density Estimation for Language Modelling

    Authors: Jovan Andonov, Octavian Ganea, Paulina Grnarova, Gary Bécigneul, Thomas Hofmann

    Abstract: Language Modelling has been a central part of Natural Language Processing for a very long time and in the past few years LSTM-based language models have been the go-to method for commercial language modeling. Recently, it has been shown that when looking at language modelling from a matrix factorization point of view, the final Softmax layer limits the expressiveness of the model, by putting an up… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Master's thesis

  2. arXiv:2008.02641  [pdf, other

    cs.LG cs.IT stat.ME stat.ML

    Bloom Origami Assays: Practical Group Testing

    Authors: Louis Abraham, Gary Becigneul, Benjamin Coleman, Bernhard Scholkopf, Anshumali Shrivastava, Alexander Smola

    Abstract: We study the problem usually referred to as group testing in the context of COVID-19. Given n samples collected from patients, how should we select and test mixtures of samples to maximize information and minimize the number of tests? Group testing is a well-studied problem with several appealing solutions, but recent biological studies impose practical constraints for COVID-19 that are incompatib… ▽ More

    Submitted 21 July, 2020; originally announced August 2020.

    Comments: arXiv admin note: text overlap with arXiv:2005.06413

  3. arXiv:2006.04804  [pdf, other

    stat.ML cs.LG

    Optimal Transport Graph Neural Networks

    Authors: Benson Chen, Gary Bécigneul, Octavian-Eugen Ganea, Regina Barzilay, Tommi Jaakkola

    Abstract: Current graph neural network (GNN) architectures naively average or sum node embeddings into an aggregated graph representation -- potentially losing structural or semantic information. We here introduce OT-GNN, a model that computes graph embeddings using parametric prototypes that highlight key facets of different graph aspects. Towards this goal, we successfully combine optimal transport (OT) w… ▽ More

    Submitted 8 October, 2021; v1 submitted 8 June, 2020; originally announced June 2020.

  4. arXiv:2005.06413  [pdf, ps, other

    stat.ME cs.LG math.PR stat.ML

    Crackovid: Optimizing Group Testing

    Authors: Louis Abraham, Gary Bécigneul, Bernhard Schölkopf

    Abstract: We study the problem usually referred to as group testing in the context of COVID-19. Given $n$ samples taken from patients, how should we select mixtures of samples to be tested, so as to maximize information and minimize the number of tests? We consider both adaptive and non-adaptive strategies, and take a Bayesian approach with a prior both for infection of patients and test errors. We start by… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

  5. arXiv:2002.08665  [pdf, other

    cs.LG stat.ML

    Computationally Tractable Riemannian Manifolds for Graph Embeddings

    Authors: Calin Cruceru, Gary Bécigneul, Octavian-Eugen Ganea

    Abstract: Representing graphs as sets of node embeddings in certain curved Riemannian manifolds has recently gained momentum in machine learning due to their desirable geometric inductive biases, e.g., hierarchical structures benefit from hyperbolic geometry. However, going beyond embedding spaces of constant sectional curvature, while potentially more representationally powerful, proves to be challenging a… ▽ More

    Submitted 6 June, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: Submitted to the Thirty-fourth Conference on Neural Information Processing Systems

  6. arXiv:1911.08411  [pdf, other

    cs.LG stat.ML

    Mixed-curvature Variational Autoencoders

    Authors: Ondrej Skopek, Octavian-Eugen Ganea, Gary Bécigneul

    Abstract: Euclidean geometry has historically been the typical "workhorse" for machine learning applications due to its power and simplicity. However, it has recently been shown that geometric spaces with constant non-zero curvature improve representations and performance on a variety of data types and downstream tasks. Consequently, generative models like Variational Autoencoders (VAEs) have been successfu… ▽ More

    Submitted 12 February, 2020; v1 submitted 19 November, 2019; originally announced November 2019.

    Comments: ICLR 2020 camera ready version

    Journal ref: International Conference on Learning Representations (ICLR) 2020

  7. arXiv:1911.05076  [pdf, other

    cs.LG cs.AI

    Constant Curvature Graph Convolutional Networks

    Authors: Gregor Bachmann, Gary Bécigneul, Octavian-Eugen Ganea

    Abstract: Interest has been rising lately towards methods representing data in non-Euclidean spaces, e.g. hyperbolic or spherical, that provide specific inductive biases useful for certain real-world data properties, e.g. scale-free, hierarchical or cyclical. However, the popular graph neural networks are currently limited in modeling data only via Euclidean geometry and associated vector space operations.… ▽ More

    Submitted 19 May, 2020; v1 submitted 12 November, 2019; originally announced November 2019.

  8. arXiv:1907.10430   

    cs.LG stat.ML

    Noise Contrastive Variational Autoencoders

    Authors: Octavian-Eugen Ganea, Yashas Annadani, Gary Bécigneul

    Abstract: We take steps towards understanding the "posterior collapse (PC)" difficulty in variational autoencoders (VAEs),~i.e. a degenerate optimum in which the latent codes become independent of their corresponding inputs. We rely on calculus of variations and theoretically explore a few popular VAE models, showing that PC always occurs for non-parametric encoders and decoders. Inspired by the popular noi… ▽ More

    Submitted 31 July, 2019; v1 submitted 23 July, 2019; originally announced July 2019.

    Comments: There is a mistake common to all the main proofs. In summary, what we find are saddle points or global maxima of the respective loss functions and not the global minima. We apologize for this

  9. arXiv:1902.08077  [pdf, other

    cs.LG stat.ML

    Breaking the Softmax Bottleneck via Learnable Monotonic Pointwise Non-linearities

    Authors: Octavian-Eugen Ganea, Sylvain Gelly, Gary Bécigneul, Aliaksei Severyn

    Abstract: The Softmax function on top of a final linear layer is the de facto method to output probability distributions in neural networks. In many applications such as language models or text generation, this model has to produce distributions over large output vocabularies. Recently, this has been shown to have limited representational capacity due to its connection with the rank bottleneck in matrix fac… ▽ More

    Submitted 13 May, 2019; v1 submitted 21 February, 2019; originally announced February 2019.

    Journal ref: ICML 2019

  10. arXiv:1810.06546  [pdf, other

    cs.CL

    Poincaré GloVe: Hyperbolic Word Embeddings

    Authors: Alexandru Tifrea, Gary Bécigneul, Octavian-Eugen Ganea

    Abstract: Words are not created equal. In fact, they form an aristocratic graph with a latent hierarchical structure that the next generation of unsupervised learned word embeddings should reveal. In this paper, justified by the notion of delta-hyperbolicity or tree-likeliness of a space, we propose to embed words in a Cartesian product of hyperbolic spaces which we theoretically connect to the Gaussian wor… ▽ More

    Submitted 22 November, 2018; v1 submitted 15 October, 2018; originally announced October 2018.

  11. arXiv:1810.00760  [pdf, other

    cs.LG stat.ML

    Riemannian Adaptive Optimization Methods

    Authors: Gary Bécigneul, Octavian-Eugen Ganea

    Abstract: Several first order stochastic optimization methods commonly used in the Euclidean domain such as stochastic gradient descent (SGD), accelerated gradient descent or variance reduced methods have already been adapted to certain Riemannian settings. However, some of the most popular of these optimization tools - namely Adam , Adagrad and the more recent Amsgrad - remain to be generalized to Riemanni… ▽ More

    Submitted 17 February, 2019; v1 submitted 1 October, 2018; originally announced October 2018.

    Comments: Accepted at International Conference on Learning Representations (ICLR), 2019

  12. arXiv:1805.09112  [pdf, other

    cs.LG stat.ML

    Hyperbolic Neural Networks

    Authors: Octavian-Eugen Ganea, Gary Bécigneul, Thomas Hofmann

    Abstract: Hyperbolic spaces have recently gained momentum in the context of machine learning due to their high capacity and tree-likeliness properties. However, the representational power of hyperbolic geometry is not yet on par with Euclidean geometry, mostly because of the absence of corresponding hyperbolic neural network layers. This makes it hard to use hyperbolic embeddings in downstream tasks. Here,… ▽ More

    Submitted 28 June, 2018; v1 submitted 23 May, 2018; originally announced May 2018.

  13. arXiv:1804.01882  [pdf, other

    cs.LG stat.ML

    Hyperbolic Entailment Cones for Learning Hierarchical Embeddings

    Authors: Octavian-Eugen Ganea, Gary Bécigneul, Thomas Hofmann

    Abstract: Learning graph representations via low-dimensional embeddings that preserve relevant network properties is an important class of problems in machine learning. We here present a novel method to embed directed acyclic graphs. Following prior work, we first advocate for using hyperbolic spaces which provably model tree-like structures better than Euclidean geometry. Second, we view hierarchical relat… ▽ More

    Submitted 6 June, 2018; v1 submitted 3 April, 2018; originally announced April 2018.

    Comments: International Conference on Machine Learning (ICML) 2018

  14. arXiv:1710.11386  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Parametrizing filters of a CNN with a GAN

    Authors: Yannic Kilcher, Gary Becigneul, Thomas Hofmann

    Abstract: It is commonly agreed that the use of relevant invariances as a good statistical bias is important in machine-learning. However, most approaches that explicitly incorporate invariances into a model architecture only make use of very simple transformations, such as translations and rotations. Hence, there is a need for methods to model and extract richer transformations that capture much higher-lev… ▽ More

    Submitted 31 October, 2017; originally announced October 2017.

  15. arXiv:1703.06726  [pdf, ps, other

    cs.LG

    On the effect of pooling on the geometry of representations

    Authors: Gary Bécigneul

    Abstract: In machine learning and neuroscience, certain computational structures and algorithms are known to yield disentangled representations without us understanding why, the most striking examples being perhaps convolutional neural networks and the ventral stream of the visual cortex in humans and primates. As for the latter, it was conjectured that representations may be disentangled by being flattened… ▽ More

    Submitted 20 March, 2017; originally announced March 2017.