Skip to main content

Showing 1–23 of 23 results for author: Ganea, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10256  [pdf, other

    cs.CL cs.AI cs.LG

    Explicit Word Density Estimation for Language Modelling

    Authors: Jovan Andonov, Octavian Ganea, Paulina Grnarova, Gary Bécigneul, Thomas Hofmann

    Abstract: Language Modelling has been a central part of Natural Language Processing for a very long time and in the past few years LSTM-based language models have been the go-to method for commercial language modeling. Recently, it has been shown that when looking at language modelling from a matrix factorization point of view, the final Softmax layer limits the expressiveness of the model, by putting an up… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Master's thesis

  2. arXiv:2202.05146  [pdf, other

    q-bio.BM cs.LG

    EquiBind: Geometric Deep Learning for Drug Binding Structure Prediction

    Authors: Hannes Stärk, Octavian-Eugen Ganea, Lagnajit Pattanaik, Regina Barzilay, Tommi Jaakkola

    Abstract: Predicting how a drug-like molecule binds to a specific protein target is a core problem in drug discovery. An extremely fast computational binding method would enable key applications such as fast virtual screening or drug engineering. Existing methods are computationally expensive as they rely on heavy candidate sampling coupled with scoring, ranking, and fine-tuning steps. We challenge this par… ▽ More

    Submitted 4 June, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: 39th International Conference on Machine Learning (ICML 2022). Also accepted at ICLR 2022 GTRL and at ICLR 2022 MLDD as spotlight

    Journal ref: 39th International Conference on Machine Learning (ICML 2022)

  3. arXiv:2111.07786  [pdf, other

    cs.AI cs.LG

    Independent SE(3)-Equivariant Models for End-to-End Rigid Protein Docking

    Authors: Octavian-Eugen Ganea, Xinyuan Huang, Charlotte Bunne, Yatao Bian, Regina Barzilay, Tommi Jaakkola, Andreas Krause

    Abstract: Protein complex formation is a central problem in biology, being involved in most of the cell's processes, and essential for applications, e.g. drug design or protein engineering. We tackle rigid body protein-protein docking, i.e., computationally predicting the 3D structure of a protein-protein complex from the individual unbound structures, assuming no conformational change within the proteins h… ▽ More

    Submitted 15 March, 2022; v1 submitted 15 November, 2021; originally announced November 2021.

    Journal ref: Spotlight at ICLR 2022: International Conference on Learning Representations

  4. arXiv:2110.06197  [pdf, other

    cs.LG cond-mat.mtrl-sci physics.comp-ph

    Crystal Diffusion Variational Autoencoder for Periodic Material Generation

    Authors: Tian Xie, Xiang Fu, Octavian-Eugen Ganea, Regina Barzilay, Tommi Jaakkola

    Abstract: Generating the periodic structure of stable materials is a long-standing challenge for the material design community. This task is difficult because stable materials only exist in a low-dimensional subspace of all possible periodic arrangements of atoms: 1) the coordinates must lie in the local energy minimum defined by quantum mechanics, and 2) global stability also requires the structure to foll… ▽ More

    Submitted 14 March, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: Accepted to ICLR 2022. Code and data are publicly available at https://github.com/txie-93/cdvae

  5. arXiv:2106.07802  [pdf, other

    physics.chem-ph cs.LG

    GeoMol: Torsional Geometric Generation of Molecular 3D Conformer Ensembles

    Authors: Octavian-Eugen Ganea, Lagnajit Pattanaik, Connor W. Coley, Regina Barzilay, Klavs F. Jensen, William H. Green, Tommi S. Jaakkola

    Abstract: Prediction of a molecule's 3D conformer ensemble from the molecular graph holds a key role in areas of cheminformatics and drug discovery. Existing generative models have several drawbacks including lack of modeling important molecular geometry elements (e.g. torsion angles), separate optimization stages prone to error accumulation, and the need for structure fine-tuning based on approximate class… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

  6. arXiv:2012.00094  [pdf, other

    q-bio.QM cs.LG

    Message Passing Networks for Molecules with Tetrahedral Chirality

    Authors: Lagnajit Pattanaik, Octavian-Eugen Ganea, Ian Coley, Klavs F. Jensen, William H. Green, Connor W. Coley

    Abstract: Molecules with identical graph connectivity can exhibit different physical and biological properties if they exhibit stereochemistry-a spatial structural characteristic. However, modern neural architectures designed for learning structure-property relationships from molecular structures treat molecules as graph-structured data and therefore are invariant to stereochemistry. Here, we develop two cu… ▽ More

    Submitted 4 December, 2020; v1 submitted 23 November, 2020; originally announced December 2020.

  7. arXiv:2006.04804  [pdf, other

    stat.ML cs.LG

    Optimal Transport Graph Neural Networks

    Authors: Benson Chen, Gary Bécigneul, Octavian-Eugen Ganea, Regina Barzilay, Tommi Jaakkola

    Abstract: Current graph neural network (GNN) architectures naively average or sum node embeddings into an aggregated graph representation -- potentially losing structural or semantic information. We here introduce OT-GNN, a model that computes graph embeddings using parametric prototypes that highlight key facets of different graph aspects. Towards this goal, we successfully combine optimal transport (OT) w… ▽ More

    Submitted 8 October, 2021; v1 submitted 8 June, 2020; originally announced June 2020.

  8. arXiv:2004.03459  [pdf, other

    cs.CV cs.LG stat.ML

    Hierarchical Image Classification using Entailment Cone Embeddings

    Authors: Ankit Dhall, Anastasia Makarova, Octavian Ganea, Dario Pavllo, Michael Greeff, Andreas Krause

    Abstract: Image classification has been studied extensively, but there has been limited work in using unconventional, external guidance other than traditional image-label pairs for training. We present a set of methods for leveraging information about the semantic hierarchy embedded in class labels. We first inject label-hierarchy knowledge into an arbitrary CNN-based classifier and empirically show that av… ▽ More

    Submitted 25 April, 2020; v1 submitted 2 April, 2020; originally announced April 2020.

    Comments: Accepted in the CVPR 2020 Workshop on Differential Geometry in Computer Vision and Machine Learning

  9. arXiv:2002.08665  [pdf, other

    cs.LG stat.ML

    Computationally Tractable Riemannian Manifolds for Graph Embeddings

    Authors: Calin Cruceru, Gary Bécigneul, Octavian-Eugen Ganea

    Abstract: Representing graphs as sets of node embeddings in certain curved Riemannian manifolds has recently gained momentum in machine learning due to their desirable geometric inductive biases, e.g., hierarchical structures benefit from hyperbolic geometry. However, going beyond embedding spaces of constant sectional curvature, while potentially more representationally powerful, proves to be challenging a… ▽ More

    Submitted 6 June, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: Submitted to the Thirty-fourth Conference on Neural Information Processing Systems

  10. arXiv:1911.08411  [pdf, other

    cs.LG stat.ML

    Mixed-curvature Variational Autoencoders

    Authors: Ondrej Skopek, Octavian-Eugen Ganea, Gary Bécigneul

    Abstract: Euclidean geometry has historically been the typical "workhorse" for machine learning applications due to its power and simplicity. However, it has recently been shown that geometric spaces with constant non-zero curvature improve representations and performance on a variety of data types and downstream tasks. Consequently, generative models like Variational Autoencoders (VAEs) have been successfu… ▽ More

    Submitted 12 February, 2020; v1 submitted 19 November, 2019; originally announced November 2019.

    Comments: ICLR 2020 camera ready version

    Journal ref: International Conference on Learning Representations (ICLR) 2020

  11. arXiv:1911.05076  [pdf, other

    cs.LG cs.AI

    Constant Curvature Graph Convolutional Networks

    Authors: Gregor Bachmann, Gary Bécigneul, Octavian-Eugen Ganea

    Abstract: Interest has been rising lately towards methods representing data in non-Euclidean spaces, e.g. hyperbolic or spherical, that provide specific inductive biases useful for certain real-world data properties, e.g. scale-free, hierarchical or cyclical. However, the popular graph neural networks are currently limited in modeling data only via Euclidean geometry and associated vector space operations.… ▽ More

    Submitted 19 May, 2020; v1 submitted 12 November, 2019; originally announced November 2019.

  12. arXiv:1907.10430   

    cs.LG stat.ML

    Noise Contrastive Variational Autoencoders

    Authors: Octavian-Eugen Ganea, Yashas Annadani, Gary Bécigneul

    Abstract: We take steps towards understanding the "posterior collapse (PC)" difficulty in variational autoencoders (VAEs),~i.e. a degenerate optimum in which the latent codes become independent of their corresponding inputs. We rely on calculus of variations and theoretically explore a few popular VAE models, showing that PC always occurs for non-parametric encoders and decoders. Inspired by the popular noi… ▽ More

    Submitted 31 July, 2019; v1 submitted 23 July, 2019; originally announced July 2019.

    Comments: There is a mistake common to all the main proofs. In summary, what we find are saddle points or global maxima of the respective loss functions and not the global minima. We apologize for this

  13. arXiv:1902.08077  [pdf, other

    cs.LG stat.ML

    Breaking the Softmax Bottleneck via Learnable Monotonic Pointwise Non-linearities

    Authors: Octavian-Eugen Ganea, Sylvain Gelly, Gary Bécigneul, Aliaksei Severyn

    Abstract: The Softmax function on top of a final linear layer is the de facto method to output probability distributions in neural networks. In many applications such as language models or text generation, this model has to produce distributions over large output vocabularies. Recently, this has been shown to have limited representational capacity due to its connection with the rank bottleneck in matrix fac… ▽ More

    Submitted 13 May, 2019; v1 submitted 21 February, 2019; originally announced February 2019.

    Journal ref: ICML 2019

  14. arXiv:1810.06546  [pdf, other

    cs.CL

    Poincaré GloVe: Hyperbolic Word Embeddings

    Authors: Alexandru Tifrea, Gary Bécigneul, Octavian-Eugen Ganea

    Abstract: Words are not created equal. In fact, they form an aristocratic graph with a latent hierarchical structure that the next generation of unsupervised learned word embeddings should reveal. In this paper, justified by the notion of delta-hyperbolicity or tree-likeliness of a space, we propose to embed words in a Cartesian product of hyperbolic spaces which we theoretically connect to the Gaussian wor… ▽ More

    Submitted 22 November, 2018; v1 submitted 15 October, 2018; originally announced October 2018.

  15. arXiv:1810.00760  [pdf, other

    cs.LG stat.ML

    Riemannian Adaptive Optimization Methods

    Authors: Gary Bécigneul, Octavian-Eugen Ganea

    Abstract: Several first order stochastic optimization methods commonly used in the Euclidean domain such as stochastic gradient descent (SGD), accelerated gradient descent or variance reduced methods have already been adapted to certain Riemannian settings. However, some of the most popular of these optimization tools - namely Adam , Adagrad and the more recent Amsgrad - remain to be generalized to Riemanni… ▽ More

    Submitted 17 February, 2019; v1 submitted 1 October, 2018; originally announced October 2018.

    Comments: Accepted at International Conference on Learning Representations (ICLR), 2019

  16. arXiv:1809.08621  [pdf, ps, other

    cs.CL

    Learning and Evaluating Sparse Interpretable Sentence Embeddings

    Authors: Valentin Trifonov, Octavian-Eugen Ganea, Anna Potapenko, Thomas Hofmann

    Abstract: Previous research on word embeddings has shown that sparse representations, which can be either learned on top of existing dense embeddings or obtained through model constraints during training time, have the benefit of increased interpretability properties: to some degree, each dimension can be understood by a human and associated with a recognizable feature in the data. In this paper, we transfe… ▽ More

    Submitted 25 September, 2018; v1 submitted 23 September, 2018; originally announced September 2018.

    Comments: Will be presented at the workshop "Analyzing and interpreting neural networks for NLP", collocated with the EMNLP 2018 conference in Brussels

  17. arXiv:1808.07699  [pdf, other

    cs.CL

    End-to-End Neural Entity Linking

    Authors: Nikolaos Kolitsas, Octavian-Eugen Ganea, Thomas Hofmann

    Abstract: Entity Linking (EL) is an essential task for semantic text understanding and information extraction. Popular methods separately address the Mention Detection (MD) and Entity Disambiguation (ED) stages of EL, without leveraging their mutual dependency. We here propose the first neural end-to-end EL system that jointly discovers and links entities in a text document. The main idea is to consider all… ▽ More

    Submitted 29 August, 2018; v1 submitted 23 August, 2018; originally announced August 2018.

    Comments: Full paper at CoNLL 2018: Conference on Computational Natural Language Learning

  18. arXiv:1805.09112  [pdf, other

    cs.LG stat.ML

    Hyperbolic Neural Networks

    Authors: Octavian-Eugen Ganea, Gary Bécigneul, Thomas Hofmann

    Abstract: Hyperbolic spaces have recently gained momentum in the context of machine learning due to their high capacity and tree-likeliness properties. However, the representational power of hyperbolic geometry is not yet on par with Euclidean geometry, mostly because of the absence of corresponding hyperbolic neural network layers. This makes it hard to use hyperbolic embeddings in downstream tasks. Here,… ▽ More

    Submitted 28 June, 2018; v1 submitted 23 May, 2018; originally announced May 2018.

  19. arXiv:1804.01882  [pdf, other

    cs.LG stat.ML

    Hyperbolic Entailment Cones for Learning Hierarchical Embeddings

    Authors: Octavian-Eugen Ganea, Gary Bécigneul, Thomas Hofmann

    Abstract: Learning graph representations via low-dimensional embeddings that preserve relevant network properties is an important class of problems in machine learning. We here present a novel method to embed directed acyclic graphs. Following prior work, we first advocate for using hyperbolic spaces which provably model tree-like structures better than Euclidean geometry. Second, we view hierarchical relat… ▽ More

    Submitted 6 June, 2018; v1 submitted 3 April, 2018; originally announced April 2018.

    Comments: International Conference on Machine Learning (ICML) 2018

  20. arXiv:1801.02607  [pdf, other

    cs.IR

    Web2Text: Deep Structured Boilerplate Removal

    Authors: Thijs Vogels, Octavian-Eugen Ganea, Carsten Eickhoff

    Abstract: Web pages are a valuable source of information for many natural language processing and information retrieval tasks. Extracting the main content from those documents is essential for the performance of derived applications. To address this issue, we introduce a novel model that performs sequence labeling to collectively classify all text blocks in an HTML page as either boilerplate or main content… ▽ More

    Submitted 27 March, 2018; v1 submitted 8 January, 2018; originally announced January 2018.

    Comments: To appear in ECIR 2018

  21. arXiv:1704.04920  [pdf, other

    cs.CL

    Deep Joint Entity Disambiguation with Local Neural Attention

    Authors: Octavian-Eugen Ganea, Thomas Hofmann

    Abstract: We propose a novel deep learning model for joint document-level entity disambiguation, which leverages learned neural representations. Key components are entity embeddings, a neural attention mechanism over local context windows, and a differentiable joint inference stage for disambiguation. Our approach thereby combines benefits of deep learning with more traditional approaches such as graphical… ▽ More

    Submitted 31 July, 2017; v1 submitted 17 April, 2017; originally announced April 2017.

    Comments: Conference on Empirical Methods in Natural Language Processing (EMNLP) 2017 long paper

  22. arXiv:1702.06589  [pdf, other

    cs.CL

    Neural Multi-Step Reasoning for Question Answering on Semi-Structured Tables

    Authors: Till Haug, Octavian-Eugen Ganea, Paulina Grnarova

    Abstract: Advances in natural language processing tasks have gained momentum in recent years due to the increasingly popular neural network methods. In this paper, we explore deep learning techniques for answering multi-step reasoning questions that operate on semi-structured tables. Challenges here arise from the level of logical compositionality expressed by questions, as well as the domain openness. Our… ▽ More

    Submitted 22 March, 2018; v1 submitted 21 February, 2017; originally announced February 2017.

    Journal ref: European Conference on Information Retrieval (ECIR) 2018

  23. arXiv:1509.02301  [pdf, other

    cs.CL

    Probabilistic Bag-Of-Hyperlinks Model for Entity Linking

    Authors: Octavian-Eugen Ganea, Marina Ganea, Aurelien Lucchi, Carsten Eickhoff, Thomas Hofmann

    Abstract: Many fundamental problems in natural language processing rely on determining what entities appear in a given text. Commonly referenced as entity linking, this step is a fundamental component of many NLP tasks such as text understanding, automatic summarization, semantic search or machine translation. Name ambiguity, word polysemy, context dependencies and a heavy-tailed distribution of entities co… ▽ More

    Submitted 29 January, 2016; v1 submitted 8 September, 2015; originally announced September 2015.