Skip to main content

Showing 1–45 of 45 results for author: Nickel, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01611  [pdf, other

    cs.IR cs.LG stat.ML

    System-2 Recommenders: Disentangling Utility and Engagement in Recommendation Systems via Temporal Point-Processes

    Authors: Arpit Agarwal, Nicolas Usunier, Alessandro Lazaric, Maximilian Nickel

    Abstract: Recommender systems are an important part of the modern human experience whose influence ranges from the food we eat to the news we read. Yet, there is still debate as to what extent recommendation platforms are aligned with the user goals. A core issue fueling this debate is the challenge of inferring a user utility based on engagement signals such as likes, shares, watch time etc., which are the… ▽ More

    Submitted 29 May, 2024; originally announced June 2024.

    Comments: Accepted at FAccT'24

  2. arXiv:2405.09409  [pdf

    cs.CV cs.DC

    Real-World Federated Learning in Radiology: Hurdles to overcome and Benefits to gain

    Authors: Markus R. Bujotzek, Ünal Akünal, Stefan Denner, Peter Neher, Maximilian Zenk, Eric Frodl, Astha Jaiswal, Moon Kim, Nicolai R. Krekiehn, Manuel Nickel, Richard Ruppel, Marcus Both, Felix Döllinger, Marcel Opitz, Thorsten Persigehl, Jens Kleesiek, Tobias Penzkofer, Klaus Maier-Hein, Rickmer Braren, Andreas Bucher

    Abstract: Objective: Federated Learning (FL) enables collaborative model training while kee** data locally. Currently, most FL studies in radiology are conducted in simulated environments due to numerous hurdles impeding its translation into practice. The few existing real-world FL initiatives rarely communicate specific measures taken to overcome these hurdles, leaving behind a significant knowledge gap.… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  3. arXiv:2405.03732  [pdf

    eess.IV cs.AI cs.CV cs.LG

    Accelerated MR Cholangiopancreatography with Deep Learning-based Reconstruction

    Authors: **ho Kim, Marcel Dominik Nickel, Florian Knoll

    Abstract: This study accelerates MR cholangiopancreatography (MRCP) acquisitions using deep learning-based (DL) reconstruction at 3T and 0.55T. Thirty healthy volunteers underwent conventional two-fold MRCP scans at field strengths of 3T or 0.55T. We trained a variational network (VN) using retrospectively six-fold undersampled data obtained at 3T. We then evaluated our method against standard techniques su… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 20 pages, 6 figures, 2 tables

  4. arXiv:2312.04823  [pdf, other

    cs.CV cs.AI cs.IT cs.LG

    Assessing Neural Network Representations During Training Using Noise-Resilient Diffusion Spectral Entropy

    Authors: Danqi Liao, Chen Liu, Benjamin W. Christensen, Alexander Tong, Guillaume Huguet, Guy Wolf, Maximilian Nickel, Ian Adelstein, Smita Krishnaswamy

    Abstract: Entropy and mutual information in neural networks provide rich information on the learning process, but they have proven difficult to compute reliably in high dimensions. Indeed, in noisy and high-dimensional data, traditional estimates in ambient dimensions approach a fixed entropy and are prohibitively hard to compute. To address these issues, we leverage data geometry to access the underlying m… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Journal ref: ICML 2023 Workshop on Topology, Algebra, and Geometry in Machine Learning

  5. arXiv:2310.02233  [pdf, other

    stat.ML cs.LG math.OC

    Generalized Schrödinger Bridge Matching

    Authors: Guan-Horng Liu, Yaron Lipman, Maximilian Nickel, Brian Karrer, Evangelos A. Theodorou, Ricky T. Q. Chen

    Abstract: Modern distribution matching algorithms for training diffusion or flow models directly prescribe the time evolution of the marginal distributions between two boundary distributions. In this work, we consider a generalized distribution matching setup, where these marginals are only implicitly described as a solution to some task-specific objective function. The problem setup, known as the Generaliz… ▽ More

    Submitted 18 April, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 Camera Ready

  6. arXiv:2309.09924  [pdf, other

    cs.LG eess.SP stat.ML

    Learning graph geometry and topology using dynamical systems based message-passing

    Authors: Dhananjay Bhaskar, Yanlei Zhang, Charles Xu, Xingzhi Sun, Oluwadamilola Fasina, Guy Wolf, Maximilian Nickel, Michael Perlmutter, Smita Krishnaswamy

    Abstract: In this paper we introduce DYMAG: a message passing paradigm for GNNs built on the expressive power of continuous, multiscale graph-dynamics. Standard discrete-time message passing algorithms implicitly make use of simplistic graph dynamics and aggregation schemes which limit their ability to capture fundamental graph topological properties. By contrast, DYMAG makes use of complex graph dynamics b… ▽ More

    Submitted 12 June, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

  7. arXiv:2307.05775  [pdf, other

    cs.LG cs.SI

    Weisfeiler and Leman Go Measurement Modeling: Probing the Validity of the WL Test

    Authors: Arjun Subramonian, Adina Williams, Maximilian Nickel, Yizhou Sun, Levent Sagun

    Abstract: The expressive power of graph neural networks is usually measured by comparing how many pairs of graphs or nodes an architecture can possibly distinguish as non-isomorphic to those distinguishable by the $k$-dimensional Weisfeiler-Leman ($k$-WL) test. In this paper, we uncover misalignments between graph machine learning practitioners' conceptualizations of expressive power and $k$-WL through a sy… ▽ More

    Submitted 31 March, 2024; v1 submitted 11 July, 2023; originally announced July 2023.

  8. arXiv:2306.06626  [pdf, other

    cs.LG stat.ML

    On Kinetic Optimal Probability Paths for Generative Models

    Authors: Neta Shaul, Ricky T. Q. Chen, Maximilian Nickel, Matt Le, Yaron Lipman

    Abstract: Recent successful generative models are trained by fitting a neural network to an a-priori defined tractable probability density path taking noise to training examples. In this paper we investigate the space of Gaussian probability paths, which includes diffusion paths as an instance, and look for an optimal member in some useful sense. In particular, minimizing the Kinetic Energy (KE) of a path i… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

  9. arXiv:2306.06062  [pdf, other

    cs.CV cs.LG

    Neural FIM for learning Fisher Information Metrics from point cloud data

    Authors: Oluwadamilola Fasina, Guillaume Huguet, Alexander Tong, Yanlei Zhang, Guy Wolf, Maximilian Nickel, Ian Adelstein, Smita Krishnaswamy

    Abstract: Although data diffusion embeddings are ubiquitous in unsupervised learning and have proven to be a viable technique for uncovering the underlying intrinsic geometry of data, diffusion embeddings are inherently limited due to their discrete nature. To this end, we propose neural FIM, a method for computing the Fisher information metric (FIM) from point cloud data - allowing for a continuous manifol… ▽ More

    Submitted 11 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: 13 pages, 11 figures, 1 table

  10. Group fairness without demographics using social networks

    Authors: David Liu, Virginie Do, Nicolas Usunier, Maximilian Nickel

    Abstract: Group fairness is a popular approach to prevent unfavorable treatment of individuals based on sensitive attributes such as race, gender, and disability. However, the reliance of group fairness on access to discrete group information raises several limitations and concerns, especially with regard to privacy, intersectionality, and unforeseen biases. In this work, we propose a "group-free" measure o… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  11. arXiv:2304.09172  [pdf, other

    cs.CV cs.LG

    Hyperbolic Image-Text Representations

    Authors: Karan Desai, Maximilian Nickel, Tanmay Rajpurohit, Justin Johnson, Ramakrishna Vedantam

    Abstract: Visual and linguistic concepts naturally organize themselves in a hierarchy, where a textual concept "dog" entails all images that contain dogs. Despite being intuitive, current large-scale vision and language models such as CLIP do not explicitly capture such hierarchy. We propose MERU, a contrastive model that yields hyperbolic representations of images and text. Hyperbolic spaces have suitable… ▽ More

    Submitted 18 January, 2024; v1 submitted 18 April, 2023; originally announced April 2023.

    Comments: ICML 2023 (v3: Add link to code in abstract)

  12. arXiv:2212.13659  [pdf, other

    cs.LG stat.ML

    Latent Discretization for Continuous-time Sequence Compression

    Authors: Ricky T. Q. Chen, Matthew Le, Matthew Muckley, Maximilian Nickel, Karen Ullrich

    Abstract: Neural compression offers a domain-agnostic approach to creating codecs for lossy or lossless compression via deep generative models. For sequence compression, however, most deep sequence models have costs that scale with the sequence length rather than the sequence complexity. In this work, we instead treat data sequences as observations from an underlying continuous-time process and learn how to… ▽ More

    Submitted 27 December, 2022; originally announced December 2022.

  13. arXiv:2210.02747  [pdf, other

    cs.LG cs.AI stat.ML

    Flow Matching for Generative Modeling

    Authors: Yaron Lipman, Ricky T. Q. Chen, Heli Ben-Hamu, Maximilian Nickel, Matt Le

    Abstract: We introduce a new paradigm for generative modeling built on Continuous Normalizing Flows (CNFs), allowing us to train CNFs at unprecedented scale. Specifically, we present the notion of Flow Matching (FM), a simulation-free approach for training CNFs based on regressing vector fields of fixed conditional probability paths. Flow Matching is compatible with a general family of Gaussian probability… ▽ More

    Submitted 8 February, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

  14. arXiv:2207.04711  [pdf, other

    stat.ML cs.LG

    Matching Normalizing Flows and Probability Paths on Manifolds

    Authors: Heli Ben-Hamu, Samuel Cohen, Joey Bose, Brandon Amos, Aditya Grover, Maximilian Nickel, Ricky T. Q. Chen, Yaron Lipman

    Abstract: Continuous Normalizing Flows (CNFs) are a class of generative models that transform a prior distribution to a model distribution by solving an ordinary differential equation (ODE). We propose to train CNFs on manifolds by minimizing probability path divergence (PPD), a novel family of divergences between the probability density path generated by the CNF and a target probability density path. PPD i… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: ICML 2022

  15. arXiv:2203.06832  [pdf, other

    cs.LG stat.ML

    Semi-Discrete Normalizing Flows through Differentiable Tessellation

    Authors: Ricky T. Q. Chen, Brandon Amos, Maximilian Nickel

    Abstract: Map** between discrete and continuous distributions is a difficult task and many have had to resort to heuristical approaches. We propose a tessellation-based approach that directly learns quantization boundaries in a continuous space, complete with exact likelihood evaluations. This is done through constructing normalizing flows on convex polytopes parameterized using a simple homeomorphism wit… ▽ More

    Submitted 11 December, 2022; v1 submitted 13 March, 2022; originally announced March 2022.

    Journal ref: NeurIPS 2022

  16. arXiv:2203.06215  [pdf, other

    cs.CV cs.AI

    Can I see an Example? Active Learning the Long Tail of Attributes and Relations

    Authors: Tyler L. Hayes, Maximilian Nickel, Christopher Kanan, Ludovic Denoyer, Arthur Szlam

    Abstract: There has been significant progress in creating machine learning models that identify objects in scenes along with their associated attributes and relationships; however, there is a large gap between the best models and human capabilities. One of the major reasons for this gap is the difficulty in collecting sufficient amounts of annotated relations and attributes for training these systems. While… ▽ More

    Submitted 7 October, 2022; v1 submitted 11 March, 2022; originally announced March 2022.

    Comments: To appear in the British Machine Vision Conference (BMVC-2022)

  17. arXiv:2108.08052  [pdf, other

    stat.ML cs.AI cs.LG

    Moser Flow: Divergence-based Generative Modeling on Manifolds

    Authors: Noam Rozen, Aditya Grover, Maximilian Nickel, Yaron Lipman

    Abstract: We are interested in learning generative models for complex geometries described via manifolds, such as spheres, tori, and other implicit surfaces. Current extensions of existing (Euclidean) generative models are restricted to specific geometries and typically suffer from high computational costs. We introduce Moser Flow (MF), a new class of generative models within the family of continuous normal… ▽ More

    Submitted 2 November, 2021; v1 submitted 18 August, 2021; originally announced August 2021.

  18. arXiv:2106.15825  [pdf, other

    cs.CL

    O2D2: Out-Of-Distribution Detector to Capture Undecidable Trials in Authorship Verification

    Authors: Benedikt Boenninghoff, Robert M. Nickel, Dorothea Kolossa

    Abstract: The PAN 2021 authorship verification (AV) challenge is part of a three-year strategy, moving from a cross-topic/closed-set AV task to a cross-topic/open-set AV task over a collection of fanfiction texts. In this work, we present a novel hybrid neural-probabilistic framework that is designed to tackle the challenges of the 2021 task. Our system is based on our 2020 winning submission, with updates… ▽ More

    Submitted 30 July, 2021; v1 submitted 30 June, 2021; originally announced June 2021.

    Comments: PAN@CLEF 2021

  19. arXiv:2106.11196  [pdf, other

    cs.CL

    Self-Calibrating Neural-Probabilistic Model for Authorship Verification Under Covariate Shift

    Authors: Benedikt Boenninghoff, Dorothea Kolossa, Robert M. Nickel

    Abstract: We are addressing two fundamental problems in authorship verification (AV): Topic variability and miscalibration. Variations in the topic of two disputed texts are a major cause of error for most AV systems. In addition, it is observed that the underlying probability estimates produced by deep learning AV mechanisms oftentimes do not match the actual case counts in the respective training data. As… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

    Comments: 12th International Conference of the CLEF Association, 2021

  20. arXiv:2105.09378  [pdf, other

    eess.IV cs.CV

    Robust partial Fourier reconstruction for diffusion-weighted imaging using a recurrent convolutional neural network

    Authors: Fasil Gadjimuradov, Thomas Benkert, Marcel Dominik Nickel, Andreas Maier

    Abstract: Purpose: To develop an algorithm for robust partial Fourier (PF) reconstruction applicable to diffusion-weighted (DW) images with non-smooth phase variations. Methods: Based on an unrolled proximal splitting algorithm, a neural network architecture is derived which alternates between data consistency operations and regularization implemented by recurrent convolutions. In order to exploit correla… ▽ More

    Submitted 9 January, 2022; v1 submitted 19 May, 2021; originally announced May 2021.

    Comments: Revisions made as required for appearance in Magnetic Resonance in Medicine

  21. arXiv:2103.01173  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Unsupervised Classification of Voiced Speech and Pitch Tracking Using Forward-Backward Kalman Filtering

    Authors: Benedikt Boenninghoff, Robert M. Nickel, Steffen Zeiler, Dorothea Kolossa

    Abstract: The detection of voiced speech, the estimation of the fundamental frequency, and the tracking of pitch values over time are crucial subtasks for a variety of speech processing techniques. Many different algorithms have been developed for each of the three subtasks. We present a new algorithm that integrates the three subtasks into a single procedure. The algorithm can be applied to pre-recorded sp… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Comments: Speech Communication; 12. ITG Symposium, 5-7 Oct. 2016

  22. arXiv:2011.04583  [pdf, other

    cs.LG

    Neural Spatio-Temporal Point Processes

    Authors: Ricky T. Q. Chen, Brandon Amos, Maximilian Nickel

    Abstract: We propose a new class of parameterizations for spatio-temporal point processes which leverage Neural ODEs as a computational method and enable flexible, high-fidelity models of discrete events that are localized in continuous time and space. Central to our approach is a combination of continuous-time neural networks with two novel neural architectures, i.e., Jump and Attentive Continuous-time Nor… ▽ More

    Submitted 17 March, 2021; v1 submitted 9 November, 2020; originally announced November 2020.

    Journal ref: ICLR 2021

  23. arXiv:2011.03902  [pdf, other

    cs.LG stat.ML

    Learning Neural Event Functions for Ordinary Differential Equations

    Authors: Ricky T. Q. Chen, Brandon Amos, Maximilian Nickel

    Abstract: The existing Neural ODE formulation relies on an explicit knowledge of the termination time. We extend Neural ODEs to implicitly defined termination criteria modeled by neural event functions, which can be chained together and differentiated through. Neural Event ODEs are capable of modeling discrete and instantaneous changes in a continuous-time system, without prior knowledge of when these chang… ▽ More

    Submitted 27 October, 2021; v1 submitted 7 November, 2020; originally announced November 2020.

    Journal ref: ICLR 2021

  24. arXiv:2010.02855  [pdf, other

    cs.AI cs.LG

    CURI: A Benchmark for Productive Concept Learning Under Uncertainty

    Authors: Ramakrishna Vedantam, Arthur Szlam, Maximilian Nickel, Ari Morcos, Brenden Lake

    Abstract: Humans can learn and reason under substantial uncertainty in a space of infinitely many concepts, including structured relational concepts ("a scene with objects that have the same color") and ad-hoc categories defined through goals ("objects that could fall on one's head"). In contrast, standard classification benchmarks: 1) consider only a fixed set of category labels, 2) do not evaluate composi… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

  25. arXiv:2008.10105  [pdf, other

    cs.CL cs.LG

    Deep Bayes Factor Scoring for Authorship Verification

    Authors: Benedikt Boenninghoff, Julian Rupp, Robert M. Nickel, Dorothea Kolossa

    Abstract: The PAN 2020 authorship verification (AV) challenge focuses on a cross-topic/closed-set AV task over a collection of fanfiction texts. Fanfiction is a fan-written extension of a storyline in which a so-called fandom topic describes the principal subject of the document. The data provided in the PAN 2020 AV task is quite challenging because authors of texts across multiple/different fandom topics a… ▽ More

    Submitted 23 August, 2020; originally announced August 2020.

    Comments: CLEF 2020 Labs and Workshops, Notebook Papers, September 2020. CEUR-WS.org

  26. arXiv:2006.10605  [pdf, other

    stat.ML cs.LG

    Riemannian Continuous Normalizing Flows

    Authors: Emile Mathieu, Maximilian Nickel

    Abstract: Normalizing flows have shown great promise for modelling flexible probability distributions in a computationally tractable way. However, whilst data is often naturally described on Riemannian manifolds such as spheres, torii, and hyperbolic spaces, most normalizing flows implicitly assume a flat geometry, making them either misspecified or ill-suited in these situations. To overcome this problem,… ▽ More

    Submitted 9 December, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: camera ready NeurIPS 2020

  27. arXiv:2005.13930  [pdf, other

    cs.LG cs.CL stat.ML

    Variational Autoencoder with Embedded Student-$t$ Mixture Model for Authorship Attribution

    Authors: Benedikt Boenninghoff, Steffen Zeiler, Robert M. Nickel, Dorothea Kolossa

    Abstract: Traditional computational authorship attribution describes a classification task in a closed-set scenario. Given a finite set of candidate authors and corresponding labeled texts, the objective is to determine which of the authors has written another set of anonymous or disputed texts. In this work, we propose a probabilistic autoencoding framework to deal with this supervised classification task.… ▽ More

    Submitted 28 May, 2020; originally announced May 2020.

    Comments: Preprint

  28. arXiv:2003.05848  [pdf, other

    cs.CV

    CPS++: Improving Class-level 6D Pose and Shape Estimation From Monocular Images With Self-Supervised Learning

    Authors: Fabian Manhardt, Gu Wang, Benjamin Busam, Manuel Nickel, Sven Meier, Luca Minciullo, Xiangyang Ji, Nassir Navab

    Abstract: Contemporary monocular 6D pose estimation methods can only cope with a handful of object instances. This naturally hampers possible applications as, for instance, robots seamlessly integrated in everyday processes necessarily require the ability to work with hundreds of different objects. To tackle this problem of immanent practical relevance, we propose a novel method for class-level monocular 6D… ▽ More

    Submitted 11 September, 2020; v1 submitted 12 March, 2020; originally announced March 2020.

  29. arXiv:2002.12501  [pdf, other

    cs.LG cs.SI stat.ML

    Learning Multivariate Hawkes Processes at Scale

    Authors: Maximilian Nickel, Matthew Le

    Abstract: Multivariate Hawkes Processes (MHPs) are an important class of temporal point processes that have enabled key advances in understanding and predicting social information systems. However, due to their complex modeling of temporal dependencies, MHPs have proven to be notoriously difficult to scale, what has limited their applications to relatively small domains. In this work, we propose a novel mod… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

  30. arXiv:1910.12892  [pdf, other

    cs.LG stat.ML

    Hyperbolic Graph Neural Networks

    Authors: Qi Liu, Maximilian Nickel, Douwe Kiela

    Abstract: Learning from graph-structured data is an important task in machine learning and artificial intelligence, for which Graph Neural Networks (GNNs) have shown great promise. Motivated by recent advances in geometric representation learning, we propose a novel GNN architecture for learning representations on Riemannian manifolds with differentiable exponential and logarithmic maps. We develop a scalab… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

    Comments: Published at NeurIPS 2019

  31. arXiv:1910.08144  [pdf, ps, other

    cs.CL

    Explainable Authorship Verification in Social Media via Attention-based Similarity Learning

    Authors: Benedikt Boenninghoff, Steffen Hessler, Dorothea Kolossa, Robert M. Nickel

    Abstract: Authorship verification is the task of analyzing the linguistic patterns of two or more texts to determine whether they were written by the same author or not. The analysis is traditionally performed by experts who consider linguistic features, which include spelling mistakes, grammatical inconsistencies, and stylistics for example. Machine learning algorithms, on the other hand, can be trained to… ▽ More

    Submitted 19 November, 2019; v1 submitted 17 October, 2019; originally announced October 2019.

    Comments: Accepted for 2019 IEEE International Conference on Big Data (IEEE Big Data 2019)

  32. arXiv:1908.07844  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Similarity Learning for Authorship Verification in Social Media

    Authors: Benedikt Boenninghoff, Robert M. Nickel, Steffen Zeiler, Dorothea Kolossa

    Abstract: Authorship verification tries to answer the question if two documents with unknown authors were written by the same author or not. A range of successful technical approaches has been proposed for this task, many of which are based on traditional linguistic features such as n-grams. These algorithms achieve good results for certain types of written documents like books and novels. Forensic authorsh… ▽ More

    Submitted 20 August, 2019; originally announced August 2019.

    Comments: 5 pages, 3 figures, 1 table, presented on ICASSP 2019 in Brighton, UK

  33. arXiv:1905.05908  [pdf, other

    cs.CV

    Task-Driven Modular Networks for Zero-Shot Compositional Learning

    Authors: Senthil Purushwalkam, Maximilian Nickel, Abhinav Gupta, Marc'Aurelio Ranzato

    Abstract: One of the hallmarks of human intelligence is the ability to compose learned knowledge into novel concepts which can be recognized without a single training example. In contrast, current state-of-the-art methods require hundreds of training examples for each possible category to build reliable and accurate classifiers. To alleviate this striking difference in efficiency, we propose a task-driven m… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

    Comments: http://www.cs.cmu.edu/~spurushw/projects/compositional.html

  34. arXiv:1902.00913  [pdf, other

    cs.CL

    Inferring Concept Hierarchies from Text Corpora via Hyperbolic Embeddings

    Authors: Matt Le, Stephen Roller, Laetitia Papaxanthos, Douwe Kiela, Maximilian Nickel

    Abstract: We consider the task of inferring is-a relationships from large text corpora. For this purpose, we propose a new method combining hyperbolic embeddings and Hearst patterns. This approach allows us to set appropriate constraints for inferring concept hierarchies from distributional contexts while also being able to predict missing is-a relationships and to correct wrong extractions. Moreover -- and… ▽ More

    Submitted 3 February, 2019; originally announced February 2019.

  35. arXiv:1806.03417  [pdf, other

    cs.AI cs.LG stat.ML

    Learning Continuous Hierarchies in the Lorentz Model of Hyperbolic Geometry

    Authors: Maximilian Nickel, Douwe Kiela

    Abstract: We are concerned with the discovery of hierarchical relationships from large-scale unstructured similarity scores. For this purpose, we study different models of hyperbolic space and find that learning embeddings in the Lorentz model is substantially more efficient than in the Poincaré-ball model. We show that the proposed approach allows us to learn high-quality embeddings of large taxonomies whi… ▽ More

    Submitted 8 July, 2018; v1 submitted 9 June, 2018; originally announced June 2018.

    Comments: Accepted at ICML'18

    ACM Class: I.2.0

  36. arXiv:1806.03191  [pdf, other

    cs.CL

    Hearst Patterns Revisited: Automatic Hypernym Detection from Large Text Corpora

    Authors: Stephen Roller, Douwe Kiela, Maximilian Nickel

    Abstract: Methods for unsupervised hypernym detection may broadly be categorized according to two paradigms: pattern-based and distributional methods. In this paper, we study the performance of both approaches on several hypernymy tasks and find that simple pattern-based methods consistently outperform distributional methods on common benchmark datasets. Our results show that pattern-based models provide im… ▽ More

    Submitted 8 June, 2018; originally announced June 2018.

    Comments: Accepted as a short paper to ACL 2018

  37. arXiv:1711.09825  [pdf, other

    cs.CV cs.IR cs.LG

    Separating Self-Expression and Visual Content in Hashtag Supervision

    Authors: Andreas Veit, Maximilian Nickel, Serge Belongie, Laurens van der Maaten

    Abstract: The variety, abundance, and structured nature of hashtags make them an interesting data source for training vision models. For instance, hashtags have the potential to significantly reduce the problem of manual supervision and annotation when learning vision models for a large number of concepts. However, a key challenge when learning from hashtags is that they are inherently subjective because th… ▽ More

    Submitted 27 November, 2017; originally announced November 2017.

  38. arXiv:1710.10881  [pdf, ps, other

    stat.ML cs.LG

    Fast Linear Model for Knowledge Graph Embeddings

    Authors: Armand Joulin, Edouard Grave, Piotr Bojanowski, Maximilian Nickel, Tomas Mikolov

    Abstract: This paper shows that a simple baseline based on a Bag-of-Words (BoW) representation learns surprisingly good knowledge graph embeddings. By casting knowledge base completion and question answering as supervised classification problems, we observe that modeling co-occurences of entities and relations leads to state-of-the-art performance with a training time of a few minutes using the open sourced… ▽ More

    Submitted 30 October, 2017; originally announced October 2017.

    Comments: Submitted AKBC 2017

  39. arXiv:1707.06320  [pdf, other

    cs.CL cs.CV

    Learning Visually Grounded Sentence Representations

    Authors: Douwe Kiela, Alexis Conneau, Allan Jabri, Maximilian Nickel

    Abstract: We introduce a variety of models, trained on a supervised image captioning corpus to predict the image features for a given caption, to perform sentence representation grounding. We train a grounded sentence encoder that achieves good performance on COCO caption and image retrieval and subsequently show that this encoder can successfully be transferred to various NLP tasks, with improved performan… ▽ More

    Submitted 4 June, 2018; v1 submitted 19 July, 2017; originally announced July 2017.

    Comments: Published at NAACL-18

  40. arXiv:1707.01475  [pdf, other

    cs.LG stat.ML

    Complex and Holographic Embeddings of Knowledge Graphs: A Comparison

    Authors: Théo Trouillon, Maximilian Nickel

    Abstract: Embeddings of knowledge graphs have received significant attention due to their excellent performance for tasks like link prediction and entity resolution. In this short paper, we are providing a comparison of two state-of-the-art knowledge graph embeddings for which their equivalence has recently been established, i.e., ComplEx and HolE [Nickel, Rosasco, and Poggio, 2016; Trouillon et al., 2016;… ▽ More

    Submitted 23 July, 2017; v1 submitted 5 July, 2017; originally announced July 2017.

  41. arXiv:1705.08039  [pdf, other

    cs.AI cs.LG stat.ML

    Poincaré Embeddings for Learning Hierarchical Representations

    Authors: Maximilian Nickel, Douwe Kiela

    Abstract: Representation learning has become an invaluable approach for learning from symbolic data such as text and graphs. However, while complex symbolic datasets often exhibit a latent hierarchical structure, state-of-the-art methods typically learn embeddings in Euclidean vector spaces, which do not account for this property. For this purpose, we introduce a new approach for learning hierarchical repre… ▽ More

    Submitted 26 May, 2017; v1 submitted 22 May, 2017; originally announced May 2017.

  42. arXiv:1609.03145  [pdf, other

    cs.AI

    Relational Models

    Authors: Volker Tresp, Maximilian Nickel

    Abstract: We provide a survey on relational models. Relational models describe complete networked {domains by taking into account global dependencies in the data}. Relational models can lead to more accurate predictions if compared to non-relational machine learning approaches. Relational models typically are based on probabilistic graphical models, e.g., Bayesian networks, Markov networks, or latent variab… ▽ More

    Submitted 11 September, 2016; originally announced September 2016.

  43. arXiv:1510.04935  [pdf, other

    cs.AI cs.LG stat.ML

    Holographic Embeddings of Knowledge Graphs

    Authors: Maximilian Nickel, Lorenzo Rosasco, Tomaso Poggio

    Abstract: Learning embeddings of entities and relations is an efficient and versatile method to perform machine learning on relational data such as knowledge graphs. In this work, we propose holographic embeddings (HolE) to learn compositional vector space representations of entire knowledge graphs. The proposed method is related to holographic models of associative memory in that it employs circular correl… ▽ More

    Submitted 7 December, 2015; v1 submitted 16 October, 2015; originally announced October 2015.

    Comments: To appear in AAAI-16

    ACM Class: I.2.6; I.2.4

  44. A Review of Relational Machine Learning for Knowledge Graphs

    Authors: Maximilian Nickel, Kevin Murphy, Volker Tresp, Evgeniy Gabrilovich

    Abstract: Relational machine learning studies methods for the statistical analysis of relational, or graph-structured, data. In this paper, we provide a review of how such statistical models can be "trained" on large knowledge graphs, and then used to predict new facts about the world (which is equivalent to predicting new edges in the graph). In particular, we discuss two fundamentally different kinds of s… ▽ More

    Submitted 28 September, 2015; v1 submitted 2 March, 2015; originally announced March 2015.

    Comments: To appear in Proceedings of the IEEE

  45. arXiv:1306.2084  [pdf, other

    stat.ML cs.LG

    Logistic Tensor Factorization for Multi-Relational Data

    Authors: Maximilian Nickel, Volker Tresp

    Abstract: Tensor factorizations have become increasingly popular approaches for various learning tasks on structured data. In this work, we extend the RESCAL tensor factorization, which has shown state-of-the-art results for multi-relational learning, to account for the binary nature of adjacency tensors. We study the improvements that can be gained via this approach on various benchmark datasets and show t… ▽ More

    Submitted 9 June, 2013; originally announced June 2013.

    Comments: Accepted at ICML 2013 Workshop "Structured Learning: Inferring Graphs from Structured and Unstructured Inputs" (SLG 2013)