Skip to main content

Showing 1–26 of 26 results for author: Cremers, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.17641  [pdf, other

    cs.LG cs.AI cs.CL math.OC stat.ML

    Variational Learning is Effective for Large Deep Networks

    Authors: Yuesong Shen, Nico Daheim, Bai Cong, Peter Nickl, Gian Maria Marconi, Clement Bazan, Rio Yokota, Iryna Gurevych, Daniel Cremers, Mohammad Emtiyaz Khan, Thomas Möllenhoff

    Abstract: We give extensive empirical evidence against the common belief that variational learning is ineffective for large neural networks. We show that an optimizer called Improved Variational Online Newton (IVON) consistently matches or outperforms Adam for training large networks such as GPT-2 and ResNets from scratch. IVON's computational costs are nearly identical to Adam but its predictive uncertaint… ▽ More

    Submitted 6 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Published at International Conference on Machine Learning (ICML), 2024. The first two authors contributed equally. Code is available here: https://github.com/team-approx-bayes/ivon

  2. arXiv:2307.07753  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Expressive Priors for Generalization and Uncertainty Estimation in Neural Networks

    Authors: Dominik Schnaus, Jongseok Lee, Daniel Cremers, Rudolph Triebel

    Abstract: In this work, we propose a novel prior learning method for advancing generalization and uncertainty estimation in deep neural networks. The key idea is to exploit scalable and structured posteriors of neural networks as informative priors with generalization guarantees. Our learned priors provide expressive probabilistic representations at large scale, like Bayesian counterparts of pre-trained mod… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

    Comments: Accepted to ICML 2023

  3. arXiv:2212.02988  [pdf, other

    cs.LG cs.CV cs.RO stat.ML

    PRISM: Probabilistic Real-Time Inference in Spatial World Models

    Authors: Atanas Mirchev, Baris Kayalibay, Ahmed Agha, Patrick van der Smagt, Daniel Cremers, Justin Bayer

    Abstract: We introduce PRISM, a method for real-time filtering in a probabilistic generative model of agent motion and visual perception. Previous approaches either lack uncertainty estimates for the map and agent state, do not run in real-time, do not have a dense scene representation or do not model agent dynamics. Our solution reconciles all of these aspects. We start from a predefined state-space model… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: Will appear in PMLR, CoRL 2022

  4. arXiv:2210.15575  [pdf, other

    cs.LG cs.AI stat.ML

    A Graph Is More Than Its Nodes: Towards Structured Uncertainty-Aware Learning on Graphs

    Authors: Hans Hao-Hsun Hsu, Yuesong Shen, Daniel Cremers

    Abstract: Current graph neural networks (GNNs) that tackle node classification on graphs tend to only focus on nodewise scores and are solely evaluated by nodewise metrics. This limits uncertainty estimation on graphs since nodewise marginals do not fully characterize the joint distribution given the graph structure. In this work, we propose novel edgewise metrics, namely the edgewise expected calibration e… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: Presented at NeurIPS 2022 New Frontiers in Graph Learning Workshop (NeurIPS GLFrontiers 2022)

  5. arXiv:2110.00053  [pdf, other

    math.OC cs.CV cs.LG stat.ML

    Sparse Quadratic Optimisation over the Stiefel Manifold with Application to Permutation Synchronisation

    Authors: Florian Bernard, Daniel Cremers, Johan Thunberg

    Abstract: We address the non-convex optimisation problem of finding a sparse matrix on the Stiefel manifold (matrices with mutually orthogonal columns of unit length) that maximises (or minimises) a quadratic objective function. Optimisation problems on the Stiefel manifold occur for example in spectral relaxations of various combinatorial problems, such as graph matching, clustering, or permutation synchro… ▽ More

    Submitted 30 September, 2021; originally announced October 2021.

    Comments: To appear at NeurIPS 2021

  6. arXiv:2107.13059  [pdf, other

    cs.LG stat.ML

    Explicit Pairwise Factorized Graph Neural Network for Semi-Supervised Node Classification

    Authors: Yu Wang, Yuesong Shen, Daniel Cremers

    Abstract: Node features and structural information of a graph are both crucial for semi-supervised node classification problems. A variety of graph neural network (GNN) based approaches have been proposed to tackle these problems, which typically determine output labels through feature aggregation. This can be problematic, as it implies conditional independence of output nodes given hidden representations,… ▽ More

    Submitted 27 July, 2021; originally announced July 2021.

  7. arXiv:2012.10988  [pdf, other

    cs.LG cs.AI stat.ML

    Post-hoc Uncertainty Calibration for Domain Drift Scenarios

    Authors: Christian Tomani, Sebastian Gruber, Muhammed Ebrar Erdem, Daniel Cremers, Florian Buettner

    Abstract: We address the problem of uncertainty calibration. While standard deep neural networks typically yield uncalibrated predictions, calibrated confidence scores that are representative of the true likelihood of a prediction can be achieved using post-hoc calibration methods. However, to date the focus of these approaches has been on in-domain calibration. Our contribution is two-fold. First, we show… ▽ More

    Submitted 23 June, 2021; v1 submitted 20 December, 2020; originally announced December 2020.

    Comments: In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021. Code available at https://github.com/tochris/calibration-domain-drift

  8. arXiv:2007.07029  [pdf, ps, other

    q-bio.BM cs.LG q-bio.QM stat.ML

    Deep Learning for Virtual Screening: Five Reasons to Use ROC Cost Functions

    Authors: Vladimir Golkov, Alexander Becker, Daniel T. Plop, Daniel Čuturilo, Neda Davoudi, Jeffrey Mendenhall, Rocco Moretti, Jens Meiler, Daniel Cremers

    Abstract: Computer-aided drug discovery is an essential component of modern drug development. Therein, deep learning has become an important tool for rapid screening of billions of molecules in silico for potential hits containing desired chemical features. Despite its importance, substantial challenges persist in training these models, such as severe class imbalance, high decision thresholds, and lack of g… ▽ More

    Submitted 25 June, 2020; originally announced July 2020.

    Comments: 10 pages

    MSC Class: 68T07 (Primary) 62H30; 92E99; 68T10; 62F07 (Secondary) ACM Class: G.3; I.2.1; I.2.6; I.5.1; J.3

  9. arXiv:2006.16856  [pdf, other

    cs.LG stat.ML

    A Chain Graph Interpretation of Real-World Neural Networks

    Authors: Yuesong Shen, Daniel Cremers

    Abstract: The last decade has witnessed a boom of deep learning research and applications achieving state-of-the-art results in various domains. However, most advances have been established empirically, and their theoretical analysis remains lacking. One major issue is that our current interpretation of neural networks (NNs) as function approximators is too generic to support in-depth analysis. In this pape… ▽ More

    Submitted 6 October, 2020; v1 submitted 30 June, 2020; originally announced June 2020.

  10. arXiv:2006.12456  [pdf, other

    cs.LG cs.CV stat.ML

    Effective Version Space Reduction for Convolutional Neural Networks

    Authors: Jiayu Liu, Ioannis Chiotellis, Rudolph Triebel, Daniel Cremers

    Abstract: In active learning, sampling bias could pose a serious inconsistency problem and hinder the algorithm from finding the optimal hypothesis. However, many methods for neural networks are hypothesis space agnostic and do not address this problem. We examine active learning with convolutional neural networks through the principled lens of version space reduction. We identify the connection between two… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

    Comments: 22 pages, 8 figures, to be published in the Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD) 2020

    ACM Class: I.2.6; G.3; I.5.1

  11. arXiv:1912.02160  [pdf, other

    cs.LG stat.ML

    Informative GANs via Structured Regularization of Optimal Transport

    Authors: Pierre Bréchet, Tao Wu, Thomas Möllenhoff, Daniel Cremers

    Abstract: We tackle the challenge of disentangled representation learning in generative adversarial networks (GANs) from the perspective of regularized optimal transport (OT). Specifically, a smoothed OT loss gives rise to an implicit transportation plan between the latent space and the data space. Based on this theoretical observation, we exploit a structured regularization on the transportation plan to en… ▽ More

    Submitted 4 December, 2019; originally announced December 2019.

    Comments: Presented at the Optimal Transport and Machine Learning Workshop, NeurIPS 2019

  12. arXiv:1910.14594  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Deep Learning for 2D and 3D Rotatable Data: An Overview of Methods

    Authors: Luca Della Libera, Vladimir Golkov, Yue Zhu, Arman Mielke, Daniel Cremers

    Abstract: Convolutional networks are successful due to their equivariance/invariance under translations. However, rotatable data such as images, volumes, shapes, or point clouds require processing with equivariance/invariance under rotations in cases where the rotational orientation of the coordinate system does not affect the meaning of the data (e.g. object classification). On the other hand, estimation/p… ▽ More

    Submitted 22 November, 2021; v1 submitted 31 October, 2019; originally announced October 2019.

    Comments: Improved Definition 1, improved and merged Sections 3.3-3.4, minor additional changes

    MSC Class: 62M45; 68T45; 62H35; 65D18; 68U10 ACM Class: I.2.6; I.5.1; G.3

  13. arXiv:1907.11025  [pdf, other

    cs.LG cs.CV cs.RO stat.ML

    Towards Generalizing Sensorimotor Control Across Weather Conditions

    Authors: Qadeer Khan, Patrick Wenzel, Daniel Cremers, Laura Leal-Taixé

    Abstract: The ability of deep learning models to generalize well across different scenarios depends primarily on the quality and quantity of annotated data. Labeling large amounts of data for all possible scenarios that a model may encounter would not be feasible; if even possible. We propose a framework to deal with limited labeled training data and demonstrate it on the application of vision-based vehicle… ▽ More

    Submitted 25 July, 2019; originally announced July 2019.

    Comments: Accepted for publication in 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  14. arXiv:1905.04730  [pdf, other

    cs.LG cs.CV stat.ML

    Flat Metric Minimization with Applications in Generative Modeling

    Authors: Thomas Möllenhoff, Daniel Cremers

    Abstract: We take the novel perspective to view data not as a probability distribution but rather as a current. Primarily studied in the field of geometric measure theory, $k$-currents are continuous linear functionals acting on compactly supported smooth differential forms and can be understood as a generalized notion of oriented $k$-dimensional manifold. By moving from distributions (which are $0$-current… ▽ More

    Submitted 12 May, 2019; originally announced May 2019.

  15. arXiv:1905.03389  [pdf, other

    cs.NE cs.AI cs.CV cs.LG stat.ML

    Learning to Evolve

    Authors: Jan Schuchardt, Vladimir Golkov, Daniel Cremers

    Abstract: Evolution and learning are two of the fundamental mechanisms by which life adapts in order to survive and to transcend limitations. These biological phenomena inspired successful computational methods such as evolutionary algorithms and deep learning. Evolution relies on random mutations and on random genetic recombination. Here we show that learning to evolve, i.e. learning to mutate and recombin… ▽ More

    Submitted 8 May, 2019; originally announced May 2019.

    MSC Class: 62M45; 68T05; 68W25; 68T20; 90C40; 91A22; 92D15; 92D25 ACM Class: G.1.6; I.2.6; I.2.8; G.3; I.5.1

  16. arXiv:1904.03081  [pdf, other

    cs.LG cs.CV stat.ML

    Controlling Neural Networks via Energy Dissipation

    Authors: Michael Moeller, Thomas Möllenhoff, Daniel Cremers

    Abstract: The last decade has shown a tremendous success in solving various computer vision problems with the help of deep learning techniques. Lately, many works have demonstrated that learning-based approaches with suitable network architectures even exhibit superior performance for the solution of (ill-posed) image reconstruction problems such as deblurring, super-resolution, or medical image reconstruct… ▽ More

    Submitted 20 August, 2019; v1 submitted 5 April, 2019; originally announced April 2019.

    Comments: Published as a conference paper at ICCV 2019, Seoul

  17. arXiv:1902.01785  [pdf, other

    cs.LG stat.ML

    Homogeneous Linear Inequality Constraints for Neural Network Activations

    Authors: Thomas Frerix, Matthias Nießner, Daniel Cremers

    Abstract: We propose a method to impose homogeneous linear inequality constraints of the form $Ax\leq 0$ on neural network activations. The proposed method allows a data-driven training approach to be combined with modeling prior knowledge about the task. One way to achieve this task is by means of a projection step at test time after unconstrained training. However, this is an expensive operation. By direc… ▽ More

    Submitted 28 May, 2020; v1 submitted 5 February, 2019; originally announced February 2019.

    Comments: CVPR 2020 DeepVision Workshop

  18. arXiv:1902.00057  [pdf, other

    cs.LG stat.ML

    Probabilistic Discriminative Learning with Layered Graphical Models

    Authors: Yuesong Shen, Tao Wu, Csaba Domokos, Daniel Cremers

    Abstract: Probabilistic graphical models are traditionally known for their successes in generative modeling. In this work, we advocate layered graphical models (LGMs) for probabilistic discriminative learning. To this end, we design LGMs in close analogy to neural networks (NNs), that is, they have deep hierarchical structures and convolutional or local connections between layers. Equipped with tensorized t… ▽ More

    Submitted 31 January, 2019; originally announced February 2019.

  19. arXiv:1807.01001  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Modular Vehicle Control for Transferring Semantic Information Between Weather Conditions Using GANs

    Authors: Patrick Wenzel, Qadeer Khan, Daniel Cremers, Laura Leal-Taixé

    Abstract: Even though end-to-end supervised learning has shown promising results for sensorimotor control of self-driving cars, its performance is greatly affected by the weather conditions under which it was trained, showing poor generalization to unseen conditions. In this paper, we show how knowledge can be transferred using semantic maps to new weather conditions without the need to obtain new ground tr… ▽ More

    Submitted 1 October, 2018; v1 submitted 3 July, 2018; originally announced July 2018.

    Comments: 2nd Conference on Robot Learning (CoRL 2018), Zürich, Switzerland

  20. arXiv:1806.02997  [pdf, other

    stat.ML cs.AI cs.CV cs.LG cs.NE

    q-Space Novelty Detection with Variational Autoencoders

    Authors: Aleksei Vasilev, Vladimir Golkov, Marc Meissner, Ilona Lipp, Eleonora Sgarlata, Valentina Tomassini, Derek K. Jones, Daniel Cremers

    Abstract: In machine learning, novelty detection is the task of identifying novel unseen data. During training, only samples from the normal class are available. Test samples are classified as normal or abnormal by assignment of a novelty score. Here we propose novelty detection methods based on training variational autoencoders (VAEs) on normal data. Since abnormal samples are not used during training, we… ▽ More

    Submitted 25 October, 2018; v1 submitted 8 June, 2018; originally announced June 2018.

    Comments: 11 pages, 2 figures

    MSC Class: 62F15; 62G07; 62M45; 68T30 ACM Class: G.3; H.3.3; I.2.4; I.2.6; I.4.6; I.5; I.5.4; J.3

  21. arXiv:1801.07648  [pdf, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    Clustering with Deep Learning: Taxonomy and New Methods

    Authors: Elie Aljalbout, Vladimir Golkov, Yawar Siddiqui, Maximilian Strobel, Daniel Cremers

    Abstract: Clustering methods based on deep neural networks have proven promising for clustering real-world data because of their high representational power. In this paper, we propose a systematic taxonomy of clustering methods that utilize deep neural networks. We base our taxonomy on a comprehensive review of recent work and validate the taxonomy in a case study. In this case study, we show that the taxon… ▽ More

    Submitted 13 September, 2018; v1 submitted 23 January, 2018; originally announced January 2018.

    MSC Class: 62H30; 62M45; 91C20 ACM Class: H.3.3; I.2.6; I.5; I.5.3; I.5.4

  22. What Makes Good Synthetic Training Data for Learning Disparity and Optical Flow Estimation?

    Authors: Nikolaus Mayer, Eddy Ilg, Philipp Fischer, Caner Hazirbas, Daniel Cremers, Alexey Dosovitskiy, Thomas Brox

    Abstract: The finding that very large networks can be trained efficiently and reliably has led to a paradigm shift in computer vision from engineered solutions to learning formulations. As a result, the research challenge shifts from devising algorithms to creating suitable and abundant training data for supervised learning. How to efficiently create such training data? The dominant data acquisition method… ▽ More

    Submitted 22 March, 2018; v1 submitted 19 January, 2018; originally announced January 2018.

    Comments: added references (UCL dataset); added IJCV copyright information

  23. arXiv:1801.05413  [pdf, other

    math.OC cs.LG stat.ML

    Combinatorial Preconditioners for Proximal Algorithms on Graphs

    Authors: Thomas Möllenhoff, Zhenzhang Ye, Tao Wu, Daniel Cremers

    Abstract: We present a novel preconditioning technique for proximal optimization methods that relies on graph algorithms to construct effective preconditioners. Such combinatorial preconditioners arise from partitioning the graph into forests. We prove that certain decompositions lead to a theoretically optimal condition number. We also show how ideal decompositions can be realized using matroid partitionin… ▽ More

    Submitted 21 February, 2018; v1 submitted 16 January, 2018; originally announced January 2018.

    Comments: Published as a conference paper at AISTATS 2018

  24. arXiv:1710.10686  [pdf, ps, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    Regularization for Deep Learning: A Taxonomy

    Authors: Jan Kukačka, Vladimir Golkov, Daniel Cremers

    Abstract: Regularization is one of the crucial ingredients of deep learning, yet the term regularization has various definitions, and regularization methods are often studied separately from each other. In our work we present a systematic, unifying taxonomy to categorize existing methods. We distinguish methods that affect data, network architectures, error terms, regularization terms, and optimization proc… ▽ More

    Submitted 29 October, 2017; originally announced October 2017.

    MSC Class: 62M45 ACM Class: I.2.6; I.5

  25. arXiv:1704.04039  [pdf, other

    q-bio.BM cs.LG q-bio.QM stat.ML

    3D Deep Learning for Biological Function Prediction from Physical Fields

    Authors: Vladimir Golkov, Marcin J. Skwark, Atanas Mirchev, Georgi Dikov, Alexander R. Geanes, Jeffrey Mendenhall, Jens Meiler, Daniel Cremers

    Abstract: Predicting the biological function of molecules, be it proteins or drug-like compounds, from their atomic structure is an important and long-standing problem. Function is dictated by structure, since it is by spatial interactions that molecules interact with each other, both in terms of steric complementarity, as well as intermolecular forces. Thus, the electron density field and electrostatic pot… ▽ More

    Submitted 13 April, 2017; originally announced April 2017.

    ACM Class: I.2.6; J.3

  26. arXiv:1512.02134  [pdf, other

    cs.CV cs.LG stat.ML

    A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation

    Authors: Nikolaus Mayer, Eddy Ilg, Philip Häusser, Philipp Fischer, Daniel Cremers, Alexey Dosovitskiy, Thomas Brox

    Abstract: Recent work has shown that optical flow estimation can be formulated as a supervised learning task and can be successfully solved with convolutional networks. Training of the so-called FlowNet was enabled by a large synthetically generated dataset. The present paper extends the concept of optical flow estimation via convolutional networks to disparity and scene flow estimation. To this end, we pro… ▽ More

    Submitted 7 December, 2015; originally announced December 2015.

    Comments: Includes supplementary material

    ACM Class: I.2.6; I.2.10; I.4.8