Skip to main content

Showing 1–18 of 18 results for author: Brox, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2211.01842  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Construction of Hierarchical Neural Architecture Search Spaces based on Context-free Grammars

    Authors: Simon Schrodi, Danny Stoll, Binxin Ru, Rhea Sukthanker, Thomas Brox, Frank Hutter

    Abstract: The discovery of neural architectures from simple building blocks is a long-standing goal of Neural Architecture Search (NAS). Hierarchical search spaces are a promising step towards this goal but lack a unifying search space design framework and typically only search over some limited aspect of architectures. In this work, we introduce a unifying search space design framework based on context-fre… ▽ More

    Submitted 8 December, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2023

  2. arXiv:2207.09239  [pdf, other

    cs.LG stat.ML

    Assaying Out-Of-Distribution Generalization in Transfer Learning

    Authors: Florian Wenzel, Andrea Dittadi, Peter Vincent Gehler, Carl-Johann Simon-Gabriel, Max Horn, Dominik Zietlow, David Kernert, Chris Russell, Thomas Brox, Bernt Schiele, Bernhard Schölkopf, Francesco Locatello

    Abstract: Since out-of-distribution generalization is a generally ill-posed problem, various proxy targets (e.g., calibration, adversarial robustness, algorithmic corruptions, invariance across shifts) were studied across different research programs resulting in different recommendations. While sharing the same aspirational goal, these approaches have never been tested under the same experimental conditions… ▽ More

    Submitted 21 October, 2022; v1 submitted 19 July, 2022; originally announced July 2022.

  3. arXiv:2110.06562  [pdf, other

    cs.CV cs.LG stat.ML

    Unsupervised Object Learning via Common Fate

    Authors: Matthias Tangemann, Steffen Schneider, Julius von Kügelgen, Francesco Locatello, Peter Gehler, Thomas Brox, Matthias Kümmerer, Matthias Bethge, Bernhard Schölkopf

    Abstract: Learning generative object models from unlabelled videos is a long standing problem and required for causal scene modeling. We decompose this problem into three easier subtasks, and provide candidate solutions for each of them. Inspired by the Common Fate Principle of Gestalt Psychology, we first extract (noisy) masks of moving objects via unsupervised motion segmentation. Second, generative model… ▽ More

    Submitted 15 May, 2023; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: Published at CLeaR 2023

  4. arXiv:2107.04369  [pdf, other

    cs.LG stat.ML

    Multi-headed Neural Ensemble Search

    Authors: Ashwin Raaghav Narayanan, Arber Zela, Tonmoy Saikia, Thomas Brox, Frank Hutter

    Abstract: Ensembles of CNN models trained with different seeds (also known as Deep Ensembles) are known to achieve superior performance over a single copy of the CNN. Neural Ensemble Search (NES) can further boost performance by adding architectural diversity. However, the scope of NES remains prohibitive under limited computational resources. In this work, we extend NES to multi-headed ensembles, which con… ▽ More

    Submitted 9 July, 2021; originally announced July 2021.

    Comments: 8 pages, 12 figures, 3 tables

  5. arXiv:2106.14999  [pdf, other

    stat.ML cs.LG

    Test-Time Adaptation to Distribution Shift by Confidence Maximization and Input Transformation

    Authors: Chaithanya Kumar Mummadi, Robin Hutmacher, Kilian Rambach, Evgeny Levinkov, Thomas Brox, Jan Hendrik Metzen

    Abstract: Deep neural networks often exhibit poor performance on data that is unlikely under the train-time data distribution, for instance data affected by corruptions. Previous works demonstrate that test-time adaptation to data shift, for instance using entropy minimization, effectively improves performance on such shifted distributions. This paper focuses on the fully test-time adaptation setting, where… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

    Comments: 16 pages, 5 figures, 7 tables

  6. arXiv:2007.02701  [pdf, other

    cs.LG cs.AI stat.ML

    Scaling Imitation Learning in Minecraft

    Authors: Artemij Amiranashvili, Nicolai Dorka, Wolfram Burgard, Vladlen Koltun, Thomas Brox

    Abstract: Imitation learning is a powerful family of techniques for learning sensorimotor coordination in immersive environments. We apply imitation learning to attain state-of-the-art performance on hard exploration problems in the Minecraft environment. We report experiments that highlight the influence of network architecture, loss function, and data augmentation. An early version of our approach reached… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

  7. arXiv:1910.01842  [pdf, other

    cs.CV cs.LG stat.ML

    SELF: Learning to Filter Noisy Labels with Self-Ensembling

    Authors: Duc Tam Nguyen, Chaithanya Kumar Mummadi, Thi Phuong Nhung Ngo, Thi Hoai Phuong Nguyen, Laura Beggel, Thomas Brox

    Abstract: Deep neural networks (DNNs) have been shown to over-fit a dataset when being trained with noisy labels for a long enough time. To overcome this problem, we present a simple and effective method self-ensemble label filtering (SELF) to progressively filter out the wrong labels during training. Our method improves the task performance by gradually allowing supervision only from the potentially non-no… ▽ More

    Submitted 4 October, 2019; originally announced October 2019.

  8. arXiv:1909.09656  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Understanding and Robustifying Differentiable Architecture Search

    Authors: Arber Zela, Thomas Elsken, Tonmoy Saikia, Yassine Marrakchi, Thomas Brox, Frank Hutter

    Abstract: Differentiable Architecture Search (DARTS) has attracted a lot of attention due to its simplicity and small search costs achieved by a continuous relaxation and an approximation of the resulting bi-level optimization problem. However, DARTS does not work robustly for new problems: we identify a wide range of search spaces for which DARTS yields degenerate architectures with very poor test performa… ▽ More

    Submitted 28 January, 2020; v1 submitted 20 September, 2019; originally announced September 2019.

    Comments: In: International Conference on Learning Representations (ICLR 2020); 28 pages, 30 figures

  9. arXiv:1908.03463  [pdf, other

    stat.ML cs.LG

    Group Pruning using a Bounded-Lp norm for Group Gating and Regularization

    Authors: Chaithanya Kumar Mummadi, Tim Genewein, Dan Zhang, Thomas Brox, Volker Fischer

    Abstract: Deep neural networks achieve state-of-the-art results on several tasks while increasing in complexity. It has been shown that neural networks can be pruned during training by imposing sparsity inducing regularizers. In this paper, we investigate two techniques for group-wise pruning during training in order to improve network efficiency. We propose a gating factor after every convolutional layer t… ▽ More

    Submitted 9 August, 2019; originally announced August 2019.

    Comments: German Conference on Pattern Recognition (GCPR) 2019, 12 main pages, 3 pages of appendix, 4 figures, 2 tables

  10. arXiv:1906.00216  [pdf, other

    cs.LG cs.CV stat.ML

    Robust Learning Under Label Noise With Iterative Noise-Filtering

    Authors: Duc Tam Nguyen, Thi-Phuong-Nhung Ngo, Zhongyu Lou, Michael Klar, Laura Beggel, Thomas Brox

    Abstract: We consider the problem of training a model under the presence of label noise. Current approaches identify samples with potentially incorrect labels and reduce their influence on the learning process by either assigning lower weights to them or completely removing them from the training set. In the first case the model however still learns from noisy labels; in the latter approach, good training d… ▽ More

    Submitted 1 June, 2019; originally announced June 2019.

  11. arXiv:1902.05605  [pdf, other

    cs.LG stat.ML

    CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity

    Authors: Aditya Bhatt, Daniel Palenicek, Boris Belousov, Max Argus, Artemij Amiranashvili, Thomas Brox, Jan Peters

    Abstract: Sample efficiency is a crucial problem in deep reinforcement learning. Recent algorithms, such as REDQ and DroQ, found a way to improve the sample efficiency by increasing the update-to-data (UTD) ratio to 20 gradient update steps on the critic per environment sample. However, this comes at the expense of a greatly increased computational cost. To reduce this computational burden, we introduce Cro… ▽ More

    Submitted 25 March, 2024; v1 submitted 14 February, 2019; originally announced February 2019.

    Comments: Published at ICLR 2024. Project page at http://aditya.bhatts.org/CrossQ and code release at https://github.com/adityab/CrossQ

  12. arXiv:1901.03162  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Motion Perception in Reinforcement Learning with Dynamic Objects

    Authors: Artemij Amiranashvili, Alexey Dosovitskiy, Vladlen Koltun, Thomas Brox

    Abstract: In dynamic environments, learned controllers are supposed to take motion into account when selecting the action to be taken. However, in existing reinforcement learning works motion is rarely treated explicitly; it is rather assumed that the controller learns the necessary motion representation from temporal stacks of frames implicitly. In this paper, we show that for continuous control tasks lear… ▽ More

    Submitted 1 February, 2019; v1 submitted 10 January, 2019; originally announced January 2019.

  13. arXiv:1812.03705  [pdf, other

    cs.CV cs.CR cs.LG stat.ML

    Defending Against Universal Perturbations With Shared Adversarial Training

    Authors: Chaithanya Kumar Mummadi, Thomas Brox, Jan Hendrik Metzen

    Abstract: Classifiers such as deep neural networks have been shown to be vulnerable against adversarial perturbations on problems with high-dimensional input space. While adversarial training improves the robustness of image classifiers against such adversarial perturbations, it leaves them sensitive to perturbations on a non-negligible fraction of the inputs. In this work, we show that adversarial training… ▽ More

    Submitted 13 August, 2019; v1 submitted 10 December, 2018; originally announced December 2018.

    Comments: ICCV 2019, 8 main pages, 9 appendix pages, 16 figures, 2 tables

  14. arXiv:1806.01175  [pdf, other

    cs.LG cs.AI stat.ML

    TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning

    Authors: Artemij Amiranashvili, Alexey Dosovitskiy, Vladlen Koltun, Thomas Brox

    Abstract: Our understanding of reinforcement learning (RL) has been shaped by theoretical and empirical results that were obtained decades ago using tabular representations and linear function approximators. These results suggest that RL methods that use temporal differencing (TD) are superior to direct Monte Carlo estimation (MC). How do these results hold up in deep RL, which deals with perceptually compl… ▽ More

    Submitted 4 June, 2018; originally announced June 2018.

  15. What Makes Good Synthetic Training Data for Learning Disparity and Optical Flow Estimation?

    Authors: Nikolaus Mayer, Eddy Ilg, Philipp Fischer, Caner Hazirbas, Daniel Cremers, Alexey Dosovitskiy, Thomas Brox

    Abstract: The finding that very large networks can be trained efficiently and reliably has led to a paradigm shift in computer vision from engineered solutions to learning formulations. As a result, the research challenge shifts from devising algorithms to creating suitable and abundant training data for supervised learning. How to efficiently create such training data? The dominant data acquisition method… ▽ More

    Submitted 22 March, 2018; v1 submitted 19 January, 2018; originally announced January 2018.

    Comments: added references (UCL dataset); added IJCV copyright information

  16. arXiv:1704.05712  [pdf, other

    stat.ML cs.AI cs.CV cs.LG cs.NE

    Universal Adversarial Perturbations Against Semantic Image Segmentation

    Authors: Jan Hendrik Metzen, Mummadi Chaithanya Kumar, Thomas Brox, Volker Fischer

    Abstract: While deep learning is remarkably successful on perceptual tasks, it was also shown to be vulnerable to adversarial perturbations of the input. These perturbations denote noise added to the input that was generated specifically to fool the system while being quasi-imperceptible for humans. More severely, there even exist universal perturbations that are input-agnostic but fool the network on the m… ▽ More

    Submitted 31 July, 2017; v1 submitted 19 April, 2017; originally announced April 2017.

    Comments: Final version for ICCV including supplementary material

  17. arXiv:1703.01101  [pdf, other

    stat.ML cs.CR cs.CV cs.LG cs.NE

    Adversarial Examples for Semantic Image Segmentation

    Authors: Volker Fischer, Mummadi Chaithanya Kumar, Jan Hendrik Metzen, Thomas Brox

    Abstract: Machine learning methods in general and Deep Neural Networks in particular have shown to be vulnerable to adversarial perturbations. So far this phenomenon has mainly been studied in the context of whole-image classification. In this contribution, we analyse how adversarial perturbations can affect the task of semantic segmentation. We show how existing adversarial attackers can be transferred to… ▽ More

    Submitted 3 March, 2017; originally announced March 2017.

    Comments: ICLR 2017 workshop submission

  18. arXiv:1512.02134  [pdf, other

    cs.CV cs.LG stat.ML

    A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation

    Authors: Nikolaus Mayer, Eddy Ilg, Philip Häusser, Philipp Fischer, Daniel Cremers, Alexey Dosovitskiy, Thomas Brox

    Abstract: Recent work has shown that optical flow estimation can be formulated as a supervised learning task and can be successfully solved with convolutional networks. Training of the so-called FlowNet was enabled by a large synthetically generated dataset. The present paper extends the concept of optical flow estimation via convolutional networks to disparity and scene flow estimation. To this end, we pro… ▽ More

    Submitted 7 December, 2015; originally announced December 2015.

    Comments: Includes supplementary material

    ACM Class: I.2.6; I.2.10; I.4.8