Skip to main content

Showing 1–19 of 19 results for author: Kindermans, P

.
  1. arXiv:2308.11606  [pdf, other

    cs.CV cs.CL

    StoryBench: A Multifaceted Benchmark for Continuous Story Visualization

    Authors: Emanuele Bugliarello, Hernan Moraldo, Ruben Villegas, Mohammad Babaeizadeh, Mohammad Taghi Saffar, Han Zhang, Dumitru Erhan, Vittorio Ferrari, Pieter-Jan Kindermans, Paul Voigtlaender

    Abstract: Generating video stories from text prompts is a complex task. In addition to having high visual quality, videos need to realistically adhere to a sequence of text prompts whilst being consistent throughout the frames. Creating a benchmark for video generation requires data annotated over time, which contrasts with the single caption used often in video datasets. To fill this gap, we collect compre… ▽ More

    Submitted 12 October, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

    Comments: NeurIPS D&B 2023

  2. arXiv:2210.02399  [pdf, other

    cs.CV cs.AI

    Phenaki: Variable Length Video Generation From Open Domain Textual Description

    Authors: Ruben Villegas, Mohammad Babaeizadeh, Pieter-Jan Kindermans, Hernan Moraldo, Han Zhang, Mohammad Taghi Saffar, Santiago Castro, Julius Kunze, Dumitru Erhan

    Abstract: We present Phenaki, a model capable of realistic video synthesis, given a sequence of textual prompts. Generating videos from text is particularly challenging due to the computational cost, limited quantities of high quality text-video data and variable length of videos. To address these issues, we introduce a new model for learning video representation which compresses the video to a small repres… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

  3. arXiv:2204.11985  [pdf, other

    cs.LG cs.CR

    When adversarial examples are excusable

    Authors: Pieter-Jan Kindermans, Charles Staats

    Abstract: Neural networks work remarkably well in practice and theoretically they can be universal approximators. However, they still make mistakes and a specific type of them called adversarial errors seem inexcusable to humans. In this work, we analyze both test errors and adversarial errors on a well controlled but highly non-linear visual classification problem. We find that, when approximating training… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

  4. arXiv:2204.07615  [pdf, other

    cs.LG stat.ML

    TabNAS: Rejection Sampling for Neural Architecture Search on Tabular Datasets

    Authors: Chengrun Yang, Gabriel Bender, Hanxiao Liu, Pieter-Jan Kindermans, Madeleine Udell, Yifeng Lu, Quoc Le, Da Huang

    Abstract: The best neural architecture for a given machine learning problem depends on many factors: not only the complexity and structure of the dataset, but also on resource constraints including latency, compute, energy consumption, etc. Neural architecture search (NAS) for tabular datasets is an important but under-explored problem. Previous NAS algorithms designed for image search spaces incorporate re… ▽ More

    Submitted 20 October, 2022; v1 submitted 15 April, 2022; originally announced April 2022.

    Comments: NeurIPS 2022; 30 pages, 15 figures, 7 tables

  5. arXiv:2008.08178  [pdf, other

    cs.CV

    Discovering Multi-Hardware Mobile Models via Architecture Search

    Authors: Grace Chu, Okan Arikan, Gabriel Bender, Weijun Wang, Achille Brighton, Pieter-Jan Kindermans, Hanxiao Liu, Berkin Akin, Suyog Gupta, Andrew Howard

    Abstract: Hardware-aware neural architecture designs have been predominantly focusing on optimizing model performance on single hardware and model development complexity, where another important factor, model deployment complexity, has been largely ignored. In this paper, we argue that, for applications that may be deployed on multiple hardware, having different single-hardware models across the deployed ha… ▽ More

    Submitted 23 April, 2021; v1 submitted 18 August, 2020; originally announced August 2020.

    Comments: CVPR Workshop 2021

  6. arXiv:2008.06120  [pdf, other

    cs.LG cs.CV stat.ML

    Can weight sharing outperform random architecture search? An investigation with TuNAS

    Authors: Gabriel Bender, Hanxiao Liu, Bo Chen, Grace Chu, Shuyang Cheng, Pieter-Jan Kindermans, Quoc Le

    Abstract: Efficient Neural Architecture Search methods based on weight sharing have shown good promise in democratizing Neural Architecture Search for computer vision models. There is, however, an ongoing debate whether these efficient methods are significantly better than random search. Here we perform a thorough comparison between efficient and random search methods on a family of progressively larger and… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: Published at CVPR 2020

    ACM Class: I.2.10

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020, pp. 14323-14332

  7. arXiv:2004.14525  [pdf, other

    cs.CV

    MobileDets: Searching for Object Detection Architectures for Mobile Accelerators

    Authors: Yunyang Xiong, Hanxiao Liu, Suyog Gupta, Berkin Akin, Gabriel Bender, Yongzhe Wang, Pieter-Jan Kindermans, Mingxing Tan, Vikas Singh, Bo Chen

    Abstract: Inverted bottleneck layers, which are built upon depthwise convolutions, have been the predominant building blocks in state-of-the-art object detection models on mobile devices. In this work, we investigate the optimality of this design pattern over a broad range of mobile accelerators by revisiting the usefulness of regular convolutions. We discover that regular convolutions are a potent componen… ▽ More

    Submitted 30 March, 2021; v1 submitted 29 April, 2020; originally announced April 2020.

    Comments: Accepted at CVPR 2021; Code and models are available in the TensorFlow Object Detection API: https://github.com/tensorflow/models/tree/master/research/object_detection

  8. arXiv:2003.11142  [pdf, other

    cs.CV

    BigNAS: Scaling Up Neural Architecture Search with Big Single-Stage Models

    Authors: Jiahui Yu, Pengchong **, Hanxiao Liu, Gabriel Bender, Pieter-Jan Kindermans, Mingxing Tan, Thomas Huang, Xiaodan Song, Ruoming Pang, Quoc Le

    Abstract: Neural architecture search (NAS) has shown promising results discovering models that are both accurate and fast. For NAS, training a one-shot model has become a popular strategy to rank the relative quality of different architectures (child models) using a single set of shared weights. However, while one-shot model weights can effectively rank different network architectures, the absolute accuraci… ▽ More

    Submitted 16 July, 2020; v1 submitted 24 March, 2020; originally announced March 2020.

    Comments: Accepted in ECCV 2020

  9. arXiv:1912.00848  [pdf, other

    cs.LG stat.ML

    Neural Predictor for Neural Architecture Search

    Authors: Wei Wen, Hanxiao Liu, Hai Li, Yiran Chen, Gabriel Bender, Pieter-Jan Kindermans

    Abstract: Neural Architecture Search methods are effective but often use complex algorithms to come up with the best architecture. We propose an approach with three basic steps that is conceptually much simpler. First we train N random architectures to generate N (architecture, validation accuracy) pairs and use them to train a regression model that predicts accuracy based on the architecture. Next, we use… ▽ More

    Submitted 2 December, 2019; originally announced December 2019.

  10. arXiv:1808.04260  [pdf, other

    cs.LG stat.ML

    iNNvestigate neural networks!

    Authors: Maximilian Alber, Sebastian Lapuschkin, Philipp Seegerer, Miriam Hägele, Kristof T. Schütt, Grégoire Montavon, Wojciech Samek, Klaus-Robert Müller, Sven Dähne, Pieter-Jan Kindermans

    Abstract: In recent years, deep neural networks have revolutionized many application domains of machine learning and are key components of many critical decision or predictive processes. Therefore, it is crucial that domain specialists can understand and analyze actions and pre- dictions, even of the most complex neural network architectures. Despite these arguments neural networks are often treated as blac… ▽ More

    Submitted 13 August, 2018; originally announced August 2018.

  11. arXiv:1808.02822  [pdf, other

    cs.NE cs.LG stat.ML

    Backprop Evolution

    Authors: Maximilian Alber, Irwan Bello, Barret Zoph, Pieter-Jan Kindermans, Prajit Ramachandran, Quoc Le

    Abstract: The back-propagation algorithm is the cornerstone of deep learning. Despite its importance, few variations of the algorithm have been attempted. This work presents an approach to discover new variations of the back-propagation equation. We use a domain specific lan- guage to describe update equations as a list of primitive functions. An evolution-based method is used to discover new propagation ru… ▽ More

    Submitted 8 August, 2018; originally announced August 2018.

  12. arXiv:1806.10758  [pdf, other

    cs.LG cs.AI stat.ML

    A Benchmark for Interpretability Methods in Deep Neural Networks

    Authors: Sara Hooker, Dumitru Erhan, Pieter-Jan Kindermans, Been Kim

    Abstract: We propose an empirical measure of the approximate accuracy of feature importance estimates in deep neural networks. Our results across several large-scale image classification datasets show that many popular interpretability methods produce estimates of feature importance that are not better than a random designation of feature importance. Only certain ensemble based approaches---VarGrad and Smoo… ▽ More

    Submitted 4 November, 2019; v1 submitted 27 June, 2018; originally announced June 2018.

    Comments: In NeurIPS 2019

  13. arXiv:1712.06113  [pdf, other

    physics.chem-ph cond-mat.mtrl-sci

    SchNet - a deep learning architecture for molecules and materials

    Authors: Kristof T. Schütt, Huziel E. Sauceda, Pieter-Jan Kindermans, Alexandre Tkatchenko, Klaus-Robert Müller

    Abstract: Deep learning has led to a paradigm shift in artificial intelligence, including web, text and image search, speech recognition, as well as bioinformatics, with growing impact in chemical physics. Machine learning in general and deep learning in particular is ideally suited for representing quantum-mechanical interactions, enabling to model nonlinear potential-energy surfaces or enhancing the explo… ▽ More

    Submitted 22 March, 2018; v1 submitted 17 December, 2017; originally announced December 2017.

  14. arXiv:1711.00867  [pdf, other

    stat.ML cs.LG

    The (Un)reliability of saliency methods

    Authors: Pieter-Jan Kindermans, Sara Hooker, Julius Adebayo, Maximilian Alber, Kristof T. Schütt, Sven Dähne, Dumitru Erhan, Been Kim

    Abstract: Saliency methods aim to explain the predictions of deep neural networks. These methods lack reliability when the explanation is sensitive to factors that do not contribute to the model prediction. We use a simple and common pre-processing step ---adding a constant shift to the input data--- to show that a transformation with no effect on the model can cause numerous methods to incorrectly attribut… ▽ More

    Submitted 2 November, 2017; originally announced November 2017.

  15. arXiv:1711.00489  [pdf, other

    cs.LG cs.CV cs.DC stat.ML

    Don't Decay the Learning Rate, Increase the Batch Size

    Authors: Samuel L. Smith, Pieter-Jan Kindermans, Chris Ying, Quoc V. Le

    Abstract: It is common practice to decay the learning rate. Here we show one can usually obtain the same learning curve on both training and test sets by instead increasing the batch size during training. This procedure is successful for stochastic gradient descent (SGD), SGD with momentum, Nesterov momentum, and Adam. It reaches equivalent test accuracies after the same number of training epochs, but with… ▽ More

    Submitted 23 February, 2018; v1 submitted 1 November, 2017; originally announced November 2017.

    Comments: 11 pages, 8 figures. Published as a conference paper at ICLR 2018

  16. arXiv:1706.08566  [pdf, other

    stat.ML physics.chem-ph

    SchNet: A continuous-filter convolutional neural network for modeling quantum interactions

    Authors: Kristof T. Schütt, Pieter-Jan Kindermans, Huziel E. Sauceda, Stefan Chmiela, Alexandre Tkatchenko, Klaus-Robert Müller

    Abstract: Deep learning has the potential to revolutionize quantum chemistry as it is ideally suited to learn representations for structured data and speed up the exploration of chemical space. While convolutional neural networks have proven to be the first choice for images, audio and video data, the atoms in molecules are not restricted to a grid. Instead, their precise locations contain essential physica… ▽ More

    Submitted 19 December, 2017; v1 submitted 26 June, 2017; originally announced June 2017.

    Journal ref: Advances in Neural Information Processing Systems 30 (2017), pp. 992-1002

  17. arXiv:1705.05598  [pdf, other

    stat.ML cs.LG

    Learning how to explain neural networks: PatternNet and PatternAttribution

    Authors: Pieter-Jan Kindermans, Kristof T. Schütt, Maximilian Alber, Klaus-Robert Müller, Dumitru Erhan, Been Kim, Sven Dähne

    Abstract: DeConvNet, Guided BackProp, LRP, were invented to better understand deep neural networks. We show that these methods do not produce the theoretically correct explanation for a linear model. Yet they are used on multi-layer networks with millions of parameters. This is a cause for concern since linear models are simple neural networks. We argue that explanation methods for neural nets should work r… ▽ More

    Submitted 24 October, 2017; v1 submitted 16 May, 2017; originally announced May 2017.

  18. arXiv:1701.07213  [pdf, other

    stat.ML cs.HC q-bio.NC

    Learning from Label Proportions in Brain-Computer Interfaces: Online Unsupervised Learning with Guarantees

    Authors: D Hübner, T Verhoeven, K Schmid, K-R Müller, M Tangermann, P-J Kindermans

    Abstract: Objective: Using traditional approaches, a Brain-Computer Interface (BCI) requires the collection of calibration data for new subjects prior to online use. Calibration time can be reduced or eliminated e.g.~by transfer of a pre-trained classifier or unsupervised adaptive classification methods which learn from scratch and adapt over time. While such heuristics work well in practice, none of them c… ▽ More

    Submitted 25 January, 2017; originally announced January 2017.

    Comments: The EEG data of 13 subjects is freely available online at: http://doi.org/10.5281/zenodo.192684

  19. arXiv:1611.07270  [pdf, ps, other

    stat.ML cs.LG

    Investigating the influence of noise and distractors on the interpretation of neural networks

    Authors: Pieter-Jan Kindermans, Kristof Schütt, Klaus-Robert Müller, Sven Dähne

    Abstract: Understanding neural networks is becoming increasingly important. Over the last few years different types of visualisation and explanation methods have been proposed. However, none of them explicitly considered the behaviour in the presence of noise and distracting elements. In this work, we will show how noise and distracting dimensions can influence the result of an explanation model. This gives… ▽ More

    Submitted 22 November, 2016; originally announced November 2016.

    Comments: Presented at NIPS 2016 Workshop on Interpretable Machine Learning in Complex Systems