Skip to main content

Showing 1–42 of 42 results for author: Bayer, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.16066  [pdf, other

    cs.HC cs.LG cs.SI

    Social Media Use is Predictable from App Sequences: Using LSTM and Transformer Neural Networks to Model Habitual Behavior

    Authors: Heinrich Peters, Joseph B. Bayer, Sandra C. Matz, Yikun Chi, Sumer S. Vaid, Gabriella M. Harari

    Abstract: The present paper introduces a novel approach to studying social media habits through predictive modeling of sequential smartphone user behaviors. While much of the literature on media and technology habits has relied on self-report questionnaires and simple behavioral frequency measures, we examine an important yet understudied aspect of media and technology habits: their embeddedness in repetiti… ▽ More

    Submitted 23 June, 2024; v1 submitted 20 April, 2024; originally announced April 2024.

  2. arXiv:2402.11093  [pdf, other

    cs.CV cs.LG

    Modular Graph Extraction for Handwritten Circuit Diagram Images

    Authors: Johannes Bayer, Leo van Waveren, Andreas Dengel

    Abstract: As digitization in engineering progressed, circuit diagrams (also referred to as schematics) are typically developed and maintained in computer-aided engineering (CAE) systems, thus allowing for automated verification, simulation and further processing in downstream engineering steps. However, apart from printed legacy schematics, hand-drawn circuit diagrams are still used today in the educational… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: As submitted to ICDAR24; 11 pages, 9 figures, 1 table

  3. Utilizing dataset affinity prediction in object detection to assess training data

    Authors: Stefan Becker, Jens Bayer, Ronny Hug, Wolfgang Hübner, Michael Arens

    Abstract: Data pooling offers various advantages, such as increasing the sample size, improving generalization, reducing sampling bias, and addressing data sparsity and quality, but it is not straightforward and may even be counterproductive. Assessing the effectiveness of pooling datasets in a principled manner is challenging due to the difficulty in estimating the overall information content of individual… ▽ More

    Submitted 8 May, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted at the International Conference on Robotics, Computer Vision and Intelligent Systems (ROBOVIS) 2024

  4. arXiv:2311.08093  [pdf

    cs.CL cs.AI

    Spot: A Natural Language Interface for Geospatial Searches in OSM

    Authors: Lynn Khellaf, Ipek Baris Schlicht, Julia Bayer, Ruben Bouwmeester, Tilman Miraß, Tilman Wagner

    Abstract: Investigative journalists and fact-checkers have found OpenStreetMap (OSM) to be an invaluable resource for their work due to its extensive coverage and intricate details of various locations, which play a crucial role in investigating news scenes. Despite its value, OSM's complexity presents considerable accessibility and usability challenges, especially for those without a technical background.… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: To be published in the Proceedings of the OSM Science 2023

  5. arXiv:2306.10963  [pdf, other

    cs.CV

    Eigenpatches -- Adversarial Patches from Principal Components

    Authors: Jens Bayer, Stefan Becker, David Münch, Michael Arens

    Abstract: Adversarial patches are still a simple yet powerful white box attack that can be used to fool object detectors by suppressing possible detections. The patches of these so-called evasion attacks are computational expensive to produce and require full access to the attacked detector. This paper addresses the problem of computational expensiveness by analyzing 375 generated patches, calculating the p… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  6. arXiv:2306.09074  [pdf, other

    cs.LO math.CT math.LO

    Category Theory in Isabelle/HOL as a Basis for Meta-logical Investigation

    Authors: Jonas Bayer, Aleksey Gonus, Christoph Benzmüller, Dana S. Scott

    Abstract: This paper presents meta-logical investigations based on category theory using the proof assistant Isabelle/HOL. We demonstrate the potential of a free logic based shallow semantic embedding of category theory by providing a formalization of the notion of elementary topoi. Additionally, we formalize symmetrical monoidal closed categories expressing the denotational semantic model of intuitionistic… ▽ More

    Submitted 16 June, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: 15 pages. Preprint of paper accepted for CICM 2023 conference

    MSC Class: 68T15; 03B35; 03B80; 03B15; 08A05; 03C10; 03C68; 03C75; 20B05; 54H20 ACM Class: F.4; I.2.3

    Journal ref: Intelligent Computer Mathematics (CICM 2023). Lecture Notes in Computer Science, vol 14101, pp. 69-83. Springer, Cham

  7. arXiv:2304.10246  [pdf, other

    cs.LG cs.RO eess.SY

    Filter-Aware Model-Predictive Control

    Authors: Baris Kayalibay, Atanas Mirchev, Ahmed Agha, Patrick van der Smagt, Justin Bayer

    Abstract: Partially-observable problems pose a trade-off between reducing costs and gathering information. They can be solved optimally by planning in belief space, but that is often prohibitively expensive. Model-predictive control (MPC) takes the alternative approach of using a state estimator to form a belief over the state, and then plan in state space. This ignores potential future observations during… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

  8. arXiv:2301.03155  [pdf, other

    cs.CV

    Instance Segmentation Based Graph Extraction for Handwritten Circuit Diagram Images

    Authors: Johannes Bayer, Amit Kumar Roy, Andreas Dengel

    Abstract: Handwritten circuit diagrams from educational scenarios or historic sources usually exist on analogue media. For deriving their functional principles or flaws automatically, they need to be digitized, extracting their electrical graph. Recently, the base technologies for automated pipelines facilitating this process shifted from computer vision to machine learning. This paper describes an approach… ▽ More

    Submitted 18 January, 2023; v1 submitted 8 January, 2023; originally announced January 2023.

    Comments: As submitted to ICPRAM23

  9. Study on Domain Name System (DNS) Abuse: Technical Report

    Authors: Jan Bayer, Yevheniya Nosyk, Olivier Hureau, Simon Fernandez, Ivett Paulovics, Andrzej Duda, Maciej Korczyński

    Abstract: A safe and secure Domain Name System (DNS) is of paramount importance for the digital economy and society. Malicious activities on the DNS, generally referred to as "DNS abuse" are frequent and severe problems affecting online security and undermining users' trust in the Internet. The proposed definition of DNS abuse is as follows: Domain Name System (DNS) abuse is any activity that makes use of d… ▽ More

    Submitted 17 December, 2022; originally announced December 2022.

  10. arXiv:2212.02988  [pdf, other

    cs.LG cs.CV cs.RO stat.ML

    PRISM: Probabilistic Real-Time Inference in Spatial World Models

    Authors: Atanas Mirchev, Baris Kayalibay, Ahmed Agha, Patrick van der Smagt, Daniel Cremers, Justin Bayer

    Abstract: We introduce PRISM, a method for real-time filtering in a probabilistic generative model of agent motion and visual perception. Previous approaches either lack uncertainty estimates for the map and agent state, do not run in real-time, do not have a dense scene representation or do not model agent dynamics. Our solution reconciles all of these aspects. We start from a predefined state-space model… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: Will appear in PMLR, CoRL 2022

  11. arXiv:2209.05533  [pdf, other

    cs.OH cs.AR

    Functional Component Descriptions for Electrical Circuits based on Semantic Technology Reasoning

    Authors: Johannes Bayer, Mina Karami Zadeh, Markus Schröder, Andreas Dengel

    Abstract: Circuit diagrams have been used in electrical engineering for decades to describe the wiring of devices and facilities. They depict electrical components in a symbolic and graph-based manner. While the circuit design is usually performed electronically, there are still legacy paper-based diagrams that require digitization in order to be used in CAE systems. Generally, knowledge on specific circuit… ▽ More

    Submitted 23 June, 2022; originally announced September 2022.

    Comments: 5 pages, 8 figures

  12. arXiv:2207.04779  [pdf, ps, other

    math.HO cs.LO

    Mathematical Proof Between Generations

    Authors: Jonas Bayer, Christoph Benzmüller, Kevin Buzzard, Marco David, Leslie Lamport, Yuri Matiyasevich, Lawrence Paulson, Dierk Schleicher, Benedikt Stock, Efim Zelmanov

    Abstract: A proof is one of the most important concepts of mathematics. However, there is a striking difference between how a proof is defined in theory and how it is used in practice. This puts the unique status of mathematics as exact science into peril. Now may be the time to reconcile theory and practice, i.e. precision and intuition, through the advent of computer proof assistants. For the most time th… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

    Comments: 17 pages, 1 figure

    Journal ref: Notices of the American Mathematical Society (January 2024), Vol. 71, No. 1, pp. 79-92

  13. arXiv:2201.10335  [pdf, other

    cs.LG

    Tracking and Planning with Spatial World Models

    Authors: Baris Kayalibay, Atanas Mirchev, Patrick van der Smagt, Justin Bayer

    Abstract: We introduce a method for real-time navigation and tracking with differentiably rendered world models. Learning models for control has led to impressive results in robotics and computer games, but this success has yet to be extended to vision-based navigation. To address this, we transfer advances in the emergent field of differentiable rendering to model-based control. We do this by planning in a… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

  14. arXiv:2110.05911  [pdf, other

    cs.RO

    System for multi-robotic exploration of underground environments CTU-CRAS-NORLAB in the DARPA Subterranean Challenge

    Authors: Tomáš Rouček, Martin Pecka, Petr Čížek, Tomáš Petříček, Jan Bayer, Vojtěch Šalanský, Teymur Azayev, Daniel Heřt, Matěj Petrlík, Tomáš Báča, Vojtěch Spurný, Vít Krátký, Pavel Petráček, Dominic Baril, Maxime Vaidis, Vladimír Kubelka, François Pomerleau, Jan Faigl, Karel Zimmermann, Martin Saska, Tomáš Svoboda, Tomáš Krajník

    Abstract: We present a field report of CTU-CRAS-NORLAB team from the Subterranean Challenge (SubT) organised by the Defense Advanced Research Projects Agency (DARPA). The contest seeks to advance technologies that would improve the safety and efficiency of search-and-rescue operations in GPS-denied environments. During the contest rounds, teams of mobile robots have to find specific objects while operating… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

    Comments: This paper have already been accepted to be published Filed Robotics special issue about DARPA SubT challange

  15. arXiv:2108.11767  [pdf, other

    cs.CV

    A Comparison of Deep Saliency Map Generators on Multispectral Data in Object Detection

    Authors: Jens Bayer, David Münch, Michael Arens

    Abstract: Deep neural networks, especially convolutional deep neural networks, are state-of-the-art methods to classify, segment or even generate images, movies, or sounds. However, these methods lack of a good semantic understanding of what happens internally. The question, why a COVID-19 detector has classified a stack of lung-ct images as positive, is sometimes more interesting than the overall specifici… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Comments: 12 pages, 11 figures

  16. arXiv:2107.10373  [pdf, other

    cs.CV

    A Public Ground-Truth Dataset for Handwritten Circuit Diagram Images

    Authors: Felix Thoma, Johannes Bayer, Yakun Li

    Abstract: The development of digitization methods for line drawings (especially in the area of electrical engineering) relies on the availability of publicly available training and evaluation data. This paper presents such an image set along with annotations. The dataset consists of 1152 images of 144 circuits by 12 drafters and 48 563 annotations. Each of these images depicts an electrical circuit diagram,… ▽ More

    Submitted 21 July, 2021; originally announced July 2021.

    Comments: 6 pages, 3 figures, raw version as submitted to GREC2021

  17. Beginners' Quest to Formalize Mathematics: A Feasibility Study in Isabelle

    Authors: Jonas Bayer, Marco David, Abhik Pal, Benedikt Stock

    Abstract: How difficult are interactive theorem provers to use? We respond by reviewing the formalization of Hilbert's tenth problem in Isabelle/HOL carried out by an undergraduate research group at Jacobs University Bremen. We argue that, as demonstrated by our example, proof assistants are feasible for beginners to formalize mathematics. With the aim to make the field more accessible, we also survey hurdl… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

    Comments: 11 pages, 1 figure; Published as a conference paper at CICM 2019

    Journal ref: Intelligent Computer Mathematics (CICM 2019). Lecture Notes in Computer Science, vol 11617, pp. 16-27. Springer, Cham

  18. arXiv:2101.07046  [pdf, other

    cs.LG stat.ML

    Mind the Gap when Conditioning Amortised Inference in Sequential Latent-Variable Models

    Authors: Justin Bayer, Maximilian Soelch, Atanas Mirchev, Baris Kayalibay, Patrick van der Smagt

    Abstract: Amortised inference enables scalable learning of sequential latent-variable models (LVMs) with the evidence lower bound (ELBO). In this setting, variational posteriors are often only partially conditioned. While the true posteriors depend, e.g., on the entire sequence of observations, approximate posteriors are only informed by past observations. This mimics the Bayesian filter -- a mixture of smo… ▽ More

    Submitted 17 March, 2021; v1 submitted 18 January, 2021; originally announced January 2021.

    Comments: Published as a conference paper at ICLR 2021 (Poster)

  19. arXiv:2006.10178  [pdf, other

    stat.ML cs.CV cs.LG

    Variational State-Space Models for Localisation and Dense 3D Map** in 6 DoF

    Authors: Atanas Mirchev, Baris Kayalibay, Patrick van der Smagt, Justin Bayer

    Abstract: We solve the problem of 6-DoF localisation and 3D dense reconstruction in spatial environments as approximate Bayesian inference in a deep state-space model. Our approach leverages both learning and domain knowledge from multiple-view geometry and rigid-body dynamics. This results in an expressive predictive model of the world, often missing in current state-of-the-art visual SLAM solutions. The c… ▽ More

    Submitted 15 March, 2021; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: Update for ICLR2021

  20. arXiv:2003.01719  [pdf, other

    cs.CV cs.LG eess.IV

    Image-based OoD-Detector Principles on Graph-based Input Data in Human Action Recognition

    Authors: Jens Bayer, David Münch, Michael Arens

    Abstract: Living in a complex world like ours makes it unacceptable that a practical implementation of a machine learning system assumes a closed world. Therefore, it is necessary for such a learning-based system in a real world environment, to be aware of its own capabilities and limits and to be able to distinguish between confident and unconfident results of the inference, especially if the sample cannot… ▽ More

    Submitted 3 March, 2020; originally announced March 2020.

  21. arXiv:2002.04881  [pdf, other

    stat.ML cs.LG

    Learning Flat Latent Manifolds with VAEs

    Authors: Nutan Chen, Alexej Klushyn, Francesco Ferroni, Justin Bayer, Patrick van der Smagt

    Abstract: Measuring the similarity between data points often requires domain knowledge, which can in parts be compensated by relying on unsupervised methods such as latent-variable models, where similarity/distance is estimated in a more compact latent space. Prevalent is the use of the Euclidean metric, which has the drawback of ignoring information about similarity of data stored in the decoder, as captur… ▽ More

    Submitted 12 August, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: Thirty-seventh International Conference on Machine Learning (ICML) 2020

    Journal ref: International Conference on Machine Learning 2020

  22. arXiv:1910.06205  [pdf, other

    stat.ML cs.CV cs.LG

    Variational Tracking and Prediction with Generative Disentangled State-Space Models

    Authors: Adnan Akhundov, Maximilian Soelch, Justin Bayer, Patrick van der Smagt

    Abstract: We address tracking and prediction of multiple moving objects in visual data streams as inference and sampling in a disentangled latent state-space model. By encoding objects separately and including explicit position information in the latent state space, we perform tracking via amortized variational Bayesian inference of the respective latent positions. Inference is implemented in a modular neur… ▽ More

    Submitted 14 October, 2019; originally announced October 2019.

  23. arXiv:1908.08750  [pdf, other

    stat.ML cs.LG

    Increasing the Generalisation Capacity of Conditional VAEs

    Authors: Alexej Klushyn, Nutan Chen, Botond Cseke, Justin Bayer, Patrick van der Smagt

    Abstract: We address the problem of one-to-many map**s in supervised learning, where a single instance has many different solutions of possibly equal cost. The framework of conditional variational autoencoders describes a class of methods to tackle such structured-prediction tasks by means of latent variables. We propose to incentivise informative latent representations for increasing the generalisation c… ▽ More

    Submitted 10 September, 2019; v1 submitted 23 August, 2019; originally announced August 2019.

  24. On Deep Set Learning and the Choice of Aggregations

    Authors: Maximilian Soelch, Adnan Akhundov, Patrick van der Smagt, Justin Bayer

    Abstract: Recently, it has been shown that many functions on sets can be represented by sum decompositions. These decompositons easily lend themselves to neural approximations, extending the applicability of neural nets to set-valued inputs---Deep Set learning. This work investigates a core component of Deep Set architecture: aggregation functions. We suggest and examine alternatives to commonly used aggreg… ▽ More

    Submitted 8 April, 2020; v1 submitted 18 March, 2019; originally announced March 2019.

  25. arXiv:1901.04436  [pdf, other

    stat.ML cs.LG

    Bayesian Learning of Neural Network Architectures

    Authors: Georgi Dikov, Patrick van der Smagt, Justin Bayer

    Abstract: In this paper we propose a Bayesian method for estimating architectural parameters of neural networks, namely layer size and network depth. We do this by learning concrete distributions over these parameters. Our results show that regular networks with a learnt structure can generalise better on small datasets, while fully stochastic networks can be more robust to parameter initialisation. The pro… ▽ More

    Submitted 27 January, 2019; v1 submitted 14 January, 2019; originally announced January 2019.

    Comments: The 22nd International Conference on Artificial Intelligence and Statistics (AISTATS 2019)

  26. arXiv:1812.08284  [pdf, other

    stat.ML cs.LG

    Fast Approximate Geodesics for Deep Generative Models

    Authors: Nutan Chen, Francesco Ferroni, Alexej Klushyn, Alexandros Paraschos, Justin Bayer, Patrick van der Smagt

    Abstract: The length of the geodesic between two data points along a Riemannian manifold, induced by a deep generative model, yields a principled measure of similarity. Current approaches are limited to low-dimensional latent spaces, due to the computational complexity of solving a non-convex optimisation problem. We propose finding shortest paths in a finite graph of samples from the aggregate approximate… ▽ More

    Submitted 23 May, 2019; v1 submitted 19 December, 2018; originally announced December 2018.

    Comments: 28th International Conference on Artificial Neural Networks, 2019

    Journal ref: 28th International Conference on Artificial Neural Networks, 2019

  27. arXiv:1805.07206  [pdf, other

    stat.ML cs.LG

    Approximate Bayesian inference in spatial environments

    Authors: Atanas Mirchev, Baris Kayalibay, Maximilian Soelch, Patrick van der Smagt, Justin Bayer

    Abstract: Model-based approaches bear great promise for decision making of agents interacting with the physical world. In the context of spatial environments, different types of problems such as localisation, map**, navigation or autonomous exploration are typically adressed with specialised methods, often relying on detailed knowledge of the system at hand. We express these tasks as probabilistic inferen… ▽ More

    Submitted 20 June, 2019; v1 submitted 18 May, 2018; originally announced May 2018.

    Comments: Preprint of publication at RSS 2019

  28. arXiv:1711.01204  [pdf, other

    stat.ML cs.LG

    Metrics for Deep Generative Models

    Authors: Nutan Chen, Alexej Klushyn, Richard Kurle, Xueyan Jiang, Justin Bayer, Patrick van der Smagt

    Abstract: Neural samplers such as variational autoencoders (VAEs) or generative adversarial networks (GANs) approximate distributions by transforming samples from a simple random source---the latent space---to samples from a more complex distribution represented by a dataset. While the manifold hypothesis implies that the density induced by a dataset contains large regions of low density, the training crite… ▽ More

    Submitted 8 February, 2018; v1 submitted 3 November, 2017; originally announced November 2017.

    Comments: Published on the 21st International Conference on Artificial Intelligence and Statistics (AISTATS), 2018

    Journal ref: The 21st International Conference on Artificial Intelligence and Statistics, 2018

  29. arXiv:1606.07312  [pdf, other

    cs.RO cs.LG stat.ML

    Unsupervised preprocessing for Tactile Data

    Authors: Maximilian Karl, Justin Bayer, Patrick van der Smagt

    Abstract: Tactile information is important for grip**, stable grasp, and in-hand manipulation, yet the complexity of tactile data prevents widespread use of such sensors. We make use of an unsupervised learning algorithm that transforms the complex tactile data into a compact, latent representation without the need to record ground truth reference data. These compact representations can either be used dir… ▽ More

    Submitted 23 June, 2016; originally announced June 2016.

  30. arXiv:1606.06588  [pdf, other

    cs.RO cs.LG

    ML-based tactile sensor calibration: A universal approach

    Authors: Maximilian Karl, Artur Lohrer, Dhananjay Shah, Frederik Diehl, Max Fiedler, Saahil Ognawala, Justin Bayer, Patrick van der Smagt

    Abstract: We study the responses of two tactile sensors, the fingertip sensor from the iCub and the BioTac under different external stimuli. The question of interest is to which degree both sensors i) allow the estimation of force exerted on the sensor and ii) enable the recognition of differing degrees of curvature. Making use of a force controlled linear motor affecting the tactile sensors we acquire seve… ▽ More

    Submitted 21 June, 2016; originally announced June 2016.

  31. arXiv:1605.06432  [pdf, other

    stat.ML cs.LG eess.SY

    Deep Variational Bayes Filters: Unsupervised Learning of State Space Models from Raw Data

    Authors: Maximilian Karl, Maximilian Soelch, Justin Bayer, Patrick van der Smagt

    Abstract: We introduce Deep Variational Bayes Filters (DVBF), a new method for unsupervised learning and identification of latent Markovian state space models. Leveraging recent advances in Stochastic Gradient Variational Bayes, DVBF can overcome intractable inference distributions via variational inference. Thus, it can handle highly nonlinear input data with temporal and spatial dependencies such as image… ▽ More

    Submitted 3 March, 2017; v1 submitted 20 May, 2016; originally announced May 2016.

    Comments: Published as a conference paper at ICLR 2017

  32. arXiv:1605.02688  [pdf, other

    cs.SC cs.LG cs.MS

    Theano: A Python framework for fast computation of mathematical expressions

    Authors: The Theano Development Team, Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermueller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre-Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul Christiano , et al. (88 additional authors not shown)

    Abstract: Theano is a Python library that allows to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Since its introduction, it has been one of the most used CPU and GPU mathematical compilers - especially in the machine learning community - and has shown steady performance improvements. Theano is being actively and continuously developed since 2008, mu… ▽ More

    Submitted 9 May, 2016; originally announced May 2016.

    Comments: 19 pages, 5 figures

  33. arXiv:1602.07109  [pdf, other

    stat.ML cs.LG

    Variational Inference for On-line Anomaly Detection in High-Dimensional Time Series

    Authors: Maximilian Soelch, Justin Bayer, Marvin Ludersdorfer, Patrick van der Smagt

    Abstract: Approximate variational inference has shown to be a powerful tool for modeling unknown complex probability distributions. Recent advances in the field allow us to learn probabilistic models of sequences that actively exploit spatial and temporal structure. We apply a Stochastic Recurrent Network (STORN) to learn robot time series data. Our evaluation demonstrates that we can robustly detect anomal… ▽ More

    Submitted 14 June, 2016; v1 submitted 23 February, 2016; originally announced February 2016.

    Comments: Accepted as workshop paper at ICLR 2016; accepted as workshop paper for anomaly detection workshop at ICML 2016

  34. arXiv:1509.08455  [pdf, other

    stat.ML cs.LG

    Efficient Empowerment

    Authors: Maximilian Karl, Justin Bayer, Patrick van der Smagt

    Abstract: Empowerment quantifies the influence an agent has on its environment. This is formally achieved by the maximum of the expected KL-divergence between the distribution of the successor state conditioned on a specific action and a distribution where the actions are marginalised out. This is a natural candidate for an intrinsic reward signal in the context of reinforcement learning: the agent will pla… ▽ More

    Submitted 28 September, 2015; originally announced September 2015.

  35. arXiv:1507.05331  [pdf, ps, other

    stat.ML cs.LG

    Fast Adaptive Weight Noise

    Authors: Justin Bayer, Maximilian Karl, Daniela Korhammer, Patrick van der Smagt

    Abstract: Marginalising out uncertain quantities within the internal representations or parameters of neural networks is of central importance for a wide range of learning techniques, such as empirical, variational or full Bayesian methods. We set out to generalise fast dropout (Wang & Manning, 2013) to cover a wider variety of noise processes in neural networks. This leads to an efficient calculation of th… ▽ More

    Submitted 19 July, 2015; originally announced July 2015.

  36. arXiv:1411.7610  [pdf, other

    stat.ML cs.LG

    Learning Stochastic Recurrent Networks

    Authors: Justin Bayer, Christian Osendorfer

    Abstract: Leveraging advances in variational inference, we propose to enhance recurrent neural networks with latent variables, resulting in Stochastic Recurrent Networks (STORNs). The model i) can be trained with stochastic gradient methods, ii) allows structured and multi-modal conditionals at each time step, iii) features a reliable estimator of the marginal likelihood and iv) is a generalisation of deter… ▽ More

    Submitted 5 March, 2015; v1 submitted 27 November, 2014; originally announced November 2014.

    Comments: Submitted to conference track of ICLR 2015

  37. arXiv:1410.5684  [pdf, other

    stat.ML cs.LG

    Regularizing Recurrent Networks - On Injected Noise and Norm-based Methods

    Authors: Saahil Ognawala, Justin Bayer

    Abstract: Advancements in parallel processing have lead to a surge in multilayer perceptrons' (MLP) applications and deep learning in the past decades. Recurrent Neural Networks (RNNs) give additional representational power to feedforward MLPs by providing a way to treat sequential data. However, RNNs are hard to train using conventional error backpropagation methods because of the difficulty in relating in… ▽ More

    Submitted 21 October, 2014; originally announced October 2014.

  38. arXiv:1406.1655   

    stat.ML cs.LG

    Variational inference of latent state sequences using Recurrent Networks

    Authors: Justin Bayer, Christian Osendorfer

    Abstract: Recent advances in the estimation of deep directed graphical models and recurrent networks let us contribute to the removal of a blind spot in the area of probabilistc modelling of time series. The proposed methods i) can infer distributed latent state-space trajectories with nonlinear transitions, ii) scale to large data sets thanks to the use of a stochastic objective and fast, approximate infer… ▽ More

    Submitted 30 September, 2014; v1 submitted 6 June, 2014; originally announced June 2014.

    Comments: This paper has been withdrawn due to a derivation/implementation error and the resulting invalidation of the results

  39. arXiv:1311.0701  [pdf, other

    stat.ML cs.LG cs.NE

    On Fast Dropout and its Applicability to Recurrent Networks

    Authors: Justin Bayer, Christian Osendorfer, Daniela Korhammer, Nutan Chen, Sebastian Urban, Patrick van der Smagt

    Abstract: Recurrent Neural Networks (RNNs) are rich models for the processing of sequential data. Recent work on advancing the state of the art has been focused on the optimization or modelling of RNNs, mostly motivated by adressing the problems of the vanishing and exploding gradients. The control of overfitting has seen considerably less attention. This paper contributes to that by analyzing fast dropout,… ▽ More

    Submitted 5 March, 2014; v1 submitted 4 November, 2013; originally announced November 2013.

    Comments: The experiments for the Penn Treebank corpus were erroneous and have been stripped from this version

  40. arXiv:1304.7948  [pdf, ps, other

    cs.CV

    Convolutional Neural Networks learn compact local image descriptors

    Authors: Christian Osendorfer, Justin Bayer, Patrick van der Smagt

    Abstract: A standard deep convolutional neural network paired with a suitable loss function learns compact local image descriptors that perform comparably to state-of-the art approaches.

    Submitted 2 June, 2013; v1 submitted 30 April, 2013; originally announced April 2013.

  41. arXiv:1301.2840  [pdf, other

    cs.CV cs.LG stat.ML

    Unsupervised Feature Learning for low-level Local Image Descriptors

    Authors: Christian Osendorfer, Justin Bayer, Sebastian Urban, Patrick van der Smagt

    Abstract: Unsupervised feature learning has shown impressive results for a wide range of input modalities, in particular for object classification tasks in computer vision. Using a large amount of unlabeled data, unsupervised feature learning methods are utilized to construct high-level representations that are discriminative enough for subsequently trained supervised classification algorithms. However, it… ▽ More

    Submitted 25 April, 2013; v1 submitted 13 January, 2013; originally announced January 2013.

  42. arXiv:1109.2034  [pdf, other

    cs.NE cs.LG

    Learning Sequence Neighbourhood Metrics

    Authors: Justin Bayer, Christian Osendorfer, Patrick van der Smagt

    Abstract: Recurrent neural networks (RNNs) in combination with a pooling operator and the neighbourhood components analysis (NCA) objective function are able to detect the characterizing dynamics of sequences and embed them into a fixed-length vector space of arbitrary dimensionality. Subsequently, the resulting features are meaningful and can be used for visualization or nearest neighbour classification in… ▽ More

    Submitted 22 August, 2013; v1 submitted 9 September, 2011; originally announced September 2011.

    Comments: Artificial Neural Networks and Machine Learning ICANN 2012 Springer Berlin Heidelberg 2012. 531-538