Skip to main content

Showing 1–19 of 19 results for author: Gepperth, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.11908  [pdf, other

    cs.LG cs.AI cs.CV

    Continual Learning: Applications and the Road Forward

    Authors: Eli Verwimp, Rahaf Aljundi, Shai Ben-David, Matthias Bethge, Andrea Cossu, Alexander Gepperth, Tyler L. Hayes, Eyke Hüllermeier, Christopher Kanan, Dhireesha Kudithipudi, Christoph H. Lampert, Martin Mundt, Razvan Pascanu, Adrian Popescu, Andreas S. Tolias, Joost van de Weijer, Bing Liu, Vincenzo Lomonaco, Tinne Tuytelaars, Gido M. van de Ven

    Abstract: Continual learning is a subfield of machine learning, which aims to allow machine learning models to continuously learn on new data, by accumulating knowledge without forgetting what was learned in the past. In this work, we take a step back, and ask: "Why should one care about continual learning in the first place?". We set the stage by examining recent continual learning papers published at four… ▽ More

    Submitted 28 March, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Journal ref: Transactions on Machine Learning Research (TMLR), 2024

  2. arXiv:2308.15157  [pdf, other

    cs.LG cs.NE eess.SY

    On the improvement of model-predictive controllers

    Authors: L. Féret, A. Gepperth, S. Lambeck

    Abstract: This article investigates synthetic model-predictive control (MPC) problems to demonstrate that an increased precision of the internal prediction model (PM) automatially entails an improvement of the controller as a whole. In contrast to reinforcement learning (RL), MPC uses the PM to predict subsequent states of the controlled system (CS), instead of directly recommending suitable actions. To ass… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  3. arXiv:2308.13778  [pdf, other

    cs.LG

    Large-scale gradient-based training of Mixtures of Factor Analyzers

    Authors: Alexander Gepperth

    Abstract: Gaussian Mixture Models (GMMs) are a standard tool in data analysis. However, they face problems when applied to high-dimensional data (e.g., images) due to the size of the required full covariance matrices (CMs), whereas the use of diagonal or spherical CMs often imposes restrictions that are too severe. The Mixture of Factor analyzers (MFA) model is an important extension of GMMs, which allows t… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

  4. arXiv:2303.13157  [pdf, other

    cs.LG cs.AI

    Adiabatic replay for continual learning

    Authors: Alexander Krawczyk, Alexander Gepperth

    Abstract: Conventional replay-based approaches to continual learning (CL) require, for each learning phase with new data, the replay of samples representing all of the previously learned knowledge in order to avoid catastrophic forgetting. Since the amount of learned knowledge grows over time in CL problems, generative replay spends an increasing amount of time just re-learning what is already known. In thi… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  5. arXiv:2208.14307  [pdf, other

    cs.LG

    Beyond Supervised Continual Learning: a Review

    Authors: Benedikt Bagus, Alexander Gepperth, Timothée Lesort

    Abstract: Continual Learning (CL, sometimes also termed incremental learning) is a flavor of machine learning where the usual assumption of stationary data distribution is relaxed or omitted. When naively applying, e.g., DNNs in CL problems, changes in the data distribution can cause the so-called catastrophic forgetting (CF) effect: an abrupt loss of previous knowledge. Although many significant contributi… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: Accepted at the ESANN2022, 19 pages, 1 figure

  6. arXiv:2206.03934  [pdf, other

    cs.LG

    A Study of Continual Learning Methods for Q-Learning

    Authors: Benedikt Bagus, Alexander Gepperth

    Abstract: We present an empirical study on the use of continual learning (CL) methods in a reinforcement learning (RL) scenario, which, to the best of our knowledge, has not been described before. CL is a very active recent research topic concerned with machine learning under non-stationary data distributions. Although this naturally applies to RL, the use of dedicated CL methods is still uncommon. This may… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

    Comments: Accepted at the IJCNN2022, 9 pages, 9 figures

  7. arXiv:2203.11034  [pdf, other

    cs.LG

    A new perspective on probabilistic image modeling

    Authors: Alexander Gepperth

    Abstract: We present the Deep Convolutional Gaussian Mixture Model (DCGMM), a new probabilistic approach for image modeling capable of density estimation, sampling and tractable inference. DCGMM instances exhibit a CNN-like layered structure, in which the principal building blocks are convolutional Gaussian Mixture (cGMM) layers. A key innovation w.r.t. related models like sum-product networks (SPNs) and pr… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  8. An Investigation of Replay-based Approaches for Continual Learning

    Authors: Benedikt Bagus, Alexander Gepperth

    Abstract: Continual learning (CL) is a major challenge of machine learning (ML) and describes the ability to learn several tasks sequentially without catastrophic forgetting (CF). Recent works indicate that CL is a complex topic, even more so when real-world scenarios with multiple constraints are involved. Several solution classes have been proposed, of which so-called replay-based approaches seem very pro… ▽ More

    Submitted 15 August, 2021; originally announced August 2021.

    Comments: Accepted at the IJCNN2021, 9 pages, 1 figure

  9. arXiv:2104.12686  [pdf, other

    cs.CV cs.LG

    Image Modeling with Deep Convolutional Gaussian Mixture Models

    Authors: Alexander Gepperth, Benedikt Pfülb

    Abstract: In this conceptual work, we present Deep Convolutional Gaussian Mixture Models (DCGMMs): a new formulation of deep hierarchical Gaussian Mixture Models (GMMs) that is particularly suitable for describing and generating images. Vanilla (i.e., flat) GMMs require a very large number of components to describe images well, leading to long training times and memory issues. DCGMMs avoid this by a stacked… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: accepted at IJCNN2021, 9 pages, 7 figures

  10. arXiv:2104.09240  [pdf, other

    cs.LG stat.ML

    Continual Learning with Fully Probabilistic Models

    Authors: Benedikt Pfülb, Alexander Gepperth, Benedikt Bagus

    Abstract: We present an approach for continual learning (CL) that is based on fully probabilistic (or generative) models of machine learning. In contrast to, e.g., GANs that are "generative" in the sense that they can generate samples, fully probabilistic models aim at modeling the data distribution directly. Consequently, they provide functionalities that are highly relevant for continual learning, such as… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: Accepted as Findings at the CLVISION2021 workshop, 11 pages, 6 figures

  11. arXiv:2104.09220  [pdf, other

    cs.LG

    Overcoming Catastrophic Forgetting with Gaussian Mixture Replay

    Authors: Benedikt Pfülb, Alexander Gepperth

    Abstract: We present Gaussian Mixture Replay (GMR), a rehearsal-based approach for continual learning (CL) based on Gaussian Mixture Models (GMM). CL approaches are intended to tackle the problem of catastrophic forgetting (CF), which occurs for Deep Neural Networks (DNNs) when sequentially training them on successive sub-tasks. GMR mitigates CF by generating samples from previous tasks and merging them wit… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: accepted at IJCNN2021, 9 pages, 12 figures

  12. arXiv:2009.11710  [pdf, other

    cs.LG stat.ML

    A Rigorous Link Between Self-Organizing Maps and Gaussian Mixture Models

    Authors: Alexander Gepperth, Benedikt Pfülb

    Abstract: This work presents a mathematical treatment of the relation between Self-Organizing Maps (SOMs) and Gaussian Mixture Models (GMMs). We show that energy-based SOM models can be interpreted as performing gradient descent, minimizing an approximation to the GMM log-likelihood that is particularly valid for high data dimensionalities. The SOM-like decrease of the neighborhood radius can be understood… ▽ More

    Submitted 24 September, 2020; originally announced September 2020.

    Comments: 10 pages, 2 figures, submitted and accepted at International Conference on Artificial Neural Networks (ICANN) 2020

  13. arXiv:1912.09379  [pdf, other

    cs.LG stat.ML

    Gradient-based training of Gaussian Mixture Models for High-Dimensional Streaming Data

    Authors: Alexander Gepperth, Benedikt Pfülb

    Abstract: We present an approach for efficiently training Gaussian Mixture Model (GMM) by Stochastic Gradient Descent (SGD) with non-stationary, high-dimensional streaming data. Our training scheme does not require data-driven parameter initialization (e.g., k-means) and can thus be trained based on a random initialization. Furthermore, the approach allows mini-batch sizes as low as 1, which are typical for… ▽ More

    Submitted 2 July, 2021; v1 submitted 18 December, 2019; originally announced December 2019.

    Comments: 17 pages, 4 figures, preprint Neural Processing Letters

  14. A Study of Deep Learning for Network Traffic Data Forecasting

    Authors: Benedikt Pfülb, Christoph Hardegen, Alexander Gepperth, Sebastian Rieger

    Abstract: We present a study of deep learning applied to the domain of network traffic data forecasting. This is a very important ingredient for network traffic engineering, e.g., intelligent routing, which can optimize network performance, especially in large networks. In a nutshell, we wish to predict, in advance, the bit rate for a transmission, based on low-dimensional connection metadata ("flows") that… ▽ More

    Submitted 12 September, 2019; v1 submitted 10 September, 2019; originally announced September 2019.

    Comments: 16 pages, 12 figures, 28th International Conference on Artificial Neural Networks (ICANN 2019)

  15. arXiv:1905.08101  [pdf, other

    cs.LG stat.ML

    A comprehensive, application-oriented study of catastrophic forgetting in DNNs

    Authors: B. Pfülb, A. Gepperth

    Abstract: We present a large-scale empirical study of catastrophic forgetting (CF) in modern Deep Neural Network (DNN) models that perform sequential (or: incremental) learning. A new experimental protocol is proposed that enforces typical constraints encountered in application scenarios. As the investigation is empirical, we evaluate CF behavior on the hitherto largest number of visual classification datas… ▽ More

    Submitted 20 May, 2019; originally announced May 2019.

    Comments: 14 pages, 12 + 23 figures, ICLR | 2019 Seventh International Conference on Learning Representations

    Journal ref: ICLR 2019 International Conference on Learning Representations

  16. Catastrophic forgetting: still a problem for DNNs

    Authors: B. Pfülb, A. Gepperth, S. Abdullah, A. Kilian

    Abstract: We investigate the performance of DNNs when trained on class-incremental visual problems consisting of initial training, followed by retraining with added visual classes. Catastrophic forgetting (CF) behavior is measured using a new evaluation procedure that aims at an application-oriented view of incremental learning. In particular, it imposes that model selection must be performed on the initial… ▽ More

    Submitted 20 May, 2019; originally announced May 2019.

    Comments: 10 pages, 11 figures, Artificial Neural Networks and Machine Learning - ICANN 2018

  17. arXiv:1810.12069  [pdf, other

    cs.LG cs.AI stat.ML

    Marginal Replay vs Conditional Replay for Continual Learning

    Authors: Timothée Lesort, Alexander Gepperth, Andrei Stoian, David Filliat

    Abstract: We present a new replay-based method of continual classification learning that we term "conditional replay" which generates samples and labels together by sampling from a distribution conditioned on the class. We compare conditional replay to another replay-based continual learning paradigm (which we term "marginal replay") that generates samples independently of their class and assigns labels i… ▽ More

    Submitted 1 July, 2019; v1 submitted 29 October, 2018; originally announced October 2018.

  18. arXiv:1601.01157  [pdf, ps, other

    cs.LG

    A simple technique for improving multi-class classification with neural networks

    Authors: Thomas Kopinski, Alexander Gepperth, Uwe Handmann

    Abstract: We present a novel method to perform multi-class pattern classification with neural networks and test it on a challenging 3D hand gesture recognition problem. Our method consists of a standard one-against-all (OAA) classification, followed by another network layer classifying the resulting class scores, possibly augmented by the original raw input vector. This allows the network to disambiguate ha… ▽ More

    Submitted 6 January, 2016; originally announced January 2016.

    Comments: European Symposium on artificial neural networks (ESANN), Jun 2015, Bruges, Belgium

  19. A pragmatic approach to multi-class classification

    Authors: Thomas Kopinski, Stéphane Magand, Uwe Handmann, Alexander Gepperth

    Abstract: We present a novel hierarchical approach to multi-class classification which is generic in that it can be applied to different classification models (e.g., support vector machines, perceptrons), and makes no explicit assumptions about the probabilistic structure of the problem as it is usually done in multi-class classification. By adding a cascade of additional classifiers, each of which receives… ▽ More

    Submitted 6 January, 2016; originally announced January 2016.

    Comments: European Symposium on artificial neural networks (ESANN), Apr 2015, Bruges, Belgium. 2015