Skip to main content

Showing 1–17 of 17 results for author: Lücke, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2311.01888  [pdf, other

    stat.ML cs.LG

    Learning Sparse Codes with Entropy-Based ELBOs

    Authors: Dmytro Velychko, Simon Damm, Asja Fischer, Jörg Lücke

    Abstract: Standard probabilistic sparse coding assumes a Laplace prior, a linear map** from latents to observables, and Gaussian observable distributions. We here derive a solely entropy-based learning objective for the parameters of standard sparse coding. The novel variational objective has the following features: (A) unlike MAP approximations, it uses non-trivial posterior approximations for probabilis… ▽ More

    Submitted 9 April, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

  2. arXiv:2209.03077  [pdf, ps, other

    stat.ML cs.IT cs.LG math.PR math.ST

    On the Convergence of the ELBO to Entropy Sums

    Authors: Jörg Lücke, Jan Warnken

    Abstract: The variational lower bound (a.k.a. ELBO or free energy) is the central objective for many established as well as many novel algorithms for unsupervised learning. During learning such algorithms change model parameters to increase the variational lower bound. Learning usually proceeds until parameters have converged to values close to a stationary point of the learning dynamics. In this purely the… ▽ More

    Submitted 29 April, 2024; v1 submitted 7 September, 2022; originally announced September 2022.

    Comments: 50 pages

    MSC Class: 62-08 ACM Class: G.3

  3. arXiv:2012.12294  [pdf, other

    stat.ML cs.LG cs.NE

    Evolutionary Variational Optimization of Generative Models

    Authors: Jakob Drefs, Enrico Guiraud, Jörg Lücke

    Abstract: We combine two popular optimization approaches to derive learning algorithms for generative models: variational optimization and evolutionary algorithms. The combination is realized for generative models with discrete latents by using truncated posteriors as the family of variational distributions. The variational parameters of truncated posteriors are sets of latent states. By interpreting these… ▽ More

    Submitted 16 April, 2021; v1 submitted 22 December, 2020; originally announced December 2020.

    MSC Class: 65C20; 68W50 ACM Class: G.3.0; I.2.6; I.4.0; I.5.1

    Journal ref: Journal of Machine Learning Research 23(21):1-51, 2022

  4. Direct Evolutionary Optimization of Variational Autoencoders With Binary Latents

    Authors: Enrico Guiraud, Jakob Drefs, Jörg Lücke

    Abstract: Discrete latent variables are considered important for real world data, which has motivated research on Variational Autoencoders (VAEs) with discrete latents. However, standard VAE training is not possible in this case, which has motivated different strategies to manipulate discrete distributions in order to train discrete VAEs similarly to conventional ones. Here we ask if it is also possible to… ▽ More

    Submitted 24 March, 2023; v1 submitted 27 November, 2020; originally announced November 2020.

    MSC Class: 65C20; 68T07 ACM Class: G.3.0; I.2.6; I.4.0; I.5.1

  5. arXiv:2010.14860  [pdf, other

    stat.ML cs.LG

    The ELBO of Variational Autoencoders Converges to a Sum of Three Entropies

    Authors: Simon Damm, Dennis Forster, Dmytro Velychko, Zhenwen Dai, Asja Fischer, Jörg Lücke

    Abstract: The central objective function of a variational autoencoder (VAE) is its variational lower bound (the ELBO). Here we show that for standard (i.e., Gaussian) VAEs the ELBO converges to a value given by the sum of three entropies: the (negative) entropy of the prior distribution, the expected (negative) entropy of the observable distribution, and the average entropy of the variational distributions… ▽ More

    Submitted 20 April, 2023; v1 submitted 28 October, 2020; originally announced October 2020.

    Journal ref: Proceedings of the 26th International Conference on Artificial Intelligence and Statistics (AISTATS), PMLR 206:3931-3960, 2023

  6. arXiv:2003.02214  [pdf, other

    cs.LG stat.ML

    Generic Unsupervised Optimization for a Latent Variable Model With Exponential Family Observables

    Authors: Hamid Mousavi, Jakob Drefs, Florian Hirschberger, Jörg Lücke

    Abstract: Latent variable models (LVMs) represent observed variables by parameterized functions of latent variables. Prominent examples of LVMs for unsupervised learning are probabilistic PCA or probabilistic SC which both assume a weighted linear summation of the latents to determine the mean of a Gaussian distribution for the observables. In many cases, however, observables do not follow a Gaussian distri… ▽ More

    Submitted 15 December, 2023; v1 submitted 4 March, 2020; originally announced March 2020.

    Journal ref: Journal of Machine Learning Research, 24(285), 1-59 (2023)

  7. arXiv:1908.06843  [pdf, ps, other

    eess.SP cs.LG stat.ML

    ProSper -- A Python Library for Probabilistic Sparse Coding with Non-Standard Priors and Superpositions

    Authors: Georgios Exarchakis, Jörg Bornschein, Abdul-Saboor Sheikh, Zhenwen Dai, Marc Henniges, Jakob Drefs, Jörg Lücke

    Abstract: ProSper is a python library containing probabilistic algorithms to learn dictionaries. Given a set of data points, the implemented algorithms seek to learn the elementary components that have generated the data. The library widens the scope of dictionary learning approaches beyond implementations of standard approaches such as ICA, NMF or standard L1 sparse coding. The implemented algorithms are e… ▽ More

    Submitted 1 August, 2019; originally announced August 2019.

  8. Large Scale Clustering with Variational EM for Gaussian Mixture Models

    Authors: Florian Hirschberger, Dennis Forster, Jörg Lücke

    Abstract: This paper represents a preliminary (pre-reviewing) version of a sublinear variational algorithm for isotropic Gaussian mixture models (GMMs). Further developments of the algorithm for GMMs with diagonal covariance matrices (instead of isotropic clusters) and their corresponding benchmarking results have been published by TPAMI (doi:10.1109/TPAMI.2021.3133763) in the paper "A Variational EM Accele… ▽ More

    Submitted 21 June, 2022; v1 submitted 1 October, 2018; originally announced October 2018.

  9. arXiv:1712.08104  [pdf, other

    stat.ML

    Truncated Variational Sampling for "Black Box" Optimization of Generative Models

    Authors: Jörg Lücke, Zhenwen Dai, Georgios Exarchakis

    Abstract: We investigate the optimization of two probabilistic generative models with binary latent variables using a novel variational EM approach. The approach distinguishes itself from previous variational approaches by using latent states as variational parameters. Here we use efficient and general purpose sampling procedures to vary the latent states, and investigate the "black box" applicability of th… ▽ More

    Submitted 22 February, 2018; v1 submitted 21 December, 2017; originally announced December 2017.

  10. arXiv:1711.03431  [pdf, other

    stat.ML

    Can clustering scale sublinearly with its clusters? A variational EM acceleration of GMMs and $k$-means

    Authors: Dennis Forster, Jörg Lücke

    Abstract: One iteration of standard $k$-means (i.e., Lloyd's algorithm) or standard EM for Gaussian mixture models (GMMs) scales linearly with the number of clusters $C$, data points $N$, and data dimensionality $D$. In this study, we explore whether one iteration of $k$-means or EM for GMMs can scale sublinearly with $C$ at run-time, while improving the clustering objective remains effective. The tool we a… ▽ More

    Submitted 17 April, 2018; v1 submitted 9 November, 2017; originally announced November 2017.

  11. $k$-means as a variational EM approximation of Gaussian mixture models

    Authors: Jörg Lücke, Dennis Forster

    Abstract: We show that $k$-means (Lloyd's algorithm) is obtained as a special case when truncated variational EM approximations are applied to Gaussian Mixture Models (GMM) with isotropic Gaussians. In contrast to the standard way to relate $k$-means and GMMs, the provided derivation shows that it is not required to consider Gaussians with small variances or the limit case of zero variances. There are a num… ▽ More

    Submitted 6 June, 2019; v1 submitted 16 April, 2017; originally announced April 2017.

    MSC Class: 62H30

    Journal ref: Pattern Recognition Letters, 125: 349-356, 2019

  12. arXiv:1702.01997  [pdf, other

    stat.ML cs.LG

    Truncated Variational EM for Semi-Supervised Neural Simpletrons

    Authors: Dennis Forster, Jörg Lücke

    Abstract: Inference and learning for probabilistic generative networks is often very challenging and typically prevents scalability to as large networks as used for deep discriminative approaches. To obtain efficiently trainable, large-scale and well performing generative networks for semi-supervised learning, we here combine two recent developments: a neural network reformulation of hierarchical Poisson mi… ▽ More

    Submitted 7 February, 2017; originally announced February 2017.

  13. arXiv:1610.03113  [pdf, other

    stat.ML

    Truncated Variational Expectation Maximization

    Authors: Jörg Lücke

    Abstract: We derive a novel variational expectation maximization approach based on truncated posterior distributions. Truncated distributions are proportional to exact posteriors within subsets of a discrete state space and equal zero otherwise. The treatment of the distributions' subsets as variational parameters distinguishes the approach from previous variational approaches. The specific structure of tru… ▽ More

    Submitted 11 July, 2019; v1 submitted 10 October, 2016; originally announced October 2016.

  14. Neural Simpletrons - Minimalistic Directed Generative Networks for Learning with Few Labels

    Authors: Dennis Forster, Abdul-Saboor Sheikh, Jörg Lücke

    Abstract: Classifiers for the semi-supervised setting often combine strong supervised models with additional learning objectives to make use of unlabeled data. This results in powerful though very complex models that are hard to train and that demand additional labels for optimal parameter tuning, which are often not given when labeled data is very sparse. We here study a minimalistic multi-layer generative… ▽ More

    Submitted 18 November, 2016; v1 submitted 28 June, 2015; originally announced June 2015.

    Journal ref: Neural Computation Volume 30, Issue 8, August 2018, p.2113-2174

  15. GP-select: Accelerating EM using adaptive subspace preselection

    Authors: Jacquelyn A. Shelton, Jan Gasthaus, Zhenwen Dai, Joerg Luecke, Arthur Gretton

    Abstract: We propose a nonparametric procedure to achieve fast inference in generative graphical models when the number of latent states is very large. The approach is based on iterative latent variable preselection, where we alternate between learning a 'selection function' to reveal the relevant latent variables, and use this to obtain a compact approximation of the posterior distribution for EM; this can… ▽ More

    Submitted 17 July, 2016; v1 submitted 10 December, 2014; originally announced December 2014.

  16. arXiv:1211.3589  [pdf, other

    stat.ML

    A Truncated EM Approach for Spike-and-Slab Sparse Coding

    Authors: Abdul-Saboor Sheikh, Jacquelyn A. Shelton, Jörg Lücke

    Abstract: We study inference and learning based on a sparse coding model with `spike-and-slab' prior. As in standard sparse coding, the model used assumes independent latent sources that linearly combine to generate data points. However, instead of using a standard sparse prior such as a Laplace distribution, we study the application of a more flexible `spike-and-slab' distribution which models the absence… ▽ More

    Submitted 3 September, 2014; v1 submitted 15 November, 2012; originally announced November 2012.

    Comments: To appear in JMLR (2014)

    Journal ref: Journal of Machine Learning Research, 15:2653-2687, 2014

  17. arXiv:1105.2493  [pdf, other

    stat.ML

    Closed-form EM for Sparse Coding and its Application to Source Separation

    Authors: Jörg Lücke, Abdul-Saboor Sheikh

    Abstract: We define and discuss the first sparse coding algorithm based on closed-form EM updates and continuous latent variables. The underlying generative model consists of a standard `spike-and-slab' prior and a Gaussian noise model. Closed-form solutions for E- and M-step equations are derived by generalizing probabilistic PCA. The resulting EM algorithm can take all modes of a potentially multi-modal p… ▽ More

    Submitted 2 March, 2012; v1 submitted 12 May, 2011; originally announced May 2011.

    Comments: joint first authorship

    Journal ref: Lücke, J. and Sheikh, A.-S. Proc. LVA/ICA, LNCS pp. 213-221, 2012