Skip to main content

Showing 1–30 of 30 results for author: Opper, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.03549  [pdf, other

    stat.ML cs.LG math.DS math.PR

    Bridging discrete and continuous state spaces: Exploring the Ehrenfest process in time-continuous diffusion models

    Authors: Ludwig Winkler, Lorenz Richter, Manfred Opper

    Abstract: Generative modeling via stochastic processes has led to remarkable empirical results as well as to recent advances in their theoretical understanding. In principle, both space and time of the processes can be discrete or continuous. In this work, we study time-continuous Markov jump processes on discrete state spaces and investigate their correspondence to state-continuous diffusion processes give… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  2. arXiv:2404.01860  [pdf, other

    cs.CL

    Self-StrAE at SemEval-2024 Task 1: Making Self-Structuring AutoEncoders Learn More With Less

    Authors: Mattia Opper, N. Siddharth

    Abstract: This paper presents two simple improvements to the Self-Structuring AutoEncoder (Self-StrAE). Firstly, we show that including reconstruction to the vocabulary as an auxiliary objective improves representation quality. Secondly, we demonstrate that increasing the number of independent channels leads to significant improvements in embedding quality, while simultaneously reducing the number of parame… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: SemEval 2024

  3. arXiv:2402.08676  [pdf, other

    cs.LG cs.IT

    A Convergence Analysis of Approximate Message Passing with Non-Separable Functions and Applications to Multi-Class Classification

    Authors: Burak Çakmak, Yue M. Lu, Manfred Opper

    Abstract: Motivated by the recent application of approximate message passing (AMP) to the analysis of convex optimizations in multi-class classifications [Loureiro, et. al., 2021], we present a convergence analysis of AMP dynamics with non-separable multivariate nonlinearities. As an application, we present a complete (and independent) analysis of the motivated convex optimization problem.

    Submitted 13 February, 2024; originally announced February 2024.

  4. arXiv:2311.00128  [pdf, other

    cs.CL

    On the effect of curriculum learning with developmental data for grammar acquisition

    Authors: Mattia Opper, J. Morrison, N. Siddharth

    Abstract: This work explores the degree to which grammar acquisition is driven by language `simplicity' and the source modality (speech vs. text) of data. Using BabyBERTa as a probe, we find that grammar acquisition is largely driven by exposure to speech data, and in particular through exposure to two of the BabyLM training corpora: AO-Childes and Open Subtitles. We arrive at this finding by examining vari… ▽ More

    Submitted 3 November, 2023; v1 submitted 31 October, 2023; originally announced November 2023.

    Comments: CoNLL-CMCL Shared Task BabyLM Challenge 2023

  5. arXiv:2310.17638  [pdf, other

    cs.LG stat.ML

    Generative Fractional Diffusion Models

    Authors: Gabriel Nobis, Maximilian Springenberg, Marco Aversa, Michael Detzel, Rembert Daems, Roderick Murray-Smith, Shinichi Nakajima, Sebastian Lapuschkin, Stefano Ermon, Tolga Birdal, Manfred Opper, Christoph Knochenhauer, Luis Oala, Wojciech Samek

    Abstract: We introduce the first continuous-time score-based generative model that leverages fractional diffusion processes for its underlying dynamics. Although diffusion models have excelled at capturing data distributions, they still suffer from various limitations such as slow convergence, mode-collapse on imbalanced data, and lack of diversity. These issues are partially linked to the use of light-tail… ▽ More

    Submitted 24 June, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    ACM Class: I.2.4; F.4.1; G.3

  6. arXiv:2310.12975  [pdf, other

    cs.LG cs.AI cs.CV stat.AP stat.ML

    Variational Inference for SDEs Driven by Fractional Noise

    Authors: Rembert Daems, Manfred Opper, Guillaume Crevecoeur, Tolga Birdal

    Abstract: We present a novel variational framework for performing inference in (neural) stochastic differential equations (SDEs) driven by Markov-approximate fractional Brownian motion (fBM). SDEs offer a versatile tool for modeling real-world continuous-time dynamic systems with inherent noise and randomness. Combining SDEs with the powerful inference capabilities of variational methods, enables the learni… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: 24 pages, under review

  7. arXiv:2305.05588  [pdf, other

    cs.CL cs.LG

    StrAE: Autoencoding for Pre-Trained Embeddings using Explicit Structure

    Authors: Mattia Opper, Victor Prokhorov, N. Siddharth

    Abstract: This work presents StrAE: a Structured Autoencoder framework that through strict adherence to explicit structure, and use of a novel contrastive objective over tree-structured representations, enables effective learning of multi-level representations. Through comparison over different forms of structure, we verify that our results are directly attributable to the informativeness of the structure p… ▽ More

    Submitted 25 October, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 Main

  8. arXiv:2304.12290  [pdf, other

    cs.IT

    Joint Message Detection and Channel Estimation for Unsourced Random Access in Cell-Free User-Centric Wireless Networks

    Authors: Burak Çakmak, Eleni Gkiouzepi, Manfred Opper, Giuseppe Caire

    Abstract: We consider unsourced random access (uRA) in a cell-free (CF) user-centric wireless network, where a large number of potential users compete for a random access slot, while only a finite subset is active. The random access users transmit codewords of length $L$ symbols from a shared codebook, which are received by $B$ geographically distributed radio units (RUs) equipped with $M$ antennas each. Ou… ▽ More

    Submitted 5 February, 2024; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: 45 pages, 9 figures, submitted to the IEEE Transactions on Information Theory

  9. Analysis of Random Sequential Message Passing Algorithms for Approximate Inference

    Authors: Burak Çakmak, Yue M. Lu, Manfred Opper

    Abstract: We analyze the dynamics of a random sequential message passing algorithm for approximate inference with large Gaussian latent variable models in a student-teacher scenario. To model nontrivial dependencies between the latent variables, we assume random covariance matrices drawn from rotation invariant ensembles. Moreover, we consider a model mismatching setting, where the teacher model and the one… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

  10. arXiv:2107.10066  [pdf, other

    stat.ML cs.LG

    Adaptive Inducing Points Selection For Gaussian Processes

    Authors: Théo Galy-Fajou, Manfred Opper

    Abstract: Gaussian Processes (\textbf{GPs}) are flexible non-parametric models with strong probabilistic interpretation. While being a standard choice for performing inference on time series, GPs have few techniques to work in a streaming setting. \cite{bui2017streaming} developed an efficient variational approach to train online GPs by using sparsity techniques: The whole set of observations is approximate… ▽ More

    Submitted 21 July, 2021; originally announced July 2021.

    Comments: Accepted at Continual Learning Workshop - ICML 2020 : https://sites.google.com/view/cl-icml/home

  11. arXiv:2105.09618  [pdf, other

    stat.ML cs.AI cs.LG stat.AP

    Nonlinear Hawkes Process with Gaussian Process Self Effects

    Authors: Noa Malem-Shinitski, Cesar Ojeda, Manfred Opper

    Abstract: Traditionally, Hawkes processes are used to model time--continuous point processes with history dependence. Here we propose an extended model where the self--effects are of both excitatory and inhibitory type and follow a Gaussian Process. Whereas previous work either relies on a less flexible parameterization of the model, or requires a large amount of data, our formulation allows for both a flex… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

  12. arXiv:2101.01571  [pdf, ps, other

    cond-mat.dis-nn cs.LG

    Exact solution to the random sequential dynamics of a message passing algorithm

    Authors: Burak Çakmak, Manfred Opper

    Abstract: We analyze the random sequential dynamics of a message passing algorithm for Ising models with random interactions in the large system limit. We derive exact results for the two-time correlation functions and the speed of convergence. The {\em de Almedia-Thouless} stability criterion of the static problem is found to be necessary and sufficient for the global convergence of the random sequential d… ▽ More

    Submitted 2 March, 2021; v1 submitted 5 January, 2021; originally announced January 2021.

    Comments: Accepted for publication in Physical Review E Letter

    Journal ref: Phys. Rev. E 103, 030101 (2021)

  13. arXiv:2005.01560  [pdf, ps, other

    cs.LG cond-mat.dis-nn stat.ML

    A Dynamical Mean-Field Theory for Learning in Restricted Boltzmann Machines

    Authors: Burak Çakmak, Manfred Opper

    Abstract: We define a message-passing algorithm for computing magnetizations in Restricted Boltzmann machines, which are Ising models on bipartite graphs introduced as neural network models for probability distributions over spin configurations. To model nontrivial statistical dependencies between the spins' couplings, we assume that the rectangular coupling matrix is drawn from an arbitrary bi-rotation inv… ▽ More

    Submitted 4 May, 2020; originally announced May 2020.

    Comments: 29 pages, 2 figures

  14. arXiv:2002.11451  [pdf, other

    stat.ML cs.LG

    Automated Augmented Conjugate Inference for Non-conjugate Gaussian Process Models

    Authors: Théo Galy-Fajou, Florian Wenzel, Manfred Opper

    Abstract: We propose automated augmented conjugate inference, a new inference method for non-conjugate Gaussian processes (GP) models. Our method automatically constructs an auxiliary variable augmentation that renders the GP model conditionally conjugate. Building on the conjugate structure of the augmented model, we develop two inference methods. First, a fast and scalable stochastic variational inference… ▽ More

    Submitted 26 February, 2020; originally announced February 2020.

    Comments: Accepted at AISTATS 2020

  15. arXiv:2002.02533  [pdf, ps, other

    cond-mat.stat-mech cond-mat.dis-nn cs.LG stat.ML

    Understanding the dynamics of message passing algorithms: a free probability heuristics

    Authors: Manfred Opper, Burak Çakmak

    Abstract: We use freeness assumptions of random matrix theory to analyze the dynamical behavior of inference algorithms for probabilistic models with dense coupling matrices in the limit of large systems. For a toy Ising model, we are able to recover previous results such as the property of vanishing effective memories and the analytical convergence rate of the algorithm.

    Submitted 3 February, 2020; originally announced February 2020.

    Comments: 11 pages, 2 figures. Presented at the conference "Random Matrix Theory: Applications in the Information Era'' 2019 Kraków

  16. arXiv:2001.04918  [pdf, ps, other

    cs.LG cond-mat.dis-nn stat.ML

    Analysis of Bayesian Inference Algorithms by the Dynamical Functional Approach

    Authors: Burak Çakmak, Manfred Opper

    Abstract: We analyze the dynamics of an algorithm for approximate inference with large Gaussian latent variable models in a student-teacher scenario. To model nontrivial dependencies between the latent variables, we assume random covariance matrices drawn from rotation invariant ensembles. For the case of perfect data-model matching, the knowledge of static order parameters derived from the replica method a… ▽ More

    Submitted 14 January, 2020; originally announced January 2020.

    Comments: 25 pages, 2 figures

  17. Tightening Bounds for Variational Inference by Revisiting Perturbation Theory

    Authors: Robert Bamler, Cheng Zhang, Manfred Opper, Stephan Mandt

    Abstract: Variational inference has become one of the most widely used methods in latent variable modeling. In its basic form, variational inference employs a fully factorized variational distribution and minimizes its KL divergence to the posterior. As the minimization can only be carried out approximately, this approximation induces a bias. In this paper, we revisit perturbation theory as a powerful way o… ▽ More

    Submitted 30 September, 2019; originally announced October 2019.

    Comments: To appear in Journal of Statistical Mechanics: Theory and Experiment (JSTAT), 2019

  18. arXiv:1905.09670  [pdf, other

    stat.ML cs.LG

    Multi-Class Gaussian Process Classification Made Conjugate: Efficient Inference via Data Augmentation

    Authors: Théo Galy-Fajou, Florian Wenzel, Christian Donner, Manfred Opper

    Abstract: We propose a new scalable multi-class Gaussian process classification approach building on a novel modified softmax likelihood function. The new likelihood has two benefits: it leads to well-calibrated uncertainty estimates and allows for an efficient latent variable augmentation. The augmented model has the advantage that it is conditionally conjugate leading to a fast variational inference metho… ▽ More

    Submitted 23 May, 2019; originally announced May 2019.

    Comments: Accepted at UAI 2019

  19. arXiv:1901.08583  [pdf, ps, other

    cond-mat.dis-nn cs.LG

    Memory-free dynamics for the TAP equations of Ising models with arbitrary rotation invariant ensembles of random coupling matrices

    Authors: Burak Çakmak, Manfred Opper

    Abstract: We propose an iterative algorithm for solving the Thouless-Anderson-Palmer (TAP) equations of Ising models with arbitrary rotation invariant (random) coupling matrices. In the thermodynamic limit, we prove by means of the dynamical functional method that the proposed algorithm converges when the so-called de Almeida Thouless (AT) criterion is fulfilled. Moreover, we give exact analytical expressio… ▽ More

    Submitted 7 March, 2019; v1 submitted 24 January, 2019; originally announced January 2019.

    Comments: 14 pages, 6 figures, the extended version of the previous preprint arXiv:1901.08583v1, both authors are co-first authors

    Journal ref: Phys. Rev. E 99, 062140 (2019)

  20. arXiv:1808.00831  [pdf, other

    stat.ML cs.LG

    Efficient Bayesian Inference of Sigmoidal Gaussian Cox Processes

    Authors: Christian Donner, Manfred Opper

    Abstract: We present an approximate Bayesian inference approach for estimating the intensity of an inhomogeneous Poisson process, where the intensity function is modelled using a Gaussian process (GP) prior via a sigmoid link function. Augmenting the model using a latent marked Poisson process and Pólya--Gamma random variables we obtain a representation of the likelihood which is conjugate to the GP prior.… ▽ More

    Submitted 3 May, 2019; v1 submitted 2 August, 2018; originally announced August 2018.

    Comments: 34 pages; 6 figures

    MSC Class: 60G55

    Journal ref: Journal of Machine Learning Research, year 2018, volume 19,number 67, pages 1-34

  21. arXiv:1805.11494  [pdf, ps, other

    stat.ML cs.LG

    Efficient Bayesian Inference for a Gaussian Process Density Model

    Authors: Christian Donner, Manfred Opper

    Abstract: We reconsider a nonparametric density model based on Gaussian processes. By augmenting the model with latent Pólya--Gamma random variables and a latent marked Poisson process we obtain a new likelihood which is conjugate to the model's Gaussian process prior. The augmented posterior allows for efficient inference by Gibbs sampling and an approximate variational mean field approach. For the latter… ▽ More

    Submitted 29 May, 2018; originally announced May 2018.

    Comments: 11 pages, 5 figures

    MSC Class: 62G07; 60G15

  22. arXiv:1803.04497  [pdf, other

    cs.SE cs.LG stat.ML

    Automated software vulnerability detection with machine learning

    Authors: Jacob A. Harer, Louis Y. Kim, Rebecca L. Russell, Onur Ozdemir, Leonard R. Kosta, Akshay Rangamani, Lei H. Hamilton, Gabriel I. Centeno, Jonathan R. Key, Paul M. Ellingwood, Erik Antelman, Alan Mackay, Marc W. McConley, Jeffrey M. Opper, Peter Chin, Tomo Lazovich

    Abstract: Thousands of security vulnerabilities are discovered in production software each year, either reported publicly to the Common Vulnerabilities and Exposures database or discovered internally in proprietary code. Vulnerabilities often manifest themselves in subtle ways that are not obvious to code reviewers or the developers themselves. With the wealth of open source code available for analysis, the… ▽ More

    Submitted 2 August, 2018; v1 submitted 14 February, 2018; originally announced March 2018.

  23. arXiv:1802.06383  [pdf, other

    stat.ML cs.LG

    Efficient Gaussian Process Classification Using Polya-Gamma Data Augmentation

    Authors: Florian Wenzel, Theo Galy-Fajou, Christan Donner, Marius Kloft, Manfred Opper

    Abstract: We propose a scalable stochastic variational approach to GP classification building on Polya-Gamma data augmentation and inducing points. Unlike former approaches, we obtain closed-form updates based on natural gradients that lead to efficient optimization. We evaluate the algorithm on real-world datasets containing up to 11 million data points and demonstrate that it is up to two orders of magnit… ▽ More

    Submitted 27 November, 2018; v1 submitted 18 February, 2018; originally announced February 2018.

  24. arXiv:1801.05411  [pdf, other

    cs.IT cs.LG

    Expectation Propagation for Approximate Inference: Free Probability Framework

    Authors: Burak Çakmak, Manfred Opper

    Abstract: We study asymptotic properties of expectation propagation (EP) -- a method for approximate inference originally developed in the field of machine learning. Applied to generalized linear models, EP iteratively computes a multivariate Gaussian approximation to the exact posterior distribution. The computational complexity of the repeated update of covariance matrices severely limits the application… ▽ More

    Submitted 9 May, 2018; v1 submitted 16 January, 2018; originally announced January 2018.

    Comments: Both authors are co-first authors. The main body of this paper is accepted for publication in the proceedings of the 2018 IEEE International Symposium on Information Theory (ISIT)

  25. arXiv:1709.07433  [pdf, other

    stat.ML cs.LG

    Perturbative Black Box Variational Inference

    Authors: Robert Bamler, Cheng Zhang, Manfred Opper, Stephan Mandt

    Abstract: Black box variational inference (BBVI) with reparameterization gradients triggered the exploration of divergence measures other than the Kullback-Leibler (KL) divergence, such as alpha divergences. In this paper, we view BBVI with generalized divergences as a form of estimating the marginal likelihood via biased importance sampling. The choice of divergence determines a bias-variance trade-off bet… ▽ More

    Submitted 6 January, 2018; v1 submitted 21 September, 2017; originally announced September 2017.

    Comments: In the proceedings of Advances in Neural Information Processing Systems (NIPS 2017)

  26. arXiv:1705.04284  [pdf, ps, other

    cs.IT

    Dynamical Functional Theory for Compressed Sensing

    Authors: Burak Çakmak, Manfred Opper, Ole Winther, Bernard H. Fleury

    Abstract: We introduce a theoretical approach for designing generalizations of the approximate message passing (AMP) algorithm for compressed sensing which are valid for large observation matrices that are drawn from an invariant random matrix ensemble. By design, the fixed points of the algorithm obey the Thouless-Anderson-Palmer (TAP) equations corresponding to the ensemble. Using a dynamical functional a… ▽ More

    Submitted 11 May, 2017; originally announced May 2017.

    Comments: 5 pages, accepted for ISIT 2017

  27. arXiv:1608.06602  [pdf, other

    cs.IT cs.LG

    Self-Averaging Expectation Propagation

    Authors: Burak Çakmak, Manfred Opper, Bernard H. Fleury, Ole Winther

    Abstract: We investigate the problem of approximate Bayesian inference for a general class of observation models by means of the expectation propagation (EP) framework for large systems under some statistical assumptions. Our approach tries to overcome the numerical bottleneck of EP caused by the inversion of large matrices. Assuming that the measurement matrices are realizations of specific types of ensemb… ▽ More

    Submitted 23 August, 2016; originally announced August 2016.

    Comments: 12 pages

  28. arXiv:1509.01229  [pdf, ps, other

    cond-mat.dis-nn cs.IT

    A Theory of Solving TAP Equations for Ising Models with General Invariant Random Matrices

    Authors: Manfred Opper, Burak Çakmak, Ole Winther

    Abstract: We consider the problem of solving TAP mean field equations by iteration for Ising model with coupling matrices that are drawn at random from general invariant ensembles. We develop an analysis of iterative algorithms using a dynamical functional approach that in the thermodynamic limit yields an effective dynamics of a single variable trajectory. Our main novel contribution is the expression for… ▽ More

    Submitted 28 March, 2016; v1 submitted 3 September, 2015; originally announced September 2015.

    Comments: 27 pages, 6 Figures Published in Journal of Physics A: Mathematical and Theoretical, Volume 49, Number 11, 2016

  29. arXiv:1406.7179  [pdf, other

    stat.ML cs.IT q-bio.NC

    Optimal Population Codes for Control and Estimation

    Authors: Alex Susemihl, Ron Meir, Manfred Opper

    Abstract: Agents acting in the natural world aim at selecting appropriate actions based on noisy and partial sensory observations. Many behaviors leading to decision mak- ing and action selection in a closed loop setting are naturally phrased within a control theoretic framework. Within the framework of optimal Control Theory, one is usually given a cost function which is minimized by selecting a control la… ▽ More

    Submitted 27 June, 2014; originally announced June 2014.

    Comments: 9 Pages, 4 figures

  30. arXiv:1309.3103  [pdf, ps, other

    stat.ML cs.LG

    Temporal Autoencoding Improves Generative Models of Time Series

    Authors: Chris Häusler, Alex Susemihl, Martin P Nawrot, Manfred Opper

    Abstract: Restricted Boltzmann Machines (RBMs) are generative models which can learn useful representations from samples of a dataset in an unsupervised fashion. They have been widely employed as an unsupervised pre-training method in machine learning. RBMs have been modified to model time series in two main ways: The Temporal RBM stacks a number of RBMs laterally and introduces temporal dependencies betwee… ▽ More

    Submitted 12 September, 2013; originally announced September 2013.