Search | arXiv e-print repository

Convergence rates of particle approximation of forward-backward splitting algorithm for granular medium equations

Authors: Matej Benko, Iwona Chlebicka, Jørgen Endal, Błażej Miasojedow

Abstract: We study the spatially homogeneous granular medium equation \[\partial_tμ=\rm{div}(μ\nabla V)+\rm{div}(μ(\nabla W \ast μ))+Δμ\,,\] within a large and natural class of the confinement potentials $V$ and interaction potentials $W$. The considered problem do not need to assume that $\nabla V$ or $\nabla W$ are globally Lipschitz. With the aim of providing particle approximation of solutions, we desig… ▽ More We study the spatially homogeneous granular medium equation \[\partial_tμ=\rm{div}(μ\nabla V)+\rm{div}(μ(\nabla W \ast μ))+Δμ\,,\] within a large and natural class of the confinement potentials $V$ and interaction potentials $W$. The considered problem do not need to assume that $\nabla V$ or $\nabla W$ are globally Lipschitz. With the aim of providing particle approximation of solutions, we design efficient forward-backward splitting algorithms. Sharp convergence rates in terms of the Wasserstein distance are provided. △ Less

Submitted 28 May, 2024; originally announced May 2024.

arXiv:2403.19070 [pdf, ps, other]

Stability of solutions of the porous medium equation with growth with respect to the diffusion exponent

Authors: Tomasz Dębiec, Piotr Gwiazda, Błażej Miasojedow, Zuzanna Szymańska

Abstract: We consider a macroscopic model for the growth of living tissues incorporating pressure-driven dispersal and pressure-modulated proliferation. Assuming a power-law relation between the mechanical pressure and the cell density, the model can be expressed as the porous medium equation with a growth term. We prove Hölder continuous dependence of the solutions of the model on the diffusion exponent. T… ▽ More We consider a macroscopic model for the growth of living tissues incorporating pressure-driven dispersal and pressure-modulated proliferation. Assuming a power-law relation between the mechanical pressure and the cell density, the model can be expressed as the porous medium equation with a growth term. We prove Hölder continuous dependence of the solutions of the model on the diffusion exponent. The main difficulty lies in the degeneracy of the porous medium equations at vacuum. To deal with this issue, we first regularise the equation by shifting the initial data away from zero and then optimise the stability estimate derived in the regular setting. △ Less

Submitted 27 March, 2024; originally announced March 2024.

arXiv:2304.02109 [pdf, ps, other]

Solidarity of Gibbs Samplers: the spectral gap

Authors: Iwona Chlebicka, Krzysztof Łatuszyński, Błażej Miasojedow

Abstract: Gibbs samplers are preeminent Markov chain Monte Carlo algorithms used in computational physics and statistical computing. Yet, their most fundamental properties, such as relations between convergence characteristics of their various versions, are not well understood. In this paper we prove the solidarity of their spectral gaps: if any of the random scan or $d!$ deterministic scans has a~spectra… ▽ More Gibbs samplers are preeminent Markov chain Monte Carlo algorithms used in computational physics and statistical computing. Yet, their most fundamental properties, such as relations between convergence characteristics of their various versions, are not well understood. In this paper we prove the solidarity of their spectral gaps: if any of the random scan or $d!$ deterministic scans has a~spectral gap then all of them have. Our methods rely on geometric interpretation of the Gibbs samplers as alternating projection algorithms and analysis of the rate of convergence in the von Neumann--Halperin method of cyclic alternating projections. In addition, we provide a quantitative result: if the spectral gap of the random scan Gibbs sampler scales polynomially with dimension, so does the spectral gap of any of the deterministic scans. △ Less

Submitted 4 April, 2023; originally announced April 2023.

arXiv:2303.05877 [pdf, other]

Absence and presence of Lavrentiev's phenomenon for double phase functionals upon every choice of exponents

Authors: Michał Borowski, Iwona Chlebicka, Filomena De Filippis, Błażej Miasojedow

Abstract: We study classes of weights ensuring the absence and presence of the Lavrentiev's phenomenon for double phase functionals upon every choice of exponents. We introduce a new sharp scale for weights for which there is no Lavrentiev's phenomenon up to a counterexample we provide. This scale embraces the sharp range for $α$-Hölder continuous weights. Moreover, it allows excluding the gap for every cho… ▽ More We study classes of weights ensuring the absence and presence of the Lavrentiev's phenomenon for double phase functionals upon every choice of exponents. We introduce a new sharp scale for weights for which there is no Lavrentiev's phenomenon up to a counterexample we provide. This scale embraces the sharp range for $α$-Hölder continuous weights. Moreover, it allows excluding the gap for every choice of exponents $q,p>1$. △ Less

Submitted 10 March, 2023; originally announced March 2023.

arXiv:2210.15217 [pdf, ps, other]

Absence of Lavrentiev's gap for anisotropic functionals

Authors: Michał Borowski, Iwona Chlebicka, Błażej Miasojedow

Abstract: We establish the absence of the Lavrentiev gap between Sobolev and smooth maps for a non-autonomous variational problem of a general structure, where the integrand is assumed to be controlled by a function which is convex and anisotropic with respect to the last variable. This fact results from new results on good approximation properties of the natural underlying unconventional function space. Sc… ▽ More We establish the absence of the Lavrentiev gap between Sobolev and smooth maps for a non-autonomous variational problem of a general structure, where the integrand is assumed to be controlled by a function which is convex and anisotropic with respect to the last variable. This fact results from new results on good approximation properties of the natural underlying unconventional function space. Scalar and vector-valued problems are studied. △ Less

Submitted 27 October, 2022; originally announced October 2022.

MSC Class: 46E30

arXiv:2209.05618 [pdf, ps, other]

Boundedness of Wolff-type potentials and applications to PDEs

Authors: Michał Borowski, Iwona Chlebicka, Błażej Miasojedow

Abstract: We provide a short proof of a sharp rearrangement estimate for a generalized version of a potential of Wolff--Havin--Maz'ya type. As a consequence, we prove a reduction principle for that integral operators, that is, a characterization of those rearrangement invariant spaces between which the potentials are bounded via a one-dimensional inequality of Hardy-type. Since the special case of the menti… ▽ More We provide a short proof of a sharp rearrangement estimate for a generalized version of a potential of Wolff--Havin--Maz'ya type. As a consequence, we prove a reduction principle for that integral operators, that is, a characterization of those rearrangement invariant spaces between which the potentials are bounded via a one-dimensional inequality of Hardy-type. Since the special case of the mentioned potential is known to control precisely very weak solutions to a broad class of quasilinear elliptic PDEs of non-standard growth, we infer the local regularity properties of the solutions in rearrangement invariant spaces for prescribed classes of data. △ Less

Submitted 12 October, 2023; v1 submitted 12 September, 2022; originally announced September 2022.

arXiv:2107.10952 [pdf, other]

doi 10.3847/1538-4357/ac1748

Predicting the redshift of gamma-ray loud AGNs using supervised machine learning

Authors: Maria Giovanna Dainotti, Malgorzata Bogdan, Aditya Narendra, Spencer James Gibson, Blazej Miasojedow, Ioannis Liodakis, Agnieszka Pollo, Trevor Nelson, Kamil Wozniak, Zooey Nguyen, Johan Larrson

Abstract: AGNs are very powerful galaxies characterized by extremely bright emissions coming out from their central massive black holes. Knowing the redshifts of AGNs provides us with an opportunity to determine their distance to investigate important astrophysical problems such as the evolution of the early stars, their formation along with the structure of early galaxies. The redshift determination is cha… ▽ More AGNs are very powerful galaxies characterized by extremely bright emissions coming out from their central massive black holes. Knowing the redshifts of AGNs provides us with an opportunity to determine their distance to investigate important astrophysical problems such as the evolution of the early stars, their formation along with the structure of early galaxies. The redshift determination is challenging because it requires detailed follow-up of multi-wavelength observations, often involving various astronomical facilities. Here, we employ machine learning algorithms to estimate redshifts from the observed gamma-ray properties and photometric data of gamma-ray loud AGN from the Fourth Fermi-LAT Catalog. The prediction is obtained with the Superlearner algorithm, using LASSO selected set of predictors. We obtain a tight correlation, with a Pearson Correlation Coefficient of 71.3% between the inferred and the observed redshifts, an average Δz_norm = 11.6 x 10^-4. We stress that notwithstanding the small sample of gamma-ray loud AGNs, we obtain a reliable predictive model using Superlearner, which is an ensemble of several machine learning models. △ Less

Submitted 22 July, 2021; originally announced July 2021.

Comments: 29 pages, 19 Figures with a total of 39 panels

arXiv:2106.05955 [pdf, other]

doi 10.1098/rsos.211279

Bayesian inference of a non-local proliferation model

Authors: Zuzanna Szymańska, Jakub Skrzeczkowski, Błażej Miasojedow, Piotr Gwiazda

Abstract: From a systems biology perspective the majority of cancer models, although interesting and providing a qualitative explanation of some problems, have a major disadvantage in that they usually miss a genuine connection with experimental data. Having this in mind, in this paper, we aim at contributing to the improvement of many cancer models which contain a proliferation term. To this end, we propos… ▽ More From a systems biology perspective the majority of cancer models, although interesting and providing a qualitative explanation of some problems, have a major disadvantage in that they usually miss a genuine connection with experimental data. Having this in mind, in this paper, we aim at contributing to the improvement of many cancer models which contain a proliferation term. To this end, we propose a new non-local model of cell proliferation. We select data which are suitable to perform a Bayesian inference for unknown parameters and we provide a discussion on the range of applicability of the model. Furthermore, we provide proof of the stability of a posteriori distributions in total variation norm which exploits the theory of spaces of measures equipped with the weighted flat norm. In a companion paper, we provide a detailed proof of the well-posedness of the problem and we investigate the convergence of the EBT algorithm applied to solve the equation. △ Less

Submitted 12 August, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

arXiv:2106.05115 [pdf, ps, other]

Convergence of the EBT method for a non-local model of cell proliferation with discontinuous interaction kernel

Authors: Piotr Gwiazda, Błażej Miasojedow, Jakub Skrzeczkowski, Zuzanna Szymańska

Abstract: We consider the EBT algorithm (a particle method) for the non-local equation with a discontinuous interaction kernel. The main difficulty lies in the low regularity of the kernel which is not Lipschitz continuous, thus preventing the application of standard arguments. Therefore, we use the radial symmetry of the problem instead and transform it using spherical coordinates. The resulting equation h… ▽ More We consider the EBT algorithm (a particle method) for the non-local equation with a discontinuous interaction kernel. The main difficulty lies in the low regularity of the kernel which is not Lipschitz continuous, thus preventing the application of standard arguments. Therefore, we use the radial symmetry of the problem instead and transform it using spherical coordinates. The resulting equation has a Lipschitz kernel with only one singularity at zero. We introduce a new weighted flat norm and prove that the particle method converges in this norm. We also comment on the two-dimensional case which requires the application of the theory of measure spaces on general metric spaces and present numerical simulations confirming the theoretical results. In a companion paper, we apply the Bayesian method to fit parameters to this model and study its theoretical properties. △ Less

Submitted 22 June, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

arXiv:2006.07648 [pdf, ps, other]

Structure learning for CTBN's via penalized maximum likelihood methods

Authors: Maryia Shpak, Błażej Miasojedow, Wojciech Rejchel

Abstract: The continuous-time Bayesian networks (CTBNs) represent a class of stochastic processes, which can be used to model complex phenomena, for instance, they can describe interactions occurring in living processes, in social science models or in medicine. The literature on this topic is usually focused on the case when the dependence structure of a system is known and we are to determine conditional t… ▽ More The continuous-time Bayesian networks (CTBNs) represent a class of stochastic processes, which can be used to model complex phenomena, for instance, they can describe interactions occurring in living processes, in social science models or in medicine. The literature on this topic is usually focused on the case when the dependence structure of a system is known and we are to determine conditional transition intensities (parameters of the network). In the paper, we study the structure learning problem, which is a more challenging task and the existing research on this topic is limited. The approach, which we propose, is based on a penalized likelihood method. We prove that our algorithm, under mild regularity conditions, recognizes the dependence structure of the graph with high probability. We also investigate the properties of the procedure in numerical studies to demonstrate its effectiveness. △ Less

Submitted 13 June, 2020; originally announced June 2020.

arXiv:1909.06631 [pdf, other]

Adaptive Bayesian SLOPE -- High-dimensional Model Selection with Missing Values

Authors: Wei Jiang, Malgorzata Bogdan, Julie Josse, Blazej Miasojedow, Veronika Rockova, TraumaBase Group

Abstract: We consider the problem of variable selection in high-dimensional settings with missing observations among the covariates. To address this relatively understudied problem, we propose a new synergistic procedure -- adaptive Bayesian SLOPE -- which effectively combines the SLOPE method (sorted $l_1$ regularization) together with the Spike-and-Slab LASSO method. We position our approach within a Baye… ▽ More We consider the problem of variable selection in high-dimensional settings with missing observations among the covariates. To address this relatively understudied problem, we propose a new synergistic procedure -- adaptive Bayesian SLOPE -- which effectively combines the SLOPE method (sorted $l_1$ regularization) together with the Spike-and-Slab LASSO method. We position our approach within a Bayesian framework which allows for simultaneous variable selection and parameter estimation, despite the missing values. As with the Spike-and-Slab LASSO, the coefficients are regarded as arising from a hierarchical model consisting of two groups: (1) the spike for the inactive and (2) the slab for the active. However, instead of assigning independent spike priors for each covariate, here we deploy a joint "SLOPE" spike prior which takes into account the ordering of coefficient magnitudes in order to control for false discoveries. Through extensive simulations, we demonstrate satisfactory performance in terms of power, FDR and estimation bias under a wide range of scenarios. Finally, we analyze a real dataset consisting of patients from Paris hospitals who underwent a severe trauma, where we show excellent performance in predicting platelet levels. Our methodology has been implemented in C++ and wrapped into an R package ABSLOPE for public use. △ Less

Submitted 6 November, 2019; v1 submitted 14 September, 2019; originally announced September 2019.

Comments: R package https://github.com/wjiang94/ABSLOPE

arXiv:1907.05074 [pdf, other]

Gamma-ray Bursts as distance indicators through a machine learning approach

Authors: Maria Dainotti, Vahé Petrosian, Malgorzata Bogdan, Blazej Miasojedow, Shigehiro Nagataki, Trevor Hastie, Zooey Nuyngen, Sankalp Gilda, Xavier Hernandez, Dominika Krol

Abstract: Gamma-ray bursts (GRBs) are spectacularly energetic events, with the potential to inform on the early universe and its evolution, once their redshifts are known. Unfortunately, determining redshifts is a painstaking procedure requiring detailed follow-up multi-wavelength observations often involving various astronomical facilities, which have to be rapidly pointed at these serendipitous events. He… ▽ More Gamma-ray bursts (GRBs) are spectacularly energetic events, with the potential to inform on the early universe and its evolution, once their redshifts are known. Unfortunately, determining redshifts is a painstaking procedure requiring detailed follow-up multi-wavelength observations often involving various astronomical facilities, which have to be rapidly pointed at these serendipitous events. Here we use Machine Learning algorithms to infer redshifts from a collection of observed temporal and spectral features of GRBs. We obtained a very high correlation coefficient ($0.96$) between the inferred and the observed redshifts, and a small dispersion (with a mean square error of $0.003$) in the test set. The addition of plateau afterglow parameters improves the predictions by $61.4\%$ compared to previous results. The GRB luminosity function and cumulative density rate evolutions, obtained from predicted and observed redshift are in excellent agreement indicating that GRBs are effective distance indicators and a reliable step for the cosmic distance ladder. △ Less

Submitted 11 July, 2019; originally announced July 2019.

Comments: 22 pages, 5 figures to be submitted

arXiv:1903.11372 [pdf, other]

doi 10.1186/s12859-019-3118-5

Jaccard/Tanimoto similarity test and estimation methods

Authors: Neo Christopher Chung, Błażej Miasojedow, Michał Startek, Anna Gambin

Abstract: Binary data are used in a broad area of biological sciences. Using binary presence-absence data, we can evaluate species co-occurrences that help elucidate relationships among organisms and environments. To summarize similarity between occurrences of species, we routinely use the Jaccard/Tanimoto coefficient, which is the ratio of their intersection to their union. It is natural, then, to identify… ▽ More Binary data are used in a broad area of biological sciences. Using binary presence-absence data, we can evaluate species co-occurrences that help elucidate relationships among organisms and environments. To summarize similarity between occurrences of species, we routinely use the Jaccard/Tanimoto coefficient, which is the ratio of their intersection to their union. It is natural, then, to identify statistically significant Jaccard/Tanimoto coefficients, which suggest non-random co-occurrences of species. However, statistical hypothesis testing using this similarity coefficient has been seldom used or studied. We introduce a hypothesis test for similarity for biological presence-absence data, using the Jaccard/Tanimoto coefficient. Several key improvements are presented including unbiased estimation of expectation and centered Jaccard/Tanimoto coefficients, that account for occurrence probabilities. We derived the exact and asymptotic solutions and developed the bootstrap and measurement concentration algorithms to compute statistical significance of binary similarity. Comprehensive simulation studies demonstrate that our proposed methods produce accurate p-values and false discovery rates. The proposed estimation methods are orders of magnitude faster than the exact solution. The proposed methods are implemented in an open source R package called jaccard (https://cran.r-project.org/package=jaccard). We introduce a suite of statistical methods for the Jaccard/Tanimoto similarity coefficient, that enable straightforward incorporation of probabilistic measures in analysis for species co-occurrences. Due to their generality, the proposed methods and implementations are applicable to a wide range of binary data arising from genomics, biochemistry, and other areas of science. △ Less

Submitted 27 March, 2019; originally announced March 2019.

MSC Class: 62F03; 62F40; 62P10; 62-07; 62E17 ACM Class: G.3; H.2.8; D.2.4

Journal ref: BMC Bioinformatics (2019) 20(Suppl 15): 644

arXiv:1902.00629 [pdf, ps, other]

Non-asymptotic Analysis of Biased Stochastic Approximation Scheme

Authors: Belhal Karimi, Blazej Miasojedow, Eric Moulines, Hoi-To Wai

Abstract: Stochastic approximation (SA) is a key method used in statistical learning. Recently, its non-asymptotic convergence analysis has been considered in many papers. However, most of the prior analyses are made under restrictive assumptions such as unbiased gradient estimates and convex objective function, which significantly limit their applications to sophisticated tasks such as online and reinforce… ▽ More Stochastic approximation (SA) is a key method used in statistical learning. Recently, its non-asymptotic convergence analysis has been considered in many papers. However, most of the prior analyses are made under restrictive assumptions such as unbiased gradient estimates and convex objective function, which significantly limit their applications to sophisticated tasks such as online and reinforcement learning. These restrictions are all essentially relaxed in this work. In particular, we analyze a general SA scheme to minimize a non-convex, smooth objective function. We consider update procedure whose drift term depends on a state-dependent Markov chain and the mean field is not necessarily of gradient type, covering approximate second-order method and allowing asymptotic bias for the one-step updates. We illustrate these settings with the online EM algorithm and the policy-gradient method for average reward maximization in reinforcement learning. △ Less

Submitted 16 June, 2019; v1 submitted 1 February, 2019; originally announced February 2019.

Comments: Accepted to COLT 2019; 32 pages. Minor updates in Section 3.2

arXiv:1808.02721 [pdf, ps, other]

Asymptotics of maximum likelihood estimators based on Markov chain Monte Carlo methods

Authors: Błażej Miasojedow, Wojciech Niemiro, Wojciech Rejchel

Abstract: In many complex statistical models maximum likelihood estimators cannot be calculated. In the paper we solve this problem using Markov chain Monte Carlo approximation of the true likelihood. In the main result we prove asymptotic normality of the estimator, when both sample sizes (the initial and Monte Carlo one) tend to infinity. Our result can be applied to models with intractable norming consta… ▽ More In many complex statistical models maximum likelihood estimators cannot be calculated. In the paper we solve this problem using Markov chain Monte Carlo approximation of the true likelihood. In the main result we prove asymptotic normality of the estimator, when both sample sizes (the initial and Monte Carlo one) tend to infinity. Our result can be applied to models with intractable norming constants and missing data models. △ Less

Submitted 8 August, 2018; originally announced August 2018.

Comments: arXiv admin note: text overlap with arXiv:1412.6371

arXiv:1805.01916 [pdf, ps, other]

Analysis of nonsmooth stochastic approximation: the differential inclusion approach

Authors: Szymon Majewski, Błażej Miasojedow, Eric Moulines

Abstract: In this paper we address the convergence of stochastic approximation when the functions to be minimized are not convex and nonsmooth. We show that the "mean-limit" approach to the convergence which leads, for smooth problems, to the ODE approach can be adapted to the non-smooth case. The limiting dynamical system may be shown to be, under appropriate assumption, a differential inclusion. Our resul… ▽ More In this paper we address the convergence of stochastic approximation when the functions to be minimized are not convex and nonsmooth. We show that the "mean-limit" approach to the convergence which leads, for smooth problems, to the ODE approach can be adapted to the non-smooth case. The limiting dynamical system may be shown to be, under appropriate assumption, a differential inclusion. Our results expand earlier works in this direction by Benaim et al. (2005) and provide a general framework for proving convergence for unconstrained and constrained stochastic approximation problems, with either explicit or implicit updates. In particular, our results allow us to establish the convergence of stochastic subgradient and proximal stochastic gradient descent algorithms arising in a large class of deep learning and high-dimensional statistical inference with sparsity inducing penalties. △ Less

Submitted 4 May, 2018; originally announced May 2018.

arXiv:1802.09188 [pdf, other]

Analysis of Langevin Monte Carlo via convex optimization

Authors: Alain Durmus, Szymon Majewski, Błażej Miasojedow

Abstract: In this paper, we provide new insights on the Unadjusted Langevin Algorithm. We show that this method can be formulated as a first order optimization algorithm of an objective functional defined on the Wasserstein space of order $2$. Using this interpretation and techniques borrowed from convex optimization, we give a non-asymptotic analysis of this method to sample from logconcave smooth target d… ▽ More In this paper, we provide new insights on the Unadjusted Langevin Algorithm. We show that this method can be formulated as a first order optimization algorithm of an objective functional defined on the Wasserstein space of order $2$. Using this interpretation and techniques borrowed from convex optimization, we give a non-asymptotic analysis of this method to sample from logconcave smooth target distribution on $\mathbb{R}^d$. Based on this interpretation, we propose two new methods for sampling from a non-smooth target distribution, which we analyze as well. Besides, these new algorithms are natural extensions of the Stochastic Gradient Langevin Dynamics (SGLD) algorithm, which is a popular extension of the Unadjusted Langevin Algorithm. Similar to SGLD, they only rely on approximations of the gradient of the target log density and can be used for large-scale Bayesian inference. △ Less

Submitted 28 March, 2018; v1 submitted 26 February, 2018; originally announced February 2018.

arXiv:1708.00234 [pdf, other]

Assigning peaks and modeling ETD in top-down mass spectrometry

Authors: Mateusz Krzysztof Łącki, Frederik Lermyte, Błażej Miasojedow, Mikołaj Olszański, Michał Startek, Frank Sobott, Dirk Valkenborg, Anna Gambin

Abstract: Among many techniques of modern mass spectrometry, the top down methods are becoming continuously more popular in the overall strive to describe the proteome. These techniques are based on fragmentation of ions inside mass spectrometers instead of being proteolytically digested. In some of these techniques, the fragmentation is induced by electron transfer. It can trigger several concurring reacti… ▽ More Among many techniques of modern mass spectrometry, the top down methods are becoming continuously more popular in the overall strive to describe the proteome. These techniques are based on fragmentation of ions inside mass spectrometers instead of being proteolytically digested. In some of these techniques, the fragmentation is induced by electron transfer. It can trigger several concurring reactions: electron transfer dissociation, electron transfer without dissociation, and proton transfer reaction. The evaluation of the extent of these reactions is important for the proper understanding of the functioning of the instrument and, what is even more important, to know if it can be used to reveal important structural information. We present a workflow for assigning peaks and interpreting the results of electron transfer driven reactions. We also present software written in Python and available under GNU v3 license. △ Less

Submitted 25 August, 2017; v1 submitted 1 August, 2017; originally announced August 2017.

arXiv:1707.01660 [pdf, other]

Particle MCMC with Poisson Resampling: Parallelization and Continuous Time Models

Authors: Tomasz Cąkała, Błażej Miasojedow, Wojciech Niemiro

Abstract: We introduce a new version of particle filter in which the number of "children" of a particle at a given time has a Poisson distribution. As a result, the number of particles is random and varies with time. An advantage of this scheme is that descendants of different particles can evolve independently. It makes easy to parallelize computations. Moreover, particle filter with Poisson resampling is… ▽ More We introduce a new version of particle filter in which the number of "children" of a particle at a given time has a Poisson distribution. As a result, the number of particles is random and varies with time. An advantage of this scheme is that descendants of different particles can evolve independently. It makes easy to parallelize computations. Moreover, particle filter with Poisson resampling is readily adapted to the case when a hidden process is a continuous time, piecewise deterministic semi-Markov process. We show that the basic techniques of particle MCMC, namely particle independent Metropolis-Hastings, particle Gibbs Sampler and its version with ancestor sampling, work under our Poisson resampling scheme. Our version of particle Gibbs Sampler is uniformly ergodic under the same assumptions as its standard counterpart. We present simulation results which indicate that our algorithms can compete with the existing methods. △ Less

Submitted 2 August, 2019; v1 submitted 6 July, 2017; originally announced July 2017.

arXiv:1612.07497 [pdf, other]

Sparse estimation in Ising Model via penalized Monte Carlo methods

Authors: Błażej Miasojedow, Wojciech Rejchel

Abstract: We consider a problem of model selection in high-dimensional binary Markov random fields. The usefulness of the Ising model in studying systems of complex interactions has been confirmed in many papers. The main drawback of this model is the intractable norming constant that makes estimation of parameters very challenging. In the paper we propose a Lasso penalized version of the Monte Carlo maximu… ▽ More We consider a problem of model selection in high-dimensional binary Markov random fields. The usefulness of the Ising model in studying systems of complex interactions has been confirmed in many papers. The main drawback of this model is the intractable norming constant that makes estimation of parameters very challenging. In the paper we propose a Lasso penalized version of the Monte Carlo maximum likelihood method. We prove that our algorithm, under mild regularity conditions, recognizes the true dependence structure of the graph with high probability. The efficiency of the proposed method is also investigated via simulation studies. △ Less

Submitted 10 December, 2018; v1 submitted 22 December, 2016; originally announced December 2016.

arXiv:1606.08160 [pdf, ps, other]

Geometric ergodicity of Rao and Teh's algorithm for Markov jump processes and CTBNs

Authors: Błażej Miasojedow, Wojcieh Niemiro

Abstract: Rao and Teh (2012, 2013) introduced an efficient MCMC algorithm for sampling from the posterior distribution of a hidden Markov jump process. The algorithm is based on the idea of sampling virtual jumps. In the present paper we show that the Markov chain generated by Rao and Teh's algorithm is geometrically ergodic. To this end we establish a geometric drift condition towards a small set. A simila… ▽ More Rao and Teh (2012, 2013) introduced an efficient MCMC algorithm for sampling from the posterior distribution of a hidden Markov jump process. The algorithm is based on the idea of sampling virtual jumps. In the present paper we show that the Markov chain generated by Rao and Teh's algorithm is geometrically ergodic. To this end we establish a geometric drift condition towards a small set. A similar result is also proved for a special version of the algorithm, used for probabilistic inference in Continuous Time Bayesian Networks. △ Less

Submitted 27 June, 2016; originally announced June 2016.

arXiv:1602.08861 [pdf, other]

Bayesian inference for age-structured population model of infectious disease with application to varicella in Poland

Authors: Piotr Gwiazda, Błażej Miasojedow, Magdalena Rosińska

Abstract: Dynamics of the infectious disease transmission is often best understood taking into account the structure of population with respect to specific features, in example age or immunity level. Practical utility of such models depends on the appropriate calibration with the observed data. Here, we discuss the Bayesian approach to data assimilation in case of two-state age-structured model. This kind o… ▽ More Dynamics of the infectious disease transmission is often best understood taking into account the structure of population with respect to specific features, in example age or immunity level. Practical utility of such models depends on the appropriate calibration with the observed data. Here, we discuss the Bayesian approach to data assimilation in case of two-state age-structured model. This kind of models are frequently used to describe the disease dynamics (i.e. force of infection) basing on prevalence data collected at several time points. We demonstrate that, in the case when the explicit solution to the model equation is known, accounting for the data collection process in the Bayesian framework allows to obtain an unbiased posterior distribution for the parameters determining the force of infection. We further show analytically and through numerical tests that the posterior distribution of these parameters is stable with respect to cohort approximation (Escalator Boxcar Train) to the solution. Finally, we apply the technique to calibrate the model based on observed sero-prevalence of varicella in Poland. △ Less

Submitted 20 June, 2016; v1 submitted 29 February, 2016; originally announced February 2016.

arXiv:1512.00736 [pdf, ps, other]

Geometric ergodicity of Rao and Teh's algorithm for Markov jump processes

Authors: Błażej Miasojedow, Wojciech Niemiro

Abstract: Rao and Teh (2013) introduced an efficient MCMC algorithm for sampling from the posterior distribution of a hidden Markov jump process. The algorithm is based on the idea of sampling virtual jumps. In the present paper we show that the Markov chain generated by Rao and Teh's algorithm is geometrically ergodic. To this end we establish a geometric drift condition towards a small set. Rao and Teh (2013) introduced an efficient MCMC algorithm for sampling from the posterior distribution of a hidden Markov jump process. The algorithm is based on the idea of sampling virtual jumps. In the present paper we show that the Markov chain generated by Rao and Teh's algorithm is geometrically ergodic. To this end we establish a geometric drift condition towards a small set. △ Less

Submitted 2 December, 2015; originally announced December 2015.

arXiv:1505.01434 [pdf, other]

Particle Gibbs algorithms for Markov jump processes

Authors: Blazej Miasojedow, Wojciech Niemiro

Abstract: In the present paper we propose a new MCMC algorithm for sampling from the posterior distribution of hidden trajectory of a Markov jump process. Our algorithm is based on the idea of exploiting virtual jumps, introduced by Rao and Teh (2013). The main novelty is that our algorithm uses particle Gibbs with ancestor sampling to update the skeleton, while Rao and Teh use forward filtering backward sa… ▽ More In the present paper we propose a new MCMC algorithm for sampling from the posterior distribution of hidden trajectory of a Markov jump process. Our algorithm is based on the idea of exploiting virtual jumps, introduced by Rao and Teh (2013). The main novelty is that our algorithm uses particle Gibbs with ancestor sampling to update the skeleton, while Rao and Teh use forward filtering backward sampling (FFBS). In contrast to previous methods our algorithm can be implemented even if the state space is infinite. In addition, the cost of a single step of the proposed algorithm does not depend on the size of the state space. The computational cost of our methood is of order $\mathcal{O}(N\mathbb{E}(n))$, where $N$ is the number of particles used in the PGAS algorithm and $\mathbb{E}(n)$ is the expected number of jumps (together with virtual ones). The cost of the algorithm of Rao and Teh is of order $\mathcal{O}(|\mathcal{X}|^2\mathbb{E}(n))$, where $|\mathcal{X}|$ is the size of the state space. Simulation results show that our algorithm with PGAS converges slightly slower than the algorithm with FFBS, if the size of the state space is not big. However, if the size of the state space increases, the proposed method outperforms existing ones. We give special attention to a hierarchical version of our algorithm which can be applied to continuous time Bayesian networks (CTBNs). △ Less

Submitted 6 May, 2015; originally announced May 2015.

arXiv:1412.6371 [pdf, ps, other]

Asymptotics of Monte Carlo maximum likelihood estimators

Authors: Blazej Miasojedow, Wojciech Niemiro, Jan Palczewski, Wojciech Rejchel

Abstract: We describe Monte Carlo approximation to the maximum likelihood estimator in models with intractable norming constants and explanatory variables. We consider both sources of randomness (due to the initial sample and to Monte Carlo simulations) and prove asymptotical normality of the estimator. We describe Monte Carlo approximation to the maximum likelihood estimator in models with intractable norming constants and explanatory variables. We consider both sources of randomness (due to the initial sample and to Monte Carlo simulations) and prove asymptotical normality of the estimator. △ Less

Submitted 19 December, 2014; originally announced December 2014.

Journal ref: Probability and Mathematical Statistics, 36(2), 2016

arXiv:1412.6370 [pdf, ps, other]

doi 10.1007/978-3-319-18781-5_14

Adaptive Monte Carlo Maximum Likelihood

Authors: Blazej Miasojedow, Wojciech Niemiro, Jan Palczewski, Wojciech Rejchel

Abstract: We consider Monte Carlo approximations to the maximum likelihood estimator in models with intractable norming constants. This paper deals with adaptive Monte Carlo algorithms, which adjust control parameters in the course of simulation. We examine asymptotics of adaptive importance sampling and a new algorithm, which uses resampling and MCMC. This algorithm is designed to reduce problems with dege… ▽ More We consider Monte Carlo approximations to the maximum likelihood estimator in models with intractable norming constants. This paper deals with adaptive Monte Carlo algorithms, which adjust control parameters in the course of simulation. We examine asymptotics of adaptive importance sampling and a new algorithm, which uses resampling and MCMC. This algorithm is designed to reduce problems with degeneracy of importance weights. Our analysis is based on martingale limit theorems. We also describe how adaptive maximization algorithms of Newton-Raphson type can be combined with the resampling techniques. The paper includes results of a small scale simulation study in which we compare the performance of adaptive and non-adaptive Monte Carlo maximum likelihood algorithms. △ Less

Submitted 19 December, 2014; originally announced December 2014.

arXiv:1406.3218 [pdf, other]

State dependent swap strategies and adaptive adjusting of number of temperatures in Parallel Tempering algorithms

Authors: Mateusz Krzysztof Łącki, Błażej Miasojedow

Abstract: In this paper we present extensions to the original adaptive parallel tempering algorithm. Two different approaches are presented. In the first one we introduce state-dependent strategies using current information to perform a swap step. It encompasses a wide family of potential moves including the standard one and Equi Energy type move, without any loss in tractability. In the second one, we intr… ▽ More In this paper we present extensions to the original adaptive parallel tempering algorithm. Two different approaches are presented. In the first one we introduce state-dependent strategies using current information to perform a swap step. It encompasses a wide family of potential moves including the standard one and Equi Energy type move, without any loss in tractability. In the second one, we introduce online adjustment of the number of temperatures. Numerical experiments demonstrate the effectiveness of the proposed method. △ Less

Submitted 24 June, 2014; v1 submitted 12 June, 2014; originally announced June 2014.

arXiv:1403.4035 [pdf, other]

Metropolis-type algorithms for Continuous Time Bayesian Networks

Authors: Blazej Miasojedow, Wojciech Niemiro, John Noble, Krzysztof Opalski

Abstract: We present a Metropolis-Hastings Markov chain Monte Carlo (MCMC) algorithm for detecting hidden variables in a continuous time Bayesian network (CTBN), which uses reversible jumps in the sense defined by (Green 1995). In common with several Monte Carlo algorithms, one of the most recent and important by (Rao and Teh 2013), our algorithm exploits uniformization techniques under which a continuous t… ▽ More We present a Metropolis-Hastings Markov chain Monte Carlo (MCMC) algorithm for detecting hidden variables in a continuous time Bayesian network (CTBN), which uses reversible jumps in the sense defined by (Green 1995). In common with several Monte Carlo algorithms, one of the most recent and important by (Rao and Teh 2013), our algorithm exploits uniformization techniques under which a continuous time Markov process can be represented as a marked Poisson process. We exploit this in a novel way. We show that our MCMC algorithm can be more efficient than those of likelihood weighting type, as in (Nodelman et al. 2003) and (Fan et al. 2010) and that our algorithm broadens the class of important examples that can be treated effectively. △ Less

Submitted 17 March, 2014; originally announced March 2014.

Comments: 3 figures

arXiv:1212.5517 [pdf, ps, other]

doi 10.3150/13-BEJ546

Optimal scaling for the transient phase of Metropolis Hastings algorithms: The longtime behavior

Authors: Benjamin Jourdain, Tony Lelièvre, Błażej Miasojedow

Abstract: We consider the Random Walk Metropolis algorithm on $\mathbb{R}^n$ with Gaussian proposals, and when the target probability measure is the $n$-fold product of a one-dimensional law. It is well known (see Roberts et al. (Ann. Appl. Probab. 7 (1997) 110-120)) that, in the limit $n\to\infty$, starting at equilibrium and for an appropriate scaling of the variance and of the timescale as a function of… ▽ More We consider the Random Walk Metropolis algorithm on $\mathbb{R}^n$ with Gaussian proposals, and when the target probability measure is the $n$-fold product of a one-dimensional law. It is well known (see Roberts et al. (Ann. Appl. Probab. 7 (1997) 110-120)) that, in the limit $n\to\infty$, starting at equilibrium and for an appropriate scaling of the variance and of the timescale as a function of the dimension $n$, a diffusive limit is obtained for each component of the Markov chain. In Jourdain et al. (Optimal scaling for the transient phase of the random walk Metropolis algorithm: The mean-field limit (2012) Preprint), we generalize this result when the initial distribution is not the target probability measure. The obtained diffusive limit is the solution to a stochastic differential equation nonlinear in the sense of McKean. In the present paper, we prove convergence to equilibrium for this equation. We discuss practical counterparts in order to optimize the variance of the proposal distribution to accelerate convergence to equilibrium. Our analysis confirms the interest of the constant acceptance rate strategy (with acceptance rate between $1/4$ and $1/3$) first suggested in Roberts et al. (Ann. Appl. Probab. 7 (1997) 110-120). We also address scaling of the Metropolis-Adjusted Langevin Algorithm. When starting at equilibrium, a diffusive limit for an optimal scaling of the variance is obtained in Roberts and Rosenthal (J. R. Stat. Soc. Ser. B. Stat. Methodol. 60 (1998) 255-268). In the transient case, we obtain formally that the optimal variance scales very differently in $n$ depending on the sign of a moment of the distribution, which vanishes at equilibrium. This suggest that it is difficult to derive practical recommendations for MALA from such asymptotic results. △ Less

Submitted 21 October, 2014; v1 submitted 21 December, 2012; originally announced December 2012.

Comments: Published in at http://dx.doi.org/10.3150/13-BEJ546 the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm)

Report number: IMS-BEJ-BEJ546

Journal ref: Bernoulli, Vol. 20, No. 4, 1930-1978 (2014)

arXiv:1210.7639 [pdf, ps, other]

doi 10.1214/14-AAP1048

Optimal scaling for the transient phase of the random walk Metropolis algorithm: The mean-field limit

Authors: Benjamin Jourdain, Tony Lelièvre, Błażej Miasojedow

Abstract: We consider the random walk Metropolis algorithm on $\mathbb{R}^n$ with Gaussian proposals, and when the target probability measure is the $n$-fold product of a one-dimensional law. In the limit $n\to\infty$, it is well known (see [Ann. Appl. Probab. 7 (1997) 110-120]) that, when the variance of the proposal scales inversely proportional to the dimension $n$ whereas time is accelerated by the fact… ▽ More We consider the random walk Metropolis algorithm on $\mathbb{R}^n$ with Gaussian proposals, and when the target probability measure is the $n$-fold product of a one-dimensional law. In the limit $n\to\infty$, it is well known (see [Ann. Appl. Probab. 7 (1997) 110-120]) that, when the variance of the proposal scales inversely proportional to the dimension $n$ whereas time is accelerated by the factor $n$, a diffusive limit is obtained for each component of the Markov chain if this chain starts at equilibrium. This paper extends this result when the initial distribution is not the target probability measure. Remarking that the interaction between the components of the chain due to the common acceptance/rejection of the proposed moves is of mean-field type, we obtain a propagation of chaos result under the same scaling as in the stationary case. This proves that, in terms of the dimension $n$, the same scaling holds for the transient phase of the Metropolis-Hastings algorithm as near stationarity. The diffusive and mean-field limit of each component is a diffusion process nonlinear in the sense of McKean. This opens the route to new investigations of the optimal choice for the variance of the proposal distribution in order to accelerate convergence to equilibrium (see [Optimal scaling for the transient phase of Metropolis-Hastings algorithms: The longtime behavior Bernoulli (2014) To appear]). △ Less

Submitted 19 June, 2015; v1 submitted 29 October, 2012; originally announced October 2012.

Comments: Published at http://dx.doi.org/10.1214/14-AAP1048 in the Annals of Applied Probability (http://www.imstat.org/aap/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AAP-AAP1048

Journal ref: Annals of Applied Probability 2015, Vol. 25, No. 4, 2263-2300

arXiv:1205.1076 [pdf, ps, other]

Adaptive parallel tempering algorithm

Authors: Blazej Miasojedow, Eric Moulines, Matti Vihola

Abstract: Parallel tempering is a generic Markov chain Monte Carlo sampling method which allows good mixing with multimodal target distributions, where conventional Metropolis-Hastings algorithms often fail. The mixing properties of the sampler depend strongly on the choice of tuning parameters, such as the temperature schedule and the proposal distribution used for local exploration. We propose an adaptive… ▽ More Parallel tempering is a generic Markov chain Monte Carlo sampling method which allows good mixing with multimodal target distributions, where conventional Metropolis-Hastings algorithms often fail. The mixing properties of the sampler depend strongly on the choice of tuning parameters, such as the temperature schedule and the proposal distribution used for local exploration. We propose an adaptive algorithm which tunes both the temperature schedule and the parameters of the random-walk Metropolis kernel automatically. We prove the convergence of the adaptation and a strong law of large numbers for the algorithm. We illustrate the performance of our method with examples. Our empirical findings indicate that the algorithm can cope well with different kind of scenarios without prior tuning. △ Less

Submitted 4 May, 2012; originally announced May 2012.

Comments: 33 pages, 3 figures

arXiv:1201.2265 [pdf, ps, other]

Hoeffding's inequalities for geometrically ergodic Markov chains on general state space

Authors: Błażej Miasojedow

Abstract: We consider Markov chain with spectral gap in $L^2$ space. Assume that $f$ is a bounded function. Then the probabilities of large deviations of average along trajectory satisfy Hoeffding's-type inequalities. These bounds depend only on the stationary mean, spectral gap and the end-points of support of $f$. We consider Markov chain with spectral gap in $L^2$ space. Assume that $f$ is a bounded function. Then the probabilities of large deviations of average along trajectory satisfy Hoeffding's-type inequalities. These bounds depend only on the stationary mean, spectral gap and the end-points of support of $f$. △ Less

Submitted 13 June, 2013; v1 submitted 11 January, 2012; originally announced January 2012.

MSC Class: 60J05 ACM Class: G.3

arXiv:1106.4739 [pdf, ps, other]

doi 10.3150/12-BEJ442

Nonasymptotic bounds on the estimation error of MCMC algorithms

Authors: Krzysztof Łatuszyński, Błażej Miasojedow, Wojciech Niemiro

Abstract: We address the problem of upper bounding the mean square error of MCMC estimators. Our analysis is nonasymptotic. We first establish a general result valid for essentially all ergodic Markov chains encountered in Bayesian computation and a possibly unbounded target function $f$. The bound is sharp in the sense that the leading term is exactly $σ_{\mathrm {as}}^2(P,f)/n$, where… ▽ More We address the problem of upper bounding the mean square error of MCMC estimators. Our analysis is nonasymptotic. We first establish a general result valid for essentially all ergodic Markov chains encountered in Bayesian computation and a possibly unbounded target function $f$. The bound is sharp in the sense that the leading term is exactly $σ_{\mathrm {as}}^2(P,f)/n$, where $σ_{\mathrm{as}}^2(P,f)$ is the CLT asymptotic variance. Next, we proceed to specific additional assumptions and give explicit computable bounds for geometrically and polynomially ergodic Markov chains under quantitative drift conditions. As a corollary, we provide results on confidence estimation. △ Less

Submitted 11 December, 2013; v1 submitted 23 June, 2011; originally announced June 2011.

Comments: Published in at http://dx.doi.org/10.3150/12-BEJ442 the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm). arXiv admin note: text overlap with arXiv:0907.4915

Report number: IMS-BEJ-BEJ442

Journal ref: Bernoulli 2013, Vol. 19, No. 5A, 2033-2066

arXiv:1101.5837 [pdf, ps, other]

Nonasymptotic bounds on the mean square error for MCMC estimates via renewal techniques

Authors: Krzysztof Latuszynski, Blazej Miasojedow, Wojciech Niemiro

Abstract: The Nummellin's split chain construction allows to decompose a Markov chain Monte Carlo (MCMC) trajectory into i.i.d. "excursions". RegenerativeMCMC algorithms based on this technique use a random number of samples. They have been proposed as a promising alternative to usual fixed length simulation [25, 33, 14]. In this note we derive nonasymptotic bounds on the mean square error (MSE) of regenera… ▽ More The Nummellin's split chain construction allows to decompose a Markov chain Monte Carlo (MCMC) trajectory into i.i.d. "excursions". RegenerativeMCMC algorithms based on this technique use a random number of samples. They have been proposed as a promising alternative to usual fixed length simulation [25, 33, 14]. In this note we derive nonasymptotic bounds on the mean square error (MSE) of regenerative MCMC estimates via techniques of renewal theory and sequential statistics. These results are applied to costruct confidence intervals. We then focus on two cases of particular interest: chains satisfying the Doeblin condition and a geometric drift condition. Available explicit nonasymptotic results are compared for different schemes of MCMC simulation. △ Less

Submitted 12 May, 2011; v1 submitted 30 January, 2011; originally announced January 2011.

Journal ref: for MCQMC 2010 Conference Proceeding

arXiv:0907.4915 [pdf, other]

Nonasymptotic bounds on the estimation error for regenerative MCMC algorithms

Authors: Krzysztof Latuszynski, Blazej Miasojedow, Wojciech Niemiro

Abstract: MCMC methods are used in Bayesian statistics not only to sample from posterior distributions but also to estimate expectations. Underlying functions are most often defined on a continuous state space and can be unbounded. We consider a regenerative setting and Monte Carlo estimators based on i.i.d. blocks of a Markov chain trajectory. The main result is an inequality for the mean square error. W… ▽ More MCMC methods are used in Bayesian statistics not only to sample from posterior distributions but also to estimate expectations. Underlying functions are most often defined on a continuous state space and can be unbounded. We consider a regenerative setting and Monte Carlo estimators based on i.i.d. blocks of a Markov chain trajectory. The main result is an inequality for the mean square error. We also consider confidence bounds. We first derive the results in terms of the asymptotic variance and then bound the asymptotic variance for both uniformly ergodic and geometrically ergodic Markov chains. △ Less

Submitted 28 July, 2009; originally announced July 2009.

Comments: 29 pages, 2 figures

Report number: CRiSM research paper 09-23

Showing 1–35 of 35 results for author: Miasojedow, B