Skip to main content

Showing 1–50 of 61 results for author: Kontoyiannis, I

.
  1. arXiv:2404.09605  [pdf, ps, other

    cs.IT math.ST

    Finite-sample expansions for the optimal error probability in asymmetric binary hypothesis testing

    Authors: Valentinian Lungu, Ioannis Kontoyiannis

    Abstract: The problem of binary hypothesis testing between two probability measures is considered. New sharp bounds are derived for the best achievable error probability of such tests based on independent and identically distributed observations. Specifically, the asymmetric version of the problem is examined, where different requirements are placed on the two error probabilities. Accurate nonasymptotic exp… ▽ More

    Submitted 29 May, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  2. arXiv:2404.06632  [pdf, other

    math.PR cs.IT

    Relative entropy bounds for sampling with and without replacement

    Authors: Oliver Johnson, Lampros Gavalakis, Ioannis Kontoyiannis

    Abstract: Sharp, nonasymptotic bounds are obtained for the relative entropy between the distributions of sampling with and without replacement from an urn with balls of $c\geq 2$ colors. Our bounds are asymptotically tight in certain regimes and, unlike previous results, they depend on the number of balls of each colour in the urn. The connection of these results with finite de Finetti-style theorems is exp… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 17 pages, 1 figure

    MSC Class: 60E05 (Primary) 60G09 (Secondary)

  3. arXiv:2403.07209  [pdf, ps, other

    cs.IT math.PR

    The entropic doubling constant and robustness of Gaussian codebooks for additive-noise channels

    Authors: Lampros Gavalakis, Ioannis Kontoyiannis, Mokshay Madiman

    Abstract: Entropy comparison inequalities are obtained for the differential entropy $h(X+Y)$ of the sum of two independent random vectors $X,Y$, when one is replaced by a Gaussian. For identically distributed random vectors $X,Y$, these are closely related to bounds on the entropic doubling constant, which quantifies the entropy increase when adding an independent copy of a random vector to itself. Conseque… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 23 pages, no figures

  4. arXiv:2308.00913  [pdf, other

    stat.ME econ.EM stat.ML

    The Bayesian Context Trees State Space Model for time series modelling and forecasting

    Authors: Ioannis Papageorgiou, Ioannis Kontoyiannis

    Abstract: A hierarchical Bayesian framework is introduced for develo** rich mixture models for real-valued time series, partly motivated by important applications in financial time series analysis. At the top level, meaningful discrete states are identified as appropriately quantised values of some of the most recent samples. These observable states are described as a discrete context-tree model. At the b… ▽ More

    Submitted 10 October, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

    Comments: arXiv admin note: text overlap with arXiv:2106.03023

  5. arXiv:2305.14131  [pdf, other

    stat.ME cs.IT math.ST q-bio.NC

    Temporally Causal Discovery Tests for Discrete Time Series and Neural Spike Trains

    Authors: A. Theocharous, G. G. Gregoriou, P. Sapountzis, I. Kontoyiannis

    Abstract: We consider the problem of detecting causal relationships between discrete time series, in the presence of potential confounders. A hypothesis test is introduced for identifying the temporally causal influence of $(x_n)$ on $(y_n)$, causally conditioned on a possibly confounding third time series $(z_n)$. Under natural Markovian modeling assumptions, it is shown that the null hypothesis, correspon… ▽ More

    Submitted 17 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 31 pages, 4 figures

  6. arXiv:2305.05931  [pdf, other

    math.PR stat.ME

    Generalised shot noise representations of stochastic systems driven by non-Gaussian Lévy processes

    Authors: Marcos Tapia Costa, Ioannis Kontoyiannis, Simon Godsill

    Abstract: We consider the problem of obtaining effective representations for the solutions of linear, vector-valued stochastic differential equations (SDEs) driven by non-Gaussian pure-jump Lévy processes, and we show how such representations lead to efficient simulation methods. The processes considered constitute a broad class of models that find application across the physical and biological sciences, ma… ▽ More

    Submitted 7 November, 2023; v1 submitted 10 May, 2023; originally announced May 2023.

    Comments: 34 pages, 14 figures

  7. arXiv:2304.05360  [pdf, ps, other

    cs.IT math.PR quant-ph

    A Third Information-Theoretic Approach to Finite de Finetti Theorems

    Authors: Mario Berta, Lampros Gavalakis, Ioannis Kontoyiannis

    Abstract: A new finite form of de Finetti's representation theorem is established using elementary information-theoretic tools. The distribution of the first $k$ random variables in an exchangeable vector of $n\geq k$ random variables is close to a mixture of product distributions. Closeness is measured in terms of the relative entropy and an explicit bound is provided. This bound is tighter than those obta… ▽ More

    Submitted 25 April, 2024; v1 submitted 11 April, 2023; originally announced April 2023.

    Comments: 11 pages, no figures. In the second version the introduction is slightly extended, two new references and Section 2.4 have been added

  8. arXiv:2212.06705  [pdf, other

    stat.ME cs.IT

    Truly Bayesian Entropy Estimation

    Authors: Ioannis Papageorgiou, Ioannis Kontoyiannis

    Abstract: Estimating the entropy rate of discrete time series is a challenging problem with important applications in numerous areas including neuroscience, genomics, image processing and natural language processing. A number of approaches have been developed for this task, typically based either on universal data compression algorithms, or on statistical estimators of the underlying process distribution. I… ▽ More

    Submitted 21 March, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2202.02239

  9. arXiv:2211.02676  [pdf, other

    cs.IT math.ST

    Context-tree weighting and Bayesian Context Trees: Asymptotic and non-asymptotic justifications

    Authors: Ioannis Kontoyiannis

    Abstract: The Bayesian Context Trees (BCT) framework is a recently introduced, general collection of statistical and algorithmic tools for modelling, analysis and inference with discrete-valued time series. The foundation of this development is built in part on some well-known information-theoretic ideas and techniques, including Rissanen's tree sources and Willems et al.'s context-tree weighting algorithm.… ▽ More

    Submitted 5 September, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

    Comments: Minor corrections, references added. To appear in the IEEE Transactions on Information Theory

  10. arXiv:2204.05033  [pdf, ps, other

    math.PR cs.IT

    Information in probability: Another information-theoretic proof of a finite de Finetti theorem

    Authors: Lampros Gavalakis, Ioannis Kontoyiannis

    Abstract: We recall some of the history of the information-theoretic approach to deriving core results in probability theory and indicate parts of the recent resurgence of interest in this area with current progress along several interesting directions. Then we give a new information-theoretic proof of a finite version of de Finetti's classical representation theorem for finite-valued random variables. We d… ▽ More

    Submitted 26 April, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: Final version, to be published as part of a Festschrift volume in the Springer "Lecture Notes in Mathematics" series

  11. arXiv:2203.04341  [pdf, other

    stat.ME stat.AP stat.ML

    Change-point Detection and Segmentation of Discrete Data using Bayesian Context Trees

    Authors: Valentinian Lungu, Ioannis Papageorgiou, Ioannis Kontoyiannis

    Abstract: A new Bayesian modelling framework is introduced for piece-wise homogeneous variable-memory Markov chains, along with a collection of effective algorithmic tools for change-point detection and segmentation of discrete time series. Building on the recently introduced Bayesian Context Trees (BCT) framework, the distributions of different segments in a discrete time series are described as variable-m… ▽ More

    Submitted 13 May, 2022; v1 submitted 8 March, 2022; originally announced March 2022.

    Comments: Link to R-package: https://CRAN.R-project.org/package=BCT

  12. arXiv:2202.02239  [pdf, other

    stat.ME cs.IT math.ST

    Posterior Representations for Bayesian Context Trees: Sampling, Estimation and Convergence

    Authors: Ioannis Papageorgiou, Ioannis Kontoyiannis

    Abstract: We revisit the Bayesian Context Trees (BCT) modelling framework for discrete time series, which was recently found to be very effective in numerous tasks including model selection, estimation and prediction. A novel representation of the induced posterior distribution on model space is derived in terms of a simple branching process, and several consequences of this are explored in theory and in pr… ▽ More

    Submitted 20 March, 2023; v1 submitted 4 February, 2022; originally announced February 2022.

  13. arXiv:2110.14427  [pdf, other

    math.ST cs.LG

    The ODE Method for Asymptotic Statistics in Stochastic Approximation and Reinforcement Learning

    Authors: Vivek Borkar, Shuhang Chen, Adithya Devraj, Ioannis Kontoyiannis, Sean Meyn

    Abstract: The paper concerns the stochastic approximation recursion, \[ θ_{n+1}= θ_n + α_{n + 1} f(θ_n, Φ_{n+1}) \,,\quad n\ge 0, \] where the {\em estimates} $θ_n\in\Re^d$ and $ \{ Φ_n \}$ is a Markov chain on a general state space. In addition to standard Lipschitz assumptions and conditions on the vanishing step-size sequence, it is assumed that the associated \textit{mean flow}… ▽ More

    Submitted 21 February, 2024; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: 2 figures

    MSC Class: 62L20; 60F17; 68T05

  14. arXiv:2106.03023  [pdf, other

    stat.ME stat.AP stat.ML

    Context-tree weighting for real-valued time series: Bayesian inference with hierarchical mixture models

    Authors: Ioannis Papageorgiou, Ioannis Kontoyiannis

    Abstract: Real-valued time series are ubiquitous in the sciences and engineering. In this work, a general, hierarchical Bayesian modelling framework is developed for building mixture models for times series. This development is based, in part, on the use of context trees, and it includes a collection of effective algorithmic tools for learning and inference. A discrete context (or 'state') is extracted for… ▽ More

    Submitted 14 April, 2023; v1 submitted 5 June, 2021; originally announced June 2021.

  15. arXiv:2106.00514  [pdf, ps, other

    math.PR cs.IT

    Entropy and the Discrete Central Limit Theorem

    Authors: Lampros Gavalakis, Ioannis Kontoyiannis

    Abstract: A strengthened version of the central limit theorem for discrete random variables is established, relying only on information-theoretic tools and elementary arguments. It is shown that the relative entropy between the standardised sum of $n$ independent and identically distributed lattice random variables and an appropriately discretised Gaussian, vanishes as $n\to\infty$.

    Submitted 1 June, 2021; originally announced June 2021.

    Comments: 15 pages

    MSC Class: 60F05; 94A17; 60E15

  16. arXiv:2105.13762  [pdf, other

    cs.LG cs.SI stat.AP

    The Feature-First Block Model

    Authors: Lawrence Tray, Ioannis Kontoyiannis

    Abstract: Labelled networks are an important class of data, naturally appearing in numerous applications in science and engineering. A typical inference goal is to determine how the vertex labels (or features) affect the network's structure. In this work, we introduce a new generative model, the feature-first block model (FFBM), that facilitates the use of rich queries on labelled networks. We develop a Bay… ▽ More

    Submitted 16 November, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

  17. arXiv:2104.06857  [pdf

    q-bio.PE math.PR

    Population-scale testing can suppress the spread of infectious disease

    Authors: Jussi Taipale, Ioannis Kontoyiannis, Sten Linnarsson

    Abstract: Major advances in public health have resulted from disease prevention. However, prevention of a new infectious disease by vaccination or pharmaceuticals is made difficult by the slow process of vaccine and drug development. We propose an additional intervention that allows rapid control of emerging infectious diseases, and can also be used to eradicate diseases that rely almost exclusively on huma… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    Comments: This paper is based, in part, on an earlier manuscript, that appears as medRxiv 2020.04.27.20078329. This is a significantly extended version, including a new and more extensive mathematical analysis. The present manuscript was written in September 2020. The form included here includes some additional bibliographical references

  18. arXiv:2104.03882  [pdf, ps, other

    cs.IT math.PR

    An Information-Theoretic Proof of a Finite de Finetti Theorem

    Authors: Lampros Gavalakis, Ioannis Kontoyiannis

    Abstract: A finite form of de Finetti's representation theorem is established using elementary information-theoretic tools: The distribution of the first $k$ random variables in an exchangeable binary vector of length $n\geq k$ is close to a mixture of product distributions. Closeness is measured in terms of the relative entropy and an explicit bound is provided.

    Submitted 25 June, 2021; v1 submitted 8 April, 2021; originally announced April 2021.

    Comments: 5 pages. Revised version with some minor typos fixed and discussion slightly expanded

  19. arXiv:2007.15981  [pdf, other

    cs.IT math.CO math.PR

    Compression and Symmetry of Small-World Graphs and Structures

    Authors: Ioannis Kontoyiannis, Yi Heng Lim, Katia Papakonstantinopoulou, Wojtek Szpankowski

    Abstract: For various purposes and, in particular, in the context of data compression, a graph can be examined at three levels. Its structure can be described as the unlabeled version of the graph; then the labeling of its structure can be added; and finally, given then structure and labeling, the contents of the labels can be described. Determining the amount of information present at each level and quanti… ▽ More

    Submitted 22 November, 2021; v1 submitted 31 July, 2020; originally announced July 2020.

    Comments: 21 pages, 1 figure

  20. arXiv:2007.14900  [pdf, other

    stat.ME cs.IT stat.AP stat.CO

    Bayesian Context Trees: Modelling and exact inference for discrete time series

    Authors: Ioannis Kontoyiannis, Lambros Mertzanis, Athina Panotopoulou, Ioannis Papageorgiou, Maria Skoularidou

    Abstract: We develop a new Bayesian modelling framework for the class of higher-order, variable-memory Markov chains, and introduce an associated collection of methodological tools for exact inference with discrete time series. We show that a version of the context tree weighting algorithm can compute the prior predictive likelihood exactly (averaged over both models and parameters), and two related algorit… ▽ More

    Submitted 6 February, 2022; v1 submitted 29 July, 2020; originally announced July 2020.

    Comments: 53 pages, 22 figures, small stylistic changes. The associated R package "BCT" is available at CRAN.R-project.org/package=BCT

  21. Sharp Second-Order Pointwise Asymptotics for Lossless Compression with Side Information

    Authors: Lampros Gavalakis, Ioannis Kontoyiannis

    Abstract: The problem of determining the best achievable performance of arbitrary lossless compression algorithms is examined, when correlated side information is available at both the encoder and decoder. For arbitrary source-side information pairs, the conditional information density is shown to provide a sharp asymptotic lower bound for the description lengths achieved by an arbitrary sequence of compres… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    Comments: 20 pages, no figures. Based on part of arXiv:1912.05734v1

  22. arXiv:2001.05513  [pdf, other

    math.ST stat.ME stat.ML

    Optimal rates for independence testing via $U$-statistic permutation tests

    Authors: Thomas B. Berrett, Ioannis Kontoyiannis, Richard J. Samworth

    Abstract: We study the problem of independence testing given independent and identically distributed pairs taking values in a $σ$-finite, separable measure space. Defining a natural measure of dependence $D(f)$ as the squared $L^2$-distance between a joint density $f$ and the product of its marginals, we first show that there is no valid test of independence that is uniformly consistent against alternatives… ▽ More

    Submitted 6 November, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

    Comments: 58 pages, 4 figures

    MSC Class: 62C20; 62G10; 62H20

  23. arXiv:1912.12524  [pdf, other

    math.PR cs.IT stat.ME

    The Lévy State Space Model

    Authors: Simon Godsill, Marina Riabiz, Ioannis Kontoyiannis

    Abstract: In this paper we introduce a new class of state space models based on shot-noise simulation representations of non-Gaussian Lévy-driven linear systems, represented as stochastic differential equations. In particular a conditionally Gaussian version of the models is proposed that is able to capture heavy-tailed non-Gaussianity while retaining tractability for inference procedures. We focus on a can… ▽ More

    Submitted 8 January, 2020; v1 submitted 28 December, 2019; originally announced December 2019.

    Comments: V1:8 pages, 4 figures. V2: References updated

  24. arXiv:1912.05734  [pdf, other

    cs.IT

    Fundamental Limits of Lossless Data Compression with Side Information

    Authors: Lampros Gavalakis, Ioannis Kontoyiannis

    Abstract: The problem of lossless data compression with side information available to both the encoder and the decoder is considered. The finite-blocklength fundamental limits of the best achievable performance are defined, in two different versions of the problem: Reference-based compression, when a single side information string is used repeatedly in compressing different source messages, and pair-based c… ▽ More

    Submitted 21 February, 2021; v1 submitted 11 December, 2019; originally announced December 2019.

    Comments: 26 pages, 1 figure. Revised, shorter, final version, focusing primarily on nonasymptotic results. This version has been accepted for publication in IEEE Transactions on Information Theory

  25. arXiv:1812.11137  [pdf, other

    cs.LG eess.SY math.OC stat.ML

    Differential Temporal Difference Learning

    Authors: Adithya M. Devraj, Ioannis Kontoyiannis, Sean P. Meyn

    Abstract: Value functions derived from Markov decision processes arise as a central component of algorithms as well as performance metrics in many statistics and engineering applications of machine learning techniques. Computation of the solution to the associated Bellman equations is challenging in most practical cases of interest. A popular class of approximation techniques, known as Temporal Difference (… ▽ More

    Submitted 27 February, 2020; v1 submitted 28 December, 2018; originally announced December 2018.

    Comments: Preliminary versions of some of the results in this article were submitted as arXiv:1604.01828

    MSC Class: 93E20; 93E35; 60J20

  26. arXiv:1808.03830  [pdf, other

    cs.IT math.PR

    A Simple Network of Nodes Moving on the Circle

    Authors: Dimitris Cheliotis, Ioannis Kontoyiannis, Michail Loulakis, Stavros Toumpis

    Abstract: Two simple Markov processes are examined, one in discrete and one in continuous time, arising from idealized versions of a transmission protocol for mobile, delay-tolerant networks. We consider two independent walkers moving with constant speed on either the discrete or continuous circle, and changing directions at independent geometric (respectively, exponential) times. One of the walkers carries… ▽ More

    Submitted 4 March, 2020; v1 submitted 11 August, 2018; originally announced August 2018.

    Comments: Preliminary versions of some of the present results appeared in ISIT 2017 and SPAWC 2018

    Journal ref: Random Structures & Algorithms 57 (2), 317-338 (2020)

  27. arXiv:1802.10065  [pdf, other

    math.PR cs.IT stat.ME

    Nonasymptotic Gaussian Approximation for Inference with Stable Noise

    Authors: Marina Riabiz, Tohid Ardeshiri, Ioannis Kontoyiannis, Simon Godsill

    Abstract: The results of a series of theoretical studies are reported, examining the convergence rate for different approximate representations of $α$-stable distributions. Although they play a key role in modelling random processes with jumps and discontinuities, the use of $α$-stable distributions in inference often leads to analytically intractable problems. The LePage series, which is a probabilistic re… ▽ More

    Submitted 1 January, 2020; v1 submitted 27 February, 2018; originally announced February 2018.

    Comments: V1: 41 pages, 16 figures. V2: Text typos fixed; redundant figures from main text and appendices removed; added references in section I; changed section VI and its proofs in Appendices C-D-E; improved section IX; removed section X (Discussion and Conclusion), reference style changed. V3: title in the metadata updated. V4: Updated Theorem 1 and its proof, global revision

  28. arXiv:1801.02229  [pdf, ps, other

    cs.IT

    Packet Speed and Cost in Mobile Wireless Delay-Tolerant Networks

    Authors: Riccardo Cavallari, Stavros Toumpis, Roberto Verdone, Ioannis Kontoyiannis

    Abstract: A mobile wireless delay-tolerant network (DTN) model is proposed and analyzed, in which infinitely many nodes are initially placed on R^2 according to a uniform Poisson point process (PPP) and subsequently travel, independently of each other, along trajectories comprised of line segments, changing travel direction at time instances that form a Poisson process, each time selecting a new travel dire… ▽ More

    Submitted 28 February, 2018; v1 submitted 7 January, 2018; originally announced January 2018.

    Comments: Submitted to the IEEE Transactions on Information Theory

  29. arXiv:1711.03652  [pdf, ps, other

    math.PR

    Geometric Ergodicity in a Weighted Sobolev Space

    Authors: Adithya Devraj, Ioannis Kontoyiannis, Sean Meyn

    Abstract: For a discrete-time Markov chain $\{X(t)\}$ evolving on $\Re^\ell$ with transition kernel $P$, natural, general conditions are developed under which the following are established: 1. The transition kernel $P$ has a purely discrete spectrum, when viewed as a linear operator on a weighted Sobolev space $L_\infty^{v,1}$ of functions with norm,… ▽ More

    Submitted 18 July, 2019; v1 submitted 9 November, 2017; originally announced November 2017.

    Comments: 33 pages; The paper has been accepted for publication in the Annals of Probability

    MSC Class: 60J05; 60J35; 37A30; 47H20

  30. arXiv:1601.04255  [pdf, ps, other

    math.PR

    Thinning and Information Projections

    Authors: Peter Harremoës, Oliver Johnson, Ioannis Kontoyiannis

    Abstract: In this paper we establish lower bounds on information divergence of a distribution on the integers from a Poisson distribution. These lower bounds are tight and in the cases where a rate of convergence in the Law of Thin Numbers can be computed the rate is determined by the lower bounds proved in this paper. General techniques for getting lower bounds in terms of moments are developed. The result… ▽ More

    Submitted 17 January, 2016; originally announced January 2016.

    MSC Class: 60F99; 94A11

  31. arXiv:1512.00523  [pdf, ps, other

    math.PR

    On the $f$-Norm Ergodicity of Markov Processes in Continuous Time

    Authors: I. Kontoyiannis, S. P. Meyn

    Abstract: Consider a Markov process $\{Φ(t) : t\geq 0\}$ evolving on a Polish space ${\sf X}$. A version of the $f$-Norm Ergodic Theorem is obtained: Suppose that the process is $ψ$-irreducible and aperiodic. For a given function $f\colon{\sf X}:\to[1,\infty)$, under suitable conditions on the process the following are equivalent: \begin{enumerate} \item[(i)] There is a unique invariant probability measure… ▽ More

    Submitted 1 December, 2015; originally announced December 2015.

    MSC Class: 60J25; 37A30; 47H99

  32. arXiv:1508.04089  [pdf, ps, other

    cs.IT math.CO math.PR

    Entropy bounds on abelian groups and the Ruzsa divergence

    Authors: Mokshay Madiman, Ioannis Kontoyiannis

    Abstract: Over the past few years, a family of interesting new inequalities for the entropies of sums and differences of random variables has been developed by Ruzsa, Tao and others, motivated by analogous results in additive combinatorics. The present work extends these earlier results to the case of random variables taking values in $\mathbb{R}^n$ or, more generally, in arbitrary locally compact and Polis… ▽ More

    Submitted 26 October, 2015; v1 submitted 17 August, 2015; originally announced August 2015.

    Comments: 26 pages. Changes in v2: Added Section V on entropies of products of random variables, corrected several typos, added some references

    Journal ref: IEEE Transactions on Information Theory, vol. 64, no. 1, pp. 77-92, January 2018

  33. arXiv:1507.01234  [pdf, ps, other

    cs.IT math.ST

    Estimating the Directed Information and Testing for Causality

    Authors: Ioannis Kontoyiannis, Maria Skoularidou

    Abstract: The problem of estimating the directed information rate between two discrete processes $\{X_n\}$ and $\{Y_n\}$ via the plug-in (or maximum-likelihood) estimator is considered. When the joint process $\{(X_n,Y_n)\}$ is a Markov chain of a given memory length, the plug-in estimator is shown to be asymptotically Gaussian and to converge at the optimal rate $O(1/\sqrt{n})$ under appropriate conditions… ▽ More

    Submitted 31 March, 2016; v1 submitted 5 July, 2015; originally announced July 2015.

    Comments: Minor typos corrected, reviewers' comments addressed

  34. arXiv:1212.2668  [pdf, other

    cs.IT math.PR

    Lossless Data Compression at Finite Blocklengths

    Authors: Ioannis Kontoyiannis, Sergio Verdu

    Abstract: This paper provides an extensive study of the behavior of the best achievable rate (and other related fundamental limits) in variable-length lossless compression. In the non-asymptotic regime, the fundamental limits of fixed-to-variable lossless compression with and without prefix constraints are shown to be tightly coupled. Several precise, quantitative bounds are derived, connecting the distribu… ▽ More

    Submitted 11 December, 2012; originally announced December 2012.

  35. arXiv:1206.0489  [pdf, ps, other

    cs.IT math.CO math.PR

    Sumset and Inverse Sumset Inequalities for Differential Entropy and Mutual Information

    Authors: Ioannis Kontoyiannis, Mokshay Madiman

    Abstract: The sumset and inverse sumset theories of Freiman, Plünnecke and Ruzsa, give bounds connecting the cardinality of the sumset $A+B=\{a+b\;;\;a\in A,\,b\in B\}$ of two discrete sets $A,B$, to the cardinalities (or the finer structure) of the original sets $A,B$. For example, the sum-difference bound of Ruzsa states that, $|A+B|\,|A|\,|B|\leq|A-B|^3$, where the difference set… ▽ More

    Submitted 3 June, 2012; originally announced June 2012.

    Comments: 23 pages

    Journal ref: IEEE Transactions on Information Theory, vol. 60, no. 8, pp. 4503-4514, August 2014

  36. arXiv:1008.1355  [pdf, ps, other

    stat.CO math.PR math.ST

    Control Variates for Reversible MCMC Samplers

    Authors: Petros Dellaportas, Ioannis Kontoyiannis

    Abstract: A general methodology is introduced for the construction and effective application of control variates to estimation problems involving data from reversible MCMC samplers. We propose the use of a specific class of functions as control variates, and we introduce a new, consistent estimator for the values of the coefficients of the optimal linear combination of these functions. The form and proposed… ▽ More

    Submitted 7 August, 2010; originally announced August 2010.

    Comments: 44 pages; 6 figures; 5 tables

  37. arXiv:1004.3692  [pdf, ps, other

    math.PR cs.IT

    Compound Poisson Approximation via Information Functionals

    Authors: A. D. Barbour, Oliver Johnson, Ioannis Kontoyiannis, Mokshay Madiman

    Abstract: An information-theoretic development is given for the problem of compound Poisson approximation, which parallels earlier treatments for Gaussian and Poisson approximation. Let $P_{S_n}$ be the distribution of a sum $S_n=\Sumn Y_i$ of independent integer-valued random variables $Y_i$. Nonasymptotic bounds are derived for the distance between $P_{S_n}$ and an appropriately chosen compound Poisson la… ▽ More

    Submitted 21 April, 2010; originally announced April 2010.

    Comments: 27 pages

    Journal ref: Electronic Journal of Probability, Vol 15, Paper no. 42, pages 1344-1369, 2010

  38. arXiv:0912.0581  [pdf, ps, other

    math.CO cs.IT math.PR

    Log-concavity, ultra-log-concavity, and a maximum entropy property of discrete compound Poisson measures

    Authors: Oliver Johnson, Ioannis Kontoyiannis, Mokshay Madiman

    Abstract: Sufficient conditions are developed, under which the compound Poisson distribution has maximal entropy within a natural class of probability measures on the nonnegative integers. Recently, one of the authors [O. Johnson, {\em Stoch. Proc. Appl.}, 2007] used a semigroup approach to show that the Poisson has maximal entropy among all ultra-log-concave distributions with fixed mean. We show via a non… ▽ More

    Submitted 27 September, 2011; v1 submitted 3 December, 2009; originally announced December 2009.

    Comments: 30 pages. This submission supersedes arXiv:0805.4112v1. Changes in v2: Updated references, typos corrected

    MSC Class: 94A17; 60E07; 60E15

    Journal ref: Discrete Applied Mathematics, vol 161/9, pages 1232-1250, 2013

  39. arXiv:0907.4160  [pdf, ps, other

    stat.CO math.PR

    Notes on Using Control Variates for Estimation with Reversible MCMC Samplers

    Authors: Ioannis Kontoyiannis, Petros Dellaportas

    Abstract: A general methodology is presented for the construction and effective use of control variates for reversible MCMC samplers. The values of the coefficients of the optimal linear combination of the control variates are computed, and adaptive, consistent MCMC estimators are derived for these optimal coefficients. All methodological and asymptotic arguments are rigorously justified. Numerous MCMC simu… ▽ More

    Submitted 4 May, 2010; v1 submitted 24 July, 2009; originally announced July 2009.

  40. arXiv:0906.5322  [pdf, ps, other

    math.PR math.SP

    Geometric Ergodicity and the Spectral Gap of Non-Reversible Markov Chains

    Authors: Ioannis Kontoyiannis, Sean P. Meyn

    Abstract: We argue that the spectral theory of non-reversible Markov chains may often be more effectively cast within the framework of the naturally associated weighted-$L_\infty$ space $L_\infty^V$, instead of the usual Hilbert space $L_2=L_2(π)$, where $π$ is the invariant measure of the chain. This observation is, in part, based on the following results. A discrete-time Markov chain with values in a ge… ▽ More

    Submitted 29 June, 2009; originally announced June 2009.

    MSC Class: 60J05; 60J10; 37A30; 37A25

  41. Thinning, Entropy and the Law of Thin Numbers

    Authors: Peter Harremoes, Oliver Johnson, Ioannis Kontoyiannis

    Abstract: Renyi's "thinning" operation on a discrete random variable is a natural discrete analog of the scaling operation for continuous random variables. The properties of thinning are investigated in an information-theoretic context, especially in connection with information-theoretic inequalities related to Poisson approximation results. The classical Binomial-to-Poisson convergence (sometimes referre… ▽ More

    Submitted 3 June, 2009; originally announced June 2009.

    Journal ref: IEEE Transactions on Information Theory, Vol 56/9, 2010, pages 4228-4244

  42. arXiv:0906.0259  [pdf, ps, other

    math.PR

    Approximating a Diffusion by a Hidden Markov Model

    Authors: Ioannis Kontoyiannis, Sean P. Meyn

    Abstract: For a wide class of continuous-time Markov processes, including all irreducible hypoelliptic diffusions evolving on an open, connected subset of $\RL^d$, the following are shown to be equivalent: (i) The process satisfies (a slightly weaker version of) the classical Donsker-Varadhan conditions; (ii) The transition semigroup of the process can be approximated by a finite-state hidden Markov model,… ▽ More

    Submitted 25 April, 2016; v1 submitted 1 June, 2009; originally announced June 2009.

    Comments: 28 pages

  43. arXiv:0904.3340  [pdf, ps, other

    cs.IT

    Lossy Compression in Near-Linear Time via Efficient Random Codebooks and Databases

    Authors: Chris Gioran, Ioannis Kontoyiannis

    Abstract: The compression-complexity trade-off of lossy compression algorithms that are based on a random codebook or a random database is examined. Motivated, in part, by recent results of Gupta-Verdú-Weissman (GVW) and their underlying connections with the pattern-matching scheme of Kontoyiannis' lossy Lempel-Ziv algorithm, we introduce a non-universal version of the lossy Lempel-Ziv method (termed LLZ)… ▽ More

    Submitted 21 April, 2009; originally announced April 2009.

    Comments: 23 pages, four figures, four tables

  44. arXiv:0805.4112  [pdf, ps, other

    cs.IT math.PR

    On the entropy and log-concavity of compound Poisson measures

    Authors: Oliver Johnson, Ioannis Kontoyiannis, Mokshay Madiman

    Abstract: Motivated, in part, by the desire to develop an information-theoretic foundation for compound Poisson approximation limit theorems (analogous to the corresponding developments for the central limit theorem and for simple Poisson approximation), this work examines sufficient conditions under which the compound Poisson distribution has maximal entropy within a natural class of probability measures… ▽ More

    Submitted 27 May, 2008; originally announced May 2008.

    Report number: Superceded by arXiv:0912.0581 MSC Class: 62B10; 94A17

  45. Estimating the entropy of binary time series: Methodology, some theory and a simulation study

    Authors: Y. Gao, I. Kontoyiannis, E. Bienenstock

    Abstract: Partly motivated by entropy-estimation problems in neuroscience, we present a detailed and extensive comparison between some of the most popular and effective entropy estimation methods used in practice: The plug-in method, four different estimators based on the Lempel-Ziv (LZ) family of data compression algorithms, an estimator based on the Context-Tree Weighting (CTW) method, and the renewal e… ▽ More

    Submitted 29 February, 2008; originally announced February 2008.

    Comments: 34 pages, 3 figures

  46. arXiv:0710.5190  [pdf, ps, other

    q-bio.GN cs.IT

    Identifying statistical dependence in genomic sequences via mutual information estimates

    Authors: H. M. Aktulga, I. Kontoyiannis, L. A. Lyznik, L. Szpankowski, A. Y. Grama, W. Szpankowski

    Abstract: Questions of understanding and quantifying the representation and amount of information in organisms have become a central part of biological research, as they potentially hold the key to fundamental advances. In this paper, we demonstrate the use of information-theoretic tools for the task of identifying segments of biomolecules (DNA or RNA) that are statistically correlated. We develop a preci… ▽ More

    Submitted 26 October, 2007; originally announced October 2007.

    Comments: Preliminary version. Final version in EURASIP Journal on Bioinformatics and Systems Biology. See http://www.hindawi.com/journals/bsb/

  47. arXiv:0710.4117  [pdf, ps, other

    q-bio.NC cs.IT math.PR stat.AP

    From the entropy to the statistical structure of spike trains

    Authors: Yun Gao, Ioannis Kontoyiannis, Elie Bienenstock

    Abstract: We use statistical estimates of the entropy rate of spike train data in order to make inferences about the underlying structure of the spike train itself. We first examine a number of different parametric and nonparametric estimators (some known and some new), including the ``plug-in'' method, several versions of Lempel-Ziv-based compression algorithms, a maximum likelihood estimator tailored to… ▽ More

    Submitted 27 March, 2008; v1 submitted 22 October, 2007; originally announced October 2007.

    Journal ref: In Proceedings of the 2006 International Symposium on Information Theory, Seattle, WA, July 2006

  48. arXiv:0710.4076  [pdf, ps, other

    cs.IT math.NT math.PR

    Some information-theoretic computations related to the distribution of prime numbers

    Authors: Ioannis Kontoyiannis

    Abstract: We illustrate how elementary information-theoretic ideas may be employed to provide proofs for well-known, nontrivial results in number theory. Specifically, we give an elementary and fairly short proof of the following asymptotic result: The sum of (log p)/p, taken over all primes p not exceeding n, is asymptotic to log n as n tends to infinity. We also give finite-n bounds refining the above l… ▽ More

    Submitted 5 November, 2007; v1 submitted 22 October, 2007; originally announced October 2007.

    Comments: 10 pages; see also http://pages.cs.aueb.gr/users/yiannisk/

  49. Estimation of the Rate-Distortion Function

    Authors: M. T. Harrison, I. Kontoyiannis

    Abstract: Motivated by questions in lossy data compression and by theoretical considerations, we examine the problem of estimating the rate-distortion function of an unknown (not necessarily discrete-valued) source from empirical data. Our focus is the behavior of the so-called "plug-in" estimator, which is simply the rate-distortion function of the empirical distribution of the observed data. Sufficient… ▽ More

    Submitted 11 April, 2008; v1 submitted 2 February, 2007; originally announced February 2007.

    Comments: 18 pages, no figures [v2: removed an example with an error; corrected typos; a shortened version will appear in IEEE Trans. Inform. Theory]

    Journal ref: IEEE Transactions on Information Theory, 54 (2008): 3757-3762

  50. arXiv:math/0612040  [pdf, ps, other

    math.PR math.ST stat.CO

    Computable exponential bounds for screened estimation and simulation

    Authors: Ioannis Kontoyiannis, Sean P. Meyn

    Abstract: Suppose the expectation $E(F(X))$ is to be estimated by the empirical averages of the values of $F$ on independent and identically distributed samples $\{X_i\}$. A sampling rule called the "screened" estimator is introduced, and its performance is studied. When the mean $E(U(X))$ of a different function $U$ is known, the estimates are "screened," in that we only consider those which correspond t… ▽ More

    Submitted 22 August, 2008; v1 submitted 1 December, 2006; originally announced December 2006.

    Comments: Published in at http://dx.doi.org/10.1214/00-AAP492 the Annals of Applied Probability (http://www.imstat.org/aap/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AAP-AAP492 MSC Class: 60C05; 60F10 (Primary) 60G05; 60E15 (Secondary)

    Journal ref: Annals of Applied Probability 2008, Vol. 18, No. 4, 1491-1518