Skip to main content

Showing 1–40 of 40 results for author: Kontoyiannis, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.09605  [pdf, ps, other

    cs.IT math.ST

    Finite-sample expansions for the optimal error probability in asymmetric binary hypothesis testing

    Authors: Valentinian Lungu, Ioannis Kontoyiannis

    Abstract: The problem of binary hypothesis testing between two probability measures is considered. New sharp bounds are derived for the best achievable error probability of such tests based on independent and identically distributed observations. Specifically, the asymmetric version of the problem is examined, where different requirements are placed on the two error probabilities. Accurate nonasymptotic exp… ▽ More

    Submitted 29 May, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  2. arXiv:2404.06632  [pdf, other

    math.PR cs.IT

    Relative entropy bounds for sampling with and without replacement

    Authors: Oliver Johnson, Lampros Gavalakis, Ioannis Kontoyiannis

    Abstract: Sharp, nonasymptotic bounds are obtained for the relative entropy between the distributions of sampling with and without replacement from an urn with balls of $c\geq 2$ colors. Our bounds are asymptotically tight in certain regimes and, unlike previous results, they depend on the number of balls of each colour in the urn. The connection of these results with finite de Finetti-style theorems is exp… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 17 pages, 1 figure

    MSC Class: 60E05 (Primary) 60G09 (Secondary)

  3. arXiv:2403.07209  [pdf, ps, other

    cs.IT math.PR

    The entropic doubling constant and robustness of Gaussian codebooks for additive-noise channels

    Authors: Lampros Gavalakis, Ioannis Kontoyiannis, Mokshay Madiman

    Abstract: Entropy comparison inequalities are obtained for the differential entropy $h(X+Y)$ of the sum of two independent random vectors $X,Y$, when one is replaced by a Gaussian. For identically distributed random vectors $X,Y$, these are closely related to bounds on the entropic doubling constant, which quantifies the entropy increase when adding an independent copy of a random vector to itself. Conseque… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 23 pages, no figures

  4. arXiv:2305.14131  [pdf, other

    stat.ME cs.IT math.ST q-bio.NC

    Temporally Causal Discovery Tests for Discrete Time Series and Neural Spike Trains

    Authors: A. Theocharous, G. G. Gregoriou, P. Sapountzis, I. Kontoyiannis

    Abstract: We consider the problem of detecting causal relationships between discrete time series, in the presence of potential confounders. A hypothesis test is introduced for identifying the temporally causal influence of $(x_n)$ on $(y_n)$, causally conditioned on a possibly confounding third time series $(z_n)$. Under natural Markovian modeling assumptions, it is shown that the null hypothesis, correspon… ▽ More

    Submitted 17 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 31 pages, 4 figures

  5. arXiv:2304.05360  [pdf, ps, other

    cs.IT math.PR quant-ph

    A Third Information-Theoretic Approach to Finite de Finetti Theorems

    Authors: Mario Berta, Lampros Gavalakis, Ioannis Kontoyiannis

    Abstract: A new finite form of de Finetti's representation theorem is established using elementary information-theoretic tools. The distribution of the first $k$ random variables in an exchangeable vector of $n\geq k$ random variables is close to a mixture of product distributions. Closeness is measured in terms of the relative entropy and an explicit bound is provided. This bound is tighter than those obta… ▽ More

    Submitted 25 April, 2024; v1 submitted 11 April, 2023; originally announced April 2023.

    Comments: 11 pages, no figures. In the second version the introduction is slightly extended, two new references and Section 2.4 have been added

  6. arXiv:2212.06705  [pdf, other

    stat.ME cs.IT

    Truly Bayesian Entropy Estimation

    Authors: Ioannis Papageorgiou, Ioannis Kontoyiannis

    Abstract: Estimating the entropy rate of discrete time series is a challenging problem with important applications in numerous areas including neuroscience, genomics, image processing and natural language processing. A number of approaches have been developed for this task, typically based either on universal data compression algorithms, or on statistical estimators of the underlying process distribution. I… ▽ More

    Submitted 21 March, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2202.02239

  7. arXiv:2211.02676  [pdf, other

    cs.IT math.ST

    Context-tree weighting and Bayesian Context Trees: Asymptotic and non-asymptotic justifications

    Authors: Ioannis Kontoyiannis

    Abstract: The Bayesian Context Trees (BCT) framework is a recently introduced, general collection of statistical and algorithmic tools for modelling, analysis and inference with discrete-valued time series. The foundation of this development is built in part on some well-known information-theoretic ideas and techniques, including Rissanen's tree sources and Willems et al.'s context-tree weighting algorithm.… ▽ More

    Submitted 5 September, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

    Comments: Minor corrections, references added. To appear in the IEEE Transactions on Information Theory

  8. arXiv:2204.05033  [pdf, ps, other

    math.PR cs.IT

    Information in probability: Another information-theoretic proof of a finite de Finetti theorem

    Authors: Lampros Gavalakis, Ioannis Kontoyiannis

    Abstract: We recall some of the history of the information-theoretic approach to deriving core results in probability theory and indicate parts of the recent resurgence of interest in this area with current progress along several interesting directions. Then we give a new information-theoretic proof of a finite version of de Finetti's classical representation theorem for finite-valued random variables. We d… ▽ More

    Submitted 26 April, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: Final version, to be published as part of a Festschrift volume in the Springer "Lecture Notes in Mathematics" series

  9. arXiv:2202.02239  [pdf, other

    stat.ME cs.IT math.ST

    Posterior Representations for Bayesian Context Trees: Sampling, Estimation and Convergence

    Authors: Ioannis Papageorgiou, Ioannis Kontoyiannis

    Abstract: We revisit the Bayesian Context Trees (BCT) modelling framework for discrete time series, which was recently found to be very effective in numerous tasks including model selection, estimation and prediction. A novel representation of the induced posterior distribution on model space is derived in terms of a simple branching process, and several consequences of this are explored in theory and in pr… ▽ More

    Submitted 20 March, 2023; v1 submitted 4 February, 2022; originally announced February 2022.

  10. arXiv:2110.14427  [pdf, other

    math.ST cs.LG

    The ODE Method for Asymptotic Statistics in Stochastic Approximation and Reinforcement Learning

    Authors: Vivek Borkar, Shuhang Chen, Adithya Devraj, Ioannis Kontoyiannis, Sean Meyn

    Abstract: The paper concerns the stochastic approximation recursion, \[ θ_{n+1}= θ_n + α_{n + 1} f(θ_n, Φ_{n+1}) \,,\quad n\ge 0, \] where the {\em estimates} $θ_n\in\Re^d$ and $ \{ Φ_n \}$ is a Markov chain on a general state space. In addition to standard Lipschitz assumptions and conditions on the vanishing step-size sequence, it is assumed that the associated \textit{mean flow}… ▽ More

    Submitted 21 February, 2024; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: 2 figures

    MSC Class: 62L20; 60F17; 68T05

  11. arXiv:2106.00514  [pdf, ps, other

    math.PR cs.IT

    Entropy and the Discrete Central Limit Theorem

    Authors: Lampros Gavalakis, Ioannis Kontoyiannis

    Abstract: A strengthened version of the central limit theorem for discrete random variables is established, relying only on information-theoretic tools and elementary arguments. It is shown that the relative entropy between the standardised sum of $n$ independent and identically distributed lattice random variables and an appropriately discretised Gaussian, vanishes as $n\to\infty$.

    Submitted 1 June, 2021; originally announced June 2021.

    Comments: 15 pages

    MSC Class: 60F05; 94A17; 60E15

  12. arXiv:2105.13762  [pdf, other

    cs.LG cs.SI stat.AP

    The Feature-First Block Model

    Authors: Lawrence Tray, Ioannis Kontoyiannis

    Abstract: Labelled networks are an important class of data, naturally appearing in numerous applications in science and engineering. A typical inference goal is to determine how the vertex labels (or features) affect the network's structure. In this work, we introduce a new generative model, the feature-first block model (FFBM), that facilitates the use of rich queries on labelled networks. We develop a Bay… ▽ More

    Submitted 16 November, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

  13. arXiv:2104.03882  [pdf, ps, other

    cs.IT math.PR

    An Information-Theoretic Proof of a Finite de Finetti Theorem

    Authors: Lampros Gavalakis, Ioannis Kontoyiannis

    Abstract: A finite form of de Finetti's representation theorem is established using elementary information-theoretic tools: The distribution of the first $k$ random variables in an exchangeable binary vector of length $n\geq k$ is close to a mixture of product distributions. Closeness is measured in terms of the relative entropy and an explicit bound is provided.

    Submitted 25 June, 2021; v1 submitted 8 April, 2021; originally announced April 2021.

    Comments: 5 pages. Revised version with some minor typos fixed and discussion slightly expanded

  14. arXiv:2007.15981  [pdf, other

    cs.IT math.CO math.PR

    Compression and Symmetry of Small-World Graphs and Structures

    Authors: Ioannis Kontoyiannis, Yi Heng Lim, Katia Papakonstantinopoulou, Wojtek Szpankowski

    Abstract: For various purposes and, in particular, in the context of data compression, a graph can be examined at three levels. Its structure can be described as the unlabeled version of the graph; then the labeling of its structure can be added; and finally, given then structure and labeling, the contents of the labels can be described. Determining the amount of information present at each level and quanti… ▽ More

    Submitted 22 November, 2021; v1 submitted 31 July, 2020; originally announced July 2020.

    Comments: 21 pages, 1 figure

  15. arXiv:2007.14900  [pdf, other

    stat.ME cs.IT stat.AP stat.CO

    Bayesian Context Trees: Modelling and exact inference for discrete time series

    Authors: Ioannis Kontoyiannis, Lambros Mertzanis, Athina Panotopoulou, Ioannis Papageorgiou, Maria Skoularidou

    Abstract: We develop a new Bayesian modelling framework for the class of higher-order, variable-memory Markov chains, and introduce an associated collection of methodological tools for exact inference with discrete time series. We show that a version of the context tree weighting algorithm can compute the prior predictive likelihood exactly (averaged over both models and parameters), and two related algorit… ▽ More

    Submitted 6 February, 2022; v1 submitted 29 July, 2020; originally announced July 2020.

    Comments: 53 pages, 22 figures, small stylistic changes. The associated R package "BCT" is available at CRAN.R-project.org/package=BCT

  16. Sharp Second-Order Pointwise Asymptotics for Lossless Compression with Side Information

    Authors: Lampros Gavalakis, Ioannis Kontoyiannis

    Abstract: The problem of determining the best achievable performance of arbitrary lossless compression algorithms is examined, when correlated side information is available at both the encoder and decoder. For arbitrary source-side information pairs, the conditional information density is shown to provide a sharp asymptotic lower bound for the description lengths achieved by an arbitrary sequence of compres… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    Comments: 20 pages, no figures. Based on part of arXiv:1912.05734v1

  17. arXiv:1912.12524  [pdf, other

    math.PR cs.IT stat.ME

    The Lévy State Space Model

    Authors: Simon Godsill, Marina Riabiz, Ioannis Kontoyiannis

    Abstract: In this paper we introduce a new class of state space models based on shot-noise simulation representations of non-Gaussian Lévy-driven linear systems, represented as stochastic differential equations. In particular a conditionally Gaussian version of the models is proposed that is able to capture heavy-tailed non-Gaussianity while retaining tractability for inference procedures. We focus on a can… ▽ More

    Submitted 8 January, 2020; v1 submitted 28 December, 2019; originally announced December 2019.

    Comments: V1:8 pages, 4 figures. V2: References updated

  18. arXiv:1912.05734  [pdf, other

    cs.IT

    Fundamental Limits of Lossless Data Compression with Side Information

    Authors: Lampros Gavalakis, Ioannis Kontoyiannis

    Abstract: The problem of lossless data compression with side information available to both the encoder and the decoder is considered. The finite-blocklength fundamental limits of the best achievable performance are defined, in two different versions of the problem: Reference-based compression, when a single side information string is used repeatedly in compressing different source messages, and pair-based c… ▽ More

    Submitted 21 February, 2021; v1 submitted 11 December, 2019; originally announced December 2019.

    Comments: 26 pages, 1 figure. Revised, shorter, final version, focusing primarily on nonasymptotic results. This version has been accepted for publication in IEEE Transactions on Information Theory

  19. arXiv:1812.11137  [pdf, other

    cs.LG eess.SY math.OC stat.ML

    Differential Temporal Difference Learning

    Authors: Adithya M. Devraj, Ioannis Kontoyiannis, Sean P. Meyn

    Abstract: Value functions derived from Markov decision processes arise as a central component of algorithms as well as performance metrics in many statistics and engineering applications of machine learning techniques. Computation of the solution to the associated Bellman equations is challenging in most practical cases of interest. A popular class of approximation techniques, known as Temporal Difference (… ▽ More

    Submitted 27 February, 2020; v1 submitted 28 December, 2018; originally announced December 2018.

    Comments: Preliminary versions of some of the results in this article were submitted as arXiv:1604.01828

    MSC Class: 93E20; 93E35; 60J20

  20. arXiv:1808.03830  [pdf, other

    cs.IT math.PR

    A Simple Network of Nodes Moving on the Circle

    Authors: Dimitris Cheliotis, Ioannis Kontoyiannis, Michail Loulakis, Stavros Toumpis

    Abstract: Two simple Markov processes are examined, one in discrete and one in continuous time, arising from idealized versions of a transmission protocol for mobile, delay-tolerant networks. We consider two independent walkers moving with constant speed on either the discrete or continuous circle, and changing directions at independent geometric (respectively, exponential) times. One of the walkers carries… ▽ More

    Submitted 4 March, 2020; v1 submitted 11 August, 2018; originally announced August 2018.

    Comments: Preliminary versions of some of the present results appeared in ISIT 2017 and SPAWC 2018

    Journal ref: Random Structures & Algorithms 57 (2), 317-338 (2020)

  21. arXiv:1802.10065  [pdf, other

    math.PR cs.IT stat.ME

    Nonasymptotic Gaussian Approximation for Inference with Stable Noise

    Authors: Marina Riabiz, Tohid Ardeshiri, Ioannis Kontoyiannis, Simon Godsill

    Abstract: The results of a series of theoretical studies are reported, examining the convergence rate for different approximate representations of $α$-stable distributions. Although they play a key role in modelling random processes with jumps and discontinuities, the use of $α$-stable distributions in inference often leads to analytically intractable problems. The LePage series, which is a probabilistic re… ▽ More

    Submitted 1 January, 2020; v1 submitted 27 February, 2018; originally announced February 2018.

    Comments: V1: 41 pages, 16 figures. V2: Text typos fixed; redundant figures from main text and appendices removed; added references in section I; changed section VI and its proofs in Appendices C-D-E; improved section IX; removed section X (Discussion and Conclusion), reference style changed. V3: title in the metadata updated. V4: Updated Theorem 1 and its proof, global revision

  22. arXiv:1801.02229  [pdf, ps, other

    cs.IT

    Packet Speed and Cost in Mobile Wireless Delay-Tolerant Networks

    Authors: Riccardo Cavallari, Stavros Toumpis, Roberto Verdone, Ioannis Kontoyiannis

    Abstract: A mobile wireless delay-tolerant network (DTN) model is proposed and analyzed, in which infinitely many nodes are initially placed on R^2 according to a uniform Poisson point process (PPP) and subsequently travel, independently of each other, along trajectories comprised of line segments, changing travel direction at time instances that form a Poisson process, each time selecting a new travel dire… ▽ More

    Submitted 28 February, 2018; v1 submitted 7 January, 2018; originally announced January 2018.

    Comments: Submitted to the IEEE Transactions on Information Theory

  23. arXiv:1508.04089  [pdf, ps, other

    cs.IT math.CO math.PR

    Entropy bounds on abelian groups and the Ruzsa divergence

    Authors: Mokshay Madiman, Ioannis Kontoyiannis

    Abstract: Over the past few years, a family of interesting new inequalities for the entropies of sums and differences of random variables has been developed by Ruzsa, Tao and others, motivated by analogous results in additive combinatorics. The present work extends these earlier results to the case of random variables taking values in $\mathbb{R}^n$ or, more generally, in arbitrary locally compact and Polis… ▽ More

    Submitted 26 October, 2015; v1 submitted 17 August, 2015; originally announced August 2015.

    Comments: 26 pages. Changes in v2: Added Section V on entropies of products of random variables, corrected several typos, added some references

    Journal ref: IEEE Transactions on Information Theory, vol. 64, no. 1, pp. 77-92, January 2018

  24. arXiv:1507.01234  [pdf, ps, other

    cs.IT math.ST

    Estimating the Directed Information and Testing for Causality

    Authors: Ioannis Kontoyiannis, Maria Skoularidou

    Abstract: The problem of estimating the directed information rate between two discrete processes $\{X_n\}$ and $\{Y_n\}$ via the plug-in (or maximum-likelihood) estimator is considered. When the joint process $\{(X_n,Y_n)\}$ is a Markov chain of a given memory length, the plug-in estimator is shown to be asymptotically Gaussian and to converge at the optimal rate $O(1/\sqrt{n})$ under appropriate conditions… ▽ More

    Submitted 31 March, 2016; v1 submitted 5 July, 2015; originally announced July 2015.

    Comments: Minor typos corrected, reviewers' comments addressed

  25. arXiv:1212.2668  [pdf, other

    cs.IT math.PR

    Lossless Data Compression at Finite Blocklengths

    Authors: Ioannis Kontoyiannis, Sergio Verdu

    Abstract: This paper provides an extensive study of the behavior of the best achievable rate (and other related fundamental limits) in variable-length lossless compression. In the non-asymptotic regime, the fundamental limits of fixed-to-variable lossless compression with and without prefix constraints are shown to be tightly coupled. Several precise, quantitative bounds are derived, connecting the distribu… ▽ More

    Submitted 11 December, 2012; originally announced December 2012.

  26. arXiv:1206.0489  [pdf, ps, other

    cs.IT math.CO math.PR

    Sumset and Inverse Sumset Inequalities for Differential Entropy and Mutual Information

    Authors: Ioannis Kontoyiannis, Mokshay Madiman

    Abstract: The sumset and inverse sumset theories of Freiman, Plünnecke and Ruzsa, give bounds connecting the cardinality of the sumset $A+B=\{a+b\;;\;a\in A,\,b\in B\}$ of two discrete sets $A,B$, to the cardinalities (or the finer structure) of the original sets $A,B$. For example, the sum-difference bound of Ruzsa states that, $|A+B|\,|A|\,|B|\leq|A-B|^3$, where the difference set… ▽ More

    Submitted 3 June, 2012; originally announced June 2012.

    Comments: 23 pages

    Journal ref: IEEE Transactions on Information Theory, vol. 60, no. 8, pp. 4503-4514, August 2014

  27. arXiv:1004.3692  [pdf, ps, other

    math.PR cs.IT

    Compound Poisson Approximation via Information Functionals

    Authors: A. D. Barbour, Oliver Johnson, Ioannis Kontoyiannis, Mokshay Madiman

    Abstract: An information-theoretic development is given for the problem of compound Poisson approximation, which parallels earlier treatments for Gaussian and Poisson approximation. Let $P_{S_n}$ be the distribution of a sum $S_n=\Sumn Y_i$ of independent integer-valued random variables $Y_i$. Nonasymptotic bounds are derived for the distance between $P_{S_n}$ and an appropriately chosen compound Poisson la… ▽ More

    Submitted 21 April, 2010; originally announced April 2010.

    Comments: 27 pages

    Journal ref: Electronic Journal of Probability, Vol 15, Paper no. 42, pages 1344-1369, 2010

  28. arXiv:0912.0581  [pdf, ps, other

    math.CO cs.IT math.PR

    Log-concavity, ultra-log-concavity, and a maximum entropy property of discrete compound Poisson measures

    Authors: Oliver Johnson, Ioannis Kontoyiannis, Mokshay Madiman

    Abstract: Sufficient conditions are developed, under which the compound Poisson distribution has maximal entropy within a natural class of probability measures on the nonnegative integers. Recently, one of the authors [O. Johnson, {\em Stoch. Proc. Appl.}, 2007] used a semigroup approach to show that the Poisson has maximal entropy among all ultra-log-concave distributions with fixed mean. We show via a non… ▽ More

    Submitted 27 September, 2011; v1 submitted 3 December, 2009; originally announced December 2009.

    Comments: 30 pages. This submission supersedes arXiv:0805.4112v1. Changes in v2: Updated references, typos corrected

    MSC Class: 94A17; 60E07; 60E15

    Journal ref: Discrete Applied Mathematics, vol 161/9, pages 1232-1250, 2013

  29. Thinning, Entropy and the Law of Thin Numbers

    Authors: Peter Harremoes, Oliver Johnson, Ioannis Kontoyiannis

    Abstract: Renyi's "thinning" operation on a discrete random variable is a natural discrete analog of the scaling operation for continuous random variables. The properties of thinning are investigated in an information-theoretic context, especially in connection with information-theoretic inequalities related to Poisson approximation results. The classical Binomial-to-Poisson convergence (sometimes referre… ▽ More

    Submitted 3 June, 2009; originally announced June 2009.

    Journal ref: IEEE Transactions on Information Theory, Vol 56/9, 2010, pages 4228-4244

  30. arXiv:0904.3340  [pdf, ps, other

    cs.IT

    Lossy Compression in Near-Linear Time via Efficient Random Codebooks and Databases

    Authors: Chris Gioran, Ioannis Kontoyiannis

    Abstract: The compression-complexity trade-off of lossy compression algorithms that are based on a random codebook or a random database is examined. Motivated, in part, by recent results of Gupta-Verdú-Weissman (GVW) and their underlying connections with the pattern-matching scheme of Kontoyiannis' lossy Lempel-Ziv algorithm, we introduce a non-universal version of the lossy Lempel-Ziv method (termed LLZ)… ▽ More

    Submitted 21 April, 2009; originally announced April 2009.

    Comments: 23 pages, four figures, four tables

  31. arXiv:0805.4112  [pdf, ps, other

    cs.IT math.PR

    On the entropy and log-concavity of compound Poisson measures

    Authors: Oliver Johnson, Ioannis Kontoyiannis, Mokshay Madiman

    Abstract: Motivated, in part, by the desire to develop an information-theoretic foundation for compound Poisson approximation limit theorems (analogous to the corresponding developments for the central limit theorem and for simple Poisson approximation), this work examines sufficient conditions under which the compound Poisson distribution has maximal entropy within a natural class of probability measures… ▽ More

    Submitted 27 May, 2008; originally announced May 2008.

    Report number: Superceded by arXiv:0912.0581 MSC Class: 62B10; 94A17

  32. Estimating the entropy of binary time series: Methodology, some theory and a simulation study

    Authors: Y. Gao, I. Kontoyiannis, E. Bienenstock

    Abstract: Partly motivated by entropy-estimation problems in neuroscience, we present a detailed and extensive comparison between some of the most popular and effective entropy estimation methods used in practice: The plug-in method, four different estimators based on the Lempel-Ziv (LZ) family of data compression algorithms, an estimator based on the Context-Tree Weighting (CTW) method, and the renewal e… ▽ More

    Submitted 29 February, 2008; originally announced February 2008.

    Comments: 34 pages, 3 figures

  33. arXiv:0710.5190  [pdf, ps, other

    q-bio.GN cs.IT

    Identifying statistical dependence in genomic sequences via mutual information estimates

    Authors: H. M. Aktulga, I. Kontoyiannis, L. A. Lyznik, L. Szpankowski, A. Y. Grama, W. Szpankowski

    Abstract: Questions of understanding and quantifying the representation and amount of information in organisms have become a central part of biological research, as they potentially hold the key to fundamental advances. In this paper, we demonstrate the use of information-theoretic tools for the task of identifying segments of biomolecules (DNA or RNA) that are statistically correlated. We develop a preci… ▽ More

    Submitted 26 October, 2007; originally announced October 2007.

    Comments: Preliminary version. Final version in EURASIP Journal on Bioinformatics and Systems Biology. See http://www.hindawi.com/journals/bsb/

  34. arXiv:0710.4117  [pdf, ps, other

    q-bio.NC cs.IT math.PR stat.AP

    From the entropy to the statistical structure of spike trains

    Authors: Yun Gao, Ioannis Kontoyiannis, Elie Bienenstock

    Abstract: We use statistical estimates of the entropy rate of spike train data in order to make inferences about the underlying structure of the spike train itself. We first examine a number of different parametric and nonparametric estimators (some known and some new), including the ``plug-in'' method, several versions of Lempel-Ziv-based compression algorithms, a maximum likelihood estimator tailored to… ▽ More

    Submitted 27 March, 2008; v1 submitted 22 October, 2007; originally announced October 2007.

    Journal ref: In Proceedings of the 2006 International Symposium on Information Theory, Seattle, WA, July 2006

  35. arXiv:0710.4076  [pdf, ps, other

    cs.IT math.NT math.PR

    Some information-theoretic computations related to the distribution of prime numbers

    Authors: Ioannis Kontoyiannis

    Abstract: We illustrate how elementary information-theoretic ideas may be employed to provide proofs for well-known, nontrivial results in number theory. Specifically, we give an elementary and fairly short proof of the following asymptotic result: The sum of (log p)/p, taken over all primes p not exceeding n, is asymptotic to log n as n tends to infinity. We also give finite-n bounds refining the above l… ▽ More

    Submitted 5 November, 2007; v1 submitted 22 October, 2007; originally announced October 2007.

    Comments: 10 pages; see also http://pages.cs.aueb.gr/users/yiannisk/

  36. Estimation of the Rate-Distortion Function

    Authors: M. T. Harrison, I. Kontoyiannis

    Abstract: Motivated by questions in lossy data compression and by theoretical considerations, we examine the problem of estimating the rate-distortion function of an unknown (not necessarily discrete-valued) source from empirical data. Our focus is the behavior of the so-called "plug-in" estimator, which is simply the rate-distortion function of the empirical distribution of the observed data. Sufficient… ▽ More

    Submitted 11 April, 2008; v1 submitted 2 February, 2007; originally announced February 2007.

    Comments: 18 pages, no figures [v2: removed an example with an error; corrected typos; a shortened version will appear in IEEE Trans. Inform. Theory]

    Journal ref: IEEE Transactions on Information Theory, 54 (2008): 3757-3762

  37. arXiv:cs/0511009  [pdf, ps, other

    cs.IT math.PR

    Mismatched codebooks and the role of entropy-coding in lossy data compression

    Authors: Ioannis Kontoyiannis, Rami Zamir

    Abstract: We introduce a universal quantization scheme based on random coding, and we analyze its performance. This scheme consists of a source-independent random codebook (typically_mismatched_ to the source distribution), followed by optimal entropy-coding that is_matched_ to the quantized codeword distribution. A single-letter formula is derived for the rate achieved by this scheme at a given distortio… ▽ More

    Submitted 2 November, 2005; originally announced November 2005.

    Comments: 35 pages, 37 references, no figures. Submitted to IEEE Transactions on Information Theory

  38. arXiv:math/0103007  [pdf, ps, other

    math.PR cs.IT

    Source Coding, Large Deviations, and Approximate Pattern Matching

    Authors: A. Dembo, I. Kontoyiannis

    Abstract: We present a development of parts of rate-distortion theory and pattern- matching algorithms for lossy data compression, centered around a lossy version of the Asymptotic Equipartition Property (AEP). This treatment closely parallels the corresponding development in lossless compression, a point of view that was advanced in an important paper of Wyner and Ziv in 1989. In the lossless case we rev… ▽ More

    Submitted 1 March, 2001; originally announced March 2001.

    Comments: 48 pages, review paper

    MSC Class: 94A15; 60F10; 60G60

  39. arXiv:math/0009018  [pdf, ps, other

    math.PR cs.IT

    Critical Behavior in Lossy Source Coding

    Authors: Amir Dembo, Ioannis Kontoyiannis

    Abstract: The following critical phenomenon was recently discovered. When a memoryless source is compressed using a variable-length fixed-distortion code, the fastest convergence rate of the (pointwise) compression ratio to the optimal $R(D)$ bits/symbol is either $O(\sqrt{n})$ or $O(\log n)$. We show it is always $O(\sqrt{n})$, except for discrete, uniformly distributed sources.

    Submitted 1 September, 2000; originally announced September 2000.

    Comments: 2 figures

  40. arXiv:math/9910062  [pdf, ps, other

    math.PR cs.IT math.FA

    Efficient sphere-covering and converse measure concentration via generalized coding theorems

    Authors: Ioannis Kontoyiannis

    Abstract: Suppose A is a finite set equipped with a probability measure P and let M be a ``mass'' function on A. We give a probabilistic characterization of the most efficient way in which A^n can be almost-covered using spheres of a fixed radius. An almost-covering is a subset C_n of A^n, such that the union of the spheres centered at the points of C_n has probability close to one with respect to the pro… ▽ More

    Submitted 27 September, 2000; v1 submitted 12 October, 1999; originally announced October 1999.

    Comments: 29 pages. See also http://www.stat.purdue.edu/~yiannis/

    MSC Class: 60E15; 28A35 (primary); 94A15; 60F10 (secondary)