Skip to main content

Showing 1–37 of 37 results for author: Harremoës, P

.
  1. arXiv:2306.16646  [pdf, ps, other

    cs.IT math.ST

    Universal Reverse Information Projections and Optimal E-statistics

    Authors: Tyron Lardy, Peter Grünwald, Peter Harremoës

    Abstract: Information projections have found important applications in probability theory, statistics, and related areas. In the field of hypothesis testing in particular, the reverse information projection (RIPr) has recently been shown to lead to so-called growth-rate optimal (GRO) e-statistics for testing simple alternatives against composite null hypotheses. However, the RIPr as well as the GRO criterio… ▽ More

    Submitted 4 December, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: A five-page abstract of this paper, containing a subset of the theorems but no proofs, was presented at ISIT 2023, Taipei

    MSC Class: 62B10 (primary); 94A17 (secondary)

  2. arXiv:2202.02668  [pdf, other

    cs.IT math.PR

    Unnormalized Measures in Information Theory

    Authors: Peter Harremoës

    Abstract: Information theory is built on probability measures and by definition a probability measure has total mass 1. Probability measures are used to model uncertainty, and one may ask how important it is that the total mass is one. We claim that the main reason to normalize measures is that probability measures are related to codes via Kraft's inequality. Using a minimum description length approach to s… ▽ More

    Submitted 5 February, 2022; originally announced February 2022.

    Comments: 6 pages, 3 figures

    MSC Class: 94A17

  3. arXiv:2201.03707  [pdf, other

    stat.AP cs.IT

    Rate Distortion Theory for Descriptive Statistics

    Authors: Peter Harremoës

    Abstract: Rate distortion theory was developed for optimizing lossy compression of data, but it also has a lot of applications in statistics. In this paper we will see how rate distortion theory can be used to analyze a complicated data set involving orientations of early Islamic mosques. The analysis involves testing, identification of outliers, choice of compression rate, calculation of optimal reconstruc… ▽ More

    Submitted 16 February, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

    Comments: 6 pages, 4 figures

    MSC Class: 94-10; 94A34

  4. arXiv:2002.03002  [pdf, other

    math.PR cs.IT

    Bounds on the Information Divergence for Hypergeometric Distributions

    Authors: Peter Harremoës, František Matúš

    Abstract: The hypergeometric distributions have many important applications, but they have not had sufficient attention in information theory. Hypergeometric distributions can be approximated by binomial distributions or Poisson distributions. In this paper we present upper and lower bounds on information divergence. These bounds are important for statistical testing and a better understanding of the notion… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

    Comments: 21 pages, 2 figures

    MSC Class: 60E15 94A17

  5. From Thermodynamic Sufficiency to Information Causality

    Authors: Peter Harremoës

    Abstract: The principle called information causality has been used to deduce Tsirelson's bound. In this paper we derive information causality from monotonicity of divergence and relate it to more basic principles related to measurements on thermodynamic systems. This principle is more fundamental in the sense that it can be formulated for both unipartite systems and multipartite systems while information ca… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

    Comments: 11 pages

    MSC Class: 81P16

  6. arXiv:1805.02234  [pdf, ps, other

    math.ST cs.IT

    Statistical Inference and Exact Saddle Point Approximations

    Authors: Peter Harremoës

    Abstract: Statistical inference may follow a frequentist approach or it may follow a Bayesian approach or it may use the minimum description length principle (MDL). Our goal is to identify situations in which these different approaches to statistical inference coincide. It is proved that for exponential families MDL and Bayesian inference coincide if and only if the renormalized saddle point approximation f… ▽ More

    Submitted 6 May, 2018; originally announced May 2018.

    Comments: 5 pages

    MSC Class: 62B10;

  7. arXiv:1707.03222  [pdf, ps, other

    math-ph quant-ph

    Entropy on Spin Factors

    Authors: Peter Harremoës

    Abstract: Recently it has been demonstrated that the Shannon entropy or the von Neuman entropy are the only entropy functions that generate a local Bregman divergences as long as the state space has rank 3 or higher. In this paper we will study the properties of Bregman divergences for convex bodies of rank 2. The two most important convex bodies of rank 2 can be identified with the bit and the qubit. We de… ▽ More

    Submitted 4 May, 2018; v1 submitted 11 July, 2017; originally announced July 2017.

    Comments: 30 pages, 6 figures

    MSC Class: 81P16

  8. arXiv:1701.06688  [pdf, ps, other

    cs.IT quant-ph

    Quantum Information on Spectral Sets

    Authors: Peter Harremoës

    Abstract: For convex optimization problems Bregman divergences appear as regret functions. Such regret functions can be defined on any convex set but if a sufficiency condition is added the regret function must be proportional to information divergence and the convex set must be spectral. Spectral set are sets where different orthogonal decompositions of a state into pure states have unique mixing coefficie… ▽ More

    Submitted 10 February, 2017; v1 submitted 23 January, 2017; originally announced January 2017.

    Comments: 13 pages, 2 figures. arXiv admin note: text overlap with arXiv:1701.01010

    MSC Class: 81P16; 94B75

  9. arXiv:1701.01010  [pdf, other

    cs.IT cond-mat.stat-mech math.OC

    Divergence and Sufficiency for Convex Optimization

    Authors: Peter Harremoës

    Abstract: Logarithmic score and information divergence appear in information theory, statistics, statistical mechanics, and portfolio theory. We demonstrate that all these topics involve some kind of optimization that leads directly to regret functions and such regret functions are often given by a Bregman divergence. If the regret function also fulfills a sufficiency condition it must be proportional to in… ▽ More

    Submitted 10 April, 2017; v1 submitted 4 January, 2017; originally announced January 2017.

    Comments: 39 pages, 3 figures

    MSC Class: 94A17

  10. arXiv:1607.02259  [pdf, ps, other

    math-ph cs.IT

    Maximum Entropy and Sufficiency

    Authors: Peter Harremoës

    Abstract: The notion of Bregman divergence and sufficiency will be defined on general convex state spaces. It is demonstrated that only spectral sets can have a Bregman divergence that satisfies a sufficiency condition. Positive elements with trace 1 in a Jordan algebra are examples of spectral sets, and the most important example is the set of density matrices with complex entries. It is conjectured that i… ▽ More

    Submitted 3 September, 2016; v1 submitted 8 July, 2016; originally announced July 2016.

    MSC Class: 81P16; 94A17

  11. arXiv:1601.07593  [pdf, ps, other

    cs.IT q-fin.PM

    Sufficiency on the Stock Market

    Authors: Peter Harremoës

    Abstract: It is well-known that there are a number of relations between theoretical finance theory and information theory. Some of these relations are exact and some are approximate. In this paper we will explore some of these relations and determine under which conditions the relations are exact. It turns out that portfolio theory always leads to Bregman divergences. The Bregman divergence is only proporti… ▽ More

    Submitted 27 January, 2016; originally announced January 2016.

    MSC Class: 91B25

  12. Bounds on Tail Probabilities in Exponential families

    Authors: Peter Harremoës

    Abstract: In this paper we present various new inequalities for tail proabilities for distributions that are elements of the most improtant exponential families. These families include the Poisson distributions, the Gamma distributions, the binomial distributions, the negative binomial distributions and the inverse Gaussian distributions. All these exponential families have simple variance functions and the… ▽ More

    Submitted 8 February, 2016; v1 submitted 20 January, 2016; originally announced January 2016.

    Comments: 27 pages, 10 figures

    MSC Class: 60E15

    Journal ref: Kybernetika 51, 943-966 (2016)

  13. arXiv:1601.04255  [pdf, ps, other

    math.PR

    Thinning and Information Projections

    Authors: Peter Harremoës, Oliver Johnson, Ioannis Kontoyiannis

    Abstract: In this paper we establish lower bounds on information divergence of a distribution on the integers from a Poisson distribution. These lower bounds are tight and in the cases where a rate of convergence in the Law of Thin Numbers can be computed the rate is determined by the lower bounds proved in this paper. General techniques for getting lower bounds in terms of moments are developed. The result… ▽ More

    Submitted 17 January, 2016; originally announced January 2016.

    MSC Class: 60F99; 94A11

  14. arXiv:1507.07089  [pdf, other

    math.ST

    Proper Scoring and Sufficiency

    Authors: Peter Harremoës

    Abstract: Logarithmic score and information divergence appear in both information theory, statistics, statistical mechanics, and portfolio theory. We demonstrate that all these topics involve some kind of optimization that leads directly to the use of Bregman divergences. If a sufficiency condition is also fulfilled the Bregman divergence must be proportional to information divergence. The sufficiency condi… ▽ More

    Submitted 25 July, 2015; originally announced July 2015.

    Comments: Proceedings WITMSE 2015

    MSC Class: 62B10; 94A17

  15. arXiv:1502.04336  [pdf, ps, other

    cs.IT

    Lattices with non-Shannon Inequalities

    Authors: Peter Harremoës

    Abstract: We study the existence or absence of non-Shannon inequalities for variables that are related by functional dependencies. Although the power-set on four variables is the smallest Boolean lattice with non-Shannon inequalities there exist lattices with many more variables without non-Shannon inequalities. We search for conditions that ensures that no non-Shannon inequalities exist. It is demonstrated… ▽ More

    Submitted 15 February, 2015; originally announced February 2015.

    Comments: Ten pages. Submitted to ISIT 2015. The appendix will not appear in the proceedings

  16. arXiv:1402.0092  [pdf, other

    math.ST cs.IT

    Mutual information of Contingency Tables and Related Inequalities

    Authors: Peter Harremoës

    Abstract: For testing independence it is very popular to use either the $χ^{2}$-statistic or $G^{2}$-statistics (mutual information). Asymptotically both are $χ^{2}$-distributed so an obvious question is which of the two statistics that has a distribution that is closest to the $χ^{2}$-distribution. Surprisingly the distribution of mutual information is much better approximated by a $χ^{2}$-distribution tha… ▽ More

    Submitted 1 February, 2014; originally announced February 2014.

    Comments: A version without the appendix has been submitted to a conference

  17. arXiv:1305.4324  [pdf, ps, other

    cs.LG stat.ML

    Horizon-Independent Optimal Prediction with Log-Loss in Exponential Families

    Authors: Peter Bartlett, Peter Grunwald, Peter Harremoes, Fares Hedayati, Wojciech Kotlowski

    Abstract: We study online learning under logarithmic loss with regular parametric models. Hedayati and Bartlett (2012b) showed that a Bayesian prediction strategy with Jeffreys prior and sequential normalized maximum likelihood (SNML) coincide and are optimal if and only if the latter is exchangeable, and if and only if the optimal strategy can be calculated without knowing the time horizon in advance. They… ▽ More

    Submitted 19 May, 2013; originally announced May 2013.

    Comments: 23 pages

  18. arXiv:1301.6465  [pdf, ps, other

    cs.IT math.ST

    Extendable MDL

    Authors: Peter Harremoës

    Abstract: In this paper we show that combination of the minimum description length principle and a exchange-ability condition leads directly to the use of Jeffreys prior. This approach works in most cases even when Jeffreys prior cannot be normalized. Kraft's inequality links codes and distributions but a closer look at this inequality demonstrates that this link only makes sense when sequences are consider… ▽ More

    Submitted 19 May, 2013; v1 submitted 28 January, 2013; originally announced January 2013.

    Comments: 9 pages

    MSC Class: 62B10; 94A15

  19. arXiv:1206.6544  [pdf, ps, other

    cs.IT

    Minimum KL-divergence on complements of $L_1$ balls

    Authors: Daniel Berend, Peter Harremoës, Aryeh Kontorovich

    Abstract: Pinsker's widely used inequality upper-bounds the total variation distance $||P-Q||_1$ in terms of the Kullback-Leibler divergence $D(P||Q)$. Although in general a bound in the reverse direction is impossible, in many applications the quantity of interest is actually $D^*(P,\eps)$ --- defined, for an arbitrary fixed $P$, as the infimum of $D(P||Q)$ over all distributions $Q$ that are $\eps$-far aw… ▽ More

    Submitted 20 February, 2014; v1 submitted 27 June, 2012; originally announced June 2012.

    Comments: A previous version had the title "A Reverse Pinsker Inequality"

    MSC Class: 60F10; 94A15

  20. arXiv:1206.2459  [pdf, other

    cs.IT math.ST stat.ML

    Rényi Divergence and Kullback-Leibler Divergence

    Authors: Tim van Erven, Peter Harremoës

    Abstract: Rényi divergence is related to Rényi entropy much like Kullback-Leibler divergence is related to Shannon's entropy, and comes up in many settings. It was introduced by Rényi as a measure of information that satisfies almost the same axioms as Kullback-Leibler divergence, and depends on a parameter that is called its order. In particular, the Rényi divergence of order 1 equals the Kullback-Leibler… ▽ More

    Submitted 24 April, 2014; v1 submitted 12 June, 2012; originally announced June 2012.

    Comments: To appear in IEEE Transactions on Information Theory

  21. arXiv:1205.1005  [pdf, ps, other

    math.ST

    Some Refinements of Large Deviation Tail Probabilities

    Authors: Laszlo Gyorfi, Peter Harremoes, Gabor Tusnady

    Abstract: We study tail probabilities via some Gaussian approximations. Our results make refinements to large deviation theory. The proof builds on classical results by Bahadur and Rao. Binomial distributions and their tail probabilities are discussed in more detail.

    Submitted 4 May, 2012; originally announced May 2012.

    Comments: 7 pages

    MSC Class: 60F10; 60E15

  22. arXiv:1202.1125  [pdf, ps, other

    math.ST cs.IT

    Information Divergence is more chi squared distributed than the chi squared statistics

    Authors: Peter Harremoës, Gábor Tusnády

    Abstract: For testing goodness of fit it is very popular to use either the chi square statistic or G statistics (information divergence). Asymptotically both are chi square distributed so an obvious question is which of the two statistics that has a distribution that is closest to the chi square distribution. Surprisingly, when there is only one degree of freedom it seems like the distribution of informatio… ▽ More

    Submitted 17 June, 2012; v1 submitted 6 February, 2012; originally announced February 2012.

    Comments: 5 pages, accepted for presentation at ISIT 2012

    MSC Class: 62E15

  23. arXiv:1102.2536  [pdf, ps, other

    cs.IT math.PR

    Lower bounds on Information Divergence

    Authors: Peter Harremoës, Christophe Vignat

    Abstract: In this paper we establish lower bounds on information divergence from a distribution to certain important classes of distributions as Gaussian, exponential, Gamma, Poisson, geometric, and binomial. These lower bounds are tight and for several convergence theorems where a rate of convergence can be computed, this rate is determined by the lower bounds proved in this paper. General techniques for g… ▽ More

    Submitted 12 February, 2011; originally announced February 2011.

    Comments: Submitted for the conference ISIT 2011

    MSC Class: 94A15

  24. arXiv:1102.0418  [pdf, ps, other

    math.HO

    Is Zero a Natural Number?

    Authors: Peter Harremoës

    Abstract: It is argued that zero should be considered as a cardinal number but not an ordinal number. One should make a clear distinction between order types that are labels for well-ordered sets and ordinal numbers that are labels for the elements in these sets.

    Submitted 2 February, 2011; originally announced February 2011.

    MSC Class: 03E10

  25. arXiv:1007.0097  [pdf, ps, other

    cs.IT math.ST

    On Pairs of $f$-divergences and their Joint Range

    Authors: Peter Harremoës, Igor Vajda

    Abstract: We compare two f-divergences and prove that their joint range is the convex hull of the joint range for distributions supported on only two points. Some applications of this result are given.

    Submitted 1 July, 2010; originally announced July 2010.

    Comments: 7 pages, 4 figures

    MSC Class: 94A17; 26Dxx

  26. arXiv:1002.1493  [pdf, ps, other

    math.ST

    On Bahadur Efficiency of Power Divergence Statistics

    Authors: Peter Harremoës, Igor Vajda

    Abstract: It is proved that the information divergence statistic is infinitely more Bahadur efficient than the power divergence statistics of the orders $α>1$ as long as the sequence of alternatives is contiguous with respect to the sequence of null-hypotheses and the the number of observations per bin increases to infinity is not very slow. This improves the former result in Harremoës and Vajda (2008) wh… ▽ More

    Submitted 7 February, 2010; originally announced February 2010.

  27. arXiv:1001.4448  [pdf, ps, other

    cs.IT

    Rényi Divergence and Majorization

    Authors: Tim van Erven, Peter Harremoës

    Abstract: Rényi divergence is related to Rényi entropy much like information divergence (also called Kullback-Leibler divergence or relative entropy) is related to Shannon's entropy, and comes up in many settings. It was introduced by Rényi as a measure of information that satisfies almost the same axioms as information divergence. We review the most important properties of Rényi divergence, including its r… ▽ More

    Submitted 27 May, 2010; v1 submitted 25 January, 2010; originally announced January 2010.

    MSC Class: 94A17

  28. arXiv:1001.4432  [pdf, ps, other

    cs.IT math.ST

    Joint Range of f-divergences

    Authors: Peter Harremoës, Igor Vajda

    Abstract: We provide a general method for evaluation of the joint range of f-divergences for two different functions f. Via topological arguments we prove that the joint range for general distributions equals the convex hull of the joint range achieved by the distributions on a two-element set. The joint range technique provides important inequalities between different f-divergences with various application… ▽ More

    Submitted 27 May, 2010; v1 submitted 25 January, 2010; originally announced January 2010.

    Comments: Accepted for presentation at ISIT 2010

  29. Thinning, Entropy and the Law of Thin Numbers

    Authors: Peter Harremoes, Oliver Johnson, Ioannis Kontoyiannis

    Abstract: Renyi's "thinning" operation on a discrete random variable is a natural discrete analog of the scaling operation for continuous random variables. The properties of thinning are investigated in an information-theoretic context, especially in connection with information-theoretic inequalities related to Poisson approximation results. The classical Binomial-to-Poisson convergence (sometimes referre… ▽ More

    Submitted 3 June, 2009; originally announced June 2009.

    Journal ref: IEEE Transactions on Information Theory, Vol 56/9, 2010, pages 4228-4244

  30. arXiv:0904.2477  [pdf, other

    cs.IT math.PR

    Joint Range of Rényi Entropies

    Authors: Peter Harremoës

    Abstract: The exact range of the joined values of several Rényi entropies is determined. The method is based on topology with special emphasis on the orientation of the objects studied. Like in the case when only two orders of Rényi entropies are studied one can parametrize upper and lower bounds but an explicit formula for a tight upper or lower bound cannot be given.

    Submitted 16 April, 2009; originally announced April 2009.

    MSC Class: 94A17; 62B10

  31. arXiv:0903.5429  [pdf, ps, other

    math.PR

    Dutch Books and Combinatorial Games

    Authors: Peter Harremoes

    Abstract: The theory of combinatorial game (like board games) and the theory of social games (where one looks for Nash equilibria) are normally considered as two separate theories. Here we shall see what comes out of combining the ideas. The central idea is Conway's observation that real numbers can be interpreted as special types of combinatorial games. Therefore the payoff function of a social game is a c… ▽ More

    Submitted 27 May, 2010; v1 submitted 31 March, 2009; originally announced March 2009.

    MSC Class: 60A05; 91A46

  32. arXiv:0903.5426  [pdf, ps, other

    cs.IT math.ST

    Testing Goodness-of-Fit via Rate Distortion

    Authors: Peter Harremoes

    Abstract: A framework is developed using techniques from rate distortion theory in statistical testing. The idea is first to do optimal compression according to a certain distortion function and then use information divergence from the compressed empirical distribution to the compressed null hypothesis as statistic. Only very special cases have been studied in more detail, but they indicate that the appro… ▽ More

    Submitted 31 March, 2009; originally announced March 2009.

    MSC Class: 94A34; 62G10

  33. arXiv:0903.5399  [pdf, ps, other

    cs.IT

    Regret and Jeffreys Integrals in Exp. Families

    Authors: Peter Grunwald, Peter Harremoes

    Abstract: The problem of whether minimax redundancy, minimax regret and Jeffreys integrals are finite or infinite are discussed.

    Submitted 31 March, 2009; originally announced March 2009.

  34. arXiv:0901.0015  [pdf, other

    cs.IT math.PR

    Maximum Entropy on Compact Groups

    Authors: Peter Harremoes

    Abstract: On a compact group the Haar probability measure plays the role of uniform distribution. The entropy and rate distortion theory for this uniform distribution is studied. New results and simplified proofs on convergence of convolutions on compact groups are presented and they can be formulated as entropy increases to its maximum. Information theoretic techniques and Markov chains play a crucial ro… ▽ More

    Submitted 29 March, 2009; v1 submitted 30 December, 2008; originally announced January 2009.

    Journal ref: Entropy 2009, 11(2), 222-237

  35. Properties of Classical and Quantum Jensen-Shannon Divergence

    Authors: Jop Briët, Peter Harremoës

    Abstract: Jensen-Shannon divergence (JD) is a symmetrized and smoothed version of the most important divergence measure of information theory, Kullback divergence. As opposed to Kullback divergence it determines in a very direct way a metric; indeed, it is the square of a metric. We consider a family of divergence measures (JD_alpha for alpha>0), the Jensen divergences of order alpha, which generalize JD… ▽ More

    Submitted 14 April, 2009; v1 submitted 27 June, 2008; originally announced June 2008.

    Comments: 13 pages, LaTeX, expanded contents, added references and corrected typos

    Journal ref: Phys. Rev. A 79, 052311 (2009)

  36. Interpretations of Renyi Entropies And Divergences

    Authors: Peter Harremoes

    Abstract: In this paper a new operational definition of Renyi entropy and Renyi divergence is presented. Other operational definitions are mentioned.

    Submitted 30 September, 2005; originally announced October 2005.

    Comments: 10 pages, 1 figure

    MSC Class: 94A17; 82B99

  37. Entropy and the Law of Small Numbers

    Authors: Ioannis Kontoyiannis, Peter Harremoes, Oliver Johnson

    Abstract: Two new information-theoretic methods are introduced for establishing Poisson approximation inequalities. First, using only elementary information-theoretic techniques it is shown that, when $S_n=\sum_{i=1}^nX_i$ is the sum of the (possibly dependent) binary random variables $X_1,X_2,...,X_n$, with $E(X_i)=p_i$ and $E(S_n)=\la$, then \ben D(P_{S_n}\|\Pol)\leq \sum_{i=1}^n p_i^2 + \Big[\sum_{i=1}… ▽ More

    Submitted 17 November, 2004; v1 submitted 1 November, 2002; originally announced November 2002.

    Comments: 15 pages. To appear, IEEE Trans Inform Theory

    Journal ref: IEEE Transactions on Information Theory, Vol 51/2, 2005, pages 466-472