Search | arXiv e-print repository

Domain-Aware Augmentations for Unsupervised Online General Continual Learning

Authors: Nicolas Michel, Romain Negrel, Giovanni Chierchia, Jean-François Bercher

Abstract: Continual Learning has been challenging, especially when dealing with unsupervised scenarios such as Unsupervised Online General Continual Learning (UOGCL), where the learning agent has no prior knowledge of class boundaries or task change information. While previous research has focused on reducing forgetting in supervised setups, recent studies have shown that self-supervised learners are more r… ▽ More Continual Learning has been challenging, especially when dealing with unsupervised scenarios such as Unsupervised Online General Continual Learning (UOGCL), where the learning agent has no prior knowledge of class boundaries or task change information. While previous research has focused on reducing forgetting in supervised setups, recent studies have shown that self-supervised learners are more resilient to forgetting. This paper proposes a novel approach that enhances memory usage for contrastive learning in UOGCL by defining and using stream-dependent data augmentations together with some implementation tricks. Our proposed method is simple yet effective, achieves state-of-the-art results compared to other unsupervised approaches in all considered setups, and reduces the gap between supervised and unsupervised continual learning. Our domain-aware augmentation procedure can be adapted to other replay-based methods, making it a promising strategy for continual learning. △ Less

Submitted 13 September, 2023; originally announced September 2023.

Comments: Accepted to BMVC'23

arXiv:2309.00462 [pdf, other]

New metrics for analyzing continual learners

Authors: Nicolas Michel, Giovanni Chierchia, Romain Negrel, Jean-François Bercher, Toshihiko Yamasaki

Abstract: Deep neural networks have shown remarkable performance when trained on independent and identically distributed data from a fixed set of classes. However, in real-world scenarios, it can be desirable to train models on a continuous stream of data where multiple classification tasks are presented sequentially. This scenario, known as Continual Learning (CL) poses challenges to standard learning algo… ▽ More Deep neural networks have shown remarkable performance when trained on independent and identically distributed data from a fixed set of classes. However, in real-world scenarios, it can be desirable to train models on a continuous stream of data where multiple classification tasks are presented sequentially. This scenario, known as Continual Learning (CL) poses challenges to standard learning algorithms which struggle to maintain knowledge of old tasks while learning new ones. This stability-plasticity dilemma remains central to CL and multiple metrics have been proposed to adequately measure stability and plasticity separately. However, none considers the increasing difficulty of the classification task, which inherently results in performance loss for any model. In that sense, we analyze some limitations of current metrics and identify the presence of setup-induced forgetting. Therefore, we propose new metrics that account for the task's increasing difficulty. Through experiments on benchmark datasets, we demonstrate that our proposed metrics can provide new insights into the stability-plasticity trade-off achieved by models in the continual learning environment. △ Less

Submitted 1 September, 2023; originally announced September 2023.

Comments: 6 pages, presented at MIRU 2023

arXiv:2306.03364 [pdf, other]

Learning Representations on the Unit Sphere: Investigating Angular Gaussian and von Mises-Fisher Distributions for Online Continual Learning

Authors: Nicolas Michel, Giovanni Chierchia, Romain Negrel, Jean-François Bercher

Abstract: We use the maximum a posteriori estimation principle for learning representations distributed on the unit sphere. We propose to use the angular Gaussian distribution, which corresponds to a Gaussian projected on the unit-sphere and derive the associated loss function. We also consider the von Mises-Fisher distribution, which is the conditional of a Gaussian in the unit-sphere. The learned represen… ▽ More We use the maximum a posteriori estimation principle for learning representations distributed on the unit sphere. We propose to use the angular Gaussian distribution, which corresponds to a Gaussian projected on the unit-sphere and derive the associated loss function. We also consider the von Mises-Fisher distribution, which is the conditional of a Gaussian in the unit-sphere. The learned representations are pushed toward fixed directions, which are the prior means of the Gaussians; allowing for a learning strategy that is resilient to data drift. This makes it suitable for online continual learning, which is the problem of training neural networks on a continuous data stream, where multiple classification tasks are presented sequentially so that data from past tasks are no longer accessible, and data from the current task can be seen only once. To address this challenging scenario, we propose a memory-based representation learning technique equipped with our new loss functions. Our approach does not require negative data or knowledge of task boundaries and performs well with smaller batch sizes while being computationally efficient. We demonstrate with extensive experiments that the proposed method outperforms the current state-of-the-art methods on both standard evaluation scenarios and realistic scenarios with blurry task boundaries. For reproducibility, we use the same training pipeline for every compared method and share the code at https://github.com/Nicolas1203/ocl-fd. △ Less

Submitted 16 February, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

Comments: Fix some typo. Accepted to AAAI24

arXiv:2302.00004 [pdf, other]

doi 10.1109/ICCCNT54827.2022.9984543

Low Complexity Approaches for End-to-End Latency Prediction

Authors: Pierre Larrenie, Jean-François Bercher, Olivier Venard, Iyad Lahsen-Cherif

Abstract: Software Defined Networks have opened the door to statistical and AI-based techniques to improve efficiency of networking. Especially to ensure a certain Quality of Service (QoS) for specific applications by routing packets with awareness on content nature (VoIP, video, files, etc.) and its needs (latency, bandwidth, etc.) to use efficiently resources of a network. Predicting various Key Performan… ▽ More Software Defined Networks have opened the door to statistical and AI-based techniques to improve efficiency of networking. Especially to ensure a certain Quality of Service (QoS) for specific applications by routing packets with awareness on content nature (VoIP, video, files, etc.) and its needs (latency, bandwidth, etc.) to use efficiently resources of a network. Predicting various Key Performance Indicators (KPIs) at any level may handle such problems while preserving network bandwidth. The question addressed in this work is the design of efficient and low-cost algorithms for KPI prediction, implementable at the local level. We focus on end-to-end latency prediction, for which we illustrate our approaches and results on a public dataset from the recent international challenge on GNN [1]. We propose several low complexity, locally implementable approaches, achieving significantly lower wall time both for training and inference, with marginally worse prediction accuracy compared to state-of-the-art global GNN solutions. △ Less

Submitted 31 January, 2023; originally announced February 2023.

Comments: arXiv admin note: substantial text overlap with arXiv:2301.13536

Journal ref: 2022 13th International Conference on Computing Communication and Networking Technologies (ICCCNT), Oct 2022, Kharagpur, France. pp.1-6

arXiv:2301.13536 [pdf, other]

Low Complexity Adaptive Machine Learning Approaches for End-to-End Latency Prediction

Authors: Pierre Larrenie, Jean-François Bercher, Olivier Venard, Iyad Lahsen-Cherif

Abstract: Software Defined Networks have opened the door to statistical and AI-based techniques to improve efficiency of networking. Especially to ensure a certain Quality of Service (QoS) for specific applications by routing packets with awareness on content nature (VoIP, video, files, etc.) and its needs (latency, bandwidth, etc.) to use efficiently resources of a network. Monitoring and predicting variou… ▽ More Software Defined Networks have opened the door to statistical and AI-based techniques to improve efficiency of networking. Especially to ensure a certain Quality of Service (QoS) for specific applications by routing packets with awareness on content nature (VoIP, video, files, etc.) and its needs (latency, bandwidth, etc.) to use efficiently resources of a network. Monitoring and predicting various Key Performance Indicators (KPIs) at any level may handle such problems while preserving network bandwidth. The question addressed in this work is the design of efficient, low-cost adaptive algorithms for KPI estimation, monitoring and prediction. We focus on end-to-end latency prediction, for which we illustrate our approaches and results on data obtained from a public generator provided after the recent international challenge on GNN [12]. In this paper, we improve our previously proposed low-cost estimators [6] by adding the adaptive dimension, and show that the performances are minimally modified while gaining the ability to track varying networks. △ Less

Submitted 31 January, 2023; originally announced January 2023.

Journal ref: 5th International Conference on Machine Learning for Networking (MLN'2022), Nov 2022, Paris, France

arXiv:2207.05615 [pdf, other]

Contrastive Learning for Online Semi-Supervised General Continual Learning

Authors: Nicolas Michel, Romain Negrel, Giovanni Chierchia, Jean-François Bercher

Abstract: We study Online Continual Learning with missing labels and propose SemiCon, a new contrastive loss designed for partly labeled data. We demonstrate its efficiency by devising a memory-based method trained on an unlabeled data stream, where every data added to memory is labeled using an oracle. Our approach outperforms existing semi-supervised methods when few labels are available, and obtain simil… ▽ More We study Online Continual Learning with missing labels and propose SemiCon, a new contrastive loss designed for partly labeled data. We demonstrate its efficiency by devising a memory-based method trained on an unlabeled data stream, where every data added to memory is labeled using an oracle. Our approach outperforms existing semi-supervised methods when few labels are available, and obtain similar results to state-of-the-art supervised methods while using only 2.6% of labels on Split-CIFAR10 and 10% of labels on Split-CIFAR100. △ Less

Submitted 22 November, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

Comments: Accepted at ICIP'22 Oral presentation

arXiv:1305.6215 [pdf, ps, other]

On some interrelations of generalized $q$-entropies and a generalized Fisher information, including a Cramér-Rao inequality

Authors: Jean-François Bercher

Abstract: In this communication, we describe some interrelations between generalized $q$-entropies and a generalized version of Fisher information. In information theory, the de Bruijn identity links the Fisher information and the derivative of the entropy. We show that this identity can be extended to generalized versions of entropy and Fisher information. More precisely, a generalized Fisher information n… ▽ More In this communication, we describe some interrelations between generalized $q$-entropies and a generalized version of Fisher information. In information theory, the de Bruijn identity links the Fisher information and the derivative of the entropy. We show that this identity can be extended to generalized versions of entropy and Fisher information. More precisely, a generalized Fisher information naturally pops up in the expression of the derivative of the Tsallis entropy. This generalized Fisher information also appears as a special case of a generalized Fisher information for estimation problems. Indeed, we derive here a new Cramér-Rao inequality for the estimation of a parameter, which involves a generalized form of Fisher information. This generalized Fisher information reduces to the standard Fisher information as a particular case. In the case of a translation parameter, the general Cramér-Rao inequality leads to an inequality for distributions which is saturated by generalized $q$-Gaussian distributions. These generalized $q$-Gaussians are important in several areas of physics and mathematics. They are known to maximize the $q$-entropies subject to a moment constraint. The Cramér-Rao inequality shows that the generalized $q$-Gaussians also minimize the generalized Fisher information among distributions with a fixed moment. Similarly, the generalized $q$-Gaussians also minimize the generalized Fisher information among distributions with a given $q$-entropy. △ Less

Submitted 27 May, 2013; originally announced May 2013.

Journal ref: Applied Stochastic Models and Data Analysis, Mataro (Barcelona) : Spain (2013)

arXiv:1305.6213 [pdf, ps, other]

Some results on a $χ$-divergence, an~extended~Fisher information and~generalized~Cramér-Rao inequalities

Authors: Jean-François Bercher

Abstract: We propose a modified $χ^β$-divergence, give some of its properties, and show that this leads to the definition of a generalized Fisher information. We give generalized Cramér-Rao inequalities, involving this Fisher information, an extension of the Fisher information matrix, and arbitrary norms and power of the estimation error. In the case of a location parameter, we obtain new characterizations… ▽ More We propose a modified $χ^β$-divergence, give some of its properties, and show that this leads to the definition of a generalized Fisher information. We give generalized Cramér-Rao inequalities, involving this Fisher information, an extension of the Fisher information matrix, and arbitrary norms and power of the estimation error. In the case of a location parameter, we obtain new characterizations of the generalized $q$-Gaussians, for instance as the distribution with a given moment that minimizes the generalized Fisher information. Finally we indicate how the generalized Fisher information can lead to new uncertainty relations. △ Less

Submitted 27 May, 2013; originally announced May 2013.

Journal ref: Geometric Sciences of Information, Paris : France (2013)

arXiv:1305.5040 [pdf, ps, other]

doi 10.1016/j.physa.2013.03.062

Some properties of generalized Fisher information in the context of nonextensive thermostatistics

Authors: Jean-François Bercher

Abstract: We present two extended forms of Fisher information that fit well in the context of nonextensive thermostatistics. We show that there exists an interplay between these generalized Fisher information, the generalized $q$-Gaussian distributions and the $q$-entropies. The minimum of the generalized Fisher information among distributions with a fixed moment, or with a fixed $q$-entropy is attained, in… ▽ More We present two extended forms of Fisher information that fit well in the context of nonextensive thermostatistics. We show that there exists an interplay between these generalized Fisher information, the generalized $q$-Gaussian distributions and the $q$-entropies. The minimum of the generalized Fisher information among distributions with a fixed moment, or with a fixed $q$-entropy is attained, in both cases, by a generalized $q$-Gaussian distribution. This complements the fact that the $q$-Gaussians maximize the $q$-entropies subject to a moment constraint, and yields new variational characterizations of the generalized $q$-Gaussians. We show that the generalized Fisher information naturally pop up in the expression of the time derivative of the $q$-entropies, for distributions satisfying a certain nonlinear heat equation. This result includes as a particular case the classical de Bruijn identity. Then we study further properties of the generalized Fisher information and of their minimization. We show that, though non additive, the generalized Fisher information of a combined system is upper bounded. In the case of mixing, we show that the generalized Fisher information is convex for $q\geq1.$ Finally, we show that the minimization of the generalized Fisher information subject to moment constraints satisfies a Legendre structure analog to the Legendre structure of thermodynamics. △ Less

Submitted 4 June, 2013; v1 submitted 22 May, 2013; originally announced May 2013.

Journal ref: Physica A: Statistical Mechanics and its Applications 392, 15 (2013) 3140-3154

arXiv:1211.2008 [pdf, ps, other]

doi 10.1088/1751-8113/46/9/095303

On multidimensional generalized Cramér-Rao inequalities, uncertainty relations and characterizations of generalized $q$-Gaussian distributions

Authors: J. -F. Bercher

Abstract: In the present work, we show how the generalized Cramér-Rao inequality for the estimation of a parameter, presented in a recent paper, can be extended to the mutidimensional case with general norms on $\mathbb{R}^{n}$, and to a wider context. As a particular case, we obtain a new multidimensional Cramér-Rao inequality which is saturated by generalized $q$-Gaussian distributions. We also give anoth… ▽ More In the present work, we show how the generalized Cramér-Rao inequality for the estimation of a parameter, presented in a recent paper, can be extended to the mutidimensional case with general norms on $\mathbb{R}^{n}$, and to a wider context. As a particular case, we obtain a new multidimensional Cramér-Rao inequality which is saturated by generalized $q$-Gaussian distributions. We also give another related Cramér-Rao inequality, for a general norm, which is saturated as well by these distributions. Finally, we derive uncertainty relations from these Cramér-Rao inequalities. These uncertainty relations involve moments computed with respect to escort distributions, and we show that some of these relations are saturated by generalized $q$-Gaussian distributions. These results introduce extended versions of Fisher information, new Cramér-Rao inequalities, and new characterizations of generalized $q$-Gaussian distributions which are important in several areas of physics and mathematics. △ Less

Submitted 24 February, 2013; v1 submitted 8 November, 2012; originally announced November 2012.

MSC Class: 28D20; 94A17; 62B10; 39B62

Journal ref: J. Phys. A: Math. Theor. vol. 46, no 9, 095303, 2013

arXiv:1206.0567 [pdf, ps, other]

doi 10.1088/1751-8113/45/25/255303

On generalized Cramér-Rao inequalities, generalized Fisher informations and characterizations of generalized q-Gaussian distributions

Authors: J. -F. Bercher

Abstract: This paper deals with Cramér-Rao inequalities in the context of nonextensive statistics and in estimation theory. It gives characterizations of generalized q-Gaussian distributions, and introduces generalized versions of Fisher information. The contributions of this paper are (i) the derivation of new extended Cramér-Rao inequalities for the estimation of a parameter, involving general q-moments o… ▽ More This paper deals with Cramér-Rao inequalities in the context of nonextensive statistics and in estimation theory. It gives characterizations of generalized q-Gaussian distributions, and introduces generalized versions of Fisher information. The contributions of this paper are (i) the derivation of new extended Cramér-Rao inequalities for the estimation of a parameter, involving general q-moments of the estimation error, (ii) the derivation of Cramér-Rao inequalities saturated by generalized q-Gaussian distributions, (iii) the definition of generalized Fisher informations, (iv) the identification and interpretation of some prior results, and finally, (v) the suggestion of new estimation methods. △ Less

Submitted 4 June, 2012; originally announced June 2012.

Journal ref: J. Phys. A: Math. Theor. 45 255303 2012

arXiv:1206.0561 [pdf, other]

doi 10.1016/j.physa.2012.04.024

A simple probabilistic construction yielding generalized entropies and divergences, escort distributions and q-Gaussians

Authors: J. -F. Bercher

Abstract: We give a simple probabilistic description of a transition between two states which leads to a generalized escort distribution. When the parameter of the distribution varies, it defines a parametric curve that we call an escort-path. The Rényi divergence appears as a natural by-product of the setting. We study the dynamics of the Fisher information on this path, and show in particular that the the… ▽ More We give a simple probabilistic description of a transition between two states which leads to a generalized escort distribution. When the parameter of the distribution varies, it defines a parametric curve that we call an escort-path. The Rényi divergence appears as a natural by-product of the setting. We study the dynamics of the Fisher information on this path, and show in particular that the thermodynamic divergence is proportional to Jeffreys' divergence. Next, we consider the problem of inferring a distribution on the escort-path, subject to generalized moments constraints. We show that our setting naturally induces a rationale for the minimization of the Rényi information divergence. Then, we derive the optimum distribution as a generalized q-Gaussian distribution. △ Less

Submitted 4 June, 2012; originally announced June 2012.

arXiv:1203.1435 [pdf, ps, other]

doi 10.1063/1.4726197

On a (β,q)-generalized Fisher information and inequalities involving q-Gaussian distributions

Authors: J. -F. Bercher

Abstract: In the present paper, we would like to draw attention to a possible generalized Fisher information that fits well in the formalism of nonextensive thermostatistics. This generalized Fisher information is defined for densities on $\mathbb{R}^{n}.$ Just as the maximum Rényi or Tsallis entropy subject to an elliptic moment constraint is a generalized q-Gaussian, we show that the minimization of the g… ▽ More In the present paper, we would like to draw attention to a possible generalized Fisher information that fits well in the formalism of nonextensive thermostatistics. This generalized Fisher information is defined for densities on $\mathbb{R}^{n}.$ Just as the maximum Rényi or Tsallis entropy subject to an elliptic moment constraint is a generalized q-Gaussian, we show that the minimization of the generalized Fisher information also leads a generalized q-Gaussian. This yields a generalized Cramér-Rao inequality. In addition, we show that the generalized Fisher information naturally pops up in a simple inequality that links the generalized entropies, the generalized Fisher information and an elliptic moment. Finally, we give an extended Stam inequality. In this series of results, the extremal functions are the generalized q-Gaussians. Thus, these results complement the classical characterization of the generalized q-Gaussian and introduce a generalized Fisher information as a new information measure in nonextensive thermostatistics. △ Less

Submitted 17 January, 2013; v1 submitted 7 March, 2012; originally announced March 2012.

Comments: v2: corrected equation (A5)

Journal ref: J. Math. Phys. 53, 063303 (2012)

arXiv:1109.3385 [pdf, ps, other]

doi 10.1016/j.physleta.2009.07.015

Source coding with escort distributions and Renyi entropy bounds

Authors: J. -F. Bercher

Abstract: We discuss the interest of escort distributions and Rényi entropy in the context of source coding. We first recall a source coding theorem by Campbell relating a generalized measure of length to the Rényi-Tsallis entropy. We show that the associated optimal codes can be obtained using considerations on escort-distributions. We propose a new family of measure of length involving escort-distribution… ▽ More We discuss the interest of escort distributions and Rényi entropy in the context of source coding. We first recall a source coding theorem by Campbell relating a generalized measure of length to the Rényi-Tsallis entropy. We show that the associated optimal codes can be obtained using considerations on escort-distributions. We propose a new family of measure of length involving escort-distributions and we show that these generalized lengths are also bounded below by the Rényi entropy. Furthermore, we obtain that the standard Shannon codes lengths are optimum for the new generalized lengths measures, whatever the entropic index. Finally, we show that there exists in this setting an interplay between standard and escort distributions. △ Less

Submitted 15 September, 2011; originally announced September 2011.

Journal ref: Physics Letters A, vol. 373, no. 36, p. 3235-3238, 2009

arXiv:1109.3311 [pdf, ps, other]

doi 10.1016/j.physleta.2011.06.057

Escort entropies and divergences and related canonical distribution

Authors: J. -F. Bercher

Abstract: We discuss two families of two-parameter entropies and divergences, derived from the standard Rényi and Tsallis entropies and divergences. These divergences and entropies are found as divergences or entropies of escort distributions. Exploiting the nonnegativity of the divergences, we derive the expression of the canonical distribution associated to the new entropies and a observable given as an e… ▽ More We discuss two families of two-parameter entropies and divergences, derived from the standard Rényi and Tsallis entropies and divergences. These divergences and entropies are found as divergences or entropies of escort distributions. Exploiting the nonnegativity of the divergences, we derive the expression of the canonical distribution associated to the new entropies and a observable given as an escort-mean value. We show that this canonical distribution extends, and smoothly connects, the results obtained in nonextensive thermodynamics for the standard and generalized mean value constraints. △ Less

Submitted 15 September, 2011; originally announced September 2011.

Journal ref: Physics Letters A, vol. 375, no. 33, p. 2969-2973, 2011

arXiv:0908.2532

Utilisation de la notion de copule en tomographie

Authors: Doriano-Boris Pougaza, Ali Mohammad-Djafari, Jean-Francois Bercher

Abstract: Un problème important en statistique est la détermination d'une loi de probabilité jointe à partir de ses lois marginales. Dans le cas bidimensionnel, les lois de probabilité marginales f1 (x) et f2(y) sont reliées à la loi jointe f(x,y) par les intégrales suivant les lignes horizontale et verticale (les deux axes x et y). Ainsi, le problème de la détermination de f(x,y) connaissant f1 (x) et f2(y… ▽ More Un problème important en statistique est la détermination d'une loi de probabilité jointe à partir de ses lois marginales. Dans le cas bidimensionnel, les lois de probabilité marginales f1 (x) et f2(y) sont reliées à la loi jointe f(x,y) par les intégrales suivant les lignes horizontale et verticale (les deux axes x et y). Ainsi, le problème de la détermination de f(x,y) connaissant f1 (x) et f2(y) est un problème inverse mal posé. En statistique la notion de copule est introduite pour obtenir une solution à ce problème. Un problème similaire en tomographie à rayon X est la reconstruction d'une image f(x,y) représentant la répartition de la densité d'une quantité à l'intérieur de l'objet à partir de ses deux projections horizontale et verticale, f1 (x) et f2(y). Il existe aussi un grand nombre de méthodes pour de tels problèmes fondées sur la transformée de Radon. Dans cet article, nous montrons les liens entre la notion de copule et celle de la tomographie à rayon X et voyons si on peut utiliser les méthodes d'un domaine à l'autre. △ Less

Submitted 14 August, 2010; v1 submitted 18 August, 2009; originally announced August 2009.

Comments: This paper has been withdrawn by the author, please find it on his web page

arXiv:0812.1316 [pdf, ps, other]

Using the Notion of Copula in Tomography

Authors: Doriano-Boris Pougaza, A. Mohammad-Djafari, Jean-François Bercher

Abstract: In 1917 Johann Radon introduced the Radon transform which is used in 1963 by A. M. Cormack for application in the context of tomographic image reconstruction. He proposed to reconstruct the spatial variation of the material density of the body from X-Ray images (radiographies) for different directions. Independently G. N. Hounsfield derived an algorithm and built the first medical CT scanner. Ba… ▽ More In 1917 Johann Radon introduced the Radon transform which is used in 1963 by A. M. Cormack for application in the context of tomographic image reconstruction. He proposed to reconstruct the spatial variation of the material density of the body from X-Ray images (radiographies) for different directions. Independently G. N. Hounsfield derived an algorithm and built the first medical CT scanner. Basically the idea of the X-ray CT is to get an image of the interior structure of an object by X-raying the object from many different directions. The mathematical problem is then estimating a multivariate function from its line integrals. Four year before Cormack's idea, Abe Sklar introduced a theory in the context of Statistics called copula. Shortly copulas are functions that link multivariate distributions to theirs univariate marginal functions. It appeared that copulas captivated all dependence structure concerning the marginal functions and offer a wide range of parametric family model which could be used as a model for the joint distribution function. This statistical problem is the same as in Tomography, because a marginal density is obtained from a line integral of its joint distribution. In the particular case of only given horizontal and vertical projections corresponding to a given two marginal functions, we link the theory of copula to tomography via the Radon transform and Sklar's theorem. The result we propose seems to be new as mathematical approach to solve this tomographic inverse problem. △ Less

Submitted 6 December, 2008; originally announced December 2008.

Comments: 60 pages, 81 figures

arXiv:0805.0129 [pdf, ps, other]

doi 10.1016/j.ins.2008.02.003

On some entropy functionals derived from Rényi information divergence

Authors: Jean-François Bercher

Abstract: We consider the maximum entropy problems associated with Rényi $Q$-entropy, subject to two kinds of constraints on expected values. The constraints considered are a constraint on the standard expectation, and a constraint on the generalized expectation as encountered in nonextensive statistics. The optimum maximum entropy probability distributions, which can exhibit a power-law behaviour, are de… ▽ More We consider the maximum entropy problems associated with Rényi $Q$-entropy, subject to two kinds of constraints on expected values. The constraints considered are a constraint on the standard expectation, and a constraint on the generalized expectation as encountered in nonextensive statistics. The optimum maximum entropy probability distributions, which can exhibit a power-law behaviour, are derived and characterized. The Rényi entropy of the optimum distributions can be viewed as a function of the constraint. This defines two families of entropy functionals in the space of possible expected values. General properties of these functionals, including nonnegativity, minimum, convexity, are documented. Their relationships as well as numerical aspects are also discussed. Finally, we work out some specific cases for the reference measure $Q(x)$ and recover in a limit case some well-known entropies. △ Less

Submitted 1 May, 2008; originally announced May 2008.

Journal ref: Information Sciences 178, 12 (2008) 2489-2506

arXiv:0802.3110 [pdf, ps, other]

An entropic view of Pickands' theorem

Authors: J. -F. Bercher, C. Vignat

Abstract: It is shown that distributions arising in Renyi-Tsallis maximum entropy setting are related to the Generalized Pareto Distributions (GPD) that are widely used for modeling the tails of distributions. The relevance of such modelization, as well as the ubiquity of GPD in practical situations follows from Balkema-De Haan-Pickands theorem on the distribution of excesses (over a high threshold). We p… ▽ More It is shown that distributions arising in Renyi-Tsallis maximum entropy setting are related to the Generalized Pareto Distributions (GPD) that are widely used for modeling the tails of distributions. The relevance of such modelization, as well as the ubiquity of GPD in practical situations follows from Balkema-De Haan-Pickands theorem on the distribution of excesses (over a high threshold). We provide an entropic view of this result, by showing that the distribution of a suitably normalized excess variable converges to the solution of a maximum Tsallis entropy, which is the GPD. This highlights the relevance of the so-called Tsallis distributions in many applications as well as some relevance to the use of the corresponding entropy. △ Less

Submitted 6 May, 2008; v1 submitted 21 February, 2008; originally announced February 2008.

Comments: 4 pages, accepted to ISIT08

arXiv:math-ph/0609077 [pdf, ps, other]

doi 10.1063/1.2423305

An amended MaxEnt formulation for deriving Tsallis factors, and associated issues

Authors: Jean-François Bercher

Abstract: An amended MaxEnt formulation for systems displaced from the conventional MaxEnt equilibrium is proposed. This formulation involves the minimization of the Kullback-Leibler divergence to a reference $Q$ (or maximization of Shannon $Q$-entropy), subject to a constraint that implicates a second reference distribution $P\_{1}$ and tunes the new equilibrium. In this setting, the equilibrium distribu… ▽ More An amended MaxEnt formulation for systems displaced from the conventional MaxEnt equilibrium is proposed. This formulation involves the minimization of the Kullback-Leibler divergence to a reference $Q$ (or maximization of Shannon $Q$-entropy), subject to a constraint that implicates a second reference distribution $P\_{1}$ and tunes the new equilibrium. In this setting, the equilibrium distribution is the generalized escort distribution associated to $P\_{1}$ and $Q$. The account of an additional constraint, an observable given by a statistical mean, leads to the maximization of Rényi/Tsallis $Q$-entropy subject to that constraint. Two natural scenarii for this observation constraint are considered, and the classical and generalized constraint of nonextensive statistics are recovered. The solutions to the maximization of Rényi $Q$-entropy subject to the two types of constraints are derived. These optimum distributions, that are Levy-like distributions, are self-referential. We then propose two `alternate' (but effectively computable) dual functions, whose maximizations enable to identify the optimum parameters. Finally, a duality between solutions and the underlying Legendre structure are presented. △ Less

Submitted 27 September, 2006; originally announced September 2006.

Comments: Presented at MaxEnt2006, Paris, France, july 10-13, 2006

MSC Class: PACS: 05.30.-d; 05.20.-y; 05.70.Ce; 05.90.+m

Showing 1–20 of 20 results for author: Bercher, J