Search | arXiv e-print repository

Self-adhesivity in lattices of abstract conditional independence models

Authors: Tobias Boege, Janneke H. Bolt, Milan Studený

Abstract: We introduce an algebraic concept of the frame for abstract conditional independence (CI) models, together with basic operations with respect to which such a frame should be closed: copying and marginalization. Three standard examples of such frames are (discrete) probabilistic CI structures, semi-graphoids and structural semi-graphoids. We concentrate on those frames which are closed under the op… ▽ More We introduce an algebraic concept of the frame for abstract conditional independence (CI) models, together with basic operations with respect to which such a frame should be closed: copying and marginalization. Three standard examples of such frames are (discrete) probabilistic CI structures, semi-graphoids and structural semi-graphoids. We concentrate on those frames which are closed under the operation of set-theoretical intersection because, for these, the respective families of CI models are lattices. This allows one to apply the results from lattice theory and formal concept analysis to describe such families in terms of implications among CI statements. The central concept of this paper is that of self-adhesivity defined in algebraic terms, which is a combinatorial reflection of the self-adhesivity concept studied earlier in context of polymatroids and information theory. The generalization also leads to a self-adhesivity operator defined on the hyper-level of CI frames. We answer some of the questions related to this approach and raise other open questions. The core of the paper is in computations. The combinatorial approach to computation might overcome some memory and space limitation of software packages based on polyhedral geometry, in particular, if SAT solvers are utilized. We characterize some basic CI families over 4 variables in terms of canonical implications among CI statements. We apply our method in information-theoretical context to the task of entropic region demarcation over 5 variables. △ Less

Submitted 21 February, 2024; originally announced February 2024.

Comments: 32 pages, 4 figures

MSC Class: 62B10 (primary) 06A15; 68T27; 68V05; 05B35 (secondary)

arXiv:2103.02414 [pdf, ps, other]

doi 10.1007/s00186-022-00770-4

Facets of the cone of exact games

Authors: Milan Studený, Václav Kratochvíl

Abstract: The class of exact transferable utility coalitional games, introduced in 1972 by Schmeidler, has been studied both in the context of game theory and in the context of imprecise probabilities. We characterize the cone of exact games by describing the minimal set of linear inequalities defining this cone; these facet-defining inequalities for the exact cone appear to correspond to certain set system… ▽ More The class of exact transferable utility coalitional games, introduced in 1972 by Schmeidler, has been studied both in the context of game theory and in the context of imprecise probabilities. We characterize the cone of exact games by describing the minimal set of linear inequalities defining this cone; these facet-defining inequalities for the exact cone appear to correspond to certain set systems (= systems of coalitions). We noticed that non-empty proper coalitions having non-zero coefficients in these facet-defining inequalities form set systems with particular properties. More specifically, we introduce the concept of a semi-balanced system of coalitions, which generalizes the classic concept of a balanced coalitional system in cooperative game theory. The semi-balanced coalitional systems provide valid inequalities for the exact cone and minimal semi-balanced systems (in the sense of inclusion of set systems) characterize this cone. We also introduce basic classification of minimal semi-balanced systems, their pictorial representatives and a substantial concept of an indecomposable (minimal) semi-balanced system of coalitions. The main result of the paper is that indecomposable semi-balanced systems are in one-to-one correspondence with facet-defining inequalities for the exact cone. The secondary relevant result is the rebuttal of a former conjecture claiming that a coalitional game is exact iff it is totally balanced and its anti-dual is also totally balanced. We additionally characterize those inequalities which are facet-defining both for the exact cone and the cone of totally balanced games. △ Less

Submitted 3 March, 2021; originally announced March 2021.

Comments: 39 pages, to be submitted (probably) to Mathematical Methods of Operation Research

MSC Class: 91A12; 52B05; 90C57

Journal ref: Mathematical Methods of Operations Research 95 (2022) 35-80

arXiv:2012.04092 [pdf, ps, other]

doi 10.1109/TIT.2021.3104250

Conditional independence structures over four discrete random variables revisited: conditional Ingleton inequalities

Authors: Milan Studeny

Abstract: The paper deals with conditional linear information inequalities valid for entropy functions induced by discrete random variables. Specifically, the so-called conditional Ingleton inequalities are in the center of interest: these are valid under conditional independence assumptions on the inducing random variables. We discuss five inequalities of this particular type, four of which has appeared ea… ▽ More The paper deals with conditional linear information inequalities valid for entropy functions induced by discrete random variables. Specifically, the so-called conditional Ingleton inequalities are in the center of interest: these are valid under conditional independence assumptions on the inducing random variables. We discuss five inequalities of this particular type, four of which has appeared earlier in the literature. Besides the proof of the new fifth inequality, simpler proofs of (some of) former inequalities are presented. These five information inequalities are used to characterize all conditional independence structures induced by four discrete random variables. △ Less

Submitted 14 March, 2022; v1 submitted 7 December, 2020; originally announced December 2020.

MSC Class: 94A17 68T37 52B40

Journal ref: IEEE Transactions on Information Theory, vol. 67, no. 11, November 2021, pp. 7030 - 7049

arXiv:1302.6847 [pdf]

Semigraphoids Are Two-Antecedental Approximations of Stochastic Conditional Independence Models

Authors: Milan Studeny

Abstract: The semigraphoid closure of every couple of CI-statements (GI=conditional independence) is a stochastic CI-model. As a consequence of this result it is shown that every probabilistically sound inference rule for CI-model, having at most two antecedents, is derivable from the semigraphoid inference rules. This justifies the use of semigraphoids as approximations of stochastic CI-models in probabi… ▽ More The semigraphoid closure of every couple of CI-statements (GI=conditional independence) is a stochastic CI-model. As a consequence of this result it is shown that every probabilistically sound inference rule for CI-model, having at most two antecedents, is derivable from the semigraphoid inference rules. This justifies the use of semigraphoids as approximations of stochastic CI-models in probabilistic reasoning. The list of all 19 potential dominant elements of the mentioned semigraphoid closure is given as a byproduct. △ Less

Submitted 27 February, 2013; originally announced February 2013.

Comments: Appears in Proceedings of the Tenth Conference on Uncertainty in Artificial Intelligence (UAI1994)

Report number: UAI-P-1994-PG-546-552

arXiv:1302.3606 [pdf]

On Separation Criterion and Recovery Algorithm for Chain Graphs

Authors: Milan Studeny

Abstract: Chain graphs give a natural unifying point of view on Markov and Bayesian networks and enlarge the potential of graphical models for description of conditional independence structures. In the paper a direct graphical separation criterion for chain graphs, called c-separation, which generalizes the d-separation criterion for Bayesian networks is introduced (recalled). It is equivalent to the clas… ▽ More Chain graphs give a natural unifying point of view on Markov and Bayesian networks and enlarge the potential of graphical models for description of conditional independence structures. In the paper a direct graphical separation criterion for chain graphs, called c-separation, which generalizes the d-separation criterion for Bayesian networks is introduced (recalled). It is equivalent to the classic moralization criterion for chain graphs and complete in sense that for every chain graph there exists a probability distribution satisfying exactly conditional independencies derivable from the chain graph by the c-separation criterion. Every class of Markov equivalent chain graphs can be uniquely described by a natural representative, called the largest chain graph. A recovery algorithm, which on basis of the (conditional) dependency model induced by an unknown chain graph finds the corresponding largest chain graph, is presented. △ Less

Submitted 13 February, 2013; originally announced February 2013.

Comments: Appears in Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence (UAI1996)

Report number: UAI-P-1996-PG-509-516

arXiv:1301.7414 [pdf]

Bayesian Networks from the Point of View of Chain Graphs

Authors: Milan Studeny

Abstract: AThe paper gives a few arguments in favour of the use of chain graphs for description of probabilistic conditional independence structures. Every Bayesian network model can be equivalently introduced by means of a factorization formula with respect to a chain graph which is Markov equivalent to the Bayesian network. A graphical characterization of such graphs is given. The class of equivalent gr… ▽ More AThe paper gives a few arguments in favour of the use of chain graphs for description of probabilistic conditional independence structures. Every Bayesian network model can be equivalently introduced by means of a factorization formula with respect to a chain graph which is Markov equivalent to the Bayesian network. A graphical characterization of such graphs is given. The class of equivalent graphs can be represented by a distinguished graph which is called the largest chain graph. The factorization formula with respect to the largest chain graph is a basis of a proposal of how to represent the corresponding (discrete) probability distribution in a computer (i.e. parametrize it). This way does not depend on the choice of a particular Bayesian network from the class of equivalent networks and seems to be the most efficient way from the point of view of memory demands. A separation criterion for reading independency statements from a chain graph is formulated in a simpler way. It resembles the well-known d-separation criterion for Bayesian networks and can be implemented locally. △ Less

Submitted 30 January, 2013; originally announced January 2013.

Comments: Appears in Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI1998)

Report number: UAI-P-1998-PG-496-503

arXiv:1301.2282 [pdf]

On characterizing Inclusion of Bayesian Networks

Authors: Tomas Kocka, Remco R. Bouckaert, Milan Studeny

Abstract: Every directed acyclic graph (DAG) over a finite non-empty set of variables (= nodes) N induces an independence model over N, which is a list of conditional independence statements over N.The inclusion problem is how to characterize (in graphical terms) whether all independence statements in the model induced by a DAG K are in the model induced by a second DAG L. Meek (1997) conjectured that this… ▽ More Every directed acyclic graph (DAG) over a finite non-empty set of variables (= nodes) N induces an independence model over N, which is a list of conditional independence statements over N.The inclusion problem is how to characterize (in graphical terms) whether all independence statements in the model induced by a DAG K are in the model induced by a second DAG L. Meek (1997) conjectured that this inclusion holds iff there exists a sequence of DAGs from L to K such that only certain 'legal' arrow reversal and 'legal' arrow adding operations are performed to get the next DAG in the sequence.In this paper we give several characterizations of inclusion of DAG models and verify Meek's conjecture in the case that the DAGs K and L differ in at most one adjacency. As a warming up a rigorous proof of well-known graphical characterizations of equivalence of DAGs, which is a highly related problem, is given. △ Less

Submitted 10 January, 2013; originally announced January 2013.

Comments: Appears in Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence (UAI2001)

Report number: UAI-P-2001-PG-261-268

arXiv:1011.6664 [pdf, ps, other]

Learning restricted Bayesian network structures

Authors: Raymond Hemmecke, Silvia Lindner, Milan Studený

Abstract: Bayesian networks are basic graphical models, used widely both in statistics and artificial intelligence. These statistical models of conditional independence structure are described by acyclic directed graphs whose nodes correspond to (random) variables in consideration. A quite important topic is the learning of Bayesian network structures, which is determining the best fitting statistical model… ▽ More Bayesian networks are basic graphical models, used widely both in statistics and artificial intelligence. These statistical models of conditional independence structure are described by acyclic directed graphs whose nodes correspond to (random) variables in consideration. A quite important topic is the learning of Bayesian network structures, which is determining the best fitting statistical model on the basis of given data. Although there are learning methods based on statistical conditional independence tests, contemporary methods are mainly based on maximization of a suitable quality criterion that evaluates how good the graph explains the occurrence of the observed data. This leads to a nonlinear combinatorial optimization problem that is in general NP-hard to solve. In this paper we deal with the complexity of learning restricted Bayesian network structures, that is, we wish to find network structures of highest score within a given subset of all possible network structures. For this, we introduce a new unique algebraic representative for these structures, called the characteristic imset. We show that these imsets are always 0-1-vectors and that they have many nice properties that allow us to simplify long proofs for some known results and to easily establish new complexity results for learning restricted Bayes network structures. △ Less

Submitted 30 November, 2010; originally announced November 2010.

MSC Class: 62F15; 68T05; 90C05; 90C09; 90C10; 9090C27; 90C60;

Showing 1–8 of 8 results for author: Studený, M