Skip to main content

Showing 1–19 of 19 results for author: Chickering, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:1707.06742  [pdf, other

    cs.LG cs.AI cs.HC cs.SE stat.ML

    Machine Teaching: A New Paradigm for Building Machine Learning Systems

    Authors: Patrice Y. Simard, Saleema Amershi, David M. Chickering, Alicia Edelman Pelton, Soroush Ghorashi, Christopher Meek, Gonzalo Ramos, **a Suh, Johan Verwey, Mo Wang, John Wernsing

    Abstract: The current processes for building machine learning systems require practitioners with deep knowledge of machine learning. This significantly limits the number of machine learning systems that can be created and has led to a mismatch between the demand for machine learning systems and the ability for organizations to build them. We believe that in order to meet this growing demand for machine lear… ▽ More

    Submitted 10 August, 2017; v1 submitted 20 July, 2017; originally announced July 2017.

    Comments: Also available at: http://aka.ms/machineteachingpaper

    Report number: MSR-TR-2017-26

  2. arXiv:1506.02113  [pdf, other

    cs.LG cs.AI

    Selective Greedy Equivalence Search: Finding Optimal Bayesian Networks Using a Polynomial Number of Score Evaluations

    Authors: David Maxwell Chickering, Christopher Meek

    Abstract: We introduce Selective Greedy Equivalence Search (SGES), a restricted version of Greedy Equivalence Search (GES). SGES retains the asymptotic correctness of GES but, unlike GES, has polynomial performance guarantees. In particular, we show that when data are sampled independently from a distribution that is perfect with respect to a DAG ${\cal G}$ defined over the observable variables then, in the… ▽ More

    Submitted 5 June, 2015; originally announced June 2015.

    Comments: Full version of UAI paper

  3. arXiv:1409.4814  [pdf

    cs.AI cs.IR

    ICE: Enabling Non-Experts to Build Models Interactively for Large-Scale Lopsided Problems

    Authors: Patrice Simard, David Chickering, Aparna Lakshmiratan, Denis Charles, Leon Bottou, Carlos Garcia Jurado Suarez, David Grangier, Saleema Amershi, Johan Verwey, **a Suh

    Abstract: Quick interaction between a human teacher and a learning machine presents numerous benefits and challenges when working with web-scale data. The human teacher guides the machine towards accomplishing the task of interest. The learning machine leverages big data to find examples that maximize the training value of its interaction with the teacher. When the teacher is restricted to labeling examples… ▽ More

    Submitted 16 September, 2014; originally announced September 2014.

  4. arXiv:1302.6815  [pdf

    cs.AI

    Learning Bayesian Networks: The Combination of Knowledge and Statistical Data

    Authors: David Heckerman, Dan Geiger, David Maxwell Chickering

    Abstract: We describe algorithms for learning Bayesian networks from a combination of user knowledge and statistical data. The algorithms have two components: a scoring metric and a search procedure. The scoring metric takes a network structure, statistical data, and a user's prior knowledge, and returns a score proportional to the posterior probability of the network structure given the data. The search pr… ▽ More

    Submitted 16 May, 2015; v1 submitted 27 February, 2013; originally announced February 2013.

    Comments: Appears in Proceedings of the Tenth Conference on Uncertainty in Artificial Intelligence (UAI1994)

    Report number: UAI-P-1994-PG-293-301

  5. arXiv:1302.4938  [pdf

    cs.AI

    A Transformational Characterization of Equivalent Bayesian Network Structures

    Authors: David Maxwell Chickering

    Abstract: We present a simple characterization of equivalent Bayesian network structures based on local transformations. The significance of the characterization is twofold. First, we are able to easily prove several new invariant properties of theoretical interest for equivalent structures. Second, we use the characterization to derive an efficient algorithm that identifies all of the compelled edges in… ▽ More

    Submitted 20 February, 2013; originally announced February 2013.

    Comments: Appears in Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence (UAI1995)

    Report number: UAI-P-1995-PG-87-98

  6. arXiv:1302.3567  [pdf

    cs.LG cs.AI stat.ML

    Efficient Approximations for the Marginal Likelihood of Incomplete Data Given a Bayesian Network

    Authors: David Maxwell Chickering, David Heckerman

    Abstract: We discuss Bayesian methods for learning Bayesian networks when data sets are incomplete. In particular, we examine asymptotic approximations for the marginal likelihood of incomplete data given a Bayesian network. We consider the Laplace approximation and the less accurate but more efficient BIC/MDL approximation. We also consider approximations proposed by Draper (1993) and Cheeseman and Stutz (… ▽ More

    Submitted 16 May, 2015; v1 submitted 13 February, 2013; originally announced February 2013.

    Comments: Appears in Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence (UAI1996)

    Report number: UAI-P-1996-PG-158-168

  7. arXiv:1302.3566  [pdf

    cs.AI cs.LG stat.ML

    Learning Equivalence Classes of Bayesian Networks Structures

    Authors: David Maxwell Chickering

    Abstract: Approaches to learning Bayesian networks from data typically combine a scoring function with a heuristic search procedure. Given a Bayesian network structure, many of the scoring functions derived in the literature return a score for the entire equivalence class to which the structure belongs. When using such a scoring function, it is appropriate for the heuristic search algorithm to search over… ▽ More

    Submitted 13 February, 2013; originally announced February 2013.

    Comments: Appears in Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence (UAI1996)

    Report number: UAI-P-1996-PG-150-157

  8. arXiv:1302.1528  [pdf

    cs.LG cs.AI stat.ML

    A Bayesian Approach to Learning Bayesian Networks with Local Structure

    Authors: David Maxwell Chickering, David Heckerman, Christopher Meek

    Abstract: Recently several researchers have investigated techniques for using data to learn Bayesian networks containing compact representations for the conditional probability distributions (CPDs) stored at each node. The majority of this work has concentrated on using decision-tree representations for the CPDs. In addition, researchers typically apply non-Bayesian (or asymptotically Bayesian) scoring func… ▽ More

    Submitted 16 May, 2015; v1 submitted 6 February, 2013; originally announced February 2013.

    Comments: Appears in Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence (UAI1997)

    Report number: UAI-P-1997-PG-80-89

  9. arXiv:1301.7415  [pdf

    cs.LG cs.AI stat.ML

    Learning Mixtures of DAG Models

    Authors: Bo Thiesson, Christopher Meek, David Maxwell Chickering, David Heckerman

    Abstract: We describe computationally efficient methods for learning mixtures in which each component is a directed acyclic graphical model (mixtures of DAGs or MDAGs). We argue that simple search-and-score algorithms are infeasible for a variety of problems, and introduce a feasible approach in which parameter and structure search is interleaved and expected data is treated as real data. Our approach can b… ▽ More

    Submitted 16 May, 2015; v1 submitted 30 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI1998)

    Report number: UAI-P-1998-PG-504-513

  10. arXiv:1301.6685  [pdf

    cs.LG stat.ML

    Fast Learning from Sparse Data

    Authors: David Maxwell Chickering, David Heckerman

    Abstract: We describe two techniques that significantly improve the running time of several standard machine-learning algorithms when data is sparse. The first technique is an algorithm that effeciently extracts one-way and two-way counts--either real or expected-- from discrete data. Extracting such counts is a fundamental step in learning algorithms for constructing a variety of models including decision… ▽ More

    Submitted 16 May, 2015; v1 submitted 23 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence (UAI1999)

    Report number: UAI-P-1999-PG-109-115

  11. arXiv:1301.3862  [pdf

    cs.AI cs.IR cs.LG

    Dependency Networks for Collaborative Filtering and Data Visualization

    Authors: David Heckerman, David Maxwell Chickering, Christopher Meek, Robert Rounthwaite, Carl Kadie

    Abstract: We describe a graphical model for probabilistic relationships---an alternative to the Bayesian network---called a dependency network. The graph of a dependency network, unlike a Bayesian network, is potentially cyclic. The probability component of a dependency network, like a Bayesian network, is a set of conditional distributions, one for each node given its parents. We identify several basic… ▽ More

    Submitted 16 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI2000)

    Report number: UAI-P-2000-PG-264-273

  12. arXiv:1301.3842  [pdf

    cs.AI

    A Decision Theoretic Approach to Targeted Advertising

    Authors: David Maxwell Chickering, David Heckerman

    Abstract: A simple advertising strategy that can be used to help increase sales of a product is to mail out special offers to selected potential customers. Because there is a cost associated with sending each offer, the optimal mailing strategy depends on both the benefit obtained from a purchase and how the offer affects the buying behavior of the customers. In this paper, we describe two methods for parti… ▽ More

    Submitted 16 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI2000)

    Report number: UAI-P-2000-PG-82-88

  13. arXiv:1301.2320  [pdf

    cs.IR cs.AI cs.LG

    Using Temporal Data for Making Recommendations

    Authors: Andrew Zimdars, David Maxwell Chickering, Christopher Meek

    Abstract: We treat collaborative filtering as a univariate time series estimation problem: given a user's previous votes, predict the next vote. We describe two families of methods for transforming data to encode time order in ways amenable to off-the-shelf classification and density estimation tools, and examine the results of using these approaches on several real-world data sets. The improvements in p… ▽ More

    Submitted 10 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence (UAI2001)

    Report number: UAI-P-2001-PG-580-588

  14. arXiv:1301.2279  [pdf

    cs.AI

    A Bayesian Approach to Tackling Hard Computational Problems

    Authors: Eric J. Horvitz, Yongshao Ruan, Carla P. Gomes, Henry Kautz, Bart Selman, David Maxwell Chickering

    Abstract: We are develo** a general framework for using learned Bayesian models for decision-theoretic control of search and reasoningalgorithms. We illustrate the approach on the specific task of controlling both general and domain-specific solvers on a hard class of structured constraint satisfaction problems. A successful strategyfor reducing the high (and even infinite) variance in running time typi… ▽ More

    Submitted 10 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence (UAI2001)

    Report number: UAI-P-2001-PG-235-244

  15. arXiv:1301.0561  [pdf

    cs.AI

    Finding Optimal Bayesian Networks

    Authors: David Maxwell Chickering, Christopher Meek

    Abstract: In this paper, we derive optimality results for greedy Bayesian-network search algorithms that perform single-edge modifications at each step and use asymptotically consistent scoring criteria. Our results extend those of Meek (1997) and Chickering (2002), who demonstrate that in the limit of large datasets, if the generative distribution is perfect with respect to a DAG defined over the observabl… ▽ More

    Submitted 12 December, 2012; originally announced January 2013.

    Comments: Appears in Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence (UAI2002)

    Report number: UAI-P-2002-PG-94-102

  16. arXiv:1212.2503  [pdf

    cs.AI stat.ML

    Practically Perfect

    Authors: Christopher Meek, David Maxwell Chickering

    Abstract: The property of perfectness plays an important role in the theory of Bayesian networks. First, the existence of perfect distributions for arbitrary sets of variables and directed acyclic graphs implies that various methods for reading independence from the structure of the graph (e.g., Pearl, 1988; Lauritzen, Dawid, Larsen & Leimer, 1990) are complete. Second, the asymptotic re… ▽ More

    Submitted 19 October, 2012; originally announced December 2012.

    Comments: Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

    Report number: UAI-P-2003-PG-411-416

  17. arXiv:1212.2468  [pdf

    cs.LG cs.AI stat.ML

    Large-Sample Learning of Bayesian Networks is NP-Hard

    Authors: David Maxwell Chickering, Christopher Meek, David Heckerman

    Abstract: In this paper, we provide new complexity results for algorithms that learn discrete-variable Bayesian networks from data. Our results apply whenever the learning algorithm uses a scoring criterion that favors the simplest model able to represent the generative distribution exactly. Our results therefore hold whenever the learning algorithm uses a consistent scoring criterion and is… ▽ More

    Submitted 19 October, 2012; originally announced December 2012.

    Comments: Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

    Report number: UAI-P-2003-PG-124-133

  18. arXiv:1209.2355  [pdf, other

    cs.LG cs.AI cs.IR math.ST

    Counterfactual Reasoning and Learning Systems

    Authors: Léon Bottou, Jonas Peters, Joaquin Quiñonero-Candela, Denis X. Charles, D. Max Chickering, Elon Portugaly, Dipankar Ray, Patrice Simard, Ed Snelson

    Abstract: This work shows how to leverage causal inference to understand the behavior of complex learning systems interacting with their environment and predict the consequences of changes to the system. Such predictions allow both humans and algorithms to select changes that improve both the short-term and long-term performance of such systems. This work is illustrated by experiments carried out on the ad… ▽ More

    Submitted 27 July, 2013; v1 submitted 11 September, 2012; originally announced September 2012.

    Comments: revised version

  19. arXiv:1207.4162  [pdf

    stat.AP cs.LG stat.ME

    ARMA Time-Series Modeling with Graphical Models

    Authors: Bo Thiesson, David Maxwell Chickering, David Heckerman, Christopher Meek

    Abstract: We express the classic ARMA time-series model as a directed graphical model. In doing so, we find that the deterministic relationships in the model make it effectively impossible to use the EM algorithm for learning model parameters. To remedy this problem, we replace the deterministic relationships with Gaussian distributions having a small variance, yielding the stochastic ARMA (ARMA) model. Thi… ▽ More

    Submitted 8 August, 2012; v1 submitted 11 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence (UAI2004)

    Report number: UAI-P-2004-PG-552-560