Skip to main content

Showing 1–14 of 14 results for author: Chipman, H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2111.13737  [pdf, other

    stat.ME

    Let's practice what we preach: Planning and interpreting simulation studies with design and analysis of experiments

    Authors: Hugh Chipman, Derek Bingham

    Abstract: Statisticians recommend the Design and Analysis of Experiments (DAE) for evidence-based research but often use tables to present their own simulation studies. Could DAE do better? We outline how DAE methods can be used to plan and analyze simulation studies. Tools for planning include fishbone diagrams, factorial and fractional factorial designs. Analysis is carried out via ANOVA, main-effect and… ▽ More

    Submitted 26 November, 2021; originally announced November 2021.

    Comments: 37 pages, 15 figures. Submitted to Canadian Journal of Statistics. For associated R code, see https://github.com/hughchipman/TablesAsDesigns

    MSC Class: 62K99 (Primary) 62K25 (Secondary)

  2. arXiv:1709.07542  [pdf, other

    stat.ME

    Heteroscedastic BART Using Multiplicative Regression Trees

    Authors: Matthew Pratola, Hugh Chipman, Edward George, Robert McCulloch

    Abstract: BART (Bayesian Additive Regression Trees) has become increasingly popular as a flexible and scalable nonparametric regression approach for modern applied statistics problems. For the practitioner dealing with large and complex nonlinear response surfaces, its advantages include a matrix-free formulation and the lack of a requirement to prespecify a confining regression basis. Although flexible in… ▽ More

    Submitted 9 July, 2018; v1 submitted 21 September, 2017; originally announced September 2017.

  3. arXiv:1612.01619  [pdf, other

    stat.OT

    mBART: Multidimensional Monotone BART

    Authors: Hugh A. Chipman, Edward I. George, Robert E. McCulloch, Thomas S. Shively

    Abstract: For the discovery of regression relationships between Y and a large set of p potential predictors x 1 , . . . , x p , the flexible nonparametric nature of BART (Bayesian Additive Regression Trees) allows for a much richer set of possibilities than restrictive parametric approaches. However, subject matter considerations sometimes warrant a minimal assumption of monotonicity in at least some of the… ▽ More

    Submitted 8 October, 2021; v1 submitted 5 December, 2016; originally announced December 2016.

  4. arXiv:1507.01816  [pdf, other

    stat.AP

    A Continuous-time Stochastic Block Model for Basketball Networks

    Authors: Lu Xin, Mu Zhu, Hugh Chipman

    Abstract: For professional basketball, finding valuable and suitable players is the key to building a winning team. To deal with such challenges, basketball managers, scouts and coaches are increasingly turning to analytics. Objective evaluation of players and teams has always been the top goal of basketball analytics. Typical statistical analytics mainly focuses on the box score and has developed various m… ▽ More

    Submitted 23 July, 2016; v1 submitted 4 July, 2015; originally announced July 2015.

  5. arXiv:1309.3802  [pdf, other

    stat.ME

    Monotone Function Estimation for Computer Experiments

    Authors: Shirin Golchi, Derek R. Bingham, Hugh Chipman, David A. Campbell

    Abstract: In statistical modeling of computer experiments sometimes prior information is available about the underlying function. For example, the physical system simulated by the computer code may be known to be monotone with respect to some or all inputs. We develop a Bayesian approach to Gaussian process modelling capable of incorporating monotonicity information for computer model emulation. Markov chai… ▽ More

    Submitted 14 June, 2014; v1 submitted 15 September, 2013; originally announced September 2013.

    Comments: 28 pages, 12 figures

  6. arXiv:1309.1906  [pdf

    stat.CO

    Parallel Bayesian Additive Regression Trees

    Authors: Matthew T. Pratola, Hugh A. Chipman, James R. Gattiker, David M. Higdon, Robert McCulloch, William N. Rust

    Abstract: Bayesian Additive Regression Trees (BART) is a Bayesian approach to flexible non-linear regression which has been shown to be competitive with the best modern predictive methods such as those based on bagging and boosting. BART offers some advantages. For example, the stochastic search Markov Chain Monte Carlo (MCMC) algorithm can provide a more complete search of the model space and variation acr… ▽ More

    Submitted 7 September, 2013; originally announced September 2013.

  7. GPfit: An R package for Gaussian Process Model Fitting using a New Optimization Algorithm

    Authors: Blake MacDonald, Pritam Ranjan, Hugh Chipman

    Abstract: Gaussian process (GP) models are commonly used statistical metamodels for emulating expensive computer simulators. Fitting a GP model can be numerically unstable if any pair of design points in the input space are close together. Ranjan, Haynes, and Karsten (2011) proposed a computationally stable approach for fitting GP models to deterministic computer simulators. They used a genetic algorithm ba… ▽ More

    Submitted 3 May, 2013; originally announced May 2013.

    Comments: 20 pages, 17 images

    Journal ref: Journal of Statistical Software, 64 (12), 1-23, 2015

  8. arXiv:1304.4077  [pdf

    stat.ME cs.CV cs.LG

    A new Bayesian ensemble of trees classifier for identifying multi-class labels in satellite images

    Authors: Reshu Agarwal, Pritam Ranjan, Hugh Chipman

    Abstract: Classification of satellite images is a key component of many remote sensing applications. One of the most important products of a raw satellite image is the classified map which labels the image pixels into meaningful classes. Though several parametric and non-parametric classifiers have been developed thus far, accurate labeling of the pixels still remains a challenge. In this paper, we propose… ▽ More

    Submitted 31 May, 2013; v1 submitted 15 April, 2013; originally announced April 2013.

    Comments: 31 pages, 6 figures, 4 tables

  9. arXiv:1203.1269  [pdf, ps, other

    stat.CO stat.ML

    A Short Note on Gaussian Process Modeling for Large Datasets using Graphics Processing Units

    Authors: Mark Franey, Pritam Ranjan, Hugh Chipman

    Abstract: The graphics processing unit (GPU) has emerged as a powerful and cost effective processor for general performance computing. GPUs are capable of an order of magnitude more floating-point operations per second as compared to modern central processing units (CPUs), and thus provide a great deal of promise for computationally intensive statistical applications. Fitting complex statistical models with… ▽ More

    Submitted 21 July, 2012; v1 submitted 6 March, 2012; originally announced March 2012.

    Comments: 11 pages, 2 figures

  10. arXiv:1203.1078  [pdf, ps, other

    stat.ME stat.ML

    Sequential Design for Computer Experiments with a Flexible Bayesian Additive Model

    Authors: Hugh Chipman, Pritam Ranjan, Weiwei Wang

    Abstract: In computer experiments, a mathematical model implemented on a computer is used to represent complex physical phenomena. These models, known as computer simulators, enable experimental study of a virtual representation of the complex phenomena. Simulators can be thought of as complex functions that take many inputs and provide an output. Often these simulators are themselves expensive to compute,… ▽ More

    Submitted 1 July, 2012; v1 submitted 5 March, 2012; originally announced March 2012.

    Comments: 21 pages

  11. arXiv:1010.1437  [pdf, other

    stat.ML cs.AI cs.SI stat.AP stat.ME

    Mixed-Membership Stochastic Block-Models for Transactional Networks

    Authors: Mahdi Shafiei, Hugh Chipman

    Abstract: Transactional network data can be thought of as a list of one-to-many communications(e.g., email) between nodes in a social network. Most social network models convert this type of data into binary relations between pairs of nodes. We develop a latent mixed membership model capable of modeling richer forms of transactional network data, including relations between more than two nodes. The model ca… ▽ More

    Submitted 7 October, 2010; originally announced October 2010.

    Comments: 22 pages

  12. arXiv:1003.0804  [pdf, other

    stat.ME stat.CO

    Branch and Bound Algorithms for Maximizing Expected Improvement Functions

    Authors: Mark Franey, Pritam Ranjan, Hugh Chipman

    Abstract: Deterministic computer simulations are often used as a replacement for complex physical experiments. Although less expensive than physical experimentation, computer codes can still be time-consuming to run. An effective strategy for exploring the response surface of the deterministic simulator is the use of an approximation to the computer code, such as a Gaussian process (GP) model, coupled wit… ▽ More

    Submitted 3 March, 2010; originally announced March 2010.

    Comments: 26 pages, 14 figures, preprint submitted to the Journal of Statistical Planning and Inference

  13. arXiv:0806.3286  [pdf, ps, other

    stat.ME stat.AP stat.ML

    BART: Bayesian additive regression trees

    Authors: Hugh A. Chipman, Edward I. George, Robert E. McCulloch

    Abstract: We develop a Bayesian "sum-of-trees" model where each tree is constrained by a regularization prior to be a weak learner, and fitting and inference are accomplished via an iterative Bayesian backfitting MCMC algorithm that generates samples from a posterior. Effectively, BART is a nonparametric Bayesian regression approach which uses dimensionally adaptive random basis elements. Motivated by ensem… ▽ More

    Submitted 7 October, 2010; v1 submitted 19 June, 2008; originally announced June 2008.

    Comments: Published in at http://dx.doi.org/10.1214/09-AOAS285 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS285

    Journal ref: Annals of Applied Statistics 2010, Vol. 4, No. 1, 266-298

  14. arXiv:0804.1325  [pdf, ps, other

    stat.ML

    On the underestimation of model uncertainty by Bayesian K-nearest neighbors

    Authors: Wanhua Su, Hugh Chipman, Mu Zhu

    Abstract: When using the K-nearest neighbors method, one often ignores uncertainty in the choice of K. To account for such uncertainty, Holmes and Adams (2002) proposed a Bayesian framework for K-nearest neighbors (KNN). Their Bayesian KNN (BKNN) approach uses a pseudo-likelihood function, and standard Markov chain Monte Carlo (MCMC) techniques to draw posterior samples. Holmes and Adams (2002) focused on… ▽ More

    Submitted 8 April, 2008; originally announced April 2008.