Skip to main content

Showing 1–24 of 24 results for author: Shalizi, C R

Searching in archive stat. Search in all archives.
.
  1. arXiv:2210.16224  [pdf, other

    stat.AP

    Empirical Macroeconomics and DSGE Modeling in Statistical Perspective

    Authors: Daniel J. McDonald, Cosma Rohilla Shalizi

    Abstract: Dynamic stochastic general equilibrium (DSGE) models have been an ubiquitous, and controversial, part of macroeconomics for decades. In this paper, we approach DSGEs purely as statstical models. We do this by applying two common model validation checks to the canonical Smets and Wouters 2007 DSGE: (1) we simulate the model and see how well it can be estimated from its own simulation output, and (2… ▽ More

    Submitted 31 October, 2022; v1 submitted 28 October, 2022; originally announced October 2022.

    Comments: 36 pages, 21 figures, 7 tables

  2. arXiv:2205.13698  [pdf, other

    stat.ME stat.ML

    Characterizing the robustness of Bayesian adaptive experimental designs to active learning bias

    Authors: Sabina J. Sloman, Daniel M. Oppenheimer, Stephen B. Broomell, Cosma Rohilla Shalizi

    Abstract: Bayesian adaptive experimental design is a form of active learning, which chooses samples to maximize the information they give about uncertain parameters. Prior work has shown that other forms of active learning can suffer from active learning bias, where unrepresentative sampling leads to inconsistent parameter estimates. We show that active learning bias can also afflict Bayesian adaptive exper… ▽ More

    Submitted 28 November, 2022; v1 submitted 26 May, 2022; originally announced May 2022.

  3. arXiv:2203.09077  [pdf, other

    stat.CO

    Evaluating Posterior Distributions by Selectively Breeding Prior Samples

    Authors: Cosma Rohilla Shalizi

    Abstract: Using Markov chain Monte Carlo to sample from posterior distributions was the key innovation which made Bayesian data analysis practical. Notoriously, however, MCMC is hard to tune, hard to diagnose, and hard to parallelize. This pedagogical note explores variants on a universal {\em non}-Markov-chain Monte Carlo scheme for sampling from posterior distributions. The basic idea is to draw parameter… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: 16 pages, 2 figures, code included in text

  4. arXiv:2111.09220  [pdf, other

    stat.ME nlin.CD physics.data-an

    A Note on Simulation-Based Inference by Matching Random Features

    Authors: Cosma Rohilla Shalizi

    Abstract: We can, and should, do statistical inference on simulation models by adjusting the parameters in the simulation so that the values of {\em randomly chosen} functions of the simulation output match the values of those same functions calculated on the data. Results from the "state-space reconstruction" or "geometry from a time series'' literature in nonlinear dynamics indicate that just $2d+1$ such… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: 41 pages, 14 figures

  5. arXiv:1912.03387  [pdf, other

    math.ST stat.ME

    Conditional Mutual Information Estimation for Mixed Discrete and Continuous Variables with Nearest Neighbors

    Authors: Octavio César Mesner, Cosma Rohilla Shalizi

    Abstract: Fields like public health, public policy, and social science often want to quantify the degree of dependence between variables whose relationships take on unknown functional forms. Typically, in fact, researchers in these fields are attempting to evaluate causal theories, and so want to quantify dependence after conditioning on other variables that might explain, mediate or confound causal relatio… ▽ More

    Submitted 6 December, 2019; originally announced December 2019.

  6. Bootstrap** Exchangeable Random Graphs

    Authors: Alden Green, Cosma Rohilla Shalizi

    Abstract: We introduce two new bootstraps for exchangeable random graphs. One, the "empirical graphon bootstrap", is based purely on resampling, while the other, the "histogram bootstrap", is a model-based "sieve" bootstrap. We show that both of them accurately approximate the sampling distributions of motif densities, i.e., of the normalized counts of the number of times fixed subgraphs appear in the netwo… ▽ More

    Submitted 3 January, 2022; v1 submitted 2 November, 2017; originally announced November 2017.

    Journal ref: Electronic Journal of Statistics, vol. 16 (2022), pp. 1058--1095

  7. arXiv:1607.06565  [pdf, other

    stat.ME cs.SI physics.soc-ph

    Estimating Causal Peer Influence in Homophilous Social Networks by Inferring Latent Locations

    Authors: Edward McFowland III, Cosma Rohilla Shalizi

    Abstract: Social influence cannot be identified from purely observational data on social networks, because such influence is generically confounded with latent homophily, i.e., with a node's network partners being informative about the node's attributes and therefore its behavior. If the network grows according to either a latent community (stochastic block) model, or a continuous latent space model, then l… ▽ More

    Submitted 17 June, 2021; v1 submitted 22 July, 2016; originally announced July 2016.

    Comments: 35 pages, 4 figures

    Journal ref: Journal of the American Statistical Association (2022)

  8. arXiv:1506.02686  [pdf, other

    stat.ML cs.LG

    The LICORS Cabinet: Nonparametric Algorithms for Spatio-temporal Prediction

    Authors: George D. Montanez, Cosma Rohilla Shalizi

    Abstract: Spatio-temporal data is intrinsically high dimensional, so unsupervised modeling is only feasible if we can exploit structure in the process. When the dynamics are local in both space and time, this structure can be exploited by splitting the global field into many lower-dimensional "light cones". We review light cone decompositions for predictive state reconstruction, introducing three simple lig… ▽ More

    Submitted 14 September, 2016; v1 submitted 8 June, 2015; originally announced June 2015.

  9. Regularized brain reading with shrinkage and smoothing

    Authors: Leila Wehbe, Aaditya Ramdas, Rebecca C. Steorts, Cosma Rohilla Shalizi

    Abstract: Functional neuroimaging measures how the brain responds to complex stimuli. However, sample sizes are modest, noise is substantial, and stimuli are high dimensional. Hence, direct estimates are inherently imprecise and call for regularization. We compare a suite of approaches which regularize via shrinkage: ridge regression, the elastic net (a generalization of ridge regression and the lasso), and… ▽ More

    Submitted 4 February, 2016; v1 submitted 25 January, 2014; originally announced January 2014.

    Comments: Published at http://dx.doi.org/10.1214/15-AOAS837 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS837

    Journal ref: Annals of Applied Statistics 2015, Vol. 9, No. 4, 1997-2022

  10. arXiv:1309.4859  [pdf, ps, other

    stat.ML

    Predictive PAC Learning and Process Decompositions

    Authors: Cosma Rohilla Shalizi, Aryeh Kontorovich

    Abstract: We informally call a stochastic process learnable if it admits a generalization error approaching zero in probability for any concept class with finite VC-dimension (IID processes are the simplest example). A mixture of learnable processes need not be learnable itself, and certainly its generalization error need not decay at the same rate. In this paper, we argue that it is natural in predictive P… ▽ More

    Submitted 19 September, 2013; originally announced September 2013.

    Comments: 9 pages, accepted in NIPS 2013

    Journal ref: Advances in Neural Information Processing Systems 26 [NIPS 2013], pp.1619--1627

  11. arXiv:1212.0463  [pdf, other

    math.ST cs.LG stat.ML

    Nonparametric risk bounds for time-series forecasting

    Authors: Daniel J. McDonald, Cosma Rohilla Shalizi, Mark Schervish

    Abstract: We derive generalization error bounds for traditional time-series forecasting models. Our results hold for many standard forecasting tools including autoregressive models, moving average models, and, more generally, linear state-space models. These non-asymptotic bounds need only weak assumptions on the data-generating process, yet allow forecasters to select among competing models and to guarante… ▽ More

    Submitted 10 September, 2016; v1 submitted 3 December, 2012; originally announced December 2012.

    Comments: 34 pages, 3 figures

    MSC Class: 62M20 (Primary) 91B84; 62G99 (Secondary)

    Journal ref: Journal of Machine Learning Research. (2017). Vol 18. p. 1-40

  12. arXiv:1211.3760  [pdf, other

    stat.ME stat.ML

    Mixed LICORS: A Nonparametric Algorithm for Predictive State Reconstruction

    Authors: Georg M. Goerg, Cosma Rohilla Shalizi

    Abstract: We introduce 'mixed LICORS', an algorithm for learning nonlinear, high-dimensional dynamics from spatio-temporal data, suitable for both prediction and simulation. Mixed LICORS extends the recent LICORS algorithm (Goerg and Shalizi, 2012) from hard clustering of predictive distributions to a non-parametric, EM-like soft clustering. This retains the asymptotic predictive optimality of LICORS, but,… ▽ More

    Submitted 2 May, 2013; v1 submitted 15 November, 2012; originally announced November 2012.

    Comments: 11 pages; AISTATS 2013

    Journal ref: AISTATS 2013, pp. 289--297

  13. arXiv:1207.3994  [pdf, other

    cs.SI cond-mat.stat-mech math.ST physics.soc-ph stat.ML

    Model Selection for Degree-corrected Block Models

    Authors: Xiaoran Yan, Cosma Rohilla Shalizi, Jacob E. Jensen, Florent Krzakala, Cristopher Moore, Lenka Zdeborova, Pan Zhang, Yaojia Zhu

    Abstract: The proliferation of models for networks raises challenging problems of model selection: the data are sparse and globally dependent, and models are typically high-dimensional and have large numbers of latent variables. Together, these issues mean that the usual model-selection criteria do not work properly for networks. We illustrate these challenges, and show one way to resolve them, by consideri… ▽ More

    Submitted 30 May, 2013; v1 submitted 17 July, 2012; originally announced July 2012.

    Journal ref: J. Stat. Mech. (2014) P05007

  14. arXiv:1206.2398  [pdf, other

    stat.ME nlin.CG physics.data-an

    LICORS: Light Cone Reconstruction of States for Non-parametric Forecasting of Spatio-Temporal Systems

    Authors: Georg M. Goerg, Cosma Rohilla Shalizi

    Abstract: We present a new, non-parametric forecasting method for data where continuous values are observed discretely in space and time. Our method, "light-cone reconstruction of states" (LICORS), uses physical principles to identify predictive states which are local properties of the system, both in space and time. LICORS discovers the number of predictive states and their predictive distributions automat… ▽ More

    Submitted 3 August, 2012; v1 submitted 11 June, 2012; originally announced June 2012.

    Comments: Main text: 30 pages; supplementary material: 12 pages; 5+2 figures

  15. arXiv:1111.3404  [pdf, ps, other

    stat.ML

    Estimated VC dimension for risk bounds

    Authors: Daniel J. McDonald, Cosma Rohilla Shalizi, Mark Schervish

    Abstract: Vapnik-Chervonenkis (VC) dimension is a fundamental measure of the generalization capacity of learning algorithms. However, apart from a few special cases, it is hard or impossible to calculate analytically. Vapnik et al. [10] proposed a technique for estimating the VC dimension empirically. While their approach behaves well in simulations, it could not be used to bound the generalization risk of… ▽ More

    Submitted 14 November, 2011; originally announced November 2011.

    Comments: 11 pages

  16. arXiv:1106.0730  [pdf, ps, other

    stat.ML cs.LG

    Rademacher complexity of stationary sequences

    Authors: Daniel J. McDonald, Cosma Rohilla Shalizi

    Abstract: We show how to control the generalization error of time series models wherein past values of the outcome are used to predict future values. The results are based on a generalization of standard i.i.d. concentration inequalities to dependent data without the mixing assumptions common in the time series setting. Our proof and the result are simpler than previous analyses with dependent data or stoch… ▽ More

    Submitted 22 May, 2017; v1 submitted 3 June, 2011; originally announced June 2011.

    Comments: 15 pages, 1 figure

  17. arXiv:1103.0949  [pdf, other

    stat.ML cs.LG physics.data-an stat.ME

    Adapting to Non-stationarity with Growing Expert Ensembles

    Authors: Cosma Rohilla Shalizi, Abigail Z. Jacobs, Kristina Lisa Klinkner, Aaron Clauset

    Abstract: When dealing with time series with complex non-stationarities, low retrospective regret on individual realizations is a more appropriate goal than low prospective risk in expectation. Online learning algorithms provide powerful guarantees of this form, and have often been proposed for use with non-stationary processes because of their ability to switch between different forecasters or ``experts''.… ▽ More

    Submitted 28 June, 2011; v1 submitted 4 March, 2011; originally announced March 2011.

    Comments: 9 pages, 1 figure; CMU Statistics Technical Report. v2: Added empirical example, revised discussion of related work

  18. arXiv:1103.0942  [pdf, other

    stat.ML cs.LG

    Generalization error bounds for stationary autoregressive models

    Authors: Daniel J. McDonald, Cosma Rohilla Shalizi, Mark Schervish

    Abstract: We derive generalization error bounds for stationary univariate autoregressive (AR) models. We show that imposing stationarity is enough to control the Gaussian complexity without further regularization. This lets us use structural risk minimization for model selection. We demonstrate our methods by predicting interest rate movements.

    Submitted 3 June, 2011; v1 submitted 4 March, 2011; originally announced March 2011.

    Comments: 10 pages, 3 figures. CMU Statistics Technical Report

  19. arXiv:1103.0941  [pdf, ps, other

    stat.ML cs.LG math.PR

    Estimating $β$-mixing coefficients

    Authors: Daniel J. McDonald, Cosma Rohilla Shalizi, Mark Schervish

    Abstract: The literature on statistical learning for time series assumes the asymptotic independence or ``mixing' of the data-generating process. These mixing assumptions are never tested, nor are there methods for estimating mixing rates from data. We give an estimator for the $β$-mixing rate based on a single stationary sample path and show it is $L_1$-risk consistent.

    Submitted 4 March, 2011; originally announced March 2011.

    Comments: 9 pages, accepted by AIStats. CMU Statistics Technical Report

    Journal ref: Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics (AISTATS 2011), pp. 516--524

  20. arXiv:1102.4101  [pdf, ps, other

    stat.AP physics.data-an physics.soc-ph

    Scaling and Hierarchy in Urban Economies

    Authors: Cosma Rohilla Shalizi

    Abstract: In several recent publications, Bettencourt, West and collaborators claim that properties of cities such as gross economic production, personal income, numbers of patents filed, number of crimes committed, etc., show super-linear power-scaling with total population, while measures of resource use show sub-linear power-law scaling. Re-analysis of the gross economic production and personal income fo… ▽ More

    Submitted 7 April, 2011; v1 submitted 20 February, 2011; originally announced February 2011.

    Comments: v1: 15 pages, 9 figures, combines main text and supporting information into one document. Submitted to PNAS. v2: Text re-arranged to comply with journal policies; added analysis with logistic (asymptotically constant) scaling relations; minor corrections

  21. arXiv:1004.4704  [pdf, other

    stat.AP cs.SI physics.data-an physics.soc-ph

    Homophily and Contagion Are Generically Confounded in Observational Social Network Studies

    Authors: Cosma Rohilla Shalizi, Andrew C. Thomas

    Abstract: We consider processes on social networks that can potentially involve three factors: homophily, or the formation of social ties due to matching individual traits; social contagion, also known as social influence; and the causal effect of an individual's covariates on their behavior or other measurable responses. We show that, generically, all of these are confounded with each other. Distinguishing… ▽ More

    Submitted 29 November, 2010; v1 submitted 27 April, 2010; originally announced April 2010.

    Comments: 27 pages, 9 figures. V2: Revised in response to referees. V3: Ditto

    Journal ref: Sociological Methods and Research, vol. 40 (2011), pp. 211--239

  22. arXiv:1004.3476  [pdf, ps, other

    stat.ME physics.data-an q-bio.NC

    Approximate Methods for State-Space Models

    Authors: Shinsuke Koyama, Lucia Castellanos Pérez-Bolde, Cosma Rohilla Shalizi, Robert E. Kass

    Abstract: State-space models provide an important body of techniques for analyzing time-series, but their use requires estimating unobserved states. The optimal estimate of the state is its conditional expectation given the observation histories, and computing this expectation is hard when there are nonlinearities. Existing filtering methods, including sequential Monte Carlo, tend to be either inaccurate… ▽ More

    Submitted 20 April, 2010; originally announced April 2010.

    Comments: 31 pages, 4 figures. Different pagination from journal version due to incompatible style files but same content; the supplemental file for the journal appears here as appendices B--E.

    Journal ref: Journal of the American Statistical Association, volume 105, 2010, pp. 170--180

  23. arXiv:1001.0036  [pdf, other

    q-bio.NC cs.IT nlin.AO physics.data-an stat.ML

    The Computational Structure of Spike Trains

    Authors: Robert Haslinger, Kristina Lisa Klinkner, Cosma Rohilla Shalizi

    Abstract: Neurons perform computations, and convey the results of those computations through the statistical structure of their output spike trains. Here we present a practical method, grounded in the information-theoretic analysis of prediction, for inferring a minimal representation of that structure and for characterizing its complexity. Starting from spike trains, our approach finds their causal state… ▽ More

    Submitted 30 December, 2009; originally announced January 2010.

    Comments: Somewhat different format from journal version but same content

    Journal ref: Neural Computation, vol. 22 (2010), pp. 121--157

  24. arXiv:0706.1062  [pdf, ps, other

    physics.data-an cond-mat.dis-nn stat.AP stat.ME

    Power-law distributions in empirical data

    Authors: Aaron Clauset, Cosma Rohilla Shalizi, M. E. J. Newman

    Abstract: Power-law distributions occur in many situations of scientific interest and have significant consequences for our understanding of natural and man-made phenomena. Unfortunately, the detection and characterization of power laws is complicated by the large fluctuations that occur in the tail of the distribution -- the part of the distribution representing large but rare events -- and by the diffic… ▽ More

    Submitted 2 February, 2009; v1 submitted 7 June, 2007; originally announced June 2007.

    Comments: 43 pages, 11 figures, 7 tables, 4 appendices; code available at http://www.santafe.edu/~aaronc/powerlaws/

    Journal ref: SIAM Review 51, 661-703 (2009)