Skip to main content

Showing 1–16 of 16 results for author: Shalizi, C R

Searching in archive cs. Search in all archives.
.
  1. arXiv:1711.02123  [pdf, ps, other

    math.ST cs.SI physics.soc-ph

    Consistency of Maximum Likelihood for Continuous-Space Network Models I

    Authors: Cosma Rohilla Shalizi, Dena Marie Asta

    Abstract: A very popular class of models for networks posits that each node is represented by a point in a continuous latent space, and that the probability of an edge between nodes is a decreasing function of the distance between them in this latent space. We study the embedding problem for these models, of recovering the latent positions from the observed graph. Assuming certain natural symmetry and smoot… ▽ More

    Submitted 29 June, 2022; v1 submitted 6 November, 2017; originally announced November 2017.

    Comments: 17 pages

  2. arXiv:1607.06565  [pdf, other

    stat.ME cs.SI physics.soc-ph

    Estimating Causal Peer Influence in Homophilous Social Networks by Inferring Latent Locations

    Authors: Edward McFowland III, Cosma Rohilla Shalizi

    Abstract: Social influence cannot be identified from purely observational data on social networks, because such influence is generically confounded with latent homophily, i.e., with a node's network partners being informative about the node's attributes and therefore its behavior. If the network grows according to either a latent community (stochastic block) model, or a continuous latent space model, then l… ▽ More

    Submitted 17 June, 2021; v1 submitted 22 July, 2016; originally announced July 2016.

    Comments: 35 pages, 4 figures

    Journal ref: Journal of the American Statistical Association (2022)

  3. arXiv:1506.02686  [pdf, other

    stat.ML cs.LG

    The LICORS Cabinet: Nonparametric Algorithms for Spatio-temporal Prediction

    Authors: George D. Montanez, Cosma Rohilla Shalizi

    Abstract: Spatio-temporal data is intrinsically high dimensional, so unsupervised modeling is only feasible if we can exploit structure in the process. When the dynamics are local in both space and time, this structure can be exploited by splitting the global field into many lower-dimensional "light cones". We review light cone decompositions for predictive state reconstruction, introducing three simple lig… ▽ More

    Submitted 14 September, 2016; v1 submitted 8 June, 2015; originally announced June 2015.

  4. arXiv:1212.0463  [pdf, other

    math.ST cs.LG stat.ML

    Nonparametric risk bounds for time-series forecasting

    Authors: Daniel J. McDonald, Cosma Rohilla Shalizi, Mark Schervish

    Abstract: We derive generalization error bounds for traditional time-series forecasting models. Our results hold for many standard forecasting tools including autoregressive models, moving average models, and, more generally, linear state-space models. These non-asymptotic bounds need only weak assumptions on the data-generating process, yet allow forecasters to select among competing models and to guarante… ▽ More

    Submitted 10 September, 2016; v1 submitted 3 December, 2012; originally announced December 2012.

    Comments: 34 pages, 3 figures

    MSC Class: 62M20 (Primary) 91B84; 62G99 (Secondary)

    Journal ref: Journal of Machine Learning Research. (2017). Vol 18. p. 1-40

  5. arXiv:1207.3994  [pdf, other

    cs.SI cond-mat.stat-mech math.ST physics.soc-ph stat.ML

    Model Selection for Degree-corrected Block Models

    Authors: Xiaoran Yan, Cosma Rohilla Shalizi, Jacob E. Jensen, Florent Krzakala, Cristopher Moore, Lenka Zdeborova, Pan Zhang, Yaojia Zhu

    Abstract: The proliferation of models for networks raises challenging problems of model selection: the data are sparse and globally dependent, and models are typically high-dimensional and have large numbers of latent variables. Together, these issues mean that the usual model-selection criteria do not work properly for networks. We illustrate these challenges, and show one way to resolve them, by consideri… ▽ More

    Submitted 30 May, 2013; v1 submitted 17 July, 2012; originally announced July 2012.

    Journal ref: J. Stat. Mech. (2014) P05007

  6. arXiv:1106.0730  [pdf, ps, other

    stat.ML cs.LG

    Rademacher complexity of stationary sequences

    Authors: Daniel J. McDonald, Cosma Rohilla Shalizi

    Abstract: We show how to control the generalization error of time series models wherein past values of the outcome are used to predict future values. The results are based on a generalization of standard i.i.d. concentration inequalities to dependent data without the mixing assumptions common in the time series setting. Our proof and the result are simpler than previous analyses with dependent data or stoch… ▽ More

    Submitted 22 May, 2017; v1 submitted 3 June, 2011; originally announced June 2011.

    Comments: 15 pages, 1 figure

  7. arXiv:1103.0949  [pdf, other

    stat.ML cs.LG physics.data-an stat.ME

    Adapting to Non-stationarity with Growing Expert Ensembles

    Authors: Cosma Rohilla Shalizi, Abigail Z. Jacobs, Kristina Lisa Klinkner, Aaron Clauset

    Abstract: When dealing with time series with complex non-stationarities, low retrospective regret on individual realizations is a more appropriate goal than low prospective risk in expectation. Online learning algorithms provide powerful guarantees of this form, and have often been proposed for use with non-stationary processes because of their ability to switch between different forecasters or ``experts''.… ▽ More

    Submitted 28 June, 2011; v1 submitted 4 March, 2011; originally announced March 2011.

    Comments: 9 pages, 1 figure; CMU Statistics Technical Report. v2: Added empirical example, revised discussion of related work

  8. arXiv:1103.0942  [pdf, other

    stat.ML cs.LG

    Generalization error bounds for stationary autoregressive models

    Authors: Daniel J. McDonald, Cosma Rohilla Shalizi, Mark Schervish

    Abstract: We derive generalization error bounds for stationary univariate autoregressive (AR) models. We show that imposing stationarity is enough to control the Gaussian complexity without further regularization. This lets us use structural risk minimization for model selection. We demonstrate our methods by predicting interest rate movements.

    Submitted 3 June, 2011; v1 submitted 4 March, 2011; originally announced March 2011.

    Comments: 10 pages, 3 figures. CMU Statistics Technical Report

  9. arXiv:1103.0941  [pdf, ps, other

    stat.ML cs.LG math.PR

    Estimating $β$-mixing coefficients

    Authors: Daniel J. McDonald, Cosma Rohilla Shalizi, Mark Schervish

    Abstract: The literature on statistical learning for time series assumes the asymptotic independence or ``mixing' of the data-generating process. These mixing assumptions are never tested, nor are there methods for estimating mixing rates from data. We give an estimator for the $β$-mixing rate based on a single stationary sample path and show it is $L_1$-risk consistent.

    Submitted 4 March, 2011; originally announced March 2011.

    Comments: 9 pages, accepted by AIStats. CMU Statistics Technical Report

    Journal ref: Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics (AISTATS 2011), pp. 516--524

  10. arXiv:1004.4704  [pdf, other

    stat.AP cs.SI physics.data-an physics.soc-ph

    Homophily and Contagion Are Generically Confounded in Observational Social Network Studies

    Authors: Cosma Rohilla Shalizi, Andrew C. Thomas

    Abstract: We consider processes on social networks that can potentially involve three factors: homophily, or the formation of social ties due to matching individual traits; social contagion, also known as social influence; and the causal effect of an individual's covariates on their behavior or other measurable responses. We show that, generically, all of these are confounded with each other. Distinguishing… ▽ More

    Submitted 29 November, 2010; v1 submitted 27 April, 2010; originally announced April 2010.

    Comments: 27 pages, 9 figures. V2: Revised in response to referees. V3: Ditto

    Journal ref: Sociological Methods and Research, vol. 40 (2011), pp. 211--239

  11. arXiv:1001.0036  [pdf, other

    q-bio.NC cs.IT nlin.AO physics.data-an stat.ML

    The Computational Structure of Spike Trains

    Authors: Robert Haslinger, Kristina Lisa Klinkner, Cosma Rohilla Shalizi

    Abstract: Neurons perform computations, and convey the results of those computations through the statistical structure of their output spike trains. Here we present a practical method, grounded in the information-theoretic analysis of prediction, for inferring a minimal representation of that structure and for characterizing its complexity. Starting from spike trains, our approach finds their causal state… ▽ More

    Submitted 30 December, 2009; originally announced January 2010.

    Comments: Somewhat different format from journal version but same content

    Journal ref: Neural Computation, vol. 22 (2010), pp. 121--157

  12. arXiv:0710.4911  [pdf, other

    cs.CY physics.soc-ph

    Social Media as Windows on the Social Life of the Mind

    Authors: Cosma Rohilla Shalizi

    Abstract: This is a programmatic paper, marking out two directions in which the study of social media can contribute to broader problems of social science: understanding cultural evolution and understanding collective cognition. Under the first heading, I discuss some difficulties with the usual, adaptationist explanations of cultural phenomena, alternative explanations involving network diffusion effects… ▽ More

    Submitted 25 October, 2007; originally announced October 2007.

    Comments: 6 pages, 1 figure, AAAI format, submitted to AAAI spring 2008 symposium on "Social Information Processing"

  13. arXiv:cs/0406011  [pdf, ps, other

    cs.LG math.ST nlin.CD physics.data-an

    Blind Construction of Optimal Nonlinear Recursive Predictors for Discrete Sequences

    Authors: Cosma Rohilla Shalizi, Kristina Lisa Shalizi

    Abstract: We present a new method for nonlinear prediction of discrete random sequences under minimal structural assumptions. We give a mathematical construction for optimal predictors of such processes, in the form of hidden Markov models. We then describe an algorithm, CSSR (Causal-State Splitting Reconstruction), which approximates the ideal predictor from data. We discuss the reliability of CSSR, its… ▽ More

    Submitted 6 June, 2004; originally announced June 2004.

    Comments: 8 pages, 4 figures

    ACM Class: I.2.6

    Journal ref: pp. 504--511 in Max Chickering and Joseph Halpern (eds.), _Uncertainty in Artificial Intelligence: Proceedings of the Twentieth Conference_ (2004)

  14. arXiv:cs/0210025  [pdf, ps, other

    cs.LG cs.CL

    An Algorithm for Pattern Discovery in Time Series

    Authors: Cosma Rohilla Shalizi, Kristina Lisa Shalizi, James P. Crutchfield

    Abstract: We present a new algorithm for discovering patterns in time series and other sequential data. We exhibit a reliable procedure for building the minimal set of hidden, Markovian states that is statistically capable of producing the behavior exhibited in the data -- the underlying process's causal states. Unlike conventional methods for fitting hidden Markov models (HMMs) to data, our algorithm mak… ▽ More

    Submitted 26 November, 2002; v1 submitted 28 October, 2002; originally announced October 2002.

    Comments: 26 pages, 5 figures; 5 tables; http://www.santafe.edu/projects/CompMech Added discussion of algorithm parameters; improved treatment of convergence and time complexity; added comparison to older methods

    Report number: SFI Working Paper 02-10-060 ACM Class: I.2.6; H.1.1; E.4

  15. arXiv:nlin/0006025  [pdf, ps, other

    nlin.AO cond-mat.dis-nn cs.LG physics.data-an

    Information Bottlenecks, Causal States, and Statistical Relevance Bases: How to Represent Relevant Information in Memoryless Transduction

    Authors: Cosma Rohilla Shalizi, James P. Crutchfield

    Abstract: Discovering relevant, but possibly hidden, variables is a key step in constructing useful and predictive theories about the natural world. This brief note explains the connections between three approaches to this problem: the recently introduced information-bottleneck method, the computational mechanics approach to inferring optimal models, and Salmon's statistical relevance basis.

    Submitted 16 June, 2000; originally announced June 2000.

    Comments: 3 pages, no figures, submitted to PRE as a "brief report". Revision: added an acknowledgements section originally omitted by a LaTeX bug

    Journal ref: Advances in Complex Systems, vol. 5, pp. 91--95 (2002)

  16. arXiv:cs/0001027  [pdf, ps, other

    cs.LG cs.NE

    Pattern Discovery and Computational Mechanics

    Authors: Cosma Rohilla Shalizi, James P. Crutchfield

    Abstract: Computational mechanics is a method for discovering, describing and quantifying patterns, using tools from statistical physics. It constructs optimal, minimal models of stochastic processes and their underlying causal structures. These models tell us about the intrinsic computation embedded within a process---how it stores and transforms information. Here we summarize the mathematics of computat… ▽ More

    Submitted 28 January, 2000; originally announced January 2000.

    Comments: 12 pages, 3 figures; submitted to the Proceedings of the 17th International Conference on Machine Learning (differs slightly in pagination and citation format from that version)

    Report number: SFI 00-01-008 ACM Class: I.2.6; F.1.3; G.3; H.1.1