Skip to main content

Showing 1–19 of 19 results for author: Shalizi, C R

Searching in archive math. Search in all archives.
.
  1. arXiv:2203.09085  [pdf, ps, other

    math.PR physics.data-an

    A Simple Non-Stationary Mean Ergodic Theorem, with Bonus Weak Law of Large Numbers

    Authors: Cosma Rohilla Shalizi

    Abstract: This brief pedagogical note re-proves a simple theorem on the convergence, in $L_2$ and in probability, of time averages of non-stationary time series to the mean of expectation values. The basic condition is that the sum of covariances grows sub-quadratically with the length of the time series. I make no claim to originality; the result is widely, but unevenly, spread bit of folklore among users… ▽ More

    Submitted 19 March, 2022; v1 submitted 17 March, 2022; originally announced March 2022.

    Comments: v2: Fixed notation to replace statements like $A_n \rightarrow m_n$ with ones like $A_n - m_n \rightarrow 0$; small wording changes and typo corrections in Remark 3

  2. arXiv:1912.03387  [pdf, other

    math.ST stat.ME

    Conditional Mutual Information Estimation for Mixed Discrete and Continuous Variables with Nearest Neighbors

    Authors: Octavio César Mesner, Cosma Rohilla Shalizi

    Abstract: Fields like public health, public policy, and social science often want to quantify the degree of dependence between variables whose relationships take on unknown functional forms. Typically, in fact, researchers in these fields are attempting to evaluate causal theories, and so want to quantify dependence after conditioning on other variables that might explain, mediate or confound causal relatio… ▽ More

    Submitted 6 December, 2019; originally announced December 2019.

  3. arXiv:1711.02834  [pdf, other

    math.ST

    Bootstrap** Generalization Error Bounds for Time Series

    Authors: Robert Lunde, Cosma Rohilla Shalizi

    Abstract: We consider the problem of finding confidence intervals for the risk of forecasting the future of a stationary, ergodic stochastic process, using a model estimated from the past of the process. We show that a bootstrap procedure provides valid confidence intervals for the risk, when the data source is sufficiently mixing, and the loss function and the estimator are suitably smooth. Autoregressive… ▽ More

    Submitted 29 November, 2017; v1 submitted 8 November, 2017; originally announced November 2017.

  4. arXiv:1711.02123  [pdf, ps, other

    math.ST cs.SI physics.soc-ph

    Consistency of Maximum Likelihood for Continuous-Space Network Models I

    Authors: Cosma Rohilla Shalizi, Dena Marie Asta

    Abstract: A very popular class of models for networks posits that each node is represented by a point in a continuous latent space, and that the probability of an edge between nodes is a decreasing function of the distance between them in this latent space. We study the embedding problem for these models, of recovering the latent positions from the observed graph. Assuming certain natural symmetry and smoot… ▽ More

    Submitted 29 June, 2022; v1 submitted 6 November, 2017; originally announced November 2017.

    Comments: 17 pages

  5. arXiv:1709.09702  [pdf, other

    math.ST

    Projective, Sparse, and Learnable Latent Position Network Models

    Authors: Neil A. Spencer, Cosma Rohilla Shalizi

    Abstract: When modeling network data using a latent position model, it is typical to assume that the nodes' positions are independently and identically distributed. However, this assumption implies the average node degree grows linearly with the number of nodes, which is inappropriate when the graph is thought to be sparse. We propose an alternative assumption -- that the latent positions are generated acco… ▽ More

    Submitted 8 September, 2023; v1 submitted 27 September, 2017; originally announced September 2017.

    Comments: 70 pages, 2 figures

  6. arXiv:1411.1350  [pdf, other

    math.ST

    Geometric Network Comparison

    Authors: Dena Asta, Cosma Rohilla Shalizi

    Abstract: Network analysis has a crucial need for tools to compare networks and assess the significance of differences between networks. We propose a principled statistical approach to network comparison that approximates networks as probability distributions on negatively curved manifolds. We outline the theory, as well as implement the approach on simulated networks.

    Submitted 5 November, 2014; originally announced November 2014.

  7. arXiv:1212.0463  [pdf, other

    math.ST cs.LG stat.ML

    Nonparametric risk bounds for time-series forecasting

    Authors: Daniel J. McDonald, Cosma Rohilla Shalizi, Mark Schervish

    Abstract: We derive generalization error bounds for traditional time-series forecasting models. Our results hold for many standard forecasting tools including autoregressive models, moving average models, and, more generally, linear state-space models. These non-asymptotic bounds need only weak assumptions on the data-generating process, yet allow forecasters to select among competing models and to guarante… ▽ More

    Submitted 10 September, 2016; v1 submitted 3 December, 2012; originally announced December 2012.

    Comments: 34 pages, 3 figures

    MSC Class: 62M20 (Primary) 91B84; 62G99 (Secondary)

    Journal ref: Journal of Machine Learning Research. (2017). Vol 18. p. 1-40

  8. arXiv:1207.3994  [pdf, other

    cs.SI cond-mat.stat-mech math.ST physics.soc-ph stat.ML

    Model Selection for Degree-corrected Block Models

    Authors: Xiaoran Yan, Cosma Rohilla Shalizi, Jacob E. Jensen, Florent Krzakala, Cristopher Moore, Lenka Zdeborova, Pan Zhang, Yaojia Zhu

    Abstract: The proliferation of models for networks raises challenging problems of model selection: the data are sparse and globally dependent, and models are typically high-dimensional and have large numbers of latent variables. Together, these issues mean that the usual model-selection criteria do not work properly for networks. We illustrate these challenges, and show one way to resolve them, by consideri… ▽ More

    Submitted 30 May, 2013; v1 submitted 17 July, 2012; originally announced July 2012.

    Journal ref: J. Stat. Mech. (2014) P05007

  9. Consistency under sampling of exponential random graph models

    Authors: Cosma Rohilla Shalizi, Alessandro Rinaldo

    Abstract: The growing availability of network data and of scientific interest in distributed systems has led to the rapid development of statistical models of network structure. Typically, however, these are models for the entire network, while the data consists only of a sampled sub-network. Parameters for the whole network, which is what is of interest, are estimated by applying the model to the sub-netwo… ▽ More

    Submitted 9 May, 2013; v1 submitted 13 November, 2011; originally announced November 2011.

    Comments: Published in at http://dx.doi.org/10.1214/12-AOS1044 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1044

    Journal ref: Annals of Statistics 2013, Vol. 41, No. 2, 508-535

  10. Estimating beta-mixing coefficients via histograms

    Authors: Daniel J. McDonald, Cosma Rohilla Shalizi, Mark Schervish

    Abstract: The literature on statistical learning for time series often assumes asymptotic independence or "mixing" of the data-generating process. These mixing assumptions are never tested, nor are there methods for estimating mixing coefficients from data. Additionally, for many common classes of processes (Markov processes, ARMA processes, etc.) general functional forms for various mixing rates are known,… ▽ More

    Submitted 8 February, 2016; v1 submitted 27 September, 2011; originally announced September 2011.

    Comments: 30 pages, 8 figures. Longer version of arXiv:1103.0941 [stat.ML]

    Journal ref: Electron. J. Statist. 9 (2015), no. 2, 2855--2883

  11. arXiv:1103.0941  [pdf, ps, other

    stat.ML cs.LG math.PR

    Estimating $β$-mixing coefficients

    Authors: Daniel J. McDonald, Cosma Rohilla Shalizi, Mark Schervish

    Abstract: The literature on statistical learning for time series assumes the asymptotic independence or ``mixing' of the data-generating process. These mixing assumptions are never tested, nor are there methods for estimating mixing rates from data. We give an estimator for the $β$-mixing rate based on a single stationary sample path and show it is $L_1$-risk consistent.

    Submitted 4 March, 2011; originally announced March 2011.

    Comments: 9 pages, accepted by AIStats. CMU Statistics Technical Report

    Journal ref: Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics (AISTATS 2011), pp. 516--524

  12. arXiv:1006.3868  [pdf, other

    math.ST physics.data-an

    Philosophy and the practice of Bayesian statistics

    Authors: Andrew Gelman, Cosma Rohilla Shalizi

    Abstract: A substantial school in the philosophy of science identifies Bayesian inference with inductive inference and even rationality as such, and seems to be strengthened by the rise and practical success of Bayesian statistics. We argue that the most successful forms of Bayesian statistics do not actually support that particular philosophy but rather accord much better with sophisticated forms of hypoth… ▽ More

    Submitted 28 June, 2011; v1 submitted 19 June, 2010; originally announced June 2010.

    Comments: 36 pages, 5 figures. v2: Fixed typo in caption of figure 1. v3: Further typo fixes. v4: Revised in response to referees

    Journal ref: British Journal of Mathematical and Statistical Psychology, vol. 66 (2013), pp. 8--38

  13. arXiv:0901.1342  [pdf, other

    math.ST q-bio.PE

    Dynamics of Bayesian Updating with Dependent Data and Misspecified Models

    Authors: Cosma Rohilla Shalizi

    Abstract: Much is now known about the consistency of Bayesian updating on infinite-dimensional parameter spaces with independent or Markovian data. Necessary conditions for consistency include the prior putting enough weight on the correct neighborhoods of the data-generating distribution; various sufficient conditions further restrict the prior in ways analogous to capacity control in frequentist nonpara… ▽ More

    Submitted 13 November, 2009; v1 submitted 11 January, 2009; originally announced January 2009.

    Comments: 36 pages, 1 figure. v2: typo fixes, minor formatting changes. v3: Improved notation, added references, new theorem on convergence rates. v4: minor changes to text, added references. v5: Minor typo corrections; matches journal version except for format details

    MSC Class: 62C10; 62G20; 62M09; 60F10; 62M05; 92D15; 94A17

    Journal ref: _Electronic Journal of Statistics_, vol. 3 (2009): 1039--1074

  14. arXiv:math/0701854  [pdf, ps, other

    math.ST physics.data-an

    Maximum Likelihood Estimation for q-Exponential (Tsallis) Distributions

    Authors: Cosma Rohilla Shalizi

    Abstract: This expository note describes how to apply the method of maximum likelihood to estimate the parameters of the ``$q$-exponential'' distributions introduced by Tsallis and collaborators. It also describes the relationship of these distributions to the classical Pareto distributions.

    Submitted 31 January, 2007; v1 submitted 29 January, 2007; originally announced January 2007.

    Comments: 4 pages, 1 figure; accompanying R code available from http://bactra.org/research/tsallis-MLE/. V2: Added results on estimation from censored data, re-arranged introduction, minor corrections and wording changes throughout, updated code

    MSC Class: 62F10; 62P35

  15. arXiv:nlin/0508001  [pdf, ps, other

    nlin.CG math.ST physics.data-an

    Automatic Filters for the Detection of Coherent Structure in Spatiotemporal Systems

    Authors: Cosma Rohilla Shalizi, Robert Haslinger, Jean-Baptiste Rouquier, Kristina Lisa Klinkner, Cristopher Moore

    Abstract: Most current methods for identifying coherent structures in spatially-extended systems rely on prior information about the form which those structures take. Here we present two new approaches to automatically filter the changing configurations of spatial dynamical systems and extract coherent structures. One, local sensitivity filtering, is a modification of the local Lyapunov exponent approach… ▽ More

    Submitted 29 July, 2005; originally announced August 2005.

    Comments: 16 pages, 21 figures. Figures considerably compressed to fit arxiv requirements; write first author for higher-resolution versions

    Journal ref: Physical Review E 73 (2006): 036104

  16. arXiv:q-bio/0506009  [pdf, ps, other

    q-bio.NC math.ST nlin.CD q-bio.QM

    Measuring Shared Information and Coordinated Activity in Neuronal Networks

    Authors: Kristina Lisa Klinkner, Cosma Rohilla Shalizi, Marcelo F. Camperi

    Abstract: Most nervous systems encode information about stimuli in the responding activity of large neuronal networks. This activity often manifests itself as dynamically coordinated sequences of action potentials. Since multiple electrode recordings are now a standard tool in neuroscience research, it is important to have a measure of such network-wide behavioral coordination and information sharing, app… ▽ More

    Submitted 29 July, 2005; v1 submitted 7 June, 2005; originally announced June 2005.

    Comments: 8 pages, 6 figures

  17. arXiv:nlin/0409024  [pdf, ps, other

    nlin.AO cond-mat.stat-mech math.ST nlin.CG physics.data-an

    Quantifying Self-Organization with Optimal Predictors

    Authors: Cosma Rohilla Shalizi, Kristina Lisa Shalizi, Robert Haslinger

    Abstract: Despite broad interest in self-organizing systems, there are few quantitative, experimentally-applicable criteria for self-organization. The existing criteria all give counter-intuitive results for important cases. In this Letter, we propose a new criterion, namely an internally-generated increase in the statistical complexity, the amount of information required for optimal prediction of the sys… ▽ More

    Submitted 10 September, 2004; originally announced September 2004.

    Comments: Four pages, two color figures

    Journal ref: Physical Review Letters, vol. 93, no. 11 (10 September 2004), article 118701

  18. arXiv:cs/0406011  [pdf, ps, other

    cs.LG math.ST nlin.CD physics.data-an

    Blind Construction of Optimal Nonlinear Recursive Predictors for Discrete Sequences

    Authors: Cosma Rohilla Shalizi, Kristina Lisa Shalizi

    Abstract: We present a new method for nonlinear prediction of discrete random sequences under minimal structural assumptions. We give a mathematical construction for optimal predictors of such processes, in the form of hidden Markov models. We then describe an algorithm, CSSR (Causal-State Splitting Reconstruction), which approximates the ideal predictor from data. We discuss the reliability of CSSR, its… ▽ More

    Submitted 6 June, 2004; originally announced June 2004.

    Comments: 8 pages, 4 figures

    ACM Class: I.2.6

    Journal ref: pp. 504--511 in Max Chickering and Joseph Halpern (eds.), _Uncertainty in Artificial Intelligence: Proceedings of the Twentieth Conference_ (2004)

  19. arXiv:math/0305160  [pdf, ps, other

    math.PR cond-mat.stat-mech nlin.CG physics.data-an

    Optimal Nonlinear Prediction of Random Fields on Networks

    Authors: Cosma Rohilla Shalizi

    Abstract: It is increasingly common to encounter time-varying random fields on networks (metabolic networks, sensor arrays, distributed computing, etc.). This paper considers the problem of optimal, nonlinear prediction of these fields, showing from an information-theoretic perspective that it is formally identical to the problem of finding minimal local sufficient statistics. I derive general properties… ▽ More

    Submitted 16 June, 2003; v1 submitted 12 May, 2003; originally announced May 2003.

    Comments: 20 pages, 5 figures. For the conference "Discrete Models of Complex Systems" (Lyon, June, 2003). v2: Typos fixed, regenerated figures should now produce readable PDF output

    Journal ref: Discrete Mathematics and Theoretical Computer Science, vol. AB(DMCS), pp. 11--30 (2003)