Skip to main content

Showing 1–43 of 43 results for author: Fox, E B

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.17233  [pdf, other

    cs.LG stat.AP stat.ME

    Hybrid$^2$ Neural ODE Causal Modeling and an Application to Glycemic Response

    Authors: Bob Junyi Zou, Matthew E. Levine, Dessi P. Zaharieva, Ramesh Johari, Emily B. Fox

    Abstract: Hybrid models composing mechanistic ODE-based dynamics with flexible and expressive neural network components have grown rapidly in popularity, especially in scientific domains where such ODE-based modeling offers important interpretability and validated causal grounding (e.g., for counterfactual reasoning). The incorporation of mechanistic models also provides inductive bias in standard blackbox… ▽ More

    Submitted 11 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  2. arXiv:2312.03344  [pdf, other

    cs.LG math.DS stat.AP stat.ML

    Interpretable Mechanistic Representations for Meal-level Glycemic Control in the Wild

    Authors: Ke Alexander Wang, Emily B. Fox

    Abstract: Diabetes encompasses a complex landscape of glycemic control that varies widely among individuals. However, current methods do not faithfully capture this variability at the meal level. On the one hand, expert-crafted features lack the flexibility of data-driven methods; on the other hand, learned representations tend to be uninterpretable which hampers clinical adoption. In this paper, we propose… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: Proceedings of Machine Learning for Health (ML4H) 2023. Code available at: https://github.com/KeAWang/interpretable-cgm-representations

  3. arXiv:2305.01638  [pdf, other

    cs.LG cs.CV stat.ML

    Sequence Modeling with Multiresolution Convolutional Memory

    Authors: Jiaxin Shi, Ke Alexander Wang, Emily B. Fox

    Abstract: Efficiently capturing the long-range patterns in sequential data sources salient to a given task -- such as classification and generative modeling -- poses a fundamental challenge. Popular approaches in the space tradeoff between the memory burden of brute-force enumeration and comparison, as in transformers, the computational burden of complicated sequential dependencies, as in recurrent neural n… ▽ More

    Submitted 1 November, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: ICML 2023, Source code: https://github.com/thjashin/multires-conv

  4. arXiv:2105.02675  [pdf, other

    stat.ME cs.LG stat.ML

    Granger Causality: A Review and Recent Advances

    Authors: Ali Shojaie, Emily B. Fox

    Abstract: Introduced more than a half century ago, Granger causality has become a popular tool for analyzing time series data in many application domains, from economics and finance to genomics and neuroscience. Despite this popularity, the validity of this notion for inferring causal relationships among time series has remained the topic of continuous debate. Moreover, while the original definition was gen… ▽ More

    Submitted 6 May, 2021; v1 submitted 5 May, 2021; originally announced May 2021.

    Comments: 40 pages, 12 figures

  5. arXiv:2104.12231  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    Model-based metrics: Sample-efficient estimates of predictive model subpopulation performance

    Authors: Andrew C. Miller, Leon A. Gatys, Joseph Futoma, Emily B. Fox

    Abstract: Machine learning models $-$ now commonly developed to screen, diagnose, or predict health conditions $-$ are evaluated with a variety of performance metrics. An important first step in assessing the practical utility of a model is to evaluate its average performance over an entire population of interest. In many settings, it is also critical that the model makes good predictions within predefined… ▽ More

    Submitted 25 April, 2021; originally announced April 2021.

    Comments: 27 pages, 8 figures

  6. arXiv:2104.12219  [pdf, other

    stat.ML cs.LG stat.ME

    Breiman's two cultures: You don't have to choose sides

    Authors: Andrew C. Miller, Nicholas J. Foti, Emily B. Fox

    Abstract: Breiman's classic paper casts data analysis as a choice between two cultures: data modelers and algorithmic modelers. Stated broadly, data modelers use simple, interpretable models with well-understood theoretical properties to analyze data. Algorithmic modelers prioritize predictive accuracy and use more flexible function approximations to analyze data. This dichotomy overlooks a third set of mod… ▽ More

    Submitted 25 April, 2021; originally announced April 2021.

    Comments: Commentary to appear in a special issue of Observational Studies, discussing Leo Breiman's paper "Statistical Modeling: The Two Cultures" (https://doi.org/10.1214/ss/1009213726)

  7. arXiv:2012.00110  [pdf, other

    stat.ML cs.LG stat.AP

    Representing and Denoising Wearable ECG Recordings

    Authors: Jeffrey Chan, Andrew C. Miller, Emily B. Fox

    Abstract: Modern wearable devices are embedded with a range of noninvasive biomarker sensors that hold promise for improving detection and treatment of disease. One such sensor is the single-lead electrocardiogram (ECG) which measures electrical signals in the heart. The benefits of the sheer volume of ECG measurements with rich longitudinal structure made possible by wearables come at the price of potentia… ▽ More

    Submitted 30 November, 2020; originally announced December 2020.

    Comments: ML for Mobile Health Workshop, NeurIPS 2020

  8. arXiv:1911.05683  [pdf, other

    cs.LG cs.HC stat.ML

    Modeling patterns of smartphone usage and their relationship to cognitive health

    Authors: Jonas Rauber, Emily B. Fox, Leon A. Gatys

    Abstract: The ubiquity of smartphone usage in many people's lives make it a rich source of information about a person's mental and cognitive state. In this work we analyze 12 weeks of phone usage data from 113 older adults, 31 with diagnosed cognitive impairment and 82 without. We develop structured models of users' smartphone interactions to reveal differences in phone usage patterns between people with an… ▽ More

    Submitted 13 November, 2019; originally announced November 2019.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2019 - Extended Abstract

  9. arXiv:1905.07473  [pdf, other

    cs.LG math.OC stat.ML

    Adaptively Truncating Backpropagation Through Time to Control Gradient Bias

    Authors: Christopher Aicher, Nicholas J. Foti, Emily B. Fox

    Abstract: Truncated backpropagation through time (TBPTT) is a popular method for learning in recurrent neural networks (RNNs) that saves computation and memory at the cost of bias by truncating backpropagation after a fixed number of lags. In practice, choosing the optimal truncation length is difficult: TBPTT will not converge if the truncation length is too small, or will converge slowly if it is too larg… ▽ More

    Submitted 1 July, 2019; v1 submitted 17 May, 2019; originally announced May 2019.

  10. arXiv:1901.10568  [pdf, other

    stat.ML cs.LG stat.CO

    Stochastic Gradient MCMC for Nonlinear State Space Models

    Authors: Christopher Aicher, Srshti Putcha, Christopher Nemeth, Paul Fearnhead, Emily B. Fox

    Abstract: State space models (SSMs) provide a flexible framework for modeling complex time series via a latent stochastic process. Inference for nonlinear, non-Gaussian SSMs is often tackled with particle methods that do not scale well to long time series. The challenge is two-fold: not only do computations scale linearly with time, as in the linear case, but particle filters additionally suffer from increa… ▽ More

    Submitted 16 July, 2023; v1 submitted 29 January, 2019; originally announced January 2019.

    Comments: To appear in Bayesian Analysis

  11. arXiv:1810.09098  [pdf, other

    stat.ML cs.LG stat.CO

    Stochastic Gradient MCMC for State Space Models

    Authors: Christopher Aicher, Yi-An Ma, Nicholas J. Foti, Emily B. Fox

    Abstract: State space models (SSMs) are a flexible approach to modeling complex time series. However, inference in SSMs is often computationally prohibitive for long time series. Stochastic gradient MCMC (SGMCMC) is a popular method for scalable Bayesian inference for large independent data. Unfortunately when applied to dependent data, such as in SSMs, SGMCMC's stochastic gradient estimates are biased as t… ▽ More

    Submitted 9 July, 2019; v1 submitted 22 October, 2018; originally announced October 2018.

  12. arXiv:1807.07621  [pdf, other

    stat.ML cs.LG stat.CO

    Approximate Collapsed Gibbs Clustering with Expectation Propagation

    Authors: Christopher Aicher, Emily B. Fox

    Abstract: We develop a framework for approximating collapsed Gibbs sampling in generative latent variable cluster models. Collapsed Gibbs is a popular MCMC method, which integrates out variables in the posterior to improve mixing. Unfortunately for many complex models, integrating out these variables is either analytically or computationally intractable. We efficiently approximate the necessary collapsed Gi… ▽ More

    Submitted 19 July, 2018; originally announced July 2018.

  13. arXiv:1806.09060  [pdf, other

    cs.LG stat.ML

    Disentangled VAE Representations for Multi-Aspect and Missing Data

    Authors: Samuel K. Ainsworth, Nicholas J. Foti, Emily B. Fox

    Abstract: Many problems in machine learning and related application areas are fundamentally variants of conditional modeling and sampling across multi-aspect data, either multi-view, multi-modal, or simply multi-group. For example, sampling from the distribution of English sentences conditioned on a given French sentence or sampling audio waveforms conditioned on a given piece of text. Central to many of th… ▽ More

    Submitted 23 June, 2018; originally announced June 2018.

  14. arXiv:1806.07137  [pdf, other

    stat.CO cs.LG stat.ML

    Large-Scale Stochastic Sampling from the Probability Simplex

    Authors: Jack Baker, Paul Fearnhead, Emily B Fox, Christopher Nemeth

    Abstract: Stochastic gradient Markov chain Monte Carlo (SGMCMC) has become a popular method for scalable Bayesian inference. These methods are based on sampling a discrete-time approximation to a continuous time process, such as the Langevin diffusion. When applied to distributions defined on a constrained space the time-discretization error can dominate when we are near the boundary of the space. We demons… ▽ More

    Submitted 26 October, 2018; v1 submitted 19 June, 2018; originally announced June 2018.

    Comments: Accepted to Advances in Neural Information Processing Systems (2018)

  15. arXiv:1711.08392  [pdf, other

    stat.ML

    An Efficient ADMM Algorithm for Structural Break Detection in Multivariate Time Series

    Authors: Alex Tank, Emily B. Fox, Ali Shojaie

    Abstract: We present an efficient alternating direction method of multipliers (ADMM) algorithm for segmenting a multivariate non-stationary time series with structural breaks into stationary regions. We draw from recent work where the series is assumed to follow a vector autoregressive model within segments and a convex estimation procedure may be formulated using group fused lasso penalties. Our ADMM appro… ▽ More

    Submitted 25 June, 2018; v1 submitted 22 November, 2017; originally announced November 2017.

    Comments: Accepted to the NIPS Time Series Workshop 2017

  16. arXiv:1711.08160  [pdf, other

    stat.ML

    An Interpretable and Sparse Neural Network Model for Nonlinear Granger Causality Discovery

    Authors: Alex Tank, Ian Cover, Nicholas J. Foti, Ali Shojaie, Emily B. Fox

    Abstract: While most classical approaches to Granger causality detection repose upon linear time series assumptions, many interactions in neuroscience and economics applications are nonlinear. We develop an approach to nonlinear Granger causality detection using multilayer perceptrons where the input to the network is the past time lags of all series and the output is the future value of a single series. A… ▽ More

    Submitted 25 June, 2018; v1 submitted 22 November, 2017; originally announced November 2017.

    Comments: Accepted to the NIPS Time Series Workshop 2017

  17. arXiv:1710.00578  [pdf, other

    stat.CO stat.AP stat.ML

    sgmcmc: An R Package for Stochastic Gradient Markov Chain Monte Carlo

    Authors: Jack Baker, Paul Fearnhead, Emily B. Fox, Christopher Nemeth

    Abstract: This paper introduces the R package sgmcmc; which can be used for Bayesian inference on problems with large datasets using stochastic gradient Markov chain Monte Carlo (SGMCMC). Traditional Markov chain Monte Carlo (MCMC) methods, such as Metropolis-Hastings, are known to run prohibitively slowly as the dataset size increases. SGMCMC solves this issue by only using a subset of data at each iterati… ▽ More

    Submitted 13 April, 2018; v1 submitted 2 October, 2017; originally announced October 2017.

  18. arXiv:1707.09380  [pdf, other

    stat.AP

    Dynamics of homelessness in urban America

    Authors: Chris Glynn, Emily B. Fox

    Abstract: The relationship between housing costs and homelessness has important implications for the way that city and county governments respond to increasing homeless populations. Though many analyses in the public policy literature have examined inter-community variation in homelessness rates to identify causal mechanisms of homelessness (Byrne et al., 2013; Lee et al., 2003; Fargo et al., 2013), few stu… ▽ More

    Submitted 28 July, 2017; originally announced July 2017.

    Comments: 54 pages, 36 figures

  19. arXiv:1706.05439  [pdf, other

    stat.CO cs.LG stat.ML

    Control Variates for Stochastic Gradient MCMC

    Authors: Jack Baker, Paul Fearnhead, Emily B. Fox, Christopher Nemeth

    Abstract: It is well known that Markov chain Monte Carlo (MCMC) methods scale poorly with dataset size. A popular class of methods for solving this issue is stochastic gradient MCMC. These methods use a noisy estimate of the gradient of the log posterior, which reduces the per iteration computational cost of the algorithm. Despite this, there are a number of results suggesting that stochastic gradient Lange… ▽ More

    Submitted 14 December, 2017; v1 submitted 16 June, 2017; originally announced June 2017.

  20. arXiv:1706.04632  [pdf, other

    stat.ML

    Stochastic Gradient MCMC Methods for Hidden Markov Models

    Authors: Yi-An Ma, Nicholas J. Foti, Emily B. Fox

    Abstract: Stochastic gradient MCMC (SG-MCMC) algorithms have proven useful in scaling Bayesian inference to large datasets under an assumption of i.i.d data. We instead develop an SG-MCMC algorithm to learn the parameters of hidden Markov models (HMMs) for time-dependent data. There are two challenges to applying SG-MCMC in this setting: The latent discrete states, and needing to break dependencies when con… ▽ More

    Submitted 14 June, 2017; originally announced June 2017.

  21. arXiv:1706.02781  [pdf, other

    stat.ME

    Granger Causality Networks for Categorical Time Series

    Authors: Alex Tank, Emily B. Fox, Ali Shojaie

    Abstract: We present a new framework for learning Granger causality networks for multivariate categorical time series, based on the mixture transition distribution (MTD) model. Traditionally, MTD is plagued by a nonconvex objective, non-identifiability, and presence of many local optima. To circumvent these problems, we recast inference in the MTD as a convex problem. The new formulation facilitates the app… ▽ More

    Submitted 8 June, 2017; originally announced June 2017.

  22. arXiv:1704.02519  [pdf, other

    stat.ME

    Identifiability and Estimation of Structural Vector Autoregressive Models for Subsampled and Mixed Frequency Time Series

    Authors: Alex Tank, Emily B. Fox, Ali Shojaie

    Abstract: Causal inference in multivariate time series is challenging due to the fact that the sampling rate may not be as fast as the timescale of the causal interactions. In this context, we can view our observed series as a subsampled version of the desired series. Furthermore, due to technological and other limitations, series may be observed at different sampling rates, representing a mixed frequency s… ▽ More

    Submitted 8 April, 2017; originally announced April 2017.

  23. Irreversible Samplers from Jump and Continuous Markov Processes

    Authors: Yi-An Ma, Emily B. Fox, Tianqi Chen, Lei Wu

    Abstract: In this paper, we propose irreversible versions of the Metropolis Hastings (MH) and Metropolis adjusted Langevin algorithm (MALA) with a main focus on the latter. For the former, we show how one can simply switch between different proposal and acceptance distributions upon rejection to obtain an irreversible jump sampler (I-Jump). The resulting algorithm has a simple implementation akin to MH, but… ▽ More

    Submitted 12 March, 2018; v1 submitted 21 August, 2016; originally announced August 2016.

    Journal ref: Stat. Comput. (2018) 1-26

  24. arXiv:1506.04696  [pdf, other

    math.ST stat.ME stat.ML

    A Complete Recipe for Stochastic Gradient MCMC

    Authors: Yi-An Ma, Tianqi Chen, Emily B. Fox

    Abstract: Many recent Markov chain Monte Carlo (MCMC) samplers leverage continuous dynamics to define a transition kernel that efficiently explores a target distribution. In tandem, a focus has been on devising scalable variants that subsample the data and use stochastic gradients in place of full-data gradients in the dynamic simulations. However, such stochastic gradient MCMC samplers have lagged behind t… ▽ More

    Submitted 31 October, 2015; v1 submitted 15 June, 2015; originally announced June 2015.

  25. arXiv:1505.01164  [pdf, other

    stat.AP stat.ME stat.ML

    Achieving a Hyperlocal Housing Price Index: Overcoming Data Sparsity by Bayesian Dynamical Modeling of Multiple Data Streams

    Authors: You Ren, Emily B. Fox, Andrew Bruce

    Abstract: Understanding how housing values evolve over time is important to policy makers, consumers and real estate professionals. Existing methods for constructing housing indices are computed at a coarse spatial granularity, such as metropolitan regions, which can mask or distort price dynamics apparent in local markets, such as neighborhoods and census tracts. A challenge in moving to estimates at, for… ▽ More

    Submitted 5 May, 2015; originally announced May 2015.

  26. arXiv:1412.0694  [pdf, other

    stat.ML

    Streaming Variational Inference for Bayesian Nonparametric Mixture Models

    Authors: Alex Tank, Nicholas J. Foti, Emily B. Fox

    Abstract: In theory, Bayesian nonparametric (BNP) models are well suited to streaming data scenarios due to their ability to adapt model complexity with the observed data. Unfortunately, such benefits have not been fully realized in practice; existing inference algorithms are either not applicable to streaming applications or not extensible to BNP models. For the special case of Dirichlet processes, streami… ▽ More

    Submitted 21 April, 2015; v1 submitted 1 December, 2014; originally announced December 2014.

  27. arXiv:1411.1670  [pdf, other

    stat.ML

    Stochastic Variational Inference for Hidden Markov Models

    Authors: Nicholas J. Foti, Jason Xu, Dillon Laird, Emily B. Fox

    Abstract: Variational inference algorithms have proven successful for Bayesian analysis in large data settings, with recent advances using stochastic variational inference (SVI). However, such methods have largely been studied in independent or exchangeable data settings. We develop an SVI algorithm to learn the parameters of hidden Markov models (HMMs) in a time-dependent data setting. The challenge in app… ▽ More

    Submitted 6 November, 2014; originally announced November 2014.

    Comments: Appears in Advances in Neural Information Processing Systems (NIPS), 2014

  28. arXiv:1402.6951  [pdf, other

    stat.ML q-bio.NC stat.AP

    Modeling the Complex Dynamics and Changing Correlations of Epileptic Events

    Authors: Drausin F. Wulsin, Emily B. Fox, Brian Litt

    Abstract: Patients with epilepsy can manifest short, sub-clinical epileptic "bursts" in addition to full-blown clinical seizures. We believe the relationship between these two classes of events---something not previously studied quantitatively---could yield important insights into the nature and intrinsic dynamics of seizures. A goal of our work is to parse these complex epileptic events into distinct dynam… ▽ More

    Submitted 13 July, 2014; v1 submitted 27 February, 2014; originally announced February 2014.

  29. arXiv:1402.4862  [pdf, other

    stat.ML cs.LG

    Learning the Parameters of Determinantal Point Process Kernels

    Authors: Raja Hafiz Affandi, Emily B. Fox, Ryan P. Adams, Ben Taskar

    Abstract: Determinantal point processes (DPPs) are well-suited for modeling repulsion and have proven useful in many applications where diversity is desired. While DPPs have many appealing properties, such as efficient sampling, learning the parameters of a DPP is still considered a difficult problem due to the non-convex nature of the likelihood function. In this paper, we propose using Bayesian methods to… ▽ More

    Submitted 19 February, 2014; originally announced February 2014.

  30. arXiv:1402.4102  [pdf, other

    stat.ME cs.LG stat.ML

    Stochastic Gradient Hamiltonian Monte Carlo

    Authors: Tianqi Chen, Emily B. Fox, Carlos Guestrin

    Abstract: Hamiltonian Monte Carlo (HMC) sampling methods provide a mechanism for defining distant proposals with high acceptance probabilities in a Metropolis-Hastings framework, enabling more efficient exploration of the state space than standard random-walk proposals. The popularity of such methods has grown significantly in recent years. However, a limitation of HMC methods is the required gradient compu… ▽ More

    Submitted 12 May, 2014; v1 submitted 17 February, 2014; originally announced February 2014.

    Comments: ICML 2014 version

  31. arXiv:1401.1137  [pdf, other

    stat.ME cs.SI math.ST stat.ML

    Sparse graphs using exchangeable random measures

    Authors: François Caron, Emily B. Fox

    Abstract: Statistical network modeling has focused on representing the graph as a discrete structure, namely the adjacency matrix, and considering the exchangeability of this array. In such cases, the Aldous-Hoover representation theorem (Aldous, 1981;Hoover, 1979} applies and informs us that the graph is necessarily either dense or empty. In this paper, we instead consider representing the graph as a mea… ▽ More

    Submitted 27 March, 2015; v1 submitted 6 January, 2014; originally announced January 2014.

    Comments: New title. Extended version

  32. arXiv:1311.2971  [pdf, other

    stat.ML cs.LG stat.ME

    Approximate Inference in Continuous Determinantal Point Processes

    Authors: Raja Hafiz Affandi, Emily B. Fox, Ben Taskar

    Abstract: Determinantal point processes (DPPs) are random point processes well-suited for modeling repulsion. In machine learning, the focus of DPP-based models has been on diverse subset selection from a discrete and finite base set. This discrete setting admits an efficient sampling algorithm based on the eigendecomposition of the defining kernel matrix. Recently, there has been growing interest in using… ▽ More

    Submitted 12 November, 2013; originally announced November 2013.

  33. arXiv:1309.3533  [pdf, ps, other

    stat.ME cs.LG stat.ML

    Mixed Membership Models for Time Series

    Authors: Emily B. Fox, Michael I. Jordan

    Abstract: In this article we discuss some of the consequences of the mixed membership perspective on time series analysis. In its most abstract form, a mixed membership model aims to associate an individual entity with some set of attributes based on a collection of observed data. Although much of the literature on mixed membership models considers the setting in which exchangeable collections of data are a… ▽ More

    Submitted 13 September, 2013; originally announced September 2013.

  34. arXiv:1308.4747  [pdf, ps, other

    stat.ME stat.ML

    Joint modeling of multiple time series via the beta process with application to motion capture segmentation

    Authors: Emily B. Fox, Michael C. Hughes, Erik B. Sudderth, Michael I. Jordan

    Abstract: We propose a Bayesian nonparametric approach to the problem of jointly modeling multiple related time series. Our model discovers a latent set of dynamical behaviors shared among the sequences, and segments each time series into regions defined by a subset of these behaviors. Using a beta process prior, the size of the behavior set and the sharing pattern are both inferred from data. We develop Ma… ▽ More

    Submitted 13 November, 2014; v1 submitted 21 August, 2013; originally announced August 2013.

    Comments: Published in at http://dx.doi.org/10.1214/14-AOAS742 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org). arXiv admin note: text overlap with arXiv:1111.4226

    Report number: IMS-AOAS-AOAS742

    Journal ref: Annals of Applied Statistics 2014, Vol. 8, No. 3, 1281-1313

  35. arXiv:1304.6777  [pdf, ps, other

    cs.SI physics.soc-ph stat.AP

    A Bayesian approach for predicting the popularity of tweets

    Authors: Tauhid Zaman, Emily B. Fox, Eric T. Bradlow

    Abstract: We predict the popularity of short messages called tweets created in the micro-blogging site known as Twitter. We measure the popularity of a tweet by the time-series path of its retweets, which is when people forward the tweet to others. We develop a probabilistic model for the evolution of the retweets using a Bayesian approach, and form predictions using only observations on the retweet times a… ▽ More

    Submitted 24 November, 2014; v1 submitted 24 April, 2013; originally announced April 2013.

    Comments: Published in at http://dx.doi.org/10.1214/14-AOAS741 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS741

    Journal ref: Annals of Applied Statistics 2014, Vol. 8, No. 3, 1583-1611

  36. arXiv:1304.5642  [pdf, other

    stat.AP stat.ME

    Spatio-Temporal Low Count Processes with Application to Violent Crime Events

    Authors: Sivan Aldor-Noiman, Lawrence D. Brown, Emily B. Fox, Robert A. Stine

    Abstract: There is significant interest in being able to predict where crimes will happen, for example to aid in the efficient tasking of police and other protective measures. We aim to model both the temporal and spatial dependencies often exhibited by violent crimes in order to make such predictions. The temporal variation of crimes typically follows patterns familiar in time series analysis, but the spat… ▽ More

    Submitted 20 April, 2013; originally announced April 2013.

  37. arXiv:1210.4850  [pdf

    cs.LG cs.IR stat.ML

    Markov Determinantal Point Processes

    Authors: Raja Hafiz Affandi, Alex Kulesza, Emily B. Fox

    Abstract: A determinantal point process (DPP) is a random process useful for modeling the combinatorial problem of subset selection. In particular, DPPs encourage a random subset Y to contain a diverse set of items selected from a base set Y. For example, we might use a DPP to display a set of news headlines that are relevant to a user's interests while covering a variety of topics. Suppose, however, that w… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-26-35

  38. arXiv:1209.0833  [pdf, ps, other

    stat.ME stat.ML

    Multiresolution Gaussian Processes

    Authors: Emily B. Fox, David B. Dunson

    Abstract: We propose a multiresolution Gaussian process to capture long-range, non-Markovian dependencies while allowing for abrupt changes. The multiresolution GP hierarchically couples a collection of smooth GPs, each defined over an element of a random nested partition. Long-range dependencies are captured by the top-level GP while the partition points define the abrupt changes. Due to the inherent conju… ▽ More

    Submitted 4 September, 2012; originally announced September 2012.

  39. arXiv:1204.2523  [pdf, other

    stat.ML cs.CL cs.IR cs.LG

    Concept Modeling with Superwords

    Authors: Khalid El-Arini, Emily B. Fox, Carlos Guestrin

    Abstract: In information retrieval, a fundamental goal is to transform a document into concepts that are representative of its content. The term "representative" is in itself challenging to define, and various tasks require different granularities of concepts. In this paper, we aim to model concepts that are sparse over the vocabulary, and that flexibly adapt their content based on other relevant semantic i… ▽ More

    Submitted 11 April, 2012; originally announced April 2012.

  40. arXiv:1111.4226  [pdf, other

    stat.ME stat.ML

    Joint Modeling of Multiple Related Time Series via the Beta Process

    Authors: Emily B. Fox, Erik B. Sudderth, Michael I. Jordan, Alan S. Willsky

    Abstract: We propose a Bayesian nonparametric approach to the problem of jointly modeling multiple related time series. Our approach is based on the discovery of a set of latent, shared dynamical behaviors. Using a beta process prior, the size of the set and the sharing pattern are both inferred from data. We develop efficient Markov chain Monte Carlo methods based on the Indian buffet process representatio… ▽ More

    Submitted 17 November, 2011; originally announced November 2011.

    Comments: 33 pages, 8 figures

  41. arXiv:1107.5239  [pdf, other

    stat.ME

    Autoregressive Models for Variance Matrices: Stationary Inverse Wishart Processes

    Authors: Emily B. Fox, Mike West

    Abstract: We introduce and explore a new class of stationary time series models for variance matrices based on a constructive definition exploiting inverse Wishart distribution theory. The main class of models explored is a novel class of stationary, first-order autoregressive (AR) processes on the cone of positive semi-definite matrices. Aspects of the theory and structure of these new models for multivari… ▽ More

    Submitted 26 July, 2011; originally announced July 2011.

  42. Bayesian Nonparametric Inference of Switching Linear Dynamical Systems

    Authors: Emily B. Fox, Erik B. Sudderth, Michael I. Jordan, Alan S. Willsky

    Abstract: Many complex dynamical phenomena can be effectively modeled by a system that switches among a set of conditionally linear dynamical modes. We consider two such models: the switching linear dynamical system (SLDS) and the switching vector autoregressive (VAR) process. Our Bayesian nonparametric approach utilizes a hierarchical Dirichlet process prior to learn an unknown number of persistent, smoot… ▽ More

    Submitted 19 March, 2010; originally announced March 2010.

    Comments: 50 pages, 7 figures

  43. arXiv:0905.2592  [pdf, ps, other

    stat.ME stat.AP stat.ML

    A sticky HDP-HMM with application to speaker diarization

    Authors: Emily B. Fox, Erik B. Sudderth, Michael I. Jordan, Alan S. Willsky

    Abstract: We consider the problem of speaker diarization, the problem of segmenting an audio recording of a meeting into temporal segments corresponding to individual speakers. The problem is rendered particularly difficult by the fact that we are not allowed to assume knowledge of the number of people participating in the meeting. To address this problem, we take a Bayesian nonparametric approach to speake… ▽ More

    Submitted 16 August, 2011; v1 submitted 15 May, 2009; originally announced May 2009.

    Comments: Published in at http://dx.doi.org/10.1214/10-AOAS395 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS395

    Journal ref: Annals of Applied Statistics 2011, Vol. 5, No. 2A, 1020-1056