Skip to main content

Showing 1–25 of 25 results for author: Fox, E B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.17879  [pdf, other

    cs.LG cs.CL

    Automated Statistical Model Discovery with Language Models

    Authors: Michael Y. Li, Emily B. Fox, Noah D. Goodman

    Abstract: Statistical model discovery is a challenging search over a vast space of models subject to domain-specific constraints. Efficiently searching over this space requires expertise in modeling and the problem domain. Motivated by the domain knowledge and programming capabilities of large language models (LMs), we introduce a method for language model driven automated statistical model discovery. We ca… ▽ More

    Submitted 22 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  2. arXiv:2402.17233  [pdf, other

    cs.LG stat.AP stat.ME

    Hybrid$^2$ Neural ODE Causal Modeling and an Application to Glycemic Response

    Authors: Bob Junyi Zou, Matthew E. Levine, Dessi P. Zaharieva, Ramesh Johari, Emily B. Fox

    Abstract: Hybrid models composing mechanistic ODE-based dynamics with flexible and expressive neural network components have grown rapidly in popularity, especially in scientific domains where such ODE-based modeling offers important interpretability and validated causal grounding (e.g., for counterfactual reasoning). The incorporation of mechanistic models also provides inductive bias in standard blackbox… ▽ More

    Submitted 11 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  3. arXiv:2312.03344  [pdf, other

    cs.LG math.DS stat.AP stat.ML

    Interpretable Mechanistic Representations for Meal-level Glycemic Control in the Wild

    Authors: Ke Alexander Wang, Emily B. Fox

    Abstract: Diabetes encompasses a complex landscape of glycemic control that varies widely among individuals. However, current methods do not faithfully capture this variability at the meal level. On the one hand, expert-crafted features lack the flexibility of data-driven methods; on the other hand, learned representations tend to be uninterpretable which hampers clinical adoption. In this paper, we propose… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: Proceedings of Machine Learning for Health (ML4H) 2023. Code available at: https://github.com/KeAWang/interpretable-cgm-representations

  4. arXiv:2305.01638  [pdf, other

    cs.LG cs.CV stat.ML

    Sequence Modeling with Multiresolution Convolutional Memory

    Authors: Jiaxin Shi, Ke Alexander Wang, Emily B. Fox

    Abstract: Efficiently capturing the long-range patterns in sequential data sources salient to a given task -- such as classification and generative modeling -- poses a fundamental challenge. Popular approaches in the space tradeoff between the memory burden of brute-force enumeration and comparison, as in transformers, the computational burden of complicated sequential dependencies, as in recurrent neural n… ▽ More

    Submitted 1 November, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: ICML 2023, Source code: https://github.com/thjashin/multires-conv

  5. arXiv:2304.14300  [pdf, other

    cs.LG math.DS q-bio.QM

    Learning Absorption Rates in Glucose-Insulin Dynamics from Meal Covariates

    Authors: Ke Alexander Wang, Matthew E. Levine, Jiaxin Shi, Emily B. Fox

    Abstract: Traditional models of glucose-insulin dynamics rely on heuristic parameterizations chosen to fit observations within a laboratory setting. However, these models cannot describe glucose dynamics in daily life. One source of failure is in their descriptions of glucose absorption rates after meal events. A meal's macronutritional content has nuanced effects on the absorption profile, which is difficu… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

    Comments: Work presented at NeurIPS 2022 Workshop on Learning from Time Series for Health (TS4H). arXiv admin note: substantial text overlap with arXiv:2302.11939

  6. arXiv:2105.02675  [pdf, other

    stat.ME cs.LG stat.ML

    Granger Causality: A Review and Recent Advances

    Authors: Ali Shojaie, Emily B. Fox

    Abstract: Introduced more than a half century ago, Granger causality has become a popular tool for analyzing time series data in many application domains, from economics and finance to genomics and neuroscience. Despite this popularity, the validity of this notion for inferring causal relationships among time series has remained the topic of continuous debate. Moreover, while the original definition was gen… ▽ More

    Submitted 6 May, 2021; v1 submitted 5 May, 2021; originally announced May 2021.

    Comments: 40 pages, 12 figures

  7. arXiv:2104.12231  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    Model-based metrics: Sample-efficient estimates of predictive model subpopulation performance

    Authors: Andrew C. Miller, Leon A. Gatys, Joseph Futoma, Emily B. Fox

    Abstract: Machine learning models $-$ now commonly developed to screen, diagnose, or predict health conditions $-$ are evaluated with a variety of performance metrics. An important first step in assessing the practical utility of a model is to evaluate its average performance over an entire population of interest. In many settings, it is also critical that the model makes good predictions within predefined… ▽ More

    Submitted 25 April, 2021; originally announced April 2021.

    Comments: 27 pages, 8 figures

  8. arXiv:2104.12219  [pdf, other

    stat.ML cs.LG stat.ME

    Breiman's two cultures: You don't have to choose sides

    Authors: Andrew C. Miller, Nicholas J. Foti, Emily B. Fox

    Abstract: Breiman's classic paper casts data analysis as a choice between two cultures: data modelers and algorithmic modelers. Stated broadly, data modelers use simple, interpretable models with well-understood theoretical properties to analyze data. Algorithmic modelers prioritize predictive accuracy and use more flexible function approximations to analyze data. This dichotomy overlooks a third set of mod… ▽ More

    Submitted 25 April, 2021; originally announced April 2021.

    Comments: Commentary to appear in a special issue of Observational Studies, discussing Leo Breiman's paper "Statistical Modeling: The Two Cultures" (https://doi.org/10.1214/ss/1009213726)

  9. arXiv:2012.00110  [pdf, other

    stat.ML cs.LG stat.AP

    Representing and Denoising Wearable ECG Recordings

    Authors: Jeffrey Chan, Andrew C. Miller, Emily B. Fox

    Abstract: Modern wearable devices are embedded with a range of noninvasive biomarker sensors that hold promise for improving detection and treatment of disease. One such sensor is the single-lead electrocardiogram (ECG) which measures electrical signals in the heart. The benefits of the sheer volume of ECG measurements with rich longitudinal structure made possible by wearables come at the price of potentia… ▽ More

    Submitted 30 November, 2020; originally announced December 2020.

    Comments: ML for Mobile Health Workshop, NeurIPS 2020

  10. arXiv:1911.05683  [pdf, other

    cs.LG cs.HC stat.ML

    Modeling patterns of smartphone usage and their relationship to cognitive health

    Authors: Jonas Rauber, Emily B. Fox, Leon A. Gatys

    Abstract: The ubiquity of smartphone usage in many people's lives make it a rich source of information about a person's mental and cognitive state. In this work we analyze 12 weeks of phone usage data from 113 older adults, 31 with diagnosed cognitive impairment and 82 without. We develop structured models of users' smartphone interactions to reveal differences in phone usage patterns between people with an… ▽ More

    Submitted 13 November, 2019; originally announced November 2019.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2019 - Extended Abstract

  11. arXiv:1905.07473  [pdf, other

    cs.LG math.OC stat.ML

    Adaptively Truncating Backpropagation Through Time to Control Gradient Bias

    Authors: Christopher Aicher, Nicholas J. Foti, Emily B. Fox

    Abstract: Truncated backpropagation through time (TBPTT) is a popular method for learning in recurrent neural networks (RNNs) that saves computation and memory at the cost of bias by truncating backpropagation after a fixed number of lags. In practice, choosing the optimal truncation length is difficult: TBPTT will not converge if the truncation length is too small, or will converge slowly if it is too larg… ▽ More

    Submitted 1 July, 2019; v1 submitted 17 May, 2019; originally announced May 2019.

  12. arXiv:1901.10568  [pdf, other

    stat.ML cs.LG stat.CO

    Stochastic Gradient MCMC for Nonlinear State Space Models

    Authors: Christopher Aicher, Srshti Putcha, Christopher Nemeth, Paul Fearnhead, Emily B. Fox

    Abstract: State space models (SSMs) provide a flexible framework for modeling complex time series via a latent stochastic process. Inference for nonlinear, non-Gaussian SSMs is often tackled with particle methods that do not scale well to long time series. The challenge is two-fold: not only do computations scale linearly with time, as in the linear case, but particle filters additionally suffer from increa… ▽ More

    Submitted 16 July, 2023; v1 submitted 29 January, 2019; originally announced January 2019.

    Comments: To appear in Bayesian Analysis

  13. arXiv:1810.09098  [pdf, other

    stat.ML cs.LG stat.CO

    Stochastic Gradient MCMC for State Space Models

    Authors: Christopher Aicher, Yi-An Ma, Nicholas J. Foti, Emily B. Fox

    Abstract: State space models (SSMs) are a flexible approach to modeling complex time series. However, inference in SSMs is often computationally prohibitive for long time series. Stochastic gradient MCMC (SGMCMC) is a popular method for scalable Bayesian inference for large independent data. Unfortunately when applied to dependent data, such as in SSMs, SGMCMC's stochastic gradient estimates are biased as t… ▽ More

    Submitted 9 July, 2019; v1 submitted 22 October, 2018; originally announced October 2018.

  14. arXiv:1807.07621  [pdf, other

    stat.ML cs.LG stat.CO

    Approximate Collapsed Gibbs Clustering with Expectation Propagation

    Authors: Christopher Aicher, Emily B. Fox

    Abstract: We develop a framework for approximating collapsed Gibbs sampling in generative latent variable cluster models. Collapsed Gibbs is a popular MCMC method, which integrates out variables in the posterior to improve mixing. Unfortunately for many complex models, integrating out these variables is either analytically or computationally intractable. We efficiently approximate the necessary collapsed Gi… ▽ More

    Submitted 19 July, 2018; originally announced July 2018.

  15. arXiv:1806.09060  [pdf, other

    cs.LG stat.ML

    Disentangled VAE Representations for Multi-Aspect and Missing Data

    Authors: Samuel K. Ainsworth, Nicholas J. Foti, Emily B. Fox

    Abstract: Many problems in machine learning and related application areas are fundamentally variants of conditional modeling and sampling across multi-aspect data, either multi-view, multi-modal, or simply multi-group. For example, sampling from the distribution of English sentences conditioned on a given French sentence or sampling audio waveforms conditioned on a given piece of text. Central to many of th… ▽ More

    Submitted 23 June, 2018; originally announced June 2018.

  16. arXiv:1806.07137  [pdf, other

    stat.CO cs.LG stat.ML

    Large-Scale Stochastic Sampling from the Probability Simplex

    Authors: Jack Baker, Paul Fearnhead, Emily B Fox, Christopher Nemeth

    Abstract: Stochastic gradient Markov chain Monte Carlo (SGMCMC) has become a popular method for scalable Bayesian inference. These methods are based on sampling a discrete-time approximation to a continuous time process, such as the Langevin diffusion. When applied to distributions defined on a constrained space the time-discretization error can dominate when we are near the boundary of the space. We demons… ▽ More

    Submitted 26 October, 2018; v1 submitted 19 June, 2018; originally announced June 2018.

    Comments: Accepted to Advances in Neural Information Processing Systems (2018)

  17. arXiv:1706.05439  [pdf, other

    stat.CO cs.LG stat.ML

    Control Variates for Stochastic Gradient MCMC

    Authors: Jack Baker, Paul Fearnhead, Emily B. Fox, Christopher Nemeth

    Abstract: It is well known that Markov chain Monte Carlo (MCMC) methods scale poorly with dataset size. A popular class of methods for solving this issue is stochastic gradient MCMC. These methods use a noisy estimate of the gradient of the log posterior, which reduces the per iteration computational cost of the algorithm. Despite this, there are a number of results suggesting that stochastic gradient Lange… ▽ More

    Submitted 14 December, 2017; v1 submitted 16 June, 2017; originally announced June 2017.

  18. arXiv:1402.4862  [pdf, other

    stat.ML cs.LG

    Learning the Parameters of Determinantal Point Process Kernels

    Authors: Raja Hafiz Affandi, Emily B. Fox, Ryan P. Adams, Ben Taskar

    Abstract: Determinantal point processes (DPPs) are well-suited for modeling repulsion and have proven useful in many applications where diversity is desired. While DPPs have many appealing properties, such as efficient sampling, learning the parameters of a DPP is still considered a difficult problem due to the non-convex nature of the likelihood function. In this paper, we propose using Bayesian methods to… ▽ More

    Submitted 19 February, 2014; originally announced February 2014.

  19. arXiv:1402.4102  [pdf, other

    stat.ME cs.LG stat.ML

    Stochastic Gradient Hamiltonian Monte Carlo

    Authors: Tianqi Chen, Emily B. Fox, Carlos Guestrin

    Abstract: Hamiltonian Monte Carlo (HMC) sampling methods provide a mechanism for defining distant proposals with high acceptance probabilities in a Metropolis-Hastings framework, enabling more efficient exploration of the state space than standard random-walk proposals. The popularity of such methods has grown significantly in recent years. However, a limitation of HMC methods is the required gradient compu… ▽ More

    Submitted 12 May, 2014; v1 submitted 17 February, 2014; originally announced February 2014.

    Comments: ICML 2014 version

  20. arXiv:1401.1137  [pdf, other

    stat.ME cs.SI math.ST stat.ML

    Sparse graphs using exchangeable random measures

    Authors: François Caron, Emily B. Fox

    Abstract: Statistical network modeling has focused on representing the graph as a discrete structure, namely the adjacency matrix, and considering the exchangeability of this array. In such cases, the Aldous-Hoover representation theorem (Aldous, 1981;Hoover, 1979} applies and informs us that the graph is necessarily either dense or empty. In this paper, we instead consider representing the graph as a mea… ▽ More

    Submitted 27 March, 2015; v1 submitted 6 January, 2014; originally announced January 2014.

    Comments: New title. Extended version

  21. arXiv:1311.2971  [pdf, other

    stat.ML cs.LG stat.ME

    Approximate Inference in Continuous Determinantal Point Processes

    Authors: Raja Hafiz Affandi, Emily B. Fox, Ben Taskar

    Abstract: Determinantal point processes (DPPs) are random point processes well-suited for modeling repulsion. In machine learning, the focus of DPP-based models has been on diverse subset selection from a discrete and finite base set. This discrete setting admits an efficient sampling algorithm based on the eigendecomposition of the defining kernel matrix. Recently, there has been growing interest in using… ▽ More

    Submitted 12 November, 2013; originally announced November 2013.

  22. arXiv:1309.3533  [pdf, ps, other

    stat.ME cs.LG stat.ML

    Mixed Membership Models for Time Series

    Authors: Emily B. Fox, Michael I. Jordan

    Abstract: In this article we discuss some of the consequences of the mixed membership perspective on time series analysis. In its most abstract form, a mixed membership model aims to associate an individual entity with some set of attributes based on a collection of observed data. Although much of the literature on mixed membership models considers the setting in which exchangeable collections of data are a… ▽ More

    Submitted 13 September, 2013; originally announced September 2013.

  23. arXiv:1304.6777  [pdf, ps, other

    cs.SI physics.soc-ph stat.AP

    A Bayesian approach for predicting the popularity of tweets

    Authors: Tauhid Zaman, Emily B. Fox, Eric T. Bradlow

    Abstract: We predict the popularity of short messages called tweets created in the micro-blogging site known as Twitter. We measure the popularity of a tweet by the time-series path of its retweets, which is when people forward the tweet to others. We develop a probabilistic model for the evolution of the retweets using a Bayesian approach, and form predictions using only observations on the retweet times a… ▽ More

    Submitted 24 November, 2014; v1 submitted 24 April, 2013; originally announced April 2013.

    Comments: Published in at http://dx.doi.org/10.1214/14-AOAS741 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS741

    Journal ref: Annals of Applied Statistics 2014, Vol. 8, No. 3, 1583-1611

  24. arXiv:1210.4850  [pdf

    cs.LG cs.IR stat.ML

    Markov Determinantal Point Processes

    Authors: Raja Hafiz Affandi, Alex Kulesza, Emily B. Fox

    Abstract: A determinantal point process (DPP) is a random process useful for modeling the combinatorial problem of subset selection. In particular, DPPs encourage a random subset Y to contain a diverse set of items selected from a base set Y. For example, we might use a DPP to display a set of news headlines that are relevant to a user's interests while covering a variety of topics. Suppose, however, that w… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-26-35

  25. arXiv:1204.2523  [pdf, other

    stat.ML cs.CL cs.IR cs.LG

    Concept Modeling with Superwords

    Authors: Khalid El-Arini, Emily B. Fox, Carlos Guestrin

    Abstract: In information retrieval, a fundamental goal is to transform a document into concepts that are representative of its content. The term "representative" is in itself challenging to define, and various tasks require different granularities of concepts. In this paper, we aim to model concepts that are sparse over the vocabulary, and that flexibly adapt their content based on other relevant semantic i… ▽ More

    Submitted 11 April, 2012; originally announced April 2012.