Skip to main content

Showing 1–28 of 28 results for author: Smyth, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2312.15045  [pdf, other

    cs.LG stat.ML

    Probabilistic Modeling for Sequences of Sets in Continuous-Time

    Authors: Yuxin Chang, Alex Boyd, Padhraic Smyth

    Abstract: Neural marked temporal point processes have been a valuable addition to the existing toolbox of statistical parametric models for continuous-time event data. These models are useful for sequences where each event is associated with a single item (a single type of event or a "mark") -- but such models are not suited for the practical situation where each event is associated with a set of items. In… ▽ More

    Submitted 18 March, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: Oral presentation at AISTATS 2024

  2. arXiv:2312.07679  [pdf, other

    cs.LG stat.ML

    Bayesian Online Learning for Consensus Prediction

    Authors: Sam Showalter, Alex Boyd, Padhraic Smyth, Mark Steyvers

    Abstract: Given a pre-trained classifier and multiple human experts, we investigate the task of online classification where model predictions are provided for free but querying humans incurs a cost. In this practical but under-explored setting, oracle ground truth is not available. Instead, the prediction target is defined as the consensus vote of all experts. Given that querying full consensus can be costl… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  3. arXiv:2305.17209  [pdf, other

    cs.LG stat.ML

    Functional Flow Matching

    Authors: Gavin Kerrigan, Giosue Migliorini, Padhraic Smyth

    Abstract: We propose Functional Flow Matching (FFM), a function-space generative model that generalizes the recently-introduced Flow Matching model to operate in infinite-dimensional spaces. Our approach works by first defining a path of probability measures that interpolates between a fixed Gaussian measure and the data distribution, followed by learning a vector field on the underlying space of functions… ▽ More

    Submitted 5 December, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

  4. arXiv:2302.07849  [pdf, other

    cs.LG cs.AI stat.ML

    Zero-Shot Anomaly Detection via Batch Normalization

    Authors: Aodong Li, Chen Qiu, Marius Kloft, Padhraic Smyth, Maja Rudolph, Stephan Mandt

    Abstract: Anomaly detection (AD) plays a crucial role in many safety-critical application domains. The challenge of adapting an anomaly detector to drift in the normal data distribution, especially when no training data is available for the "new normal," has led to the development of zero-shot AD techniques. In this paper, we propose a simple yet effective method called Adaptive Centered Representations (AC… ▽ More

    Submitted 7 November, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: accepted at NeurIPS 2023

  5. arXiv:2212.00886  [pdf, other

    cs.LG stat.ML

    Diffusion Generative Models in Infinite Dimensions

    Authors: Gavin Kerrigan, Justin Ley, Padhraic Smyth

    Abstract: Diffusion generative models have recently been applied to domains where the available data can be seen as a discretization of an underlying function, such as audio signals or time series. However, these models operate directly on the discretized data, and there are no semantics in the modeling process that relate the observed data to the underlying functional forms. We generalize diffusion models… ▽ More

    Submitted 24 February, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

    Comments: In Proceedings of The 26th International Conference on Artificial Intelligence and Statistics (AISTATS 2023)

  6. arXiv:2211.08499  [pdf, other

    stat.ML cs.LG

    Probabilistic Querying of Continuous-Time Event Sequences

    Authors: Alex Boyd, Yuxin Chang, Stephan Mandt, Padhraic Smyth

    Abstract: Continuous-time event sequences, i.e., sequences consisting of continuous time stamps and associated event types ("marks"), are an important type of sequential data with many applications, e.g., in clinical medicine or user behavior modeling. Since these data are typically modeled autoregressively (e.g., using neural Hawkes processes or their classical counterparts), it is natural to ask questions… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  7. arXiv:2206.09076  [pdf, other

    stat.ML cs.LG stat.ME

    Fair Generalized Linear Models with a Convex Penalty

    Authors: Hyungrok Do, Preston Putzel, Axel Martin, Padhraic Smyth, Judy Zhong

    Abstract: Despite recent advances in algorithmic fairness, methodologies for achieving fairness with generalized linear models (GLMs) have yet to be explored in general, despite GLMs being widely used in practice. In this paper we introduce two fairness criteria for GLMs based on equalizing expected outcomes or log-likelihoods. We prove that for GLMs both criteria can be achieved via a convex penalty term b… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: Accepted for publication in ICML 2022

  8. arXiv:2109.14591  [pdf, other

    cs.LG stat.ML

    Combining Human Predictions with Model Probabilities via Confusion Matrices and Calibration

    Authors: Gavin Kerrigan, Padhraic Smyth, Mark Steyvers

    Abstract: An increasingly common use case for machine learning models is augmenting the abilities of human decision makers. For classification tasks where neither the human or model are perfectly accurate, a key step in obtaining high performance is combining their individual predictions in a manner that leverages their relative strengths. In this work, we develop a set of algorithms that combine the probab… ▽ More

    Submitted 1 October, 2021; v1 submitted 29 September, 2021; originally announced September 2021.

    Comments: NeurIPS 2021

  9. arXiv:2105.04648  [pdf, other

    stat.AP stat.ME

    Joint Fairness Model with Applications to Risk Predictions for Under-represented Populations

    Authors: Hyungrok Do, Shin**i Nandi, Preston Putzel, Padhraic Smyth, Judy Zhong

    Abstract: In data collection for predictive modeling, under-representation of certain groups, based on gender, race/ethnicity, or age, may yield less-accurate predictions for these groups. Recently, this issue of fairness in predictions has attracted significant attention, as data-driven models are increasingly utilized to perform crucial decision-making tasks. Existing methods to achieve fairness in the ma… ▽ More

    Submitted 23 February, 2022; v1 submitted 10 May, 2021; originally announced May 2021.

    Comments: 34 pages, 4 figures, 1 table

  10. arXiv:2012.08101  [pdf, other

    stat.ML cs.LG

    Detecting and Adapting to Irregular Distribution Shifts in Bayesian Online Learning

    Authors: Aodong Li, Alex Boyd, Padhraic Smyth, Stephan Mandt

    Abstract: We consider the problem of online learning in the presence of distribution shifts that occur at an unknown rate and of unknown intensity. We derive a new Bayesian online inference approach to simultaneously infer these distribution shifts and adapt the model to the detected changes by integrating ideas from change point detection, switching dynamical systems, and Bayesian online learning. Using a… ▽ More

    Submitted 26 October, 2021; v1 submitted 15 December, 2020; originally announced December 2020.

    Comments: Published version, Neural Information Processing Systems 2021

  11. arXiv:2011.03231  [pdf, other

    stat.ML cs.LG

    User-Dependent Neural Sequence Models for Continuous-Time Event Data

    Authors: Alex Boyd, Robert Bamler, Stephan Mandt, Padhraic Smyth

    Abstract: Continuous-time event data are common in applications such as individual behavior data, financial transactions, and medical health records. Modeling such data can be very challenging, in particular for applications with many different types of events, since it requires a model to predict the event types as well as the time of occurrence. Recurrent neural networks that parameterize time-varying int… ▽ More

    Submitted 6 November, 2020; originally announced November 2020.

    Comments: Accepted at NeurIPS 2020

  12. arXiv:2010.09851  [pdf, other

    stat.ML cs.AI cs.LG

    Can I Trust My Fairness Metric? Assessing Fairness with Unlabeled Data and Bayesian Inference

    Authors: Disi Ji, Padhraic Smyth, Mark Steyvers

    Abstract: We investigate the problem of reliably assessing group fairness when labeled examples are few but unlabeled examples are plentiful. We propose a general Bayesian framework that can augment labeled data with unlabeled data to produce more accurate and lower-variance estimates compared to methods based on labeled data alone. Our approach estimates calibrated scores for unlabeled examples in each gro… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

    Comments: 27 pages

  13. arXiv:2002.06532  [pdf, other

    stat.ML cs.LG

    Active Bayesian Assessment for Black-Box Classifiers

    Authors: Disi Ji, Robert L. Logan IV, Padhraic Smyth, Mark Steyvers

    Abstract: Recent advances in machine learning have led to increased deployment of black-box classifiers across a wide variety of applications. In many such situations there is a critical need to both reliably assess the performance of these pre-trained models and to perform this assessment in a label-efficient manner (given that labels may be scarce and costly to collect). In this paper, we introduce an act… ▽ More

    Submitted 15 March, 2021; v1 submitted 16 February, 2020; originally announced February 2020.

  14. arXiv:1810.04045  [pdf, other

    stat.ML cs.LG

    Dropout as a Structured Shrinkage Prior

    Authors: Eric Nalisnick, José Miguel Hernández-Lobato, Padhraic Smyth

    Abstract: Dropout regularization of deep neural networks has been a mysterious yet effective tool to prevent overfitting. Explanations for its success range from the prevention of "co-adapted" weights to it being a form of cheap Bayesian inference. We propose a novel framework for understanding multiplicative noise in neural networks, considering continuous distributions as well as Bernoulli noise (i.e. dro… ▽ More

    Submitted 29 May, 2019; v1 submitted 9 October, 2018; originally announced October 2018.

    Comments: ICML 2019

  15. arXiv:1711.07673  [pdf, other

    stat.ML q-bio.QM

    Mondrian Processes for Flow Cytometry Analysis

    Authors: Disi Ji, Eric Nalisnick, Padhraic Smyth

    Abstract: Analysis of flow cytometry data is an essential tool for clinical diagnosis of hematological and immunological conditions. Current clinical workflows rely on a manual process called gating to classify cells into their canonical types. This dependence on human annotation limits the rate, reproducibility, and complexity of flow cytometry analysis. In this paper, we propose using Mondrian processes t… ▽ More

    Submitted 28 November, 2017; v1 submitted 21 November, 2017; originally announced November 2017.

    Comments: 7 pages, 4 figures, NIPS workshop ML4H: Machine Learning for Health 2017, Long Beach, CA, USA

  16. arXiv:1704.01168  [pdf, other

    stat.ML stat.CO

    Learning Approximately Objective Priors

    Authors: Eric Nalisnick, Padhraic Smyth

    Abstract: Informative Bayesian priors are often difficult to elicit, and when this is the case, modelers usually turn to noninformative or objective priors. However, objective priors such as the Jeffreys and reference priors are not tractable to derive for many models of interest. We address this issue by proposing techniques for learning reference prior approximations: we select a parametric family and opt… ▽ More

    Submitted 4 August, 2017; v1 submitted 4 April, 2017; originally announced April 2017.

    Comments: UAI 2017

  17. arXiv:1701.02856  [pdf, other

    stat.AP stat.ML

    Bayesian Non-Homogeneous Markov Models via Polya-Gamma Data Augmentation with Applications to Rainfall Modeling

    Authors: Tracy Holsclaw, Arthur M. Greene, Andrew W. Robertson, Padhraic Smyth

    Abstract: Discrete-time hidden Markov models are a broadly useful class of latent-variable models with applications in areas such as speech recognition, bioinformatics, and climate data analysis. It is common in practice to introduce temporal non-homogeneity into such models by making the transition probabilities dependent on time-varying exogenous input variables via a multinomial logistic parametrization.… ▽ More

    Submitted 12 January, 2017; v1 submitted 11 January, 2017; originally announced January 2017.

    Comments: 40 pages, 26 figures

  18. arXiv:1605.06197  [pdf, other

    stat.ML

    Stick-Breaking Variational Autoencoders

    Authors: Eric Nalisnick, Padhraic Smyth

    Abstract: We extend Stochastic Gradient Variational Bayes to perform posterior inference for the weights of Stick-Breaking processes. This development allows us to define a Stick-Breaking Variational Autoencoder (SB-VAE), a Bayesian nonparametric version of the variational autoencoder that has a latent representation with stochastic dimensionality. We experimentally demonstrate that the SB-VAE, and a semi-s… ▽ More

    Submitted 3 April, 2017; v1 submitted 19 May, 2016; originally announced May 2016.

    Comments: ICLR 2017, Conference Track

  19. arXiv:1506.03208  [pdf, other

    stat.ML

    A Scale Mixture Perspective of Multiplicative Noise in Neural Networks

    Authors: Eric Nalisnick, Anima Anandkumar, Padhraic Smyth

    Abstract: Corrupting the input and hidden layers of deep neural networks (DNNs) with multiplicative noise, often drawn from the Bernoulli distribution (or 'dropout'), provides regularization that has significantly contributed to deep learning's success. However, understanding how multiplicative corruptions prevent overfitting has been difficult due to the complexity of a DNN's functional form. In this paper… ▽ More

    Submitted 10 June, 2015; originally announced June 2015.

  20. arXiv:1504.00860  [pdf, ps, other

    stat.ME

    Bayesian Detection of Changepoints in Finite-State Markov Chains for Multiple Sequences

    Authors: Petter Arnesen, Tracy Holsclaw, Padhraic Smyth

    Abstract: We consider the analysis of sets of categorical sequences consisting of piecewise homogeneous Markov segments. The sequences are assumed to be governed by a common underlying process with segments occurring in the same order for each sequence. Segments are defined by a set of unobserved changepoints where the positions and number of changepoints can vary from sequence to sequence. We propose a Bay… ▽ More

    Submitted 7 April, 2015; v1 submitted 3 April, 2015; originally announced April 2015.

  21. arXiv:1212.2467  [pdf

    stat.AP

    Probabilistic models for joint clustering and time-war** of multidimensional curves

    Authors: Darya Chudova, Scott Gaffney, Padhraic Smyth

    Abstract: In this paper we present a family of algorithms that can simultaneously align and cluster sets of multidimensional curves measured on a discrete time grid. Our approach is based on a generative mixture model that allows non-linear time war** of the observed curves relative to the mean curves within the clusters. We also allow for arbitrary discrete-valued translation of the time… ▽ More

    Submitted 19 October, 2012; originally announced December 2012.

    Comments: Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

    Report number: UAI-P-2003-PG-134-141

  22. arXiv:1207.7306  [pdf, ps, other

    stat.ME

    Hierarchical Models for Relational Event Sequences

    Authors: Christopher DuBois, Carter T. Butts, Daniel McFarland, Padhraic Smyth

    Abstract: Interaction within small groups can often be represented as a sequence of events, where each event involves a sender and a recipient. Recent methods for modeling network data in continuous time model the rate at which individuals interact conditioned on the previous history of events as well as actor covariates. We present a hierarchical extension for modeling multiple such sequences, facilitating… ▽ More

    Submitted 31 July, 2012; originally announced July 2012.

  23. arXiv:1207.4169  [pdf

    cs.IR cs.LG stat.ML

    The Author-Topic Model for Authors and Documents

    Authors: Michal Rosen-Zvi, Thomas Griffiths, Mark Steyvers, Padhraic Smyth

    Abstract: We introduce the author-topic model, a generative model for documents that extends Latent Dirichlet Allocation (LDA; Blei, Ng, & Jordan, 2003) to include authorship information. Each author is associated with a multinomial distribution over topics and each topic is associated with a multinomial distribution over words. A document with multiple authors is modeled as a distribution over topics that… ▽ More

    Submitted 11 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence (UAI2004)

    Report number: UAI-P-2004-PG-487-494

  24. arXiv:1207.4143  [pdf

    stat.AP cs.CE

    Modeling Waveform Shapes with Random Eects Segmental Hidden Markov Models

    Authors: Seyoung Kim, Padhraic Smyth, Stefan Luther

    Abstract: In this paper we describe a general probabilistic framework for modeling waveforms such as heartbeats from ECG data. The model is based on segmental hidden Markov models (as used in speech recognition) with the addition of random effects to the generative model. The random effects component of the model handles shape variability across different waveforms within a general class of waveforms of sim… ▽ More

    Submitted 11 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence (UAI2004)

    Report number: UAI-P-2004-PG-309-316

  25. arXiv:1207.4142  [pdf

    cs.LG stat.ML

    Conditional Chow-Liu Tree Structures for Modeling Discrete-Valued Vector Time Series

    Authors: Sergey Kirshner, Padhraic Smyth, Andrew Robertson

    Abstract: We consider the problem of modeling discrete-valued vector time series data using extensions of Chow-Liu tree models to capture both dependencies across time and dependencies across variables. Conditional Chow-Liu tree models are introduced, as an extension to standard Chow-Liu trees, for modeling conditional rather than joint densities. We describe learning algorithms for such models and show how… ▽ More

    Submitted 11 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence (UAI2004)

    Report number: UAI-P-2004-PG-317-324

  26. arXiv:1206.6845  [pdf

    stat.ME cs.LG stat.ML

    Gibbs Sampling for (Coupled) Infinite Mixture Models in the Stick Breaking Representation

    Authors: Ian Porteous, Alexander T. Ihler, Padhraic Smyth, Max Welling

    Abstract: Nonparametric Bayesian approaches to clustering, information retrieval, language modeling and object recognition have recently shown great promise as a new paradigm for unsupervised data analysis. Most contributions have focused on the Dirichlet process mixture models or extensions thereof for which efficient Gibbs samplers exist. In this paper we explore Gibbs samplers for infinite complexity mix… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Second Conference on Uncertainty in Artificial Intelligence (UAI2006)

    Report number: UAI-P-2006-PG-385-392

  27. arXiv:1205.2662  [pdf

    cs.LG stat.ML

    On Smoothing and Inference for Topic Models

    Authors: Arthur Asuncion, Max Welling, Padhraic Smyth, Yee Whye Teh

    Abstract: Latent Dirichlet analysis, or topic modeling, is a flexible latent variable framework for modeling high-dimensional sparse count data. Various learning algorithms have been developed in recent years, including collapsed Gibbs sampling, variational inference, and maximum a posteriori estimation, and this variety motivates the need for careful empirical comparisons. In this paper, we highlight the c… ▽ More

    Submitted 9 May, 2012; originally announced May 2012.

    Comments: Appears in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009)

    Report number: UAI-P-2009-PG-27-34

  28. arXiv:1107.2462  [pdf, other

    stat.ML cs.LG

    Statistical Topic Models for Multi-Label Document Classification

    Authors: Timothy N. Rubin, America Chambers, Padhraic Smyth, Mark Steyvers

    Abstract: Machine learning approaches to multi-label document classification have to date largely relied on discriminative modeling techniques such as support vector machines. A drawback of these approaches is that performance rapidly drops off as the total number of labels and the number of labels per document increase. This problem is amplified when the label frequencies exhibit the type of highly skewed… ▽ More

    Submitted 9 November, 2011; v1 submitted 13 July, 2011; originally announced July 2011.

    Comments: 44 Pages (Including Appendices). To be published in: The Machine Learning Journal, special issue on Learning from Multi-Label Data. Version 2 corrects some typos, updates some of the notation used in the paper for clarification of some equations, and incorporates several relatively minor changes to the text throughout the paper