Skip to main content

Showing 1–50 of 66 results for author: Smyth, P

.
  1. arXiv:2406.16308  [pdf, other

    cs.LG cs.AI cs.CL

    Anomaly Detection of Tabular Data Using LLMs

    Authors: Aodong Li, Yunhan Zhao, Chen Qiu, Marius Kloft, Padhraic Smyth, Maja Rudolph, Stephan Mandt

    Abstract: Large language models (LLMs) have shown their potential in long-context understanding and mathematical reasoning. In this paper, we study the problem of using LLMs to detect tabular anomalies and show that pre-trained LLMs are zero-shot batch-level anomaly detectors. That is, without extra distribution-specific model fitting, they can discover hidden outliers in a batch of data, demonstrating thei… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: accepted at the Anomaly Detection with Foundation Models workshop

  2. arXiv:2405.06729  [pdf, other

    q-bio.GN cs.LG

    Fine-tuning Protein Language Models with Deep Mutational Scanning improves Variant Effect Prediction

    Authors: Aleix Lafita, Ferran Gonzalez, Mahmoud Hossam, Paul Smyth, Jacob Deasy, Ari Allyn-Feuer, Daniel Seaton, Stephen Young

    Abstract: Protein Language Models (PLMs) have emerged as performant and scalable tools for predicting the functional impact and clinical significance of protein-coding variants, but they still lag experimental accuracy. Here, we present a novel fine-tuning approach to improve the performance of PLMs with experimental maps of variant effects from Deep Mutational Scanning (DMS) assays using a Normalised Log-o… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: Machine Learning for Genomics Explorations workshop at ICLR 2024

  3. arXiv:2404.04240  [pdf, other

    cs.LG

    Dynamic Conditional Optimal Transport through Simulation-Free Flows

    Authors: Gavin Kerrigan, Giosue Migliorini, Padhraic Smyth

    Abstract: We study the geometry of conditional optimal transport (COT) and prove a dynamical formulation which generalizes the Benamou-Brenier Theorem. Equipped with these tools, we propose a simulation-free flow-based method for conditional generative modeling. Our method couples an arbitrary source distribution to a specified target distribution through a triangular COT plan, and a conditional generative… ▽ More

    Submitted 31 May, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

  4. arXiv:2401.13835  [pdf, other

    cs.LG cs.AI cs.CL cs.HC

    The Calibration Gap between Model and Human Confidence in Large Language Models

    Authors: Mark Steyvers, Heliodoro Tejeda, Aakriti Kumar, Catarina Belem, Sheer Karny, Xinyue Hu, Lukas Mayer, Padhraic Smyth

    Abstract: For large language models (LLMs) to be trusted by humans they need to be well-calibrated in the sense that they can accurately assess and communicate how likely it is that their predictions are correct. Recent work has focused on the quality of internal LLM confidence assessments, but the question remains of how well LLMs can communicate this internal model confidence to human users. This paper ex… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: 27 pages, 10 figures

  5. arXiv:2312.15045  [pdf, other

    cs.LG stat.ML

    Probabilistic Modeling for Sequences of Sets in Continuous-Time

    Authors: Yuxin Chang, Alex Boyd, Padhraic Smyth

    Abstract: Neural marked temporal point processes have been a valuable addition to the existing toolbox of statistical parametric models for continuous-time event data. These models are useful for sequences where each event is associated with a single item (a single type of event or a "mark") -- but such models are not suited for the practical situation where each event is associated with a set of items. In… ▽ More

    Submitted 18 March, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: Oral presentation at AISTATS 2024

  6. arXiv:2312.07679  [pdf, other

    cs.LG stat.ML

    Bayesian Online Learning for Consensus Prediction

    Authors: Sam Showalter, Alex Boyd, Padhraic Smyth, Mark Steyvers

    Abstract: Given a pre-trained classifier and multiple human experts, we investigate the task of online classification where model predictions are provided for free but querying humans incurs a cost. In this practical but under-explored setting, oracle ground truth is not available. Instead, the prediction target is defined as the consensus vote of all experts. Given that querying full consensus can be costl… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  7. arXiv:2305.17209  [pdf, other

    cs.LG stat.ML

    Functional Flow Matching

    Authors: Gavin Kerrigan, Giosue Migliorini, Padhraic Smyth

    Abstract: We propose Functional Flow Matching (FFM), a function-space generative model that generalizes the recently-introduced Flow Matching model to operate in infinite-dimensional spaces. Our approach works by first defining a path of probability measures that interpolates between a fixed Gaussian measure and the data distribution, followed by learning a vector field on the underlying space of functions… ▽ More

    Submitted 5 December, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

  8. arXiv:2305.09064  [pdf, other

    cs.LG cs.AI cs.HC

    Capturing Humans' Mental Models of AI: An Item Response Theory Approach

    Authors: Markelle Kelly, Aakriti Kumar, Padhraic Smyth, Mark Steyvers

    Abstract: Improving our understanding of how humans perceive AI teammates is an important foundation for our general understanding of human-AI teams. Extending relevant work from cognitive science, we propose a framework based on item response theory for modeling these perceptions. We apply this framework to real-world experiments, in which each participant works alongside another person or an AI agent in a… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: FAccT 2023

  9. arXiv:2302.07849  [pdf, other

    cs.LG cs.AI stat.ML

    Zero-Shot Anomaly Detection via Batch Normalization

    Authors: Aodong Li, Chen Qiu, Marius Kloft, Padhraic Smyth, Maja Rudolph, Stephan Mandt

    Abstract: Anomaly detection (AD) plays a crucial role in many safety-critical application domains. The challenge of adapting an anomaly detector to drift in the normal data distribution, especially when no training data is available for the "new normal," has led to the development of zero-shot AD techniques. In this paper, we propose a simple yet effective method called Adaptive Centered Representations (AC… ▽ More

    Submitted 7 November, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: accepted at NeurIPS 2023

  10. arXiv:2302.07832  [pdf, other

    cs.LG cs.AI

    Deep Anomaly Detection under Labeling Budget Constraints

    Authors: Aodong Li, Chen Qiu, Marius Kloft, Padhraic Smyth, Stephan Mandt, Maja Rudolph

    Abstract: Selecting informative data points for expert feedback can significantly improve the performance of anomaly detection (AD) in various contexts, such as medical diagnostics or fraud detection. In this paper, we determine a set of theoretical conditions under which anomaly scores generalize from labeled queries to unlabeled data. Motivated by these results, we propose a data labeling strategy with op… ▽ More

    Submitted 4 July, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: ICML 2023

  11. arXiv:2212.00886  [pdf, other

    cs.LG stat.ML

    Diffusion Generative Models in Infinite Dimensions

    Authors: Gavin Kerrigan, Justin Ley, Padhraic Smyth

    Abstract: Diffusion generative models have recently been applied to domains where the available data can be seen as a discretization of an underlying function, such as audio signals or time series. However, these models operate directly on the discretized data, and there are no semantics in the modeling process that relate the observed data to the underlying functional forms. We generalize diffusion models… ▽ More

    Submitted 24 February, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

    Comments: In Proceedings of The 26th International Conference on Artificial Intelligence and Statistics (AISTATS 2023)

  12. arXiv:2211.08499  [pdf, other

    stat.ML cs.LG

    Probabilistic Querying of Continuous-Time Event Sequences

    Authors: Alex Boyd, Yuxin Chang, Stephan Mandt, Padhraic Smyth

    Abstract: Continuous-time event sequences, i.e., sequences consisting of continuous time stamps and associated event types ("marks"), are an important type of sequential data with many applications, e.g., in clinical medicine or user behavior modeling. Since these data are typically modeled autoregressively (e.g., using neural Hawkes processes or their classical counterparts), it is natural to ask questions… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  13. arXiv:2210.06464  [pdf, other

    cs.LG cs.AI

    Predictive Querying for Autoregressive Neural Sequence Models

    Authors: Alex Boyd, Sam Showalter, Stephan Mandt, Padhraic Smyth

    Abstract: In reasoning about sequential events it is natural to pose probabilistic queries such as "when will event A occur next" or "what is the probability of A occurring before B", with applications in areas such as user modeling, medicine, and finance. However, with machine learning shifting towards neural autoregressive models such as RNNs and transformers, probabilistic querying has been largely restr… ▽ More

    Submitted 4 November, 2022; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: Oral Presentation at the Intl. Conference on Neural Information Processing Systems (NeurIPS 2022)

  14. arXiv:2209.15154  [pdf, other

    cs.LG

    Variable-Based Calibration for Machine Learning Classifiers

    Authors: Markelle Kelly, Padhraic Smyth

    Abstract: The deployment of machine learning classifiers in high-stakes domains requires well-calibrated confidence scores for model predictions. In this paper we introduce the notion of variable-based calibration to characterize calibration properties of a model with respect to a variable of interest, generalizing traditional score-based metrics such as expected calibration error (ECE). In particular, we f… ▽ More

    Submitted 5 April, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

  15. arXiv:2206.09076  [pdf, other

    stat.ML cs.LG stat.ME

    Fair Generalized Linear Models with a Convex Penalty

    Authors: Hyungrok Do, Preston Putzel, Axel Martin, Padhraic Smyth, Judy Zhong

    Abstract: Despite recent advances in algorithmic fairness, methodologies for achieving fairness with generalized linear models (GLMs) have yet to be explored in general, despite GLMs being widely used in practice. In this paper we introduce two fairness criteria for GLMs based on equalizing expected outcomes or log-likelihoods. We prove that for GLMs both criteria can be achieved via a convex penalty term b… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: Accepted for publication in ICML 2022

  16. arXiv:2109.14591  [pdf, other

    cs.LG stat.ML

    Combining Human Predictions with Model Probabilities via Confusion Matrices and Calibration

    Authors: Gavin Kerrigan, Padhraic Smyth, Mark Steyvers

    Abstract: An increasingly common use case for machine learning models is augmenting the abilities of human decision makers. For classification tasks where neither the human or model are perfectly accurate, a key step in obtaining high performance is combining their individual predictions in a manner that leverages their relative strengths. In this work, we develop a set of algorithms that combine the probab… ▽ More

    Submitted 1 October, 2021; v1 submitted 29 September, 2021; originally announced September 2021.

    Comments: NeurIPS 2021

  17. arXiv:2105.05699  [pdf, other

    cs.DB cs.LG

    Automating Data Science: Prospects and Challenges

    Authors: Tijl De Bie, Luc De Raedt, José Hernández-Orallo, Holger H. Hoos, Padhraic Smyth, Christopher K. I. Williams

    Abstract: Given the complexity of typical data science projects and the associated demand for human expertise, automation has the potential to transform the data science process. Key insights: * Automation in data science aims to facilitate and transform the work of data scientists, not to replace them. * Important parts of data science are already being automated, especially in the modeling stages, w… ▽ More

    Submitted 28 February, 2022; v1 submitted 12 May, 2021; originally announced May 2021.

    Comments: 19 pages, 3 figures. v1 accepted for publication (April 2021) in Communications of the ACM

    Journal ref: Communications of the ACM 65(3) 76-87 (2022)

  18. arXiv:2105.04648  [pdf, other

    stat.AP stat.ME

    Joint Fairness Model with Applications to Risk Predictions for Under-represented Populations

    Authors: Hyungrok Do, Shin**i Nandi, Preston Putzel, Padhraic Smyth, Judy Zhong

    Abstract: In data collection for predictive modeling, under-representation of certain groups, based on gender, race/ethnicity, or age, may yield less-accurate predictions for these groups. Recently, this issue of fairness in predictions has attracted significant attention, as data-driven models are increasingly utilized to perform crucial decision-making tasks. Existing methods to achieve fairness in the ma… ▽ More

    Submitted 23 February, 2022; v1 submitted 10 May, 2021; originally announced May 2021.

    Comments: 34 pages, 4 figures, 1 table

  19. arXiv:2103.05337  [pdf, other

    cs.LG cs.CV q-bio.QM

    A Mask R-CNN approach to counting bacterial colony forming units in pharmaceutical development

    Authors: Tanguy Naets, Maarten Huijsmans, Paul Smyth, Laurent Sorber, Gaël de Lannoy

    Abstract: We present an application of the well-known Mask R-CNN approach to the counting of different types of bacterial colony forming units that were cultured in Petri dishes. Our model was made available to lab technicians in a modern SPA (Single-Page Application). Users can upload images of dishes, after which the Mask R-CNN model that was trained and tuned specifically for this task detects the number… ▽ More

    Submitted 9 March, 2021; originally announced March 2021.

    Comments: 9 pages, 3 pdf figures. Extended version of poster presented at ESANN 2020 (European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning)

  20. arXiv:2012.08101  [pdf, other

    stat.ML cs.LG

    Detecting and Adapting to Irregular Distribution Shifts in Bayesian Online Learning

    Authors: Aodong Li, Alex Boyd, Padhraic Smyth, Stephan Mandt

    Abstract: We consider the problem of online learning in the presence of distribution shifts that occur at an unknown rate and of unknown intensity. We derive a new Bayesian online inference approach to simultaneously infer these distribution shifts and adapt the model to the detected changes by integrating ideas from change point detection, switching dynamical systems, and Bayesian online learning. Using a… ▽ More

    Submitted 26 October, 2021; v1 submitted 15 December, 2020; originally announced December 2020.

    Comments: Published version, Neural Information Processing Systems 2021

  21. arXiv:2011.03231  [pdf, other

    stat.ML cs.LG

    User-Dependent Neural Sequence Models for Continuous-Time Event Data

    Authors: Alex Boyd, Robert Bamler, Stephan Mandt, Padhraic Smyth

    Abstract: Continuous-time event data are common in applications such as individual behavior data, financial transactions, and medical health records. Modeling such data can be very challenging, in particular for applications with many different types of events, since it requires a model to predict the event types as well as the time of occurrence. Recurrent neural networks that parameterize time-varying int… ▽ More

    Submitted 6 November, 2020; originally announced November 2020.

    Comments: Accepted at NeurIPS 2020

  22. arXiv:2010.09851  [pdf, other

    stat.ML cs.AI cs.LG

    Can I Trust My Fairness Metric? Assessing Fairness with Unlabeled Data and Bayesian Inference

    Authors: Disi Ji, Padhraic Smyth, Mark Steyvers

    Abstract: We investigate the problem of reliably assessing group fairness when labeled examples are few but unlabeled examples are plentiful. We propose a general Bayesian framework that can augment labeled data with unlabeled data to produce more accurate and lower-variance estimates compared to methods based on labeled data alone. Our approach estimates calibrated scores for unlabeled examples in each gro… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

    Comments: 27 pages

  23. arXiv:2009.00926  [pdf, other

    cs.CV cs.LG q-bio.QM

    Deep Learning to Detect Bacterial Colonies for the Production of Vaccines

    Authors: Thomas Beznik, Paul Smyth, Gaël de Lannoy, John A. Lee

    Abstract: During the development of vaccines, bacterial colony forming units (CFUs) are counted in order to quantify the yield in the fermentation process. This manual task is time-consuming and error-prone. In this work we test multiple segmentation algorithms based on the U-Net CNN architecture and show that these offer robust, automated CFU counting. We show that the multiclass generalisation with a besp… ▽ More

    Submitted 2 September, 2020; originally announced September 2020.

    Comments: 6 pages, 2 figures, accepted at ESANN 2020 (European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning)

  24. arXiv:2007.00239  [pdf

    physics.ao-ph physics.geo-ph

    Zonally opposing shifts of the intertropical convergence zone in response to climate change

    Authors: Antonios Mamalakis, James T. Randerson, **-Yi Yu, Michael S. Pritchard, Gudrun Magnusdottir, Padhraic Smyth, Paul A. Levine, Sungduk Yu, Efi Foufoula-Georgiou

    Abstract: Future changes in the location of the intertropical convergence zone (ITCZ) due to climate change are of high interest since they could substantially alter precipitation patterns in the tropics and subtropics. Although models predict a future narrowing of the ITCZ during the 21st century in response to climate warming, uncertainties remain large regarding its future position, with most past work f… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

    Journal ref: Nature Climate Change 2021

  25. arXiv:2002.06532  [pdf, other

    stat.ML cs.LG

    Active Bayesian Assessment for Black-Box Classifiers

    Authors: Disi Ji, Robert L. Logan IV, Padhraic Smyth, Mark Steyvers

    Abstract: Recent advances in machine learning have led to increased deployment of black-box classifiers across a wide variety of applications. In many such situations there is a critical need to both reliably assess the performance of these pre-trained models and to perform this assessment in a label-efficient manner (given that labels may be scarce and costly to collect). In this paper, we introduce an act… ▽ More

    Submitted 15 March, 2021; v1 submitted 16 February, 2020; originally announced February 2020.

  26. arXiv:1810.04045  [pdf, other

    stat.ML cs.LG

    Dropout as a Structured Shrinkage Prior

    Authors: Eric Nalisnick, José Miguel Hernández-Lobato, Padhraic Smyth

    Abstract: Dropout regularization of deep neural networks has been a mysterious yet effective tool to prevent overfitting. Explanations for its success range from the prevention of "co-adapted" weights to it being a form of cheap Bayesian inference. We propose a novel framework for understanding multiplicative noise in neural networks, considering continuous distributions as well as Bernoulli noise (i.e. dro… ▽ More

    Submitted 29 May, 2019; v1 submitted 9 October, 2018; originally announced October 2018.

    Comments: ICML 2019

  27. arXiv:1711.07673  [pdf, other

    stat.ML q-bio.QM

    Mondrian Processes for Flow Cytometry Analysis

    Authors: Disi Ji, Eric Nalisnick, Padhraic Smyth

    Abstract: Analysis of flow cytometry data is an essential tool for clinical diagnosis of hematological and immunological conditions. Current clinical workflows rely on a manual process called gating to classify cells into their canonical types. This dependence on human annotation limits the rate, reproducibility, and complexity of flow cytometry analysis. In this paper, we propose using Mondrian processes t… ▽ More

    Submitted 28 November, 2017; v1 submitted 21 November, 2017; originally announced November 2017.

    Comments: 7 pages, 4 figures, NIPS workshop ML4H: Machine Learning for Health 2017, Long Beach, CA, USA

  28. arXiv:1704.01168  [pdf, other

    stat.ML stat.CO

    Learning Approximately Objective Priors

    Authors: Eric Nalisnick, Padhraic Smyth

    Abstract: Informative Bayesian priors are often difficult to elicit, and when this is the case, modelers usually turn to noninformative or objective priors. However, objective priors such as the Jeffreys and reference priors are not tractable to derive for many models of interest. We address this issue by proposing techniques for learning reference prior approximations: we select a parametric family and opt… ▽ More

    Submitted 4 August, 2017; v1 submitted 4 April, 2017; originally announced April 2017.

    Comments: UAI 2017

  29. arXiv:1701.02856  [pdf, other

    stat.AP stat.ML

    Bayesian Non-Homogeneous Markov Models via Polya-Gamma Data Augmentation with Applications to Rainfall Modeling

    Authors: Tracy Holsclaw, Arthur M. Greene, Andrew W. Robertson, Padhraic Smyth

    Abstract: Discrete-time hidden Markov models are a broadly useful class of latent-variable models with applications in areas such as speech recognition, bioinformatics, and climate data analysis. It is common in practice to introduce temporal non-homogeneity into such models by making the transition probabilities dependent on time-varying exogenous input variables via a multinomial logistic parametrization.… ▽ More

    Submitted 12 January, 2017; v1 submitted 11 January, 2017; originally announced January 2017.

    Comments: 40 pages, 26 figures

  30. arXiv:1605.06197  [pdf, other

    stat.ML

    Stick-Breaking Variational Autoencoders

    Authors: Eric Nalisnick, Padhraic Smyth

    Abstract: We extend Stochastic Gradient Variational Bayes to perform posterior inference for the weights of Stick-Breaking processes. This development allows us to define a Stick-Breaking Variational Autoencoder (SB-VAE), a Bayesian nonparametric version of the variational autoencoder that has a latent representation with stochastic dimensionality. We experimentally demonstrate that the SB-VAE, and a semi-s… ▽ More

    Submitted 3 April, 2017; v1 submitted 19 May, 2016; originally announced May 2016.

    Comments: ICLR 2017, Conference Track

  31. arXiv:1506.03208  [pdf, other

    stat.ML

    A Scale Mixture Perspective of Multiplicative Noise in Neural Networks

    Authors: Eric Nalisnick, Anima Anandkumar, Padhraic Smyth

    Abstract: Corrupting the input and hidden layers of deep neural networks (DNNs) with multiplicative noise, often drawn from the Bernoulli distribution (or 'dropout'), provides regularization that has significantly contributed to deep learning's success. However, understanding how multiplicative corruptions prevent overfitting has been difficult due to the complexity of a DNN's functional form. In this paper… ▽ More

    Submitted 10 June, 2015; originally announced June 2015.

  32. arXiv:1504.00860  [pdf, ps, other

    stat.ME

    Bayesian Detection of Changepoints in Finite-State Markov Chains for Multiple Sequences

    Authors: Petter Arnesen, Tracy Holsclaw, Padhraic Smyth

    Abstract: We consider the analysis of sets of categorical sequences consisting of piecewise homogeneous Markov segments. The sequences are assumed to be governed by a common underlying process with segments occurring in the same order for each sequence. Segments are defined by a set of unobserved changepoints where the positions and number of changepoints can vary from sequence to sequence. We propose a Bay… ▽ More

    Submitted 7 April, 2015; v1 submitted 3 April, 2015; originally announced April 2015.

  33. arXiv:1412.6599  [pdf, other

    cs.LG

    Hot Swap** for Online Adaptation of Optimization Hyperparameters

    Authors: Kevin Bache, Dennis DeCoste, Padhraic Smyth

    Abstract: We describe a general framework for online adaptation of optimization hyperparameters by `hot swap**' their values during learning. We investigate this approach in the context of adaptive learning rate selection using an explore-exploit strategy from the multi-armed bandit literature. Experiments on a benchmark neural network show that the hot swap** approach leads to consistently better solut… ▽ More

    Submitted 13 April, 2015; v1 submitted 19 December, 2014; originally announced December 2014.

    Comments: Submission to ICLR 2015

    MSC Class: 62L20 ACM Class: G.1.6; I.2.6

  34. arXiv:1309.7971   

    cs.AI

    Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (2013)

    Authors: Ann Nicholson, Padhriac Smyth

    Abstract: This is the Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence, which was held in Bellevue, WA, August 11-15, 2013

    Submitted 27 August, 2014; v1 submitted 30 September, 2013; originally announced September 2013.

    Report number: UAI2013

  35. Scalar masses in general N=2 gauged supergravity theories

    Authors: Francesca Catino, Claudio A. Scrucca, Paul Smyth

    Abstract: We readdress the question of whether any universal upper bound exists on the square mass m^2 of the lightest scalar around a supersymmetry breaking vacuum in generic N=2 gauged supergravity theories for a given gravitino mass m_3/2 and cosmological constant V. We review the known bounds which apply to theories with restricted matter content from a new perspective. We then extend these results to t… ▽ More

    Submitted 18 January, 2014; v1 submitted 6 September, 2013; originally announced September 2013.

    Comments: 19 pages, 1 figure; v2 minor corrections and additions

    Journal ref: JHEP 1401 (2014) 029

  36. arXiv:1305.2452  [pdf, ps, other

    cs.LG

    Stochastic Collapsed Variational Bayesian Inference for Latent Dirichlet Allocation

    Authors: James Foulds, Levi Boyles, Christopher Dubois, Padhraic Smyth, Max Welling

    Abstract: In the internet era there has been an explosion in the amount of digital text information available, leading to difficulties of scale for traditional inference algorithms for topic models. Recent advances in stochastic variational inference algorithms for latent Dirichlet allocation (LDA) have made it feasible to learn topic models on large-scale corpora, but these methods do not currently take fu… ▽ More

    Submitted 10 May, 2013; originally announced May 2013.

  37. arXiv:1305.1903  [pdf, ps, other

    hep-th

    The rigid limit of N=2 supergravity

    Authors: Bobby E. Gunara, Jan Louis, Paul Smyth, Luca Tripodi, Roberto Valandro

    Abstract: In this paper we review the rigid limit of N=2 supergravity coupled to vector and hypermultiplets. In particular we show how the respective scalar field spaces reduce to their global counterparts. In the hypermultiplet sector we focus on the relation between the local and rigid c-map.

    Submitted 8 May, 2013; originally announced May 2013.

    Comments: 12 pages

  38. Simple metastable de Sitter vacua in N=2 gauged supergravity

    Authors: Francesca Catino, Claudio A. Scrucca, Paul Smyth

    Abstract: We construct a simple class of N=2 gauged supergravity theories that admit metastable de Sitter vacua, generalizing the recent work done in the context of rigid supersymmetry. The setup involves one hypermultiplet and one vector multiplet spanning suitably curved quaternionic-Kahler and special-Kahler geometries, with an Abelian gauging based on a single triholomorphic isometry, but neither Fayet-… ▽ More

    Submitted 26 April, 2013; v1 submitted 7 February, 2013; originally announced February 2013.

    Comments: 26 pages, 2 figures; v2 minor corrections, some additional comments and one reference added

    Journal ref: JHEP 1304 (2013) 056

  39. arXiv:1301.3884  [pdf

    cs.AI cs.DB

    Probabilistic Models for Query Approximation with Large Sparse Binary Datasets

    Authors: Dmitry Y. Pavlov, Heikki Mannila, Padhraic Smyth

    Abstract: Large sparse sets of binary transaction data with millions of records and thousands of attributes occur in various domains: customers purchasing products, users visiting web pages, and documents containing words are just three typical examples. Real-time query selectivity estimation (the problem of estimating the number of rows in the data satisfying a given predicate) is an important practical pr… ▽ More

    Submitted 16 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI2000)

    Report number: UAI-P-2000-PG-465-472

  40. Electrically gauged N=4 supergravities in D=4 with N=2 vacua

    Authors: Christoph Horst, Jan Louis, Paul Smyth

    Abstract: We study N=2 vacua in spontaneously broken N=4 electrically gauged supergravities in four space-time dimensions. We argue that the classification of all such solutions amounts to solving a system of purely algebraic equations. We then explicitly construct a special class of consistent N=2 solutions and study their properties. In particular we find that the spectrum assembles in N=2 massless or BPS… ▽ More

    Submitted 15 January, 2013; v1 submitted 19 December, 2012; originally announced December 2012.

    Comments: 48 pages; v2: one reference added

    Report number: ZMP-HH/12-27

  41. arXiv:1212.2467  [pdf

    stat.AP

    Probabilistic models for joint clustering and time-war** of multidimensional curves

    Authors: Darya Chudova, Scott Gaffney, Padhraic Smyth

    Abstract: In this paper we present a family of algorithms that can simultaneously align and cluster sets of multidimensional curves measured on a discrete time grid. Our approach is based on a generative mixture model that allows non-linear time war** of the observed curves relative to the mean curves within the clusters. We also allow for arbitrary discrete-valued translation of the time… ▽ More

    Submitted 19 October, 2012; originally announced December 2012.

    Comments: Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

    Report number: UAI-P-2003-PG-134-141

  42. Metastable spontaneous breaking of N=2 supersymmetry

    Authors: Benoit Legeret, Claudio A. Scrucca, Paul Smyth

    Abstract: We show that contrary to the common lore it is possible to spontaneously break N=2 supersymmetry even in simple theories without constant Fayet-Iliopoulos terms. We consider the most general N=2 supersymmetric theory with one hypermultiplet and one vector multiplet without Fayet-Iliopoulos terms, and show that metastable supersymmetry breaking vacua can arise if both the hyper-Kahler and the speci… ▽ More

    Submitted 26 April, 2013; v1 submitted 30 November, 2012; originally announced November 2012.

    Comments: 16 pages, no figures; v2 improved introduction and conclusions; v3 minor corrections

  43. arXiv:1209.5791  [pdf, other

    cs.DS

    Windows into Relational Events: Data Structures for Contiguous Subsequences of Edges

    Authors: Michael J. Bannister, Christopher DuBois, David Eppstein, Padhraic Smyth

    Abstract: We consider the problem of analyzing social network data sets in which the edges of the network have timestamps, and we wish to analyze the subgraphs formed from edges in contiguous subintervals of these timestamps. We provide data structures for these problems that use near-linear preprocessing time, linear space, and sublogarithmic query time to handle queries that ask for the number of connecte… ▽ More

    Submitted 25 September, 2012; originally announced September 2012.

  44. Metastable de Sitter vacua in N=2 to N=1 truncated supergravity

    Authors: Francesca Catino, Claudio A. Scrucca, Paul Smyth

    Abstract: We study the possibility of achieving metastable de Sitter vacua in general N=2 to N=1 truncated supergravities without vector multiplets, and compare with the situations arising in N=2 theories with only hypermultiplets and N=1 theories with only chiral multiplets. In N=2 theories based on a quaternionic manifold and a graviphoton gauging, de Sitter vacua are necessarily unstable, as a result of… ▽ More

    Submitted 5 September, 2012; originally announced September 2012.

    Comments: 40 pages, no figures

  45. arXiv:1207.7306  [pdf, ps, other

    stat.ME

    Hierarchical Models for Relational Event Sequences

    Authors: Christopher DuBois, Carter T. Butts, Daniel McFarland, Padhraic Smyth

    Abstract: Interaction within small groups can often be represented as a sequence of events, where each event involves a sender and a recipient. Recent methods for modeling network data in continuous time model the rate at which individuals interact conditioned on the previous history of events as well as actor covariates. We present a hierarchical extension for modeling multiple such sequences, facilitating… ▽ More

    Submitted 31 July, 2012; originally announced July 2012.

  46. arXiv:1207.4169  [pdf

    cs.IR cs.LG stat.ML

    The Author-Topic Model for Authors and Documents

    Authors: Michal Rosen-Zvi, Thomas Griffiths, Mark Steyvers, Padhraic Smyth

    Abstract: We introduce the author-topic model, a generative model for documents that extends Latent Dirichlet Allocation (LDA; Blei, Ng, & Jordan, 2003) to include authorship information. Each author is associated with a multinomial distribution over topics and each topic is associated with a multinomial distribution over words. A document with multiple authors is modeled as a distribution over topics that… ▽ More

    Submitted 11 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence (UAI2004)

    Report number: UAI-P-2004-PG-487-494

  47. arXiv:1207.4143  [pdf

    stat.AP cs.CE

    Modeling Waveform Shapes with Random Eects Segmental Hidden Markov Models

    Authors: Seyoung Kim, Padhraic Smyth, Stefan Luther

    Abstract: In this paper we describe a general probabilistic framework for modeling waveforms such as heartbeats from ECG data. The model is based on segmental hidden Markov models (as used in speech recognition) with the addition of random effects to the generative model. The random effects component of the model handles shape variability across different waveforms within a general class of waveforms of sim… ▽ More

    Submitted 11 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence (UAI2004)

    Report number: UAI-P-2004-PG-309-316

  48. arXiv:1207.4142  [pdf

    cs.LG stat.ML

    Conditional Chow-Liu Tree Structures for Modeling Discrete-Valued Vector Time Series

    Authors: Sergey Kirshner, Padhraic Smyth, Andrew Robertson

    Abstract: We consider the problem of modeling discrete-valued vector time series data using extensions of Chow-Liu tree models to capture both dependencies across time and dependencies across variables. Conditional Chow-Liu tree models are introduced, as an extension to standard Chow-Liu trees, for modeling conditional rather than joint densities. We describe learning algorithms for such models and show how… ▽ More

    Submitted 11 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence (UAI2004)

    Report number: UAI-P-2004-PG-317-324

  49. arXiv:1206.6845  [pdf

    stat.ME cs.LG stat.ML

    Gibbs Sampling for (Coupled) Infinite Mixture Models in the Stick Breaking Representation

    Authors: Ian Porteous, Alexander T. Ihler, Padhraic Smyth, Max Welling

    Abstract: Nonparametric Bayesian approaches to clustering, information retrieval, language modeling and object recognition have recently shown great promise as a new paradigm for unsupervised data analysis. Most contributions have focused on the Dirichlet process mixture models or extensions thereof for which efficient Gibbs samplers exist. In this paper we explore Gibbs samplers for infinite complexity mix… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Second Conference on Uncertainty in Artificial Intelligence (UAI2006)

    Report number: UAI-P-2006-PG-385-392

  50. arXiv:1205.2662  [pdf

    cs.LG stat.ML

    On Smoothing and Inference for Topic Models

    Authors: Arthur Asuncion, Max Welling, Padhraic Smyth, Yee Whye Teh

    Abstract: Latent Dirichlet analysis, or topic modeling, is a flexible latent variable framework for modeling high-dimensional sparse count data. Various learning algorithms have been developed in recent years, including collapsed Gibbs sampling, variational inference, and maximum a posteriori estimation, and this variety motivates the need for careful empirical comparisons. In this paper, we highlight the c… ▽ More

    Submitted 9 May, 2012; originally announced May 2012.

    Comments: Appears in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009)

    Report number: UAI-P-2009-PG-27-34