Skip to main content

Showing 1–23 of 23 results for author: Gray, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2305.15022  [pdf, other

    stat.ML cs.LG

    Hierarchical clustering with dot products recovers hidden tree structure

    Authors: Annie Gray, Alexander Modell, Patrick Rubin-Delanchy, Nick Whiteley

    Abstract: In this paper we offer a new perspective on the well established agglomerative clustering algorithm, focusing on recovery of hierarchical structure. We recommend a simple variant of the standard algorithm, in which clusters are merged by maximum average dot product and not, for example, by minimum distance or within-cluster variance. We demonstrate that the tree output by this algorithm provides a… ▽ More

    Submitted 1 March, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

  2. arXiv:2301.05507  [pdf, ps, other

    stat.OT math.LO math.PR stat.AP stat.CO

    Correlation-Based And-Operations Can Be Copulas: A Proof

    Authors: Enrique Miralles-Dolz, Ander Gray, Edoardo Patelli, Scott Ferson, Vladik Kreinovich, Olga Kosheleva

    Abstract: In many practical situations, we know the probabilities $a$ and $b$ of two events $A$ and $B$, and we want to estimate the joint probability ${\rm Prob}(A\,\&\,B)$. The algorithm that estimates the joint probability based on the known values $a$ and $b$ is called an and-operation. An important case when such a reconstruction is possible is when we know the correlation between $A$ and $B$; we call… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

  3. arXiv:2208.11665  [pdf, other

    stat.ME cs.LG stat.ML

    Statistical exploration of the Manifold Hypothesis

    Authors: Nick Whiteley, Annie Gray, Patrick Rubin-Delanchy

    Abstract: The Manifold Hypothesis is a widely accepted tenet of Machine Learning which asserts that nominally high-dimensional data are in fact concentrated near a low-dimensional manifold, embedded in high-dimensional space. This phenomenon is observed empirically in many real world situations, has led to development of a wide range of statistical methods in the last few decades, and has been suggested as… ▽ More

    Submitted 9 February, 2024; v1 submitted 24 August, 2022; originally announced August 2022.

    MSC Class: 62R20; 62R40; 62G05; 62G20; 62R07; 62-08; 62H25; 62H30

  4. arXiv:2207.10583  [pdf, other

    math.PR math.LO stat.AP stat.CO

    Correlated Boolean Operators for Uncertainty Logic

    Authors: Enrique Miralles-Dolz, Ander Gray, Edoardo Patelli, Scott Ferson

    Abstract: We present a correlated \textit{and} gate which may be used to propagate uncertainty and dependence through Boolean functions, since any Boolean function may be expressed as a combination of \textit{and} and \textit{not} operations. We argue that the \textit{and} gate is a bivariate copula family, which has the interpretation of constructing bivariate Bernoulli random variables following a given P… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

    Journal ref: Information Processing and Management of Uncertainty in Knowledge-Based Systems. IPMU 2022. Communications in Computer and Information Science, vol 1601, pages 798--811

  5. arXiv:2106.01260  [pdf, other

    stat.ML cs.LG

    Matrix factorisation and the interpretation of geodesic distance

    Authors: Nick Whiteley, Annie Gray, Patrick Rubin-Delanchy

    Abstract: Given a graph or similarity matrix, we consider the problem of recovering a notion of true distance between the nodes, and so their true positions. We show that this can be accomplished in two steps: matrix factorisation, followed by nonlinear dimension reduction. This combination is effective because the point cloud obtained in the first step lives close to a manifold in which latent distance is… ▽ More

    Submitted 22 September, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

    MSC Class: 62G05; 62H20; 62H12; 62H30

  6. arXiv:2105.09707  [pdf, other

    stat.AP physics.ao-ph

    Spatio-temporal Local Interpolation of Global Ocean Heat Transport using Argo Floats: A Debiased Latent Gaussian Process Approach

    Authors: Beomjo Park, Mikael Kuusela, Donata Giglio, Alison Gray

    Abstract: The world ocean plays a key role in redistributing heat in the climate system and hence in regulating Earth's climate. Yet statistical analysis of ocean heat transport suffers from partially incomplete large-scale data intertwined with complex spatio-temporal dynamics, as well as from potential model misspecification. We present a comprehensive spatio-temporal statistical framework tailored to int… ▽ More

    Submitted 18 July, 2022; v1 submitted 20 May, 2021; originally announced May 2021.

    Comments: 30 pages, 10 figures with supplementary material 9 pages, 10 figures

  7. arXiv:2006.09635  [pdf, other

    cs.LG math.OC stat.ML

    Solving Constrained CASH Problems with ADMM

    Authors: Parikshit Ram, Sijia Liu, Deepak Vijaykeerthi, Dakuo Wang, Djallel Bouneffouf, Greg Bramble, Horst Samulowitz, Alexander G. Gray

    Abstract: The CASH problem has been widely studied in the context of automated configurations of machine learning (ML) pipelines and various solvers and toolkits are available. However, CASH solvers do not directly handle black-box constraints such as fairness, robustness or other domain-specific custom constraints. We present our recent approach [Liu, et al., 2020] that leverages the ADMM optimization fram… ▽ More

    Submitted 10 July, 2020; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: 7th ICML Workshop on Automated Machine Learning (2020)

  8. arXiv:1912.06723  [pdf, other

    cs.LG cs.HC stat.ML

    AutoAIViz: Opening the Blackbox of Automated Artificial Intelligence with Conditional Parallel Coordinates

    Authors: Daniel Karl I. Weidele, Justin D. Weisz, Eno Oduor, Michael Muller, Josh Andres, Alexander Gray, Dakuo Wang

    Abstract: Artificial Intelligence (AI) can now automate the algorithm selection, feature engineering, and hyperparameter tuning steps in a machine learning workflow. Commonly known as AutoML or AutoAI, these technologies aim to relieve data scientists from the tedious manual work. However, today's AutoAI systems often present only limited to no information about the process of how they select and generate m… ▽ More

    Submitted 17 January, 2020; v1 submitted 13 December, 2019; originally announced December 2019.

    Comments: 5 pages, 1 figure, IUI2020

  9. arXiv:1905.00424  [pdf, other

    cs.LG stat.ML

    An ADMM Based Framework for AutoML Pipeline Configuration

    Authors: Sijia Liu, Parikshit Ram, Deepak Vijaykeerthy, Djallel Bouneffouf, Gregory Bramble, Horst Samulowitz, Dakuo Wang, Andrew Conn, Alexander Gray

    Abstract: We study the AutoML problem of automatically configuring machine learning pipelines by jointly selecting algorithms and their appropriate hyper-parameters for all steps in supervised learning pipelines. This black-box (gradient-free) optimization with mixed integer & continuous variables is a challenging problem. We propose a novel AutoML scheme by leveraging the alternating direction method of mu… ▽ More

    Submitted 6 December, 2019; v1 submitted 1 May, 2019; originally announced May 2019.

    Journal ref: published at AAAI 2020

  10. arXiv:1507.07260  [pdf, other

    stat.ML cs.LG

    Reduced-Set Kernel Principal Components Analysis for Improving the Training and Execution Speed of Kernel Machines

    Authors: Hassan A. Kingravi, Patricio A. Vela, Alexandar Gray

    Abstract: This paper presents a practical, and theoretically well-founded, approach to improve the speed of kernel manifold learning algorithms relying on spectral decomposition. Utilizing recent insights in kernel smoothing and learning with integral operators, we propose Reduced Set KPCA (RSKPCA), which also suggests an easy-to-implement method to remove or replace samples with minimal effect on the empir… ▽ More

    Submitted 26 July, 2015; originally announced July 2015.

  11. arXiv:1403.4890  [pdf, other

    stat.ME stat.CO

    Modeling an Augmented Lagrangian for Blackbox Constrained Optimization

    Authors: Robert B. Gramacy, Genetha A. Gray, Sebastien Le Digabel, Herbert K. H. Lee, Pritam Ranjan, Garth Wells, Stefan M. Wild

    Abstract: Constrained blackbox optimization is a difficult problem, with most approaches coming from the mathematical programming literature. The statistical literature is sparse, especially in addressing problems with nontrivial constraints. This situation is unfortunate because statistical methods have many attractive properties: global scope, handling noisy objectives, sensitivity analysis, and so forth.… ▽ More

    Submitted 3 March, 2015; v1 submitted 19 March, 2014; originally announced March 2014.

    Comments: 22 Pages, 2 additional supplementary, 5 figures

  12. arXiv:1309.6830  [pdf

    cs.LG stat.ML

    Building Bridges: Viewing Active Learning from the Multi-Armed Bandit Lens

    Authors: Ravi Ganti, Alexander G. Gray

    Abstract: In this paper we propose a multi-armed bandit inspired, pool based active learning algorithm for the problem of binary classification. By carefully constructing an analogy between active learning and multi-armed bandits, we utilize ideas such as lower confidence bounds, and self-concordant regularization from the multi-armed bandit literature to design our proposed algorithm. Our algorithm is a se… ▽ More

    Submitted 26 September, 2013; originally announced September 2013.

    Comments: Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

    Report number: UAI-P-2013-PG-232-241

  13. arXiv:1309.3699  [pdf, other

    stat.ML cs.AI cs.LG

    Local Support Vector Machines:Formulation and Analysis

    Authors: Ravi Ganti, Alexander Gray

    Abstract: We provide a formulation for Local Support Vector Machines (LSVMs) that generalizes previous formulations, and brings out the explicit connections to local polynomial learning used in nonparametric estimation literature. We investigate the simplest type of LSVMs called Local Linear Support Vector Machines (LLSVMs). For the first time we establish conditions under which LLSVMs make Bayes consistent… ▽ More

    Submitted 14 September, 2013; originally announced September 2013.

    Comments: 12 pages, 1 figure

  14. arXiv:1211.0632  [pdf, ps, other

    cs.LG math.OC stat.ML

    Stochastic ADMM for Nonsmooth Optimization

    Authors: Hua Ouyang, Niao He, Alexander Gray

    Abstract: We present a stochastic setting for optimization problems with nonsmooth convex separable objective functions over linear equality constraints. To solve such problems, we propose a stochastic Alternating Direction Method of Multipliers (ADMM) algorithm. Our algorithm applies to a more general class of nonsmooth convex functions that does not necessarily have a closed-form solution by minimizing th… ▽ More

    Submitted 22 January, 2013; v1 submitted 3 November, 2012; originally announced November 2012.

    Comments: A short version of this paper appears in the 5th NIPS Workshop on Optimization for Machine Learning, Lake Tahoe, Nevada, USA, 2012

  15. arXiv:1209.2784  [pdf, other

    cs.LG stat.ML

    Minimax Multi-Task Learning and a Generalized Loss-Compositional Paradigm for MTL

    Authors: Nishant A. Mehta, Dongryeol Lee, Alexander G. Gray

    Abstract: Since its inception, the modus operandi of multi-task learning (MTL) has been to minimize the task-wise mean of the empirical risks. We introduce a generalized loss-compositional paradigm for MTL that includes a spectrum of formulations as a subfamily. One endpoint of this spectrum is minimax MTL: a new MTL formulation that minimizes the maximum of the tasks' empirical risks. Via a certain relaxat… ▽ More

    Submitted 13 September, 2012; originally announced September 2012.

    Comments: appearing at NIPS 2012

  16. arXiv:1206.6857  [pdf

    cs.LG math.NA stat.ML

    Faster Gaussian Summation: Theory and Experiment

    Authors: Dongryeol Lee, Alexander G. Gray

    Abstract: We provide faster algorithms for the problem of Gaussian summation, which occurs in many machine learning methods. We develop two new extensions - an O(Dp) Taylor expansion for the Gaussian kernel with rigorous error bounds and a new error control scheme integrating any arbitrary approximation method - within the best discretealgorithmic framework using adaptive hierarchical data structures. We ri… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Second Conference on Uncertainty in Artificial Intelligence (UAI2006)

    Report number: UAI-P-2006-PG-281-288

  17. arXiv:1206.5278  [pdf

    stat.ME cs.LG stat.ML

    Fast Nonparametric Conditional Density Estimation

    Authors: Michael P. Holmes, Alexander G. Gray, Charles Lee Isbell

    Abstract: Conditional density estimation generalizes regression by modeling a full density f(yjx) rather than only the expected value E(yjx). This is important for many tasks, including handling multi-modality and generating prediction intervals. Though fundamental and widely applicable, nonparametric conditional density estimators have received relatively little attention from statisticians and little or n… ▽ More

    Submitted 20 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence (UAI2007)

    Report number: UAI-P-2007-PG-175-182

  18. arXiv:1205.4481  [pdf, ps, other

    cs.LG stat.CO stat.ML

    Stochastic Smoothing for Nonsmooth Minimizations: Accelerating SGD by Exploiting Structure

    Authors: Hua Ouyang, Alexander Gray

    Abstract: In this work we consider the stochastic minimization of nonsmooth convex loss functions, a central problem in machine learning. We propose a novel algorithm called Accelerated Nonsmooth Stochastic Gradient Descent (ANSGD), which exploits the structure of common nonsmooth loss functions to achieve optimal convergence rates for a class of problems including SVMs. It is the first stochastic algorithm… ▽ More

    Submitted 1 October, 2012; v1 submitted 20 May, 2012; originally announced May 2012.

    Comments: Full length version of ICML'12 with all proofs. In this version, a bug in proving Theorem 6 is fixed. We'd like to thank Dr. Francesco Orabona for pointing it out

  19. arXiv:1202.4050  [pdf, other

    cs.LG stat.ML

    On the Sample Complexity of Predictive Sparse Coding

    Authors: Nishant A. Mehta, Alexander G. Gray

    Abstract: The goal of predictive sparse coding is to learn a representation of examples as sparse linear combinations of elements from a dictionary, such that a learned hypothesis linear in the new representation performs well on a predictive task. Predictive sparse coding algorithms recently have demonstrated impressive performance on a variety of supervised tasks, but their generalization properties have… ▽ More

    Submitted 7 October, 2012; v1 submitted 17 February, 2012; originally announced February 2012.

    Comments: Sparse Coding Stability Theorem from version 1 has been relaxed considerably using a new notion of coding margin. Old Sparse Coding Stability Theorem still in new version, now as Theorem 2. Presentation of all proofs simplified/improved considerably. Paper reorganized. Empirical analysis showing new coding margin is non-trivial on real datasets

  20. arXiv:1111.1784  [pdf, other

    stat.ML cs.AI cs.LG

    UPAL: Unbiased Pool Based Active Learning

    Authors: Ravi Ganti, Alexander Gray

    Abstract: In this paper we address the problem of pool based active learning, and provide an algorithm, called UPAL, that works by minimizing the unbiased estimator of the risk of a hypothesis in a given hypothesis space. For the space of linear classifiers and the squared loss we show that UPAL is equivalent to an exponentially weighted average forecaster. Exploiting some recent results regarding the spect… ▽ More

    Submitted 13 November, 2011; v1 submitted 7 November, 2011; originally announced November 2011.

    Comments: 20 pages, 4 figures, 2 tables, a few minor typos were corrected

  21. arXiv:1102.2878  [pdf, ps, other

    stat.CO cs.DS stat.ML

    Dual-Tree Fast Gauss Transforms

    Authors: Dongryeol Lee, Alexander G. Gray, Andrew W. Moore

    Abstract: Kernel density estimation (KDE) is a popular statistical technique for estimating the underlying density distribution with minimal assumptions. Although they can be shown to achieve asymptotic estimation optimality for any input distribution, cross-validating for an optimal parameter requires significant computation dominated by kernel summations. In this paper we present an improvement to the dua… ▽ More

    Submitted 14 February, 2011; originally announced February 2011.

    Comments: Extended version of a conference paper. Submitted to a journal

  22. arXiv:1005.0188  [pdf, other

    cs.LG stat.ML

    Generative and Latent Mean Map Kernels

    Authors: Nishant A. Mehta, Alexander G. Gray

    Abstract: We introduce two kernels that extend the mean map, which embeds probability measures in Hilbert spaces. The generative mean map kernel (GMMK) is a smooth similarity measure between probabilistic models. The latent mean map kernel (LMMK) generalizes the non-iid formulation of Hilbert space embeddings of empirical distributions in order to incorporate latent variable models. When comparing certain c… ▽ More

    Submitted 3 May, 2010; originally announced May 2010.

    Comments: 16 pages, 1 figure, 1 table

  23. Sequential category aggregation and partitioning approaches for multi-way contingency tables based on survey and census data

    Authors: L. Fraser Jackson, Alistair G. Gray, Stephen E. Fienberg

    Abstract: Large contingency tables arise in many contexts but especially in the collection of survey and census data by government statistical agencies. Because the vast majority of the variables in this context have a large number of categories, agencies and users need a systematic way of constructing tables which are summaries of such contingency tables. We propose such an approach in this paper by find… ▽ More

    Submitted 11 November, 2008; originally announced November 2008.

    Comments: Published in at http://dx.doi.org/10.1214/08-AOAS175 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS175

    Journal ref: Annals of Applied Statistics 2008, Vol. 2, No. 3, 955-981