Skip to main content

Showing 1–26 of 26 results for author: Gray, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.02350  [pdf, ps, other

    cs.LG cs.AI

    What makes Models Compositional? A Theoretical View: With Supplement

    Authors: Parikshit Ram, Tim Klinger, Alexander G. Gray

    Abstract: Compositionality is thought to be a key component of language, and various compositional benchmarks have been developed to empirically probe the compositional generalization of existing sequence processing models. These benchmarks often highlight failures of existing models, but it is not clear why these models fail in this way. In this paper, we seek to theoretically understand the role the compo… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: Extended version of the original IJCAI 2024 paper with detailed supplementary materials (27 pages, 7 figures)

  2. arXiv:2309.05137  [pdf, other

    cs.PL cs.HC

    Debugging Trait Errors as Logic Programs

    Authors: Gavin Gray, Will Crichton

    Abstract: Rust uses traits to define units of shared behavior. Trait constraints build up an implicit set of first-order hereditary Harrop clauses which is executed by a powerful logic programming engine in the trait system. But that power comes at a cost: the number of traits in Rust libraries is increasing, which puts a growing burden on the trait system to help programmers diagnose errors. Beyond a certa… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    Comments: 9 pages, 2 figures

  3. arXiv:2309.04134  [pdf, other

    cs.PL cs.HC

    A Grounded Conceptual Model for Ownership Types in Rust

    Authors: Will Crichton, Gavin Gray, Shriram Krishnamurthi

    Abstract: Programmers learning Rust struggle to understand ownership types, Rust's core mechanism for ensuring memory safety without garbage collection. This paper describes our attempt to systematically design a pedagogy for ownership types. First, we studied Rust developers' misconceptions of ownership to create the Ownership Inventory, a new instrument for measuring a person's knowledge of ownership. We… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: Published at OOPSLA 2023

  4. arXiv:2301.05131  [pdf, other

    cs.LG

    Toward Theoretical Guidance for Two Common Questions in Practical Cross-Validation based Hyperparameter Selection

    Authors: Parikshit Ram, Alexander G. Gray, Horst C. Samulowitz, Gregory Bramble

    Abstract: We show, to our knowledge, the first theoretical treatments of two common questions in cross-validation based hyperparameter selection: (1) After selecting the best hyperparameter using a held-out set, we train the final model using {\em all} of the training data -- since this may or may not improve future generalization error, should one do this? (2) During optimization such as via SGD (stochasti… ▽ More

    Submitted 12 January, 2023; originally announced January 2023.

    Comments: Extended version of the paper appearing at the SIAM International Conference on Data Mining 2023 (SDM23)

  5. arXiv:2106.13367  [pdf, other

    cs.AI cs.DB

    SeaNet -- Towards A Knowledge Graph Based Autonomic Management of Software Defined Networks

    Authors: Qianru Zhou, Alasdair J. G. Gray, Stephen McLaughlin

    Abstract: Automatic network management driven by Artificial Intelligent technologies has been heatedly discussed over decades. However, current reports mainly focus on theoretic proposals and architecture designs, works on practical implementations on real-life networks are yet to appear. This paper proposes our effort toward the implementation of knowledge graph driven approach for autonomic network manage… ▽ More

    Submitted 27 May, 2022; v1 submitted 24 June, 2021; originally announced June 2021.

  6. SARA -- A Semantic Access Point Resource Allocation Service for Heterogenous Wireless Networks

    Authors: Qianru Zhou, Alasdair J. G. Gray, Dimitrios Pezaros, Stephen McLaughlin

    Abstract: In this paper, we present SARA, a Semantic Access point Resource Allocation service for heterogenous wireless networks with various wireless access technologies existing together. By automatically reasoning on the knowledge base of the full system provided by a knowledge based autonomic network management system -- SEANET, SARA selects the access point providing the best quality of service among t… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: 2019 IEEE Wireless Day

  7. arXiv:2006.09635  [pdf, other

    cs.LG math.OC stat.ML

    Solving Constrained CASH Problems with ADMM

    Authors: Parikshit Ram, Sijia Liu, Deepak Vijaykeerthi, Dakuo Wang, Djallel Bouneffouf, Greg Bramble, Horst Samulowitz, Alexander G. Gray

    Abstract: The CASH problem has been widely studied in the context of automated configurations of machine learning (ML) pipelines and various solvers and toolkits are available. However, CASH solvers do not directly handle black-box constraints such as fairness, robustness or other domain-specific custom constraints. We present our recent approach [Liu, et al., 2020] that leverages the ADMM optimization fram… ▽ More

    Submitted 10 July, 2020; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: 7th ICML Workshop on Automated Machine Learning (2020)

  8. arXiv:1906.04113  [pdf, other

    cs.LG stat.ML

    BlockSwap: Fisher-guided Block Substitution for Network Compression on a Budget

    Authors: Jack Turner, Elliot J. Crowley, Michael O'Boyle, Amos Storkey, Gavin Gray

    Abstract: The desire to map neural networks to varying-capacity devices has led to the development of a wealth of compression techniques, many of which involve replacing standard convolutional blocks in a large network with cheap alternative blocks. However, not all blocks are created equally; for a required compute budget there may exist a potent combination of many different cheap blocks, though exhaustiv… ▽ More

    Submitted 23 January, 2020; v1 submitted 10 June, 2019; originally announced June 2019.

    Comments: ICLR 2020

  9. arXiv:1906.00859  [pdf, other

    stat.ML cs.LG

    Separable Layers Enable Structured Efficient Linear Substitutions

    Authors: Gavin Gray, Elliot J. Crowley, Amos Storkey

    Abstract: In response to the development of recent efficient dense layers, this paper shows that something as simple as replacing linear components in pointwise convolutions with structured linear decompositions also produces substantial gains in the efficiency/accuracy tradeoff. Pointwise convolutions are fully connected layers and are thus prepared for replacement by structured transforms. Networks using… ▽ More

    Submitted 3 June, 2019; originally announced June 2019.

  10. arXiv:1903.05372  [pdf, other

    cs.CY cs.NI

    Lost Silence: An emergency response early detection service through continuous processing of telecommunication data streams

    Authors: Qianru Zhou, Stephen McLaughlin, Alasdair J. G. Gray, Shangbin Wu, Chengxiang Wang

    Abstract: Early detection of significant traumatic events, e.g. a terrorist attack or a ship capsizing, is important to ensure that a prompt emergency response can occur. In the modern world telecommunication systems could play a key role in ensuring a successful emergency response by detecting such incidents through significant changes in calls and access to the networks. In this paper a methodology is ill… ▽ More

    Submitted 13 March, 2019; originally announced March 2019.

    Comments: 15 pages, 4 figures, WSP ISWC 2017 conference

    Journal ref: ISWC WSP 2017, pp. 33--47

  11. arXiv:1711.02613  [pdf, other

    stat.ML cs.CV cs.LG

    Moonshine: Distilling with Cheap Convolutions

    Authors: Elliot J. Crowley, Gavin Gray, Amos Storkey

    Abstract: Many engineers wish to deploy modern neural networks in memory-limited settings; but the development of flexible methods for reducing memory use is in its infancy, and there is little knowledge of the resulting cost-benefit. We propose structural model distillation for memory reduction using a strategy that produces a student architecture that is a simple transformation of the teacher architecture… ▽ More

    Submitted 17 January, 2019; v1 submitted 7 November, 2017; originally announced November 2017.

    Comments: 32nd Conference on Neural Information Processing Systems (NeurIPS 2018)

  12. arXiv:1309.6830  [pdf

    cs.LG stat.ML

    Building Bridges: Viewing Active Learning from the Multi-Armed Bandit Lens

    Authors: Ravi Ganti, Alexander G. Gray

    Abstract: In this paper we propose a multi-armed bandit inspired, pool based active learning algorithm for the problem of binary classification. By carefully constructing an analogy between active learning and multi-armed bandits, we utilize ideas such as lower confidence bounds, and self-concordant regularization from the multi-armed bandit literature to design our proposed algorithm. Our algorithm is a se… ▽ More

    Submitted 26 September, 2013; originally announced September 2013.

    Comments: Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

    Report number: UAI-P-2013-PG-232-241

  13. PAV ontology: Provenance, Authoring and Versioning

    Authors: Paolo Ciccarese, Stian Soiland-Reyes, Khalid Belhajjame, Alasdair J G Gray, Carole Goble, Tim Clark

    Abstract: Provenance is a critical ingredient for establishing trust of published scientific content. This is true whether we are considering a data set, a computational workflow, a peer-reviewed publication or a simple scientific claim with supportive evidence. Existing vocabularies such as DC Terms and the W3C PROV-O are domain-independent and general-purpose and they allow and encourage for extensions to… ▽ More

    Submitted 6 December, 2013; v1 submitted 26 April, 2013; originally announced April 2013.

    Comments: 22 pages (incl 5 tables and 19 figures). Submitted to Journal of Biomedical Semantics 2013-04-26 (#1858276535979415). Revised article submitted 2013-08-30. Second revised article submitted 2013-10-06. Accepted 2013-10-07. Author proofs sent 2013-10-09 and 2013-10-16. Published 2013-11-22. Final version 2013-12-06. http://www.jbiomedsem.com/content/4/1/37

    Report number: University of Manchester eScholar: uk-ac-man-scw:193385 ACM Class: I.2.4; H.2.1; H.3.7; I.7.4

    Journal ref: Journal of Biomedical Semantics 2013, 4:37

  14. arXiv:1304.4327  [pdf, ps, other

    cs.DS

    Tree-Independent Dual-Tree Algorithms

    Authors: Ryan R. Curtin, William B. March, Parikshit Ram, David V. Anderson, Alexander G. Gray, Charles L. Isbell Jr

    Abstract: Dual-tree algorithms are a widely used class of branch-and-bound algorithms. Unfortunately, develo** dual-tree algorithms for use with different trees and problems is often complex and burdensome. We introduce a four-part logical split: the tree, the traversal, the point-to-point base case, and the pruning rule. We provide a meta-algorithm which allows development of dual-tree algorithms in a tr… ▽ More

    Submitted 16 April, 2013; originally announced April 2013.

    Comments: accepted in ICML 2013

  15. arXiv:1210.6293  [pdf, ps, other

    cs.MS cs.CV cs.LG

    MLPACK: A Scalable C++ Machine Learning Library

    Authors: Ryan R. Curtin, James R. Cline, N. P. Slagle, William B. March, Parikshit Ram, Nishant A. Mehta, Alexander G. Gray

    Abstract: MLPACK is a state-of-the-art, scalable, multi-platform C++ machine learning library released in late 2011 offering both a simple, consistent API accessible to novice users and high performance and flexibility to expert users by leveraging modern features of C++. MLPACK provides cutting-edge algorithms whose benchmarks exhibit far better performance than other leading machine learning libraries. ML… ▽ More

    Submitted 23 October, 2012; originally announced October 2012.

    Comments: Submitted to JMLR MLOSS (http://jmlr.csail.mit.edu/mloss/)

    Journal ref: Journal of Machine Learning Research 14 (2013) 801-805

  16. arXiv:1210.6287  [pdf, ps, other

    cs.DS cs.IR cs.LG

    Fast Exact Max-Kernel Search

    Authors: Ryan R. Curtin, Parikshit Ram, Alexander G. Gray

    Abstract: The wide applicability of kernels makes the problem of max-kernel search ubiquitous and more general than the usual similarity search in metric spaces. We focus on solving this problem efficiently. We begin by characterizing the inherent hardness of the max-kernel search problem with a novel notion of directional concentration. Following that, we present a method to use an $O(n \log n)$ algorithm… ▽ More

    Submitted 26 October, 2012; v1 submitted 23 October, 2012; originally announced October 2012.

    Comments: Under submission in SIAM Data Mining conference

  17. arXiv:1209.2784  [pdf, other

    cs.LG stat.ML

    Minimax Multi-Task Learning and a Generalized Loss-Compositional Paradigm for MTL

    Authors: Nishant A. Mehta, Dongryeol Lee, Alexander G. Gray

    Abstract: Since its inception, the modus operandi of multi-task learning (MTL) has been to minimize the task-wise mean of the empirical risks. We introduce a generalized loss-compositional paradigm for MTL that includes a spectrum of formulations as a subfamily. One endpoint of this spectrum is minimax MTL: a new MTL formulation that minimizes the maximum of the tasks' empirical risks. Via a certain relaxat… ▽ More

    Submitted 13 September, 2012; originally announced September 2012.

    Comments: appearing at NIPS 2012

  18. arXiv:1206.6857  [pdf

    cs.LG math.NA stat.ML

    Faster Gaussian Summation: Theory and Experiment

    Authors: Dongryeol Lee, Alexander G. Gray

    Abstract: We provide faster algorithms for the problem of Gaussian summation, which occurs in many machine learning methods. We develop two new extensions - an O(Dp) Taylor expansion for the Gaussian kernel with rigorous error bounds and a new error control scheme integrating any arbitrary approximation method - within the best discretealgorithmic framework using adaptive hierarchical data structures. We ri… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Second Conference on Uncertainty in Artificial Intelligence (UAI2006)

    Report number: UAI-P-2006-PG-281-288

  19. arXiv:1206.5278  [pdf

    stat.ME cs.LG stat.ML

    Fast Nonparametric Conditional Density Estimation

    Authors: Michael P. Holmes, Alexander G. Gray, Charles Lee Isbell

    Abstract: Conditional density estimation generalizes regression by modeling a full density f(yjx) rather than only the expected value E(yjx). This is important for many tasks, including handling multi-modality and generating prediction intervals. Though fundamental and widely applicable, nonparametric conditional density estimators have received relatively little attention from statisticians and little or n… ▽ More

    Submitted 20 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence (UAI2007)

    Report number: UAI-P-2007-PG-175-182

  20. arXiv:1202.6101  [pdf, ps, other

    cs.CG cs.DS cs.IR

    Maximum Inner-Product Search using Tree Data-structures

    Authors: Parikshit Ram, Alexander G. Gray

    Abstract: The problem of {\em efficiently} finding the best match for a query in a given set with respect to the Euclidean distance or the cosine similarity has been extensively studied in literature. However, a closely related problem of efficiently finding the best match with respect to the inner product has never been explored in the general setting to the best of our knowledge. In this paper we consider… ▽ More

    Submitted 27 February, 2012; originally announced February 2012.

    Comments: Under submission in KDD 2012

  21. arXiv:1202.4050  [pdf, other

    cs.LG stat.ML

    On the Sample Complexity of Predictive Sparse Coding

    Authors: Nishant A. Mehta, Alexander G. Gray

    Abstract: The goal of predictive sparse coding is to learn a representation of examples as sparse linear combinations of elements from a dictionary, such that a learned hypothesis linear in the new representation performs well on a predictive task. Predictive sparse coding algorithms recently have demonstrated impressive performance on a variety of supervised tasks, but their generalization properties have… ▽ More

    Submitted 7 October, 2012; v1 submitted 17 February, 2012; originally announced February 2012.

    Comments: Sparse Coding Stability Theorem from version 1 has been relaxed considerably using a new notion of coding margin. Old Sparse Coding Stability Theorem still in new version, now as Theorem 2. Presentation of all proofs simplified/improved considerably. Paper reorganized. Empirical analysis showing new coding margin is non-trivial on real datasets

  22. arXiv:1105.2769  [pdf, ps, other

    physics.comp-ph cs.DS

    Multibody Multipole Methods

    Authors: Dongryeol Lee, Arkadas Ozakin, Alexander G. Gray

    Abstract: A three-body potential function can account for interactions among triples of particles which are uncaptured by pairwise interaction functions such as Coulombic or Lennard-Jones potentials. Likewise, a multibody potential of order $n$ can account for interactions among $n$-tuples of particles uncaptured by interaction functions of lower orders. To date, the computation of multibody potential funct… ▽ More

    Submitted 30 June, 2012; v1 submitted 13 May, 2011; originally announced May 2011.

    Comments: To appear in Journal of Computational Physics

    MSC Class: 68U01 ACM Class: J.2

  23. arXiv:1102.2878  [pdf, ps, other

    stat.CO cs.DS stat.ML

    Dual-Tree Fast Gauss Transforms

    Authors: Dongryeol Lee, Alexander G. Gray, Andrew W. Moore

    Abstract: Kernel density estimation (KDE) is a popular statistical technique for estimating the underlying density distribution with minimal assumptions. Although they can be shown to achieve asymptotic estimation optimality for any input distribution, cross-validating for an optimal parameter requires significant computation dominated by kernel summations. In this paper we present an improvement to the dua… ▽ More

    Submitted 14 February, 2011; originally announced February 2011.

    Comments: Extended version of a conference paper. Submitted to a journal

  24. arXiv:1005.0188  [pdf, other

    cs.LG stat.ML

    Generative and Latent Mean Map Kernels

    Authors: Nishant A. Mehta, Alexander G. Gray

    Abstract: We introduce two kernels that extend the mean map, which embeds probability measures in Hilbert spaces. The generative mean map kernel (GMMK) is a smooth similarity measure between probabilistic models. The latent mean map kernel (LMMK) generalizes the non-iid formulation of Hilbert space embeddings of empirical distributions in order to incorporate latent variable models. When comparing certain c… ▽ More

    Submitted 3 May, 2010; originally announced May 2010.

    Comments: 16 pages, 1 figure, 1 table

  25. arXiv:0810.4611  [pdf, ps, other

    cs.LG

    Learning Isometric Separation Maps

    Authors: Nikolaos Vasiloglou, Alexander G. Gray, David V. Anderson

    Abstract: Maximum Variance Unfolding (MVU) and its variants have been very successful in embedding data-manifolds in lower dimensional spaces, often revealing the true intrinsic dimension. In this paper we show how to also incorporate supervised class information into an MVU-like method without breaking its convexity. We call this method the Isometric Separation Map and we show that the resulting kernel m… ▽ More

    Submitted 15 April, 2009; v1 submitted 25 October, 2008; originally announced October 2008.

    Comments: Submitted to the NIPS workshop on Kernel Learning:Automatic Selection Of Kernels and now presented in MLSP 2009

  26. arXiv:0810.2311  [pdf, ps, other

    cs.AI cs.CV

    Non-Negative Matrix Factorization, Convexity and Isometry

    Authors: Nikolaos Vasiloglou, Alexander G. Gray, David V. Anderson

    Abstract: In this paper we explore avenues for improving the reliability of dimensionality reduction methods such as Non-Negative Matrix Factorization (NMF) as interpretive exploratory data analysis tools. We first explore the difficulties of the optimization problem underlying NMF, showing for the first time that non-trivial NMF solutions always exist and that the optimization problem is actually convex,… ▽ More

    Submitted 22 April, 2009; v1 submitted 13 October, 2008; originally announced October 2008.

    Comments: accpepted in SIAM Data Mining 2009, 12 pages