Skip to main content

Showing 1–10 of 10 results for author: Bondell, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18806  [pdf, other

    stat.ML cs.LG

    Density Ratio Estimation via Sampling along Generalized Geodesics on Statistical Manifolds

    Authors: Masanari Kimura, Howard Bondell

    Abstract: The density ratio of two probability distributions is one of the fundamental tools in mathematical and computational statistics and machine learning, and it has a variety of known applications. Therefore, density ratio estimation from finite samples is a very important task, but it is known to be unstable when the distributions are distant from each other. One approach to address this problem is d… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2403.04125  [pdf, other

    cs.CV

    ComFe: Interpretable Image Classifiers With Foundation Models, Transformers and Component Features

    Authors: Evelyn Mannix, Howard Bondell

    Abstract: Interpretable computer vision models are able to explain their reasoning through comparing the distances between the image patch embeddings and prototypes within a latent space. However, many of these approaches introduce additional complexity, can require multiple training steps and often have a performance cost in comparison to black-box approaches. In this work, we introduce Component Features… ▽ More

    Submitted 24 May, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

  3. arXiv:2311.17093  [pdf, other

    cs.CV cs.LG

    PAWS-VMK: A Unified Approach To Semi-Supervised Learning And Out-of-Distribution Detection

    Authors: Evelyn Mannix, Howard Bondell

    Abstract: This paper describes PAWS-VMK, a prototypical deep learning approach that obtains state-of-the-art results for image classification tasks in both a semi-supervised learning (SSL) and out-of-distribution (OOD) detection context. We consider developments in the fields of SSL, OOD detection, and computer vision foundation models to introduce a number of innovations that connect the key ideas within t… ▽ More

    Submitted 24 May, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

  4. arXiv:2305.10071  [pdf, other

    cs.CV cs.AI cs.LG

    Cold PAWS: Unsupervised class discovery and addressing the cold-start problem for semi-supervised learning

    Authors: Evelyn J. Mannix, Howard D. Bondell

    Abstract: In many machine learning applications, labeling datasets can be an arduous and time-consuming task. Although research has shown that semi-supervised learning techniques can achieve high accuracy with very few labels within the field of computer vision, little attention has been given to how images within a dataset should be selected for labeling. In this paper, we propose a novel approach based on… ▽ More

    Submitted 6 June, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  5. arXiv:2205.13869  [pdf, other

    cs.LG stat.ML

    MissDAG: Causal Discovery in the Presence of Missing Data with Continuous Additive Noise Models

    Authors: Erdun Gao, Ignavier Ng, Mingming Gong, Li Shen, Wei Huang, Tongliang Liu, Kun Zhang, Howard Bondell

    Abstract: State-of-the-art causal discovery methods usually assume that the observational data is complete. However, the missing data problem is pervasive in many practical scenarios such as clinical trials, economics, and biology. One straightforward way to address the missing data problem is first to impute the data using off-the-shelf imputation methods and then apply existing causal discovery methods. H… ▽ More

    Submitted 16 January, 2023; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: Accepted to NeurIPS22

  6. arXiv:2112.03555  [pdf, other

    cs.LG stat.ML

    FedDAG: Federated DAG Structure Learning

    Authors: Erdun Gao, Junjia Chen, Li Shen, Tongliang Liu, Mingming Gong, Howard Bondell

    Abstract: To date, most directed acyclic graphs (DAGs) structure learning approaches require data to be stored in a central server. However, due to the consideration of privacy protection, data owners gradually refuse to share their personalized raw data to avoid private information leakage, making this task more troublesome by cutting off the first step. Thus, a puzzle arises: \textit{how do we discover th… ▽ More

    Submitted 16 January, 2023; v1 submitted 7 December, 2021; originally announced December 2021.

    Comments: Accepted to Transactions on Machine Learning Research

  7. arXiv:2109.14171  [pdf, other

    stat.ML cs.LG stat.CO

    Non-stationary Gaussian process discriminant analysis with variable selection for high-dimensional functional data

    Authors: W Yu, S Wade, H D Bondell, L Azizi

    Abstract: High-dimensional classification and feature selection tasks are ubiquitous with the recent advancement in data acquisition technology. In several application areas such as biology, genomics and proteomics, the data are often functional in their nature and exhibit a degree of roughness and non-stationarity. These structures pose additional challenges to commonly used methods that rely mainly on a t… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

  8. arXiv:2008.07653  [pdf, other

    stat.ML cs.LG stat.AP

    Nonparametric Conditional Density Estimation In A Deep Learning Framework For Short-Term Forecasting

    Authors: David B. Huberman, Brian J. Reich, Howard D. Bondell

    Abstract: Short-term forecasting is an important tool in understanding environmental processes. In this paper, we incorporate machine learning algorithms into a conditional distribution estimator for the purposes of forecasting tropical cyclone intensity. Many machine learning techniques give a single-point prediction of the conditional distribution of the target variable, which does not give a full account… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

    Comments: 44 pages, 5 figures

  9. arXiv:1905.05284  [pdf, ps, other

    stat.ML cs.LG stat.CO stat.ME

    Variational approximations using Fisher divergence

    Authors: Yue Yang, Ryan Martin, Howard Bondell

    Abstract: Modern applications of Bayesian inference involve models that are sufficiently complex that the corresponding posterior distributions are intractable and must be approximated. The most common approximation is based on Markov chain Monte Carlo, but these can be expensive when the data set is large and/or the model is complex, so more efficient variational approximations have recently received consi… ▽ More

    Submitted 13 May, 2019; originally announced May 2019.

    Comments: 13 pages, 5 figures, 2 tables

  10. arXiv:1903.06023  [pdf, other

    stat.ML cs.LG stat.ME

    Deep Distribution Regression

    Authors: Rui Li, Howard D. Bondell, Brian J. Reich

    Abstract: Due to their flexibility and predictive performance, machine-learning based regression methods have become an important tool for predictive modeling and forecasting. However, most methods focus on estimating the conditional mean or specific quantiles of the target quantity and do not provide the full conditional distribution, which contains uncertainty information that might be crucial for decisio… ▽ More

    Submitted 14 March, 2019; originally announced March 2019.

    Comments: 19 pages, 4 figures