Skip to main content

Showing 1–9 of 9 results for author: Aminian, G

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.00454  [pdf, ps, other

    cs.LG cs.IT stat.ML

    Robust Semi-supervised Learning via $f$-Divergence and $α$-Rényi Divergence

    Authors: Gholamali Aminian, Amirhossien Bagheri, Mahyar JafariNodeh, Radmehr Karimian, Mohammad-Hossein Yassaee

    Abstract: This paper investigates a range of empirical risk functions and regularization methods suitable for self-training methods in semi-supervised learning. These approaches draw inspiration from various divergence measures, such as $f$-divergences and $α$-Rényi divergences. Inspired by the theoretical foundations rooted in divergences, i.e., $f$-divergences and $α$-Rényi divergence, we also provide val… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: Accepted in ISIT 2024

  2. arXiv:2402.07025  [pdf, other

    stat.ML cs.IT cs.LG

    Generalization Error of Graph Neural Networks in the Mean-field Regime

    Authors: Gholamali Aminian, Yixuan He, Gesine Reinert, Łukasz Szpruch, Samuel N. Cohen

    Abstract: This work provides a theoretical framework for assessing the generalization error of graph neural networks in the over-parameterized regime, where the number of parameters surpasses the quantity of data points. We explore two widely utilized types of graph neural networks: graph convolutional neural networks and message passing graph neural networks. Prior to this study, existing bounds on the gen… ▽ More

    Submitted 1 July, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

    Comments: Accepted in ICML 2024

  3. arXiv:2306.11623  [pdf, ps, other

    stat.ML cs.LG math.ST

    Mean-field Analysis of Generalization Errors

    Authors: Gholamali Aminian, Samuel N. Cohen, Łukasz Szpruch

    Abstract: We propose a novel framework for exploring weak and $L_2$ generalization errors of algorithms through the lens of differential calculus on the space of probability measures. Specifically, we consider the KL-regularized empirical risk minimization problem and establish generic conditions under which the generalization error convergence rate, when training on a sample of size $n$, is… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: 49 pages

    MSC Class: 62B10; 60F99; 49N80; 46N30

  4. arXiv:2210.00483  [pdf, ps, other

    cs.LG cs.IT stat.ML

    Learning Algorithm Generalization Error Bounds via Auxiliary Distributions

    Authors: Gholamali Aminian, Saeed Masiha, Laura Toni, Miguel R. D. Rodrigues

    Abstract: Generalization error bounds are essential for comprehending how well machine learning models work. In this work, we suggest a novel method, i.e., the Auxiliary Distribution Method, that leads to new upper bounds on expected generalization errors that are appropriate for supervised learning scenarios. We show that our general upper bounds can be specialized under some conditions to new bounds invol… ▽ More

    Submitted 16 April, 2024; v1 submitted 2 October, 2022; originally announced October 2022.

    Comments: Accepted in IEEE Journal on Selected Areas in Information Theory

  5. arXiv:2202.12123  [pdf, ps, other

    cs.IT stat.ML

    An Information-theoretical Approach to Semi-supervised Learning under Covariate-shift

    Authors: Gholamali Aminian, Mahed Abroshan, Mohammad Mahdi Khalili, Laura Toni, Miguel R. D. Rodrigues

    Abstract: A common assumption in semi-supervised learning is that the labeled, unlabeled, and test data are drawn from the same distribution. However, this assumption is not satisfied in many applications. In many scenarios, the data is collected sequentially (e.g., healthcare) and the distribution of the data may change over time often exhibiting so-called covariate shifts. In this paper, we propose an app… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

    Comments: Accepted at AISTATS 2022

  6. arXiv:2111.01635  [pdf, ps, other

    cs.LG cs.IT stat.ML

    Characterizing and Understanding the Generalization Error of Transfer Learning with Gibbs Algorithm

    Authors: Yuheng Bu, Gholamali Aminian, Laura Toni, Miguel Rodrigues, Gregory Wornell

    Abstract: We provide an information-theoretic analysis of the generalization ability of Gibbs-based transfer learning algorithms by focusing on two popular transfer learning approaches, $α$-weighted-ERM and two-stage-ERM. Our key result is an exact characterization of the generalization behaviour using the conditional symmetrized KL information between the output hypothesis and the target training samples g… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

  7. arXiv:2107.13656  [pdf, ps, other

    cs.LG cs.IT math.ST stat.ML

    Characterizing the Generalization Error of Gibbs Algorithm with Symmetrized KL information

    Authors: Gholamali Aminian, Yuheng Bu, Laura Toni, Miguel R. D. Rodrigues, Gregory Wornell

    Abstract: Bounding the generalization error of a supervised learning algorithm is one of the most important problems in learning theory, and various approaches have been developed. However, existing bounds are often loose and lack of guarantees. As a result, they may fail to characterize the exact generalization ability of a learning algorithm. Our main contribution is an exact characterization of the expec… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

    Comments: The first and second author have contributed equally to the paper. This paper is accepted in the ICML-21 Workshop on Information-Theoretic Methods for Rigorous, Responsible, and Reliable Machine Learning: https://sites.google.com/view/itr3/schedule

  8. arXiv:2102.02016  [pdf, ps, other

    cs.IT cs.LG stat.ML

    Information-Theoretic Bounds on the Moments of the Generalization Error of Learning Algorithms

    Authors: Gholamali Aminian, Laura Toni, Miguel R. D. Rodrigues

    Abstract: Generalization error bounds are critical to understanding the performance of machine learning models. In this work, building upon a new bound of the expected value of an arbitrary function of the population and empirical risk of a learning algorithm, we offer a more refined analysis of the generalization behaviour of a machine learning models based on a characterization of (bounds) to their genera… ▽ More

    Submitted 5 May, 2021; v1 submitted 3 February, 2021; originally announced February 2021.

    Comments: 7 pages, 3 figures, to be published in ISIT 2021. Some typos are fixed in the new version. The Re'yni divergence results are added in the new version

  9. arXiv:2010.12664  [pdf, ps, other

    cs.IT math.ST stat.ML

    Jensen-Shannon Information Based Characterization of the Generalization Error of Learning Algorithms

    Authors: Gholamali Aminian, Laura Toni, Miguel R. D. Rodrigues

    Abstract: Generalization error bounds are critical to understanding the performance of machine learning models. In this work, we propose a new information-theoretic based generalization error upper bound applicable to supervised learning scenarios. We show that our general bound can specialize in various previous bounds. We also show that our general bound can be specialized under some conditions to a new b… ▽ More

    Submitted 8 January, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: Accepted in ITW 2020 conference