Search | arXiv e-print repository

Understanding Entropic Regularization in GANs

Authors: Daria Reshetova, Yikun Bai, Xiugang Wu, Ayfer Ozgur

Abstract: Generative Adversarial Networks are a popular method for learning distributions from data by modeling the target distribution as a function of a known distribution. The function, often referred to as the generator, is optimized to minimize a chosen distance measure between the generated and target distributions. One commonly used measure for this purpose is the Wasserstein distance. However, Wasse… ▽ More Generative Adversarial Networks are a popular method for learning distributions from data by modeling the target distribution as a function of a known distribution. The function, often referred to as the generator, is optimized to minimize a chosen distance measure between the generated and target distributions. One commonly used measure for this purpose is the Wasserstein distance. However, Wasserstein distance is hard to compute and optimize, and in practice entropic regularization techniques are used to improve numerical convergence. The influence of regularization on the learned solution, however, remains not well-understood. In this paper, we study how several popular entropic regularizations of Wasserstein distance impact the solution in a simple benchmark setting where the generator is linear and the target distribution is high-dimensional Gaussian. We show that entropy regularization promotes the solution sparsification, while replacing the Wasserstein distance with the Sinkhorn divergence recovers the unregularized solution. Both regularization techniques remove the curse of dimensionality suffered by Wasserstein distance. We show that the optimal generator can be learned to accuracy $ε$ with $O(1/ε^2)$ samples from the target distribution. We thus conclude that these regularization techniques can improve the quality of the generator learned from empirical data for a large class of distributions. △ Less

Submitted 2 November, 2021; originally announced November 2021.

Comments: 29 pages, 7 figures

arXiv:1607.00076 [pdf, ps, other]

Multi-class classification: mirror descent approach

Authors: Daria Reshetova

Abstract: We consider the problem of multi-class classification and a stochastic opti- mization approach to it. We derive risk bounds for stochastic mirror descent algorithm and provide examples of set geometries that make the use of the algorithm efficient in terms of error in k. We consider the problem of multi-class classification and a stochastic opti- mization approach to it. We derive risk bounds for stochastic mirror descent algorithm and provide examples of set geometries that make the use of the algorithm efficient in terms of error in k. △ Less

Submitted 8 December, 2016; v1 submitted 30 June, 2016; originally announced July 2016.

arXiv:1507.03040 [pdf, other]

doi 10.1134/S105466181604009X

Tight Risk Bounds for Multi-Class Margin Classifiers

Authors: Yury Maximov, Daria Reshetova

Abstract: We consider a problem of risk estimation for large-margin multi-class classifiers. We propose a novel risk bound for the multi-class classification problem. The bound involves the marginal distribution of the classifier and the Rademacher complexity of the hypothesis class. We prove that our bound is tight in the number of classes. Finally, we compare our bound with the related ones and provide a… ▽ More We consider a problem of risk estimation for large-margin multi-class classifiers. We propose a novel risk bound for the multi-class classification problem. The bound involves the marginal distribution of the classifier and the Rademacher complexity of the hypothesis class. We prove that our bound is tight in the number of classes. Finally, we compare our bound with the related ones and provide a simplified version of the bound for the multi-class classification with kernel based hypotheses. △ Less

Submitted 2 July, 2016; v1 submitted 10 July, 2015; originally announced July 2015.

Comments: 11 pages

Journal ref: Pattern Recognition and Image Analysis 26, 673 - 680 (2016)

Showing 1–3 of 3 results for author: Reshetova, D