Skip to main content

Showing 1–39 of 39 results for author: Sriperumbudur, B

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.08401  [pdf, other

    stat.ML cs.LG math.ST

    Nyström Kernel Stein Discrepancy

    Authors: Florian Kalinke, Zoltan Szabo, Bharath K. Sriperumbudur

    Abstract: Kernel methods underpin many of the most successful approaches in data science and statistics, and they allow representing probability measures as elements of a reproducing kernel Hilbert space without loss of information. Recently, the kernel Stein discrepancy (KSD), which combines Stein's method with kernel techniques, gained considerable attention. Through the Stein operator, KSD allows the con… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    MSC Class: 46E22 (Primary) 62G10 (Secondary) ACM Class: G.3; I.2.6

  2. arXiv:2404.08278  [pdf, other

    math.ST stat.ML

    Minimax Optimal Goodness-of-Fit Testing with Kernel Stein Discrepancy

    Authors: Omar Hagrass, Bharath Sriperumbudur, Krishnakumar Balasubramanian

    Abstract: We explore the minimax optimality of goodness-of-fit tests on general domains using the kernelized Stein discrepancy (KSD). The KSD framework offers a flexible approach for goodness-of-fit testing, avoiding strong distributional assumptions, accommodating diverse data structures beyond Euclidean spaces, and relying only on partial knowledge of the reference distribution, while maintaining computat… ▽ More

    Submitted 20 May, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Comments: 54 pages

    MSC Class: Primary: 62G10; Secondary: 65J20; 65J22; 46E22; 47A52

  3. arXiv:2308.04561  [pdf, other

    math.ST stat.ML

    Spectral Regularized Kernel Goodness-of-Fit Tests

    Authors: Omar Hagrass, Bharath K. Sriperumbudur, Bing Li

    Abstract: Maximum mean discrepancy (MMD) has enjoyed a lot of success in many machine learning and statistical applications, including non-parametric hypothesis testing, because of its ability to handle non-Euclidean data. Recently, it has been demonstrated in Balasubramanian et al.(2021) that the goodness-of-fit test based on MMD is not minimax optimal while a Tikhonov regularized version of it is, for an… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: 44 pages. arXiv admin note: text overlap with arXiv:2212.09201

    MSC Class: 62G10 (Primary); 65J20; 65J22; 46E22; 47A52 (Secondary)

  4. arXiv:2306.17329  [pdf, other

    stat.ML cs.LG math.ST

    Kernel $ε$-Greedy for Contextual Bandits

    Authors: Sakshi Arya, Bharath K. Sriperumbudur

    Abstract: We consider a kernelized version of the $ε$-greedy strategy for contextual bandits. More precisely, in a setting with finitely many arms, we consider that the mean reward functions lie in a reproducing kernel Hilbert space (RKHS). We propose an online weighted kernel ridge regression estimator for the reward functions. Under some conditions on the exploration probability sequence, $\{ε_t\}_t$, and… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    MSC Class: 62L10; 62G05; 68T05

  5. arXiv:2212.09201  [pdf, other

    math.ST cs.LG stat.ML

    Spectral Regularized Kernel Two-Sample Tests

    Authors: Omar Hagrass, Bharath K. Sriperumbudur, Bing Li

    Abstract: Over the last decade, an approach that has gained a lot of popularity to tackle nonparametric testing problems on general (i.e., non-Euclidean) domains is based on the notion of reproducing kernel Hilbert space (RKHS) embedding of probability distributions. The main goal of our work is to understand the optimality of two-sample tests constructed based on this approach. First, we show the popular M… ▽ More

    Submitted 1 May, 2024; v1 submitted 18 December, 2022; originally announced December 2022.

    Comments: 75 pages, to be published in the Annals of Statistics

    MSC Class: Primary: 62G10; Secondary: 65J20; 65J22; 46E22; 47A52

  6. arXiv:2211.07861  [pdf, other

    stat.ML cs.LG math.AP math.NA math.ST stat.CO

    Regularized Stein Variational Gradient Flow

    Authors: Ye He, Krishnakumar Balasubramanian, Bharath K. Sriperumbudur, Jianfeng Lu

    Abstract: The Stein Variational Gradient Descent (SVGD) algorithm is a deterministic particle method for sampling. However, a mean-field analysis reveals that the gradient flow corresponding to the SVGD algorithm (i.e., the Stein Variational Gradient Flow) only provides a constant-order approximation to the Wasserstein Gradient Flow corresponding to the KL-divergence minimization. In this work, we propose t… ▽ More

    Submitted 8 May, 2024; v1 submitted 14 November, 2022; originally announced November 2022.

  7. arXiv:2207.06357  [pdf, ps, other

    math.ST stat.ME stat.ML

    Shrinkage Estimation of Higher Order Bochner Integrals

    Authors: Saiteja Utpala, Bharath K. Sriperumbudur

    Abstract: We consider shrinkage estimation of higher order Hilbert space valued Bochner integrals in a non-parametric setting. We propose estimators that shrink the $U$-statistic estimator of the Bochner integral towards a pre-specified target element in the Hilbert space. Depending on the degeneracy of the kernel of the $U$-statistic, we construct consistent shrinkage estimators with fast rates of converge… ▽ More

    Submitted 21 July, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: 33 pages; Under Review

    MSC Class: 62G05(Primary); 62F10; 62J07(Secondary)

  8. arXiv:2206.01795  [pdf, other

    math.ST cs.CG cs.LG math.AT stat.ML

    Robust Topological Inference in the Presence of Outliers

    Authors: Siddharth Vishwanath, Bharath K. Sriperumbudur, Kenji Fukumizu, Satoshi Kuriki

    Abstract: The distance function to a compact set plays a crucial role in the paradigm of topological data analysis. In particular, the sublevel sets of the distance function are used in the computation of persistent homology -- a backbone of the topological data analysis pipeline. Despite its stability to perturbations in the Hausdorff distance, persistent homology is highly sensitive to outliers. In this w… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

    Comments: 50 pages, 10 figures

    MSC Class: 62R40; 55N31; 68T09

  9. arXiv:2111.11328  [pdf, other

    cs.LG stat.ML

    Cycle Consistent Probability Divergences Across Different Spaces

    Authors: Zhengxin Zhang, Youssef Mroueh, Ziv Goldfeld, Bharath K. Sriperumbudur

    Abstract: Discrepancy measures between probability distributions are at the core of statistical inference and machine learning. In many applications, distributions of interest are supported on different spaces, and yet a meaningful correspondence between data points is desired. Motivated to explicitly encode consistent bidirectional maps into the discrepancy measure, this work proposes a novel unbalanced Mo… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

    Comments: 35 pages

  10. arXiv:2105.08875  [pdf, ps, other

    stat.ML cs.LG math.ST

    Statistical Optimality and Computational Efficiency of Nyström Kernel PCA

    Authors: Nicholas Sterge, Bharath Sriperumbudur

    Abstract: Kernel methods provide an elegant framework for develo** nonlinear learning algorithms from simple linear methods. Though these methods have superior empirical performance in several real data applications, their usefulness is inhibited by the significant computational burden incurred in large sample situations. Various approximation schemes have been proposed in the literature to alleviate thes… ▽ More

    Submitted 18 May, 2021; originally announced May 2021.

    Comments: 26 pages

    MSC Class: Primary: 65R15; Secondary: 62H25; 46E22; 65F55

  11. arXiv:2006.10012  [pdf, other

    math.ST cs.CG cs.LG math.AT stat.ML

    Robust Persistence Diagrams using Reproducing Kernels

    Authors: Siddharth Vishwanath, Kenji Fukumizu, Satoshi Kuriki, Bharath Sriperumbudur

    Abstract: Persistent homology has become an important tool for extracting geometric and topological features from data, whose multi-scale features are summarized in a persistence diagram. From a statistical perspective, however, persistence diagrams are very sensitive to perturbations in the input space. In this work, we develop a framework for constructing robust persistence diagrams from superlevel filtra… ▽ More

    Submitted 3 June, 2022; v1 submitted 17 June, 2020; originally announced June 2020.

    MSC Class: 55N31; 62R40; 62G07; 46E22

  12. arXiv:1912.01103  [pdf, ps, other

    math.ST stat.ML

    On Distance and Kernel Measures of Conditional Independence

    Authors: Tianhong Sheng, Bharath K. Sriperumbudur

    Abstract: Measuring conditional independence is one of the important tasks in statistical inference and is fundamental in causal discovery, feature selection, dimensionality reduction, Bayesian network learning, and others. In this work, we explore the connection between conditional independence measures induced by distances on a metric space and reproducing kernels associated with a reproducing kernel Hilb… ▽ More

    Submitted 17 August, 2020; v1 submitted 2 December, 2019; originally announced December 2019.

  13. arXiv:1908.05818  [pdf, other

    stat.ML cs.LG math.PR math.ST

    Gaussian Sketching yields a J-L Lemma in RKHS

    Authors: Samory Kpotufe, Bharath K. Sriperumbudur

    Abstract: The main contribution of the paper is to show that Gaussian sketching of a kernel-Gram matrix $\boldsymbol K$ yields an operator whose counterpart in an RKHS $\mathcal H$, is a \emph{random projection} operator---in the spirit of Johnson-Lindenstrauss (J-L) lemma. To be precise, given a random matrix $Z$ with i.i.d. Gaussian entries, we show that a sketch $Z\boldsymbol{K}$ corresponds to a particu… ▽ More

    Submitted 11 March, 2020; v1 submitted 15 August, 2019; originally announced August 2019.

    Comments: 16 pages

  14. arXiv:1907.05226  [pdf, other

    stat.ML cs.LG math.ST

    Gain with no Pain: Efficient Kernel-PCA by Nyström Sampling

    Authors: Nicholas Sterge, Bharath Sriperumbudur, Lorenzo Rosasco, Alessandro Rudi

    Abstract: In this paper, we propose and study a Nyström based approach to efficient large scale kernel principal component analysis (PCA). The latter is a natural nonlinear extension of classical PCA based on considering a nonlinear feature map or the corresponding kernel. Like other kernel approaches, kernel PCA enjoys good mathematical and statistical properties but, numerically, it scales poorly with the… ▽ More

    Submitted 11 July, 2019; originally announced July 2019.

    Comments: 19 pages, 2 figures

    MSC Class: 62H25; 62H12; 46E22

  15. arXiv:1902.01219  [pdf, ps, other

    math.ST cs.IT cs.LG stat.ML

    Local minimax rates for closeness testing of discrete distributions

    Authors: Joseph Lam-Weil, Alexandra Carpentier, Bharath K. Sriperumbudur

    Abstract: We consider the closeness testing problem for discrete distributions. The goal is to distinguish whether two samples are drawn from the same unspecified distribution, or whether their respective distributions are separated in $L_1$-norm. In this paper, we focus on adapting the rate to the shape of the underlying distributions, i.e. we consider \textit{a local minimax setting}. We provide, to the b… ▽ More

    Submitted 19 January, 2021; v1 submitted 1 February, 2019; originally announced February 2019.

    MSC Class: 62F03; 62G10; 62F35 ACM Class: G.3; I.2.6

  16. arXiv:1810.05207  [pdf, ps, other

    stat.ML cs.LG math.PR

    On Kernel Derivative Approximation with Random Fourier Features

    Authors: Zoltan Szabo, Bharath K. Sriperumbudur

    Abstract: Random Fourier features (RFF) represent one of the most popular and wide-spread techniques in machine learning to scale up kernel algorithms. Despite the numerous successful applications of RFFs, unfortunately, quite little is understood theoretically on their optimality and limitations of their performance. Only recently, precise statistical-computational trade-offs have been established for RFFs… ▽ More

    Submitted 9 February, 2019; v1 submitted 11 October, 2018; originally announced October 2018.

    Comments: AISTATS-2019

    MSC Class: 60E10; 42Bxx; 46E22 ACM Class: G.3; I.2.6

  17. arXiv:1807.02582  [pdf, other

    stat.ML cs.LG

    Gaussian Processes and Kernel Methods: A Review on Connections and Equivalences

    Authors: Motonobu Kanagawa, Philipp Hennig, Dino Sejdinovic, Bharath K Sriperumbudur

    Abstract: This paper is an attempt to bridge the conceptual gaps between researchers working on the two widely used approaches based on positive definite kernels: Bayesian learning or inference using Gaussian processes on the one side, and frequentist kernel methods based on reproducing kernel Hilbert spaces on the other. It is widely known in machine learning that these two formalisms are closely related;… ▽ More

    Submitted 6 July, 2018; originally announced July 2018.

    Comments: 64 pages

  18. arXiv:1803.11451  [pdf, ps, other

    math.ST cs.IT stat.ML

    Minimax Estimation of Quadratic Fourier Functionals

    Authors: Shashank Singh, Bharath K. Sriperumbudur, Barnabás Póczos

    Abstract: We study estimation of (semi-)inner products between two nonparametric probability distributions, given IID samples from each distribution. These products include relatively well-studied classical $\mathcal{L}^2$ and Sobolev inner products, as well as those induced by translation-invariant reproducing kernels, for which we believe our results are the first. We first propose estimators for these qu… ▽ More

    Submitted 1 September, 2018; v1 submitted 30 March, 2018; originally announced March 2018.

  19. arXiv:1709.00147  [pdf, other

    math.NA stat.ML

    Convergence Analysis of Deterministic Kernel-Based Quadrature Rules in Misspecified Settings

    Authors: Motonobu Kanagawa, Bharath K. Sriperumbudur, Kenji Fukumizu

    Abstract: This paper presents a convergence analysis of kernel-based quadrature rules in misspecified settings, focusing on deterministic quadrature in Sobolev spaces. In particular, we deal with misspecified settings where a test integrand is less smooth than a Sobolev RKHS based on which a quadrature rule is constructed. We provide convergence guarantees based on two different assumptions on a quadrature… ▽ More

    Submitted 30 October, 2018; v1 submitted 1 September, 2017; originally announced September 2017.

    Comments: 36 pages

    MSC Class: 65D30 (Primary); 65D32; 65D05; 46E35; 46E22 (Secondary)

  20. arXiv:1708.08157  [pdf, ps, other

    stat.ML cs.IT stat.ME

    Characteristic and Universal Tensor Product Kernels

    Authors: Zoltan Szabo, Bharath K. Sriperumbudur

    Abstract: Maximum mean discrepancy (MMD), also called energy distance or N-distance in statistics and Hilbert-Schmidt independence criterion (HSIC), specifically distance covariance in statistics, are among the most popular and successful approaches to quantify the difference and independence of random variables, respectively. Thanks to their kernel-based foundations, MMD and HSIC are applicable on a wide v… ▽ More

    Submitted 2 August, 2018; v1 submitted 27 August, 2017; originally announced August 2017.

    Comments: final version appeared in JMLR

    MSC Class: 46E22; 94A15; 62G10; 47B32 ACM Class: G.3; H.1.1; I.2.6

    Journal ref: Journal of Machine Learning Research 18(233):1-29, 2018

  21. arXiv:1708.05254  [pdf, other

    stat.ML stat.ME

    Adaptive Clustering Using Kernel Density Estimators

    Authors: Ingo Steinwart, Bharath K. Sriperumbudur, Philipp Thomann

    Abstract: We derive and analyze a generic, recursive algorithm for estimating all splits in a finite cluster tree as well as the corresponding clusters. We further investigate statistical properties of this generic clustering algorithm when it receives level set estimates from a kernel density estimator. In particular, we derive finite sample guarantees, consistency, rates of convergence, and an adaptive da… ▽ More

    Submitted 1 November, 2021; v1 submitted 17 August, 2017; originally announced August 2017.

  22. arXiv:1706.06296  [pdf, ps, other

    stat.ML math.ST

    Approximate Kernel PCA Using Random Features: Computational vs. Statistical Trade-off

    Authors: Bharath Sriperumbudur, Nicholas Sterge

    Abstract: Kernel methods are powerful learning methodologies that allow to perform non-linear data analysis. Despite their popularity, they suffer from poor scalability in big data scenarios. Various approximation methods, including random feature approximation, have been proposed to alleviate the problem. However, the statistical consistency of most of these approximate kernel methods is not well understoo… ▽ More

    Submitted 11 June, 2022; v1 submitted 20 June, 2017; originally announced June 2017.

    Comments: 65 pages

    MSC Class: 62H25; 62G05

  23. arXiv:1605.09522  [pdf, ps, other

    stat.ML cs.LG

    Kernel Mean Embedding of Distributions: A Review and Beyond

    Authors: Krikamol Muandet, Kenji Fukumizu, Bharath Sriperumbudur, Bernhard Schölkopf

    Abstract: A Hilbert space embedding of a distribution---in short, a kernel mean embedding---has recently emerged as a powerful tool for machine learning and inference. The basic idea behind this framework is to map distributions into a reproducing kernel Hilbert space (RKHS) in which the whole arsenal of kernel methods can be extended to probability measures. It can be viewed as a generalization of the orig… ▽ More

    Submitted 13 December, 2020; v1 submitted 31 May, 2016; originally announced May 2016.

    Comments: 147 pages; this is the final version

    Journal ref: Foundations and Trends in Machine Learning: Vol. 10: No. 1-2, pp 1-141 (2017)

  24. arXiv:1605.07254  [pdf, other

    stat.ML

    Convergence guarantees for kernel-based quadrature rules in misspecified settings

    Authors: Motonobu Kanagawa, Bharath K. Sriperumbudur, Kenji Fukumizu

    Abstract: Kernel-based quadrature rules are becoming important in machine learning and statistics, as they achieve super-$\sqrt{n}$ convergence rates in numerical integration, and thus provide alternatives to Monte Carlo integration in challenging settings where integrands are expensive to evaluate or where integrands are high dimensional. These rules are based on the assumption that the integrand has a cer… ▽ More

    Submitted 28 October, 2016; v1 submitted 23 May, 2016; originally announced May 2016.

    Comments: To appear at NIPS2016

  25. arXiv:1506.02155  [pdf, ps, other

    math.ST cs.LG math.FA stat.ML

    Optimal Rates for Random Fourier Features

    Authors: Bharath K. Sriperumbudur, Zoltan Szabo

    Abstract: Kernel methods represent one of the most powerful tools in machine learning to tackle problems expressed in terms of function values and derivatives due to their capability to represent and model complex relations. While these methods show good versatility, they are computationally intensive and have poor scalability to large data as they require operations on Gram matrices. In order to mitigate t… ▽ More

    Submitted 4 November, 2015; v1 submitted 6 June, 2015; originally announced June 2015.

    Comments: To appear at NIPS-2015

    MSC Class: 60E10; 62Gxx; 62Exx; 62H12; 42Bxx; 46E22 ACM Class: G.3; I.2.6; F.2

  26. arXiv:1411.2066  [pdf, ps, other

    math.ST cs.LG math.FA stat.ML

    Learning Theory for Distribution Regression

    Authors: Zoltan Szabo, Bharath Sriperumbudur, Barnabas Poczos, Arthur Gretton

    Abstract: We focus on the distribution regression problem: regressing to vector-valued outputs from probability measures. Many important machine learning and statistical tasks fit into this framework, including multi-instance learning and point estimation problems without analytical solution (such as hyperparameter or entropy estimation). Despite the large number of available heuristics in the literature, t… ▽ More

    Submitted 21 October, 2016; v1 submitted 7 November, 2014; originally announced November 2014.

    Comments: Final version appeared at JMLR, with supplement. Code: https://bitbucket.org/szzoli/ite/. arXiv admin note: text overlap with arXiv:1402.1754

    MSC Class: 62G08; 46E22; 47B32 ACM Class: G.3; I.2.6

    Journal ref: Journal of Machine Learning Research, 17(152):1-40, 2016

  27. arXiv:1411.0900  [pdf, ps, other

    stat.ML math.ST

    Kernel Mean Estimation via Spectral Filtering

    Authors: Krikamol Muandet, Bharath Sriperumbudur, Bernhard Schölkopf

    Abstract: The problem of estimating the kernel mean in a reproducing kernel Hilbert space (RKHS) is central to kernel methods in that it is used by classical approaches (e.g., when centering a kernel PCA matrix), and it also forms the core inference step of modern kernel methods (e.g., kernel-based non-parametric tests) that rely on embedding probability distributions in RKHSs. Muandet et al. (2014) has sho… ▽ More

    Submitted 4 November, 2014; originally announced November 2014.

    Comments: To appear at the 28th Annual Conference on Neural Information Processing Systems (NIPS 2014). 16 pages

  28. arXiv:1405.5505  [pdf, ps, other

    stat.ML cs.LG

    Kernel Mean Shrinkage Estimators

    Authors: Krikamol Muandet, Bharath Sriperumbudur, Kenji Fukumizu, Arthur Gretton, Bernhard Schölkopf

    Abstract: A mean function in a reproducing kernel Hilbert space (RKHS), or a kernel mean, is central to kernel methods in that it is used by many classical algorithms such as kernel principal component analysis, and it also forms the core inference step of modern kernel methods that rely on embedding probability distributions in RKHSs. Given a finite sample, an empirical average has been used commonly as a… ▽ More

    Submitted 25 February, 2016; v1 submitted 21 May, 2014; originally announced May 2014.

    Comments: 41 pages

  29. arXiv:1402.1754  [pdf, ps, other

    math.ST cs.LG math.FA stat.ML

    Two-stage Sampled Learning Theory on Distributions

    Authors: Zoltan Szabo, Arthur Gretton, Barnabas Poczos, Bharath Sriperumbudur

    Abstract: We focus on the distribution regression problem: regressing to a real-valued response from a probability distribution. Although there exist a large number of similarity measures between distributions, very little is known about their generalization performance in specific learning tasks. Learning problems formulated on distributions have an inherent two-stage sampled difficulty: in practice only s… ▽ More

    Submitted 26 January, 2015; v1 submitted 7 February, 2014; originally announced February 2014.

    Comments: v6: accepted at AISTATS-2015 for oral presentation; final version; code: https://bitbucket.org/szzoli/ite/; extension to the misspecified and vector-valued case: http://arxiv.longhoe.net/abs/1411.2066

    MSC Class: 62G08; 46E22; 47B32 ACM Class: G.3; I.2.6

  30. arXiv:1312.3516  [pdf, ps, other

    math.ST stat.ME stat.ML

    Density Estimation in Infinite Dimensional Exponential Families

    Authors: Bharath Sriperumbudur, Kenji Fukumizu, Arthur Gretton, Aapo Hyvärinen, Revant Kumar

    Abstract: In this paper, we consider an infinite dimensional exponential family, $\mathcal{P}$ of probability densities, which are parametrized by functions in a reproducing kernel Hilbert space, $H$ and show it to be quite rich in the sense that a broad class of densities on $\mathbb{R}^d$ can be approximated arbitrarily well in Kullback-Leibler (KL) divergence by elements in $\mathcal{P}$. The main goal o… ▽ More

    Submitted 26 May, 2017; v1 submitted 12 December, 2013; originally announced December 2013.

    Comments: 58 pages, 8 figures; Fixed some errors and typos

  31. arXiv:1306.0842  [pdf, ps, other

    stat.ML cs.LG math.ST

    Kernel Mean Estimation and Stein's Effect

    Authors: Krikamol Muandet, Kenji Fukumizu, Bharath Sriperumbudur, Arthur Gretton, Bernhard Schölkopf

    Abstract: A mean function in reproducing kernel Hilbert space, or a kernel mean, is an important part of many applications ranging from kernel principal component analysis to Hilbert-space embedding of distributions. Given finite samples, an empirical average is the standard estimate for the true kernel mean. We show that this estimator can be improved via a well-known phenomenon in statistics called Stein'… ▽ More

    Submitted 6 June, 2013; v1 submitted 4 June, 2013; originally announced June 2013.

    Comments: first draft

  32. arXiv:1305.2505  [pdf, other

    cs.LG stat.ML

    On the Generalization Ability of Online Learning Algorithms for Pairwise Loss Functions

    Authors: Purushottam Kar, Bharath K Sriperumbudur, Prateek Jain, Harish C Karnick

    Abstract: In this paper, we study the generalization properties of online learning based stochastic methods for supervised learning problems where the loss function is dependent on more than one training sample (e.g., metric learning, ranking). We present a generic decoupling technique that enables us to provide Rademacher complexity-based generalization error bounds. Our bounds are in general tighter than… ▽ More

    Submitted 11 May, 2013; originally announced May 2013.

    Comments: To appear in proceedings of the 30th International Conference on Machine Learning (ICML 2013)

    Journal ref: Journal of Machine Learning Research, W&CP 28(3) (2013)

  33. arXiv:1207.6076  [pdf, ps, other

    stat.ME cs.LG math.ST stat.ML

    Equivalence of distance-based and RKHS-based statistics in hypothesis testing

    Authors: Dino Sejdinovic, Bharath Sriperumbudur, Arthur Gretton, Kenji Fukumizu

    Abstract: We provide a unifying framework linking two classes of statistics used in two-sample and independence testing: on the one hand, the energy distances and distance covariances from the statistics literature; on the other, maximum mean discrepancies (MMD), that is, distances between embeddings of distributions to reproducing kernel Hilbert spaces (RKHS), as established in machine learning. In the cas… ▽ More

    Submitted 12 November, 2013; v1 submitted 25 July, 2012; originally announced July 2012.

    Comments: Published in at http://dx.doi.org/10.1214/13-AOS1140 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1140

    Journal ref: Annals of Statistics 2013, Vol. 41, No. 5, 2263-2291

  34. arXiv:1205.0411  [pdf, ps, other

    cs.LG stat.ME stat.ML

    Hypothesis testing using pairwise distances and associated kernels (with Appendix)

    Authors: Dino Sejdinovic, Arthur Gretton, Bharath Sriperumbudur, Kenji Fukumizu

    Abstract: We provide a unifying framework linking two classes of statistics used in two-sample and independence testing: on the one hand, the energy distances and distance covariances from the statistics literature; on the other, distances between embeddings of distributions to reproducing kernel Hilbert spaces (RKHS), as established in machine learning. The equivalence holds when energy distances are compu… ▽ More

    Submitted 21 May, 2012; v1 submitted 2 May, 2012; originally announced May 2012.

    Comments: Appearing in Proceedings of the 29th International Conference on Machine Learning, Edinburgh, Scotland, UK, 2012

  35. Discussion of: Brownian distance covariance

    Authors: Arthur Gretton, Kenji Fukumizu, Bharath K. Sriperumbudur

    Abstract: Discussion on "Brownian distance covariance" by Gábor J. Székely and Maria L. Rizzo [arXiv:1010.0297]

    Submitted 5 October, 2010; originally announced October 2010.

    Comments: Published in at http://dx.doi.org/10.1214/09-AOAS312E the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS312E

    Journal ref: Annals of Applied Statistics 2009, Vol. 3, No. 4, 1285-1294

  36. arXiv:1003.0887  [pdf, ps, other

    stat.ML math.ST

    Universality, Characteristic Kernels and RKHS Embedding of Measures

    Authors: Bharath K. Sriperumbudur, Kenji Fukumizu, Gert R. G. Lanckriet

    Abstract: A Hilbert space embedding for probability measures has recently been proposed, wherein any probability measure is represented as a mean element in a reproducing kernel Hilbert space (RKHS). Such an embedding has found applications in homogeneity testing, independence testing, dimensionality reduction, etc., with the requirement that the reproducing kernel is characteristic, i.e., the embedding i… ▽ More

    Submitted 3 March, 2010; originally announced March 2010.

    Comments: 30 pages, 1 figure

  37. arXiv:0907.5309  [pdf, ps, other

    stat.ML math.ST

    Hilbert space embeddings and metrics on probability measures

    Authors: Bharath K. Sriperumbudur, Arthur Gretton, Kenji Fukumizu, Bernhard Schölkopf, Gert R. G. Lanckriet

    Abstract: A Hilbert space embedding for probability measures has recently been proposed, with applications including dimensionality reduction, homogeneity testing, and independence testing. This embedding represents any probability measure as a mean element in a reproducing kernel Hilbert space (RKHS). A pseudometric on the space of probability measures can be defined as the distance between distribution… ▽ More

    Submitted 29 January, 2010; v1 submitted 30 July, 2009; originally announced July 2009.

    Comments: 48 pages

  38. arXiv:0901.1504  [pdf, ps, other

    stat.ML stat.ME

    A D.C. Programming Approach to the Sparse Generalized Eigenvalue Problem

    Authors: Bharath Sriperumbudur, David Torres, Gert Lanckriet

    Abstract: In this paper, we consider the sparse eigenvalue problem wherein the goal is to obtain a sparse solution to the generalized eigenvalue problem. We achieve this by constraining the cardinality of the solution to the generalized eigenvalue problem and obtain sparse principal component analysis (PCA), sparse canonical correlation analysis (CCA) and sparse Fisher discriminant analysis (FDA) as speci… ▽ More

    Submitted 12 October, 2009; v1 submitted 12 January, 2009; originally announced January 2009.

    Comments: 40 pages

  39. arXiv:0706.3499  [pdf, ps, other

    stat.ML

    Metric Embedding for Nearest Neighbor Classification

    Authors: Bharath K. Sriperumbudur, Gert R. G. Lanckriet

    Abstract: The distance metric plays an important role in nearest neighbor (NN) classification. Usually the Euclidean distance metric is assumed or a Mahalanobis distance metric is optimized to improve the NN performance. In this paper, we study the problem of embedding arbitrary metric spaces into a Euclidean space with the goal to improve the accuracy of the NN classifier. We propose a solution by appeal… ▽ More

    Submitted 24 June, 2007; originally announced June 2007.

    Comments: 9 pages, 1 table