Skip to main content

Showing 1–37 of 37 results for author: Sriperumbudur, B

Searching in archive math. Search in all archives.
.
  1. arXiv:2406.10005  [pdf, ps, other

    math.ST

    Optimal Rates for Functional Linear Regression with General Regularization

    Authors: Naveen Gupta, S. Sivananthan, Bharath K. Sriperumbudur

    Abstract: Functional linear regression is one of the fundamental and well-studied methods in functional data analysis. In this work, we investigate the functional linear regression model within the context of reproducing kernel Hilbert space by employing general spectral regularization to approximate the slope function with certain smoothness assumptions. We establish optimal convergence rates for estimatio… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  2. arXiv:2406.08401  [pdf, other

    stat.ML cs.LG math.ST

    Nyström Kernel Stein Discrepancy

    Authors: Florian Kalinke, Zoltan Szabo, Bharath K. Sriperumbudur

    Abstract: Kernel methods underpin many of the most successful approaches in data science and statistics, and they allow representing probability measures as elements of a reproducing kernel Hilbert space without loss of information. Recently, the kernel Stein discrepancy (KSD), which combines Stein's method with kernel techniques, gained considerable attention. Through the Stein operator, KSD allows the con… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    MSC Class: 46E22 (Primary) 62G10 (Secondary) ACM Class: G.3; I.2.6

  3. arXiv:2404.08278  [pdf, other

    math.ST stat.ML

    Minimax Optimal Goodness-of-Fit Testing with Kernel Stein Discrepancy

    Authors: Omar Hagrass, Bharath Sriperumbudur, Krishnakumar Balasubramanian

    Abstract: We explore the minimax optimality of goodness-of-fit tests on general domains using the kernelized Stein discrepancy (KSD). The KSD framework offers a flexible approach for goodness-of-fit testing, avoiding strong distributional assumptions, accommodating diverse data structures beyond Euclidean spaces, and relying only on partial knowledge of the reference distribution, while maintaining computat… ▽ More

    Submitted 20 May, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Comments: 54 pages

    MSC Class: Primary: 62G10; Secondary: 65J20; 65J22; 46E22; 47A52

  4. arXiv:2310.02607  [pdf, ps, other

    math.ST

    Convergence Analysis of Kernel Conjugate Gradient for Functional Linear Regression

    Authors: Naveen Gupta, S. Sivananthan, Bharath K. Sriperumbudur

    Abstract: In this paper, we discuss the convergence analysis of the conjugate gradient-based algorithm for the functional linear model in the reproducing kernel Hilbert space framework, utilizing early stop** results in regularization against over-fitting. We establish the convergence rates depending on the regularity condition of the slope function and the decay rate of the eigenvalues of the operator co… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    MSC Class: 62R10; 62G20; 65F22

  5. arXiv:2308.04561  [pdf, other

    math.ST stat.ML

    Spectral Regularized Kernel Goodness-of-Fit Tests

    Authors: Omar Hagrass, Bharath K. Sriperumbudur, Bing Li

    Abstract: Maximum mean discrepancy (MMD) has enjoyed a lot of success in many machine learning and statistical applications, including non-parametric hypothesis testing, because of its ability to handle non-Euclidean data. Recently, it has been demonstrated in Balasubramanian et al.(2021) that the goodness-of-fit test based on MMD is not minimax optimal while a Tikhonov regularized version of it is, for an… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: 44 pages. arXiv admin note: text overlap with arXiv:2212.09201

    MSC Class: 62G10 (Primary); 65J20; 65J22; 46E22; 47A52 (Secondary)

  6. arXiv:2306.17329  [pdf, other

    stat.ML cs.LG math.ST

    Kernel $ε$-Greedy for Contextual Bandits

    Authors: Sakshi Arya, Bharath K. Sriperumbudur

    Abstract: We consider a kernelized version of the $ε$-greedy strategy for contextual bandits. More precisely, in a setting with finitely many arms, we consider that the mean reward functions lie in a reproducing kernel Hilbert space (RKHS). We propose an online weighted kernel ridge regression estimator for the reward functions. Under some conditions on the exploration probability sequence, $\{ε_t\}_t$, and… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    MSC Class: 62L10; 62G05; 68T05

  7. arXiv:2212.12848  [pdf, other

    math.ST

    Gromov-Wasserstein Distances: Entropic Regularization, Duality, and Sample Complexity

    Authors: Zhengxin Zhang, Ziv Goldfeld, Youssef Mroueh, Bharath K. Sriperumbudur

    Abstract: The Gromov-Wasserstein (GW) distance, rooted in optimal transport (OT) theory, quantifies dissimilarity between metric measure spaces and provides a framework for aligning heterogeneous datasets. While computational aspects of the GW problem have been widely studied, a duality theory and fundamental statistical questions concerning empirical convergence rates remained obscure. This work closes the… ▽ More

    Submitted 28 September, 2023; v1 submitted 24 December, 2022; originally announced December 2022.

    Comments: 47 pages

  8. arXiv:2212.09201  [pdf, other

    math.ST cs.LG stat.ML

    Spectral Regularized Kernel Two-Sample Tests

    Authors: Omar Hagrass, Bharath K. Sriperumbudur, Bing Li

    Abstract: Over the last decade, an approach that has gained a lot of popularity to tackle nonparametric testing problems on general (i.e., non-Euclidean) domains is based on the notion of reproducing kernel Hilbert space (RKHS) embedding of probability distributions. The main goal of our work is to understand the optimality of two-sample tests constructed based on this approach. First, we show the popular M… ▽ More

    Submitted 1 May, 2024; v1 submitted 18 December, 2022; originally announced December 2022.

    Comments: 75 pages, to be published in the Annals of Statistics

    MSC Class: Primary: 62G10; Secondary: 65J20; 65J22; 46E22; 47A52

  9. arXiv:2211.07861  [pdf, other

    stat.ML cs.LG math.AP math.NA math.ST stat.CO

    Regularized Stein Variational Gradient Flow

    Authors: Ye He, Krishnakumar Balasubramanian, Bharath K. Sriperumbudur, Jianfeng Lu

    Abstract: The Stein Variational Gradient Descent (SVGD) algorithm is a deterministic particle method for sampling. However, a mean-field analysis reveals that the gradient flow corresponding to the SVGD algorithm (i.e., the Stein Variational Gradient Flow) only provides a constant-order approximation to the Wasserstein Gradient Flow corresponding to the KL-divergence minimization. In this work, we propose t… ▽ More

    Submitted 8 May, 2024; v1 submitted 14 November, 2022; originally announced November 2022.

  10. arXiv:2207.06357  [pdf, ps, other

    math.ST stat.ME stat.ML

    Shrinkage Estimation of Higher Order Bochner Integrals

    Authors: Saiteja Utpala, Bharath K. Sriperumbudur

    Abstract: We consider shrinkage estimation of higher order Hilbert space valued Bochner integrals in a non-parametric setting. We propose estimators that shrink the $U$-statistic estimator of the Bochner integral towards a pre-specified target element in the Hilbert space. Depending on the degeneracy of the kernel of the $U$-statistic, we construct consistent shrinkage estimators with fast rates of converge… ▽ More

    Submitted 21 July, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: 33 pages; Under Review

    MSC Class: 62G05(Primary); 62F10; 62J07(Secondary)

  11. arXiv:2206.03975  [pdf, other

    math.ST

    Functional linear and single-index models: A unified approach via Gaussian Stein identity

    Authors: Krishnakumar Balasubramanian, Hans-Georg Müller, Bharath K. Sriperumbudur

    Abstract: Functional linear and single-index models are core regression methods in functional data analysis and are widely used for performing regression in a wide range of applications when the covariates are random functions coupled with scalar responses. In the existing literature, however, the construction of associated estimators and the study of their theoretical properties is invariably carried out o… ▽ More

    Submitted 26 March, 2024; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: To appear in Bernoulli Journal

  12. arXiv:2206.01795  [pdf, other

    math.ST cs.CG cs.LG math.AT stat.ML

    Robust Topological Inference in the Presence of Outliers

    Authors: Siddharth Vishwanath, Bharath K. Sriperumbudur, Kenji Fukumizu, Satoshi Kuriki

    Abstract: The distance function to a compact set plays a crucial role in the paradigm of topological data analysis. In particular, the sublevel sets of the distance function are used in the computation of persistent homology -- a backbone of the topological data analysis pipeline. Despite its stability to perturbations in the Hausdorff distance, persistent homology is highly sensitive to outliers. In this w… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

    Comments: 50 pages, 10 figures

    MSC Class: 62R40; 55N31; 68T09

  13. arXiv:2105.08875  [pdf, ps, other

    stat.ML cs.LG math.ST

    Statistical Optimality and Computational Efficiency of Nyström Kernel PCA

    Authors: Nicholas Sterge, Bharath Sriperumbudur

    Abstract: Kernel methods provide an elegant framework for develo** nonlinear learning algorithms from simple linear methods. Though these methods have superior empirical performance in several real data applications, their usefulness is inhibited by the significant computational burden incurred in large sample situations. Various approximation schemes have been proposed in the literature to alleviate thes… ▽ More

    Submitted 18 May, 2021; originally announced May 2021.

    Comments: 26 pages

    MSC Class: Primary: 65R15; Secondary: 62H25; 46E22; 65F55

  14. arXiv:2010.08071  [pdf, other

    math.ST

    Shrinkage Estimation for the Diagonal Multivariate Exponential Families

    Authors: Nikolas Siapoutis, Donald Richards, Bharath K. Sriperumbudur

    Abstract: We study shrinkage estimation of the mean parameters of a class of multivariate distributions for which the diagonal entries of the corresponding covariance matrix are certain quadratic functions of the mean parameter. This class of distributions includes the diagonal multivariate natural exponential families. We propose two classes of semi-parametric shrinkage estimators for the mean and construc… ▽ More

    Submitted 1 July, 2022; v1 submitted 15 October, 2020; originally announced October 2020.

    Comments: 36 pages, 2 figures

    MSC Class: 62F12; 62H05 (Primary) 62J07; 62G05 (Secondary)

  15. arXiv:2006.10012  [pdf, other

    math.ST cs.CG cs.LG math.AT stat.ML

    Robust Persistence Diagrams using Reproducing Kernels

    Authors: Siddharth Vishwanath, Kenji Fukumizu, Satoshi Kuriki, Bharath Sriperumbudur

    Abstract: Persistent homology has become an important tool for extracting geometric and topological features from data, whose multi-scale features are summarized in a persistence diagram. From a statistical perspective, however, persistence diagrams are very sensitive to perturbations in the input space. In this work, we develop a framework for constructing robust persistence diagrams from superlevel filtra… ▽ More

    Submitted 3 June, 2022; v1 submitted 17 June, 2020; originally announced June 2020.

    MSC Class: 55N31; 62R40; 62G07; 46E22

  16. arXiv:2001.00220  [pdf, other

    math.PR math.AT math.ST

    On the Limits of Topological Data Analysis for Statistical Inference

    Authors: Siddharth Vishwanath, Kenji Fukumizu, Satoshi Kuriki, Bharath Sriperumbudur

    Abstract: Topological data analysis has emerged as a powerful tool for extracting the metric, geometric and topological features underlying the data as a multi-resolution summary statistic, and has found applications in several areas where data arises from complex sources. In this paper, we examine the use of topological summary statistics through the lens of statistical inference. We investigate necessary… ▽ More

    Submitted 15 February, 2024; v1 submitted 1 January, 2020; originally announced January 2020.

    Comments: 36 pages, 9 figures

    MSC Class: 62F30; 55N31; 62R40

  17. arXiv:1912.01103  [pdf, ps, other

    math.ST stat.ML

    On Distance and Kernel Measures of Conditional Independence

    Authors: Tianhong Sheng, Bharath K. Sriperumbudur

    Abstract: Measuring conditional independence is one of the important tasks in statistical inference and is fundamental in causal discovery, feature selection, dimensionality reduction, Bayesian network learning, and others. In this work, we explore the connection between conditional independence measures induced by distances on a metric space and reproducing kernels associated with a reproducing kernel Hilb… ▽ More

    Submitted 17 August, 2020; v1 submitted 2 December, 2019; originally announced December 2019.

  18. arXiv:1908.05818  [pdf, other

    stat.ML cs.LG math.PR math.ST

    Gaussian Sketching yields a J-L Lemma in RKHS

    Authors: Samory Kpotufe, Bharath K. Sriperumbudur

    Abstract: The main contribution of the paper is to show that Gaussian sketching of a kernel-Gram matrix $\boldsymbol K$ yields an operator whose counterpart in an RKHS $\mathcal H$, is a \emph{random projection} operator---in the spirit of Johnson-Lindenstrauss (J-L) lemma. To be precise, given a random matrix $Z$ with i.i.d. Gaussian entries, we show that a sketch $Z\boldsymbol{K}$ corresponds to a particu… ▽ More

    Submitted 11 March, 2020; v1 submitted 15 August, 2019; originally announced August 2019.

    Comments: 16 pages

  19. arXiv:1907.05226  [pdf, other

    stat.ML cs.LG math.ST

    Gain with no Pain: Efficient Kernel-PCA by Nyström Sampling

    Authors: Nicholas Sterge, Bharath Sriperumbudur, Lorenzo Rosasco, Alessandro Rudi

    Abstract: In this paper, we propose and study a Nyström based approach to efficient large scale kernel principal component analysis (PCA). The latter is a natural nonlinear extension of classical PCA based on considering a nonlinear feature map or the corresponding kernel. Like other kernel approaches, kernel PCA enjoys good mathematical and statistical properties but, numerically, it scales poorly with the… ▽ More

    Submitted 11 July, 2019; originally announced July 2019.

    Comments: 19 pages, 2 figures

    MSC Class: 62H25; 62H12; 46E22

  20. arXiv:1902.07284  [pdf, other

    math.ST

    Optimal Function-on-Scalar Regression over Complex Domains

    Authors: Matthew Reimherr, Bharath Sriperumbudur, Hyun Bin Kang

    Abstract: In this work we consider the problem of estimating function-on-scalar regression models when the functions are observed over multi-dimensional or manifold domains and with potentially multivariate output. We establish the minimax rates of convergence and present an estimator based on reproducing kernel Hilbert spaces that achieves the minimax rate. To better interpret the derived rates, we extend… ▽ More

    Submitted 19 February, 2019; originally announced February 2019.

  21. arXiv:1902.01219  [pdf, ps, other

    math.ST cs.IT cs.LG stat.ML

    Local minimax rates for closeness testing of discrete distributions

    Authors: Joseph Lam-Weil, Alexandra Carpentier, Bharath K. Sriperumbudur

    Abstract: We consider the closeness testing problem for discrete distributions. The goal is to distinguish whether two samples are drawn from the same unspecified distribution, or whether their respective distributions are separated in $L_1$-norm. In this paper, we focus on adapting the rate to the shape of the underlying distributions, i.e. we consider \textit{a local minimax setting}. We provide, to the b… ▽ More

    Submitted 19 January, 2021; v1 submitted 1 February, 2019; originally announced February 2019.

    MSC Class: 62F03; 62G10; 62F35 ACM Class: G.3; I.2.6

  22. arXiv:1810.05207  [pdf, ps, other

    stat.ML cs.LG math.PR

    On Kernel Derivative Approximation with Random Fourier Features

    Authors: Zoltan Szabo, Bharath K. Sriperumbudur

    Abstract: Random Fourier features (RFF) represent one of the most popular and wide-spread techniques in machine learning to scale up kernel algorithms. Despite the numerous successful applications of RFFs, unfortunately, quite little is understood theoretically on their optimality and limitations of their performance. Only recently, precise statistical-computational trade-offs have been established for RFFs… ▽ More

    Submitted 9 February, 2019; v1 submitted 11 October, 2018; originally announced October 2018.

    Comments: AISTATS-2019

    MSC Class: 60E10; 42Bxx; 46E22 ACM Class: G.3; I.2.6

  23. arXiv:1803.11451  [pdf, ps, other

    math.ST cs.IT stat.ML

    Minimax Estimation of Quadratic Fourier Functionals

    Authors: Shashank Singh, Bharath K. Sriperumbudur, Barnabás Póczos

    Abstract: We study estimation of (semi-)inner products between two nonparametric probability distributions, given IID samples from each distribution. These products include relatively well-studied classical $\mathcal{L}^2$ and Sobolev inner products, as well as those induced by translation-invariant reproducing kernels, for which we believe our results are the first. We first propose estimators for these qu… ▽ More

    Submitted 1 September, 2018; v1 submitted 30 March, 2018; originally announced March 2018.

  24. arXiv:1709.00147  [pdf, other

    math.NA stat.ML

    Convergence Analysis of Deterministic Kernel-Based Quadrature Rules in Misspecified Settings

    Authors: Motonobu Kanagawa, Bharath K. Sriperumbudur, Kenji Fukumizu

    Abstract: This paper presents a convergence analysis of kernel-based quadrature rules in misspecified settings, focusing on deterministic quadrature in Sobolev spaces. In particular, we deal with misspecified settings where a test integrand is less smooth than a Sobolev RKHS based on which a quadrature rule is constructed. We provide convergence guarantees based on two different assumptions on a quadrature… ▽ More

    Submitted 30 October, 2018; v1 submitted 1 September, 2017; originally announced September 2017.

    Comments: 36 pages

    MSC Class: 65D30 (Primary); 65D32; 65D05; 46E35; 46E22 (Secondary)

  25. arXiv:1708.03372  [pdf, other

    math.ST

    Optimal Prediction for Additive Function-on-Function Regression

    Authors: Matthew Reimherr, Bharath Sriperumbudur, Bahaeddine Taoufik

    Abstract: As with classic statistics, functional regression models are invaluable in the analysis of functional data. While there are now extensive tools with accompanying theory available for linear models, there is still a great deal of work to be done concerning nonlinear models for functional data. In this work we consider the Additive Function-on-Function Regression model, a type of nonlinear model tha… ▽ More

    Submitted 22 June, 2018; v1 submitted 10 August, 2017; originally announced August 2017.

  26. arXiv:1706.06296  [pdf, ps, other

    stat.ML math.ST

    Approximate Kernel PCA Using Random Features: Computational vs. Statistical Trade-off

    Authors: Bharath Sriperumbudur, Nicholas Sterge

    Abstract: Kernel methods are powerful learning methodologies that allow to perform non-linear data analysis. Despite their popularity, they suffer from poor scalability in big data scenarios. Various approximation methods, including random feature approximation, have been proposed to alleviate the problem. However, the statistical consistency of most of these approximate kernel methods is not well understoo… ▽ More

    Submitted 11 June, 2022; v1 submitted 20 June, 2017; originally announced June 2017.

    Comments: 65 pages

    MSC Class: 62H25; 62G05

  27. arXiv:1602.04361  [pdf, ps, other

    math.ST

    Minimax Estimation of Kernel Mean Embeddings

    Authors: Ilya Tolstikhin, Bharath Sriperumbudur, Krikamol Muandet

    Abstract: In this paper, we study the minimax estimation of the Bochner integral $$μ_k(P):=\int_{\mathcal{X}} k(\cdot,x)\,dP(x),$$ also called as the kernel mean embedding, based on random samples drawn i.i.d.~from $P$, where $k:\mathcal{X}\times\mathcal{X}\rightarrow\mathbb{R}$ is a positive definite kernel. Various estimators (including the empirical estimator), $\hatθ_n$ of $μ_k(P)$ are studied in the li… ▽ More

    Submitted 31 July, 2017; v1 submitted 13 February, 2016; originally announced February 2016.

    MSC Class: 62G05; 62G07

  28. arXiv:1506.02155  [pdf, ps, other

    math.ST cs.LG math.FA stat.ML

    Optimal Rates for Random Fourier Features

    Authors: Bharath K. Sriperumbudur, Zoltan Szabo

    Abstract: Kernel methods represent one of the most powerful tools in machine learning to tackle problems expressed in terms of function values and derivatives due to their capability to represent and model complex relations. While these methods show good versatility, they are computationally intensive and have poor scalability to large data as they require operations on Gram matrices. In order to mitigate t… ▽ More

    Submitted 4 November, 2015; v1 submitted 6 June, 2015; originally announced June 2015.

    Comments: To appear at NIPS-2015

    MSC Class: 60E10; 62Gxx; 62Exx; 62H12; 42Bxx; 46E22 ACM Class: G.3; I.2.6; F.2

  29. arXiv:1411.2066  [pdf, ps, other

    math.ST cs.LG math.FA stat.ML

    Learning Theory for Distribution Regression

    Authors: Zoltan Szabo, Bharath Sriperumbudur, Barnabas Poczos, Arthur Gretton

    Abstract: We focus on the distribution regression problem: regressing to vector-valued outputs from probability measures. Many important machine learning and statistical tasks fit into this framework, including multi-instance learning and point estimation problems without analytical solution (such as hyperparameter or entropy estimation). Despite the large number of available heuristics in the literature, t… ▽ More

    Submitted 21 October, 2016; v1 submitted 7 November, 2014; originally announced November 2014.

    Comments: Final version appeared at JMLR, with supplement. Code: https://bitbucket.org/szzoli/ite/. arXiv admin note: text overlap with arXiv:1402.1754

    MSC Class: 62G08; 46E22; 47B32 ACM Class: G.3; I.2.6

    Journal ref: Journal of Machine Learning Research, 17(152):1-40, 2016

  30. arXiv:1411.0900  [pdf, ps, other

    stat.ML math.ST

    Kernel Mean Estimation via Spectral Filtering

    Authors: Krikamol Muandet, Bharath Sriperumbudur, Bernhard Schölkopf

    Abstract: The problem of estimating the kernel mean in a reproducing kernel Hilbert space (RKHS) is central to kernel methods in that it is used by classical approaches (e.g., when centering a kernel PCA matrix), and it also forms the core inference step of modern kernel methods (e.g., kernel-based non-parametric tests) that rely on embedding probability distributions in RKHSs. Muandet et al. (2014) has sho… ▽ More

    Submitted 4 November, 2014; originally announced November 2014.

    Comments: To appear at the 28th Annual Conference on Neural Information Processing Systems (NIPS 2014). 16 pages

  31. arXiv:1402.1754  [pdf, ps, other

    math.ST cs.LG math.FA stat.ML

    Two-stage Sampled Learning Theory on Distributions

    Authors: Zoltan Szabo, Arthur Gretton, Barnabas Poczos, Bharath Sriperumbudur

    Abstract: We focus on the distribution regression problem: regressing to a real-valued response from a probability distribution. Although there exist a large number of similarity measures between distributions, very little is known about their generalization performance in specific learning tasks. Learning problems formulated on distributions have an inherent two-stage sampled difficulty: in practice only s… ▽ More

    Submitted 26 January, 2015; v1 submitted 7 February, 2014; originally announced February 2014.

    Comments: v6: accepted at AISTATS-2015 for oral presentation; final version; code: https://bitbucket.org/szzoli/ite/; extension to the misspecified and vector-valued case: http://arxiv.longhoe.net/abs/1411.2066

    MSC Class: 62G08; 46E22; 47B32 ACM Class: G.3; I.2.6

  32. arXiv:1312.3516  [pdf, ps, other

    math.ST stat.ME stat.ML

    Density Estimation in Infinite Dimensional Exponential Families

    Authors: Bharath Sriperumbudur, Kenji Fukumizu, Arthur Gretton, Aapo Hyvärinen, Revant Kumar

    Abstract: In this paper, we consider an infinite dimensional exponential family, $\mathcal{P}$ of probability densities, which are parametrized by functions in a reproducing kernel Hilbert space, $H$ and show it to be quite rich in the sense that a broad class of densities on $\mathbb{R}^d$ can be approximated arbitrarily well in Kullback-Leibler (KL) divergence by elements in $\mathcal{P}$. The main goal o… ▽ More

    Submitted 26 May, 2017; v1 submitted 12 December, 2013; originally announced December 2013.

    Comments: 58 pages, 8 figures; Fixed some errors and typos

  33. arXiv:1310.8240  [pdf, ps, other

    math.ST math.PR

    On the optimal estimation of probability measures in weak and strong topologies

    Authors: Bharath Sriperumbudur

    Abstract: Given random samples drawn i.i.d. from a probability measure $\mathbb{P}$ (defined on say, $\mathbb{R}^d$), it is well-known that the empirical estimator is an optimal estimator of $\mathbb{P}$ in weak topology but not even a consistent estimator of its density (if it exists) in the strong topology (induced by the total variation distance). On the other hand, various popular density estimators suc… ▽ More

    Submitted 30 March, 2016; v1 submitted 30 October, 2013; originally announced October 2013.

    Comments: Published at http://dx.doi.org/10.3150/15-BEJ713 in the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm)

    Report number: IMS-BEJ-BEJ713

    Journal ref: Bernoulli 2016, Vol. 22, No. 3, 1839-1893

  34. arXiv:1306.0842  [pdf, ps, other

    stat.ML cs.LG math.ST

    Kernel Mean Estimation and Stein's Effect

    Authors: Krikamol Muandet, Kenji Fukumizu, Bharath Sriperumbudur, Arthur Gretton, Bernhard Schölkopf

    Abstract: A mean function in reproducing kernel Hilbert space, or a kernel mean, is an important part of many applications ranging from kernel principal component analysis to Hilbert-space embedding of distributions. Given finite samples, an empirical average is the standard estimate for the true kernel mean. We show that this estimator can be improved via a well-known phenomenon in statistics called Stein'… ▽ More

    Submitted 6 June, 2013; v1 submitted 4 June, 2013; originally announced June 2013.

    Comments: first draft

  35. arXiv:1207.6076  [pdf, ps, other

    stat.ME cs.LG math.ST stat.ML

    Equivalence of distance-based and RKHS-based statistics in hypothesis testing

    Authors: Dino Sejdinovic, Bharath Sriperumbudur, Arthur Gretton, Kenji Fukumizu

    Abstract: We provide a unifying framework linking two classes of statistics used in two-sample and independence testing: on the one hand, the energy distances and distance covariances from the statistics literature; on the other, maximum mean discrepancies (MMD), that is, distances between embeddings of distributions to reproducing kernel Hilbert spaces (RKHS), as established in machine learning. In the cas… ▽ More

    Submitted 12 November, 2013; v1 submitted 25 July, 2012; originally announced July 2012.

    Comments: Published in at http://dx.doi.org/10.1214/13-AOS1140 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1140

    Journal ref: Annals of Statistics 2013, Vol. 41, No. 5, 2263-2291

  36. arXiv:1003.0887  [pdf, ps, other

    stat.ML math.ST

    Universality, Characteristic Kernels and RKHS Embedding of Measures

    Authors: Bharath K. Sriperumbudur, Kenji Fukumizu, Gert R. G. Lanckriet

    Abstract: A Hilbert space embedding for probability measures has recently been proposed, wherein any probability measure is represented as a mean element in a reproducing kernel Hilbert space (RKHS). Such an embedding has found applications in homogeneity testing, independence testing, dimensionality reduction, etc., with the requirement that the reproducing kernel is characteristic, i.e., the embedding i… ▽ More

    Submitted 3 March, 2010; originally announced March 2010.

    Comments: 30 pages, 1 figure

  37. arXiv:0907.5309  [pdf, ps, other

    stat.ML math.ST

    Hilbert space embeddings and metrics on probability measures

    Authors: Bharath K. Sriperumbudur, Arthur Gretton, Kenji Fukumizu, Bernhard Schölkopf, Gert R. G. Lanckriet

    Abstract: A Hilbert space embedding for probability measures has recently been proposed, with applications including dimensionality reduction, homogeneity testing, and independence testing. This embedding represents any probability measure as a mean element in a reproducing kernel Hilbert space (RKHS). A pseudometric on the space of probability measures can be defined as the distance between distribution… ▽ More

    Submitted 29 January, 2010; v1 submitted 30 July, 2009; originally announced July 2009.

    Comments: 48 pages