Skip to main content

Showing 1–25 of 25 results for author: Reimherr, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.14966  [pdf, other

    stat.ML cs.LG stat.ME

    Smoothness Adaptive Hypothesis Transfer Learning

    Authors: Haotian Lin, Matthew Reimherr

    Abstract: Many existing two-phase kernel-based hypothesis transfer learning algorithms employ the same kernel regularization across phases and rely on the known smoothness of functions to obtain optimality. Therefore, they fail to adapt to the varying and unknown smoothness between the target/source and their offset in practice. In this paper, we address these problems by proposing Smoothness Adaptive Trans… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  2. arXiv:2309.02416  [pdf, other

    stat.AP

    Differentially Private Synthetic Heavy-tailed Data

    Authors: Tran Tran, Matthew Reimherr, Aleksandra Slavković

    Abstract: The U.S. Census Longitudinal Business Database (LBD) product contains employment and payroll information of all U.S. establishments and firms dating back to 1976 and is an invaluable resource for economic research. However, the sensitive information in LBD requires confidentiality measures that the U.S. Census in part addressed by releasing a synthetic version (SynLBD) of the data to protect firms… ▽ More

    Submitted 14 October, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: 26 pages, LaTeX; corrected typos, added references, and clarified unclear wording

  3. arXiv:2309.00125  [pdf, other

    stat.ML cs.CR cs.LG

    Pure Differential Privacy for Functional Summaries via a Laplace-like Process

    Authors: Haotian Lin, Matthew Reimherr

    Abstract: Many existing mechanisms to achieve differential privacy (DP) on infinite-dimensional functional summaries often involve embedding these summaries into finite-dimensional subspaces and applying traditional DP techniques. Such mechanisms generally treat each dimension uniformly and struggle with complex, structured summaries. This work introduces a novel mechanism for DP functional summary release:… ▽ More

    Submitted 3 March, 2024; v1 submitted 31 August, 2023; originally announced September 2023.

  4. arXiv:2303.14801  [pdf, other

    stat.ME stat.CO stat.ML

    FAStEN: an efficient adaptive method for feature selection and estimation in high-dimensional functional regressions

    Authors: Tobia Boschi, Lorenzo Testa, Francesca Chiaromonte, Matthew Reimherr

    Abstract: Functional regression analysis is an established tool for many contemporary scientific applications. Regression problems involving large and complex data sets are ubiquitous, and feature selection is crucial for avoiding overfitting and achieving accurate predictions. We propose a new, flexible and ultra-efficient approach to perform feature selection in a sparse high dimensional function-on-funct… ▽ More

    Submitted 4 September, 2023; v1 submitted 26 March, 2023; originally announced March 2023.

  5. arXiv:2209.12667  [pdf, other

    stat.ML cs.LG math.DG math.ST

    Shape And Structure Preserving Differential Privacy

    Authors: Carlos Soto, Karthik Bharath, Matthew Reimherr, Aleksandra Slavkovic

    Abstract: It is common for data structures such as images and shapes of 2D objects to be represented as points on a manifold. The utility of a mechanism to produce sanitized differentially private estimates from such data is intimately linked to how compatible it is with the underlying structure and geometry of the space. In particular, as recently shown, utility of the Laplace mechanism on a positively cur… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

    Comments: 15 pages (including supplementary material and references), 3 figures (including supplementary material), to be published in NeurIPS 2022

  6. arXiv:2206.04277  [pdf, other

    stat.ML cs.LG

    On Hypothesis Transfer Learning of Functional Linear Models

    Authors: Haotian Lin, Matthew Reimherr

    Abstract: We study the transfer learning (TL) for the functional linear regression (FLR) under the Reproducing Kernel Hilbert Space (RKHS) framework, observing the TL techniques in existing high-dimensional linear regression is not compatible with the truncation-based FLR methods as functional data are intrinsically infinite-dimensional and generated by smooth underlying processes. We measure the similarity… ▽ More

    Submitted 22 February, 2024; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: The results are extended to functional GLM

  7. arXiv:2204.01132  [pdf, other

    cs.CR stat.CO

    Exact Privacy Guarantees for Markov Chain Implementations of the Exponential Mechanism with Artificial Atoms

    Authors: Jeremy Seeman, Matthew Reimherr, Aleksandra Slavkovic

    Abstract: Implementations of the exponential mechanism in differential privacy often require sampling from intractable distributions. When approximate procedures like Markov chain Monte Carlo (MCMC) are used, the end result incurs costs to both privacy and accuracy. Existing work has examined these effects asymptotically, but implementable finite sample results are needed in practice so that users can speci… ▽ More

    Submitted 3 April, 2022; originally announced April 2022.

    Comments: 16 pages, 3 figures

    Journal ref: Advances in Neural Information Processing Systems 34 (NeurIPS 2021)

  8. arXiv:2204.01102  [pdf, other

    cs.CR stat.ME

    Formal Privacy for Partially Private Data

    Authors: Jeremy Seeman, Matthew Reimherr, Aleksandra Slavkovic

    Abstract: Differential privacy (DP) quantifies privacy loss by analyzing noise injected into output statistics. For non-trivial statistics, this noise is necessary to ensure finite privacy loss. However, data curators frequently release collections of statistics where some use DP mechanisms and others are released as-is, i.e., without additional randomized noise. Consequently, DP alone cannot characterize t… ▽ More

    Submitted 14 December, 2022; v1 submitted 3 April, 2022; originally announced April 2022.

    Comments: 34 pages, 4 figures; submitted to JMLR

  9. arXiv:2111.02516  [pdf, other

    math.ST math.DG stat.ML

    Differential Privacy Over Riemannian Manifolds

    Authors: Matthew Reimherr, Karthik Bharath, Carlos Soto

    Abstract: In this work we consider the problem of releasing a differentially private statistical summary that resides on a Riemannian manifold. We present an extension of the Laplace or K-norm mechanism that utilizes intrinsic distances and volumes on the manifold. We also consider in detail the specific case where the summary is the Fréchet mean of data residing on a manifold. We demonstrate that our mecha… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: 15 pages (including supplementary material and references), 2 figures (including supplementary material), published in NeurIPS

  10. arXiv:2107.14151  [pdf, other

    stat.ME cs.AI cs.LG stat.ML

    Modern Non-Linear Function-on-Function Regression

    Authors: Aniruddha Rajendra Rao, Matthew Reimherr

    Abstract: We introduce a new class of non-linear function-on-function regression models for functional data using neural networks. We propose a framework using a hidden layer consisting of continuous neurons, called a continuous hidden layer, for functional response modeling and give two model fitting strategies, Functional Direct Neural Network (FDNN) and Functional Basis Neural Network (FBNN). Both are de… ▽ More

    Submitted 7 October, 2023; v1 submitted 29 July, 2021; originally announced July 2021.

    Comments: 6 figures, 6 tables (including supplementary material), 16 pages (including supplementary material). arXiv admin note: text overlap with arXiv:2104.09371

    Journal ref: Statistics and Computing 2023

  11. arXiv:2104.09371  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Non-linear Functional Modeling using Neural Networks

    Authors: Aniruddha Rajendra Rao, Matthew Reimherr

    Abstract: We introduce a new class of non-linear models for functional data based on neural networks. Deep learning has been very successful in non-linear modeling, but there has been little work done in the functional data setting. We propose two variations of our framework: a functional neural network with continuous hidden layers, called the Functional Direct Neural Network (FDNN), and a second version t… ▽ More

    Submitted 3 May, 2023; v1 submitted 19 April, 2021; originally announced April 2021.

    Comments: 3 figures, 10 tables (including supplementary material), 14 pages (including supplementary material)

    Journal ref: Journal of Computational and Graphical Statistics, 2023

  12. arXiv:2011.12509  [pdf, other

    stat.ME cs.LG stat.ML

    Modern Multiple Imputation with Functional Data

    Authors: Aniruddha Rajendra Rao, Matthew Reimherr

    Abstract: This work considers the problem of fitting functional models with sparsely and irregularly sampled functional data. It overcomes the limitations of the state-of-the-art methods, which face major challenges in the fitting of more complex non-linear models. Currently, many of these models cannot be consistently estimated unless the number of observed points per curve grows sufficiently quickly with… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

    Comments: 7 figures (including supplementary material), 8 tables (including supplementary material), 14 pages (including supplementary material)

    Journal ref: Stat, 2021

  13. arXiv:2006.03970  [pdf, other

    stat.ML cs.LG stat.CO

    An Efficient Semi-smooth Newton Augmented Lagrangian Method for Elastic Net

    Authors: Tobia Boschi, Matthew Reimherr, Francesca Chiaromonte

    Abstract: Feature selection is an important and active research area in statistics and machine learning. The Elastic Net is often used to perform selection when the features present non-negligible collinearity or practitioners wish to incorporate additional known structure. In this article, we propose a new Semi-smooth Newton Augmented Lagrangian Method to efficiently solve the Elastic Net in ultra-high dim… ▽ More

    Submitted 6 June, 2020; originally announced June 2020.

    MSC Class: 62J07 ACM Class: G.3

  14. arXiv:1910.00131  [pdf, other

    stat.ME math.ST

    Fast and Fair Simultaneous Confidence Bands for Functional Parameters

    Authors: Dominik Liebl, Matthew Reimherr

    Abstract: Quantifying uncertainty using confidence regions is a central goal of statistical inference. Despite this, methodologies for confidence bands in Functional Data Analysis are still underdeveloped compared to estimation and hypothesis testing. In this work, we present a new methodology for constructing simultaneous confidence bands for functional parameter estimates. Our bands possess a number of po… ▽ More

    Submitted 11 November, 2022; v1 submitted 30 September, 2019; originally announced October 2019.

  15. arXiv:1905.09881  [pdf, other

    stat.ME math.ST

    Adaptive Function-on-Scalar Regression with a Smoothing Elastic Net

    Authors: Ardalan Mirshani, Matthew Reimherr

    Abstract: This paper presents a new methodology, called AFSSEN, to simultaneously select significant predictors and produce smooth estimates in a high-dimensional function-on-scalar linear model with a sub-Gaussian errors. Outcomes are assumed to lie in a general real separable Hilbert space, H, while parameters lie in a subspace known as a Cameron Martin space, K, which are closely related to Reproducing K… ▽ More

    Submitted 23 May, 2019; originally announced May 2019.

  16. arXiv:1905.09436  [pdf, other

    cs.CR stat.ML

    KNG: The K-Norm Gradient Mechanism

    Authors: Matthew Reimherr, Jordan Awan

    Abstract: This paper presents a new mechanism for producing sanitized statistical summaries that achieve \emph{differential privacy}, called the \emph{K-Norm Gradient} Mechanism, or KNG. This new approach maintains the strong flexibility of the exponential mechanism, while achieving the powerful utility performance of objective perturbation. KNG starts with an inherent objective function (often an empirical… ▽ More

    Submitted 2 August, 2021; v1 submitted 22 May, 2019; originally announced May 2019.

    Comments: 14 pages, 2 figures, published in NeurIPS 33

  17. arXiv:1901.10864  [pdf, other

    cs.CR cs.LG stat.ML

    Benefits and Pitfalls of the Exponential Mechanism with Applications to Hilbert Spaces and Functional PCA

    Authors: Jordan Awan, Ana Kenney, Matthew Reimherr, Aleksandra Slavković

    Abstract: The exponential mechanism is a fundamental tool of Differential Privacy (DP) due to its strong privacy guarantees and flexibility. We study its extension to settings with summaries based on infinite dimensional outputs such as with functional data analysis, shape analysis, and nonparametric statistics. We show that one can design the mechanism with respect to a specific base measure over the outpu… ▽ More

    Submitted 30 January, 2019; originally announced January 2019.

    Comments: 13 pages, 5 images, 2 tables

    MSC Class: 46E22; 46S50; 60G15; 62H25

  18. Highly Irregular Functional Generalized Linear Regression with Electronic Health Records

    Authors: Justin Petrovich, Matthew Reimherr, Carrie Daymont

    Abstract: This work presents a new approach, called MISFIT, for fitting generalized functional linear regression models with sparsely and irregularly sampled data. Current methods do not allow for consistent estimation unless one assumes that the number of observed points per curve grows sufficiently quickly with the sample size. In contrast, MISFIT is based on a multiple imputation framework, which has the… ▽ More

    Submitted 4 October, 2019; v1 submitted 22 May, 2018; originally announced May 2018.

    Comments: 5 figures, 17 tables (including supplementary material), 34 pages (including supplementary material)

    Journal ref: J.R.Stat.Soc.Series.C (2022) 1-28

  19. arXiv:1710.01619  [pdf, other

    stat.ME

    Manifold Data Analysis with Applications to High-Frequency 3D Imaging

    Authors: Hyun Bin Kang, Matthew Reimherr, Mark Shriver, Peter Claes

    Abstract: Many scientific areas are faced with the challenge of extracting information from large, complex, and highly structured data sets. A great deal of modern statistical work focuses on develo** tools for handling such data. This paper presents a new subfield of functional data analysis, FDA, which we call Manifold Data Analysis, or MDA. MDA is concerned with the statistical analysis of samples wher… ▽ More

    Submitted 4 October, 2017; originally announced October 2017.

  20. arXiv:1607.07771  [pdf, ps, other

    stat.ME math.ST

    A Geometric Approach to Confidence Regions and Bands for Functional Parameters

    Authors: Hyunphil Choi, Matthew Reimherr

    Abstract: Functional data analysis, FDA, is now a well established discipline of statistics, with its core concepts and perspectives in place. Despite this, there are still fundamental statistical questions which have received relatively little attention. One of these is the systematic construction of confidence regions for functional parameters. This work is concerned with develo**, understanding, and vi… ▽ More

    Submitted 10 August, 2016; v1 submitted 26 July, 2016; originally announced July 2016.

  21. arXiv:1510.02594  [pdf, other

    stat.ME

    A randomness test for functional panels

    Authors: Piotr Kokoszka, Matthew Reimherr, Nikolas Wölfing

    Abstract: Functional panels are collections of functional time series, and arise often in the study of high frequency multivariate data. We develop a portmanteau style test to determine if the cross-sections of such a panel are independent and identically distributed. Our framework allows the number of functional projections and/or the number of time series to grow with the sample size. A large sample justi… ▽ More

    Submitted 10 July, 2016; v1 submitted 9 October, 2015; originally announced October 2015.

    Comments: Supplemental material from the authors' homepage or upon request

  22. arXiv:1509.07017  [pdf, other

    stat.ME

    Testing separability of space--time functional processes

    Authors: Panayiotis Constantinou, Piotr Kokoszka, Matthew Reimherr

    Abstract: We present a new methodology and accompanying theory to test for separability of spatio-temporal functional data. In spatio-temporal statistics, separability is a common simplifying assumption concerning the covariance structure which, if true, can greatly increase estimation accuracy and inferential power. While our focus is on testing for the separation of space and time in spatio-temporal data,… ▽ More

    Submitted 23 September, 2015; originally announced September 2015.

  23. arXiv:1406.5958  [pdf, other

    stat.ME

    Prior sample size extensions for assessing prior impact and prior--likelihood discordance

    Authors: Matthew Reimherr, Xiao-Li Meng, Dan L. Nicolae

    Abstract: This paper outlines a framework for quantifying the prior's contribution to posterior inference in the presence of prior-likelihood discordance, a broader concept than the usual notion of prior-likelihood conflict. We achieve this dual purpose by extending the classic notion of \textit{prior sample size}, $M$, in three directions: (I) estimating $M$ beyond conjugate families; (II) formulating $M$… ▽ More

    Submitted 7 January, 2021; v1 submitted 23 June, 2014; originally announced June 2014.

    MSC Class: 62F15

  24. A functional data analysis approach for genetic association studies

    Authors: Matthew Reimherr, Dan Nicolae

    Abstract: We present a new method based on Functional Data Analysis (FDA) for detecting associations between one or more scalar covariates and a longitudinal response, while correcting for other variables. Our methods exploit the temporal structure of longitudinal data in ways that are otherwise difficult with a multivariate approach. Our procedure, from an FDA perspective, is a departure from more establis… ▽ More

    Submitted 29 April, 2014; originally announced April 2014.

    Comments: Published in at http://dx.doi.org/10.1214/13-AOAS692 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS692

    Journal ref: Annals of Applied Statistics 2014, Vol. 8, No. 1, 406-429

  25. On Quantifying Dependence: A Framework for Develo** Interpretable Measures

    Authors: Matthew Reimherr, Dan L. Nicolae

    Abstract: We present a framework for selecting and develo** measures of dependence when the goal is the quantification of a relationship between two variables, not simply the establishment of its existence. Much of the literature on dependence measures is focused, at least implicitly, on detection or revolves around the inclusion/exclusion of particular axioms and discussing which measures satisfy said ax… ▽ More

    Submitted 21 February, 2013; originally announced February 2013.

    Comments: Published in at http://dx.doi.org/10.1214/12-STS405 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-STS-STS405

    Journal ref: Statistical Science 2013, Vol. 28, No. 1, 116-130