Skip to main content

Showing 1–23 of 23 results for author: Fujisawa, H

.
  1. arXiv:2308.15838  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Adaptive Lasso, Transfer Lasso, and Beyond: An Asymptotic Perspective

    Authors: Masaaki Takada, Hironori Fujisawa

    Abstract: This paper presents a comprehensive exploration of the theoretical properties inherent in the Adaptive Lasso and the Transfer Lasso. The Adaptive Lasso, a well-established method, employs regularization divided by initial estimators and is characterized by asymptotic normality and variable selection consistency. In contrast, the recently proposed Transfer Lasso employs regularization subtracted by… ▽ More

    Submitted 17 April, 2024; v1 submitted 30 August, 2023; originally announced August 2023.

  2. arXiv:2208.11592  [pdf, ps, other

    math.ST stat.ML

    Outlier Robust and Sparse Estimation of Linear Regression Coefficients

    Authors: Takeyuki Sasai, Hironori Fujisawa

    Abstract: We consider outlier-robust and sparse estimation of linear regression coefficients, when the covariates and the noises are contaminated by adversarial outliers and noises are sampled from a heavy-tailed distribution. Our results present sharper error bounds under weaker assumptions than prior studies that share similar interests with this study. Our analysis relies on some sharp concentration ineq… ▽ More

    Submitted 24 May, 2024; v1 submitted 24 August, 2022; originally announced August 2022.

    MSC Class: 62J07; 62F35

  3. arXiv:2106.13946  [pdf, other

    stat.ME

    Outlier-Resistant Estimators for Average Treatment Effect in Causal Inference

    Authors: Kazuharu Harada, Hironori Fujisawa

    Abstract: The inverse probability (IPW) and doubly robust (DR) estimators are often used to estimate the average causal effect (ATE), but are vulnerable to outliers. The IPW/DR median can be used for outlier-resistant estimation of the ATE, but the outlier resistance of the median is limited and it is not resistant enough for heavy contamination. We propose extensions of the IPW/DR estimators with density p… ▽ More

    Submitted 15 April, 2022; v1 submitted 26 June, 2021; originally announced June 2021.

  4. arXiv:2102.11120  [pdf, ps, other

    math.ST stat.ML

    Adversarial robust weighted Huber regression

    Authors: Takeyuki Sasai, Hironori Fujisawa

    Abstract: We consider a robust estimation of linear regression coefficients. In this note, we focus on the case where the covariates are sampled from an $L$-subGaussian distribution with unknown covariance, the noises are sampled from a distribution with a bounded absolute moment and both covariates and noises may be contaminated by an adversary. We derive an estimation error bound, which depends on the sta… ▽ More

    Submitted 24 May, 2024; v1 submitted 22 February, 2021; originally announced February 2021.

    Comments: The case of sparse coefficients is investigated in arXiv:2208.11592. This manuscript will not be submitted for publications

    MSC Class: 62G35; 62G05

  5. arXiv:2010.13018  [pdf, ps, other

    stat.ML cs.LG math.ST

    Adversarial Robust Low Rank Matrix Estimation: Compressed Sensing and Matrix Completion

    Authors: Takeyuki Sasai, Hironori Fujisawa

    Abstract: We consider robust low rank matrix estimation as a trace regression when outputs are contaminated by adversaries. The adversaries are allowed to add arbitrary values to arbitrary outputs. Such values can depend on any samples. We deal with matrix compressed sensing, including lasso as a partial problem, and matrix completion, and then we obtain sharp estimation error bounds. To obtain the error bo… ▽ More

    Submitted 24 May, 2024; v1 submitted 24 October, 2020; originally announced October 2020.

    Comments: The lasso part of this manuscript with contaminated input as well as output is investigated in arXiv:2208.11592. This manuscript will not be submitted for publications

    MSC Class: 62G35; 62G05

  6. arXiv:2009.03077  [pdf, other

    stat.ML cs.LG

    Estimation of Structural Causal Model via Sparsely Mixing Independent Component Analysis

    Authors: Kazuharu Harada, Hironori Fujisawa

    Abstract: We consider the problem of inferring the causal structure from observational data, especially when the structure is sparse. This type of problem is usually formulated as an inference of a directed acyclic graph (DAG) model. The linear non-Gaussian acyclic model (LiNGAM) is one of the most successful DAG models, and various estimation methods have been developed. However, existing methods are not e… ▽ More

    Submitted 7 September, 2020; originally announced September 2020.

    Comments: 9 pages, 6 figures

  7. arXiv:2006.14845  [pdf, other

    stat.ML cs.LG

    Transfer Learning via $\ell_1$ Regularization

    Authors: Masaaki Takada, Hironori Fujisawa

    Abstract: Machine learning algorithms typically require abundant data under a stationary environment. However, environments are nonstationary in many real-world applications. Critical issues lie in how to effectively adapt models under an ever-changing environment. We propose a method for transferring knowledge from a source domain to a target domain via $\ell_1$ regularization. We incorporate $\ell_1$ regu… ▽ More

    Submitted 26 June, 2020; originally announced June 2020.

  8. arXiv:2004.05990  [pdf, ps, other

    math.ST cs.LG stat.ML

    Robust estimation with Lasso when outputs are adversarially contaminated

    Authors: Takeyuki Sasai, Hironori Fujisawa

    Abstract: We consider robust estimation when outputs are adversarially contaminated. Nguyen and Tran (2012) proposed an extended Lasso for robust parameter estimation and then they showed the convergence rate of the estimation error. Recently, Dalalyan and Thompson (2019) gave some useful inequalities and then they showed a faster convergence rate than Nguyen and Tran (2012). They focused on the fact that t… ▽ More

    Submitted 24 May, 2024; v1 submitted 13 April, 2020; originally announced April 2020.

    Comments: The case of contaminated inputs as well as outputs is investigated in arXiv:2208.11592. This manuscript will not be submitted for publications

  9. arXiv:1811.00255  [pdf, other

    stat.ML cs.LG

    HMLasso: Lasso with High Missing Rate

    Authors: Masaaki Takada, Hironori Fujisawa, Takeichiro Nishikawa

    Abstract: Sparse regression such as the Lasso has achieved great success in handling high-dimensional data. However, one of the biggest practical problems is that high-dimensional data often contain large amounts of missing values. Convex Conditioned Lasso (CoCoLasso) has been proposed for dealing with high-dimensional data with missing values, but it performs poorly when there are many missing values, so t… ▽ More

    Submitted 19 June, 2019; v1 submitted 1 November, 2018; originally announced November 2018.

  10. arXiv:1805.07960   

    stat.ML cs.LG math.OC

    Stochastic Gradient Descent for Stochastic Doubly-Nonconvex Composite Optimization

    Authors: Takayuki Kawashima, Hironori Fujisawa

    Abstract: The stochastic gradient descent has been widely used for solving composite optimization problems in big data analyses. Many algorithms and convergence properties have been developed. The composite functions were convex primarily and gradually nonconvex composite functions have been adopted to obtain more desirable properties. The convergence properties have been investigated, but only when either… ▽ More

    Submitted 1 March, 2020; v1 submitted 21 May, 2018; originally announced May 2018.

    Comments: There is a mistake in the proof of Proposition 3.2. related to the Euclidean projection with stochastic gradients

  11. arXiv:1805.06144  [pdf, ps, other

    math.ST

    On Difference Between Two Types of $γ$-divergence for Regression

    Authors: Takayuki Kawashima, Hironori Fujisawa

    Abstract: The $γ$-divergence is well-known for having strong robustness against heavy contamination. By virtue of this property, many applications via the $γ$-divergence have been proposed. There are two types of \gd\ for regression problem, in which the treatments of base measure are different. In this paper, we compare them and pointed out a distinct difference between these two divergences under heteroge… ▽ More

    Submitted 16 May, 2018; originally announced May 2018.

  12. arXiv:1802.05475  [pdf, ps, other

    stat.ME

    Robust and sparse Gaussian graphical modeling under cell-wise contamination

    Authors: Shota Katayama, Hironori Fujisawa, Mathias Drton

    Abstract: Graphical modeling explores dependences among a collection of variables by inferring a graph that encodes pairwise conditional independences. For jointly Gaussian variables, this translates into detecting the support of the precision matrix. Many modern applications feature high-dimensional and contaminated data that complicate this task. In particular, traditional robust methods that down-weight… ▽ More

    Submitted 15 February, 2018; originally announced February 2018.

  13. arXiv:1802.03127  [pdf, ps, other

    stat.ML stat.ME

    Robust and Sparse Regression in GLM by Stochastic Optimization

    Authors: Takayuki Kawashima, Hironori Fujisawa

    Abstract: The generalized linear model (GLM) plays a key role in regression analyses. In high-dimensional data, the sparse GLM has been used but it is not robust against outliers. Recently, the robust methods have been proposed for the specific example of the sparse GLM. Among them, we focus on the robust and sparse linear regression based on the $γ$-divergence. The estimator of the $γ$-divergence has stron… ▽ More

    Submitted 8 February, 2018; originally announced February 2018.

    Comments: 28 pages

  14. arXiv:1711.01796  [pdf, ps, other

    stat.ML

    Independently Interpretable Lasso: A New Regularizer for Sparse Regression with Uncorrelated Variables

    Authors: Masaaki Takada, Taiji Suzuki, Hironori Fujisawa

    Abstract: Sparse regularization such as $\ell_1$ regularization is a quite powerful and widely used strategy for high dimensional learning problems. The effectiveness of sparse regularization has been supported practically and theoretically by several studies. However, one of the biggest issues in sparse regularization is that its performance is quite sensitive to correlations between features. Ordinary… ▽ More

    Submitted 22 February, 2018; v1 submitted 6 November, 2017; originally announced November 2017.

  15. Sparse principal component regression for generalized linear models

    Authors: Shuichi Kawano, Hironori Fujisawa, Toyoyuki Takada, Toshihiko Shiroishi

    Abstract: Principal component regression (PCR) is a widely used two-stage procedure: principal component analysis (PCA), followed by regression in which the selected principal components are regarded as new explanatory variables in the model. Note that PCA is based only on the explanatory variables, so the principal components are not selected using the information on the response variable. In this paper, w… ▽ More

    Submitted 12 October, 2016; v1 submitted 28 September, 2016; originally announced September 2016.

    Comments: 29 pages

    Journal ref: Computational Statistics & Data Analysis 124 (2018) 180-196

  16. arXiv:1604.06637  [pdf, ps, other

    stat.ME stat.ML

    Robust and Sparse Regression via $γ$-divergence

    Authors: Takayuki Kawashima, Hironori Fujisawa

    Abstract: In high-dimensional data, many sparse regression methods have been proposed. However, they may not be robust against outliers. Recently, the use of density power weight has been studied for robust parameter estimation and the corresponding divergences have been discussed. One of such divergences is the $γ$-divergence and the robust estimator using the $γ$-divergence is known for having a strong ro… ▽ More

    Submitted 29 August, 2016; v1 submitted 22 April, 2016; originally announced April 2016.

    Comments: 25 pages

  17. arXiv:1508.05571  [pdf, other

    stat.ME

    Robust sparse Gaussian graphical modeling

    Authors: Kei Hirose, Hironori Fujisawa, Jun Sese

    Abstract: Gaussian graphical modeling has been widely used to explore various network structures, such as gene regulatory networks and social networks. We often use a penalized maximum likelihood approach with the $L_1$ penalty for learning a high-dimensional graphical model. However, the penalized maximum likelihood procedure is sensitive to outliers. To overcome this problem, we introduce a robust estimat… ▽ More

    Submitted 12 June, 2017; v1 submitted 23 August, 2015; originally announced August 2015.

    Comments: 27 pages

  18. arXiv:1505.05257  [pdf, other

    math.ST stat.ME

    Sparse and Robust Linear Regression: An Optimization Algorithm and Its Statistical Properties

    Authors: Shota Katayama, Hironori Fujisawa

    Abstract: This paper studies sparse linear regression analysis with outliers in the responses. A parameter vector for modeling outliers is added to the standard linear regression model and then the sparse estimation problem for both coefficients and outliers is considered. The $\ell_{1}$ penalty is imposed for the coefficients, while various penalties including redescending type penalties are for the outlie… ▽ More

    Submitted 20 May, 2015; originally announced May 2015.

    Comments: 23 pages, 2 figures

    MSC Class: 62J05 (Primary); 62F35 (Secondary)

  19. arXiv:1412.1411  [pdf, other

    math.ST math.PR

    On the Weak Convergence and Central Limit Theorem of Blurring and Nonblurring Processes with Application to Robust Location Estimation

    Authors: Ting-Li Chen, Hironori Fujisawa, Su-Yun Huang, Chii-Ruey Hwang

    Abstract: This article studies the weak convergence and associated Central Limit Theorem for blurring and nonblurring processes. Then, they are applied to the estimation of location parameter. Simulation studies show that the location estimation based on the convergence point of blurring process is more robust and often more efficient than that of nonblurring process.

    Submitted 27 January, 2015; v1 submitted 3 December, 2014; originally announced December 2014.

  20. Sparse principal component regression with adaptive loading

    Authors: Shuichi Kawano, Hironori Fujisawa, Toyoyuki Takada, Toshihiko Shiroishi

    Abstract: Principal component regression (PCR) is a two-stage procedure that selects some principal components and then constructs a regression model regarding them as new explanatory variables. Note that the principal components are obtained from only explanatory variables and not considered with the response variable. To address this problem, we propose the sparse principal component regression (SPCR) tha… ▽ More

    Submitted 31 October, 2014; v1 submitted 26 February, 2014; originally announced February 2014.

    Comments: 24 pages

    MSC Class: 62H25; 62J07

    Journal ref: Computational Statistics & Data Analysis 89 (2015) 192-203

  21. arXiv:1311.5301  [pdf, other

    math.ST

    Robust Estimation under Heavy Contamination using Enlarged Models

    Authors: Takafumi Kanamori, Hironori Fujisawa

    Abstract: In data analysis, contamination caused by outliers is inevitable, and robust statistical methods are strongly demanded. In this paper, our concern is to develop a new approach for robust data analysis based on scoring rules. The scoring rule is a discrepancy measure to assess the quality of probabilistic forecasts. We propose a simple way of estimating not only the parameter in the statistical mod… ▽ More

    Submitted 20 November, 2013; originally announced November 2013.

    Comments: 32 pages, 3 figures, 3 tables

  22. arXiv:1305.2473  [pdf, ps, other

    math.ST stat.ML

    Affine Invariant Divergences associated with Composite Scores and its Applications

    Authors: Takafumi Kanamori, Hironori Fujisawa

    Abstract: In statistical analysis, measuring a score of predictive performance is an important task. In many scientific fields, appropriate scores were tailored to tackle the problems at hand. A proper score is a popular tool to obtain statistically consistent forecasts. Furthermore, a mathematical characterization of the proper score was studied. As a result, it was revealed that the proper score correspon… ▽ More

    Submitted 11 May, 2013; originally announced May 2013.

    Comments: 24 pages

  23. arXiv:1012.4921  [pdf, ps, other

    stat.ME

    Approximate tail probabilities of the maximum of a chi-square field on multi-dimensional lattice points and their applications to detection of loci interactions

    Authors: Satoshi Kuriki, Yoshiaki Harushima, Hironori Fujisawa, Nori Kurata

    Abstract: Define a chi-square random field on a multi-dimensional lattice points index set with a direct-product covariance structure, and consider the distribution of the maximum of this random field. We provide two approximate formulas for the upper tail probability of the distribution based on nonlinear renewal theory and an integral-geometric approach called the volume-of-tube method. This study is moti… ▽ More

    Submitted 30 March, 2013; v1 submitted 22 December, 2010; originally announced December 2010.

    Comments: 33 pages, 5 figures, 2 tables