Skip to main content

Showing 1–17 of 17 results for author: Kanamori, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.12450  [pdf, other

    stat.ML cs.LG

    A Convex Framework for Confounding Robust Inference

    Authors: Kei Ishikawa, Niao He, Takafumi Kanamori

    Abstract: We study policy evaluation of offline contextual bandits subject to unobserved confounders. Sensitivity analysis methods are commonly used to estimate the policy value under the worst-case confounding over a given uncertainty set. However, existing work often resorts to some coarse relaxation of the uncertainty set for the sake of tractability, leading to overly conservative estimation of the poli… ▽ More

    Submitted 1 November, 2023; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: This is an extended version of the following work https://proceedings.mlr.press/v206/ishikawa23a.html. arXiv admin note: text overlap with arXiv:2302.13348

  2. arXiv:2304.09552  [pdf, other

    stat.ML cs.LG

    Denoising Cosine Similarity: A Theory-Driven Approach for Efficient Representation Learning

    Authors: Takumi Nakagawa, Yutaro Sanada, Hiroki Waida, Yuhui Zhang, Yuichiro Wada, Kōsaku Takanashi, Tomonori Yamada, Takafumi Kanamori

    Abstract: Representation learning has been increasing its impact on the research and practice of machine learning, since it enables to learn representations that can apply to various downstream tasks efficiently. However, recent works pay little attention to the fact that real-world datasets used during the stage of representation learning are commonly contaminated by noise, which can degrade the quality of… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

  3. arXiv:2304.00395  [pdf, other

    cs.LG stat.ML

    Towards Understanding the Mechanism of Contrastive Learning via Similarity Structure: A Theoretical Analysis

    Authors: Hiroki Waida, Yuichiro Wada, Léo Andéol, Takumi Nakagawa, Yuhui Zhang, Takafumi Kanamori

    Abstract: Contrastive learning is an efficient approach to self-supervised representation learning. Although recent studies have made progress in the theoretical understanding of contrastive learning, the investigation of how to characterize the clusters of the learned representations is still limited. In this paper, we aim to elucidate the characterization from theoretical perspectives. To this end, we con… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

  4. arXiv:2303.03036  [pdf, other

    stat.ML cs.LG

    Deep Clustering with a Constraint for Topological Invariance based on Symmetric InfoNCE

    Authors: Yuhui Zhang, Yuichiro Wada, Hiroki Waida, Kaito Goto, Yusaku Hino, Takafumi Kanamori

    Abstract: We consider the scenario of deep clustering, in which the available prior knowledge is limited. In this scenario, few existing state-of-the-art deep clustering methods can perform well for both non-complex topology and complex topology datasets. To address the problem, we propose a constraint utilizing symmetric InfoNCE, which helps an objective of deep clustering method in the scenario train the… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: 48 pages, 6 figures

  5. Learning Domain Invariant Representations by Joint Wasserstein Distance Minimization

    Authors: Léo Andeol, Yusei Kawakami, Yuichiro Wada, Takafumi Kanamori, Klaus-Robert Müller, Grégoire Montavon

    Abstract: Domain shifts in the training data are common in practical applications of machine learning; they occur for instance when the data is coming from different sources. Ideally, a ML model should work well independently of these shifts, for example, by learning a domain-invariant representation. However, common ML losses do not give strong guarantees on how consistently the ML model performs for diffe… ▽ More

    Submitted 21 August, 2023; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: 23 pages + supplement

  6. arXiv:2011.11246  [pdf, other

    cs.AR

    RVCoreP-32IC: A high-performance RISC-V soft processor with an efficient fetch unit supporting the compressed instructions

    Authors: Takuto Kanamori, Hiromu Miyazaki, Kenji Kise

    Abstract: In this paper, we propose a high-performance RISC-V soft processor with an efficient fetch unit supporting the compressed instructions targeting on FPGA. The compressed instruction extension in RISC-V can reduce the program size by about 25%. But it needs a complicated logic for the instruction fetch unit and has a significant impact on performance. We propose an instruction fetch unit that suppor… ▽ More

    Submitted 23 November, 2020; originally announced November 2020.

    Comments: 8 pages, 10 figures

  7. RVCoreP : An optimized RISC-V soft processor of five-stage pipelining

    Authors: Hiromu Miyazaki, Takuto Kanamori, Md Ashraful Islam, Kenji Kise

    Abstract: RISC-V is a RISC based open and loyalty free instruction set architecture which has been developed since 2010, and can be used for cost-effective soft processors on FPGAs. The basic 32-bit integer instruction set in RISC-V is defined as RV32I, which is sufficient to support the operating system environment and suits for embedded systems. In this paper, we propose an optimized RV32I soft processor… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

    Comments: 9 pages, 9 figures, this paper is submitted to the Institute of Electronics, Information and Communication Engineers (IEICE)

    ACM Class: C.1.1

  8. arXiv:1910.08280  [pdf, other

    stat.ML cs.LG

    Robust modal regression with direct log-density derivative estimation

    Authors: Hiroaki Sasaki, Tomoya Sakai, Takafumi Kanamori

    Abstract: Modal regression is aimed at estimating the global mode (i.e., global maximum) of the conditional density function of the output variable given input variables, and has led to regression methods robust against heavy-tailed or skewed noises. The conditional mode is often estimated through maximization of the modal regression risk (MRR). In order to apply a gradient method for the maximization, the… ▽ More

    Submitted 18 October, 2019; originally announced October 2019.

  9. arXiv:1910.03834  [pdf, other

    stat.ML cs.LG math.ST

    Estimating Density Models with Truncation Boundaries using Score Matching

    Authors: Song Liu, Takafumi Kanamori, Daniel J. Williams

    Abstract: Truncated densities are probability density functions defined on truncated domains. They share the same parametric form with their non-truncated counterparts up to a normalizing constant. Since the computation of their normalizing constants is usually infeasible, Maximum Likelihood Estimation cannot be easily applied to estimate truncated density models. Score Matching (SM) is a powerful tool for… ▽ More

    Submitted 20 April, 2022; v1 submitted 9 October, 2019; originally announced October 2019.

    Comments: to be published in the Journal of Machine Learning Research

  10. arXiv:1901.07710  [pdf, other

    stat.ML cs.LG

    Unified estimation framework for unnormalized models with statistical efficiency

    Authors: Masatoshi Uehara, Takafumi Kanamori, Takashi Takenouchi, Takeru Matsuda

    Abstract: The parameter estimation of unnormalized models is a challenging problem. The maximum likelihood estimation (MLE) is computationally infeasible for these models since normalizing constants are not explicitly calculated. Although some consistent estimators have been proposed earlier, the problem of statistical efficiency remains. In this study, we propose a unified, statistically efficient estimati… ▽ More

    Submitted 5 June, 2020; v1 submitted 22 January, 2019; originally announced January 2019.

    Comments: To appear at AISTATS 2020

  11. arXiv:1806.00569  [pdf, other

    stat.ML cs.LG

    Variable Selection for Nonparametric Learning with Power Series Kernels

    Authors: Kota Matsui, Wataru Kumagai, Kenta Kanamori, Mitsuaki Nishikimi, Takafumi Kanamori

    Abstract: In this paper, we propose a variable selection method for general nonparametric kernel-based estimation. The proposed method consists of two-stage estimation: (1) construct a consistent estimator of the target function, (2) approximate the estimator using a few variables by l1-type penalized estimation. We see that the proposed method can be applied to various kernel nonparametric estimation such… ▽ More

    Submitted 4 December, 2018; v1 submitted 1 June, 2018; originally announced June 2018.

    Comments: 24 pages, 3 tables, 2 figures

  12. arXiv:1805.07454  [pdf, other

    stat.ML cs.LG

    Fisher Efficient Inference of Intractable Models

    Authors: Song Liu, Takafumi Kanamori, Wittawat Jitkrittum, Yu Chen

    Abstract: Maximum Likelihood Estimators (MLE) has many good properties. For example, the asymptotic variance of MLE solution attains equality of the asymptotic Cram{é}r-Rao lower bound (efficiency bound), which is the minimum possible variance for an unbiased estimator. However, obtaining such MLE solution requires calculating the likelihood function which may not be tractable due to the normalization term… ▽ More

    Submitted 1 November, 2019; v1 submitted 18 May, 2018; originally announced May 2018.

    Comments: Fixed typos in the text. To appear in Neural Information Process 2019

  13. arXiv:1409.3912  [pdf, other

    stat.ML cs.LG

    Parallel Distributed Block Coordinate Descent Methods based on Pairwise Comparison Oracle

    Authors: Kota Matsui, Wataru Kumagai, Takafumi Kanamori

    Abstract: This paper provides a block coordinate descent algorithm to solve unconstrained optimization problems. In our algorithm, computation of function values or gradients is not required. Instead, pairwise comparison of function values is used. Our algorithm consists of two steps; one is the direction estimate step and the other is the search step. Both steps require only pairwise comparison of function… ▽ More

    Submitted 13 September, 2014; originally announced September 2014.

  14. arXiv:1409.0934  [pdf, other

    stat.ML cs.LG

    Breakdown Point of Robust Support Vector Machine

    Authors: Takafumi Kanamori, Shuhei Fujiwara, Akiko Takeda

    Abstract: The support vector machine (SVM) is one of the most successful learning methods for solving classification problems. Despite its popularity, SVM has a serious drawback, that is sensitivity to outliers in training samples. The penalty on misclassification is defined by a convex loss called the hinge loss, and the unboundedness of the convex loss causes the sensitivity to outliers. To deal wit… ▽ More

    Submitted 2 September, 2014; originally announced September 2014.

    Comments: 27 pages

  15. arXiv:1207.0099  [pdf, ps, other

    cs.LG stat.ML

    Density-Difference Estimation

    Authors: Masashi Sugiyama, Takafumi Kanamori, Taiji Suzuki, Marthinus Christoffel du Plessis, Song Liu, Ichiro Takeuchi

    Abstract: We address the problem of estimating the difference between two probability densities. A naive approach is a two-step procedure of first estimating two densities separately and then computing their difference. However, such a two-step procedure does not necessarily work well because the first step is performed without regard to the second step and thus a small error incurred in the first stage can… ▽ More

    Submitted 30 June, 2012; originally announced July 2012.

  16. arXiv:1206.4599  [pdf

    cs.LG stat.ML

    A Unified Robust Classification Model

    Authors: Akiko Takeda, Hiroyuki Mitsugi, Takafumi Kanamori

    Abstract: A wide variety of machine learning algorithms such as support vector machine (SVM), minimax probability machine (MPM), and Fisher discriminant analysis (FDA), exist for binary classification. The purpose of this paper is to provide a unified classification model that includes the above models through a robust optimization approach. This unified model has several benefits. One is that the extension… ▽ More

    Submitted 18 June, 2012; originally announced June 2012.

    Comments: ICML2012

  17. arXiv:1204.6583  [pdf, ps, other

    stat.ML cs.LG

    A Conjugate Property between Loss Functions and Uncertainty Sets in Classification Problems

    Authors: Takafumi Kanamori, Akiko Takeda, Taiji Suzuki

    Abstract: In binary classification problems, mainly two approaches have been proposed; one is loss function approach and the other is uncertainty set approach. The loss function approach is applied to major learning algorithms such as support vector machine (SVM) and boosting methods. The loss function represents the penalty of the decision function on the training samples. In the learning algorithm, the em… ▽ More

    Submitted 30 April, 2012; originally announced April 2012.

    Comments: 41 pages, 4 figures. The shorter version is accepted by COLT2012