Skip to main content

Showing 1–15 of 15 results for author: Yamaguchi, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2403.17423  [pdf, other

    cs.CV stat.ML

    Test-time Adaptation Meets Image Enhancement: Improving Accuracy via Uncertainty-aware Logit Switching

    Authors: Shohei Enomoto, Naoya Hasegawa, Kazuki Adachi, Taku Sasaki, Shin'ya Yamaguchi, Satoshi Suzuki, Takeharu Eda

    Abstract: Deep neural networks have achieved remarkable success in a variety of computer vision applications. However, there is a problem of degrading accuracy when the data distribution shifts between training and testing. As a solution of this problem, Test-time Adaptation~(TTA) has been well studied because of its practicality. Although TTA methods increase accuracy under distribution shift by updating t… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted to IJCNN2024

  2. arXiv:2309.16143  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Generative Semi-supervised Learning with Meta-Optimized Synthetic Samples

    Authors: Shin'ya Yamaguchi

    Abstract: Semi-supervised learning (SSL) is a promising approach for training deep classification models using labeled and unlabeled datasets. However, existing SSL methods rely on a large unlabeled dataset, which may not always be available in many real-world applications due to legal constraints (e.g., GDPR). In this paper, we investigate the research question: Can we train SSL models without real unlabel… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: Accepted to the 15th Asian Conference on Machine Learning (ACML2023); a preprint of the camera-ready version

  3. arXiv:2306.10656  [pdf, other

    cs.LG cs.AI stat.ML

    Virtual Human Generative Model: Masked Modeling Approach for Learning Human Characteristics

    Authors: Kenta Oono, Nontawat Charoenphakdee, Kotatsu Bito, Zhengyan Gao, Yoshiaki Ota, Shoichiro Yamaguchi, Yohei Sugawara, Shin-ichi Maeda, Kunihiko Miyoshi, Yuki Saito, Koki Tsuda, Hiroshi Maruyama, Kohei Hayashi

    Abstract: Identifying the relationship between healthcare attributes, lifestyles, and personality is vital for understanding and improving physical and mental conditions. Machine learning approaches are promising for modeling their relationships and offering actionable suggestions. In this paper, we propose Virtual Human Generative Model (VHGM), a machine learning model for estimating attributes about healt… ▽ More

    Submitted 14 August, 2023; v1 submitted 18 June, 2023; originally announced June 2023.

    Comments: 14 pages, 4 figures

  4. arXiv:2207.10283  [pdf, other

    cs.LG cs.AI stat.ML

    One-vs-the-Rest Loss to Focus on Important Samples in Adversarial Training

    Authors: Sekitoshi Kanai, Shin'ya Yamaguchi, Masanori Yamada, Hiroshi Takahashi, Kentaro Ohno, Yasutoshi Ida

    Abstract: This paper proposes a new loss function for adversarial training. Since adversarial training has difficulties, e.g., necessity of high model capacity, focusing on important data points by weighting cross-entropy loss has attracted much attention. However, they are vulnerable to sophisticated attacks, e.g., Auto-Attack. This paper experimentally reveals that the cause of their vulnerability is thei… ▽ More

    Submitted 26 April, 2023; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: ICML2023, 26 pages, 19 figures

  5. arXiv:2205.15619  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Meta-ticket: Finding optimal subnetworks for few-shot learning within randomly initialized neural networks

    Authors: Daiki Chijiwa, Shin'ya Yamaguchi, Atsutoshi Kumagai, Yasutoshi Ida

    Abstract: Few-shot learning for neural networks (NNs) is an important problem that aims to train NNs with a few data. The main challenge is how to avoid overfitting since over-parameterized NNs can easily overfit to such small dataset. Previous work (e.g. MAML by Finn et al. 2017) tackles this challenge by meta-learning, which learns how to learn from a few data by using various tasks. On the other hand, on… ▽ More

    Submitted 9 February, 2023; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  6. arXiv:2204.12833  [pdf, other

    cs.LG cs.AI stat.ML

    Transfer Learning with Pre-trained Conditional Generative Models

    Authors: Shin'ya Yamaguchi, Sekitoshi Kanai, Atsutoshi Kumagai, Daiki Chijiwa, Hisashi Kashima

    Abstract: Transfer learning is crucial in training deep neural networks on new target tasks. Current transfer learning methods always assume at least one of (i) source and target task label spaces overlap, (ii) source datasets are available, and (iii) target network architectures are consistent with source ones. However, holding these assumptions is difficult in practical settings because the target task ra… ▽ More

    Submitted 29 September, 2022; v1 submitted 27 April, 2022; originally announced April 2022.

    Comments: 24 pages, 6 figures

  7. arXiv:2106.09269  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Pruning Randomly Initialized Neural Networks with Iterative Randomization

    Authors: Daiki Chijiwa, Shin'ya Yamaguchi, Yasutoshi Ida, Kenji Umakoshi, Tomohiro Inoue

    Abstract: Pruning the weights of randomly initialized neural networks plays an important role in the context of lottery ticket hypothesis. Ramanujan et al. (2020) empirically showed that only pruning the weights can achieve remarkable performance instead of optimizing the weight values. However, to achieve the same level of performance as the weight optimization, the pruning approach requires more parameter… ▽ More

    Submitted 5 April, 2022; v1 submitted 17 June, 2021; originally announced June 2021.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021); Selected for a spotlight presentation

  8. arXiv:2010.02558  [pdf, other

    stat.ML cs.AI cs.LG

    Constraining Logits by Bounded Function for Adversarial Robustness

    Authors: Sekitoshi Kanai, Masanori Yamada, Shin'ya Yamaguchi, Hiroshi Takahashi, Yasutoshi Ida

    Abstract: We propose a method for improving adversarial robustness by addition of a new bounded function just before softmax. Recent studies hypothesize that small logits (inputs of softmax) by logit regularization can improve adversarial robustness of deep learning. Following this hypothesis, we analyze norms of logit vectors at the optimal point under the assumption of universal approximation and explore… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: 19 pages, 16 figures

  9. arXiv:2008.01883  [pdf, other

    stat.ML cs.LG

    When is invariance useful in an Out-of-Distribution Generalization problem ?

    Authors: Masanori Koyama, Shoichiro Yamaguchi

    Abstract: The goal of Out-of-Distribution (OOD) generalization problem is to train a predictor that generalizes on all environments. Popular approaches in this field use the hypothesis that such a predictor shall be an \textit{invariant predictor} that captures the mechanism that remains constant across environments. While these approaches have been experimentally successful in various case studies, there i… ▽ More

    Submitted 25 November, 2021; v1 submitted 4 August, 2020; originally announced August 2020.

  10. arXiv:1912.11603  [pdf, other

    stat.ML cs.CV cs.LG

    Image Enhanced Rotation Prediction for Self-Supervised Learning

    Authors: Shin'ya Yamaguchi, Sekitoshi Kanai, Tetsuya Shioda, Shoichiro Takeda

    Abstract: The rotation prediction (Rotation) is a simple pretext-task for self-supervised learning (SSL), where models learn useful representations for target vision tasks by solving pretext-tasks. Although Rotation captures information of object shapes, it hardly captures information of textures. To tackle this problem, we introduce a novel pretext-task called image enhanced rotation prediction (IE-Rot) fo… ▽ More

    Submitted 4 June, 2021; v1 submitted 25 December, 2019; originally announced December 2019.

    Comments: Accepted to IEEE ICIP 2021. The title has been changed from "Multiple Pretext-Task for Self-Supervised Learning via Mixing Multiple Image Transformations"

  11. arXiv:1912.11597  [pdf, other

    stat.ML cs.CV cs.LG

    Effective Data Augmentation with Multi-Domain Learning GANs

    Authors: Shin'ya Yamaguchi, Sekitoshi Kanai, Takeharu Eda

    Abstract: For deep learning applications, the massive data development (e.g., collecting, labeling), which is an essential process in building practical applications, still incurs seriously high costs. In this work, we propose an effective data augmentation method based on generative adversarial networks (GANs), called Domain Fusion. Our key idea is to import the knowledge contained in an outer dataset to a… ▽ More

    Submitted 25 December, 2019; originally announced December 2019.

    Comments: AAAI-2020

  12. arXiv:1911.08444  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    MANGA: Method Agnostic Neural-policy Generalization and Adaptation

    Authors: Homanga Bharadhwaj, Shoichiro Yamaguchi, Shin-ichi Maeda

    Abstract: In this paper we target the problem of transferring policies across multiple environments with different dynamics parameters and motor noise variations, by introducing a framework that decouples the processes of policy learning and system identification. Efficiently transferring learned policies to an unknown environment with changes in dynamics configurations in the presence of motor noise is ver… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

    Comments: Under Review. Video available at https://drive.google.com/file/d/12GsDq3iQDXEutE-xpzXxqrEfD6dYhKjs/view?usp=sharing Other details will be made available in the author's webpage www.homangabharadhwaj.com

  13. arXiv:1906.08412  [pdf, other

    cs.LG stat.ML

    Data Interpolating Prediction: Alternative Interpretation of Mixup

    Authors: Takuya Shimada, Shoichiro Yamaguchi, Kohei Hayashi, Sosuke Kobayashi

    Abstract: Data augmentation by mixing samples, such as Mixup, has widely been used typically for classification tasks. However, this strategy is not always effective due to the gap between augmented samples for training and original samples for testing. This gap may prevent a classifier from learning the optimal decision boundary and increase the generalization error. To overcome this problem, we propose an… ▽ More

    Submitted 19 June, 2019; originally announced June 2019.

    Comments: Presented at the 2nd Learning from Limited Labeled Data (LLD) Workshop at ICLR 2019

  14. arXiv:1906.04868  [pdf, other

    cs.LG stat.ML

    Semi-flat minima and saddle points by embedding neural networks to overparameterization

    Authors: Kenji Fukumizu, Shoichiro Yamaguchi, Yoh-ichi Mototake, Mirai Tanaka

    Abstract: We theoretically study the landscape of the training error for neural networks in overparameterized cases. We consider three basic methods for embedding a network into a wider one with more hidden units, and discuss whether a minimum point of the narrower network gives a minimum or saddle point of the wider one. Our results show that the networks with smooth and ReLU activation have different part… ▽ More

    Submitted 14 June, 2019; v1 submitted 11 June, 2019; originally announced June 2019.

    Comments: 38 pages, 4 figures

  15. arXiv:1902.02992  [pdf, other

    stat.ML cs.LG

    A Wrapped Normal Distribution on Hyperbolic Space for Gradient-Based Learning

    Authors: Yoshihiro Nagano, Shoichiro Yamaguchi, Yasuhiro Fujita, Masanori Koyama

    Abstract: Hyperbolic space is a geometry that is known to be well-suited for representation learning of data with an underlying hierarchical structure. In this paper, we present a novel hyperbolic distribution called \textit{pseudo-hyperbolic Gaussian}, a Gaussian-like distribution on hyperbolic space whose density can be evaluated analytically and differentiated with respect to the parameters. Our distribu… ▽ More

    Submitted 9 May, 2019; v1 submitted 8 February, 2019; originally announced February 2019.

    Comments: 20 pages, 12 figures