Skip to main content

Showing 1–14 of 14 results for author: Ida, Y

.
  1. arXiv:2406.00315  [pdf, other

    cs.RO cs.CV

    Precision and Adaptability of YOLOv5 and YOLOv8 in Dynamic Robotic Environments

    Authors: Victor A. Kich, Muhammad A. Muttaqien, Junya Toyama, Ryutaro Miyoshi, Yosuke Ida, Akihisa Ohya, Hisashi Date

    Abstract: Recent advancements in real-time object detection frameworks have spurred extensive research into their application in robotic systems. This study provides a comparative analysis of YOLOv5 and YOLOv8 models, challenging the prevailing assumption of the latter's superiority in performance metrics. Contrary to initial expectations, YOLOv5 models demonstrated comparable, and in some cases superior, p… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: 11th IEEE International Conference on Cybernetics and Intelligent Systems (CIS)

  2. arXiv:2303.07597  [pdf, other

    cs.LG stat.ML

    Fast Regularized Discrete Optimal Transport with Group-Sparse Regularizers

    Authors: Yasutoshi Ida, Sekitoshi Kanai, Kazuki Adachi, Atsutoshi Kumagai, Yasuhiro Fujiwara

    Abstract: Regularized discrete optimal transport (OT) is a powerful tool to measure the distance between two discrete distributions that have been constructed from data samples on two different domains. While it has a wide range of applications in machine learning, in some cases the sampled data from only one of the domains will have class labels such as unsupervised domain adaptation. In this kind of probl… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: This is an extended version of the paper accepted by the 37th AAAI Conference on Artificial Intelligence (AAAI 2023)

  3. arXiv:2210.01348  [pdf, other

    cs.LG cs.NE

    Fast Saturating Gate for Learning Long Time Scales with Recurrent Neural Networks

    Authors: Kentaro Ohno, Sekitoshi Kanai, Yasutoshi Ida

    Abstract: Gate functions in recurrent models, such as an LSTM and GRU, play a central role in learning various time scales in modeling time series data by using a bounded activation function. However, it is difficult to train gates to capture extremely long time scales due to gradient vanishing of the bounded function for large inputs, which is known as the saturation problem. We closely analyze the relatio… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.

    Comments: 9 pages of main texts with 4 pages appendices, 12 figures

  4. arXiv:2207.10283  [pdf, other

    cs.LG cs.AI stat.ML

    One-vs-the-Rest Loss to Focus on Important Samples in Adversarial Training

    Authors: Sekitoshi Kanai, Shin'ya Yamaguchi, Masanori Yamada, Hiroshi Takahashi, Kentaro Ohno, Yasutoshi Ida

    Abstract: This paper proposes a new loss function for adversarial training. Since adversarial training has difficulties, e.g., necessity of high model capacity, focusing on important data points by weighting cross-entropy loss has attracted much attention. However, they are vulnerable to sophisticated attacks, e.g., Auto-Attack. This paper experimentally reveals that the cause of their vulnerability is thei… ▽ More

    Submitted 26 April, 2023; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: ICML2023, 26 pages, 19 figures

  5. arXiv:2205.15619  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Meta-ticket: Finding optimal subnetworks for few-shot learning within randomly initialized neural networks

    Authors: Daiki Chijiwa, Shin'ya Yamaguchi, Atsutoshi Kumagai, Yasutoshi Ida

    Abstract: Few-shot learning for neural networks (NNs) is an important problem that aims to train NNs with a few data. The main challenge is how to avoid overfitting since over-parameterized NNs can easily overfit to such small dataset. Previous work (e.g. MAML by Finn et al. 2017) tackles this challenge by meta-learning, which learns how to learn from a few data by using various tasks. On the other hand, on… ▽ More

    Submitted 9 February, 2023; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  6. arXiv:2106.09269  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Pruning Randomly Initialized Neural Networks with Iterative Randomization

    Authors: Daiki Chijiwa, Shin'ya Yamaguchi, Yasutoshi Ida, Kenji Umakoshi, Tomohiro Inoue

    Abstract: Pruning the weights of randomly initialized neural networks plays an important role in the context of lottery ticket hypothesis. Ramanujan et al. (2020) empirically showed that only pruning the weights can achieve remarkable performance instead of optimizing the weight values. However, to achieve the same level of performance as the weight optimization, the pruning approach requires more parameter… ▽ More

    Submitted 5 April, 2022; v1 submitted 17 June, 2021; originally announced June 2021.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021); Selected for a spotlight presentation

  7. arXiv:2103.01400  [pdf, other

    cs.LG cs.AI stat.ML

    Smoothness Analysis of Adversarial Training

    Authors: Sekitoshi Kanai, Masanori Yamada, Hiroshi Takahashi, Yuki Yamanaka, Yasutoshi Ida

    Abstract: Deep neural networks are vulnerable to adversarial attacks. Recent studies about adversarial robustness focus on the loss landscape in the parameter space since it is related to optimization and generalization performance. These studies conclude that the difficulty of adversarial training is caused by the non-smoothness of the loss function: i.e., its gradient is not Lipschitz continuous. However,… ▽ More

    Submitted 15 June, 2021; v1 submitted 1 March, 2021; originally announced March 2021.

    Comments: 22 pages, 7 figures. In V3, we add the results of EntropySGD for adversarial training

  8. arXiv:2012.09606  [pdf, ps, other

    q-fin.GN math.PR

    The Thermodynamic Approach to Whole-Life Insurance: A Method for Evaluation of Surrender Risk

    Authors: JirĂ´ Akahori, Yuuki Ida, Maho Nishida, Shuji Tamada

    Abstract: We introduce a collective model for life insurance where the heterogeneity of each insured, including the health state, is modeled by a diffusion process. This model is influenced by concepts in statistical mechanics. Using the proposed framework, one can describe the total pay-off as a functional of the diffusion process, which can be used to derive a level premium that evaluates the risk of laps… ▽ More

    Submitted 10 December, 2020; originally announced December 2020.

  9. arXiv:2010.02558  [pdf, other

    stat.ML cs.AI cs.LG

    Constraining Logits by Bounded Function for Adversarial Robustness

    Authors: Sekitoshi Kanai, Masanori Yamada, Shin'ya Yamaguchi, Hiroshi Takahashi, Yasutoshi Ida

    Abstract: We propose a method for improving adversarial robustness by addition of a new bounded function just before softmax. Recent studies hypothesize that small logits (inputs of softmax) by logit regularization can improve adversarial robustness of deep learning. Following this hypothesis, we analyze norms of logit vectors at the optimal point under the assumption of universal approximation and explore… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: 19 pages, 16 figures

  10. arXiv:1909.08830  [pdf, other

    stat.ML cs.CV cs.LG

    Absum: Simple Regularization Method for Reducing Structural Sensitivity of Convolutional Neural Networks

    Authors: Sekitoshi Kanai, Yasutoshi Ida, Yasuhiro Fujiwara, Masanori Yamada, Shuichi Adachi

    Abstract: We propose Absum, which is a regularization method for improving adversarial robustness of convolutional neural networks (CNNs). Although CNNs can accurately recognize images, recent studies have shown that the convolution operations in CNNs commonly have structural sensitivity to specific noise composed of Fourier basis functions. By exploiting this sensitivity, they proposed a simple black-box a… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.

    Comments: 16 pages, 39 figures

  11. arXiv:1906.03826  [pdf, other

    cs.LG cs.CV stat.ML

    Network Implosion: Effective Model Compression for ResNets via Static Layer Pruning and Retraining

    Authors: Yasutoshi Ida, Yasuhiro Fujiwara

    Abstract: Residual Networks with convolutional layers are widely used in the field of machine learning. Since they effectively extract features from input data by stacking multiple layers, they can achieve high accuracy in many applications. However, the stacking of many layers raises their computation costs. To address this problem, we propose Network Implosion, it erases multiple layers from Residual Netw… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: Preprint of International Joint Conference on Neural Networks (IJCNN) 2019

  12. arXiv:1705.00864  [pdf, ps, other

    q-fin.CP math.PR

    Towards the Exact Simulation Using Hyperbolic Brownian Motion

    Authors: Yuuki Ida, Yuri Imamura

    Abstract: In the present paper, an expansion of the transition density of Hyperbolic Brownian motion with drift is given, which is potentially useful for pricing and hedging of options under stochastic volatility models. We work on a condition on the drift which dramatically simplifies the proof.

    Submitted 2 May, 2017; originally announced May 2017.

  13. arXiv:1605.09593  [pdf, other

    cs.LG cs.AI stat.ML

    Adaptive Learning Rate via Covariance Matrix Based Preconditioning for Deep Neural Networks

    Authors: Yasutoshi Ida, Yasuhiro Fujiwara, Sotetsu Iwamura

    Abstract: Adaptive learning rate algorithms such as RMSProp are widely used for training deep neural networks. RMSProp offers efficient training since it uses first order gradients to approximate Hessian-based preconditioning. However, since the first order gradients include noise caused by stochastic optimization, the approximation may be inaccurate. In this paper, we propose a novel adaptive learning rate… ▽ More

    Submitted 28 September, 2017; v1 submitted 31 May, 2016; originally announced May 2016.

    Comments: Accepted at IJCAI 2017

  14. Polar Antiferromagnets Produced with Orbital-Order

    Authors: Naoki Ogawa, Yasushi Ogimoto, Yoshiaki Ida, Yusuke Nomura, Ryotaro Arita, Kenjiro Miyano

    Abstract: Polar magnetic states are realized in pseudocubic manganite thin films fabricated on high-index substrates, in which a Jahn-Teller (JT) distortion remains an active variable. Several types of orbital-orders were found to develop large optical second harmonic generation, signaling broken-inversion-symmetry distinct from their bulk forms and films on (100) substrates. The observed symmetry-lifting a… ▽ More

    Submitted 5 December, 2011; v1 submitted 5 December, 2011; originally announced December 2011.

    Comments: 5 pages, 4 figures