Skip to main content

Showing 1–34 of 34 results for author: Yamaguchi, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.17423  [pdf, other

    cs.CV stat.ML

    Test-time Adaptation Meets Image Enhancement: Improving Accuracy via Uncertainty-aware Logit Switching

    Authors: Shohei Enomoto, Naoya Hasegawa, Kazuki Adachi, Taku Sasaki, Shin'ya Yamaguchi, Satoshi Suzuki, Takeharu Eda

    Abstract: Deep neural networks have achieved remarkable success in a variety of computer vision applications. However, there is a problem of degrading accuracy when the data distribution shifts between training and testing. As a solution of this problem, Test-time Adaptation~(TTA) has been well studied because of its practicality. Although TTA methods increase accuracy under distribution shift by updating t… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted to IJCNN2024

  2. arXiv:2403.14114  [pdf, other

    cs.CV

    Test-time Similarity Modification for Person Re-identification toward Temporal Distribution Shift

    Authors: Kazuki Adachi, Shohei Enomoto, Taku Sasaki, Shin'ya Yamaguchi

    Abstract: Person re-identification (re-id), which aims to retrieve images of the same person in a given image from a database, is one of the most practical image recognition applications. In the real world, however, the environments that the images are taken from change over time. This causes a distribution shift between training and testing and degrades the performance of re-id. To maintain re-id performan… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Accepted to IJCNN2024

  3. arXiv:2403.10097  [pdf, other

    cs.LG cs.AI cs.CV

    Adaptive Random Feature Regularization on Fine-tuning Deep Neural Networks

    Authors: Shin'ya Yamaguchi, Sekitoshi Kanai, Kazuki Adachi, Daiki Chijiwa

    Abstract: While fine-tuning is a de facto standard method for training deep neural networks, it still suffers from overfitting when using small target datasets. Previous methods improve fine-tuning performance by maintaining knowledge of the source datasets or introducing regularization terms such as contrastive loss. However, these methods require auxiliary source information (e.g., source labels or datase… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  4. arXiv:2311.13090  [pdf, other

    cs.AI cs.CV

    On the Limitation of Diffusion Models for Synthesizing Training Datasets

    Authors: Shin'ya Yamaguchi, Takuma Fukuda

    Abstract: Synthetic samples from diffusion models are promising for leveraging in training discriminative models as replications of real training datasets. However, we found that the synthetic datasets degrade classification performance over real datasets even when using state-of-the-art diffusion models. This means that modern diffusion models do not perfectly represent the data distribution for the purpos… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: NeurIPS 2023 SyntheticData4ML Workshop

  5. arXiv:2310.03913  [pdf, other

    cs.RO

    TRAIL Team Description Paper for RoboCup@Home 2023

    Authors: Chikaha Tsuji, Dai Komukai, Mimo Shirasaka, Hikaru Wada, Tsunekazu Omija, Aoi Horo, Daiki Furuta, Saki Yamaguchi, So Ikoma, Soshi Tsunashima, Masato Kobayashi, Koki Ishimoto, Yuya Ikeda, Tatsuya Matsushima, Yusuke Iwasawa, Yutaka Matsuo

    Abstract: Our team, TRAIL, consists of AI/ML laboratory members from The University of Tokyo. We leverage our extensive research experience in state-of-the-art machine learning to build general-purpose in-home service robots. We previously participated in two competitions using Human Support Robot (HSR): RoboCup@Home Japan Open 2020 (DSPL) and World Robot Summit 2020, equivalent to RoboCup World Tournament.… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  6. arXiv:2309.16143  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Generative Semi-supervised Learning with Meta-Optimized Synthetic Samples

    Authors: Shin'ya Yamaguchi

    Abstract: Semi-supervised learning (SSL) is a promising approach for training deep classification models using labeled and unlabeled datasets. However, existing SSL methods rely on a large unlabeled dataset, which may not always be available in many real-world applications due to legal constraints (e.g., GDPR). In this paper, we investigate the research question: Can we train SSL models without real unlabel… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: Accepted to the 15th Asian Conference on Machine Learning (ACML2023); a preprint of the camera-ready version

  7. arXiv:2308.16454  [pdf, other

    cs.CV cs.LG

    Adversarial Finetuning with Latent Representation Constraint to Mitigate Accuracy-Robustness Tradeoff

    Authors: Satoshi Suzuki, Shin'ya Yamaguchi, Shoichiro Takeda, Sekitoshi Kanai, Naoki Makishima, Atsushi Ando, Ryo Masumura

    Abstract: This paper addresses the tradeoff between standard accuracy on clean examples and robustness against adversarial examples in deep neural networks (DNNs). Although adversarial training (AT) improves robustness, it degrades the standard accuracy, thus yielding the tradeoff. To mitigate this tradeoff, we propose a novel AT method called ARREST, which comprises three components: (i) adversarial finetu… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: Accepted by International Conference on Computer Vision (ICCV) 2023

  8. arXiv:2307.13899  [pdf, other

    cs.LG cs.AI cs.CV

    Regularizing Neural Networks with Meta-Learning Generative Models

    Authors: Shin'ya Yamaguchi, Daiki Chijiwa, Sekitoshi Kanai, Atsutoshi Kumagai, Hisashi Kashima

    Abstract: This paper investigates methods for improving generative data augmentation for deep learning. Generative data augmentation leverages the synthetic samples produced by generative models as an additional dataset for classification with small dataset settings. A key challenge of generative data augmentation is that the synthetic data contain uninformative samples that degrade accuracy. This is becaus… ▽ More

    Submitted 23 October, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: Accepted to NeurIPS 2023

  9. arXiv:2306.10656  [pdf, other

    cs.LG cs.AI stat.ML

    Virtual Human Generative Model: Masked Modeling Approach for Learning Human Characteristics

    Authors: Kenta Oono, Nontawat Charoenphakdee, Kotatsu Bito, Zhengyan Gao, Yoshiaki Ota, Shoichiro Yamaguchi, Yohei Sugawara, Shin-ichi Maeda, Kunihiko Miyoshi, Yuki Saito, Koki Tsuda, Hiroshi Maruyama, Kohei Hayashi

    Abstract: Identifying the relationship between healthcare attributes, lifestyles, and personality is vital for understanding and improving physical and mental conditions. Machine learning approaches are promising for modeling their relationships and offering actionable suggestions. In this paper, we propose Virtual Human Generative Model (VHGM), a machine learning model for estimating attributes about healt… ▽ More

    Submitted 14 August, 2023; v1 submitted 18 June, 2023; originally announced June 2023.

    Comments: 14 pages, 4 figures

  10. arXiv:2306.05641  [pdf, other

    cs.LG cs.AI

    Revisiting Permutation Symmetry for Merging Models between Different Datasets

    Authors: Masanori Yamada, Tomoya Yamashita, Shin'ya Yamaguchi, Daiki Chijiwa

    Abstract: Model merging is a new approach to creating a new model by combining the weights of different trained models. Previous studies report that model merging works well for models trained on a single dataset with different random seeds, while model merging between different datasets is difficult. Merging knowledge from different datasets has practical significance, but it has not been well investigated… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: 18 pages; comments are welcome

  11. arXiv:2304.12304  [pdf

    cs.AI cs.CY cs.LG

    A Survey on Multi-Resident Activity Recognition in Smart Environments

    Authors: Farhad MortezaPour Shiri, Thinagaran Perumal, Norwati Mustapha, Raihani Mohamed, Mohd Anuaruddin Bin Ahmadon, Shingo Yamaguchi

    Abstract: Human activity recognition (HAR) is a rapidly growing field that utilizes smart devices, sensors, and algorithms to automatically classify and identify the actions of individuals within a given environment. These systems have a wide range of applications, including assisting with caring tasks, increasing security, and improving energy efficiency. However, there are several challenges that must be… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

    Comments: 16 pages, to appear in Evolution of Information, Communication and Computing Systems (EICCS) Book Series

  12. arXiv:2210.05268  [pdf, other

    cs.LG

    Component-Wise Natural Gradient Descent -- An Efficient Neural Network Optimization

    Authors: Tran Van Sang, Mhd Irvan, Rie Shigetomi Yamaguchi, Toshiyuki Nakata

    Abstract: Natural Gradient Descent (NGD) is a second-order neural network training that preconditions the gradient descent with the inverse of the Fisher Information Matrix (FIM). Although NGD provides an efficient preconditioner, it is not practicable due to the expensive computation required when inverting the FIM. This paper proposes a new NGD variant algorithm named Component-Wise Natural Gradient Desce… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  13. arXiv:2207.10283  [pdf, other

    cs.LG cs.AI stat.ML

    One-vs-the-Rest Loss to Focus on Important Samples in Adversarial Training

    Authors: Sekitoshi Kanai, Shin'ya Yamaguchi, Masanori Yamada, Hiroshi Takahashi, Kentaro Ohno, Yasutoshi Ida

    Abstract: This paper proposes a new loss function for adversarial training. Since adversarial training has difficulties, e.g., necessity of high model capacity, focusing on important data points by weighting cross-entropy loss has attracted much attention. However, they are vulnerable to sophisticated attacks, e.g., Auto-Attack. This paper experimentally reveals that the cause of their vulnerability is thei… ▽ More

    Submitted 26 April, 2023; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: ICML2023, 26 pages, 19 figures

  14. arXiv:2205.15619  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Meta-ticket: Finding optimal subnetworks for few-shot learning within randomly initialized neural networks

    Authors: Daiki Chijiwa, Shin'ya Yamaguchi, Atsutoshi Kumagai, Yasutoshi Ida

    Abstract: Few-shot learning for neural networks (NNs) is an important problem that aims to train NNs with a few data. The main challenge is how to avoid overfitting since over-parameterized NNs can easily overfit to such small dataset. Previous work (e.g. MAML by Finn et al. 2017) tackles this challenge by meta-learning, which learns how to learn from a few data by using various tasks. On the other hand, on… ▽ More

    Submitted 9 February, 2023; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  15. arXiv:2204.13263  [pdf, other

    cs.LG

    Covariance-aware Feature Alignment with Pre-computed Source Statistics for Test-time Adaptation to Multiple Image Corruptions

    Authors: Kazuki Adachi, Shin'ya Yamaguchi, Atsutoshi Kumagai

    Abstract: Real-world image recognition systems often face corrupted input images, which cause distribution shifts and degrade the performance of models. These systems often use a single prediction model in a central server and process images sent from various environments, such as cameras distributed in cities or cars. Such single models face images corrupted in heterogeneous ways in test time. Thus, they r… ▽ More

    Submitted 29 June, 2023; v1 submitted 27 April, 2022; originally announced April 2022.

    Comments: Extended version of the paper accepted to ICIP 2023

  16. arXiv:2204.12833  [pdf, other

    cs.LG cs.AI stat.ML

    Transfer Learning with Pre-trained Conditional Generative Models

    Authors: Shin'ya Yamaguchi, Sekitoshi Kanai, Atsutoshi Kumagai, Daiki Chijiwa, Hisashi Kashima

    Abstract: Transfer learning is crucial in training deep neural networks on new target tasks. Current transfer learning methods always assume at least one of (i) source and target task label spaces overlap, (ii) source datasets are available, and (iii) target network architectures are consistent with source ones. However, holding these assumptions is difficult in practical settings because the target task ra… ▽ More

    Submitted 29 September, 2022; v1 submitted 27 April, 2022; originally announced April 2022.

    Comments: 24 pages, 6 figures

  17. arXiv:2202.13062  [pdf, other

    cs.RO

    Learning-based Collision-free Planning on Arbitrary Optimization Criteria in the Latent Space through cGANs

    Authors: Tomoki Ando, Hiroto Iino, Hiroki Mori, Ryota Torishima, Kuniyuki Takahashi, Shoichiro Yamaguchi, Daisuke Okanohara, Tetsuya Ogata

    Abstract: We propose a new method for collision-free planning using Conditional Generative Adversarial Networks (cGANs) to transform between the robot's joint space and a latent space that captures only collision-free areas of the joint space, conditioned by an obstacle map. Generating multiple plausible trajectories is convenient in applications such as the manipulation of a robot arm by enabling the selec… ▽ More

    Submitted 5 February, 2023; v1 submitted 26 February, 2022; originally announced February 2022.

    Comments: 19 pages, 7 figures. An accompanying video is available at https://www.youtube.com/watch?v=IJUxdmaSwy0. arXiv admin note: text overlap with arXiv:2202.07203

  18. arXiv:2202.07203  [pdf, other

    cs.RO

    Collision-free Path Planning in the Latent Space through cGANs

    Authors: Tomoki Ando, Hiroki Mori, Ryota Torishima, Kuniyuki Takahashi, Shoichiro Yamaguchi, Daisuke Okanohara, Tetsuya Ogata

    Abstract: We show a new method for collision-free path planning by cGANs by map** its latent space to only the collision-free areas of the robot joint space. Our method simply provides this collision-free latent space after which any planner, using any optimization conditions, can be used to generate the most suitable paths on the fly. We successfully verified this method with a simulated two-link robot a… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

    Comments: 10pages, 9figures

  19. arXiv:2202.04237  [pdf, other

    cs.CV cs.AI cs.LG

    Learning Robust Convolutional Neural Networks with Relevant Feature Focusing via Explanations

    Authors: Kazuki Adachi, Shin'ya Yamaguchi

    Abstract: Existing image recognition techniques based on convolutional neural networks (CNNs) basically assume that the training and test datasets are sampled from i.i.d distributions. However, this assumption is easily broken in the real world because of the distribution shift that occurs when the co-occurrence relations between objects and backgrounds in input images change. Under this type of distributio… ▽ More

    Submitted 23 March, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

    Comments: Accepted by ICME 2022

  20. arXiv:2106.09269  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Pruning Randomly Initialized Neural Networks with Iterative Randomization

    Authors: Daiki Chijiwa, Shin'ya Yamaguchi, Yasutoshi Ida, Kenji Umakoshi, Tomohiro Inoue

    Abstract: Pruning the weights of randomly initialized neural networks plays an important role in the context of lottery ticket hypothesis. Ramanujan et al. (2020) empirically showed that only pruning the weights can achieve remarkable performance instead of optimizing the weight values. However, to achieve the same level of performance as the weight optimization, the pruning approach requires more parameter… ▽ More

    Submitted 5 April, 2022; v1 submitted 17 June, 2021; originally announced June 2021.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021); Selected for a spotlight presentation

  21. arXiv:2106.02343  [pdf, other

    cs.CV cs.LG eess.IV

    F-Drop&Match: GANs with a Dead Zone in the High-Frequency Domain

    Authors: Shin'ya Yamaguchi, Sekitoshi Kanai

    Abstract: Generative adversarial networks built from deep convolutional neural networks (GANs) lack the ability to exactly replicate the high-frequency components of natural images. To alleviate this issue, we introduce two novel training techniques called frequency drop** (F-Drop) and frequency matching (F-Match). The key idea of F-Drop is to filter out unnecessary high-frequency components from the inpu… ▽ More

    Submitted 18 August, 2021; v1 submitted 4 June, 2021; originally announced June 2021.

    Comments: Accepted to ICCV 2021; Added experiments on StyleGAN2-ADA

  22. arXiv:2010.02558  [pdf, other

    stat.ML cs.AI cs.LG

    Constraining Logits by Bounded Function for Adversarial Robustness

    Authors: Sekitoshi Kanai, Masanori Yamada, Shin'ya Yamaguchi, Hiroshi Takahashi, Yasutoshi Ida

    Abstract: We propose a method for improving adversarial robustness by addition of a new bounded function just before softmax. Recent studies hypothesize that small logits (inputs of softmax) by logit regularization can improve adversarial robustness of deep learning. Following this hypothesis, we analyze norms of logit vectors at the optimal point under the assumption of universal approximation and explore… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: 19 pages, 16 figures

  23. arXiv:2008.01883  [pdf, other

    stat.ML cs.LG

    When is invariance useful in an Out-of-Distribution Generalization problem ?

    Authors: Masanori Koyama, Shoichiro Yamaguchi

    Abstract: The goal of Out-of-Distribution (OOD) generalization problem is to train a predictor that generalizes on all environments. Popular approaches in this field use the hypothesis that such a predictor shall be an \textit{invariant predictor} that captures the mechanism that remains constant across environments. While these approaches have been experimentally successful in various case studies, there i… ▽ More

    Submitted 25 November, 2021; v1 submitted 4 August, 2020; originally announced August 2020.

  24. arXiv:2007.05722  [pdf, other

    cs.CV cs.SD eess.AS

    Do We Need Sound for Sound Source Localization?

    Authors: Takashi Oya, Shohei Iwase, Ryota Natsume, Takahiro Itazuri, Shugo Yamaguchi, Shigeo Morishima

    Abstract: During the performance of sound source localization which uses both visual and aural information, it presently remains unclear how much either image or sound modalities contribute to the result, i.e. do we need both image and sound for sound source localization? To address this question, we develop an unsupervised learning system that solves sound source localization by decomposing this task into… ▽ More

    Submitted 11 July, 2020; originally announced July 2020.

    Comments: Paper: 14 pages, 6 figures. Supplementary Material: 6 pages, 3 figures. Videos and Codes will be released later

  25. arXiv:1912.11603  [pdf, other

    stat.ML cs.CV cs.LG

    Image Enhanced Rotation Prediction for Self-Supervised Learning

    Authors: Shin'ya Yamaguchi, Sekitoshi Kanai, Tetsuya Shioda, Shoichiro Takeda

    Abstract: The rotation prediction (Rotation) is a simple pretext-task for self-supervised learning (SSL), where models learn useful representations for target vision tasks by solving pretext-tasks. Although Rotation captures information of object shapes, it hardly captures information of textures. To tackle this problem, we introduce a novel pretext-task called image enhanced rotation prediction (IE-Rot) fo… ▽ More

    Submitted 4 June, 2021; v1 submitted 25 December, 2019; originally announced December 2019.

    Comments: Accepted to IEEE ICIP 2021. The title has been changed from "Multiple Pretext-Task for Self-Supervised Learning via Mixing Multiple Image Transformations"

  26. arXiv:1912.11597  [pdf, other

    stat.ML cs.CV cs.LG

    Effective Data Augmentation with Multi-Domain Learning GANs

    Authors: Shin'ya Yamaguchi, Sekitoshi Kanai, Takeharu Eda

    Abstract: For deep learning applications, the massive data development (e.g., collecting, labeling), which is an essential process in building practical applications, still incurs seriously high costs. In this work, we propose an effective data augmentation method based on generative adversarial networks (GANs), called Domain Fusion. Our key idea is to import the knowledge contained in an outer dataset to a… ▽ More

    Submitted 25 December, 2019; originally announced December 2019.

    Comments: AAAI-2020

  27. arXiv:1911.08444  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    MANGA: Method Agnostic Neural-policy Generalization and Adaptation

    Authors: Homanga Bharadhwaj, Shoichiro Yamaguchi, Shin-ichi Maeda

    Abstract: In this paper we target the problem of transferring policies across multiple environments with different dynamics parameters and motor noise variations, by introducing a framework that decouples the processes of policy learning and system identification. Efficiently transferring learned policies to an unknown environment with changes in dynamics configurations in the presence of motor noise is ver… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

    Comments: Under Review. Video available at https://drive.google.com/file/d/12GsDq3iQDXEutE-xpzXxqrEfD6dYhKjs/view?usp=sharing Other details will be made available in the author's webpage www.homangabharadhwaj.com

  28. arXiv:1910.03253  [pdf, other

    cs.RO cs.LG

    Motion Generation Considering Situation with Conditional Generative Adversarial Networks for Throwing Robots

    Authors: Kyo Kutsuzawa, Hitoshi Kusano, Ayaka Kume, Shoichiro Yamaguchi

    Abstract: When robots work in a cluttered environment, the constraints for motions change frequently and the required action can change even for the same task. However, planning complex motions from direct calculation has the risk of resulting in poor performance local optima. In addition, machine learning approaches often require relearning for novel situations. In this paper, we propose a method of search… ▽ More

    Submitted 8 October, 2019; originally announced October 2019.

  29. arXiv:1906.08412  [pdf, other

    cs.LG stat.ML

    Data Interpolating Prediction: Alternative Interpretation of Mixup

    Authors: Takuya Shimada, Shoichiro Yamaguchi, Kohei Hayashi, Sosuke Kobayashi

    Abstract: Data augmentation by mixing samples, such as Mixup, has widely been used typically for classification tasks. However, this strategy is not always effective due to the gap between augmented samples for training and original samples for testing. This gap may prevent a classifier from learning the optimal decision boundary and increase the generalization error. To overcome this problem, we propose an… ▽ More

    Submitted 19 June, 2019; originally announced June 2019.

    Comments: Presented at the 2nd Learning from Limited Labeled Data (LLD) Workshop at ICLR 2019

  30. arXiv:1906.04868  [pdf, other

    cs.LG stat.ML

    Semi-flat minima and saddle points by embedding neural networks to overparameterization

    Authors: Kenji Fukumizu, Shoichiro Yamaguchi, Yoh-ichi Mototake, Mirai Tanaka

    Abstract: We theoretically study the landscape of the training error for neural networks in overparameterized cases. We consider three basic methods for embedding a network into a wider one with more hidden units, and discuss whether a minimum point of the narrower network gives a minimum or saddle point of the wider one. Our results show that the networks with smooth and ReLU activation have different part… ▽ More

    Submitted 14 June, 2019; v1 submitted 11 June, 2019; originally announced June 2019.

    Comments: 38 pages, 4 figures

  31. arXiv:1904.10595  [pdf, other

    cs.CR

    Influences of Human Demographics, Brand Familiarity and Security Backgrounds on Homograph Recognition

    Authors: Tran Phuong Thao, Yukiko Sawaya, Hoang-Quoc Nguyen-Son, Akira Yamada, Ayumu Kubota, Tran Van Sang, Rie Shigetomi Yamaguchi

    Abstract: Homograph attack is a way that attackers deceive victims about which website domain name they are communicating with by exploiting the fact that many characters look alike. The attack becomes serious and is raising broad attention when recently many brand domains have been attacked such as Apple Inc., Adobe Inc., Lloyds Bank, etc. We first design a survey of human demographics, brand familiarity,… ▽ More

    Submitted 26 January, 2020; v1 submitted 23 April, 2019; originally announced April 2019.

  32. arXiv:1903.09538  [pdf

    q-bio.QM cs.LG eess.IV

    Use of Ghost Cytometry to Differentiate Cells with Similar Gross Morphologic Characteristics

    Authors: Hiroaki Adachi, Yoko Kawamura, Keiji Nakagawa, Ryoichi Horisaki, Issei Sato, Satoko Yamaguchi, Katsuhito Fujiu, Kayo Waki, Hiroyuki Noji, Sadao Ota

    Abstract: Imaging flow cytometry shows significant potential for increasing our understanding of heterogeneous and complex life systems and is useful for biomedical applications. Ghost cytometry is a recently proposed approach for directly analyzing compressively measured signals, thereby relieving the computational bottleneck observed in high-throughput cytometry based on morphological information. While t… ▽ More

    Submitted 22 March, 2019; originally announced March 2019.

  33. arXiv:1902.02992  [pdf, other

    stat.ML cs.LG

    A Wrapped Normal Distribution on Hyperbolic Space for Gradient-Based Learning

    Authors: Yoshihiro Nagano, Shoichiro Yamaguchi, Yasuhiro Fujita, Masanori Koyama

    Abstract: Hyperbolic space is a geometry that is known to be well-suited for representation learning of data with an underlying hierarchical structure. In this paper, we present a novel hyperbolic distribution called \textit{pseudo-hyperbolic Gaussian}, a Gaussian-like distribution on hyperbolic space whose density can be evaluated analytically and differentiated with respect to the parameters. Our distribu… ▽ More

    Submitted 9 May, 2019; v1 submitted 8 February, 2019; originally announced February 2019.

    Comments: 20 pages, 12 figures

  34. arXiv:1812.05296  [pdf

    cs.MA

    Aerial Robot Model based design and verification of the single and multi-agent inspection application development

    Authors: Seiko P. Yamaguchi, Masaru Sakuma, Takaki Ueno, Filip Karolonek, Tadeusz Uhl, Ankit A. Ravankar, Takanori Emaru, Yukinori Kobayashi

    Abstract: In recent decade, potential application of Unmanned Aerial Vehicles (UAV) has enabled replacement of various operations in hard-to-access areas, such as, inspection, surveillance or search and rescue applications in challenging and complex environments. Furthermore, aerial robotics application with multi-agent systems are anticipated to further extend its potential. However, one of the major diffi… ▽ More

    Submitted 13 December, 2018; originally announced December 2018.

    Comments: 3 pages, 5 figures, conference paper