Skip to main content

Showing 1–16 of 16 results for author: Miyato, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.10375  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    GTA: A Geometry-Aware Attention Mechanism for Multi-View Transformers

    Authors: Takeru Miyato, Bernhard Jaeger, Max Welling, Andreas Geiger

    Abstract: As transformers are equivariant to the permutation of input tokens, encoding the positional information of tokens is necessary for many tasks. However, since existing positional encoding schemes have been initially designed for NLP tasks, their suitability for vision tasks, which typically exhibit different structural properties in their data, is questionable. We argue that existing positional enc… ▽ More

    Submitted 7 June, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: Published as a conference paper at ICLR 2024

  2. arXiv:2305.18484  [pdf, other

    stat.ML cs.LG

    Neural Fourier Transform: A General Approach to Equivariant Representation Learning

    Authors: Masanori Koyama, Kenji Fukumizu, Kohei Hayashi, Takeru Miyato

    Abstract: Symmetry learning has proven to be an effective approach for extracting the hidden structure of data, with the concept of equivariance relation playing the central role. However, most of the current studies are built on architectural theory and corresponding assumptions on the form of data. We propose Neural Fourier Transform (NFT), a general framework of learning the latent linear action of the g… ▽ More

    Submitted 14 February, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

  3. arXiv:2210.07413  [pdf, other

    stat.ML cs.LG

    Invariance-adapted decomposition and Lasso-type contrastive learning

    Authors: Masanori Koyama, Takeru Miyato, Kenji Fukumizu

    Abstract: Recent years have witnessed the effectiveness of contrastive learning in obtaining the representation of dataset that is useful in interpretation and downstream tasks. However, the mechanism that describes this effectiveness have not been thoroughly analyzed, and many studies have been conducted to investigate the data structures captured by contrastive learning. In particular, the recent study of… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Journal ref: 2022 ICML workshop of Topology, Algebra and Geometry in Machine Learning (spotlight)

  4. arXiv:2210.05972  [pdf, other

    cs.LG stat.ML

    Unsupervised Learning of Equivariant Structure from Sequences

    Authors: Takeru Miyato, Masanori Koyama, Kenji Fukumizu

    Abstract: In this study, we present meta-sequential prediction (MSP), an unsupervised framework to learn the symmetry from the time sequence of length at least three. Our method leverages the stationary property (e.g. constant velocity, constant acceleration) of the time sequence to learn the underlying equivariant structure of the dataset by simply training the encoder-decoder model to be able to predict t… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted to NeurIPS 2022

  5. arXiv:2111.07679  [pdf, other

    stat.ML cs.LG

    Contrastive Representation Learning with Trainable Augmentation Channel

    Authors: Masanori Koyama, Kentaro Minami, Takeru Miyato, Yarin Gal

    Abstract: In contrastive representation learning, data representation is trained so that it can classify the image instances even when the images are altered by augmentations. However, depending on the datasets, some augmentations can damage the information of the images beyond recognition, and such augmentations can result in collapsed representations. We present a partial solution to this problem by forma… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

  6. arXiv:1905.13021  [pdf, other

    stat.ML cs.IT cs.LG

    Robustness to Adversarial Perturbations in Learning from Incomplete Data

    Authors: Amir Najafi, Shin-ichi Maeda, Masanori Koyama, Takeru Miyato

    Abstract: What is the role of unlabeled data in an inference problem, when the presumed underlying distribution is adversarially perturbed? To provide a concrete answer to this question, this paper unifies two major learning frameworks: Semi-Supervised Learning (SSL) and Distributionally Robust Learning (DRL). We develop a generalization theory for our framework based on a number of novel complexity measure… ▽ More

    Submitted 24 May, 2019; originally announced May 2019.

    Comments: 41 pages, 9 figures

  7. arXiv:1811.10153  [pdf, other

    cs.CV cs.LG

    Spatially Controllable Image Synthesis with Internal Representation Collaging

    Authors: Ryohei Suzuki, Masanori Koyama, Takeru Miyato, Taizan Yonetsuji, Huachun Zhu

    Abstract: We present a novel CNN-based image editing strategy that allows the user to change the semantic information of an image over an arbitrary region by manipulating the feature-space representation of the image in a trained GAN model. We will present two variants of our strategy: (1) spatial conditional batch normalization (sCBN), a type of conditional batch normalization with user-specifiable spatial… ▽ More

    Submitted 9 April, 2019; v1 submitted 25 November, 2018; originally announced November 2018.

  8. arXiv:1805.06386  [pdf, other

    stat.ML cs.CV cs.LG

    Neural Multi-scale Image Compression

    Authors: Ken Nakanishi, Shin-ichi Maeda, Takeru Miyato, Daisuke Okanohara

    Abstract: This study presents a new lossy image compression method that utilizes the multi-scale features of natural images. Our model consists of two networks: multi-scale lossy autoencoder and parallel multi-scale lossless coder. The multi-scale lossy autoencoder extracts the multi-scale image features to quantized variables and the parallel multi-scale lossless coder enables rapid and accurate lossless c… ▽ More

    Submitted 16 May, 2018; originally announced May 2018.

    Comments: 15 pages, 15 figures

  9. arXiv:1802.05957  [pdf, other

    cs.LG cs.CV stat.ML

    Spectral Normalization for Generative Adversarial Networks

    Authors: Takeru Miyato, Toshiki Kataoka, Masanori Koyama, Yuichi Yoshida

    Abstract: One of the challenges in the study of generative adversarial networks is the instability of its training. In this paper, we propose a novel weight normalization technique called spectral normalization to stabilize the training of the discriminator. Our new normalization technique is computationally light and easy to incorporate into existing implementations. We tested the efficacy of spectral norm… ▽ More

    Submitted 16 February, 2018; originally announced February 2018.

    Comments: Published as a conference paper at ICLR 2018

  10. arXiv:1802.05637  [pdf, other

    cs.LG cs.CV stat.ML

    cGANs with Projection Discriminator

    Authors: Takeru Miyato, Masanori Koyama

    Abstract: We propose a novel, projection based way to incorporate the conditional information into the discriminator of GANs that respects the role of the conditional information in the underlining probabilistic model. This approach is in contrast with most frameworks of conditional GANs used in application today, which use the conditional information by concatenating the (embedded) conditional vector to th… ▽ More

    Submitted 14 August, 2018; v1 submitted 15 February, 2018; originally announced February 2018.

    Comments: Published as a conference paper at ICLR 2018

  11. arXiv:1711.07170  [pdf, other

    cs.CV cs.AI

    Parameter Reference Loss for Unsupervised Domain Adaptation

    Authors: Jiren **, Richard G. Calland, Takeru Miyato, Brian K. Vogel, Hideki Nakayama

    Abstract: The success of deep learning in computer vision is mainly attributed to an abundance of data. However, collecting large-scale data is not always possible, especially for the supervised labels. Unsupervised domain adaptation (UDA) aims to utilize labeled data from a source domain to learn a model that generalizes to a target domain of unlabeled data. A large amount of existing work uses Siamese net… ▽ More

    Submitted 5 December, 2017; v1 submitted 20 November, 2017; originally announced November 2017.

    Comments: Add experiments that compare parameter reference loss with existing methods using the same architecture

  12. arXiv:1705.10941  [pdf, other

    stat.ML cs.LG

    Spectral Norm Regularization for Improving the Generalizability of Deep Learning

    Authors: Yuichi Yoshida, Takeru Miyato

    Abstract: We investigate the generalizability of deep learning based on the sensitivity to input perturbation. We hypothesize that the high sensitivity to the perturbation of data degrades the performance on it. To reduce the sensitivity to perturbation, we propose a simple and effective regularization method, referred to as spectral norm regularization, which penalizes the high spectral norm of weight matr… ▽ More

    Submitted 31 May, 2017; originally announced May 2017.

  13. arXiv:1704.03976  [pdf, other

    stat.ML cs.LG

    Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning

    Authors: Takeru Miyato, Shin-ichi Maeda, Masanori Koyama, Shin Ishii

    Abstract: We propose a new regularization method based on virtual adversarial loss: a new measure of local smoothness of the conditional label distribution given input. Virtual adversarial loss is defined as the robustness of the conditional label distribution around each input data point against local perturbation. Unlike adversarial training, our method defines the adversarial direction without label info… ▽ More

    Submitted 27 June, 2018; v1 submitted 12 April, 2017; originally announced April 2017.

    Comments: To be appeared in IEEE Transactions on Pattern Analysis and Machine Intelligence

  14. arXiv:1702.08720  [pdf, other

    stat.ML cs.LG

    Learning Discrete Representations via Information Maximizing Self-Augmented Training

    Authors: Weihua Hu, Takeru Miyato, Seiya Tokui, Eiichi Matsumoto, Masashi Sugiyama

    Abstract: Learning discrete representations of data is a central machine learning task because of the compactness of the representations and ease of interpretation. The task includes clustering and hash learning as special cases. Deep neural networks are promising to be used because they can model the non-linearity of data and scale to large datasets. However, their model complexity is huge, and therefore,… ▽ More

    Submitted 14 June, 2017; v1 submitted 28 February, 2017; originally announced February 2017.

    Comments: To appear at ICML 2017

  15. arXiv:1605.07725  [pdf, ps, other

    stat.ML cs.LG

    Adversarial Training Methods for Semi-Supervised Text Classification

    Authors: Takeru Miyato, Andrew M. Dai, Ian Goodfellow

    Abstract: Adversarial training provides a means of regularizing supervised learning algorithms while virtual adversarial training is able to extend supervised learning algorithms to the semi-supervised setting. However, both methods require making small perturbations to numerous entries of the input vector, which is inappropriate for sparse high-dimensional inputs such as one-hot word representations. We ex… ▽ More

    Submitted 16 November, 2021; v1 submitted 25 May, 2016; originally announced May 2016.

    Comments: Published as a conference paper at ICLR 2017

  16. arXiv:1507.00677  [pdf, other

    stat.ML cs.LG

    Distributional Smoothing with Virtual Adversarial Training

    Authors: Takeru Miyato, Shin-ichi Maeda, Masanori Koyama, Ken Nakae, Shin Ishii

    Abstract: We propose local distributional smoothness (LDS), a new notion of smoothness for statistical model that can be used as a regularization term to promote the smoothness of the model distribution. We named the LDS based regularization as virtual adversarial training (VAT). The LDS of a model at an input datapoint is defined as the KL-divergence based robustness of the model distribution against local… ▽ More

    Submitted 11 June, 2016; v1 submitted 2 July, 2015; originally announced July 2015.

    Comments: Under review as a conference paper at ICLR 2016