Skip to main content

Showing 1–12 of 12 results for author: Taki, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12220  [pdf, other

    cs.LG cond-mat.dis-nn cs.CV cs.NE stat.ML

    Hierarchical Associative Memory, Parallelized MLP-Mixer, and Symmetry Breaking

    Authors: Ryo Karakida, Toshihiro Ota, Masato Taki

    Abstract: Transformers have established themselves as the leading neural network model in natural language processing and are increasingly foundational in various domains. In vision, the MLP-Mixer model has demonstrated competitive performance, suggesting that attention mechanisms might not be indispensable. Inspired by this, recent research has explored replacing attention modules with other mechanisms, in… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 18 pages

  2. arXiv:2403.06676  [pdf, other

    cs.CV

    CAM Back Again: Large Kernel CNNs from a Weakly Supervised Object Localization Perspective

    Authors: Shunsuke Yasuki, Masato Taki

    Abstract: Recently, convolutional neural networks (CNNs) with large size kernels have attracted much attention in the computer vision field, following the success of the Vision Transformers. Large kernel CNNs have been reported to perform well in downstream vision tasks as well as in classification performance. The reason for the high-performance of large kernel CNNs in downstream tasks has been attributed… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR2024. Code: https://github.com/snskysk/CAM-Back-Again

  3. arXiv:2304.13061  [pdf, other

    cs.LG cond-mat.dis-nn cs.CV cs.NE

    iMixer: hierarchical Hopfield network implies an invertible, implicit and iterative MLP-Mixer

    Authors: Toshihiro Ota, Masato Taki

    Abstract: In the last few years, the success of Transformers in computer vision has stimulated the discovery of many alternative models that compete with Transformers, such as the MLP-Mixer. Despite their weak inductive bias, these models have achieved performance comparable to well-studied convolutional neural networks. Recent studies on modern Hopfield networks suggest the correspondence between certain e… ▽ More

    Submitted 1 April, 2024; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: 19 pages. v2: minor improvements

  4. arXiv:2303.03932  [pdf, other

    cs.CV cs.AI cs.LG

    FFT-based Dynamic Token Mixer for Vision

    Authors: Yuki Tatsunami, Masato Taki

    Abstract: Multi-head-self-attention (MHSA)-equipped models have achieved notable performance in computer vision. Their computational complexity is proportional to quadratic numbers of pixels in input feature maps, resulting in slow processing, especially when dealing with high-resolution images. New types of token-mixer are proposed as an alternative to MHSA to circumvent this problem: an FFT-based token-mi… ▽ More

    Submitted 17 December, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

    Comments: The 38th Annual AAAI Conference on Artificial Intelligence (AAAI'24)

  5. arXiv:2302.01526  [pdf, other

    cs.AI cs.CV cs.LG physics.geo-ph

    Example-Based Explainable AI and its Application for Remote Sensing Image Classification

    Authors: Shin-nosuke Ishikawa, Masato Todo, Masato Taki, Yasunobu Uchiyama, Kazunari Matsunaga, Peihsuan Lin, Taiki Ogihara, Masao Yasui

    Abstract: We present a method of explainable artificial intelligence (XAI), "What I Know (WIK)", to provide additional information to verify the reliability of a deep learning model by showing an example of an instance in a training dataset that is similar to the input data to be inferred and demonstrate it in a remote sensing image classification task. One of the expected roles of XAI methods is verifying… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

    Comments: 10 pages, 4 figures, accepted for publication in International Journal of Applied Earth Observation and Geoinformation

  6. arXiv:2205.01972  [pdf, other

    cs.CV cs.AI cs.LG

    Sequencer: Deep LSTM for Image Classification

    Authors: Yuki Tatsunami, Masato Taki

    Abstract: In recent computer vision research, the advent of the Vision Transformer (ViT) has rapidly revolutionized various architectural design efforts: ViT achieved state-of-the-art image classification performance using self-attention found in natural language processing, and MLP-Mixer achieved competitive performance using simple multi-layer perceptrons. In contrast, several studies have also suggested… ▽ More

    Submitted 12 January, 2023; v1 submitted 4 May, 2022; originally announced May 2022.

    Comments: Accepted in NeurIPS 2022; camera ready edition

  7. arXiv:2204.04906  [pdf, other

    cond-mat.mes-hall cs.CE cs.LG quant-ph

    Application of QUBO solver using black-box optimization to structural design for resonance avoidance

    Authors: Tadayoshi Matsumori, Masato Taki, Tadashi Kadowaki

    Abstract: Quadratic unconstrained binary optimization (QUBO) solvers can be applied to design an optimal structure to avoid resonance. QUBO algorithms that work on a classical or quantum device have succeeded in some industrial applications. However, their applications are still limited due to the difficulty of transforming from the original optimization problem to QUBO. Recently, black-box optimization (BB… ▽ More

    Submitted 19 July, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: This is a preprint of an article published in Scientific Reports. The final authenticated version is available online at: 10.1038/s41598-022-16149-8

    Journal ref: Sci Rep 12, 12143 (2022)

  8. arXiv:2110.14927  [pdf

    q-bio.NC cs.CV cs.LG

    Counterfactual Explanation of Brain Activity Classifiers using Image-to-Image Transfer by Generative Adversarial Network

    Authors: Teppei Matsui, Masato Taki, Trung Quang Pham, Junichi Chikazoe, Koji Jimura

    Abstract: Deep neural networks (DNNs) can accurately decode task-related information from brain activations. However, because of the nonlinearity of the DNN, the decisions made by DNNs are hardly interpretable. One of the promising approaches for explaining such a black-box system is counterfactual explanation. In this framework, the behavior of a black-box system is explained by comparing real data and rea… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

    Comments: 28 pages, 6 figures, 3 tables, 2 supplementary figures, 1 supplementary table

  9. arXiv:2108.04384  [pdf, other

    cs.CV cs.AI cs.LG

    RaftMLP: How Much Can Be Done Without Attention and with Less Spatial Locality?

    Authors: Yuki Tatsunami, Masato Taki

    Abstract: For the past ten years, CNN has reigned supreme in the world of computer vision, but recently, Transformer has been on the rise. However, the quadratic computational cost of self-attention has become a serious problem in practice applications. There has been much research on architectures without CNN and self-attention in this context. In particular, MLP-Mixer is a simple architecture designed usi… ▽ More

    Submitted 12 January, 2023; v1 submitted 9 August, 2021; originally announced August 2021.

    Comments: ACCV2022 accepted

  10. arXiv:1709.02956  [pdf, other

    cs.LG stat.ML

    Deep Residual Networks and Weight Initialization

    Authors: Masato Taki

    Abstract: Residual Network (ResNet) is the state-of-the-art architecture that realizes successful training of really deep neural network. It is also known that good weight initialization of neural network avoids problem of vanishing/exploding gradients. In this paper, simplified models of ResNets are analyzed. We argue that goodness of ResNet is correlated with the fact that ResNets are relatively insensiti… ▽ More

    Submitted 9 September, 2017; originally announced September 2017.

    Comments: 10 pages, 4 figures

    Report number: RIKEN-iTHEMS-Report-17

  11. arXiv:0912.1333  [pdf

    cs.IT

    Spectral Efficiency Optimization for an Interfering Cognitive Radio with Adaptive Modulation and Coding

    Authors: Mehrdad Taki, Farshad Lahouti

    Abstract: In this paper, we consider a primary and a cognitive user transmitting over a wireless fading interference channel. The primary user transmits with a constant power and utilizes an adaptive modulation and coding (AMC) scheme satisfying a bit error rate requirement. We propose a link adaptation scheme to maximize the average spectral efficiency of the cognitive radio, while a minimum required spe… ▽ More

    Submitted 7 December, 2009; originally announced December 2009.

    Comments: 26 pages, 7 figures submitted to IEEE journal on selected areas in communication

  12. arXiv:0903.0099  [pdf, other

    cs.IT

    Spectral Efficiency Optimized Adaptive Transmission for Cognitive Radios in an Interference Channel

    Authors: Mehrdad Taki, Farshad Lahouti

    Abstract: In this paper, we consider a primary and a cognitive user transmitting over a wireless fading interference channel. The primary user transmits with a constant power and utilizes an adaptive modulation and coding (AMC) scheme satisfying a bit error rate requirement. We propose a link adaptation scheme to maximize the average spectral efficiency of the cognitive radio, while a minimum required spe… ▽ More

    Submitted 28 February, 2009; originally announced March 2009.

    Comments: accepted in ICC 2009