Skip to main content

Showing 1–9 of 9 results for author: Kihara, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18610  [pdf, other

    cs.CV

    Vox-UDA: Voxel-wise Unsupervised Domain Adaptation for Cryo-Electron Subtomogram Segmentation with Denoised Pseudo Labeling

    Authors: Haoran Li, Xingjian Li, Jiahua Shi, Huaming Chen, Bo Du, Daisuke Kihara, Johan Barthelemy, Jun Shen, Min Xu

    Abstract: Cryo-Electron Tomography (cryo-ET) is a 3D imaging technology facilitating the study of macromolecular structures at near-atomic resolution. Recent volumetric segmentation approaches on cryo-ET images have drawn widespread interest in biological sector. However, existing methods heavily rely on manually labeled data, which requires highly professional skills, thereby hindering the adoption of full… ▽ More

    Submitted 30 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: 11 pages

  2. arXiv:2406.08859  [pdf, other

    cs.CV

    Fusion of regional and sparse attention in Vision Transformers

    Authors: Nabil Ibtehaz, Ning Yan, Masood Mortazavi, Daisuke Kihara

    Abstract: Modern vision transformers leverage visually inspired local interaction between pixels through attention computed within window or grid regions, in contrast to the global attention employed in the original ViT. Regional attention restricts pixel interactions within specific regions, while sparse attention disperses them across sparse grids. These differing approaches pose a challenge between maint… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted as a Workshop Paper at T4V@CVPR2024. arXiv admin note: substantial text overlap with arXiv:2403.04200

  3. arXiv:2403.04200  [pdf, other

    cs.CV

    ACC-ViT : Atrous Convolution's Comeback in Vision Transformers

    Authors: Nabil Ibtehaz, Ning Yan, Masood Mortazavi, Daisuke Kihara

    Abstract: Transformers have elevated to the state-of-the-art vision architectures through innovations in attention mechanism inspired from visual perception. At present two classes of attentions prevail in vision transformers, regional and sparse attention. The former bounds the pixel interactions within a region; the latter spreads them across sparse grids. The opposing natures of them have resulted in a d… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  4. arXiv:2308.13680  [pdf, other

    cs.CV

    ACC-UNet: A Completely Convolutional UNet model for the 2020s

    Authors: Nabil Ibtehaz, Daisuke Kihara

    Abstract: This decade is marked by the introduction of Vision Transformer, a radical paradigm shift in broad computer vision. A similar trend is followed in medical imaging, UNet, one of the most influential architectures, has been redesigned with transformers. Recently, the efficacy of convolutional models in vision is being reinvestigated by seminal works such as ConvNext, which elevates a ResNet to Swin… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  5. arXiv:2204.00613  [pdf, other

    cs.CV cs.LG

    On the Importance of Asymmetry for Siamese Representation Learning

    Authors: Xiao Wang, Haoqi Fan, Yuandong Tian, Daisuke Kihara, Xinlei Chen

    Abstract: Many recent self-supervised frameworks for visual representation learning are based on certain forms of Siamese networks. Such networks are conceptually symmetric with two parallel encoders, but often practically asymmetric as numerous mechanisms are devised to break the symmetry. In this work, we conduct a formal study on the importance of asymmetry by explicitly distinguishing the two encoders w… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

    Comments: 11 pages, CVPR 2022

  6. SHREC 2021: Classification in cryo-electron tomograms

    Authors: Ilja Gubins, Marten L. Chaillet, Gijs van der Schot, M. Cristina Trueba, Remco C. Veltkamp, Friedrich Förster, Xiao Wang, Daisuke Kihara, Emmanuel Moebel, Nguyen P. Nguyen, Tommi White, Filiz Bunyak, Giorgos Papoulias, Stavros Gerolymatos, Evangelia I. Zacharaki, Konstantinos Moustakas, Xiangrui Zeng, Sinuo Liu, Min Xu, Yaoyu Wang, Cheng Chen, Xuefeng Cui, Fa Zhang

    Abstract: Cryo-electron tomography (cryo-ET) is an imaging technique that allows three-dimensional visualization of macro-molecular assemblies under near-native conditions. Cryo-ET comes with a number of challenges, mainly low signal-to-noise and inability to obtain images from all angles. Computational methods are key to analyze cryo-electron tomograms. To promote innovation in computational methods, we… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: Workshop version of the paper can be found here: https://diglib.eg.org/handle/10.2312/3dor20211307

  7. arXiv:1911.09265  [pdf, other

    cs.CV

    EnAET: A Self-Trained framework for Semi-Supervised and Supervised Learning with Ensemble Transformations

    Authors: Xiao Wang, Daisuke Kihara, Jiebo Luo, Guo-Jun Qi

    Abstract: Deep neural networks have been successfully applied to many real-world applications. However, such successes rely heavily on large amounts of labeled data that is expensive to obtain. Recently, many methods for semi-supervised learning have been proposed and achieved excellent performance. In this study, we propose a new EnAET framework to further improve existing semi-supervised methods with self… ▽ More

    Submitted 1 February, 2021; v1 submitted 20 November, 2019; originally announced November 2019.

    Comments: 10 pages, 3 figures, conference

  8. arXiv:1910.12995  [pdf, other

    cs.CL cs.LG

    A Simple but Effective BERT Model for Dialog State Tracking on Resource-Limited Systems

    Authors: Tuan Manh Lai, Quan Hung Tran, Trung Bui, Daisuke Kihara

    Abstract: In a task-oriented dialog system, the goal of dialog state tracking (DST) is to monitor the state of the conversation from the dialog history. Recently, many deep learning based methods have been proposed for the task. Despite their impressive performance, current neural architectures for DST are typically heavily-engineered and conceptually complex, making it difficult to implement, debug, and ma… ▽ More

    Submitted 8 February, 2020; v1 submitted 28 October, 2019; originally announced October 2019.

    Comments: Accepted to ICASSP 2020

  9. arXiv:1909.09696  [pdf, other

    cs.CL cs.AI

    A Gated Self-attention Memory Network for Answer Selection

    Authors: Tuan Lai, Quan Hung Tran, Trung Bui, Daisuke Kihara

    Abstract: Answer selection is an important research problem, with applications in many areas. Previous deep learning based approaches for the task mainly adopt the Compare-Aggregate architecture that performs word-level comparison followed by aggregation. In this work, we take a departure from the popular Compare-Aggregate architecture, and instead, propose a new gated self-attention memory network for the… ▽ More

    Submitted 13 September, 2019; originally announced September 2019.

    Comments: Accepted at the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP 2019)