Skip to main content

Showing 1–4 of 4 results for author: Naruse, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.15136  [pdf, other

    cs.DS cs.CV cs.DB cs.DC cs.IR

    CAGRA: Highly Parallel Graph Construction and Approximate Nearest Neighbor Search for GPUs

    Authors: Hiroyuki Ootomo, Akira Naruse, Corey Nolet, Ray Wang, Tamas Feher, Yong Wang

    Abstract: Approximate Nearest Neighbor Search (ANNS) plays a critical role in various disciplines spanning data mining and artificial intelligence, from information retrieval and computer vision to natural language processing and recommender systems. Data volumes have soared in recent years and the computational cost of an exhaustive exact nearest neighbor search is often prohibitive, necessitating the adop… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  2. arXiv:2301.06672  [pdf, other

    cs.DC

    Custom 8-bit floating point value format for reducing shared memory bank conflict in approximate nearest neighbor search

    Authors: Hiroyuki Ootomo, Akira Naruse

    Abstract: The k-nearest neighbor search is used in various applications such as machine learning, computer vision, database search, and information retrieval. While the computational cost of the exact nearest neighbor search is enormous, an approximate nearest neighbor search (ANNS) has been attracting much attention. IVFPQ is one of the ANNS methods. Although we can leverage the high bandwidth and low late… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

    Comments: Extended "extended abstract of the SC22 research poster"

  3. arXiv:2002.06015  [pdf, other

    cs.LG stat.ML

    Scalable and Practical Natural Gradient for Large-Scale Deep Learning

    Authors: Kazuki Osawa, Yohei Tsuji, Yuichiro Ueno, Akira Naruse, Chuan-Sheng Foo, Rio Yokota

    Abstract: Large-scale distributed training of deep neural networks results in models with worse generalization performance as a result of the increase in the effective mini-batch size. Previous approaches attempt to address this problem by varying the learning rate and batch size over epochs and layers, or ad hoc modifications of batch normalization. We propose Scalable and Practical Natural Gradient Descen… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.

    Comments: arXiv admin note: text overlap with arXiv:1811.12019

  4. arXiv:1811.12019  [pdf, other

    cs.LG cs.CV stat.ML

    Large-Scale Distributed Second-Order Optimization Using Kronecker-Factored Approximate Curvature for Deep Convolutional Neural Networks

    Authors: Kazuki Osawa, Yohei Tsuji, Yuichiro Ueno, Akira Naruse, Rio Yokota, Satoshi Matsuoka

    Abstract: Large-scale distributed training of deep neural networks suffer from the generalization gap caused by the increase in the effective mini-batch size. Previous approaches try to solve this problem by varying the learning rate and batch size over epochs and layers, or some ad hoc modification of the batch normalization. We propose an alternative approach using a second-order optimization method that… ▽ More

    Submitted 30 March, 2019; v1 submitted 29 November, 2018; originally announced November 2018.

    Comments: 10 pages, 7 figures. Accepted at CVPR 2019, Long Beach, CA