Skip to main content

Showing 1–4 of 4 results for author: Fujiki, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.14029  [pdf, other

    cs.LG cs.AI stat.ML

    Partial Search in a Frozen Network is Enough to Find a Strong Lottery Ticket

    Authors: Hikari Otsuka, Daiki Chijiwa, Ángel López García-Arias, Yasuyuki Okoshi, Kazushi Kawamura, Thiem Van Chu, Daichi Fujiki, Susumu Takeuchi, Masato Motomura

    Abstract: Randomly initialized dense networks contain subnetworks that achieve high accuracy without weight learning -- strong lottery tickets (SLTs). Recently, Gadhikar et al. (2023) demonstrated that SLTs can also be found within a randomly pruned source network, thus reducing the SLT search space. However, this limits the search to SLTs that are even sparser than the source, leading to worse accuracy due… ▽ More

    Submitted 3 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: v2: Updates include additional experiments and revisions of some experiments

  2. arXiv:2312.06086  [pdf, other

    cs.AR

    HALO-CAT: A Hidden Network Processor with Activation-Localized CIM Architecture and Layer-Penetrative Tiling

    Authors: Yung-Chin Chen, Shimpei Ando, Daichi Fujiki, Shinya Takamaeda-Yamazaki, Kentaro Yoshioka

    Abstract: To address the 'memory wall' problem in NN hardware acceleration, we introduce HALO-CAT, a software-hardware co-design optimized for Hidden Neural Network (HNN) processing. HALO-CAT integrates Layer-Penetrative Tiling (LPT) for algorithmic efficiency, reducing intermediate result sizes. Furthermore, the architecture employs an activation-localized computing-in-memory approach to minimize data move… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  3. arXiv:2309.02680  [pdf, other

    cs.AR

    Vector-Processing for Mobile Devices: Benchmark and Analysis

    Authors: Alireza Khadem, Daichi Fujiki, Nishil Talati, Scott Mahlke, Reetuparna Das

    Abstract: Vector processing has become commonplace in today's CPU microarchitectures. Vector instructions improve performance and energy which is crucial for resource-constraint mobile devices. The research community currently lacks a comprehensive benchmark suite to study the benefits of vector processing for mobile devices. This paper presents Swan-an extensive vector processing benchmark suite for mobile… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: 2023 IEEE International Symposium on Workload Characterization (IISWC)

  4. arXiv:2308.15040  [pdf, other

    cs.AR

    OSA-HCIM: On-The-Fly Saliency-Aware Hybrid SRAM CIM with Dynamic Precision Configuration

    Authors: Yung-Chin Chen, Shimpei Ando, Daichi Fujiki, Shinya Takamaeda-Yamazaki, Kentaro Yoshioka

    Abstract: Computing-in-Memory (CIM) has shown great potential for enhancing efficiency and performance for deep neural networks (DNNs). However, the lack of flexibility in CIM leads to an unnecessary expenditure of computational resources on less critical operations, and a diminished Signal-to-Noise Ratio (SNR) when handling more complex tasks, significantly hindering the overall performance. Hence, we focu… ▽ More

    Submitted 21 November, 2023; v1 submitted 29 August, 2023; originally announced August 2023.