Skip to main content

Showing 1–4 of 4 results for author: Chang, M F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2205.07991  [pdf, other

    cs.AR cs.DC

    TopSort: A High-Performance Two-Phase Sorting Accelerator Optimized on HBM-based FPGAs

    Authors: Weikang Qiao, Licheng Guo, Zhenman Fang, Mau-Chung Frank Chang, Jason Cong

    Abstract: The emergence of high-bandwidth memory (HBM) brings new opportunities to boost the performance of sorting acceleration on FPGAs, which was conventionally bounded by the available off-chip memory bandwidth. However, it is nontrivial for designers to fully utilize this immense bandwidth. First, the existing sorter designs cannot be directly scaled at the increasing rate of available off-chip bandwid… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

  2. arXiv:1709.06614  [pdf

    cs.ET cs.AR cs.LG

    An Analog Neural Network Computing Engine using CMOS-Compatible Charge-Trap-Transistor (CTT)

    Authors: Yuan Du, Li Du, Xuefeng Gu, Jieqiong Du, X. Shawn Wang, Boyu Hu, Mingzhe Jiang, Xiaoliang Chen, Junjie Su, Subramanian S. Iyer, Mau-Chung Frank Chang

    Abstract: An analog neural network computing engine based on CMOS-compatible charge-trap transistor (CTT) is proposed in this paper. CTT devices are used as analog multipliers. Compared to digital multipliers, CTT-based analog multiplier shows significant area and power reduction. The proposed computing engine is composed of a scalable CTT multiplier array and energy efficient analog-digital interfaces. Thr… ▽ More

    Submitted 9 August, 2018; v1 submitted 19 September, 2017; originally announced September 2017.

    Comments: 9 pages, 11 figures

  3. arXiv:1709.05116  [pdf

    cs.AR cs.AI

    A Streaming Accelerator for Deep Convolutional Neural Networks with Image and Feature Decomposition for Resource-limited System Applications

    Authors: Yuan Du, Li Du, Yilei Li, Junjie Su, Mau-Chung Frank Chang

    Abstract: Deep convolutional neural networks (CNN) are widely used in modern artificial intelligence (AI) and smart vision systems but also limited by computation latency, throughput, and energy efficiency on a resource-limited scenario, such as mobile devices, internet of things (IoT), unmanned aerial vehicles (UAV), and so on. A hardware streaming architecture is proposed to accelerate convolution and poo… ▽ More

    Submitted 15 September, 2017; originally announced September 2017.

    Comments: 5 pages, 8 figures

  4. arXiv:1707.02973  [pdf

    cs.CV cs.AR

    A Reconfigurable Streaming Deep Convolutional Neural Network Accelerator for Internet of Things

    Authors: Li Du, Yuan Du, Yilei Li, Mau-Chung Frank Chang

    Abstract: Convolutional neural network (CNN) offers significant accuracy in image detection. To implement image detection using CNN in the internet of things (IoT) devices, a streaming hardware accelerator is proposed. The proposed accelerator optimizes the energy efficiency by avoiding unnecessary data movement. With unique filter decomposition technique, the accelerator can support arbitrary convolution w… ▽ More

    Submitted 8 July, 2017; originally announced July 2017.