Skip to main content

Showing 1–11 of 11 results for author: Cheung, P Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15045  [pdf, other

    cs.CL

    Harnessing Knowledge Retrieval with Large Language Models for Clinical Report Error Correction

    Authors: **ge Wu, Zhaolong Wu, Abul Hasan, Yunsoo Kim, Jason P. Y. Cheung, Teng Zhang, Honghan Wu

    Abstract: This study proposes an approach for error correction in clinical radiology reports, leveraging large language models (LLMs) and retrieval-augmented generation (RAG) techniques. The proposed framework employs internal and external retrieval mechanisms to extract relevant medical entities and relations from the report and external knowledge sources. A three-stage inference process is introduced, dec… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2406.09103  [pdf, other

    cs.CL

    Chain-of-Though (CoT) prompting strategies for medical error detection and correction

    Authors: Zhaolong Wu, Abul Hasan, **ge Wu, Yunsoo Kim, Jason P. Y. Cheung, Teng Zhang, Honghan Wu

    Abstract: This paper describes our submission to the MEDIQA-CORR 2024 shared task for automatically detecting and correcting medical errors in clinical notes. We report results for three methods of few-shot In-Context Learning (ICL) augmented with Chain-of-Thought (CoT) and reason prompts using a large language model (LLM). In the first method, we manually analyse a subset of train and validation dataset to… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: accepted as NAACL workshop

  3. arXiv:2207.12943  [pdf, other

    cs.CY

    Unique in what sense? Heterogeneous relationships between multiple types of uniqueness and popularity in music

    Authors: Yulin Yu, Pui Yin Cheung, Yong-Yeol Ahn, Paramveer Dhillon

    Abstract: How does our society appreciate the uniqueness of cultural products? This fundamental puzzle has intrigued scholars in many fields, including psychology, sociology, anthropology, and marketing. It has been theorized that cultural products that balance familiarity and novelty are more likely to become popular. However, a cultural product's novelty is typically multifaceted. This paper uses songs as… ▽ More

    Submitted 9 April, 2023; v1 submitted 26 July, 2022; originally announced July 2022.

    Comments: Accepted at the International AAAI Conference on Web and Social Media (ICWSM, 2023). Special Recognition Award at 7th International Conference on Computational Social Science (IC2S2, 2021)

  4. Logic Shrinkage: Learned FPGA Netlist Sparsity for Efficient Neural Network Inference

    Authors: Erwei Wang, James J. Davis, Georgios-Ilias Stavrou, Peter Y. K. Cheung, George A. Constantinides, Mohamed S. Abdelfattah

    Abstract: FPGA-specific DNN architectures using the native LUTs as independently trainable inference operators have been shown to achieve favorable area-accuracy and energy-accuracy tradeoffs. The first work in this area, LUTNet, exhibited state-of-the-art performance for standard DNN benchmarks. In this paper, we propose the learned optimization of such LUT-based topologies, resulting in higher-efficiency… ▽ More

    Submitted 2 January, 2022; v1 submitted 4 December, 2021; originally announced December 2021.

    Comments: Accepted manuscript uploaded 04/12/21. DOA 22/11/21

  5. arXiv:2106.14089  [pdf, other

    cs.LG cs.AR physics.ins-det

    Accelerating Recurrent Neural Networks for Gravitational Wave Experiments

    Authors: Zhiqiang Que, Erwei Wang, Umar Marikar, Eric Moreno, Jennifer Ngadiuba, Hamza Javed, Bartłomiej Borzyszkowski, Thea Aarrestad, Vladimir Loncar, Sioni Summers, Maurizio Pierini, Peter Y Cheung, Wayne Luk

    Abstract: This paper presents novel reconfigurable architectures for reducing the latency of recurrent neural networks (RNNs) that are used for detecting gravitational waves. Gravitational interferometers such as the LIGO detectors capture cosmic events such as black hole mergers which happen at unknown times and of varying durations, producing time-series data. We have developed a new architecture capable… ▽ More

    Submitted 26 June, 2021; originally announced June 2021.

    Comments: Accepted at the 2021 32nd IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP)

  6. arXiv:2102.04270  [pdf, other

    cs.LG cs.AR

    Enabling Binary Neural Network Training on the Edge

    Authors: Erwei Wang, James J. Davis, Daniele Moro, Piotr Zielinski, Jia Jie Lim, Claudionor Coelho, Satrajit Chatterjee, Peter Y. K. Cheung, George A. Constantinides

    Abstract: The ever-growing computational demands of increasingly complex machine learning models frequently necessitate the use of powerful cloud-based infrastructure for their training. Binary neural networks are known to be promising candidates for on-device inference due to their extreme compute and memory savings over higher-precision alternatives. However, their existing training methods require the co… ▽ More

    Submitted 24 September, 2023; v1 submitted 8 February, 2021; originally announced February 2021.

  7. arXiv:1910.12625  [pdf, other

    cs.LG cs.CV eess.SP stat.ML

    LUTNet: Learning FPGA Configurations for Highly Efficient Neural Network Inference

    Authors: Erwei Wang, James J. Davis, Peter Y. K. Cheung, George A. Constantinides

    Abstract: Research has shown that deep neural networks contain significant redundancy, and thus that high classification accuracy can be achieved even when weights and activations are quantized down to binary values. Network binarization on FPGAs greatly increases area efficiency by replacing resource-hungry multipliers with lightweight XNOR gates. However, an FPGA's fundamental building block, the K-LUT, i… ▽ More

    Submitted 2 March, 2020; v1 submitted 23 October, 2019; originally announced October 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1904.00938. Accepted manuscript uploaded 02/03/20. DOA 01/03/20

  8. arXiv:1910.10075  [pdf, other

    eess.SP cs.AR cs.LG

    Automatic Generation of Multi-precision Multi-arithmetic CNN Accelerators for FPGAs

    Authors: Yiren Zhao, Xitong Gao, Xuan Guo, Junyi Liu, Erwei Wang, Robert Mullins, Peter Y. K. Cheung, George Constantinides, Cheng-Zhong Xu

    Abstract: Modern deep Convolutional Neural Networks (CNNs) are computationally demanding, yet real applications often require high throughput and low latency. To help tackle these problems, we propose Tomato, a framework designed to automate the process of generating efficient CNN accelerators. The generated design is pipelined and each convolution layer uses different arithmetics at various precisions. Usi… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

    Comments: To be published in International Conference on Field Programmable Technology 2019

  9. arXiv:1904.00938  [pdf, other

    cs.LG stat.ML

    LUTNet: Rethinking Inference in FPGA Soft Logic

    Authors: Erwei Wang, James J. Davis, Peter Y. K. Cheung, George A. Constantinides

    Abstract: Research has shown that deep neural networks contain significant redundancy, and that high classification accuracies can be achieved even when weights and activations are quantised down to binary values. Network binarisation on FPGAs greatly increases area efficiency by replacing resource-hungry multipliers with lightweight XNOR gates. However, an FPGA's fundamental building block, the K-LUT, is c… ▽ More

    Submitted 1 April, 2019; originally announced April 2019.

    Comments: Accepted manuscript uploaded 01/04/19. DOA 03/03/19

  10. Deep Neural Network Approximation for Custom Hardware: Where We've Been, Where We're Going

    Authors: Erwei Wang, James J. Davis, Ruizhe Zhao, Ho-Cheung Ng, Xinyu Niu, Wayne Luk, Peter Y. K. Cheung, George A. Constantinides

    Abstract: Deep neural networks have proven to be particularly effective in visual and audio recognition tasks. Existing models tend to be computationally expensive and memory intensive, however, and so methods for hardware-oriented approximation have become a hot topic. Research has shown that custom hardware-based neural network accelerators can surpass their general-purpose processor equivalents in terms… ▽ More

    Submitted 8 July, 2019; v1 submitted 21 January, 2019; originally announced January 2019.

    Comments: Accepted manuscript uploaded 21/01/19. DOA 15/01/19

    Journal ref: ACM Comput. Surv. 52, 2, Article 40 (May 2019), 39 pages

  11. arXiv:1807.10577  [pdf, other

    cs.CV

    Accuracy to Throughput Trade-offs for Reduced Precision Neural Networks on Reconfigurable Logic

    Authors: Jiang Su, Nicholas J. Fraser, Giulio Gambardella, Michaela Blott, Gianluca Durelli, David B. Thomas, Philip Leong, Peter Y. K. Cheung

    Abstract: Modern CNN are typically based on floating point linear algebra based implementations. Recently, reduced precision NN have been gaining popularity as they require significantly less memory and computational resources compared to floating point. This is particularly important in power constrained compute environments. However, in many cases a reduction in precision comes at a small cost to the accu… ▽ More

    Submitted 17 July, 2018; originally announced July 2018.

    Comments: Accepted by ARC 2018