Skip to main content

Showing 1–15 of 15 results for author: Chuang, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.08017  [pdf, other

    cs.CV cs.CL cs.LG

    Lumos : Empowering Multimodal LLMs with Scene Text Recognition

    Authors: Ashish Shenoy, Yichao Lu, Srihari Jayakumar, Debojeet Chatterjee, Mohsen Moslehpour, Pierce Chuang, Abhay Harpale, Vikas Bhardwaj, Di Xu, Shicong Zhao, Longfang Zhao, Ankit Ramchandani, Xin Luna Dong, Anuj Kumar

    Abstract: We introduce Lumos, the first end-to-end multimodal question-answering system with text understanding capabilities. At the core of Lumos is a Scene Text Recognition (STR) component that extracts text from first person point-of-view images, the output of which is used to augment input to a Multimodal Large Language Model (MM-LLM). While building Lumos, we encountered numerous challenges related to… ▽ More

    Submitted 1 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted to KDD 2024 (ADS Track)

  2. arXiv:2308.09612  [pdf, other

    cs.LG eess.SY

    Constrained Bayesian Optimization Using a Lagrange Multiplier Applied to Power Transistor Design

    Authors: **-Ju Chuang, Ali Saadat, Sara Ghazvini, Hal Edwards, William G. Vandenberghe

    Abstract: We propose a novel constrained Bayesian Optimization (BO) algorithm optimizing the design process of Laterally-Diffused Metal-Oxide-Semiconductor (LDMOS) transistors while realizing a target Breakdown Voltage (BV). We convert the constrained BO problem into a conventional BO problem using a Lagrange multiplier. Instead of directly optimizing the traditional Figure-of-Merit (FOM), we set the Lagran… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: 7 pages, 5 figures

  3. arXiv:2306.00230  [pdf, other

    cs.CE cs.LG

    Predictive Limitations of Physics-Informed Neural Networks in Vortex Shedding

    Authors: Pi-Yueh Chuang, Lorena A. Barba

    Abstract: The recent surge of interest in physics-informed neural network (PINN) methods has led to a wave of studies that attest to their potential for solving partial differential equations (PDEs) and predicting the dynamics of physical systems. However, the predictive limitations of PINNs have not been thoroughly investigated. We look at the flow around a 2D cylinder and find that data-free PINNs are una… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

  4. arXiv:2305.03584  [pdf, other

    cs.CL cs.AI

    Now It Sounds Like You: Learning Personalized Vocabulary On Device

    Authors: Sid Wang, Ashish Shenoy, Pierce Chuang, John Nguyen

    Abstract: In recent years, Federated Learning (FL) has shown significant advancements in its ability to perform various natural language processing (NLP) tasks. This work focuses on applying personalized FL for on-device language modeling. Due to limitations of memory and latency, these models cannot support the complexity of sub-word tokenization or beam search decoding, resulting in the decision to deploy… ▽ More

    Submitted 13 February, 2024; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: Federated Learning, Personalization, On-device NLP, Accepted at AAAI Spring Symposium 2024

  5. arXiv:2205.14249  [pdf, other

    physics.flu-dyn cs.AI cs.LG

    Experience report of physics-informed neural networks in fluid simulations: pitfalls and frustration

    Authors: Pi-Yueh Chuang, Lorena A. Barba

    Abstract: Though PINNs (physics-informed neural networks) are now deemed as a complement to traditional CFD (computational fluid dynamics) solvers rather than a replacement, their ability to solve the Navier-Stokes equations without given data is still of great interest. This report presents our not-so-successful experiments of solving the Navier-Stokes equations with PINN as a replacement for traditional s… ▽ More

    Submitted 22 July, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: 8 pages, 9 figures

  6. arXiv:2110.08352  [pdf, other

    cs.SD cs.CL eess.AS

    Omni-sparsity DNN: Fast Sparsity Optimization for On-Device Streaming E2E ASR via Supernet

    Authors: Haichuan Yang, Yuan Shangguan, Dilin Wang, Meng Li, Pierce Chuang, Xiaohui Zhang, Ganesh Venkatesh, Ozlem Kalinli, Vikas Chandra

    Abstract: From wearables to powerful smart devices, modern automatic speech recognition (ASR) models run on a variety of edge devices with different computational budgets. To navigate the Pareto front of model accuracy vs model size, researchers are trapped in a dilemma of optimizing model accuracy by training and fine-tuning models for each individual edge device while kee** the training GPU-hours tracta… ▽ More

    Submitted 20 July, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

  7. arXiv:2107.04677  [pdf, other

    cs.CL

    Noisy Training Improves E2E ASR for the Edge

    Authors: Dilin Wang, Yuan Shangguan, Haichuan Yang, Pierce Chuang, Jiatong Zhou, Meng Li, Ganesh Venkatesh, Ozlem Kalinli, Vikas Chandra

    Abstract: Automatic speech recognition (ASR) has become increasingly ubiquitous on modern edge devices. Past work developed streaming End-to-End (E2E) all-neural speech recognizers that can run compactly on edge devices. However, E2E ASR models are prone to overfitting and have difficulties in generalizing to unseen testing data. Various techniques have been proposed to regularize the training of ASR models… ▽ More

    Submitted 9 July, 2021; originally announced July 2021.

  8. arXiv:2106.11890  [pdf, other

    cs.LG

    Latency-Aware Neural Architecture Search with Multi-Objective Bayesian Optimization

    Authors: David Eriksson, Pierce I-Jen Chuang, Samuel Daulton, Peng Xia, Akshat Shrivastava, Arun Babu, Shicong Zhao, Ahmed Aly, Ganesh Venkatesh, Maximilian Balandat

    Abstract: When tuning the architecture and hyperparameters of large machine learning models for on-device deployment, it is desirable to understand the optimal trade-offs between on-device latency and model accuracy. In this work, we leverage recent methodological advances in Bayesian optimization over high-dimensional search spaces and multi-objective Bayesian optimization to efficiently explore these trad… ▽ More

    Submitted 25 June, 2021; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: To Appear at the 8th ICML Workshop on Automated Machine Learning, ICML 2021

  9. arXiv:2104.07275  [pdf, other

    cs.CL

    Span Pointer Networks for Non-Autoregressive Task-Oriented Semantic Parsing

    Authors: Akshat Shrivastava, Pierce Chuang, Arun Babu, Shrey Desai, Abhinav Arora, Alexander Zotov, Ahmed Aly

    Abstract: An effective recipe for building seq2seq, non-autoregressive, task-oriented parsers to map utterances to semantic frames proceeds in three steps: encoding an utterance $x$, predicting a frame's length |y|, and decoding a |y|-sized frame with utterance and ontology tokens. Though empirically strong, these models are typically bottlenecked by length prediction, as even small inaccuracies change the… ▽ More

    Submitted 14 September, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

  10. arXiv:2103.04958  [pdf, other

    cs.AR cs.CV

    F-CAD: A Framework to Explore Hardware Accelerators for Codec Avatar Decoding

    Authors: Xiaofan Zhang, Dawei Wang, Pierce Chuang, Shugao Ma, Deming Chen, Yuecheng Li

    Abstract: Creating virtual avatars with realistic rendering is one of the most essential and challenging tasks to provide highly immersive virtual reality (VR) experiences. It requires not only sophisticated deep neural network (DNN) based codec avatar decoders to ensure high visual quality and precise motion expression, but also efficient hardware accelerators to guarantee smooth real-time rendering using… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Comments: Published as a conference paper at Design Automation Conference 2021 (DAC'21)

  11. arXiv:2008.09916  [pdf, other

    cs.LG cs.CV eess.IV

    One Weight Bitwidth to Rule Them All

    Authors: Ting-Wu Chin, Pierce I-Jen Chuang, Vikas Chandra, Diana Marculescu

    Abstract: Weight quantization for deep ConvNets has shown promising results for applications such as image classification and semantic segmentation and is especially important for applications where memory storage is limited. However, when aiming for quantization without accuracy degradation, different tasks may end up with different bitwidths. This creates complexity for software and hardware support and t… ▽ More

    Submitted 28 August, 2020; v1 submitted 22 August, 2020; originally announced August 2020.

    Comments: Accepted at ECCV 2020 Embedded Vision Workshop (Best paper)

  12. arXiv:2003.06310  [pdf, other

    eess.SP cs.AR cs.CV cs.NE

    A Power-Efficient Binary-Weight Spiking Neural Network Architecture for Real-Time Object Classification

    Authors: Pai-Yu Tan, Po-Yao Chuang, Yen-Ting Lin, Cheng-Wen Wu, Juin-Ming Lu

    Abstract: Neural network hardware is considered an essential part of future edge devices. In this paper, we propose a binary-weight spiking neural network (BW-SNN) hardware architecture for low-power real-time object classification on edge platforms. This design stores a full neural network on-chip, and hence requires no off-chip bandwidth. The proposed systolic array maximizes data reuse for a typical conv… ▽ More

    Submitted 12 March, 2020; originally announced March 2020.

  13. arXiv:2002.05293  [pdf, other

    cs.CV cs.LG

    Improving Efficiency in Neural Network Accelerator Using Operands Hamming Distance optimization

    Authors: Meng Li, Yilei Li, Pierce Chuang, Liangzhen Lai, Vikas Chandra

    Abstract: Neural network accelerator is a key enabler for the on-device AI inference, for which energy efficiency is an important metric. The data-path energy, including the computation energy and the data movement energy among the arithmetic units, claims a significant part of the total accelerator energy. By revisiting the basic physics of the arithmetic logic circuits, we show that the data-path energy i… ▽ More

    Submitted 12 February, 2020; originally announced February 2020.

    Comments: 12 pages, 10 figures, 6 tables

  14. arXiv:1807.06964  [pdf, other

    cs.CV

    Bridging the Accuracy Gap for 2-bit Quantized Neural Networks (QNN)

    Authors: Jungwook Choi, Pierce I-Jen Chuang, Zhuo Wang, Swagath Venkataramani, Vijayalakshmi Srinivasan, Kailash Gopalakrishnan

    Abstract: Deep learning algorithms achieve high classification accuracy at the expense of significant computation cost. In order to reduce this cost, several quantization schemes have gained attention recently with some focusing on weight quantization, and others focusing on quantizing activations. This paper proposes novel techniques that target weight and activation quantizations separately resulting in a… ▽ More

    Submitted 17 July, 2018; originally announced July 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1805.06085

  15. arXiv:1805.06085  [pdf, other

    cs.CV cs.AI

    PACT: Parameterized Clip** Activation for Quantized Neural Networks

    Authors: Jungwook Choi, Zhuo Wang, Swagath Venkataramani, Pierce I-Jen Chuang, Vijayalakshmi Srinivasan, Kailash Gopalakrishnan

    Abstract: Deep learning algorithms achieve high classification accuracy at the expense of significant computation cost. To address this cost, a number of quantization schemes have been proposed - but most of these techniques focused on quantizing weights, which are relatively smaller in size compared to activations. This paper proposes a novel quantization scheme for activations during training - that enabl… ▽ More

    Submitted 17 July, 2018; v1 submitted 15 May, 2018; originally announced May 2018.