Skip to main content

Showing 1–10 of 10 results for author: Chang, P

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.17976  [pdf, other

    eess.SY

    The Role of Electric Grid Research in Addressing Climate Change

    Authors: Le Xie, Subir Majumder, Tong Huang, Qian Zhang, ** Chang, David J. Hill, Mohammad Shahidehpour

    Abstract: Addressing the urgency of climate change necessitates a coordinated and inclusive effort from all relevant stakeholders. Critical to this effort is the modeling, analysis, control, and integration of technological innovations within the electric energy system, which plays a crucial role in scaling up climate change solutions. This perspective article presents a set of research challenges and oppor… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 17 pages, 2 figures

  2. arXiv:2305.04929  [pdf, other

    physics.ao-ph eess.SY

    Impact of Climate Simulation Resolutions on Future Energy System Reliability Assessment: A Texas Case Study

    Authors: Xiangtian Zheng, Le Xie, Kiyeob Lee, Dan Fu, Jiahan Wu, ** Chang

    Abstract: The reliability of energy systems is strongly influenced by the prevailing climate conditions. With the increasing prevalence of renewable energy sources, the interdependence between energy and climate systems has become even stronger. This study examines the impact of different spatial resolutions in climate modeling on energy grid reliability assessment, with the Texas interconnection between 20… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

  3. A CTC Alignment-based Non-autoregressive Transformer for End-to-end Automatic Speech Recognition

    Authors: Ruchao Fan, Wei Chu, Peng Chang, Abeer Alwan

    Abstract: Recently, end-to-end models have been widely used in automatic speech recognition (ASR) systems. Two of the most representative approaches are connectionist temporal classification (CTC) and attention-based encoder-decoder (AED) models. Autoregressive transformers, variants of AED, adopt an autoregressive mechanism for token generation and thus are relatively slow during inference. In this paper,… ▽ More

    Submitted 15 April, 2023; originally announced April 2023.

    Comments: Published in IEEE Transactions on Audio, Speech, and Language Processing

  4. arXiv:2207.01011  [pdf, other

    eess.IV cs.CV cs.LG

    Facial Image Reconstruction from Functional Magnetic Resonance Imaging via GAN Inversion with Improved Attribute Consistency

    Authors: Pei-Chun Chang, Yan-Yu Tien, Chia-Lin Chen, Li-Fen Chen, Yong-Sheng Chen, Hui-Ling Chan

    Abstract: Neuroscience studies have revealed that the brain encodes visual content and embeds information in neural activity. Recently, deep learning techniques have facilitated attempts to address visual reconstructions by map** brain activity to image stimuli using generative adversarial networks (GANs). However, none of these studies have considered the semantic meaning of latent code in image space. O… ▽ More

    Submitted 3 July, 2022; originally announced July 2022.

    Comments: Accepted at the 2022 International Joint Conference on Neural Networks (IJCNN 2022)

  5. Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment

    Authors: Yuan Gong, Ziyi Chen, Iek-Heng Chu, Peng Chang, James Glass

    Abstract: Automatic pronunciation assessment is an important technology to help self-directed language learners. While pronunciation quality has multiple aspects including accuracy, fluency, completeness, and prosody, previous efforts typically only model one aspect (e.g., accuracy) at one granularity (e.g., at the phoneme-level). In this work, we explore modeling multi-aspect pronunciation assessment at mu… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

    Comments: Accepted at ICASSP 2022. Code at https://github.com/YuanGongND/gopt Interactive Colab demo at https://colab.research.google.com/github/YuanGongND/gopt/blob/master/colab/GOPT_GPU.ipynb . ICASSP 2022

  6. arXiv:2201.08221  [pdf

    physics.ins-det eess.SP

    A 1.5GS/s 8b Pipelined-SAR ADC with Output Level Shifting Settling Technique in 14nm CMOS

    Authors: Yuanming Zhu, Shengchang Cai, Shiva Kiran, Yang-Hang Fan, Po-Hsuan Chang, Sebastian Hoyos, Samuel Palermo

    Abstract: A single channel 1.5GS/s 8-bit pipelined-SAR ADC utilizes a novel output level shifting (OLS) settling technique to reduce the power and enable low-voltage operation of the dynamic residue amplifier. The ADC consists of a 4-bit first stage and a 5-bit second stage, with 1-bit redundancy to relax the offset, gain, and settling requirements of the first stage. Employing the OLS technique allows for… ▽ More

    Submitted 20 August, 2022; v1 submitted 8 January, 2022; originally announced January 2022.

    Comments: it is a 4 page and 9 figure IEEE Custom Integrated Circuit Conference paper

    Journal ref: IEEE Custom Integrated Circuit Conference 2020

  7. arXiv:2109.08910  [pdf, other

    cs.SD cs.AI eess.AS

    MS-SincResNet: Joint learning of 1D and 2D kernels using multi-scale SincNet and ResNet for music genre classification

    Authors: Pei-Chun Chang, Yong-Sheng Chen, Chang-Hsing Lee

    Abstract: In this study, we proposed a new end-to-end convolutional neural network, called MS-SincResNet, for music genre classification. MS-SincResNet appends 1D multi-scale SincNet (MS-SincNet) to 2D ResNet as the first convolutional layer in an attempt to jointly learn 1D kernels and 2D kernels during the training stage. First, an input music signal is divided into a number of fixed-duration (3 seconds i… ▽ More

    Submitted 18 September, 2021; originally announced September 2021.

  8. arXiv:2106.09885  [pdf, other

    eess.AS cs.AI

    An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition

    Authors: Ruchao Fan, Wei Chu, Peng Chang, **g Xiao, Abeer Alwan

    Abstract: Non-autoregressive mechanisms can significantly decrease inference time for speech transformers, especially when the single step variant is applied. Previous work on CTC alignment-based single step non-autoregressive transformer (CASS-NAT) has shown a large real time factor (RTF) improvement over autoregressive transformers (AT). In this work, we propose several methods to improve the accuracy of… ▽ More

    Submitted 21 July, 2021; v1 submitted 17 June, 2021; originally announced June 2021.

    Comments: Accepted to Interspeech2021

  9. arXiv:2010.14725  [pdf, other

    eess.AS cs.CL cs.SD

    CASS-NAT: CTC Alignment-based Single Step Non-autoregressive Transformer for Speech Recognition

    Authors: Ruchao Fan, Wei Chu, Peng Chang, **g Xiao

    Abstract: We propose a CTC alignment-based single step non-autoregressive transformer (CASS-NAT) for speech recognition. Specifically, the CTC alignment contains the information of (a) the number of tokens for decoder input, and (b) the time span of acoustics for each token. The information are used to extract acoustic representation for each token in parallel, referred to as token-level acoustic embedding… ▽ More

    Submitted 11 February, 2021; v1 submitted 27 October, 2020; originally announced October 2020.

    Comments: Accepted to ICASSP2021, camera ready version

  10. arXiv:1903.03474  [pdf, other

    physics.app-ph eess.SP physics.optics

    Demonstration of multivariate photonics: blind dimensionality reduction with analog integrated photonics

    Authors: Alexander N. Tait, Philip Y. Ma, Thomas Ferreira de Lima, Eric C. Blow, Matthew P. Chang, Mitchell A. Nahmias, Bhavin J. Shastri, Paul R. Prucnal

    Abstract: Multi-antenna radio front-ends generate a multi-dimensional flood of information, most of which is partially redundant. Redundancy is eliminated by dimensionality reduction, but contemporary digital processing techniques face harsh fundamental tradeoffs when implementing this class of functions. These tradeoffs can be broken in the analog domain, in which the performance of optical technologies gr… ▽ More

    Submitted 10 February, 2019; originally announced March 2019.

    Comments: 24 pages, 7 figures