Skip to main content

Showing 1–50 of 52 results for author: Chang, C

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.00637  [pdf, ps, other

    eess.SY

    A Distributed Model Identification Algorithm for Multi-Agent Systems

    Authors: Vivek Khatana, Chin-Yao Chang, Wenbo Wang

    Abstract: In this study, we investigate agent-based approach for system model identification with an emphasis on power distribution system applications. Departing from conventional practices of relying on historical data for offline model identification, we adopt an online update approach utilizing real-time data by employing the latest data points for gradient computation. This methodology offers advantage… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 6 pages, 4 figures

  2. arXiv:2402.09846  [pdf

    physics.ao-ph cs.LG eess.SP

    A Deep Learning Approach to Radar-based QPE

    Authors: Ting-Shuo Yo, Shih-Hao Su, Jung-Lien Chu, Chiao-Wei Chang, Hung-Chi Kuo

    Abstract: In this study, we propose a volume-to-point framework for quantitative precipitation estimation (QPE) based on the Quantitative Precipitation Estimation and Segregation Using Multiple Sensor (QPESUMS) Mosaic Radar data set. With a data volume consisting of the time series of gridded radar reflectivities over the Taiwan area, we used machine learning algorithms to establish a statistical model for… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 22 pages, 11 figures. Published in Earth and Space Science

    Journal ref: Earth Space Sci. 2021, 8, e2020EA001340

  3. arXiv:2401.11445  [pdf, other

    cs.RO eess.SY

    Towards Non-Robocentric Dynamic Landing of Quadrotor UAVs

    Authors: Li-Yu Lo, Boyang Li, Chih-Yung Wen, Ching-Wei Chang

    Abstract: In this work, we propose a dynamic landing solution without the need for onboard exteroceptive sensors and an expensive computation unit, where all localization and control modules are carried out on the ground in a non-inertial frame. Our system starts with a relative state estimator of the aerial robot from the perspective of the landing platform, where the state tracking of the UAV is done thro… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

  4. arXiv:2312.17156  [pdf, other

    cs.SD eess.AS

    BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer

    Authors: Chih-Cheng Chang, Li Su

    Abstract: Many deep learning models have achieved dominant performance on the offline beat tracking task. However, online beat tracking, in which only the past and present input features are available, still remains challenging. In this paper, we propose BEAt tracking Streaming Transformer (BEAST), an online joint beat and downbeat tracking system based on the streaming Transformer. To deal with online scen… ▽ More

    Submitted 23 April, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: Accepted by ICASSP 2024

  5. arXiv:2312.14453  [pdf, other

    cs.RO eess.SY

    Hybrid Aerodynamics-Based Model Predictive Control for a Tail-Sitter UAV

    Authors: Bailun Jiang, Boyang Li, Ching-Wei Chang, Chih-Yung Wen

    Abstract: It is challenging to model and control a tail-sitter unmanned aerial vehicle (UAV) because its blended wing body generates complicated nonlinear aerodynamic effects, such as wing lift, fuselage drag, and propeller-wing interactions. We therefore devised a hybrid aerodynamic modeling method and model predictive control (MPC) design for a quadrotor tail-sitter UAV. The hybrid model consists of the N… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  6. All Attention U-NET for Semantic Segmentation of Intracranial Hemorrhages In Head CT Images

    Authors: Chia Shuo Chang, Tian Sheuan Chang, Jiun Lin Yan, Li Ko

    Abstract: Intracranial hemorrhages in head CT scans serve as a first line tool to help specialists diagnose different types. However, their types have diverse shapes in the same type but similar confusing shape, size and location between types. To solve this problem, this paper proposes an all attention U-Net. It uses channel attentions in the U-Net encoder side to enhance class specific feature extraction,… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: 2022 IEEE Biomedical Circuits and Systems Conference (BioCAS)

  7. arXiv:2311.12666  [pdf, other

    cs.LG eess.SP

    SSVEP-DAN: A Data Alignment Network for SSVEP-based Brain Computer Interfaces

    Authors: Sung-Yu Chen, Chi-Min Chang, Kuan-Jung Chiang, Chun-Shu Wei

    Abstract: Steady-state visual-evoked potential (SSVEP)-based brain-computer interfaces (BCIs) offer a non-invasive means of communication through high-speed speller systems. However, their efficiency heavily relies on individual training data obtained during time-consuming calibration sessions. To address the challenge of data insufficiency in SSVEP-based BCIs, we present SSVEP-DAN, the first dedicated neur… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  8. arXiv:2311.10641  [pdf

    physics.med-ph eess.IV

    Image-Domain Material Decomposition for Dual-energy CT using Unsupervised Learning with Data-fidelity Loss

    Authors: Junbo Peng, Chih-Wei Chang, Huiqiao Xie, Richard L. J. Qiu, Justin Roper, Tonghe Wang, Beth Bradshaw, Xiangyang Tang, Xiaofeng Yang

    Abstract: Background: Dual-energy CT (DECT) and material decomposition play vital roles in quantitative medical imaging. However, the decomposition process may suffer from significant noise amplification, leading to severely degraded image signal-to-noise ratios (SNRs). While existing iterative algorithms perform noise suppression using different image priors, these heuristic image priors cannot accurately… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  9. arXiv:2311.04241  [pdf, ps, other

    eess.SP cs.AI cs.LG

    AI-Enabled Unmanned Vehicle-Assisted Reconfigurable Intelligent Surfaces: Deployment, Prototy**, Experiments, and Opportunities

    Authors: Li-Hsiang Shen, Kai-Ten Feng, Ta-Sung Lee, Yuan-Chun Lin, Shih-Cheng Lin, Chia-Chan Chang, Sheng-Fuh Chang

    Abstract: The requirement of wireless data demands is increasingly high as the sixth-generation (6G) technology evolves. Reconfigurable intelligent surface (RIS) is promisingly deemed to be one of 6G techniques for extending service coverage, reducing power consumption, and enhancing spectral efficiency. In this article, we have provided some fundamentals of RIS deployment in theory and hardware perspective… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  10. arXiv:2311.04234  [pdf

    eess.SP cs.CV cs.LG

    Leveraging sinusoidal representation networks to predict fMRI signals from EEG

    Authors: Yamin Li, Ange Lou, Ziyuan Xu, Shiyu Wang, Catie Chang

    Abstract: In modern neuroscience, functional magnetic resonance imaging (fMRI) has been a crucial and irreplaceable tool that provides a non-invasive window into the dynamics of whole-brain activity. Nevertheless, fMRI is limited by hemodynamic blurring as well as high cost, immobility, and incompatibility with metal implants. Electroencephalography (EEG) is complementary to fMRI and can directly record the… ▽ More

    Submitted 24 January, 2024; v1 submitted 5 November, 2023; originally announced November 2023.

  11. arXiv:2308.13072  [pdf

    eess.IV cs.CV

    Full-dose Whole-body PET Synthesis from Low-dose PET Using High-efficiency Denoising Diffusion Probabilistic Model: PET Consistency Model

    Authors: Shaoyan Pan, Elham Abouei, Junbo Peng, Joshua Qian, Jacob F Wynne, Tonghe Wang, Chih-Wei Chang, Justin Roper, Jonathon A Nye, Hui Mao, Xiaofeng Yang

    Abstract: Objective: Positron Emission Tomography (PET) has been a commonly used imaging modality in broad clinical applications. One of the most important tradeoffs in PET imaging is between image quality and radiation dose: high image quality comes with high radiation exposure. Improving image quality is desirable for all clinical applications while minimizing radiation exposure is needed to reduce risk t… ▽ More

    Submitted 16 April, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

  12. arXiv:2307.07650  [pdf, ps, other

    cs.LG cs.AI eess.SP

    SALC: Skeleton-Assisted Learning-Based Clustering for Time-Varying Indoor Localization

    Authors: An-Hung Hsiao, Li-Hsiang Shen, Chen-Yi Chang, Chun-Jie Chiu, Kai-Ten Feng

    Abstract: Wireless indoor localization has attracted significant amount of attention in recent years. Using received signal strength (RSS) obtained from WiFi access points (APs) for establishing fingerprinting database is a widely utilized method in indoor localization. However, the time-variant problem for indoor positioning systems is not well-investigated in existing literature. Compared to conventional… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

  13. arXiv:2306.15808  [pdf, other

    cs.MM cs.SD eess.AS eess.SP

    Classification of Infant Sleep/Wake States: Cross-Attention among Large Scale Pretrained Transformer Networks using Audio, ECG, and IMU Data

    Authors: Kai Chieh Chang, Mark Hasegawa-Johnson, Nancy L. McElwain, Bashima Islam

    Abstract: Infant sleep is critical to brain and behavioral development. Prior studies on infant sleep/wake classification have been largely limited to reliance on expensive and burdensome polysomnography (PSG) tests in the laboratory or wearable devices that collect single-modality data. To facilitate data collection and accuracy of detection, we aimed to advance this field of study by using a multi-modal w… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: Preprint for APSIPA2023

  14. arXiv:2306.06982  [pdf

    eess.IV cs.CV cs.LG

    Weakly Supervised Lesion Detection and Diagnosis for Breast Cancers with Partially Annotated Ultrasound Images

    Authors: Jian Wang, Liang Qiao, Shichong Zhou, ** Zhou, Jun Wang, Juncheng Li, Shihui Ying, Cai Chang, Jun Shi

    Abstract: Deep learning (DL) has proven highly effective for ultrasound-based computer-aided diagnosis (CAD) of breast cancers. In an automaticCAD system, lesion detection is critical for the following diagnosis. However, existing DL-based methods generally require voluminous manually-annotated region of interest (ROI) labels and class labels to train both the lesion detection and diagnosis models. In clini… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  15. arXiv:2306.01952  [pdf, other

    math.OC eess.SY

    Learning in Domain Randomization via Continuous Time Non-Stochastic Control

    Authors: **gwei Li, **g Dong, Can Chang, Baoxiang Wang, **gzhao Zhang

    Abstract: Domain randomization is a popular method for robustly training agents to adapt to diverse environments and real-world tasks. In this paper, we examine how to train an agent in domain randomization environments from a nonstochastic control perspective. We first theoretically study online control of continuous-time linear systems under nonstochastic noises. We present a novel two-level online algori… ▽ More

    Submitted 14 December, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

  16. arXiv:2305.19467  [pdf

    eess.IV cs.CV

    Synthetic CT Generation from MRI using 3D Transformer-based Denoising Diffusion Model

    Authors: Shaoyan Pan, Elham Abouei, Jacob Wynne, Tonghe Wang, Richard L. J. Qiu, Yuheng Li, Chih-Wei Chang, Junbo Peng, Justin Roper, Pretesh Patel, David S. Yu, Hui Mao, Xiaofeng Yang

    Abstract: Magnetic resonance imaging (MRI)-based synthetic computed tomography (sCT) simplifies radiation therapy treatment planning by eliminating the need for CT simulation and error-prone image registration, ultimately reducing patient radiation dose and setup uncertainty. We propose an MRI-to-CT transformer-based denoising diffusion probabilistic model (MC-DDPM) to transform MRI into high-quality sCT to… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  17. arXiv:2305.00042  [pdf

    eess.IV cs.CV

    Cycle-guided Denoising Diffusion Probability Model for 3D Cross-modality MRI Synthesis

    Authors: Shaoyan Pan, Chih-Wei Chang, Junbo Peng, Jiahan Zhang, Richard L. J. Qiu, Tonghe Wang, Justin Roper, Tian Liu, Hui Mao, Xiaofeng Yang

    Abstract: This study aims to develop a novel Cycle-guided Denoising Diffusion Probability Model (CG-DDPM) for cross-modality MRI synthesis. The CG-DDPM deploys two DDPMs that condition each other to generate synthetic images from two different MRI pulse sequences. The two DDPMs exchange random latent noise in the reverse processes, which helps to regularize both DDPMs and generate matching images in two mod… ▽ More

    Submitted 28 April, 2023; originally announced May 2023.

  18. arXiv:2304.14688  [pdf, other

    eess.IV

    An Efficient Hash-based Data Structure for Dynamic Vision Sensors and its Application to Low-energy Low-memory Noise Filtering

    Authors: Pradeep Kumar Gopalakrishnan, Chip-Hong Chang, Arindam Basu

    Abstract: Events generated by the Dynamic Vision Sensor (DVS) are generally stored and processed in two-dimensional data structures whose memory complexity and energy-per-event scale proportionately with increasing sensor dimensions. In this paper, we propose a new two-dimensional data structure (BF_2) that takes advantage of the sparsity of events and enables compact storage of data using hash functions. I… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

    Comments: Supplementary material can be accessed at the link provided at the end of the manuscript

  19. arXiv:2304.11267  [pdf, other

    cs.CV cs.LG eess.IV

    Speed Is All You Need: On-Device Acceleration of Large Diffusion Models via GPU-Aware Optimizations

    Authors: Yu-Hui Chen, Raman Sarokin, Juhyun Lee, Jiuqiang Tang, Chuo-Ling Chang, Andrei Kulik, Matthias Grundmann

    Abstract: The rapid development and application of foundation models have revolutionized the field of artificial intelligence. Large diffusion models have gained significant attention for their ability to generate photorealistic images and support various tasks. On-device deployment of these models provides benefits such as lower server costs, offline functionality, and improved user privacy. However, commo… ▽ More

    Submitted 16 June, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

    Comments: 4 pages (not including references), 2 figures, 2 tables. Accepted to Efficient Deep Learning for Computer Vision workshop 2023

  20. arXiv:2304.06474  [pdf, ps, other

    eess.SP cs.LG

    Attention-based Learning for Sleep Apnea and Limb Movement Detection using Wi-Fi CSI Signals

    Authors: Chi-Che Chang, An-Hung Hsiao, Li-Hsiang Shen, Kai-Ten Feng, Chia-Yu Chen

    Abstract: Wi-Fi channel state information (CSI) has become a promising solution for non-invasive breathing and body motion monitoring during sleep. Sleep disorders of apnea and periodic limb movement disorder (PLMD) are often unconscious and fatal. The existing researches detect abnormal sleep disorders in impractically controlled environments. Moreover, it leads to compelling challenges to classify complex… ▽ More

    Submitted 26 March, 2023; originally announced April 2023.

  21. arXiv:2304.03172  [pdf, other

    eess.SY math.OC

    A Privacy Preserving Distributed Model Identification Algorithm for Power Distribution Systems

    Authors: Chin-Yao Chang

    Abstract: Distributed control/optimization is a promising approach for network systems due to its advantages over centralized schemes, such as robustness, cost-effectiveness, and improved privacy. However, distributed methods can have drawbacks, such as slower convergence rates due to limited knowledge of the overall network model. Additionally, ensuring privacy in the communication of sensitive information… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

  22. arXiv:2211.09949  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Compressing Transformer-based self-supervised models for speech processing

    Authors: Tzu-Quan Lin, Tsung-Huan Yang, Chun-Yao Chang, Kuang-Ming Chen, Tzu-hsun Feng, Hung-yi Lee, Hao Tang

    Abstract: Despite the success of Transformers in self- supervised learning with applications to various downstream tasks, the computational cost of training and inference remains a major challenge for applying these models to a wide spectrum of devices. Several isolated attempts have been made to compress Transformers, but the settings and metrics are different across studies. Trade-off at various compressi… ▽ More

    Submitted 26 January, 2024; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: Submitted to IEEE Transactions on Audio, Speech and Language Processing (TASLP)

  23. arXiv:2211.07357  [pdf, other

    cs.LG cs.AI eess.SY

    Controlling Commercial Cooling Systems Using Reinforcement Learning

    Authors: Jerry Luo, Cosmin Paduraru, Octavian Voicu, Yuri Chervonyi, Scott Munns, Jerry Li, Crystal Qian, Praneet Dutta, Jared Quincy Davis, Ningjia Wu, Xingwei Yang, Chu-Ming Chang, Ted Li, Rob Rose, Mingyan Fan, Hootan Nakhost, Tinglin Liu, Brian Kirkman, Frank Altamura, Lee Cline, Patrick Tonker, Joel Gouker, Dave Uden, Warren Buddy Bryan, Jason Law , et al. (11 additional authors not shown)

    Abstract: This paper is a technical overview of DeepMind and Google's recent work on reinforcement learning for controlling commercial cooling systems. Building on expertise that began with cooling Google's data centers more efficiently, we recently conducted live experiments on two real-world facilities in partnership with Trane Technologies, a building management system provider. These live experiments ha… ▽ More

    Submitted 14 December, 2022; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: 27 pages, 11 figures

  24. arXiv:2210.08225  [pdf, other

    eess.IV cs.CV cs.LG

    Learned Video Compression for YUV 4:2:0 Content Using Flow-based Conditional Inter-frame Coding

    Authors: Yung-Han Ho, Chih-Hsuan Lin, Peng-Yu Chen, Mu-Jung Chen, Chih-Peng Chang, Wen-Hsiao Peng, Hsueh-Ming Hang

    Abstract: This paper proposes a learning-based video compression framework for variable-rate coding on YUV 4:2:0 content. Most existing learning-based video compression models adopt the traditional hybrid-based coding architecture, which involves temporal prediction followed by residual coding. However, recent studies have shown that residual coding is sub-optimal from the information-theoretic perspective.… ▽ More

    Submitted 15 October, 2022; originally announced October 2022.

    Comments: Accepted by ISCAS 2022

  25. arXiv:2207.07931  [pdf

    eess.IV cs.LG

    Learnable Mixed-precision and Dimension Reduction Co-design for Low-storage Activation

    Authors: Yu-Shan Tai, Cheng-Yang Chang, Chieh-Fang Teng, AnYeu, Wu

    Abstract: Recently, deep convolutional neural networks (CNNs) have achieved many eye-catching results. However, deploying CNNs on resource-constrained edge devices is constrained by limited memory bandwidth for transmitting large intermediated data during inference, i.e., activation. Existing research utilizes mixed-precision and dimension reduction to reduce computational complexity but pays less attention… ▽ More

    Submitted 18 July, 2022; v1 submitted 16 July, 2022; originally announced July 2022.

  26. arXiv:2207.05315  [pdf, other

    cs.CV cs.LG eess.IV

    CANF-VC: Conditional Augmented Normalizing Flows for Video Compression

    Authors: Yung-Han Ho, Chih-Peng Chang, Peng-Yu Chen, Alessandro Gnutti, Wen-Hsiao Peng

    Abstract: This paper presents an end-to-end learning-based video compression system, termed CANF-VC, based on conditional augmented normalizing flows (CANF). Most learned video compression systems adopt the same hybrid-based coding architecture as the traditional codecs. Recent research on conditional coding has shown the sub-optimality of the hybrid-based coding and opens up opportunities for deep generati… ▽ More

    Submitted 14 August, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

  27. Exploring Continuous Integrate-and-Fire for Adaptive Simultaneous Speech Translation

    Authors: Chih-Chiang Chang, Hung-yi Lee

    Abstract: Simultaneous speech translation (SimulST) is a challenging task aiming to translate streaming speech before the complete input is observed. A SimulST system generally includes two components: the pre-decision that aggregates the speech information and the policy that decides to read or write. While recent works had proposed various strategies to improve the pre-decision, they mainly adopt the fixe… ▽ More

    Submitted 3 October, 2022; v1 submitted 22 March, 2022; originally announced April 2022.

    Comments: INTERSPEECH 2022 camera ready

    Journal ref: Proc. Interspeech 2022, 5175-5179

  28. Wi-Fi and Bluetooth Contact Tracing Without User Intervention

    Authors: Brosnan Yuen, Yifeng Bie, Duncan Cairns, Geoffrey Harper, Jason Xu, Charles Chang, Xiaodai Dong, Tao Lu

    Abstract: Previous contact tracing systems required the users to perform many manual actions, such as installing smartphone applications, joining wireless networks, or carrying custom user devices. This increases the barrier to entry and lowers the user adoption rate. As a result, the contact tracing effectiveness is reduced. Unlike the systems above, we propose a new privacy preserving Wi-Fi and Bluetooth… ▽ More

    Submitted 23 July, 2022; v1 submitted 30 March, 2022; originally announced April 2022.

    Report number: 2169-3536

    Journal ref: IEEE Access Volume 11 (2022) 91027-91044

  29. arXiv:2203.10597  [pdf, other

    cs.CR cs.LG eess.SY

    The Dark Side: Security Concerns in Machine Learning for EDA

    Authors: Zhiyao Xie, **gyu Pan, Chen-Chia Chang, Yiran Chen

    Abstract: The growing IC complexity has led to a compelling need for design efficiency improvement through new electronic design automation (EDA) methodologies. In recent years, many unprecedented efficient EDA methods have been enabled by machine learning (ML) techniques. While ML demonstrates its great potential in circuit design, however, the dark side about security problems, is seldomly discussed. This… ▽ More

    Submitted 20 March, 2022; originally announced March 2022.

  30. arXiv:2202.02518  [pdf, other

    cs.CV cs.MM eess.IV

    On the predictability in reversible steganography

    Authors: Ching-Chun Chang, Xu Wang, Sisheng Chen, Hitoshi Kiya, Isao Echizen

    Abstract: Artificial neural networks have advanced the frontiers of reversible steganography. The core strength of neural networks is the ability to render accurate predictions for a bewildering variety of data. Residual modulation is recognised as the most advanced reversible steganographic algorithm for digital images. The pivot of this algorithm is predictive analytics in which pixel intensities are pred… ▽ More

    Submitted 7 March, 2023; v1 submitted 5 February, 2022; originally announced February 2022.

    Journal ref: Telecommunication Systems (2023), vol. 82, no. 2, pp. 301-313

  31. Instrumented shoulder functional assessment using inertial measurement units for frozen shoulder

    Authors: Ting-Yang Lu, Kai-Chun Liu, Chia-Yeh Hsieh, Chih-Ya Chang, Yu Tsao, Chia-Tai Chan

    Abstract: Frozen shoulder (FS) is a shoulder condition that leads to pain and loss of shoulder range of motion. FS patients have difficulties in independently performing daily activities. Inertial measurement units (IMUs) have been developed to objectively measure upper limb range of motion (ROM) and shoulder function. In this work, we propose an IMU-based shoulder functional task assessment with kinematic… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

    Comments: 4 pages, 6 tables, 2 figures, To appear in 2021 IEEE BHI

  32. arXiv:2111.06046  [pdf, other

    cs.SD cs.AI eess.AS

    Music Score Expansion with Variable-Length Infilling

    Authors: Chih-Pin Tan, Chin-Jui Chang, Alvin W. Y. Su, Yi-Hsuan Yang

    Abstract: In this paper, we investigate using the variable-length infilling (VLI) model, which is originally proposed to infill missing segments, to "prolong" existing musical segments at musical boundaries. Specifically, as a case study, we expand 20 musical segments from 12 bars to 16 bars, and examine the degree to which the VLI model preserves musical boundaries in the expanded results using a few objec… ▽ More

    Submitted 10 November, 2021; originally announced November 2021.

    Comments: Going to published as a late-breaking demo paper at ISMIR 2021

  33. arXiv:2110.08828  [pdf

    cs.CV cs.LG eess.SP

    Compression-aware Projection with Greedy Dimension Reduction for Convolutional Neural Network Activations

    Authors: Yu-Shan Tai, Chieh-Fang Teng, Cheng-Yang Chang, An-Yeu Wu

    Abstract: Convolutional neural networks (CNNs) achieve remarkable performance in a wide range of fields. However, intensive memory access of activations introduces considerable energy consumption, impeding deployment of CNNs on resourceconstrained edge devices. Existing works in activation compression propose to transform feature maps for higher compressibility, thus enabling dimension reduction. Neverthele… ▽ More

    Submitted 17 October, 2021; originally announced October 2021.

    Comments: 5 pages, 5 figures, submitted to 2022 ICASSP

  34. arXiv:2109.10181  [pdf, other

    eess.SY

    Intelligent Traffic Control System by Using Image Information

    Authors: Zong-Ming Lin, Cheng-Yang Chang, Chin-Yu Hu, Yung-Yuan Chen

    Abstract: This paper implements a traffic signal control system by using real-time traffic flow feedback. This system is designed to deal with two-lane intersections. We construct an experiment field similar to the roads and drivers in Taiwan using an autonomous simulation software called Virtual Test Drive (VTD) released by MSC Software. We erect four cameras on the side of the roads to get the image of th… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: 7 pages, 16 figures

  35. arXiv:2107.05223  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    BERT-like Pre-training for Symbolic Piano Music Classification Tasks

    Authors: Yi-Hui Chou, I-Chun Chen, Chin-Jui Chang, Joann Ching, Yi-Hsuan Yang

    Abstract: This article presents a benchmark study of symbolic piano music classification using the masked language modelling approach of the Bidirectional Encoder Representations from Transformers (BERT). Specifically, we consider two types of MIDI data: MIDI scores, which are musical scores rendered directly into MIDI with no dynamics and precisely aligned with the metrical grid notated by its composer and… ▽ More

    Submitted 13 April, 2024; v1 submitted 12 July, 2021; originally announced July 2021.

    Comments: Accepted to Journal of Creative Music Systems

  36. arXiv:2106.06924  [pdf, other

    cs.MM cs.CV eess.IV

    Deep Learning for Predictive Analytics in Reversible Steganography

    Authors: Ching-Chun Chang, Xu Wang, Sisheng Chen, Isao Echizen, Victor Sanchez, Chang-Tsun Li

    Abstract: Deep learning is regarded as a promising solution for reversible steganography. There is an accelerating trend of representing a reversible steo-system by monolithic neural networks, which bypass intermediate operations in traditional pipelines of reversible steganography. This end-to-end paradigm, however, suffers from imperfect reversibility. By contrast, the modular paradigm that incorporates n… ▽ More

    Submitted 7 March, 2023; v1 submitted 13 June, 2021; originally announced June 2021.

    Journal ref: IEEE Access (2023), vol. 11, pp. 3494-3510

  37. arXiv:2104.13895  [pdf, other

    eess.SY

    Closed-loop Control Design and Motor Allocation for a Lower-limb Cable-driven Exoskeleton: A Switched Systems Approach

    Authors: Chen-Hao Chang, Jonathan Casas, Victor H. Duenas

    Abstract: Powered lower-limb exoskeletons provide assistive torques to coordinate limb motion during walking in individuals with movement disorders. Advances in sensing and actuation have improved the wearability and portability of state-of-the-art exoskeletons for walking. Cable-driven exoskeletons offload the actuators away from the user, thus rendering light-weight devices to facilitate locomotion traini… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

  38. arXiv:2011.07442  [pdf, other

    cs.SD cs.LG eess.AS

    Improving Speech Enhancement Performance by Leveraging Contextual Broad Phonetic Class Information

    Authors: Yen-Ju Lu, Chia-Yu Chang, Cheng Yu, Ching-Feng Liu, Jeih-weih Hung, Shinji Watanabe, Yu Tsao

    Abstract: Previous studies have confirmed that by augmenting acoustic features with the place/manner of articulatory features, the speech enhancement (SE) process can be guided to consider the broad phonetic properties of the input speech when performing enhancement to attain performance improvements. In this paper, we explore the contextual information of articulatory attributes as additional information t… ▽ More

    Submitted 18 June, 2023; v1 submitted 14 November, 2020; originally announced November 2020.

    Comments: To appear in IEEE Transactions on Audio, Speech and Language Processing (TASLP)

  39. arXiv:2011.07406  [pdf, other

    cs.LG eess.SP

    Using Convolutional Variational Autoencoders to Predict Post-Trauma Health Outcomes from Actigraphy Data

    Authors: Ayse S. Cakmak, Nina Thigpen, Garrett Honke, Erick Perez Alday, Ali Bahrami Rad, Rebecca Adaimi, Chia Jung Chang, Qiao Li, Pramod Gupta, Thomas Neylan, Samuel A. McLean, Gari D. Clifford

    Abstract: Depression and post-traumatic stress disorder (PTSD) are psychiatric conditions commonly associated with experiencing a traumatic event. Estimating mental health status through non-invasive techniques such as activity-based algorithms can help to identify successful early interventions. In this work, we used locomotor activity captured from 1113 individuals who wore a research grade smartwatch pos… ▽ More

    Submitted 19 November, 2020; v1 submitted 14 November, 2020; originally announced November 2020.

    Comments: Fixed typo in author affiliations

  40. arXiv:2011.04101  [pdf, other

    eess.SY

    Enabling DER Participation in Frequency Regulation Markets

    Authors: Priyank Srivastava, Chin-Yao Chang, Jorge Cortes

    Abstract: Distributed energy resources (DERs) are playing an increasing role in ancillary services for the bulk grid, particularly in frequency regulation. In this paper, we propose a framework for collections of DERs, combined to form microgrids and controlled by aggregators, to participate in frequency regulation markets. Our approach covers both the identification of bids for the market clearing stage an… ▽ More

    Submitted 29 January, 2021; v1 submitted 8 November, 2020; originally announced November 2020.

    Comments: 14 pages, 8 figures

  41. arXiv:2009.14668  [pdf

    eess.AS cs.LG cs.SD

    Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion

    Authors: Che-Jui Chang

    Abstract: Cross-lingual voice conversion (VC) is a task that aims to synthesize target voices with the same content while source and target speakers speak in different languages. Its challenge lies in the fact that the source and target data are naturally non-parallel, and it is even difficult to bridge the gaps between languages with no transcriptions provided. In this paper, we focus on knowledge transfer… ▽ More

    Submitted 30 September, 2020; originally announced September 2020.

  42. arXiv:2009.01759  [pdf, other

    eess.AS cs.SD

    Intra-Utterance Similarity Preserving Knowledge Distillation for Audio Tagging

    Authors: Chun-Chieh Chang, Chieh-Chi Kao, Ming Sun, Chao Wang

    Abstract: Knowledge Distillation (KD) is a popular area of research for reducing the size of large models while still maintaining good performance. The outputs of larger teacher models are used to guide the training of smaller student models. Given the repetitive nature of acoustic events, we propose to leverage this information to regulate the KD training for Audio Tagging. This novel KD method, "Intra-Utt… ▽ More

    Submitted 3 September, 2020; originally announced September 2020.

    Comments: Accepted to Interspeech 2020

  43. arXiv:2004.14252  [pdf

    eess.SP

    Task-Projected Hyperdimensional Computing for Multi-Task Learning

    Authors: Cheng-Yang Chang, Yu-Chuan Chuang, An-Yeu Wu

    Abstract: Brain-inspired Hyperdimensional (HD) computing is an emerging technique for cognitive tasks in the field of low-power design. As a fast-learning and energy-efficient computational paradigm, HD computing has shown great success in many real-world applications. However, an HD model incrementally trained on multiple tasks suffers from the negative impacts of catastrophic forgetting. The model forgets… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

    Comments: To be published in 16th International Conference on Artificial Intelligence Applications and Innovations

  44. arXiv:2004.07980  [pdf, other

    eess.SY cs.RO cs.SE eess.SP

    Co-simulation Platform for Develo** InfoRich Energy-Efficient Connected and Automated Vehicles

    Authors: Shunsuke Aoki, Lung En Jan, Junfeng Zhao, Anand Bhat, Ragunathan, Rajkumar, Chen-Fang Chang

    Abstract: With advances in sensing, computing, and communication technologies, Connected and Automated Vehicles (CAVs) are becoming feasible. The advent of CAVs presents new opportunities to improve the energy efficiency of individual vehicles. However, testing and verifying energy-efficient autonomous driving systems are difficult due to safety considerations and repeatability. In this paper, we present a… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

  45. arXiv:2001.04489  [pdf, other

    quant-ph cs.ET eess.SY math.OC

    On the Computational Viability of Quantum Optimization for PMU Placement

    Authors: Eric B. Jones, Eliot Kapit, Chin-Yao Chang, David Biagioni, Deepthi Vaidhynathan, Peter Graf, Wesley Jones

    Abstract: Using optimal phasor measurement unit placement as a prototypical problem, we assess the computational viability of the current generation D-Wave Systems 2000Q quantum annealer for power systems design problems. We reformulate minimum dominating set for the annealer hardware, solve the reformulation for a standard set of IEEE test systems, and benchmark solution quality and time to solution agains… ▽ More

    Submitted 13 January, 2020; originally announced January 2020.

  46. arXiv:2001.01052  [pdf, ps, other

    eess.SP

    Joint Beamforming and Computation Offloading for Multi-user Mobile-Edge Computing

    Authors: Changfeng Ding, Jun-Bo Wang, Ming Cheng, Chuanwen Chang, **-Yuan Wang, Min Lin

    Abstract: Mobile edge computing (MEC) is considered as an efficient method to relieve the computation burden of mobile devices. In order to reduce the energy consumption and time delay of mobile devices (MDs) in MEC, multiple users multiple input and multiple output (MU-MIMO) communications is considered to be applied to the MEC system. The purpose of this paper is to minimize the weighted sum of energy con… ▽ More

    Submitted 4 January, 2020; originally announced January 2020.

  47. A Reinforcement Learning Approach for the Multichannel Rendezvous Problem

    Authors: Jen-Hung Wang, **-En Lu, Cheng-Shang Chang, Duan-Shin Lee

    Abstract: In this paper, we consider the multichannel rendezvous problem in cognitive radio networks (CRNs) where the probability that two users hop** on the same channel have a successful rendezvous is a function of channel states. The channel states are modelled by two-state Markov chains that have a good state and a bad state. These channel states are not observable by the users. For such a multichanne… ▽ More

    Submitted 5 July, 2019; v1 submitted 2 July, 2019; originally announced July 2019.

    Comments: 5 pages, 9 figures. arXiv admin note: text overlap with arXiv:1906.10424

    Journal ref: 2019 IEEE Globecom Workshops (GC Wkshps), Waikoloa, HI, USA, 2019, pp. 1-5

  48. arXiv:1905.00190  [pdf, other

    eess.IV cs.MM

    Learned Image Compression with Soft Bit-based Rate-Distortion Optimization

    Authors: David Alexandre, Chih-Peng Chang, Wen-Hsiao Peng, Hsueh-Ming Hang

    Abstract: This paper introduces the notion of soft bits to address the rate-distortion optimization for learning-based image compression. Recent methods for such compression train an autoencoder end-to-end with an objective to strike a balance between distortion and rate. They are faced with the zero gradient issue due to quantization and the difficulty of estimating the rate accurately. Inspired by soft qu… ▽ More

    Submitted 1 May, 2019; originally announced May 2019.

  49. Receiver Operating Characteristics for a Prototype Quantum Two-Mode Squeezing Radar

    Authors: David Luong, C. W. Sandbo Chang, A. M. Vadiraj, Anthony Damini, C. M. Wilson, Bhashyam Balaji

    Abstract: We have built and evaluated a prototype quantum radar, which we call a quantum two-mode squeezing radar (QTMS radar), in the laboratory. It operates solely at microwave frequencies; there is no downconversion from optical frequencies. Because the signal generation process relies on quantum mechanical principles, the system is considered to contain a quantum-enhanced radar transmitter. This transmi… ▽ More

    Submitted 28 February, 2019; originally announced March 2019.

    Comments: 17 pages, 17 figures; submitted to IEEE Transactions on Aerospace and Electronic Systems

  50. arXiv:1811.12214  [pdf, other

    cs.SD eess.AS

    Play as You Like: Timbre-enhanced Multi-modal Music Style Transfer

    Authors: Chien-Yu Lu, Min-Xin Xue, Chia-Che Chang, Che-Rung Lee, Li Su

    Abstract: Style transfer of polyphonic music recordings is a challenging task when considering the modeling of diverse, imaginative, and reasonable music pieces in the style different from their original one. To achieve this, learning stable multi-modal representations for both domain-variant (i.e., style) and domain-invariant (i.e., content) information of music in an unsupervised manner is critical. In th… ▽ More

    Submitted 28 November, 2018; originally announced November 2018.