Skip to main content

Showing 1–23 of 23 results for author: Chang, R

Searching in archive eess. Search in all archives.
.
  1. arXiv:2401.11095  [pdf, other

    cs.HC cs.SD eess.AS

    SoundShift: Exploring Sound Manipulations for Accessible Mixed-Reality Awareness

    Authors: Ruei-Che Chang, Chia-Sheng Hung, Bing-Yu Chen, Dhruv Jain, Anhong Guo

    Abstract: Mixed-reality (MR) soundscapes blend real-world sound with virtual audio from hearing devices, presenting intricate auditory information that is hard to discern and differentiate. This is particularly challenging for blind or visually impaired individuals, who rely on sounds and descriptions in their everyday lives. To understand how complex audio information is consumed, we analyzed online forum… ▽ More

    Submitted 26 May, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: DIS 2024

  2. arXiv:2311.18168  [pdf, other

    cs.CV cs.LG eess.AS

    Probabilistic Speech-Driven 3D Facial Motion Synthesis: New Benchmarks, Methods, and Applications

    Authors: Karren D. Yang, Anurag Ranjan, Jen-Hao Rick Chang, Raviteja Vemulapalli, Oncel Tuzel

    Abstract: We consider the task of animating 3D facial geometry from speech signal. Existing works are primarily deterministic, focusing on learning a one-to-one map** from speech signal to 3D face meshes on small datasets with limited speakers. While these models can achieve high-quality lip articulation for speakers in the training set, they are unable to capture the full and diverse distribution of 3D f… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  3. arXiv:2310.15130  [pdf, other

    cs.SD cs.CV eess.AS

    Novel-View Acoustic Synthesis from 3D Reconstructed Rooms

    Authors: Byeongjoo Ahn, Karren Yang, Brian Hamilton, Jonathan Sheaffer, Anurag Ranjan, Miguel Sarabia, Oncel Tuzel, Jen-Hao Rick Chang

    Abstract: We investigate the benefit of combining blind audio recordings with 3D scene information for novel-view acoustic synthesis. Given audio recordings from 2-4 microphones and the 3D geometry and material of a scene containing multiple unknown sound sources, we estimate the sound anywhere in the scene. We identify the main challenges of novel-view acoustic synthesis as sound source localization, separ… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  4. arXiv:2309.10707  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Corpus Synthesis for Zero-shot ASR domain Adaptation using Large Language Models

    Authors: Hsuan Su, Ting-Yao Hu, Hema Swetha Koppula, Raviteja Vemulapalli, Jen-Hao Rick Chang, Karren Yang, Gautam Varma Mantena, Oncel Tuzel

    Abstract: While Automatic Speech Recognition (ASR) systems are widely used in many real-world applications, they often do not generalize well to new domains and need to be finetuned on data from these domains. However, target-domain data usually are not readily available in many scenarios. In this paper, we propose a new strategy for adapting ASR models to new target domains without any text or speech from… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  5. arXiv:2308.10790  [pdf

    eess.IV cs.CV

    Extraction of Text from Optic Nerve Optical Coherence Tomography Reports

    Authors: Iyad Majid, Youchen Victor Zhang, Robert Chang, Sophia Y. Wang

    Abstract: Purpose: The purpose of this study was to develop and evaluate rule-based algorithms to enhance the extraction of text data, including retinal nerve fiber layer (RNFL) values and other ganglion cell count (GCC) data, from Zeiss Cirrus optical coherence tomography (OCT) scan reports. Methods: DICOM files that contained encapsulated PDF reports with RNFL or Ganglion Cell in their document titles wer… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

  6. arXiv:2308.03027  [pdf, other

    cs.LG cs.CV eess.SP

    Causal Disentanglement Hidden Markov Model for Fault Diagnosis

    Authors: Rihao Chang, Yongtao Ma, Weizhi Nie, Jie Nie, An-an Liu

    Abstract: In modern industries, fault diagnosis has been widely applied with the goal of realizing predictive maintenance. The key issue for the fault diagnosis system is to extract representative characteristics of the fault signal and then accurately predict the fault type. In this paper, we propose a Causal Disentanglement Hidden Markov model (CDHM) to learn the causality in the bearing fault mechanism a… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

  7. arXiv:2303.14885  [pdf, other

    eess.AS cs.LG cs.SD

    Text is All You Need: Personalizing ASR Models using Controllable Speech Synthesis

    Authors: Karren Yang, Ting-Yao Hu, Jen-Hao Rick Chang, Hema Swetha Koppula, Oncel Tuzel

    Abstract: Adapting generic speech recognition models to specific individuals is a challenging problem due to the scarcity of personalized data. Recent works have proposed boosting the amount of training data using personalized text-to-speech synthesis. Here, we ask two fundamental questions about this strategy: when is synthetic data effective for personalization, and why is it effective in those cases? To… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.

    Comments: ICASSP 2023

  8. arXiv:2303.05745  [pdf, other

    eess.IV cs.CV

    Multi-site, Multi-domain Airway Tree Modeling (ATM'22): A Public Benchmark for Pulmonary Airway Segmentation

    Authors: Minghui Zhang, Yangqian Wu, Hanxiao Zhang, Yulei Qin, Hao Zheng, Wen Tang, Corey Arnold, Chenhao Pei, Pengxin Yu, Yang Nan, Guang Yang, Simon Walsh, Dominic C. Marshall, Matthieu Komorowski, Puyang Wang, Dazhou Guo, Dakai **, Ya'nan Wu, Shuiqing Zhao, Runsheng Chang, Boyu Zhang, Xing Lv, Abdul Qayyum, Moona Mazher, Qi Su , et al. (11 additional authors not shown)

    Abstract: Open international challenges are becoming the de facto standard for assessing computer vision and image analysis algorithms. In recent years, new methods have extended the reach of pulmonary airway segmentation that is closer to the limit of image resolution. Since EXACT'09 pulmonary airway segmentation, limited effort has been directed to quantitative comparison of newly emerged algorithms drive… ▽ More

    Submitted 27 June, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

    Comments: 32 pages, 16 figures. Homepage: https://atm22.grand-challenge.org/. Submitted

  9. arXiv:2212.07651  [pdf, other

    eess.IV cs.CV cs.LG

    Two-stage Contextual Transformer-based Convolutional Neural Network for Airway Extraction from CT Images

    Authors: Yanan Wu, Shuiqing Zhao, Shouliang Qi, Jie Feng, Haowen Pang, Runsheng Chang, Long Bai, Mengqi Li, Shuyue Xia, Wei Qian, Hongliang Ren

    Abstract: Accurate airway extraction from computed tomography (CT) images is a critical step for planning navigation bronchoscopy and quantitative assessment of airway-related chronic obstructive pulmonary disease (COPD). The existing methods are challenging to sufficiently segment the airway, especially the high-generation airway, with the constraint of the limited label and cannot meet the clinical use in… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  10. arXiv:2212.02057  [pdf, other

    cs.CV cs.AI eess.IV

    DA-CIL: Towards Domain Adaptive Class-Incremental 3D Object Detection

    Authors: Ziyuan Zhao, Mingxi Xu, Peisheng Qian, Ramanpreet Singh Pahwa, Richard Chang

    Abstract: Deep learning has achieved notable success in 3D object detection with the advent of large-scale point cloud datasets. However, severe performance degradation in the past trained classes, i.e., catastrophic forgetting, still remains a critical issue for real-world deployment when the number of classes is unknown or may vary. Moreover, existing 3D class-incremental detection methods are developed f… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: Accepted by the 33rd British Machine Vision Conference (BMVC 2022)

    Journal ref: 33rd British Machine Vision Conference 2022, BMVC 2022, London, UK, November 21-24, 2022. BMVA Press, 2022. URL https://bmvc2022.mpi-inf.mpg.de/0916.pdf

  11. arXiv:2208.08894  [pdf

    eess.SP

    EEG Machine Learning for Analysis of Mild Traumatic Brain Injury: A survey

    Authors: Weiqing Gu, Ryan Chang, Bohan Yang

    Abstract: Mild Traumatic Brain Injury (mTBI) is a common brain injury and affects a diverse group of people: soldiers, constructors, athletes, drivers, children, elders, and nearly everyone. Thus, having a well-established, fast, cheap, and accurate classification method is crucial for the well-being of people around the globe. Luckily, using Machine Learning (ML) on electroencephalography (EEG) data shows… ▽ More

    Submitted 10 August, 2022; originally announced August 2022.

    Comments: 27 pages

  12. arXiv:2208.01632  [pdf, ps, other

    eess.SP cs.IT eess.SY

    Sensor Deployment and Link Analysis in Satellite IoT Systems for Wildfire Detection

    Authors: How-Hang Liu, Ronald Y. Chang, Yi-Ying Chen, I-Kang Fu, H. Vincent Poor

    Abstract: Climate change has been identified as one of the most critical threats to human civilization and sustainability. Wildfires, which produce huge amounts of carbon emission, are both drivers and results of climate change. An early and timely wildfire detection system can constrain fires to short and small ones and yield significant carbon reduction. In this paper, we propose to use ground sensor depl… ▽ More

    Submitted 5 August, 2022; v1 submitted 2 August, 2022; originally announced August 2022.

    Comments: IEEE Global Communications Conference (GLOBECOM) 2022

  13. arXiv:2202.01946  [pdf, ps, other

    eess.SP cs.IT cs.LG

    Unsupervised Learning Based Hybrid Beamforming with Low-Resolution Phase Shifters for MU-MIMO Systems

    Authors: Chia-Ho Kuo, Hsin-Yuan Chang, Ronald Y. Chang, Wei-Ho Chung

    Abstract: Millimeter wave (mmWave) is a key technology for fifth-generation (5G) and beyond communications. Hybrid beamforming has been proposed for large-scale antenna systems in mmWave communications. Existing hybrid beamforming designs based on infinite-resolution phase shifters (PSs) are impractical due to hardware cost and power consumption. In this paper, we propose an unsupervised-learning-based sche… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

    Comments: IEEE International Conference on Communications (ICC) 2022

  14. arXiv:2201.12656  [pdf, ps, other

    eess.SP cs.LG

    Few-Shot Transfer Learning for Device-Free Fingerprinting Indoor Localization

    Authors: Bing-Jia Chen, Ronald Y. Chang

    Abstract: Device-free wireless indoor localization is an essential technology for the Internet of Things (IoT), and fingerprint-based methods are widely used. A common challenge to fingerprint-based methods is data collection and labeling. This paper proposes a few-shot transfer learning system that uses only a small amount of labeled data from the current environment and reuses a large amount of existing l… ▽ More

    Submitted 29 January, 2022; originally announced January 2022.

    Comments: IEEE International Conference on Communications (ICC) 2022

  15. arXiv:2110.11479  [pdf, other

    eess.AS cs.LG cs.SD

    Synt++: Utilizing Imperfect Synthetic Data to Improve Speech Recognition

    Authors: Ting-Yao Hu, Mohammadreza Armandpour, Ashish Shrivastava, Jen-Hao Rick Chang, Hema Koppula, Oncel Tuzel

    Abstract: With recent advances in speech synthesis, synthetic data is becoming a viable alternative to real data for training speech recognition models. However, machine learning with synthetic data is not trivial due to the gap between the synthetic and the real data distributions. Synthetic datasets may contain artifacts that do not exist in real data such as structured noise, content errors, or unrealist… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

  16. arXiv:2110.02891  [pdf, other

    cs.LG cs.SD eess.AS

    Style Equalization: Unsupervised Learning of Controllable Generative Sequence Models

    Authors: Jen-Hao Rick Chang, Ashish Shrivastava, Hema Swetha Koppula, Xiaoshuai Zhang, Oncel Tuzel

    Abstract: Controllable generative sequence models with the capability to extract and replicate the style of specific examples enable many applications, including narrating audiobooks in different voices, auto-completing and auto-correcting written handwriting, and generating missing training samples for downstream recognition tasks. However, under an unsupervised-style setting, typical training algorithms f… ▽ More

    Submitted 30 June, 2022; v1 submitted 6 October, 2021; originally announced October 2021.

    Comments: ICML 2022

  17. arXiv:2109.10505  [pdf, ps, other

    eess.SP cs.IT eess.SY

    Sensor-Based Satellite IoT for Early Wildfire Detection

    Authors: How-Hang Liu, Ronald Y. Chang, Yi-Ying Chen, I-Kang Fu

    Abstract: Frequent and severe wildfires have been observed lately on a global scale. Wildfires not only threaten lives and properties, but also pose negative environmental impacts that transcend national boundaries (e.g., greenhouse gas emission and global warming). Thus, early wildfire detection with timely feedback is much needed. We propose to use the emerging beyond fifth-generation (B5G) and sixth-gene… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: To appear in IEEE GLOBECOM 2021 Workshops

  18. arXiv:2109.09267  [pdf, ps, other

    cs.IT eess.SP

    Intelligent Reflecting Surfaces and Classical Relays: Coexistence and Co-Design

    Authors: Te-Yi Kan, Ronald Y. Chang, Feng-Tsun Chien

    Abstract: This paper investigates a multiuser downlink communication system with coexisting intelligent reflecting surface (IRS) and classical half-duplex decode-and-forward (DF) relay. In this system, the IRS and the DF relay interact with each other and assist transmission simultaneously. In particular, active beamforming at the base station (BS) and at the DF relay, and passive beamforming at the IRS, ar… ▽ More

    Submitted 19 September, 2021; originally announced September 2021.

    Comments: To appear in IEEE GLOBECOM 2021 Workshops

  19. arXiv:2102.00178  [pdf, other

    eess.SP cs.IT cs.LG

    Deep Reinforcement Learning Aided Monte Carlo Tree Search for MIMO Detection

    Authors: Tz-Wei Mo, Ronald Y. Chang, Te-Yi Kan

    Abstract: This paper proposes a novel multiple-input multiple-output (MIMO) symbol detector that incorporates a deep reinforcement learning (DRL) agent into the Monte Carlo tree search (MCTS) detection algorithm. We first describe how the MCTS algorithm, used in many decision-making problems, is applied to the MIMO detection problem. Then, we introduce a self-designed deep reinforcement learning agent, cons… ▽ More

    Submitted 30 January, 2021; originally announced February 2021.

  20. arXiv:2008.07111  [pdf, other

    eess.SP cs.LG

    Semi-Supervised Learning with GANs for Device-Free Fingerprinting Indoor Localization

    Authors: Kevin M. Chen, Ronald Y. Chang

    Abstract: Device-free wireless indoor localization is a key enabling technology for the Internet of Things (IoT). Fingerprint-based indoor localization techniques are a commonly used solution. This paper proposes a semi-supervised, generative adversarial network (GAN)-based device-free fingerprinting indoor localization system. The proposed system uses a small amount of labeled data and a large amount of un… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

    Comments: Accepted at IEEE GLOBECOM 2020

  21. arXiv:2005.00946  [pdf, other

    eess.IV cs.CV physics.optics

    Towards Occlusion-Aware Multifocal Displays

    Authors: Jen-Hao Rick Chang, Anat Levin, B. V. K. Vijaya Kumar, Aswin C. Sankaranarayanan

    Abstract: The human visual system uses numerous cues for depth perception, including disparity, accommodation, motion parallax and occlusion. It is incumbent upon virtual-reality displays to satisfy these cues to provide an immersive user experience. Multifocal displays, one of the classic approaches to satisfy the accommodation cue, place virtual content at multiple focal planes, each at a di erent depth.… ▽ More

    Submitted 2 May, 2020; originally announced May 2020.

    Comments: SIGGRAPH 2020

  22. arXiv:1910.06302  [pdf, other

    eess.IV cs.CV cs.LG

    Finding New Diagnostic Information for Detecting Glaucoma using Neural Networks

    Authors: Erfan Noury, Suria S. Mannil, Robert T. Chang, An Ran Ran, Carol Y. Cheung, Suman S. Thapa, Harsha L. Rao, Srilakshmi Dasari, Mohammed Riyazuddin, Dolly Chang, Sriharsha Nagaraj, Clement C. Tham, Reza Zadeh

    Abstract: We describe a new approach to automated Glaucoma detection in 3D Spectral Domain Optical Coherence Tomography (OCT) optic nerve scans. First, we gathered a unique and diverse multi-ethnic dataset of OCT scans consisting of glaucoma and non-glaucomatous cases obtained from four tertiary care eye hospitals located in four different countries. Using this longitudinal data, we achieved state-of-the-ar… ▽ More

    Submitted 2 September, 2020; v1 submitted 14 October, 2019; originally announced October 2019.

    Comments: 28 pages, 12 figures, 15 tables, title changed, new authors added

  23. arXiv:1812.11031  [pdf, other

    eess.SP cs.IT

    Distributed Multi-Stream Beamforming in MIMO Multi-Relay Interference Networks

    Authors: Cenk M. Yetis, Ronald Y. Chang

    Abstract: In this paper, multi-stream transmission in interference networks aided by multiple amplify-and-forward (AF) relays in the presence of direct links is considered. The objective is to minimize the sum power of transmitters and relays by beamforming optimization under the stream signal-to-interference-plus-noise-ratio (SINR) constraints. For transmit beamforming optimization, the problem is a well-k… ▽ More

    Submitted 14 December, 2018; originally announced December 2018.

    Comments: 18 pages, 10 figures, and 4 tables. This paper is to appear in IEEE Access