Skip to main content

Showing 1–50 of 72 results for author: Gan, W

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.14158  [pdf, other

    eess.SP

    Computation-efficient Virtual Sensing Approach with Multichannel Adjoint Least Mean Square Algorithm

    Authors: Boxiang Wang, Junwei Ji, Xiaoyi Shen, Dongyuan Shi, Woon-Seng Gan

    Abstract: Multichannel active noise control (ANC) systems are designed to create a large zone of quietness (ZoQ) around the error microphones, however, the placement of these microphones often presents challenges due to physical limitations. Virtual sensing technique that effectively suppresses the noise far from the physical error microphones is one of the most promising solutions. Nevertheless, the conven… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  2. arXiv:2405.12996  [pdf, other

    eess.IV

    Dose-aware Diffusion Model for 3D Low-dose PET: Multi-institutional Validation with Reader Study and Real Low-dose Data

    Authors: Huidong Xie, Weijie Gan, Bo Zhou, Ming-Kai Chen, Michal Kulon, Annemarie Boustani, Benjamin A. Spencer, Reimund Bayerlein, Xiongchao Chen, Qiong Liu, Xueqi Guo, Menghua Xia, Yinchi Zhou, Hui Liu, Liang Guo, Hongyu An, Ulugbek S. Kamilov, Hanzhong Wang, Biao Li, Axel Rominger, Kuangyu Shi, Ge Wang, Ramsey D. Badawi, Chi Liu

    Abstract: As PET imaging is accompanied by radiation exposure and potentially increased cancer risk, reducing radiation dose in PET scans without compromising the image quality is an important topic. Deep learning (DL) techniques have been investigated for low-dose PET imaging. However, existing models have often resulted in compromised image quality when achieving low-dose PET and have limited generalizabi… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 16 Pages, 15 Figures, 4 Tables. Paper under review. arXiv admin note: substantial text overlap with arXiv:2311.04248

  3. arXiv:2405.12496  [pdf, other

    eess.AS cs.NI cs.SD eess.SP

    A Survey of Integrating Wireless Technology into Active Noise Control

    Authors: Xiaoyi Shen, Dongyuan Shi, Zhengding Luo, Junwei Ji, Woon-Seng Gan

    Abstract: Active Noise Control (ANC) is a widely adopted technology for reducing environmental noise across various scenarios. This paper focuses on enhancing noise reduction performance, particularly through the refinement of signal quality fed into ANC systems. We discuss the main wireless technique integrated into the ANC system, equipped with some innovative algorithms, in diverse environments. Instead… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  4. arXiv:2405.07536  [pdf, other

    cs.RO eess.SY

    Multi-AUV Kinematic Task Assignment based on Self-organizing Map Neural Network and Dubins Path Generator

    Authors: Xin Li, Wenyang Gan, Pang Wen, Daqi Zhu

    Abstract: To deal with the task assignment problem of multi-AUV systems under kinematic constraints, which means steering capability constraints for underactuated AUVs or other vehicles likely, an improved task assignment algorithm is proposed combining the Dubins Path algorithm with improved SOM neural network algorithm. At first, the aimed tasks are assigned to the AUVs by improved SOM neural network meth… ▽ More

    Submitted 24 June, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  5. arXiv:2403.18139  [pdf, other

    eess.IV cs.CV

    Pseudo-MRI-Guided PET Image Reconstruction Method Based on a Diffusion Probabilistic Model

    Authors: Weijie Gan, Huidong Xie, Carl von Gall, Günther Platsch, Michael T. Jurkiewicz, Andrea Andrade, Udunna C. Anazodo, Ulugbek S. Kamilov, Hongyu An, Jorge Cabello

    Abstract: Anatomically guided PET reconstruction using MRI information has been shown to have the potential to improve PET image quality. However, these improvements are limited to PET scans with paired MRI information. In this work we employed a diffusion probabilistic model (DPM) to infer T1-weighted-MRI (deep-MRI) images from FDG-PET brain images. We then use the DPM-generated T1w-MRI to guide the PET re… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  6. Unsupervised learning based end-to-end delayless generative fixed-filter active noise control

    Authors: Zhengding Luo, Dongyuan Shi, Xiaoyi Shen, Woon-Seng Gan

    Abstract: Delayless noise control is achieved by our earlier generative fixed-filter active noise control (GFANC) framework through efficient coordination between the co-processor and real-time controller. However, the one-dimensional convolutional neural network (1D CNN) in the co-processor requires initial training using labelled noise datasets. Labelling noise data can be resource-intensive and may intro… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024)

  7. arXiv:2402.02694  [pdf, other

    eess.AS cs.LG cs.SD

    Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift

    Authors: Jisheng Bai, Mou Wang, Haohe Liu, Han Yin, Yafei Jia, Siwei Huang, Yutong Du, Dongzhe Zhang, Dongyuan Shi, Woon-Seng Gan, Mark D. Plumbley, Susanto Rahardja, Bin Xiang, Jianfeng Chen

    Abstract: Acoustic scene classification (ASC) is a crucial research problem in computational auditory scene analysis, and it aims to recognize the unique acoustic characteristics of an environment. One of the challenges of the ASC task is the domain shift between training and testing data. Since 2018, ASC challenges have focused on the generalization of ASC models across different recording devices. Althoug… ▽ More

    Submitted 28 February, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

  8. arXiv:2401.13998  [pdf, other

    eess.IV cs.CV

    WAL-Net: Weakly supervised auxiliary task learning network for carotid plaques classification

    Authors: Haitao Gan, Lingchao Fu, Ran Zhou, Weiyan Gan, Furong Wang, Xiaoyan Wu, Zhi Yang, Zhongwei Huang

    Abstract: The classification of carotid artery ultrasound images is a crucial means for diagnosing carotid plaques, holding significant clinical relevance for predicting the risk of stroke. Recent research suggests that utilizing plaque segmentation as an auxiliary task for classification can enhance performance by leveraging the correlation between segmentation and classification tasks. However, this appro… ▽ More

    Submitted 27 January, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

  9. arXiv:2401.08678  [pdf, other

    eess.AS cs.SD

    Sub-band and Full-band Interactive U-Net with DPRNN for Demixing Cross-talk Stereo Music

    Authors: Han Yin, Mou Wang, Jisheng Bai, Dongyuan Shi, Woon-Seng Gan, Jianfeng Chen

    Abstract: This paper presents a detailed description of our proposed methods for the ICASSP 2024 Cadenza Challenge. Experimental results show that the proposed system can achieve better performance than official baselines.

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: Submitted to ICASSP 2024

  10. arXiv:2311.18073  [pdf, other

    eess.IV

    DiffGEPCI: 3D MRI Synthesis from mGRE Signals using 2.5D Diffusion Model

    Authors: Yuyang Hu, Satya V. V. N. Kothapalli, Weijie Gan, Alexander L. Sukstanskii, Gregory F. Wu, Manu Goyal, Dmitriy A. Yablonskiy, Ulugbek S. Kamilov

    Abstract: We introduce a new framework called DiffGEPCI for cross-modality generation in magnetic resonance imaging (MRI) using a 2.5D conditional diffusion model. DiffGEPCI can synthesize high-quality Fluid Attenuated Inversion Recovery (FLAIR) and Magnetization Prepared-Rapid Gradient Echo (MPRAGE) images, without acquiring corresponding measurements, by leveraging multi-Gradient-Recalled Echo (mGRE) MRI… ▽ More

    Submitted 18 April, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

  11. arXiv:2311.15445  [pdf, other

    cs.CV eess.IV

    FLAIR: A Conditional Diffusion Framework with Applications to Face Video Restoration

    Authors: Zihao Zou, Jiaming Liu, Shirin Shoushtari, Yubo Wang, Weijie Gan, Ulugbek S. Kamilov

    Abstract: Face video restoration (FVR) is a challenging but important problem where one seeks to recover a perceptually realistic face videos from a low-quality input. While diffusion probabilistic models (DPMs) have been shown to achieve remarkable performance for face image restoration, they often fail to preserve temporally coherent, high-quality videos, compromising the fidelity of reconstructed faces.… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

    Comments: 32 pages, 27 figures

  12. arXiv:2311.14068  [pdf, other

    eess.AS

    Interactive Dual-Conformer with Scene-Inspired Mask for Soft Sound Event Detection

    Authors: Han Yin, Jisheng Bai, Mou Wang, Dongyuan Shi, Woon-Seng Gan, Jianfeng Chen

    Abstract: Traditional binary hard labels for sound event detection (SED) lack details about the complexity and variability of sound event distributions. Recently, a novel annotation workflow is proposed to generate fine-grained non-binary soft labels, resulting in a new real-life dataset named MAESTRO Real for SED. In this paper, we first propose an interactive dual-conformer (IDC) module, in which a cross-… ▽ More

    Submitted 7 December, 2023; v1 submitted 23 November, 2023; originally announced November 2023.

    Comments: to be improved (unfinished)

  13. arXiv:2311.12371  [pdf, other

    eess.AS

    AudioLog: LLMs-Powered Long Audio Logging with Hybrid Token-Semantic Contrastive Learning

    Authors: Jisheng Bai, Han Yin, Mou Wang, Dongyuan Shi, Woon-Seng Gan, Jianfeng Chen, Susanto Rahardja

    Abstract: Previous studies in automated audio captioning have faced difficulties in accurately capturing the complete temporal details of acoustic scenes and events within long audio sequences. This paper presents AudioLog, a large language models (LLMs)-powered audio logging system with hybrid token-semantic contrastive learning. Specifically, we propose to fine-tune the pre-trained hierarchical token-sema… ▽ More

    Submitted 4 January, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

  14. arXiv:2311.04248  [pdf, other

    eess.IV

    DDPET-3D: Dose-aware Diffusion Model for 3D Ultra Low-dose PET Imaging

    Authors: Huidong Xie, Weijie Gan, Bo Zhou, Xiongchao Chen, Qiong Liu, Xueqi Guo, Liang Guo, Hongyu An, Ulugbek S. Kamilov, Ge Wang, Chi Liu

    Abstract: As PET imaging is accompanied by substantial radiation exposure and cancer risk, reducing radiation dose in PET scans is an important topic. Recently, diffusion models have emerged as the new state-of-the-art generative model to generate high-quality samples and have demonstrated strong potential for various tasks in medical imaging. However, it is difficult to extend diffusion models for 3D image… ▽ More

    Submitted 28 November, 2023; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: Paper under review. 16 pages, 11 figures, 4 tables

  15. arXiv:2311.02003  [pdf, other

    eess.IV cs.CV

    A Structured Pruning Algorithm for Model-based Deep Learning

    Authors: Chicago Park, Weijie Gan, Zihao Zou, Yuyang Hu, Zhixin Sun, Ulugbek S. Kamilov

    Abstract: There is a growing interest in model-based deep learning (MBDL) for solving imaging inverse problems. MBDL networks can be seen as iterative algorithms that estimate the desired image using a physical measurement model and a learned image prior specified using a convolutional neural net (CNNs). The iterative nature of MBDL networks increases the test-time computational complexity, which limits the… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  16. arXiv:2310.07504  [pdf, other

    eess.IV cs.CV

    PtychoDV: Vision Transformer-Based Deep Unrolling Network for Ptychographic Image Reconstruction

    Authors: Weijie Gan, Qiuchen Zhai, Michael Thompson McCann, Cristina Garcia Cardona, Ulugbek S. Kamilov, Brendt Wohlberg

    Abstract: Ptychography is an imaging technique that captures multiple overlap** snapshots of a sample, illuminated coherently by a moving localized probe. The image recovery from ptychographic data is generally achieved via an iterative algorithm that solves a nonlinear phase retrieval problem derived from measured diffraction patterns. However, these iterative approaches have high computational cost. In… ▽ More

    Submitted 6 March, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  17. arXiv:2310.04297  [pdf, other

    eess.IV

    A Plug-and-Play Image Registration Network

    Authors: Junhao Hu, Weijie Gan, Zhixin Sun, Hongyu An, Ulugbek S. Kamilov

    Abstract: Deformable image registration (DIR) is an active research topic in biomedical imaging. There is a growing interest in develo** DIR methods based on deep learning (DL). A traditional DL approach to DIR is based on training a convolutional neural network (CNN) to estimate the registration field between two input images. While conceptually simple, this approach comes with a limitation that it exclu… ▽ More

    Submitted 19 March, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

  18. arXiv:2308.07767  [pdf, other

    eess.AS cs.SD

    Preliminary investigation of the short-term in situ performance of an automatic masker selection system

    Authors: Bhan Lam, Zhen-Ting Ong, Kenneth Ooi, Wen-Hui Ong, Trevor Wong, Karn N. Watcharasupat, Woon-Seng Gan

    Abstract: Soundscape augmentation or "masking" introduces wanted sounds into the acoustic environment to improve acoustic comfort. Usually, the masker selection and playback strategies are either arbitrary or based on simple rules (e.g. -3 dBA), which may lead to sub-optimal increment or even reduction in acoustic comfort for dynamic acoustic environments. To reduce ambiguity in the selection of maskers, an… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: paper submitted to the 52nd International Congress and Exposition on Noise Control Engineering held in Chiba, Greater Tokyo, Japan, on 20-23 August 2023 (Inter-Noise 2023)

    ACM Class: J.2; J.4

  19. arXiv:2308.03684  [pdf, other

    eess.AS cs.SD

    Active Noise Control based on the Momentum Multichannel Normalized Filtered-x Least Mean Square Algorithm

    Authors: Dongyuan Shi, Woon-Seng Gan, Bhan Lam, Shulin Wen, Xiaoyi Shen

    Abstract: Multichannel active noise control (MCANC) is widely utilized to achieve significant noise cancellation area in the complicated acoustic field. Meanwhile, the filter-x least mean square (FxLMS) algorithm gradually becomes the benchmark solution for the implementation of MCANC due to its low computational complexity. However, its slow convergence speed more or less undermines the performance of deal… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: Conference: INTER-NOISE and NOISE-CON Congress and Conference Proceedings 2020 At Korea Volume: 261

  20. arXiv:2307.10913  [pdf, other

    eess.SP

    Practical Active Noise Control: Restriction of Maximum Output Power

    Authors: Woon-Seng Gan, Dongyuan Shi, Xiaoyi Shen

    Abstract: This paper presents some recent algorithms developed by the authors for real-time adaptive active noise (AANC) control systems. These algorithms address some of the common challenges faced by AANC systems, such as speaker saturation, system divergence, and disturbance rejection. Speaker saturation can introduce nonlinearity into the adaptive system and degrade the noise reduction performance. Syst… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: 2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)

  21. Anti-noise window: Subjective perception of active noise reduction and effect of informational masking

    Authors: Bhan Lam, Kelvin Chee Quan Lim, Kenneth Ooi, Zhen-Ting Ong, Dongyuan Shi, Woon-Seng Gan

    Abstract: Reviving natural ventilation (NV) for urban sustainability presents challenges for indoor acoustic comfort. Active control and interference-based noise mitigation strategies, such as the use of loudspeakers, offer potential solutions to achieve acoustic comfort while maintaining NV. However, these approaches are not commonly integrated or evaluated from a perceptual standpoint. This study examines… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

    Comments: Accepted manuscript submitted to Sustainable Cities and Society

    Journal ref: Sustain. Cities Soc., 104763, 2023

  22. arXiv:2306.11408  [pdf, other

    eess.AS

    A Computation-efficient Online Secondary Path Modeling Technique for Modified FXLMS Algorithm

    Authors: Junwei Ji, Dongyuan Shi, Woon-Seng Gan, Xiaoyi Shen, Zhengding Luo

    Abstract: This paper proposes an online secondary path modelling (SPM) technique to improve the performance of the modified filtered reference Least Mean Square (FXLMS) algorithm. It can effectively respond to a time-varying secondary path, which refers to the path from a secondary source to an error sensor. Unlike traditional methods, the proposed approach switches modes between adaptive ANC and online SPM… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  23. MOV-Modified-FxLMS algorithm with Variable Penalty Factor in a Practical Power Output Constrained Active Control System

    Authors: Chung Kwan Lai, Dongyuan Shi, Bhan Lam, Woon-Seng Gan

    Abstract: Practical Active Noise Control (ANC) systems typically require a restriction in their maximum output power, to prevent overdriving the loudspeaker and causing system instability. Recently, the minimum output variance filtered-reference least mean square (MOV-FxLMS) algorithm was shown to have optimal control under output constraint with an analytically formulated penalty factor, but it needs offli… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: Accepted article in IEEE Signal Processing Letters

    Journal ref: IEEE Signal Process. Lett., vol. 30, pp. 723-727, 2023

  24. arXiv:2306.01425  [pdf, other

    eess.AS eess.SP eess.SY

    Active Noise Control in The New Century: The Role and Prospect of Signal Processing

    Authors: Dongyuan Shi, Bhan Lam, Woon-Seng Gan, Jordan Cheer, Stephen J. Elliott

    Abstract: Since Paul Leug's 1933 patent application for a system for the active control of sound, the field of active noise control (ANC) has not flourished until the advent of digital signal processors forty years ago. Early theoretical advancements in digital signal processing and processors laid the groundwork for the phenomenal growth of the field, particularly over the past quarter-century. The widespr… ▽ More

    Submitted 6 July, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: Submitted to inter.noise 2023, Chiba, Japan

  25. arXiv:2305.12672  [pdf, other

    eess.IV cs.CV cs.LG

    Block Coordinate Plug-and-Play Methods for Blind Inverse Problems

    Authors: Weijie Gan, Shirin Shoushtari, Yuyang Hu, Jiaming Liu, Hongyu An, Ulugbek S. Kamilov

    Abstract: Plug-and-play (PnP) prior is a well-known class of methods for solving imaging inverse problems by computing fixed-points of operators combining physical measurement models and learned image denoisers. While PnP methods have been extensively used for image recovery with known measurement operators, there is little work on PnP for solving blind inverse problems. We address this gap by presenting a… ▽ More

    Submitted 26 October, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

  26. Real-time modelling of observation filter in the Remote Microphone Technique for an Active Noise Control application

    Authors: Chung Kwan Lai, Bhan Lam, Dongyuan Shi, Woon-Seng Gan

    Abstract: The remote microphone technique (RMT) is often used in active noise control (ANC) applications to overcome design constraints in microphone placements by estimating the acoustic pressure at inconvenient locations using a pre-calibrated observation filter (OF), albeit limited to stationary primary acoustic fields. While the OF estimation in varying primary fields can be significantly improved throu… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 5 pages, 5 figures. Submitted to 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2023)

    Journal ref: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Jun. 2023, pp. 1-5

  27. arXiv:2303.08411  [pdf, other

    eess.AS

    A practical distributed active noise control algorithm overcoming communication restrictions

    Authors: Junwei Ji, Dongyuan Shi, Zhengding Luo, Xiaoyi Shen, Woon-Seng Gan

    Abstract: By assigning the massive computing tasks of the traditional multichannel active noise control (MCANC) system to several distributed control nodes, distributed multichannel active noise control (DMCANC) techniques have become effective global noise reduction solutions with low computational costs. However, existing DMCANC algorithms simply complete the distribution of traditional centralized algori… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  28. arXiv:2303.08397  [pdf, other

    eess.AS eess.SP

    A Momentum Two-gradient Direction Algorithm with Variable Step Size Applied to Solve Practical Output Constraint Issue for Active Noise Control

    Authors: Xiaoyi Shen, Dongyuan Shi, Zhengding Luo, Junwei Ji, Woon-Seng Gan

    Abstract: Active noise control (ANC) has been widely utilized to reduce unwanted environmental noise. The primary objective of ANC is to generate an anti-noise with the same amplitude but the opposite phase of the primary noise using the secondary source. However, the effectiveness of the ANC application is impacted by the speaker's output saturation. This paper proposes a two-gradient direction ANC algorit… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: Paper is submitted and accepted by ICASSP2023

  29. Implementing Continuous HRTF Measurement in Near-Field

    Authors: Ee-Leng Tan, Santi Peksi, Woon-Seng Gan

    Abstract: Head-related transfer function (HRTF) is an essential component to create an immersive listening experience over headphones for virtual reality (VR) and augmented reality (AR) applications. Metaverse combines VR and AR to create immersive digital experiences, and users are very likely to interact with virtual objects in the near-field (NF). The HRTFs of such objects are highly individualized and d… ▽ More

    Submitted 15 June, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 5 pages, 9 figures, Submitted to 2023 IEEE International Conference on Acoustics, Speech and Signal Processing

  30. Autonomous Soundscape Augmentation with Multimodal Fusion of Visual and Participant-linked Inputs

    Authors: Kenneth Ooi, Karn N. Watcharasupat, Bhan Lam, Zhen-Ting Ong, Woon-Seng Gan

    Abstract: Autonomous soundscape augmentation systems typically use trained models to pick optimal maskers to effect a desired perceptual change. While acoustic information is paramount to such systems, contextual information, including participant demographics and the visual environment, also influences acoustic perception. Hence, we propose modular modifications to an existing attention-based deep neural n… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: 5 pages, 2 figures. Submitted to 2023 IEEE International Conference on Acoustics, Speech and Signal Processing

    Journal ref: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Jun. 2023, pp. 1-5

  31. Deep Generative Fixed-filter Active Noise Control

    Authors: Zhengding Luo, Dongyuan Shi, Xiaoyi Shen, Junwei Ji, Woon-Seng Gan

    Abstract: Due to the slow convergence and poor tracking ability, conventional LMS-based adaptive algorithms are less capable of handling dynamic noises. Selective fixed-filter active noise control (SFANC) can significantly reduce response time by selecting appropriate pre-trained control filters for different noises. Nonetheless, the limited number of pre-trained control filters may affect noise reduction p… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: Accepted by ICASSP 2023. Code will be available after publication

    Journal ref: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  32. arXiv:2211.12798  [pdf, other

    cs.CY cs.AI eess.SY

    An Open Case-based Reasoning Framework for Personalized On-board Driving Assistance in Risk Scenarios

    Authors: Wenbin Gan, Minh-Son Dao, Koji Zettsu

    Abstract: Driver reaction is of vital importance in risk scenarios. Drivers can take correct evasive maneuver at proper cushion time to avoid the potential traffic crashes, but this reaction process is highly experience-dependent and requires various levels of driving skills. To improve driving safety and avoid the traffic accidents, it is necessary to provide all road drivers with on-board driving assistan… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: 10 pahes, 8 figures, 4 tables, accepted by IEEE BigData 2022

  33. arXiv:2210.14974  [pdf, other

    eess.IV cs.CV

    SINCO: A Novel structural regularizer for image compression using implicit neural representations

    Authors: Harry Gao, Weijie Gan, Zhixin Sun, Ulugbek S. Kamilov

    Abstract: Implicit neural representations (INR) have been recently proposed as deep learning (DL) based solutions for image compression. An image can be compressed by training an INR model with fewer weights than the number of image pixels to map the coordinates of the image to corresponding pixel values. While traditional training approaches for INRs are based on enforcing pixel-wise image consistency, we… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

  34. arXiv:2210.06330  [pdf, other

    eess.IV cs.CV cs.LG

    CoRRECT: A Deep Unfolding Framework for Motion-Corrected Quantitative R2* Map**

    Authors: Xiaojian Xu, Weijie Gan, Satya V. V. N. Kothapalli, Dmitriy A. Yablonskiy, Ulugbek S. Kamilov

    Abstract: Quantitative MRI (qMRI) refers to a class of MRI methods for quantifying the spatial distribution of biological tissue parameters. Traditional qMRI methods usually deal separately with artifacts arising from accelerated data acquisition, involuntary physical motion, and magnetic-field inhomogeneities, leading to suboptimal end-to-end performance. This paper presents CoRRECT, a unified deep unfoldi… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  35. arXiv:2210.03837  [pdf, other

    eess.IV cs.CV

    Self-Supervised Deep Equilibrium Models for Inverse Problems with Theoretical Guarantees

    Authors: Weijie Gan, Chunwei Ying, Parna Eshraghi, Tongyao Wang, Cihat Eldeniz, Yuyang Hu, Jiaming Liu, Yasheng Chen, Hongyu An, Ulugbek S. Kamilov

    Abstract: Deep equilibrium models (DEQ) have emerged as a powerful alternative to deep unfolding (DU) for image reconstruction. DEQ models-implicit neural networks with effectively infinite number of layers-were shown to achieve state-of-the-art image reconstruction without the memory complexity associated with DU. While the performance of DEQ has been widely investigated, the existing work has primarily fo… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

  36. arXiv:2210.02584  [pdf, other

    eess.IV

    SPICER: Self-Supervised Learning for MRI with Automatic Coil Sensitivity Estimation and Reconstruction

    Authors: Yuyang Hu, Weijie Gan, Chunwei Ying, Tongyao Wang, Cihat Eldeniz, Jiaming Liu, Yasheng Chen, Hongyu An, Ulugbek S. Kamilov

    Abstract: Deep model-based architectures (DMBAs) integrating physical measurement models and learned image regularizers are widely used in parallel magnetic resonance imaging (PMRI). Traditional DMBAs for PMRI rely on pre-estimated coil sensitivity maps (CSMs) as a component of the measurement model. However, estimation of accurate CSMs is a challenging problem when measurements are highly undersampled. Add… ▽ More

    Submitted 6 June, 2024; v1 submitted 5 October, 2022; originally announced October 2022.

  37. arXiv:2208.08440  [pdf, other

    cs.LG cs.AI eess.SP

    Performance Evaluation of Selective Fixed-filter Active Noise Control based on Different Convolutional Neural Networks

    Authors: Zhengding Luo, Dongyuan Shi, Woon-Seng Gan

    Abstract: Due to its rapid response time and a high degree of robustness, the selective fixed-filter active noise control (SFANC) method appears to be a viable candidate for widespread use in a variety of practical active noise control (ANC) systems. In comparison to conventional fixed-filter ANC methods, SFANC can select the pre-trained control filters for different types of noise. Deep learning technologi… ▽ More

    Submitted 17 August, 2022; originally announced August 2022.

    Comments: arXiv admin note: text overlap with arXiv:2208.08082

  38. arXiv:2208.08086  [pdf, other

    eess.SY

    Implementation of Multi-channel Active Noise Control based on Back-propagation Mechanism

    Authors: Zhengding Luo, Dongyuan Shi, Junwei Ji, Woon-seng Gan

    Abstract: Active noise control (ANC) systems can efficiently attenuate low-frequency noises by introducing anti-noises to combine with the unwanted noises. In ANC systems, the filtered-x least mean square (FxLMS) and filtered-X normalized least-mean-square (FxNLMS) algorithm are well-known algorithms for adaptively adjusting control filters. Multi-channel ANC systems are typically required to attenuate unwa… ▽ More

    Submitted 17 August, 2022; originally announced August 2022.

  39. arXiv:2208.08082  [pdf, other

    eess.SY cs.LG cs.SD eess.AS

    A Hybrid SFANC-FxNLMS Algorithm for Active Noise Control based on Deep Learning

    Authors: Zhengding Luo, Dongyuan Shi, Woon-Seng Gan

    Abstract: The selective fixed-filter active noise control (SFANC) method selecting the best pre-trained control filters for various types of noise can achieve a fast response time. However, it may lead to large steady-state errors due to inaccurate filter selection and the lack of adaptability. In comparison, the filtered-X normalized least-mean-square (FxNLMS) algorithm can obtain lower steady-state errors… ▽ More

    Submitted 17 August, 2022; originally announced August 2022.

    Report number: Vol.29, p.1102-1106

    Journal ref: IEEE Signal Processing Letters, 2022

  40. arXiv:2207.12899  [pdf, other

    eess.AS cs.SD

    Assessment of a cost-effective headphone calibration procedure for soundscape evaluations

    Authors: Bhan Lam, Kenneth Ooi, Zhen-Ting Ong, Karn N. Watcharasupat, Trevor Wong, Woon-Seng Gan

    Abstract: To increase the availability and adoption of the soundscape standard, a low-cost calibration procedure for reproduction of audio stimuli over headphones was proposed as part of the global ``Soundscape Attributes Translation Project'' (SATP) for validating ISO/TS~12913-2:2018 perceived affective quality (PAQ) attribute translations. A previous preliminary study revealed significant deviations from… ▽ More

    Submitted 24 July, 2022; originally announced July 2022.

    Comments: For 24th International Congress on Acoustics

    Journal ref: in Proc. 24th Int. Congr. Acoust., 2022, pp. 1-8

  41. arXiv:2207.09221  [pdf, other

    eess.AS stat.AP

    Do uHear? Validation of uHear App for Preliminary Screening of Hearing Ability in Soundscape Studies

    Authors: Zhen-Ting Ong, Bhan Lam, Kenneth Ooi, Karn N. Watcharasupat, Trevor Wong, Woon-Seng Gan

    Abstract: Studies involving soundscape perception often exclude participants with hearing loss to prevent impaired perception from affecting experimental results. Participants are typically screened with pure tone audiometry, the "gold standard" for identifying and quantifying hearing loss at specific frequencies, and excluded if a study-dependent threshold is not met. However, procuring professional audiom… ▽ More

    Submitted 16 July, 2022; originally announced July 2022.

    Comments: Full paper submitted to 24th International Congress on Acoustics

  42. ARAUS: A Large-Scale Dataset and Baseline Models of Affective Responses to Augmented Urban Soundscapes

    Authors: Kenneth Ooi, Zhen-Ting Ong, Karn N. Watcharasupat, Bhan Lam, Joo Young Hong, Woon-Seng Gan

    Abstract: Choosing optimal maskers for existing soundscapes to effect a desired perceptual change via soundscape augmentation is non-trivial due to extensive varieties of maskers and a dearth of benchmark datasets with which to compare and develop soundscape augmentation models. To address this problem, we make publicly available the ARAUS (Affective Responses to Augmented Urban Soundscapes) dataset, which… ▽ More

    Submitted 5 March, 2023; v1 submitted 3 July, 2022; originally announced July 2022.

    Comments: [v1, v2] 25 pages, 11 figures. [v3] 33 pages, 18 figures. v3 updated with changes made after peer review. in IEEE Transactions on Affective Computing, 2023

    Journal ref: IEEE Trans. Affect. Comput., pp. 1-17, 2023

  43. arXiv:2206.07293  [pdf, other

    cs.SD eess.AS

    FRCRN: Boosting Feature Representation using Frequency Recurrence for Monaural Speech Enhancement

    Authors: Shengkui Zhao, Bin Ma, Karn N. Watcharasupat, Woon-Seng Gan

    Abstract: Convolutional recurrent networks (CRN) integrating a convolutional encoder-decoder (CED) structure and a recurrent structure have achieved promising performance for monaural speech enhancement. However, feature representation across frequency context is highly constrained due to limited receptive fields in the convolutions of CED. In this paper, we propose a convolutional recurrent encoder-decoder… ▽ More

    Submitted 24 November, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: The paper has been accepted by ICASSP 2022. 5 pages, 2 figures, 5 tables

  44. arXiv:2206.04264  [pdf, other

    eess.SY

    Formation Tracking for a Multi-Auv System Based on an Adaptive Sliding Mode Method in the Water Flow Environment

    Authors: Xin Li, Daqi Zhu, Bing Sun, Qi Chen, Wenyang Gan, Zhigang Li

    Abstract: In this paper, formation tracking for a multi-AUV system (MAS) using an improved adaptive sliding mode control method is studied in the Three Dimensional (3-D) underwater environment. Firstly, the kinematics model and the dynamic model of the AUVs are given as the Six Dimensions of Freedom (6-DOF) considered. Then, control law based on the mathematical model of the AUVs is proposed based on the im… ▽ More

    Submitted 17 January, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

  45. arXiv:2206.03112  [pdf

    cs.LG cs.SD eess.AS

    Singapore Soundscape Site Selection Survey (S5): Identification of Characteristic Soundscapes of Singapore via Weighted k-means Clustering

    Authors: Kenneth Ooi, Bhan Lam, Joo Young Hong, Karn N. Watcharasupat, Zhen-Ting Ong, Woon-Seng Gan

    Abstract: The ecological validity of soundscape studies usually rests on a choice of soundscapes that are representative of the perceptual space under investigation. For example, a soundscape pleasantness study might investigate locations with soundscapes ranging from "pleasant" to "annoying". The choice of soundscapes is typically researcher-led, but a participant-led process can reduce selection bias and… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

    Comments: 23 pages, 8 figures. Submitted to Sustainability

    Journal ref: MDPI Sustainability. 2022; 14(12):7485

  46. arXiv:2205.13051  [pdf, other

    eess.IV cs.CV cs.LG

    Online Deep Equilibrium Learning for Regularization by Denoising

    Authors: Jiaming Liu, Xiaojian Xu, Weijie Gan, Shirin Shoushtari, Ulugbek S. Kamilov

    Abstract: Plug-and-Play Priors (PnP) and Regularization by Denoising (RED) are widely-used frameworks for solving imaging inverse problems by computing fixed-points of operators combining physical measurement models and learned image priors. While traditional PnP/RED formulations have focused on priors specified using image denoisers, there is a growing interest in learning PnP/RED priors that are end-to-en… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: 28 pages, 8 figures

  47. arXiv:2205.04728  [pdf, other

    eess.AS cs.SD

    Preliminary assessment of a cost-effective headphone calibration procedure for soundscape evaluations

    Authors: Bhan Lam, Kenneth Ooi, Karn N. Watcharasupat, Zhen-Ting Ong, Yun-Ting Lau, Trevor Wong, Woon-Seng Gan

    Abstract: The introduction of ISO 12913-2:2018 has provided a framework for standardized data collection and reporting procedures for soundscape practitioners. A strong emphasis was placed on the use of calibrated head and torso simulators (HATS) for binaural audio capture to obtain an accurate subjective impression and acoustic measure of the soundscape under evaluation. To auralise the binaural recordings… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

    Comments: Submitted to the 28th International Congress on Sound and Vibration

  48. arXiv:2204.13890  [pdf, other

    eess.AS cs.SD eess.SY

    Deployment of an IoT System for Adaptive In-Situ Soundscape Augmentation

    Authors: Trevor Wong, Karn N. Watcharasupat, Bhan Lam, Kenneth Ooi, Zhen-Ting Ong, Furi Andi Karnapi, Woon-Seng Gan

    Abstract: Soundscape augmentation is an emerging approach for noise mitigation by introducing additional sounds known as "maskers" to increase acoustic comfort. Traditionally, the choice of maskers is often predicated on expert guidance or post-hoc analysis which can be time-consuming and sometimes arbitrary. Moreover, this often results in a static set of maskers that are inflexible to the dynamic nature o… ▽ More

    Submitted 29 April, 2022; originally announced April 2022.

    Comments: To be presented at the 51st International Congress and Exposition on Noise Control Engineering

    Journal ref: INTER-NOISE and NOISE-CON Congress and Conference Proceedings, Feb. 2022, vol. 265, no. 5, pp. 2013-2021

  49. arXiv:2204.13883  [pdf, other

    eess.AS cs.LG cs.SD

    Autonomous In-Situ Soundscape Augmentation via Joint Selection of Masker and Gain

    Authors: Karn N. Watcharasupat, Kenneth Ooi, Bhan Lam, Trevor Wong, Zhen-Ting Ong, Woon-Seng Gan

    Abstract: The selection of maskers and playback gain levels in a soundscape augmentation system is crucial to its effectiveness in improving the overall acoustic comfort of a given environment. Traditionally, the selection of appropriate maskers and gain levels has been informed by expert opinion, which may not representative of the target population, or by listening tests, which can be time-consuming and l… ▽ More

    Submitted 23 July, 2022; v1 submitted 29 April, 2022; originally announced April 2022.

    Comments: Accepted to IEEE Signal Processing Letters. (c) 2022 IEEE

    Journal ref: IEEE Signal Processing Letters, Vol. 29, pp. 1749 - 1753, 2022

  50. arXiv:2204.04771  [pdf, other

    eess.IV

    Image Reconstruction for MRI using Deep CNN Priors Trained without Groundtruth

    Authors: Weijie Gan, Cihat Eldeniz, Jiaming Liu, Sihao Chen, Hongyu An, Ulugbek S. Kamilov

    Abstract: We propose a new plug-and-play priors (PnP) based MR image reconstruction method that systematically enforces data consistency while also exploiting deep-learning priors. Our prior is specified through a convolutional neural network (CNN) trained without any artifact-free ground truth to remove undersampling artifacts from MR images. The results on reconstructing free-breathing MRI data into ten r… ▽ More

    Submitted 10 April, 2022; originally announced April 2022.