Skip to main content

Showing 1–50 of 99 results for author: Dong, Y

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.16658  [pdf, other

    eess.IV cs.CV math.ST

    Sampling Strategies in Bayesian Inversion: A Study of RTO and Langevin Methods

    Authors: Remi Laumont, Yiqiu Dong, Martin Skovgaard Andersen

    Abstract: This paper studies two classes of sampling methods for the solution of inverse problems, namely Randomize-Then-Optimize (RTO), which is rooted in sensitivity analysis, and Langevin methods, which are rooted in the Bayesian framework. The two classes of methods correspond to different assumptions and yield samples from different target distributions. We highlight the main conceptual and theoretical… ▽ More

    Submitted 25 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    MSC Class: 65K10; 65K05; 65D18; 62F15; 62C10; 68Q25; 68U10; 90C25; 65C05

  2. arXiv:2406.15160  [pdf, other

    eess.AS eess.SP

    Exploring Audio-Visual Information Fusion for Sound Event Localization and Detection In Low-Resource Realistic Scenarios

    Authors: Ya Jiang, Qing Wang, Jun Du, Maocheng Hu, Pengfei Hu, Zeyan Liu, Shi Cheng, Zhaoxu Nian, Yuxuan Dong, Mingqi Cai, Xin Fang, Chin-Hui Lee

    Abstract: This study presents an audio-visual information fusion approach to sound event localization and detection (SELD) in low-resource scenarios. We aim at utilizing audio and video modality information through cross-modal learning and multi-modal fusion. First, we propose a cross-modal teacher-student learning (TSL) framework to transfer information from an audio-only teacher model, trained on a rich c… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: accepted by icme2024

  3. arXiv:2406.06295  [pdf, other

    cs.SD eess.AS

    Zero-Shot Audio Captioning Using Soft and Hard Prompts

    Authors: Yiming Zhang, Xuenan Xu, Ruoyi Du, Haohe Liu, Yuan Dong, Zheng-Hua Tan, Wenwu Wang, Zhanyu Ma

    Abstract: In traditional audio captioning methods, a model is usually trained in a fully supervised manner using a human-annotated dataset containing audio-text pairs and then evaluated on the test sets from the same dataset. Such methods have two limitations. First, these methods are often data-hungry and require time-consuming and expensive human annotations to obtain audio-text pairs. Second, these model… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Submitted to IEEE/ACM Transactions on Audio, Speech and Language Processing

  4. arXiv:2405.19373  [pdf, other

    eess.SP cs.LG

    Multi-modal Mood Reader: Pre-trained Model Empowers Cross-Subject Emotion Recognition

    Authors: Yihang Dong, Xuhang Chen, Yanyan Shen, Michael Kwok-Po Ng, Tao Qian, Shuqiang Wang

    Abstract: Emotion recognition based on Electroencephalography (EEG) has gained significant attention and diversified development in fields such as neural signal processing and affective computing. However, the unique brain anatomy of individuals leads to non-negligible natural differences in EEG signals across subjects, posing challenges for cross-subject emotion recognition. While recent studies have attem… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted by International Conference on Neural Computing for Advanced Applications, 2024

  5. arXiv:2405.13996  [pdf, other

    eess.SP cs.HC

    Detecting Gait Abnormalities in Foot-Floor Contacts During Walking Through FootstepInduced Structural Vibrations

    Authors: Yiwen Dong, Yuyan Wu, Hae Young Noh

    Abstract: Gait abnormality detection is critical for the early discovery and progressive tracking of musculoskeletal and neurological disorders, such as Parkinson's and Cerebral Palsy. Especially, analyzing the foot-floor contacts during walking provides important insights into gait patterns, such as contact area, contact force, and contact time, enabling gait abnormality detection through these measurement… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: The 14th International Workshop on Structural Health Monitoring (IWSHM)

  6. arXiv:2405.01119  [pdf

    cs.SI eess.SY

    Towards Understanding Worldwide Cross-cultural Differences in Implicit Driving Cues: Review, Comparative Analysis, and Research Roadmap

    Authors: Yongqi Dong, Chang Liu, Yiyun Wang, Zhe Fu

    Abstract: Recognizing and understanding implicit driving cues across diverse cultures is imperative for fostering safe and efficient global transportation systems, particularly when training new immigrants holding driving licenses from culturally disparate countries. Additionally, it is essential to consider cross-cultural differences in the development of Automated Driving features tailored to different co… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 7 pages, 1 figure, under review by the 27th IEEE International Conference on Intelligent Transportation Systems (IEEE ITSC 2024)

  7. arXiv:2403.19943  [pdf, other

    cs.LG cs.AI eess.SP

    TDANet: A Novel Temporal Denoise Convolutional Neural Network With Attention for Fault Diagnosis

    Authors: Zhongzhi Li, Rong Fan, **gqi Tu, **yi Ma, Jianliang Ai, Yiqun Dong

    Abstract: Fault diagnosis plays a crucial role in maintaining the operational integrity of mechanical systems, preventing significant losses due to unexpected failures. As intelligent manufacturing and data-driven approaches evolve, Deep Learning (DL) has emerged as a pivotal technique in fault diagnosis research, recognized for its ability to autonomously extract complex features. However, the practical ap… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  8. arXiv:2403.07988  [pdf, other

    eess.SY

    Configuration and EMT Simulation of the 240-bus MiniWECC System Integrating Offshore Wind Farms (OWFs)

    Authors: Buxin She, Hisham Mahmood, Marcelo Elizondo, Veronica Adetola, Yuqing Dong

    Abstract: As offshore wind farms (OWFs) become increasingly prevalent in Northern California and Southern Oregon, they introduce faster dynamics into the Western Electricity Coordinating Council (WECC) system, resha** its dynamic behavior. Accordingly, electromagnetic transient (EMT) simulation is essential to assess high frequency dynamics of the WECC system with integrated OWFs. Against this background,… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 5 pages

  9. arXiv:2403.07317  [pdf, other

    eess.SY

    GMPC: Geometric Model Predictive Control for Wheeled Mobile Robot Trajectory Tracking

    Authors: Jiawei Tang, Shuang Wu, Bo Lan, Yahui Dong, Yuqiang **, Guangjian Tian, Wen-An Zhang, Ling Shi

    Abstract: The configuration of most robotic systems lies in continuous transformation groups. However, in mobile robot trajectory tracking, many recent works still naively utilize optimization methods for elements in vector space without considering the manifold constraint of the robot configuration. In this letter, we propose a geometric model predictive control (MPC) framework for wheeled mobile robot tra… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  10. arXiv:2402.01546  [pdf, other

    cs.LG cs.AI cs.CR cs.DC cs.MA eess.SY

    Privacy-Preserving Distributed Learning for Residential Short-Term Load Forecasting

    Authors: Yi Dong, Yingjie Wang, Mariana Gama, Mustafa A. Mustafa, Geert Deconinck, Xiaowei Huang

    Abstract: In the realm of power systems, the increasing involvement of residential users in load forecasting applications has heightened concerns about data privacy. Specifically, the load data can inadvertently reveal the daily routines of residential users, thereby posing a risk to their property security. While federated learning (FL) has been employed to safeguard user privacy by enabling model training… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  11. arXiv:2401.17593  [pdf, other

    eess.IV cs.CV physics.med-ph

    Head and Neck Tumor Segmentation from [18F]F-FDG PET/CT Images Based on 3D Diffusion Model

    Authors: Yafei Dong, Kuang Gong

    Abstract: Head and neck (H&N) cancers are among the most prevalent types of cancer worldwide, and [18F]F-FDG PET/CT is widely used for H&N cancer management. Recently, the diffusion model has demonstrated remarkable performance in various image-generation tasks. In this work, we proposed a 3D diffusion model to accurately perform H&N tumor segmentation from 3D PET and CT volumes. The 3D diffusion model was… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: 28 pages, 5 figures

  12. Localization of Dummy Data Injection Attacks in Power Systems Considering Incomplete Topological Information: A Spatio-Temporal Graph Wavelet Convolutional Neural Network Approach

    Authors: Zhaoyang Qu, Yunchang Dong, Yang Li, Siqi Song, Tao Jiang, Min Li, Qiming Wang, Lei Wang, Xiaoyong Bo, Jiye Zang, Qi Xu

    Abstract: The emergence of novel the dummy data injection attack (DDIA) poses a severe threat to the secure and stable operation of power systems. These attacks are particularly perilous due to the minimal Euclidean spatial separation between the injected malicious data and legitimate data, rendering their precise detection challenging using conventional distance-based methods. Furthermore, existing researc… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: Accepted by Applied Energy

    Journal ref: Applied Energy 360 (2024) 122736

  13. arXiv:2401.05083  [pdf, other

    cs.RO cs.MA eess.SY

    Discrete-Time Stress Matrix-Based Formation Control of General Linear Multi-Agent Systems

    Authors: Okechi Onuoha, Suleiman Kurawa, Zezhi Tang, Yi Dong

    Abstract: This paper considers the distributed leader-follower stress-matrix-based affine formation control problem of discrete-time linear multi-agent systems with static and dynamic leaders. In leader-follower multi-agent formation control, the aim is to drive a set of agents comprising leaders and followers to form any desired geometric pattern and simultaneously execute any required manoeuvre by control… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

  14. arXiv:2401.01496  [pdf, other

    eess.IV cs.AI cs.CV

    From Pixel to Slide image: Polarization Modality-based Pathological Diagnosis Using Representation Learning

    Authors: Jia Dong, Yao Yao, Yang Dong, Hui Ma

    Abstract: Thyroid cancer is the most common endocrine malignancy, and accurately distinguishing between benign and malignant thyroid tumors is crucial for develo** effective treatment plans in clinical practice. Pathologically, thyroid tumors pose diagnostic challenges due to improper specimen sampling. In this study, we have designed a three-stage model using representation learning to integrate pixel-le… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

  15. arXiv:2312.16607  [pdf, other

    eess.IV cs.CV stat.ML

    A Polarization and Radiomics Feature Fusion Network for the Classification of Hepatocellular Carcinoma and Intrahepatic Cholangiocarcinoma

    Authors: Jia Dong, Yao Yao, Liyan Lin, Yang Dong, Jiachen Wan, Ran Peng, Chao Li, Hui Ma

    Abstract: Classifying hepatocellular carcinoma (HCC) and intrahepatic cholangiocarcinoma (ICC) is a critical step in treatment selection and prognosis evaluation for patients with liver diseases. Traditional histopathological diagnosis poses challenges in this context. In this study, we introduce a novel polarization and radiomics feature fusion network, which combines polarization features obtained from Mu… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

  16. arXiv:2312.04610  [pdf

    cs.LG cs.AI eess.SP stat.OT

    Data-driven Semi-supervised Machine Learning with Surrogate Safety Measures for Abnormal Driving Behavior Detection

    Authors: Yongqi Dong, Lanxin Zhang, Haneen Farah, Arkady Zgonnikov, Bart van Arem

    Abstract: Detecting abnormal driving behavior is critical for road traffic safety and the evaluation of drivers' behavior. With the advancement of machine learning (ML) algorithms and the accumulation of naturalistic driving data, many ML models have been adopted for abnormal driving behavior detection. Most existing ML-based detectors rely on (fully) supervised ML methods, which require substantial labeled… ▽ More

    Submitted 24 May, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: 22 pages, 10 figures, accepted by the 103rd Transportation Research Board (TRB) Annual Meeting, under third round review by Transportation Research Record: Journal of the Transportation Research Board

  17. arXiv:2312.04398  [pdf

    cs.CV cs.AI cs.LG eess.IV stat.ML

    Intelligent Anomaly Detection for Lane Rendering Using Transformer with Self-Supervised Pre-Training and Customized Fine-Tuning

    Authors: Yongqi Dong, Xingmin Lu, Ruohan Li, Wei Song, Bart van Arem, Haneen Farah

    Abstract: The burgeoning navigation services using digital maps provide great convenience to drivers. Nevertheless, the presence of anomalies in lane rendering map images occasionally introduces potential hazards, as such anomalies can be misleading to human drivers and consequently contribute to unsafe driving conditions. In response to this concern and to accurately and effectively detect the anomalies, t… ▽ More

    Submitted 29 May, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: 22 pages, 6 figures, accepted by the 103rd Transportation Research Board (TRB) Annual Meeting, under review by Transportation Research Record: Journal of the Transportation Research Board

  18. arXiv:2311.15168  [pdf, other

    eess.SY cs.LG

    A Data-Driven Approach for High-Impedance Fault Localization in Distribution Systems

    Authors: Yuqi Zhou, Yuqing Dong, Rui Yang

    Abstract: Accurate and quick identification of high-impedance faults is critical for the reliable operation of distribution systems. Unlike other faults in power grids, HIFs are very difficult to detect by conventional overcurrent relays due to the low fault current. Although HIFs can be affected by various factors, the voltage current characteristics can substantially imply how the system responds to the d… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

  19. arXiv:2311.13361  [pdf, other

    cs.AI cs.HC eess.SY

    Applying Large Language Models to Power Systems: Potential Security Threats

    Authors: Jiaqi Ruan, Gaoqi Liang, Huan Zhao, Guolong Liu, Xianzhuo Sun, **g Qiu, Zhao Xu, Fushuan Wen, Zhao Yang Dong

    Abstract: Applying large language models (LLMs) to modern power systems presents a promising avenue for enhancing decision-making and operational efficiency. However, this action may also incur potential security threats, which have not been fully recognized so far. To this end, this article analyzes potential threats incurred by applying LLMs to power systems, emphasizing the need for urgent research and d… ▽ More

    Submitted 24 January, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

  20. arXiv:2311.08816  [pdf, other

    eess.IV cs.CV

    Target-oriented Domain Adaptation for Infrared Image Super-Resolution

    Authors: Yongsong Huang, Tomo Miyazaki, Xiaofeng Liu, Yafei Dong, Shinichiro Omachi

    Abstract: Recent efforts have explored leveraging visible light images to enrich texture details in infrared (IR) super-resolution. However, this direct adaptation approach often becomes a double-edged sword, as it improves texture at the cost of introducing noise and blurring artifacts. To address these challenges, we propose the Target-oriented Domain Adaptation SRGAN (DASRGAN), an innovative framework sp… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 11 pages, 9 figures

  21. arXiv:2311.08808  [pdf, other

    eess.IV

    Degradation Estimation Recurrent Neural Network with Local and Non-Local Priors for Compressive Spectral Imaging

    Authors: Yubo Dong, Dahua Gao, Yuyan Li, Guangming Shi, Danhua Liu

    Abstract: In the Coded Aperture Snapshot Spectral Imaging (CASSI) system, deep unfolding networks (DUNs) have demonstrated excellent performance in recovering 3D hyperspectral images (HSIs) from 2D measurements. However, some noticeable gaps exist between the imaging model used in DUNs and the real CASSI imaging process, such as the sensing error as well as photon and dark current noise, compromising the ac… ▽ More

    Submitted 14 January, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  22. arXiv:2310.15767  [pdf, ps, other

    eess.IV cs.CV cs.LG

    Unpaired MRI Super Resolution with Contrastive Learning

    Authors: Hao Li, Quanwei Liu, Jianan Liu, Xiling Liu, Yanni Dong, Tao Huang, Zhihan Lv

    Abstract: Magnetic resonance imaging (MRI) is crucial for enhancing diagnostic accuracy in clinical settings. However, the inherent long scan time of MRI restricts its widespread applicability. Deep learning-based image super-resolution (SR) methods exhibit promise in improving MRI resolution without additional cost. Due to lacking of aligned high-resolution (HR) and low-resolution (LR) MRI image pairs, uns… ▽ More

    Submitted 16 February, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

  23. Deep learning based on Transformer architecture for power system short-term voltage stability assessment with class imbalance

    Authors: Yang Li, Jiting Cao, Yan Xu, Lipeng Zhu, Zhao Yang Dong

    Abstract: Most existing data-driven power system short-term voltage stability assessment (STVSA) approaches presume class-balanced input data. However, in practical applications, the occurrence of short-term voltage instability following a disturbance is minimal, leading to a significant class imbalance problem and a consequent decline in classifier performance. This work proposes a Transformer-based STVSA… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: Accepted by Renewable and Sustainable Energy Reviews

    Journal ref: Renewable and Sustainable Energy Reviews 189 (2024) 113913

  24. arXiv:2310.10376  [pdf, other

    eess.SY

    Research on Train Shunting Impedance Based on Transmission Line Theory

    Authors: Yinchao Dong, Linhai Zhao

    Abstract: At present, the shunting process of train to track circuit is usually studied by taking the shunting resistance of the first wheel set of train as the equivalent model, which ignores the shunting effect of other wheel sets and cannot study the fault conditions such as "pool shunting". Especially for the jointless track circuit (JTC), the compensation capacitors connected in parallel on the rail li… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  25. arXiv:2310.06678  [pdf, other

    cs.IT eess.SP eess.SY

    Modelling and Performance Analysis of the Over-the-Air Computing in Cellular IoT Networks

    Authors: Ying Dong, Haonan Hu, Qiaoshou Liu, Tingwei Lv, Qianbin Chen, Jie Zhang

    Abstract: Ultra-fast wireless data aggregation (WDA) of distributed data has emerged as a critical design challenge in the ultra-densely deployed cellular internet of things network (CITN) due to limited spectral resources. Over-the-air computing (AirComp) has been proposed as an effective solution for ultra-fast WDA by exploiting the superposition property of wireless channels. However, the effect of acces… ▽ More

    Submitted 11 August, 2023; originally announced October 2023.

  26. arXiv:2309.16813  [pdf, other

    cs.NI eess.SP

    Wi-Fi 8: Embracing the Millimeter-Wave Era

    Authors: Xiaoqian Liu, Tingwei Chen, Yuhan Dong, Zhi Mao, Ming Gan, Xun Yang, Jianmin Lu

    Abstract: With the increasing demands in communication, Wi-Fi technology is advancing towards its next generation. Building on the foundation of Wi-Fi 7, millimeter-wave technology is anticipated to converge with Wi-Fi 8 in the near future. In this paper, we look into the millimeter-wave technology and other potential feasible features, providing a comprehensive perspective on the future of Wi-Fi 8. Our sim… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: 7 pages, 4 figures

  27. arXiv:2309.15951  [pdf, other

    cs.NI eess.SP

    IEEE 802.11be Wi-Fi 7: Feature Summary and Performance Evaluation

    Authors: Xiaoqian Liu, Yuhan Dong, Yiqing Li, Yousi Lin, Xun Yang, Ming Gan

    Abstract: While the pace of commercial scale application of Wi-Fi 6 accelerates, the IEEE 802.11 Working Group is about to complete the development of a new amendment standard IEEE 802.11be -- Extremely High Throughput (EHT), also known as Wi-Fi 7, which can be used to meet the demand for the throughput of 4K/8K videos up to tens of Gbps and low-latency video applications such as virtual reality (VR) and au… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: 6 pages, 4 figures

  28. arXiv:2309.01958  [pdf, other

    cs.CV eess.IV

    Empowering Low-Light Image Enhancer through Customized Learnable Priors

    Authors: Naishan Zheng, Man Zhou, Yanmeng Dong, Xiangyu Rui, Jie Huang, Chongyi Li, Feng Zhao

    Abstract: Deep neural networks have achieved remarkable progress in enhancing low-light images by improving their brightness and eliminating noise. However, most existing methods construct end-to-end map** networks heuristically, neglecting the intrinsic prior of image enhancement task and lacking transparency and interpretability. Although some unfolding solutions have been proposed to relieve these issu… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: Accepted by ICCV 2023

  29. arXiv:2308.06979  [pdf, other

    eess.AS cs.SD

    The Sound Demixing Challenge 2023 $\unicode{x2013}$ Music Demixing Track

    Authors: Giorgio Fabbro, Stefan Uhlich, Chieh-Hsin Lai, Woosung Choi, Marco Martínez-Ramírez, Weihsiang Liao, Igor Gadelha, Geraldo Ramos, Eddie Hsu, Hugo Rodrigues, Fabian-Robert Stöter, Alexandre Défossez, Yi Luo, Jianwei Yu, Dipam Chakraborty, Sharada Mohanty, Roman Solovyev, Alexander Stempkovskiy, Tatiana Habruseva, Nabarun Goswami, Tatsuya Harada, Minseok Kim, Jun Hyung Lee, Yuanliang Dong, Xinran Zhang , et al. (2 additional authors not shown)

    Abstract: This paper summarizes the music demixing (MDX) track of the Sound Demixing Challenge (SDX'23). We provide a summary of the challenge setup and introduce the task of robust music source separation (MSS), i.e., training MSS models in the presence of errors in the training data. We propose a formalization of the errors that can occur in the design of a training dataset for MSS systems and introduce t… ▽ More

    Submitted 19 April, 2024; v1 submitted 14 August, 2023; originally announced August 2023.

    Comments: Published in Transactions of the International Society for Music Information Retrieval (https://transactions.ismir.net/articles/10.5334/tismir.171)

    Journal ref: Transactions of the International Society for Music Information Retrieval, 7(1), pp.63-84, 2024

  30. arXiv:2307.09729  [pdf, other

    cs.CV cs.MM eess.IV

    NTIRE 2023 Quality Assessment of Video Enhancement Challenge

    Authors: Xiaohong Liu, Xiongkuo Min, Wei Sun, Yulun Zhang, Kai Zhang, Radu Timofte, Guangtao Zhai, Yixuan Gao, Yuqin Cao, Tengchuan Kou, Yunlong Dong, Ziheng Jia, Yilin Li, Wei Wu, Shuming Hu, Sibin Deng, Pengxiang Xiao, Ying Chen, Kai Li, Kai Zhao, Kun Yuan, Ming Sun, Heng Cong, Hao Wang, Lingzhi Fu , et al. (47 additional authors not shown)

    Abstract: This paper reports on the NTIRE 2023 Quality Assessment of Video Enhancement Challenge, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2023. This challenge is to address a major challenge in the field of video processing, namely, video quality assessment (VQA) for enhanced videos. The challenge uses the VQA Dataset for Perceptual… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  31. arXiv:2306.11984  [pdf, ps, other

    eess.IV cs.AI cs.CV

    TauPETGen: Text-Conditional Tau PET Image Synthesis Based on Latent Diffusion Models

    Authors: Se-In Jang, Cristina Lois, Emma Thibault, J. Alex Becker, Yafei Dong, Marc D. Normandin, Julie C. Price, Keith A. Johnson, Georges El Fakhri, Kuang Gong

    Abstract: In this work, we developed a novel text-guided image synthesis technique which could generate realistic tau PET images from textual descriptions and the subject's MR image. The generated tau PET images have the potential to be used in examining relations between different measures and also increasing the public availability of tau PET datasets. The method was based on latent diffusion models. Both… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  32. arXiv:2306.11466  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Comprehensive Training and Evaluation on Deep Reinforcement Learning for Automated Driving in Various Simulated Driving Maneuvers

    Authors: Yongqi Dong, Tobias Datema, Vincent Wassenaar, Joris van de Weg, Cahit Tolga Kopar, Harim Suleman

    Abstract: Develo** and testing automated driving models in the real world might be challenging and even dangerous, while simulation can help with this, especially for challenging maneuvers. Deep reinforcement learning (DRL) has the potential to tackle complex decision-making and controlling tasks through learning and interacting with the environment, thus it is suitable for develo** automated driving wh… ▽ More

    Submitted 18 August, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: 6 pages, 3 figures, accepted by the 26th IEEE International Conference on Intelligent Transportation Systems (ITSC 2023)

  33. arXiv:2306.11465  [pdf

    cs.RO cs.AI cs.LG eess.SY

    Safe, Efficient, Comfort, and Energy-saving Automated Driving through Roundabout Based on Deep Reinforcement Learning

    Authors: Henan Yuan, Penghui Li, Bart van Arem, Liujiang Kang, Yongqi Dong

    Abstract: Traffic scenarios in roundabouts pose substantial complexity for automated driving. Manually map** all possible scenarios into a state space is labor-intensive and challenging. Deep reinforcement learning (DRL) with its ability to learn from interacting with the environment emerges as a promising solution for training such automated driving models. This study explores, employs, and implements va… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: 6 pages, 3 figures, under review by the 26th IEEE International Conference on Intelligent Transportation Systems (ITSC 2023)

  34. arXiv:2306.10494  [pdf, other

    eess.SP cs.AI

    Semi-Supervised Learning for Multi-Label Cardiovascular Diseases Prediction:A Multi-Dataset Study

    Authors: Rushuang Zhou, Lei Lu, Zijun Liu, Ting Xiang, Zhen Liang, David A. Clifton, Yining Dong, Yuan-Ting Zhang

    Abstract: Electrocardiography (ECG) is a non-invasive tool for predicting cardiovascular diseases (CVDs). Current ECG-based diagnosis systems show promising performance owing to the rapid development of deep learning techniques. However, the label scarcity problem, the co-occurrence of multiple CVDs and the poor performance on unseen datasets greatly hinder the widespread application of deep learning-based… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

  35. arXiv:2305.18807  [pdf

    eess.SY math.OC

    Design of the Reverse Logistics System for Medical Waste Recycling Part II: Route Optimization with Case Study under COVID-19 Pandemic

    Authors: Chaozhong Xue, Yongqi Dong, Jiaqi Liu, Yijun Liao, Lingbo Li

    Abstract: Medical waste recycling and treatment has gradually drawn concerns from the whole society, as the amount of medical waste generated is increasing dramatically, especially during the pandemic of COVID-19. To tackle the emerging challenges, this study designs a reverse logistics system architecture with three modules, i.e., medical waste classification & monitoring module, temporary storage & dispos… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: 6 pages, 4 figures, under review by the 26th IEEE International Conference on Intelligent Transportation Systems (ITSC 2023)

  36. arXiv:2305.17271  [pdf

    cs.CV cs.AI cs.LG eess.IV

    Robust Lane Detection through Self Pre-training with Masked Sequential Autoencoders and Fine-tuning with Customized PolyLoss

    Authors: Ruohan Li, Yongqi Dong

    Abstract: Lane detection is crucial for vehicle localization which makes it the foundation for automated driving and many intelligent and advanced driving assistant systems. Available vision-based lane detection methods do not make full use of the valuable features and aggregate contextual information, especially the interrelationships between lane lines and other regions of the images in continuous frames.… ▽ More

    Submitted 11 August, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: 12 pages, 8 figures, accepted by journal of IEEE Transactions on Intelligent Transportation Systems

  37. arXiv:2305.10666  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    A unified front-end framework for English text-to-speech synthesis

    Authors: Zelin Ying, Chen Li, Yu Dong, Qiuqiang Kong, Qiao Tian, Yuanyuan Huo, Yuxuan Wang

    Abstract: The front-end is a critical component of English text-to-speech (TTS) systems, responsible for extracting linguistic features that are essential for a text-to-speech model to synthesize speech, such as prosodies and phonemes. The English TTS front-end typically consists of a text normalization (TN) module, a prosody word prosody phrase (PWPP) module, and a grapheme-to-phoneme (G2P) module. However… ▽ More

    Submitted 25 March, 2024; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: Accepted in ICASSP 2024

  38. arXiv:2305.09512  [pdf, other

    cs.CV eess.IV

    Light-VQA: A Multi-Dimensional Quality Assessment Model for Low-Light Video Enhancement

    Authors: Yunlong Dong, Xiaohong Liu, Yixuan Gao, Xunchu Zhou, Tao Tan, Guangtao Zhai

    Abstract: Recently, Users Generated Content (UGC) videos becomes ubiquitous in our daily lives. However, due to the limitations of photographic equipments and techniques, UGC videos often contain various degradations, in which one of the most visually unfavorable effects is the underexposure. Therefore, corresponding video enhancement algorithms such as Low-Light Video Enhancement (LLVE) have been proposed… ▽ More

    Submitted 6 August, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

  39. arXiv:2305.03172  [pdf, other

    eess.SY

    TelecomTM: A Fine-Grained and Ubiquitous Traffic Monitoring System Using Pre-Existing Telecommunication Fiber-Optic Cables as Sensors

    Authors: **gxiao Liu, Siyuan Yuan, Yiwen Dong, Biondo Biondi, Hae Young Noh

    Abstract: We introduce the TelecomTM system that uses pre-existing telecommunication fiber-optic cables as virtual strain sensors to sense vehicle-induced ground vibrations for fine-grained and ubiquitous traffic monitoring and characterization. Here we call it a virtual sensor because it is a software-based representation of a physical sensor. Due to the extensively installed telecommunication fiber-optic… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

  40. arXiv:2305.00179  [pdf, other

    cs.IT eess.SP

    Integrated Sensing and Communications: Recent Advances and Ten Open Challenges

    Authors: Shihang Lu, Fan Liu, Yunxin Li, Kecheng Zhang, Hongjia Huang, Jiaqi Zou, Xinyu Li, Yuxiang Dong, Fuwang Dong, Jia Zhu, Yifeng Xiong, Weijie Yuan, Yuanhao Cui, Lajos Hanzo

    Abstract: It is anticipated that integrated sensing and communications (ISAC) would be one of the key enablers of next-generation wireless networks (such as beyond 5G (B5G) and 6G) for supporting a variety of emerging applications. In this paper, we provide a comprehensive review of the recent advances in ISAC systems, with a particular focus on their foundations, system design, networking aspects and ISAC… ▽ More

    Submitted 17 December, 2023; v1 submitted 29 April, 2023; originally announced May 2023.

    Comments: 26 pages, 22 figures, resubmitted to IEEE Journal. Appreciation for the outstanding contributions of coauthors in the paper!

  41. arXiv:2304.06496  [pdf, other

    eess.SP cs.HC cs.LG

    EEGMatch: Learning with Incomplete Labels for Semi-Supervised EEG-based Cross-Subject Emotion Recognition

    Authors: Rushuang Zhou, Weishan Ye, Zhiguo Zhang, Yanyang Luo, Li Zhang, Linling Li, Gan Huang, Yining Dong, Yuan-Ting Zhang, Zhen Liang

    Abstract: Electroencephalography (EEG) is an objective tool for emotion recognition and shows promising performance. However, the label scarcity problem is a main challenge in this field, which limits the wide application of EEG-based emotion recognition. In this paper, we propose a novel semi-supervised learning framework (EEGMatch) to leverage both labeled and unlabeled EEG data. First, an EEG-Mixup based… ▽ More

    Submitted 27 March, 2023; originally announced April 2023.

  42. arXiv:2303.09290  [pdf, other

    eess.IV

    VDPVE: VQA Dataset for Perceptual Video Enhancement

    Authors: Yixuan Gao, Yuqin Cao, Tengchuan Kou, Wei Sun, Yunlong Dong, Xiaohong Liu, Xiongkuo Min, Guangtao Zhai

    Abstract: Recently, many video enhancement methods have been proposed to improve video quality from different aspects such as color, brightness, contrast, and stability. Therefore, how to evaluate the quality of the enhanced video in a way consistent with human visual perception is an important research topic. However, most video quality assessment methods mainly calculate video quality by estimating the di… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

  43. arXiv:2302.12428  [pdf

    eess.SY

    A holistically 3D-printed flexible millimeter-wave Doppler radar: Towards fully printed high-frequency multilayer flexible hybrid electronics systems

    Authors: Hong Tang, Yingjie Zhang, Bowen Zheng, Sensong An, Mohammad Haerinia, Yunxi Dong, Yi Huang, Wei Guo, Hualiang Zhang

    Abstract: Flexible hybrid electronics (FHE) is an emerging technology enabled through the integration of advanced semiconductor devices and 3D printing technology. It unlocks tremendous market potential by realizing low-cost flexible circuits and systems that can be conformally integrated into various applications. However, the operating frequencies of most reported FHE systems are relatively low. It is als… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    MSC Class: 78-05

  44. arXiv:2302.04961  [pdf

    eess.SY

    Design of the Reverse Logistics System for Medical Waste Recycling Part I: System Architecture and Disposal Site Selection Algorithm

    Authors: Chaozhong Xue, Yongqi Dong, Jiaqi Liu, Yijun Liao, Lingbo Li

    Abstract: With social progress and the development of modern medical technology, the amount of medical waste generated is increasing dramatically. The problem of medical waste recycling and treatment has gradually drawn concerns from the whole society. The sudden outbreak of the COVID-19 epidemic further brought new challenges. To tackle the challenges, this study proposes a reverse logistics system archite… ▽ More

    Submitted 27 May, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

    Comments: 6 pages, 6 figures, submitted to and under review by the IEEE International Conference on Intelligent Transportation Systems (ITSC 2023)

  45. arXiv:2302.01728  [pdf, other

    eess.SY cs.DC

    Decentralised and Cooperative Control of Multi-Robot Systems through Distributed Optimisation

    Authors: Yi Dong, Zhongguo Li, Xingyu Zhao, Zhengtao Ding, Xiaowei Huang

    Abstract: Multi-robot cooperative control has gained extensive research interest due to its wide applications in civil, security, and military domains. This paper proposes a cooperative control algorithm for multi-robot systems with general linear dynamics. The algorithm is based on distributed cooperative optimisation and output regulation, and it achieves global optimum by utilising only information share… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

    Comments: Accepted by AAMAS'23

  46. arXiv:2302.00810  [pdf, other

    eess.SP

    Beyond KNN: Deep Neighborhood Learning for WiFi-based Indoor Positioning Systems

    Authors: Yinhuan Dong, Francisco Zampella, Firas Alsehly

    Abstract: K-Neares Neighbors (KNN) and its variant weighted KNN (WKNN) have been explored for years in both academy and industry to provide stable and reliable performance in WiFi-based indoor positioning systems. Such algorithms estimate the location of a given point based on the locality information from the selected nearest WiFi neighbors according to some distance metrics calculated from the combination… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  47. arXiv:2212.03378  [pdf, other

    eess.SP physics.app-ph

    PigV$^2$: Monitoring Pig Vital Signs through Ground Vibrations Induced by Heartbeat and Respiration

    Authors: Yiwen Dong, Jesse R Codling, Gary Rohrer, Jeremy Miles, Sudhendu Sharma, Tami Brown-Brandl, Pei Zhang, Hae Young Noh

    Abstract: Pig vital sign monitoring (e.g., estimating the heart rate (HR) and respiratory rate (RR)) is essential to understand the stress level of the sow and detect the onset of parturition. It helps to maximize peri-natal survival and improve animal well-being in swine production. The existing approach mainly relies on manual measurement, which is labor-intensive and only provides a few points of informa… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: 7 pages, 9 figures

  48. GaitVibe+: Enhancing Structural Vibration-based Footstep Localization Using Temporary Cameras for In-home Gait Analysis

    Authors: Yiwen Dong, **gxiao Liu, Hae Young Noh

    Abstract: In-home gait analysis is important for providing early diagnosis and adaptive treatments for individuals with gait disorders. Existing systems include wearables and pressure mats, but they have limited scalability. Recent studies have developed vision-based systems to enable scalable, accurate in-home gait analysis, but it faces privacy concerns due to the exposure of people's appearances. Our pri… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: 7 pages, 7 figures

    ACM Class: J.3

  49. arXiv:2211.06891  [pdf, other

    eess.IV cs.CV

    Residual Degradation Learning Unfolding Framework with Mixing Priors across Spectral and Spatial for Compressive Spectral Imaging

    Authors: Yubo Dong, Dahua Gao, Tian Qiu, Yuyan Li, Minxi Yang, Guangming Shi

    Abstract: To acquire a snapshot spectral image, coded aperture snapshot spectral imaging (CASSI) is proposed. A core problem of the CASSI system is to recover the reliable and fine underlying 3D spectral cube from the 2D measurement. By alternately solving a data subproblem and a prior subproblem, deep unfolding methods achieve good performance. However, in the data subproblem, the used sensing matrix is il… ▽ More

    Submitted 15 November, 2023; v1 submitted 13 November, 2022; originally announced November 2022.

    Comments: CVPR 2023

  50. Joint Receiver Design for Integrated Sensing and Communications

    Authors: Yuxiang Dong, Fan Liu, Yifeng Xiong

    Abstract: In this letter, we investigate the joint receiver design for integrated sensing and communication (ISAC) systems, where the communication signal and the target echo signal are simultaneously received and processed to achieve a balanced performance between both functionalities. In particular, we proposed two design schemes to solve the joint sensing and communication problem of receive signal proce… ▽ More

    Submitted 5 May, 2023; v1 submitted 10 November, 2022; originally announced November 2022.