Skip to main content

Showing 1–50 of 130 results for author: Liang, Y

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.18549  [pdf

    eess.IV cs.CV

    Advancements in Feature Extraction Recognition of Medical Imaging Systems Through Deep Learning Technique

    Authors: Qishi Zhan, Dan Sun, Erdi Gao, Yuhan Ma, Yaxin Liang, Haowei Yang

    Abstract: This study introduces a novel unsupervised medical image feature extraction method that employs spatial stratification techniques. An objective function based on weight is proposed to achieve the purpose of fast image recognition. The algorithm divides the pixels of the image into multiple subdomains and uses a quadtree to access the image. A technique for threshold optimization utilizing a simple… ▽ More

    Submitted 23 May, 2024; originally announced June 2024.

    Comments: conference

  2. arXiv:2406.16381  [pdf, other

    eess.SP

    Polar-Coded Tensor-Based Unsourced Random Access with Soft Decoding

    Authors: Jiaqi Fang, Yan Liang, Gangle Sun, Hongwei Hou, Yafei Wang, Li You, Wen** Wang

    Abstract: The unsourced random access (URA) has emerged as a viable scheme for supporting the massive machine-type communications (mMTC) in the sixth generation (6G) wireless networks. Notably, the tensor-based URA (TURA), with its inherent tensor structure, stands out by simultaneously enhancing performance and reducing computational complexity for the multi-user separation, especially in mMTC networks wit… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  3. arXiv:2406.13358  [pdf, other

    cs.CV eess.IV

    Multi-scale Restoration of Missing Data in Optical Time-series Images with Masked Spatial-Temporal Attention Network

    Authors: Zaiyan Zhang, **ing Yan, Yuanqi Liang, Jiaxin Feng, Haixu He, Wei Han

    Abstract: Due to factors such as thick cloud cover and sensor limitations, remote sensing images often suffer from significant missing data, resulting in incomplete time-series information. Existing methods for imputing missing values in remote sensing images do not fully exploit spatio-temporal auxiliary information, leading to limited accuracy in restoration. Therefore, this paper proposes a novel deep le… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  4. arXiv:2406.11163  [pdf, other

    eess.SP

    Explainable Bayesian Recurrent Neural Smoother to Capture Global State Evolutionary Correlations

    Authors: Shi Yan, Yan Liang, Huayu Zhang, Le Zheng, Difan Zou, Binglu Wang

    Abstract: Through integrating the evolutionary correlations across global states in the bidirectional recursion, an explainable Bayesian recurrent neural smoother (EBRNS) is proposed for offline data-assisted fixed-interval state smoothing. At first, the proposed model, containing global states in the evolutionary interval, is transformed into an equivalent model with bidirectional memory. This transformati… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  5. arXiv:2406.08837  [pdf

    eess.IV cs.CV cs.LG

    Research on Deep Learning Model of Feature Extraction Based on Convolutional Neural Network

    Authors: Houze Liu, Iris Li, Yaxin Liang, Dan Sun, Yining Yang, Haowei Yang

    Abstract: Neural networks with relatively shallow layers and simple structures may have limited ability in accurately identifying pneumonia. In addition, deep neural networks also have a large demand for computing resources, which may cause convolutional neural networks to be unable to be implemented on terminals. Therefore, this paper will carry out the optimal classification of convolutional neural networ… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  6. arXiv:2404.17175  [pdf, ps, other

    cs.IT eess.SP

    Over-the-Air Modulation for RIS-assisted Symbiotic Radios: Design, Analysis, and Optimization

    Authors: Hu Zhou, Ying-Chang Liang, Chau Yuen

    Abstract: In reconfigurable intelligent surface (RIS)-assisted symbiotic radio (SR), an RIS is exploited to assist the primary system and to simultaneously operate as a secondary transmitter by modulating its own information over the incident primary signal from the air. Such an operation is called over-the-air modulation. The existing modulation schemes such as on-off keying and binary phase-shift keying s… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: 13 pages, 9 figures

  7. arXiv:2404.16611  [pdf, ps, other

    cs.IT eess.SP

    Towards Symbiotic SAGIN Through Inter-operator Resource and Service Sharing: Joint Orchestration of User Association and Radio Resources

    Authors: Shizhao He, Jungang Ge, Ying-Chang Liang, Dusit Niyato

    Abstract: The space-air-ground integrated network (SAGIN) is a pivotal architecture to support ubiquitous connectivity in the upcoming 6G era. Inter-operator resource and service sharing is a promising way to realize such a huge network, utilizing resources efficiently and reducing construction costs. Given the rationality of operators, the configuration of resources and services in SAGIN should focus on bo… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  8. arXiv:2404.06393  [pdf, other

    cs.SD cs.AI eess.AS

    MuPT: A Generative Symbolic Music Pretrained Transformer

    Authors: Xingwei Qu, Yuelin Bai, Yinghao Ma, Ziya Zhou, Ka Man Lo, Jiaheng Liu, Ruibin Yuan, Lejun Min, Xueling Liu, Tianyu Zhang, Xinrun Du, Shuyue Guo, Yiming Liang, Yizhi Li, Shangda Wu, Junting Zhou, Tianyu Zheng, Ziyang Ma, Fengze Han, Wei Xue, Gus Xia, Emmanouil Benetos, Xiang Yue, Chenghua Lin, Xu Tan , et al. (4 additional authors not shown)

    Abstract: In this paper, we explore the application of Large Language Models (LLMs) to the pre-training of music. While the prevalent use of MIDI in music modeling is well-established, our findings suggest that LLMs are inherently more compatible with ABC Notation, which aligns more closely with their design and strengths, thereby enhancing the model's performance in musical composition. To address the chal… ▽ More

    Submitted 10 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  9. arXiv:2403.20058  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Revolutionizing Disease Diagnosis with simultaneous functional PET/MR and Deeply Integrated Brain Metabolic, Hemodynamic, and Perfusion Networks

    Authors: Luoyu Wang, Yitian Tao, Qing Yang, Yan Liang, Siwei Liu, Hongcheng Shi, Dinggang Shen, Han Zhang

    Abstract: Simultaneous functional PET/MR (sf-PET/MR) presents a cutting-edge multimodal neuroimaging technique. It provides an unprecedented opportunity for concurrently monitoring and integrating multifaceted brain networks built by spatiotemporally covaried metabolic activity, neural activity, and cerebral blood flow (perfusion). Albeit high scientific/clinical values, short in hardware accessibility of P… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: 11 pages

  10. arXiv:2403.16472  [pdf, ps, other

    cs.IT eess.SP

    Power-Aware Sparse Reflect Beamforming in Active RIS-aided Interference Channels

    Authors: Ruizhe Long, Hu Zhou, Ying-Chang Liang

    Abstract: Active reconfigurable intelligent surface (RIS) has attracted significant attention in wireless communications, due to its reflecting elements (REs) capable of reflecting incident signals with not only phase shifts but also amplitude amplifications. In this paper, we are interested in active RIS-aided interference channels in which $K$ user pairs share the same time and frequency resources with th… ▽ More

    Submitted 29 March, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  11. arXiv:2403.15139  [pdf, other

    cs.CV eess.IV

    Deep Generative Model based Rate-Distortion for Image Downscaling Assessment

    Authors: Yuanbang Liang, Bhavesh Garg, Paul L Rosin, Yipeng Qin

    Abstract: In this paper, we propose Image Downscaling Assessment by Rate-Distortion (IDA-RD), a novel measure to quantitatively evaluate image downscaling algorithms. In contrast to image-based methods that measure the quality of downscaled images, ours is process-based that draws ideas from rate-distortion theory to measure the distortion incurred during downscaling. Our main idea is that downscaling and s… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: Accepted at CVPR 2024

  12. arXiv:2403.06700  [pdf, other

    eess.IV

    Enhancing Adversarial Training with Prior Knowledge Distillation for Robust Image Compression

    Authors: Zhi Cao, Youneng Bao, Fanyang Meng, Chao Li, Wen Tan, Genhong Wang, Yongsheng Liang

    Abstract: Deep neural network-based image compression (NIC) has achieved excellent performance, but NIC method models have been shown to be susceptible to backdoor attacks. Adversarial training has been validated in image compression models as a common method to enhance model robustness. However, the improvement effect of adversarial training on model robustness is limited. In this paper, we propose a prior… ▽ More

    Submitted 15 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  13. arXiv:2403.04116  [pdf, other

    eess.IV cs.CV

    Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis

    Authors: Yuanhao Cai, Yixun Liang, Jiahao Wang, Angtian Wang, Yulun Zhang, Xiaokang Yang, Zongwei Zhou, Alan Yuille

    Abstract: X-ray is widely applied for transmission imaging due to its stronger penetration than natural light. When rendering novel view X-ray projections, existing methods mainly based on NeRF suffer from long training time and slow inference speed. In this paper, we propose a 3D Gaussian splatting-based framework, namely X-Gaussian, for X-ray novel view synthesis. Firstly, we redesign a radiative Gaussian… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: The first 3D Gaussian Splatting-based method for X-ray 3D reconstruction

  14. arXiv:2402.16153  [pdf, other

    cs.SD cs.AI cs.CL cs.LG cs.MM eess.AS

    ChatMusician: Understanding and Generating Music Intrinsically with LLM

    Authors: Ruibin Yuan, Hanfeng Lin, Yi Wang, Zeyue Tian, Shangda Wu, Tianhao Shen, Ge Zhang, Yuhang Wu, Cong Liu, Ziya Zhou, Ziyang Ma, Liumeng Xue, Ziyu Wang, Qin Liu, Tianyu Zheng, Yizhi Li, Yinghao Ma, Yiming Liang, Xiaowei Chi, Ruibo Liu, Zili Wang, Pengfei Li, **gcheng Wu, Chenghua Lin, Qifeng Liu , et al. (10 additional authors not shown)

    Abstract: While Large Language Models (LLMs) demonstrate impressive capabilities in text generation, we find that their ability has yet to be generalized to music, humanity's creative language. We introduce ChatMusician, an open-source LLM that integrates intrinsic musical abilities. It is based on continual pre-training and finetuning LLaMA2 on a text-compatible music representation, ABC notation, and the… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: GitHub: https://shanghaicannon.github.io/ChatMusician/

  15. arXiv:2402.13901  [pdf, other

    cs.LG eess.SP stat.ML

    Non-asymptotic Convergence of Discrete-time Diffusion Models: New Approach and Improved Rate

    Authors: Yuchen Liang, Peizhong Ju, Yingbin Liang, Ness Shroff

    Abstract: The denoising diffusion model has recently emerged as a powerful generative technique that converts noise into data. While there are many studies providing theoretical guarantees for diffusion processes based on discretized stochastic differential equation (D-SDE), many generative samplers in real applications directly employ a discrete-time (DT) diffusion process. However, there are very few stud… ▽ More

    Submitted 30 May, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  16. arXiv:2402.13776  [pdf, other

    eess.IV cs.CV cs.LG

    Cas-DiffCom: Cascaded diffusion model for infant longitudinal super-resolution 3D medical image completion

    Authors: Lianghu Guo, Tianli Tao, Xinyi Cai, Zihao Zhu, Jiawei Huang, Lixuan Zhu, Zhuoyang Gu, Haifeng Tang, Rui Zhou, Siyan Han, Yan Liang, Qing Yang, Dinggang Shen, Han Zhang

    Abstract: Early infancy is a rapid and dynamic neurodevelopmental period for behavior and neurocognition. Longitudinal magnetic resonance imaging (MRI) is an effective tool to investigate such a crucial stage by capturing the developmental trajectories of the brain structures. However, longitudinal MRI acquisition always meets a serious data-missing problem due to participant dropout and failed scans, makin… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  17. arXiv:2402.04865  [pdf, other

    eess.SP

    Collaborative Computing in Non-Terrestrial Networks: A Multi-Time-Scale Deep Reinforcement Learning Approach

    Authors: Yang Cao, Shao-Yu Lien, Ying-Chang Liang, Dusit Niyato, Xuemin, Shen

    Abstract: Constructing earth-fixed cells with low-earth orbit (LEO) satellites in non-terrestrial networks (NTNs) has been the most promising paradigm to enable global coverage. The limited computing capabilities on LEO satellites however render tackling resource optimization within a short duration a critical challenge. Although the sufficient computing capabilities of the ground infrastructures can be uti… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  18. arXiv:2402.04584  [pdf, other

    eess.IV cs.CV

    Troublemaker Learning for Low-Light Image Enhancement

    Authors: Yinghao Song, Zhiyuan Cao, Wanhong Xiang, Sifan Long, Bo Yang, Hongwei Ge, Yanchun Liang, Chunguo Wu

    Abstract: Low-light image enhancement (LLIE) restores the color and brightness of underexposed images. Supervised methods suffer from high costs in collecting low/normal-light image pairs. Unsupervised methods invest substantial effort in crafting complex loss functions. We address these two challenges through the proposed TroubleMaker Learning (TML) strategy, which employs normal-light images as inputs for… ▽ More

    Submitted 2 March, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  19. arXiv:2402.04056  [pdf, other

    eess.SP

    Collaborative Deep Reinforcement Learning for Resource Optimization in Non-Terrestrial Networks

    Authors: Yang Cao, Shao-Yu Lien, Ying-Chang Liang, Dusit Niyato, Xuemin, Shen

    Abstract: Non-terrestrial networks (NTNs) with low-earth orbit (LEO) satellites have been regarded as promising remedies to support global ubiquitous wireless services. Due to the rapid mobility of LEO satellite, inter-beam/satellite handovers happen frequently for a specific user equipment (UE). To tackle this issue, earth-fixed cell scenarios have been under studied, in which the LEO satellite adjusts its… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  20. arXiv:2402.03302  [pdf, other

    eess.IV cs.CV cs.LG

    Swin-UMamba: Mamba-based UNet with ImageNet-based pretraining

    Authors: Jiarun Liu, Hao Yang, Hong-Yu Zhou, Yan Xi, Lequan Yu, Yizhou Yu, Yong Liang, Guangming Shi, Shaoting Zhang, Hairong Zheng, Shanshan Wang

    Abstract: Accurate medical image segmentation demands the integration of multi-scale information, spanning from local features to global dependencies. However, it is challenging for existing methods to model long-range global information, where convolutional neural networks (CNNs) are constrained by their local receptive fields, and vision transformers (ViTs) suffer from high quadratic complexity of their a… ▽ More

    Submitted 6 March, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Code and models of Swin-UMamba are publicly available at: https://github.com/JiarunLiu/Swin-UMamba

  21. arXiv:2402.02775  [pdf

    physics.optics eess.IV physics.bio-ph

    Instant square lattice structured illumination microscopy: an optimal strategy towards photon-saving and real-time super-resolution observation

    Authors: Tianyu Zhao, Zhaojun Wang, Manming Shu, **gxiang Zhang, Yansheng Liang, Shaowei Wang, Ming Lei

    Abstract: Over the past decade, structured illumination microscopy (SIM) has found its niche in super-resolution (SR) microscopy due to its fast imaging speed and low excitation intensity. However, due to the significantly higher light dose compared to wide-field microscopy and the time-consuming post-processing procedures, long-term, real-time, super-resolution observation of living cells is still out of r… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  22. arXiv:2401.10544  [pdf, other

    cs.SD cs.AI eess.AS

    AAT: Adapting Audio Transformer for Various Acoustics Recognition Tasks

    Authors: Yun Liang, Hai Lin, Shaojian Qiu, Yihang Zhang

    Abstract: Recently, Transformers have been introduced into the field of acoustics recognition. They are pre-trained on large-scale datasets using methods such as supervised learning and semi-supervised learning, demonstrating robust generality--It fine-tunes easily to downstream tasks and shows more robust performance. However, the predominant fine-tuning method currently used is still full fine-tuning, whi… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: Preprint version for ICASSP 2024, Korea

  23. arXiv:2401.03497  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    EAT: Self-Supervised Pre-Training with Efficient Audio Transformer

    Authors: Wenxi Chen, Yuzhe Liang, Ziyang Ma, Zhisheng Zheng, Xie Chen

    Abstract: Audio self-supervised learning (SSL) pre-training, which aims to learn good representations from unlabeled audio, has made remarkable progress. However, the extensive computational demands during pre-training pose a significant barrier to the potential application and optimization of audio SSL models. In this paper, inspired by the success of data2vec 2.0 in image modality and Audio-MAE in audio m… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

  24. arXiv:2401.00475  [pdf, other

    cs.SD eess.AS

    E-chat: Emotion-sensitive Spoken Dialogue System with Large Language Models

    Authors: Hongfei Xue, Yuhao Liang, Bingshen Mu, Shiliang Zhang, Mengzhe Chen, Qian Chen, Lei Xie

    Abstract: This study focuses on emotion-sensitive spoken dialogue in human-machine speech interaction. With the advancement of Large Language Models (LLMs), dialogue systems can handle multimodal data, including audio. Recent models have enhanced the understanding of complex audio signals through the integration of various audio events. However, they are unable to generate appropriate responses based on emo… ▽ More

    Submitted 6 January, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

    Comments: 6 pages, 3 figures

  25. arXiv:2312.08097  [pdf, ps, other

    eess.SP

    Hierarchical Cognitive Spectrum Sharing in Space-Air-Ground Integrated Networks

    Authors: Zizhen Zhou, Qianqian Zhang, Jungang Ge, Ying-Chang Liang

    Abstract: In space-air-ground integrated networks (SAGINs), cognitive spectrum sharing has been regarded as a promising solution to improve spectrum efficiency by enabling a secondary network to access the spectrum of a primary network. However, different networks in SAGIN may have different quality of service (QoS) requirements, which can not be well satisfied with the traditional cognitive spectrum sharin… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  26. arXiv:2311.16568  [pdf, ps, other

    cs.IT eess.SP

    Active Reconfigurable Intelligent Surface Enhanced Spectrum Sensing for Cognitive Radio Networks

    Authors: Jungang Ge, Ying-Chang Liang, Sumei Sun, Yonghong Zeng, Zhidong Bai

    Abstract: In opportunistic cognitive radio networks, when the primary signal is very weak compared to the background noise, the secondary user requires long sensing time to achieve a reliable spectrum sensing performance, leading to little remaining time for the secondary transmission. To tackle this issue, we propose an active reconfigurable intelligent surface (RIS) assisted spectrum sensing system, where… ▽ More

    Submitted 26 April, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

  27. arXiv:2311.15128  [pdf, other

    math.ST eess.SP

    Quickest Change Detection with Post-Change Density Estimation

    Authors: Yuchen Liang, Venugopal V. Veeravalli

    Abstract: The problem of quickest change detection in a sequence of independent observations is considered. The pre-change distribution is assumed to be known, while the post-change distribution is unknown. Two tests based on post-change density estimation are developed for this problem, the window-limited non-parametric generalized likelihood ratio (NGLR) CuSum test and the non-parametric window-limited ad… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: text overlap with arXiv:2211.00223

  28. arXiv:2311.04791  [pdf, other

    eess.SP

    Integrated Distributed Semantic Communication and Over-the-air Computation for Cooperative Spectrum Sensing

    Authors: Peng Yi, Yang Cao, Xin Kang, Ying-Chang Liang

    Abstract: Cooperative spectrum sensing (CSS) is a promising approach to improve the detection of primary users (PUs) using multiple sensors. However, there are several challenges for existing combination methods, i.e., performance degradation and ceiling effect for hard-decision fusion (HDF), as well as significant uploading latency and non-robustness to noise in the reporting channel for soft-data fusion (… ▽ More

    Submitted 25 April, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: 13 pages,10 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  29. Channel Estimation and Training Design for Active RIS Aided Wireless Communications

    Authors: Hao Chen, Nanxi Li, Ruizhe Long, Ying-Chang Liang

    Abstract: Active reconfigurable intelligent surface (ARIS) is a newly emerging RIS technique that leverages radio frequency (RF) reflection amplifiers to empower phase-configurable reflection elements (REs) in amplifying the incident signal. Thereby, ARIS can enhance wireless communications with the strengthened ARIS-aided links. In this letter, we propose exploiting the signal amplification capability of A… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: This paper has been accepted for publication in IEEE Wireless Communications Letters

    Journal ref: IEEE Wireless Communications Letters, early access, 2023

  30. Pilot Design and Signal Detection for Symbiotic Radio over OFDM Carriers

    Authors: Hao Chen, Qianqian Zhang, Ruizhe Long, Yiyang Pei, Ying-Chang Liang

    Abstract: Symbiotic radio (SR) is a promising solution to achieve high spectrum- and energy-efficiency due to its spectrum sharing and low-power consumption properties, in which the secondary system achieves data transmissions by backscattering the signal originating from the primary system. In this paper, we are interested in the pilot design and signal detection when the primary transmission adopts orthog… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: This paper has been accepted for publication in IEEE Transactions on Wireless Communications

    Journal ref: IEEE Transactions on Wireless Communications, early access, 2023

  31. arXiv:2311.02837  [pdf, ps, other

    cs.IT eess.SP

    Multi-User Multi-IoT-Device Symbiotic Radio: A Novel Massive Access Scheme for Cellular IoT

    Authors: Jun Wang, Ying-Chang Liang, Sumei Sun

    Abstract: Symbiotic radio (SR) is a promising technique to support cellular Internet-of-Things (IoT) by forming a mutualistic relationship between IoT and cellular transmissions. In this paper, we propose a novel multi-user multi-IoT-device SR system to enable massive access in cellular IoT. In the considered system, the base station (BS) transmits information to multiple cellular users, and a number of IoT… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

    Comments: 13 pages, 12 figures, Conference J. Wang and Y.-C. Liang, Transmit beamforming design for multiuser multi-IoT-device symbiotic radios, in Proc. IEEE ICC, Rome, Italy, May 2023, pp. 1-6

  32. arXiv:2311.01167  [pdf, ps, other

    cs.IT eess.SP

    Modulation Design and Optimization for RIS-Assisted Symbiotic Radios

    Authors: Hu Zhou, Bowen Cai, Qianqian Zhang, Ruizhe Long, Yiyang Pei, Ying-Chang Liang

    Abstract: In reconfigurable intelligent surface (RIS)-assisted symbiotic radio (SR), the RIS acts as a secondary transmitter by modulating its information bits over the incident primary signal and simultaneously assists the primary transmission, then a cooperative receiver is used to jointly decode the primary and secondary signals. Most existing works of SR focus on using RIS to enhance the reflecting link… ▽ More

    Submitted 26 April, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: 16 pages,16 figures

  33. arXiv:2310.17187  [pdf, other

    eess.SP

    Explainable Gated Bayesian Recurrent Neural Network for Non-Markov State Estimation

    Authors: Shi Yan, Yan Liang, Le Zheng, Mingyang Fan, Xiaoxu Wang, Binglu Wang

    Abstract: The optimality of Bayesian filtering relies on the completeness of prior models, while deep learning holds a distinct advantage in learning models from offline data. Nevertheless, the current fusion of these two methodologies remains largely ad hoc, lacking a theoretical foundation. This paper presents a novel solution, namely an explainable gated Bayesian recurrent neural network specifically des… ▽ More

    Submitted 7 March, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

  34. arXiv:2310.04863  [pdf, other

    cs.SD eess.AS

    SA-Paraformer: Non-autoregressive End-to-End Speaker-Attributed ASR

    Authors: Yangze Li, Fan Yu, Yuhao Liang, Pengcheng Guo, Mohan Shi, Zhihao Du, Shiliang Zhang, Lei Xie

    Abstract: Joint modeling of multi-speaker ASR and speaker diarization has recently shown promising results in speaker-attributed automatic speech recognition (SA-ASR).Although being able to obtain state-of-the-art (SOTA) performance, most of the studies are based on an autoregressive (AR) decoder which generates tokens one-by-one and results in a large real-time factor (RTF). To speed up inference, we intro… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  35. arXiv:2309.13573  [pdf, other

    cs.SD eess.AS

    The second multi-channel multi-party meeting transcription challenge (M2MeT) 2.0): A benchmark for speaker-attributed ASR

    Authors: Yuhao Liang, Mohan Shi, Fan Yu, Yangze Li, Shiliang Zhang, Zhihao Du, Qian Chen, Lei Xie, Yanmin Qian, Jian Wu, Zhuo Chen, Kong Aik Lee, Zhijie Yan, Hui Bu

    Abstract: With the success of the first Multi-channel Multi-party Meeting Transcription challenge (M2MeT), the second M2MeT challenge (M2MeT 2.0) held in ASRU2023 particularly aims to tackle the complex task of \emph{speaker-attributed ASR (SA-ASR)}, which directly addresses the practical and challenging problem of ``who spoke what at when" at typical meeting scenario. We particularly established two sub-tr… ▽ More

    Submitted 5 October, 2023; v1 submitted 24 September, 2023; originally announced September 2023.

    Comments: 8 pages, Accepted by ASRU2023

  36. arXiv:2309.02855  [pdf, other

    cs.CV eess.IV

    Bandwidth-efficient Inference for Neural Image Compression

    Authors: Shanzhi Yin, Tongda Xu, Yongsheng Liang, Yuanyuan Wang, Yanghao Li, Yan Wang, **g**g Liu

    Abstract: With neural networks growing deeper and feature maps growing larger, limited communication bandwidth with external memory (or DRAM) and power constraints become a bottleneck in implementing network inference on mobile and edge devices. In this paper, we propose an end-to-end differentiable bandwidth efficient neural inference method with the activation compressed by neural data compression method.… ▽ More

    Submitted 6 September, 2023; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: 9 pages, 6 figures, submitted to ICASSP 2024

    MSC Class: 68U10(primary); 94A08 68T07(secondary) ACM Class: I.2.6; I.4.2

  37. Structure-Aware Parametric Representations for Time-Resolved Light Transport

    Authors: Diego Royo, Zesheng Huang, Yun Liang, Boyan Song, Adolfo Muñoz, Diego Gutierrez, Julio Marco

    Abstract: Time-resolved illumination provides rich spatio-temporal information for applications such as accurate depth sensing or hidden geometry reconstruction, becoming a useful asset for prototy** and as input for data-driven approaches. However, time-resolved illumination measurements are high-dimensional and have a low signal-to-noise ratio, hampering their applicability in real scenarios. We propose… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  38. Distributed Extended Object Tracking Using Coupled Velocity Model from WLS Perspective

    Authors: Zhifei Li, Yan Liang, Linfeng Xu

    Abstract: This study proposes a coupled velocity model (CVM) that establishes the relation between the orientation and velocity using their correlation, avoiding that the existing extended object tracking (EOT) models treat them as two independent quantities. As a result, CVM detects the mismatch between the prior dynamic model and actual motion pattern to correct the filtering gain, and simultaneously beco… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: Corrected Version

    Journal ref: Published by IEEE Transactions on Signal and Information Processing over Networks,2022

  39. arXiv:2308.04743  [pdf

    eess.SY cs.RO math.DS

    Missile guidance law design based on free-time convergent error dynamics

    Authors: Yuanhe Liu, Nianhao Xie, Kebo Li, Yangang Liang

    Abstract: The design of guidance law can be considered a kind of finite-time error-tracking problem. A unified free-time convergent guidance law design approach based on the error dynamics and the free-time convergence method is proposed in this paper. Firstly, the desired free-time convergent error dynamics approach is proposed, and its convergent time can be set freely, which is independent of the initial… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: 13 pages, 6 figures, accepted by Journal of Systems Engineering and Electronics

  40. arXiv:2307.12255  [pdf, other

    eess.IV cs.CV cs.LG

    ResWCAE: Biometric Pattern Image Denoising Using Residual Wavelet-Conditioned Autoencoder

    Authors: Youzhi Liang, Wen Liang

    Abstract: The utilization of biometric authentication with pattern images is increasingly popular in compact Internet of Things (IoT) devices. However, the reliability of such systems can be compromised by image quality issues, particularly in the presence of high levels of noise. While state-of-the-art deep learning algorithms designed for generic image denoising have shown promise, their large number of p… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

    Comments: 8 pages, 2 figures

  41. arXiv:2306.15433  [pdf, other

    eess.SP

    Recursive LMMSE-Based Iterative Soft Interference Cancellation for MIMO Systems to Save Computations and Memories

    Authors: Hufei Zhu, Fuqin Deng, Yikui Zhai, Jiaming Zhong, Yanyang Liang

    Abstract: Firstly, a reordered description is given for the linear minimum mean square error (LMMSE)-based iterative soft interference cancellation (ISIC) detection process for Mutipleinput multiple-output (MIMO) wireless communication systems, which is based on the equivalent channel matrix. Then the above reordered description is applied to compare the detection process for LMMSE-ISIC with that for the ha… ▽ More

    Submitted 5 December, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

  42. arXiv:2306.10125  [pdf, other

    cs.LG cs.AI eess.SP stat.AP

    Self-Supervised Learning for Time Series Analysis: Taxonomy, Progress, and Prospects

    Authors: Kexin Zhang, Qingsong Wen, Chaoli Zhang, Rongyao Cai, Ming **, Yong Liu, James Zhang, Yuxuan Liang, Guansong Pang, Dong** Song, Shirui Pan

    Abstract: Self-supervised learning (SSL) has recently achieved impressive performance on various time series tasks. The most prominent advantage of SSL is that it reduces the dependence on labeled data. Based on the pre-training and fine-tuning strategy, even a small amount of labeled data can achieve high performance. Compared with many published self-supervised surveys on computer vision and natural langu… ▽ More

    Submitted 8 April, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI); 26 pages, 200+ references; the first work to comprehensively and systematically summarize self-supervised learning for time series analysis (SSL4TS). The GitHub repository is https://github.com/qingsongedu/Awesome-SSL4TS

  43. arXiv:2306.07505  [pdf

    q-bio.TO eess.IV

    Deep learning radiomics for assessment of gastroesophageal varices in people with compensated advanced chronic liver disease

    Authors: Lan Wang, Ruiling He, Lili Zhao, Jia Wang, Zhengzi Geng, Tao Ren, Guo Zhang, Peng Zhang, Kaiqiang Tang, Chaofei Gao, Fei Chen, Liting Zhang, Yonghe Zhou, Xin Li, Fanbin He, Hui Huan, Wenjuan Wang, Yunxiao Liang, Juan Tang, Fang Ai, Tingyu Wang, Liyun Zheng, Zhongwei Zhao, Jiansong Ji, Wei Liu , et al. (22 additional authors not shown)

    Abstract: Objective: Bleeding from gastroesophageal varices (GEV) is a medical emergency associated with high mortality. We aim to construct an artificial intelligence-based model of two-dimensional shear wave elastography (2D-SWE) of the liver and spleen to precisely assess the risk of GEV and high-risk gastroesophageal varices (HRV). Design: A prospective multicenter study was conducted in patients with… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  44. arXiv:2306.02682  [pdf, other

    cs.CL eess.AS

    End-to-End Word-Level Pronunciation Assessment with MASK Pre-training

    Authors: Yukang Liang, Kaitao Song, Shaoguang Mao, Huiqiang Jiang, Luna Qiu, Yuqing Yang, Dongsheng Li, Linli Xu, Lili Qiu

    Abstract: Pronunciation assessment is a major challenge in the computer-aided pronunciation training system, especially at the word (phoneme)-level. To obtain word (phoneme)-level scores, current methods usually rely on aligning components to obtain acoustic features of each word (phoneme), which limits the performance of assessment to the accuracy of alignments. Therefore, to address this problem, we propo… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted by InterSpeech 2023

  45. arXiv:2305.13716  [pdf, other

    cs.SD cs.CL eess.AS

    BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR

    Authors: Yuhao Liang, Fan Yu, Yangze Li, Pengcheng Guo, Shiliang Zhang, Qian Chen, Lei Xie

    Abstract: The recently proposed serialized output training (SOT) simplifies multi-talker automatic speech recognition (ASR) by generating speaker transcriptions separated by a special token. However, frequent speaker changes can make speaker change prediction difficult. To address this, we propose boundary-aware serialized output training (BA-SOT), which explicitly incorporates boundary knowledge into the d… ▽ More

    Submitted 5 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted by INTERSPEECH 2023

  46. arXiv:2304.02398  [pdf, ps, other

    cs.IT eess.SP

    Robust Secure Transmission for Active RIS Enabled Symbiotic Radio Multicast Communications

    Authors: Bin Lyu, Chao Zhou, Shimin Gong, Dinh Thai Hoang, Ying-chang Liang

    Abstract: In this paper, we propose a robust secure transmission scheme for an active reconfigurable intelligent surface (RIS) enabled symbiotic radio (SR) system in the presence of multiple eavesdroppers (Eves). In the considered system, the active RIS is adopted to enable the secure transmission of primary signals from the primary transmitter to multiple primary users in a multicasting manner, and simulta… ▽ More

    Submitted 13 April, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

    Comments: 32 Pages, 12 figures, accepted to IEEE Transactions on Wireless Communications

  47. arXiv:2303.15299  [pdf, other

    eess.SY cs.AI

    Resilient Output Consensus Control of Heterogeneous Multi-agent Systems against Byzantine Attacks: A Twin Layer Approach

    Authors: Xin Gong, Yiwen Liang, Yukang Cui, Shi Liang, Tingwen Huang

    Abstract: This paper studies the problem of cooperative control of heterogeneous multi-agent systems (MASs) against Byzantine attacks. The agent affected by Byzantine attacks sends different wrong values to all neighbors while applying wrong input signals for itself, which is aggressive and difficult to be defended. Inspired by the concept of Digital Twin, a new hierarchical protocol equipped with a virtual… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

  48. arXiv:2303.14082  [pdf, ps, other

    cs.IT eess.SP

    Deep Reinforcement Learning for Distributed Dynamic Coordinated Beamforming in Massive MIMO Cellular Networks

    Authors: Jungang Ge, Ying-Chang Liang, Liao Zhang, Ruizhe Long, Sumei Sun

    Abstract: To accommodate the explosive wireless traffics, massive multiple-input multiple-output (MIMO) is regarded as one of the key enabling technologies for next-generation communication systems. In massive MIMO cellular networks, coordinated beamforming (CBF), which jointly designs the beamformers of multiple base stations (BSs), is an efficient method to enhance the network performance. In this paper,… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  49. arXiv:2303.13760  [pdf, ps, other

    cs.IT eess.SY

    Multiple Access Design for Symbiotic Radios: Facilitating Massive IoT Connections with Cellular Networks

    Authors: Jun Wang, Xiangyu Ding, Qianqian Zhang, Ying-Chang Liang

    Abstract: Symbiotic radio (SR) has emerged as a spectrum- and energy-efficient paradigm to support massive Internet of Things (IoT) connections. Two multiple access schemes are proposed in this paper to facilitate the massive IoT connections using the cellular network based on the SR technique, namely, the simultaneous access (SA) scheme and the selection diversity access (SDA) scheme. In the SA scheme, the… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  50. arXiv:2303.11413  [pdf, other

    eess.SP cs.AI cs.LG

    Structural Vibration Signal Denoising Using Stacking Ensemble of Hybrid CNN-RNN

    Authors: Youzhi Liang, Wen Liang, Jianguo Jia

    Abstract: Vibration signals have been increasingly utilized in various engineering fields for analysis and monitoring purposes, including structural health monitoring, fault diagnosis and damage detection, where vibration signals can provide valuable information about the condition and integrity of structures. In recent years, there has been a growing trend towards the use of vibration signals in the field… ▽ More

    Submitted 22 July, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

    Comments: 10 pages, 4 figures