Skip to main content

Showing 1–50 of 141 results for author: Yuan, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.18592  [pdf, ps, other

    eess.SP

    On the Coexistence of OTFS Modulation with OFDM-based Communication Systems

    Authors: Akram Shafie, **hong Yuan, Paul Fitzpatrick, Taka Sakurai, Yuting Fang

    Abstract: We investigate the coexistence of orthogonal time-frequency space (OTFS) modulation with current fourth- and fifth-generation (4G/5G) communication systems that primarily use orthogonal frequency-division multiplexing (OFDM) waveforms. We first derive the input-output-relation of OTFS in the considered coexisting system. In this derivation, we consider (i) the inclusion of multiple cyclic prefixes… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: This paper has been submitted for publication in an IEEE Journal. Copyright may be transferred without notice, after which this version may no longer be accessible. arXiv admin note: text overlap with arXiv:2311.06850

  2. arXiv:2406.18548  [pdf

    eess.IV cs.CV

    Exploration of Multi-Scale Image Fusion Systems in Intelligent Medical Image Analysis

    Authors: Yuxiang Hu, Haowei Yang, Ting Xu, Shuyao He, Jiajie Yuan, Haozhang Deng

    Abstract: The diagnosis of brain cancer relies heavily on medical imaging techniques, with MRI being the most commonly used. It is necessary to perform automatic segmentation of brain tumors on MRI images. This project intends to build an MRI algorithm based on U-Net. The residual network and the module used to enhance the context information are combined, and the void space convolution pooling pyramid is a… ▽ More

    Submitted 23 May, 2024; originally announced June 2024.

  3. arXiv:2406.07410  [pdf, other

    eess.AS

    Clever Hans Effect Found in Automatic Detection of Alzheimer's Disease through Speech

    Authors: Yin-Long Liu, Rui Feng, Jia-Hong Yuan, Zhen-Hua Ling

    Abstract: We uncover an underlying bias present in the audio recordings produced from the picture description task of the Pitt corpus, the largest publicly accessible database for Alzheimer's Disease (AD) detection research. Even by solely utilizing the silent segments of these audio recordings, we achieve nearly 100% accuracy in AD detection. However, employing the same methods to other datasets and prepro… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  4. arXiv:2406.04776  [pdf, ps, other

    eess.SP cs.AI

    OFDM-Standard Compatible SC-NOFS Waveforms for Low-Latency and Jitter-Tolerance Industrial IoT Communications

    Authors: Tongyang Xu, Shuangyang Li, **hong Yuan

    Abstract: Traditional communications focus on regular and orthogonal signal waveforms for simplified signal processing and improved spectral efficiency. In contrast, the next-generation communications would aim for irregular and non-orthogonal signal waveforms to introduce new capabilities. This work proposes a spectrally efficient irregular Sinc (irSinc) sha** technique, revisiting the traditional Sinc b… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  5. arXiv:2406.02126  [pdf, other

    eess.SY cs.AI cs.LG cs.MA

    CityLight: A Universal Model Towards Real-world City-scale Traffic Signal Control Coordination

    Authors: **wei Zeng, Chao Yu, Xinyi Yang, Wenxuan Ao, Jian Yuan, Yong Li, Yu Wang, Huazhong Yang

    Abstract: Traffic signal control (TSC) is a promising low-cost measure to enhance transportation efficiency without affecting existing road infrastructure. While various reinforcement learning-based TSC methods have been proposed and experimentally outperform conventional rule-based methods, none of them has been deployed in the real world. An essential gap lies in the oversimplification of the scenarios in… ▽ More

    Submitted 6 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  6. arXiv:2405.08295  [pdf, other

    cs.CL cs.SD eess.AS

    SpeechVerse: A Large-scale Generalizable Audio Language Model

    Authors: Nilaksh Das, Saket Dingliwal, Srikanth Ronanki, Rohit Paturi, Zhaocheng Huang, Prashant Mathur, Jie Yuan, Dhanush Bekal, Xing Niu, Sai Muralidhar Jayanthi, Xilai Li, Karel Mundnich, Monica Sunkara, Sundararajan Srinivasan, Kyu J Han, Katrin Kirchhoff

    Abstract: Large language models (LLMs) have shown incredible proficiency in performing tasks that require semantic understanding of natural language instructions. Recently, many works have further expanded this capability to perceive multimodal audio and text inputs, but their capabilities are often limited to specific fine-tuned tasks such as automatic speech recognition and translation. We therefore devel… ▽ More

    Submitted 31 May, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: Single Column, 13 page

  7. arXiv:2405.08288  [pdf, other

    eess.SP

    Orthogonal Delay-Doppler Division Multiplexing Modulation with Tomlinson-Harashima Precoding

    Authors: Yiyan Ma, Akram Shafie, **hong Yuan, Guoyu Ma, Zhangdui Zhong, Bo Ai

    Abstract: The orthogonal delay-Doppler (DD) division multiplexing(ODDM) modulation has been recently proposed as a promising modulation scheme for next-generation communication systems with high mobility. Despite its benefits, ODDM modulation and other DD domain modulation schemes face the challenge of excessive equalization complexity. To address this challenge, we propose time domain Tomlinson-Harashima p… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  8. arXiv:2405.07547  [pdf, other

    cs.IT eess.SP

    Channel Coding Toward 6G: Technical Overview and Outlook

    Authors: Mohammad Rowshan, Min Qiu, Yixuan Xie, Xinyi Gu, **hong Yuan

    Abstract: Channel coding plays a pivotal role in ensuring reliable communication over wireless channels. With the growing need for ultra-reliable communication in emerging wireless use cases, the significance of channel coding has amplified. Furthermore, minimizing decoding latency is crucial for critical-mission applications, while optimizing energy efficiency is paramount for mobile and the Internet of Th… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 102 pages, 87 figures, IEEE Open Journal of the Communications Society (invited paper)

  9. arXiv:2404.16253  [pdf, other

    eess.SP

    Mitigating Automotive Radar Interference using Onboard Intelligent Reflective Surface

    Authors: Shree Prasad Maruthi, Karrthik G. K., Vijaya Krishna A., Mahbub Hassan, **hong Yuan

    Abstract: The use of automotive radars is gaining popularity as a means to enhance a vehicle's sensing capabilities. However, these radars can suffer from interference caused by transmissions from other radars mounted on nearby vehicles. To address this issue, we investigate the use of an onboard intelligent reflective surface (IRS) to artificially increase a vehicle's effective radar cross section (RCS), o… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: 7 pages, 9 Figures

  10. arXiv:2403.14192  [pdf, ps, other

    cs.IT eess.SP

    Fundamentals of Delay-Doppler Communications: Practical Implementation and Extensions to OTFS

    Authors: Shuangyang Li, Peter Jung, Weijie Yuan, Zhiqiang Wei, **hong Yuan, Baoming Bai, Giuseppe Caire

    Abstract: The recently proposed orthogonal time frequency space (OTFS) modulation, which is a typical Delay-Doppler (DD) communication scheme, has attracted significant attention thanks to its appealing performance over doubly-selective channels. In this paper, we present the fundamentals of general DD communications from the viewpoint of the Zak transform. We start our study by constructing DD domain basis… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  11. arXiv:2403.10323  [pdf, ps, other

    eess.SP

    Joint Optimization for Achieving Covertness in MIMO Over-the-Air Computation Networks

    Authors: Junteng Yao, Tuo Wu, Ming **, Cunhua Pan, Quanzhong Li, **hong Yuan

    Abstract: This paper investigates covert data transmission within a multiple-input multiple-output (MIMO) over-the-air computation (AirComp) network, where sensors transmit data to the access point (AP) while guaranteeing covertness to the warden (Willie). Simultaneously, the AP introduces artificial noise (AN) to confuse Willie, meeting the covert requirement. We address the challenge of minimizing mean-sq… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  12. arXiv:2403.02012  [pdf, other

    cs.IT eess.SP

    OTFS vs OFDM: Which is Superior in Multiuser LEO Satellite Communications

    Authors: Yu Liu, Ming Chen, Cunhua Pan, Tantao Gong, **hong Yuan, Jiangzhou Wang

    Abstract: Orthogonal time frequency space (OTFS) modulation, a delay-Doppler (DD) domain communication scheme exhibiting strong robustness against the Doppler shifts, has the potentials to be employed in LEO satellite communications. However, the performance comparison with the orthogonal frequency division multiplexing (OFDM) modulation and the resource allocation scheme for multiuser OTFS-based LEO satell… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 13 pages, 9 figures

  13. arXiv:2402.12127  [pdf, other

    cs.IT eess.SP

    Rate-Splitting Multiple Access for Transmissive Reconfigurable Intelligent Surface Transceiver Empowered ISAC System

    Authors: Ziwei Liu, Wen Chen, Qingqing Wu, **hong Yuan, Shanshan Zhang, Zhendong Li, Jun Li

    Abstract: In this paper, a novel transmissive reconfigurable intelligent surface (TRIS) transceiver empowered integrated sensing and communications (ISAC) system is proposed for future multi-demand terminals. To address interference management, we implement rate-splitting multiple access (RSMA), where the common stream is independently designed for the sensing service. We introduce the sensing quality of se… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  14. arXiv:2401.15164  [pdf, other

    cs.SD cs.CV cs.LG cs.MM eess.AS

    AMuSE: Adaptive Multimodal Analysis for Speaker Emotion Recognition in Group Conversations

    Authors: Naresh Kumar Devulapally, Sidharth Anand, Sreyasee Das Bhattacharjee, Junsong Yuan, Yu-** Chang

    Abstract: Analyzing individual emotions during group conversation is crucial in develo** intelligent agents capable of natural human-machine interaction. While reliable emotion recognition techniques depend on different modalities (text, audio, video), the inherent heterogeneity between these modalities and the dynamic cross-modal interactions influenced by an individual's unique behavioral patterns make… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  15. arXiv:2401.11058  [pdf, ps, other

    cs.IT eess.SP

    Low Complexity Turbo SIC-MMSE Detection for Orthogonal Time Frequency Space Modulation

    Authors: Qi Li, **hong Yuan, Min Qiu, Shuangyang Li, Yixuan Xie

    Abstract: Recently, orthogonal time frequency space (OTFS) modulation has garnered considerable attention due to its robustness against doubly-selective wireless channels. In this paper, we propose a low-complexity iterative successive interference cancellation based minimum mean squared error (SIC-MMSE) detection algorithm for zero-padded OTFS (ZP-OTFS) modulation. In the proposed algorithm, signals are de… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: 15 pages, 12 figures, accepted by IEEE Transactions on Communications

  16. arXiv:2401.01433  [pdf, other

    cs.IT eess.SP

    Multiple Access Techniques for Intelligent and Multi-Functional 6G: Tutorial, Survey, and Outlook

    Authors: Bruno Clerckx, Yijie Mao, Zhaohui Yang, Mingzhe Chen, Ahmed Alkhateeb, Liang Liu, Min Qiu, **hong Yuan, Vincent W. S. Wong, Juan Montojo

    Abstract: Multiple access (MA) is a crucial part of any wireless system and refers to techniques that make use of the resource dimensions to serve multiple users/devices/machines/services, ideally in the most efficient way. Given the needs of multi-functional wireless networks for integrated communications, sensing, localization, computing, coupled with the surge of machine learning / artificial intelligenc… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: submitted for publication in Proceedings of the IEEE

  17. arXiv:2311.15556  [pdf, other

    cs.CV eess.IV

    PKU-I2IQA: An Image-to-Image Quality Assessment Database for AI Generated Images

    Authors: Jiquan Yuan, Xinyan Cao, Chang** Li, Fanyi Yang, **long Lin, Xixin Cao

    Abstract: As image generation technology advances, AI-based image generation has been applied in various fields and Artificial Intelligence Generated Content (AIGC) has garnered widespread attention. However, the development of AI-based image generative models also brings new problems and challenges. A significant challenge is that AI-generated images (AIGI) may exhibit unique distortions compared to natura… ▽ More

    Submitted 29 November, 2023; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: 18 pages

  18. arXiv:2311.13787  [pdf, other

    eess.SP

    A Fast Power Spectrum Sensing Solution for Generalized Coprime Sampling

    Authors: Kaili Jiang, Dechang Wang, Kailun Tian, Hancong Feng, Yuxin Zhao, Junyu Yuan, Bin Tang

    Abstract: The growing scarcity of spectrum resources, wideband spectrum sensing is required to process a prohibitive volume of data at a high sampling rate. For some applications, spectrum estimation only requires second-order statistics. In this case, a fast power spectrum sensing solution is proposed based on the generalized coprime sampling. By exploring the sensing vector inherent structure, the autocor… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  19. arXiv:2311.07238  [pdf, ps, other

    eess.SP

    Time-Frequency Localization Characteristics of the Delay-Doppler Plane Orthogonal Pulse

    Authors: Akram Shafie, **hong Yuan, Nan Yang, Hai Lin

    Abstract: The orthogonal delay-Doppler (DD) division multiplexing (ODDM) modulation has recently been proposed as a promising solution for ensuring reliable communications in high mobility scenarios. In this work, we investigate the time-frequency (TF) localization characteristics of the DD plane orthogonal pulse (DDOP), which is the prototype pulse of ODDM modulation. The TF localization characteristics ex… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: This paper has been submitted for publication in an IEEE Conference. Copyright may be transferred without notice, after which this version may no longer be accessible

  20. arXiv:2311.06850  [pdf, ps, other

    eess.SP

    Coexistence of OTFS Modulation With OFDM-based Communication Systems

    Authors: Akram Shafie, **hong Yuan, Yuting Fang, Paul Fitzpatrick, Taka Sakurai

    Abstract: This study examines the coexistence of orthogonal time-frequency space (OTFS) modulation with current fourth- and fifth-generation (4G/5G) wireless communication systems that primarily use orthogonal frequency-division multiplexing (OFDM) waveforms. We first derive the input-output-relation (IOR) of OTFS when it coexists with an OFDM system while considering the impact of unequal lengths of the cy… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: This paper has been accepted for publication in IEEE Global Communications Conferences (GLOBECOM) 2023. Copyright may be transferred without notice, after which this version may no longer be accessible

  21. arXiv:2310.19087  [pdf, other

    eess.IV physics.ins-det physics.med-ph physics.optics q-bio.TO

    Transport-of-Intensity Model for Single-Mask X-ray Differential Phase Contrast Imaging

    Authors: **gcheng Yuan, Mini Das

    Abstract: X-ray phase contrast imaging holds great promise for improving the visibility of light-element materials such as soft tissues and tumors. Single-mask differential phase contrastnimaging method stands out as a simple and effective approach to yield differential phase contrast. In this work, we introduce a novel model for a single-mask phase imaging system based on the transport-of-intensity equatio… ▽ More

    Submitted 31 January, 2024; v1 submitted 29 October, 2023; originally announced October 2023.

    Comments: 10 pages, 6 figures

  22. arXiv:2309.01823  [pdf

    eess.IV cs.CV

    Multi-dimension unified Swin Transformer for 3D Lesion Segmentation in Multiple Anatomical Locations

    Authors: Shaoyan Pan, Yiqiao Liu, Sarah Halek, Michal Tomaszewski, Shubing Wang, Richard Baumgartner, Jianda Yuan, Gregory Goldmacher, Antong Chen

    Abstract: In oncology research, accurate 3D segmentation of lesions from CT scans is essential for the modeling of lesion growth kinetics. However, following the RECIST criteria, radiologists routinely only delineate each lesion on the axial slice showing the largest transverse area, and delineate a small number of lesions in 3D for research purposes. As a result, we have plenty of unlabeled 3D volumes and… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

  23. arXiv:2309.00928  [pdf, other

    cs.CV cs.RO eess.IV

    S$^3$-MonoDETR: Supervised Shape&Scale-perceptive Deformable Transformer for Monocular 3D Object Detection

    Authors: Xuan He, Kailun Yang, Junwei Zheng, ** Yuan, Luis M. Bergasa, Hui Zhang, Zhiyong Li

    Abstract: Recently, transformer-based methods have shown exceptional performance in monocular 3D object detection, which can predict 3D attributes from a single 2D image. These methods typically use visual and depth representations to generate query points on objects, whose quality plays a decisive role in the detection accuracy. However, current unsupervised attention mechanisms without any geometry appear… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

    Comments: The source code will be made publicly available at https://github.com/mikasa3lili/S3-MonoDETR

  24. arXiv:2308.08883  [pdf, other

    cs.IT eess.SP

    Coexistence of Heterogeneous Services in the Uplink with Discrete Signaling and Treating Interference as Noise

    Authors: Min Qiu, Yu-Chih Huang, **hong Yuan

    Abstract: The problem of enabling the coexistence of heterogeneous services, e.g., different ultra-reliable low-latency communications (URLLC) services and/or enhanced mobile broadband (eMBB) services, in the uplink is studied. Each service has its own error probability and blocklength constraints and the longer transmission block suffers from heterogeneous interference. Due to the latency concern, the deco… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: 7 pages, accepted for presentation at IEEE Global Communications Conference (GLOBECOM) 2023

  25. arXiv:2308.08172  [pdf, other

    eess.IV cs.CV cs.LG

    AATCT-IDS: A Benchmark Abdominal Adipose Tissue CT Image Dataset for Image Denoising, Semantic Segmentation, and Radiomics Evaluation

    Authors: Zhiyu Ma, Chen Li, Tianming Du, Le Zhang, Dechao Tang, Deguo Ma, Shanchuan Huang, Yan Liu, Yihao Sun, Zhihao Chen, ** Yuan, Qianqing Nie, Marcin Grzegorzek, Hongzan Sun

    Abstract: Methods: In this study, a benchmark \emph{Abdominal Adipose Tissue CT Image Dataset} (AATTCT-IDS) containing 300 subjects is prepared and published. AATTCT-IDS publics 13,732 raw CT slices, and the researchers individually annotate the subcutaneous and visceral adipose tissue regions of 3,213 of those slices that have the same slice distance to validate denoising methods, train semantic segmentati… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: 17 pages, 7 figures

  26. arXiv:2308.07079  [pdf

    eess.SP

    Wideband Spectrum Acquisition for UAV Swarm Using the Sparse Coding Fourier Transform

    Authors: Kaili Jiang, Kailun Tian, Hancong Feng, Junyu Yuan, Bin Tang

    Abstract: As the trend towards small, safe, smart, speedy and swarm development grows, unmanned aerial vehicles (UAVs) are becoming increasingly popular for a wide range of applications. In this letter, the challenge of wideband spectrum acquisition for the UAV swarms is studied by proposing a processing method that features lower power consumption, higher compression rates, and a lower signal-to-noise rati… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  27. arXiv:2308.07077  [pdf

    eess.SP

    Distributed UAV Swarm Augmented Wideband Spectrum Sensing Using Nyquist Folding Receiver

    Authors: Kaili Jiang, Kailun Tian, Hancong Feng, Yuxin Zhao, Dechang Wang, Sen Cao, Jian Gao, Xuying Zhang, Yanfei Li, Junyu Yuan, Ying Xiong, Bin Tang

    Abstract: Distributed unmanned aerial vehicle (UAV) swarms are formed by multiple UAVs with increased portability, higher levels of sensing capabilities, and more powerful autonomy. These features make them attractive for many recent applica-tions, potentially increasing the shortage of spectrum resources. In this paper, wideband spectrum sensing augmented technology is discussed for distributed UAV swarms… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  28. arXiv:2308.07075  [pdf, other

    eess.SP

    Wideband Power Spectrum Sensing: a Fast Practical Solution for Nyquist Folding Receiver

    Authors: Kaili Jiang, Dechang Wang, Kailun Tian, Hancong Feng, Yuxin Zhao, Sen Cao, Jian Gao, Xuying Zhang, Yanfei Li, Junyu Yuan, Ying Xiong, Bin Tang

    Abstract: The limited availability of spectrum resources has been growing into a critical problem in wireless communications, remote sensing, and electronic surveillance, etc. To address the high-speed sampling bottleneck of wideband spectrum sensing, a fast and practical solution of power spectrum estimation for Nyquist folding receiver (NYFR) is proposed in this paper. The NYFR architectures is can theore… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  29. arXiv:2308.01802  [pdf, ps, other

    cs.IT eess.SP

    Multi-Carrier Modulation: An Evolution from Time-Frequency Domain to Delay-Doppler Domain

    Authors: Hai Lin, **hong Yuan, Wei Yu, **gxian Wu, Lajos Hanzo

    Abstract: The recently proposed orthogonal delay-Doppler division multiplexing (ODDM) modulation, which is based on the new delay-Doppler (DD) domain orthogonal pulse (DDOP), is studied. A substantial benefit of the DDOP-based ODDM or general delay-Doppler domain multi-carrier (DDMC) modulation is that it achieves orthogonality with respect to the fine time and frequency resolutions of the DD domain. We fir… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: This paper has been submitted to the IEEE for possible publication. The supplementary material of this work will be posted at https://www.omu.ac.jp/eng/ees-sic/oddm/

  30. arXiv:2308.01147  [pdf, other

    cs.CV cs.MM eess.IV

    Contrast-augmented Diffusion Model with Fine-grained Sequence Alignment for Markup-to-Image Generation

    Authors: Guo** Zhong, ** Yuan, Pan Wang, Kailun Yang, Weili Guan, Zhiyong Li

    Abstract: The recently rising markup-to-image generation poses greater challenges as compared to natural image generation, due to its low tolerance for errors as well as the complex sequence and context correlations between markup and rendered image. This paper proposes a novel model named "Contrast-augmented Diffusion Model with Fine-grained Sequence Alignment" (FSA-CDM), which introduces contrastive posit… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: Accepted to ACM MM 2023. The code will be released at https://github.com/zgj77/FSACDM

  31. arXiv:2307.06605  [pdf, ps, other

    eess.SP cs.IT

    Intelligent Omni Surfaces assisted Integrated Multi Target Sensing and Multi User MIMO Communications

    Authors: Ziheng Zhang, Wen Chen, Qingqing Wu, Zhendong Li, Xusheng Zhu, **hong Yuan

    Abstract: Drawing inspiration from the advantages of intelligent reflecting surfaces (IRS) in wireless networks,this paper presents a novel design for intelligent omni surface (IOS) enabled integrated sensing and communications (ISAC). By harnessing the power of multi antennas and a multitude of elements, the dual-function base station (BS) and IOS collaborate to realize joint active and passive beamforming… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: 30 pages, 7 figures

  32. arXiv:2306.17451  [pdf, ps, other

    cs.IT eess.SP

    Self-Connected Spatially Coupled LDPC Codes with Improved Termination

    Authors: Yihuan Liao, Min Qiu, **hong Yuan

    Abstract: This paper investigates the design of self-connected spatially coupled low-density parity-check (SC-LDPC) codes. First, a termination method is proposed to reduce rate loss. Particularly, a single-side open SC-LDPC ensemble is introduced, which halves the rate loss of a conventional terminated SC-LDPC by reducing the number of check nodes. We further propose a self-connection method that allows re… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: 6 pages, 8 figures, accepted for publication in IEEE Communications Letters

  33. arXiv:2306.08704  [pdf, ps, other

    cs.IT eess.SP

    On the Pulse Sha** for Delay-Doppler Communications

    Authors: Shuangyang Li, Weijie Yuan, Zhiqiang Wei, **hong Yuan, Baoming Bai, Giuseppe Caire

    Abstract: In this paper, we study the pulse sha** for delay-Doppler (DD) communications. We start with constructing a basis function in the DD domain following the properties of the Zak transform. Particularly, we show that the constructed basis functions are globally quasi-periodic while locally twisted-shifted, and their significance in time and frequency domains are then revealed. We further analyze th… ▽ More

    Submitted 21 November, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

  34. arXiv:2306.06656  [pdf, other

    cs.CV cs.RO eess.IV

    VPUFormer: Visual Prompt Unified Transformer for Interactive Image Segmentation

    Authors: Xu Zhang, Kailun Yang, Jiacheng Lin, ** Yuan, Zhiyong Li, Shutao Li

    Abstract: The integration of diverse visual prompts like clicks, scribbles, and boxes in interactive image segmentation could significantly facilitate user interaction as well as improve interaction efficiency. Most existing studies focus on a single type of visual prompt by simply concatenating prompts and images as input for segmentation prediction, which suffers from low-efficiency prompt representation… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    Comments: Code will be made publicly available at https://github.com/XuZhang1211/VPUFormer

  35. arXiv:2306.01304  [pdf, other

    cs.SD cs.IR cs.MM eess.AS

    JEPOO: Highly Accurate Joint Estimation of Pitch, Onset and Offset for Music Information Retrieval

    Authors: Haojie Wei, Jun Yuan, Rui Zhang, Yueguo Chen, Gang Wang

    Abstract: Melody extraction is a core task in music information retrieval, and the estimation of pitch, onset and offset are key sub-tasks in melody extraction. Existing methods have limited accuracy, and work for only one type of data, either single-pitch or multipitch. In this paper, we propose a highly accurate method for joint estimation of pitch, onset and offset, named JEPOO. We address the challenges… ▽ More

    Submitted 7 July, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: This paper has been accepted by IJCAI 2023; 11 pages, 6 figures

  36. arXiv:2305.14892  [pdf, ps, other

    cs.IT eess.SP

    Segmented GRAND: Combining Sub-patterns in Near-ML Order

    Authors: Mohammad Rowshan, **hong Yuan

    Abstract: The recently introduced maximum-likelihood (ML) decoding scheme called guessing random additive noise decoding (GRAND) has demonstrated a remarkably low time complexity in high signal-to-noise ratio (SNR) regimes. However, the complexity is not as low at low SNR regimes and low code rates. To mitigate this concern, we propose a scheme for a near-ML variant of GRAND called ordered reliability bits… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  37. arXiv:2305.07270  [pdf, other

    cs.CV cs.RO eess.IV

    SSD-MonoDETR: Supervised Scale-aware Deformable Transformer for Monocular 3D Object Detection

    Authors: Xuan He, Fan Yang, Kailun Yang, Jiacheng Lin, Haolong Fu, Meng Wang, ** Yuan, Zhiyong Li

    Abstract: Transformer-based methods have demonstrated superior performance for monocular 3D object detection recently, which aims at predicting 3D attributes from a single 2D image. Most existing transformer-based methods leverage both visual and depth representations to explore valuable query points on objects, and the quality of the learned query points has a great impact on detection accuracy. Unfortunat… ▽ More

    Submitted 1 September, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE Transactions on Intelligent Vehicles (T-IV). Code will be made publicly available at https://github.com/mikasa3lili/SSD-MonoDETR

  38. arXiv:2305.02599  [pdf, ps, other

    eess.SP

    Transmissive Reconfigurable Intelligent Surface Transmitter Empowered Cognitive RSMA Networks

    Authors: Ziwei Liu, Wen Chen, Zhendong Li, **hong Yuan, Qingqing Wu, Kunlun Wang

    Abstract: In this paper, we investigated the downlink transmission problem of a cognitive radio network (CRN) equipped with a novel transmissive reconfigurable intelligent surface (TRIS) transmitter. In order to achieve low power consumption and high-rate multi-streams communication, time-modulated arrays (TMA) is implemented and users access the network using rate splitting multiple access (RSMA). With suc… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: IEEE Communications Letters

  39. arXiv:2305.01360  [pdf, other

    eess.IV cs.CV

    Self-supervised arbitrary scale super-resolution framework for anisotropic MRI

    Authors: Haonan Zhang, Yuhan Zhang, Qing Wu, Jiangjie Wu, Zhiming Zhen, Feng Shi, Jianmin Yuan, Hongjiang Wei, Chen Liu, Yuyao Zhang

    Abstract: In this paper, we propose an efficient self-supervised arbitrary-scale super-resolution (SR) framework to reconstruct isotropic magnetic resonance (MR) images from anisotropic MRI inputs without involving external training data. The proposed framework builds a training dataset using in-the-wild anisotropic MR volumes with arbitrary image resolution. We then formulate the 3D volume SR task as a SR… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Comments: 10 pages, 5 figures

  40. arXiv:2304.07802  [pdf, other

    eess.SP

    Reconfigurable Intelligent Surface-Enabled Gridless DoA Estimation System for NLoS Scenarios

    Authors: Jiawen Yuan, Shaodan Ma, Gong Zhang, Henry Leung

    Abstract: The conventional direction-of-arrival (DoA) estimation approaches only be effective when the line-of-sight (LoS) link exists, while in the case of the non-line-of-sight (NLoS) situation, the spatial angle can not be captured and thus the DoA estimation performance would be significantly degraded. To address this challenge, a novel reconfigurable intelligent surface (RIS)- enabled gridless DoA esti… ▽ More

    Submitted 7 November, 2023; v1 submitted 16 April, 2023; originally announced April 2023.

  41. arXiv:2303.12360  [pdf

    cs.CV eess.IV

    Automatically Predict Material Properties with Microscopic Image Example Polymer Compatibility

    Authors: Zhilong Liang, Zhenzhi Tan, Ruixin Hong, Wanli Ouyang, **ying Yuan, Changshui Zhang

    Abstract: Many material properties are manifested in the morphological appearance and characterized with microscopic image, such as scanning electron microscopy (SEM). Polymer miscibility is a key physical quantity of polymer material and commonly and intuitively judged by SEM images. However, human observation and judgement for the images is time-consuming, labor-intensive and hard to be quantified. Comput… ▽ More

    Submitted 3 August, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

  42. arXiv:2303.10727  [pdf, other

    cs.LG cs.SD eess.AS

    ERSAM: Neural Architecture Search For Energy-Efficient and Real-Time Social Ambiance Measurement

    Authors: Chaojian Li, Wenwan Chen, Jiayi Yuan, Yingyan Lin, Ashutosh Sabharwal

    Abstract: Social ambiance describes the context in which social interactions happen, and can be measured using speech audio by counting the number of concurrent speakers. This measurement has enabled various mental health tracking and human-centric IoT applications. While on-device Socal Ambiance Measure (SAM) is highly desirable to ensure user privacy and thus facilitate wide adoption of the aforementioned… ▽ More

    Submitted 24 March, 2023; v1 submitted 19 March, 2023; originally announced March 2023.

    Comments: Accepted by ICASSP'23

  43. arXiv:2303.07626  [pdf, other

    cs.SD cs.MM eess.AS

    CAT: Causal Audio Transformer for Audio Classification

    Authors: Xiaoyu Liu, Hanlin Lu, Jianbo Yuan, Xinyu Li

    Abstract: The attention-based Transformers have been increasingly applied to audio classification because of their global receptive field and ability to handle long-term dependency. However, the existing frameworks which are mainly extended from the Vision Transformers are not perfectly compatible with audio signals. In this paper, we introduce a Causal Audio Transformer (CAT) consisting of a Multi-Resoluti… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: Accepted to ICASSP 2023

  44. arXiv:2303.06324  [pdf, other

    cs.DC eess.SY

    OCCL: a Deadlock-free Library for GPU Collective Communication

    Authors: Lichen Pan, Juncheng Liu, **hui Yuan, Rongkai Zhang, Pengze Li, Zhen Xiao

    Abstract: Various distributed deep neural network (DNN) training technologies lead to increasingly complicated use of collective communications on GPU. The deadlock-prone collectives on GPU force researchers to guarantee that collectives are enqueued in a consistent order on each GPU to prevent deadlocks. In complex distributed DNN training scenarios, manual hardcoding is the only practical way for deadlock… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

  45. EDMAE: An Efficient Decoupled Masked Autoencoder for Standard View Identification in Pediatric Echocardiography

    Authors: Yiman Liu, Xiaoxiang Han, Tongtong Liang, Bin Dong, Jiajun Yuan, Menghan Hu, Qiaohong Liu, Jiangang Chen, Qingli Li, Yuqi Zhang

    Abstract: This paper introduces the Efficient Decoupled Masked Autoencoder (EDMAE), a novel self-supervised method for recognizing standard views in pediatric echocardiography. EDMAE introduces a new proxy task based on the encoder-decoder structure. The EDMAE encoder is composed of a teacher and a student encoder. The teacher encoder extracts the potential representation of the masked image blocks, while t… ▽ More

    Submitted 3 August, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: 15 pages, 5 figures, 8 tables, Published in Biomedical Signal Processing and Control

    Journal ref: Biomedical Signal Processing and Control 86 (2023) 105280

  46. arXiv:2301.09303  [pdf, other

    cs.IT eess.SP

    Downlink Transmission under Heterogeneous Blocklength Constraints: Discrete Signaling with Single-User Decoding

    Authors: Min Qiu, Yu-Chih Huang, **hong Yuan

    Abstract: In this paper, we consider the downlink broadcast channel under heterogenous blocklength constraints, where each user experiences different interference statistics across its received symbols. Different from the homogeneous blocklength case, the strong users with short blocklength transmitted symbol blocks usually cannot wait to receive the entire transmission frame and perform successive interfer… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

    Comments: 7 pages, 1 figure, accepted for presentation at IEEE ICC 2023. arXiv admin note: substantial text overlap with arXiv:2212.01736

  47. On Delay-Doppler Plane Orthogonal Pulse

    Authors: Hai Lin, **hong Yuan

    Abstract: In this paper, we analyze the recently discovered delay-Doppler plane orthogonal pulse (DDOP), which is essential for delay-Doppler plane multi-carrier modulation waveform. In particular, we introduce a local orthogonality property of pulses corresponding to Weyl-Heisenberg (WH) subset and justify the DDOP's existence, in contrast to global orthogonality corresponding to WH set governed by the WH… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

    Comments: This paper was presented at the IEEE GLOBECOM 2022

  48. arXiv:2212.13059  [pdf

    eess.IV cs.CV

    OMSN and FAROS: OCTA Microstructure Segmentation Network and Fully Annotated Retinal OCTA Segmentation Dataset

    Authors: Peng Xiao, Xiaodong Hu, Ke Ma, Gengyuan Wang, Ziqing Feng, Yuancong Huang, ** Yuan

    Abstract: The lack of efficient segmentation methods and fully-labeled datasets limits the comprehensive assessment of optical coherence tomography angiography (OCTA) microstructures like retinal vessel network (RVN) and foveal avascular zone (FAZ), which are of great value in ophthalmic and systematic diseases evaluation. Here, we introduce an innovative OCTA microstructure segmentation network (OMSN) by c… ▽ More

    Submitted 26 December, 2022; originally announced December 2022.

    Comments: 10 pages, 6 figures, submitted to IEEE Transactions on Medical Imaging (TMI)

  49. arXiv:2212.11594  [pdf, other

    eess.SP

    Electromagnetic Based Communication Model for Dynamic Metasurface Antennas

    Authors: Robin Jess Williams, Pablo Ramirez-Espinosa, Jide Yuan, Elisabeth De Carvalho

    Abstract: Dynamic metasurface antennas (DMAs) arise as a promising technology in the field of massive multiple-input multiple-output (mMIMO) systems, offering the possibility of integrating a large number of antennas in a limited -- and potentially large -- aperture while kee** the required number of radio-frequency (RF) chains under control. Although envisioned as practical realizations of mMIMO systems,… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

  50. arXiv:2212.10390  [pdf, other

    cs.CV cs.LG eess.IV

    UniDA3D: Unified Domain Adaptive 3D Semantic Segmentation Pipeline

    Authors: Ben Fei, Siyuan Huang, Jiakang Yuan, Botian Shi, Bo Zhang, Weidong Yang, Min Dou, Yikang Li

    Abstract: State-of-the-art 3D semantic segmentation models are trained on off-the-shelf public benchmarks, but they will inevitably face the challenge of recognition accuracy drop when these well-trained models are deployed to a new domain. In this paper, we introduce a Unified Domain Adaptive 3D semantic segmentation pipeline (UniDA3D) to enhance the weak generalization ability, and bridge the point distri… ▽ More

    Submitted 12 March, 2023; v1 submitted 20 December, 2022; originally announced December 2022.