Skip to main content

Showing 1–50 of 69 results for author: Guo, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.19246  [pdf, other

    eess.SP

    An Interpretable and Efficient Sleep Staging Algorithm: DetectsleepNet

    Authors: Shengwei Guo

    Abstract: Sleep quality directly impacts human health and quality of life, so accurate sleep staging is essential for assessing sleep quality. However, most traditional methods are inefficient and time-consuming due to segmenting different sleep cycles by manual labeling. In contrast, automated sleep staging technology not only directly assesses sleep quality but also helps sleep specialists analyze sleep s… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 25 pages, 11 figures

  2. arXiv:2406.16878  [pdf, ps, other

    eess.SP cs.AI cs.IT

    Benchmarking Semantic Communications for Image Transmission Over MIMO Interference Channels

    Authors: Yanhu Wang, Shuaishuai Guo, Anming Dong, Hui Zhao

    Abstract: Semantic communications offer promising prospects for enhancing data transmission efficiency. However, existing schemes have predominantly concentrated on point-to-point transmissions. In this paper, we aim to investigate the validity of this claim in interference scenarios compared to baseline approaches. Specifically, our focus is on general multiple-input multiple-output (MIMO) interference cha… ▽ More

    Submitted 10 April, 2024; originally announced June 2024.

  3. arXiv:2406.14067  [pdf

    physics.optics eess.SP

    A microwave photonic prototype for concurrent radar detection and spectrum sensing over an 8 to 40 GHz bandwidth

    Authors: Taixia Shi, Dingding Liang, Lu Wang, Lin Li, Shaogang Guo, Jiawei Gao, Xiaowei Li, Chulun Lin, Lei Shi, Baogang Ding, Shiyang Liu, Fangyi Yang, Chi Jiang, Yang Chen

    Abstract: In this work, a microwave photonic prototype for concurrent radar detection and spectrum sensing is proposed, designed, built, and investigated. A direct digital synthesizer and an analog electronic circuit are integrated to generate an intermediate frequency (IF) linearly frequency-modulated (LFM) signal with a tunable center frequency from 2.5 to 9.5 GHz and an instantaneous bandwidth of 1 GHz.… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 18 pages, 12 figures, 1 table

  4. arXiv:2406.06937  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    A Non-autoregressive Generation Framework for End-to-End Simultaneous Speech-to-Any Translation

    Authors: Zhengrui Ma, Qingkai Fang, Shaolei Zhang, Shoutao Guo, Yang Feng, Min Zhang

    Abstract: Simultaneous translation models play a crucial role in facilitating communication. However, existing research primarily focuses on text-to-text or speech-to-text models, necessitating additional cascade components to achieve speech-to-speech translation. These pipeline methods suffer from error propagation and accumulate delays in each cascade component, resulting in reduced synchronization betwee… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: ACL 2024; Codes and demos are at https://github.com/ictnlp/NAST-S2x

  5. arXiv:2406.03049  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning

    Authors: Shaolei Zhang, Qingkai Fang, Shoutao Guo, Zhengrui Ma, Min Zhang, Yang Feng

    Abstract: Simultaneous speech-to-speech translation (Simul-S2ST, a.k.a streaming speech translation) outputs target speech while receiving streaming speech inputs, which is critical for real-time communication. Beyond accomplishing translation between speech, Simul-S2ST requires a policy to control the model to generate corresponding target speech at the opportune moment within speech inputs, thereby posing… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024 main conference, Project Page: https://ictnlp.github.io/StreamSpeech-site/

  6. arXiv:2405.16011  [pdf, ps, other

    eess.SP

    Semantic Importance-Aware Communications with Semantic Correction Using Large Language Models

    Authors: Shuaishuai Guo, Yanhu Wang, Jia Ye, Anbang Zhang, Kun Xu

    Abstract: Semantic communications, a promising approach for agent-human and agent-agent interactions, typically operate at a feature level, lacking true semantic understanding. This paper explores understanding-level semantic communications (ULSC), transforming visual data into human-intelligible semantic content. We employ an image caption neural network (ICNN) to derive semantic representations from visua… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  7. arXiv:2405.09552  [pdf, other

    eess.IV cs.AI cs.CV

    ODFormer: Semantic Fundus Image Segmentation Using Transformer for Optic Nerve Head Detection

    Authors: Jiayi Wang, Yi-An Mao, Xiaoyu Ma, Sicen Guo, Yuting Shao, Xiao Lv, Wenting Han, Mark Christopher, Linda M. Zangwill, Yanlong Bi, Rui Fan

    Abstract: Optic nerve head (ONH) detection has been a crucial area of study in ophthalmology for years. However, the significant discrepancy between fundus image datasets, each generated using a single type of fundus camera, poses challenges to the generalizability of ONH detection approaches developed based on semantic segmentation networks. Despite the numerous recent advancements in general-purpose seman… ▽ More

    Submitted 2 June, 2024; v1 submitted 15 April, 2024; originally announced May 2024.

  8. arXiv:2404.13905  [pdf, other

    eess.IV

    SI-FID: Only One Objective Indicator for Evaluating Stitched Images

    Authors: Xinrui Zhang, Shengwei Guo, Guobing Sun

    Abstract: Image quality evaluation accurately is vital in develo** image stitching algorithms as it directly reflects the algorithms progress. However, commonly used objective indicators always produce inconsistent and even conflicting results with subjective indicators. To enhance the consistency between objective and subjective evaluations, this paper introduces a novel indicator the Frechet Distance fo… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 17 pages, 9 figures

  9. arXiv:2404.06393  [pdf, other

    cs.SD cs.AI eess.AS

    MuPT: A Generative Symbolic Music Pretrained Transformer

    Authors: Xingwei Qu, Yuelin Bai, Yinghao Ma, Ziya Zhou, Ka Man Lo, Jiaheng Liu, Ruibin Yuan, Lejun Min, Xueling Liu, Tianyu Zhang, Xinrun Du, Shuyue Guo, Yiming Liang, Yizhi Li, Shangda Wu, Junting Zhou, Tianyu Zheng, Ziyang Ma, Fengze Han, Wei Xue, Gus Xia, Emmanouil Benetos, Xiang Yue, Chenghua Lin, Xu Tan , et al. (4 additional authors not shown)

    Abstract: In this paper, we explore the application of Large Language Models (LLMs) to the pre-training of music. While the prevalent use of MIDI in music modeling is well-established, our findings suggest that LLMs are inherently more compatible with ABC Notation, which aligns more closely with their design and strengths, thereby enhancing the model's performance in musical composition. To address the chal… ▽ More

    Submitted 10 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  10. arXiv:2403.16468  [pdf, ps, other

    eess.SP

    Unified Integrated Sensing and Communication Signal Design: A Sphere Packing Perspective

    Authors: Shuaishuai Guo, Kaiqian Qu

    Abstract: The design of communication signal sets is fundamentally a sphere packing problem. It aims to identify a set of M points in an N -dimensional space, with the objective of maximizing the separability of points that represent different bits.In contrast, signals used for sensing targets should ideally be asdeterministic as possible. This paper explores the inherent conflict and trade-off between comm… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: submitted to IEEE TCOM

  11. arXiv:2401.05446  [pdf, other

    eess.SP cs.AI cs.LG

    Self-supervised Learning for Electroencephalogram: A Systematic Survey

    Authors: Weining Weng, Yang Gu, Shuai Guo, Yuan Ma, Zhaohua Yang, Yuchen Liu, Yiqiang Chen

    Abstract: Electroencephalogram (EEG) is a non-invasive technique to record bioelectrical signals. Integrating supervised deep learning techniques with EEG signals has recently facilitated automatic analysis across diverse EEG-based tasks. However, the label issues of EEG signals have constrained the development of EEG-based deep models. Obtaining EEG annotations is difficult that requires domain experts to… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: 35 pages, 12 figures

    MSC Class: 68-02 (Primarily); 68T01 (Secondary) ACM Class: I.2; J.3; I.5.4

  12. arXiv:2312.16247  [pdf, other

    cs.CV eess.IV

    Toward Accurate and Temporally Consistent Video Restoration from Raw Data

    Authors: Shi Guo, Jianqi Ma, Xi Yang, Zhengqiang Zhang, Lei Zhang

    Abstract: Denoising and demosaicking are two fundamental steps in reconstructing a clean full-color video from raw data, while performing video denoising and demosaicking jointly, namely VJDD, could lead to better video restoration performance than performing them separately. In addition to restoration accuracy, another key challenge to VJDD lies in the temporal consistency of consecutive frames. This issue… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  13. arXiv:2311.01812  [pdf, ps, other

    eess.SP

    Carrier Frequency Offset Estimation for OCDM with Null Subchirps

    Authors: Sidong Guo, Yiyin Wang, Xiaoli Ma

    Abstract: In this paper, we investigate the carrier frequency offset (CFO) identifiability problem in orthogonal chirp division multiplexing (OCDM) systems. We propose a transmission scheme by inserting consecutive null subchirps. A CFO estimator is accordingly developed to achieve a full acquisition range. We further demonstrate that the proposed transmission scheme not only help to resolve CFO identifia… ▽ More

    Submitted 8 December, 2023; v1 submitted 3 November, 2023; originally announced November 2023.

    Comments: 2 fig

  14. arXiv:2310.20242  [pdf, other

    cs.NI eess.SP

    Intelligent-Reflecting-Surface-Assisted UAV Communications for 6G Networks

    Authors: Zhaolong Ning, Tengfeng Li, Yu Wu, Xiaojie Wang, Qingqing Wu, Fei Richard Yu, Song Guo

    Abstract: In 6th-Generation (6G) mobile networks, Intelligent Reflective Surfaces (IRSs) and Unmanned Aerial Vehicles (UAVs) have emerged as promising technologies to address the coverage difficulties and resource constraints faced by terrestrial networks. UAVs, with their mobility and low costs, offer diverse connectivity options for mobile users and a novel deployment paradigm for 6G networks. However, th… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  15. arXiv:2309.12688  [pdf, ps, other

    cs.IT eess.SP

    Green Holographic MIMO Communications With A Few Transmit Radio Frequency Chains

    Authors: Shuaishuai Guo, Jia Ye, Kaiqian Qu, Shu** Dang

    Abstract: Holographic multiple-input multiple-output (MIMO) communications are widely recognized as a promising candidate for the next-generation air interface. With holographic MIMO surface, the number of the spatial degrees-of-freedom (DoFs) considerably increases and also significantly varies as the user moves. To fully employ the large and varying number of spatial DoFs, the number of equipped RF chains… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

    Comments: 10 figures; has been accepted by TGCN

  16. arXiv:2308.10428  [pdf, other

    eess.AS cs.SD

    Multi-GradSpeech: Towards Diffusion-based Multi-Speaker Text-to-speech Using Consistent Diffusion Models

    Authors: Heyang Xue, Shuai Guo, Pengcheng Zhu, Mengxiao Bi

    Abstract: Despite imperfect score-matching causing drift in training and sampling distributions of diffusion models, recent advances in diffusion-based acoustic models have revolutionized data-sufficient single-speaker Text-to-Speech (TTS) approaches, with Grad-TTS being a prime example. However, the sampling drift problem leads to these approaches struggling in multi-speaker scenarios in practice due to mo… ▽ More

    Submitted 31 August, 2023; v1 submitted 20 August, 2023; originally announced August 2023.

  17. Reconfigurable Intelligent Surface Enabled Joint Backscattering and Communication

    Authors: **qiu Zhao, Jia Ye Shuaishuai Guo, Zhiquan Bai, Di Zhou, Abeer Mohamed

    Abstract: Reconfigurable intelligent surface (RIS) as an essential topic in the sixth-generation (6G) communications aims to enhance communication performance or mitigate undesired transmission. However, the controllability of each reflecting element on RIS also enables it to act as a passive backscatter device (BD) and transmit its information to reader devices. In this paper, we propose a RIS-enabled join… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

    Comments: 11 pages, 8 figures, published to IEEE TVT

    Journal ref: IEEE Transactions on Vehicular Technology, 2023

  18. arXiv:2308.07342  [pdf, other

    eess.SP cs.CL

    Emergent communication for AR

    Authors: Ruxiao Chen, Shuaishuai Guo

    Abstract: Mobile augmented reality (MAR) is widely acknowledged as one of the ubiquitous interfaces to the digital twin and Metaverse, demanding unparalleled levels of latency, computational power, and energy efficiency. The existing solutions for realizing MAR combine multiple technologies like edge, cloud computing, and fifth-generation (5G) networks. However, the inherent communication latency of visual… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

  19. arXiv:2308.06455  [pdf, ps, other

    eess.SP

    Near-Field Integrated Sensing and Communication: Performance Analysis and Beamforming Design

    Authors: Kaiqian Qu, Shuaishuai Guo, Nasir Saeed

    Abstract: This paper explores the potential of near-field beamforming (NFBF) in integrated sensing and communication (ISAC) systems with extremely large-scale arrays (XL-arrays). The large-scale antenna arrays increase the possibility of having communication users and targets of interest in the near field of the base station (BS). The paper first establishes the models of electromagnetic (EM) near-field sph… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: under review

  20. arXiv:2308.00253  [pdf, ps, other

    eess.SP

    Privacy and Security in Ubiquitous Integrated Sensing and Communication: Threats, Challenges and Future Directions

    Authors: Kaiqian Qu, Jia Ye, Xuran Li, Shuaishuai Guo

    Abstract: Integrated sensing and communication (ISAC) technology is one of the featuring technologies of the next-generation communication systems. When sensing capability becomes ubiquitous, more information can be collected, which can facilitate many applications in intelligent transportation, unmanned aerial vehicle (UAV) surveillance and healthcare. However, it also faces many information privacy leakag… ▽ More

    Submitted 13 May, 2024; v1 submitted 31 July, 2023; originally announced August 2023.

    Comments: to appear in IOTMAG

  21. arXiv:2308.00252  [pdf, ps, other

    eess.SP

    Near-Field Integrated Sensing and Communications: Unlocking Potentials and Sha** the Future

    Authors: Kaiqian Qu, Shuaishuai Guo, Jia Ye, Nasir Saeed

    Abstract: The sixth generation (6G) communication networks are featured by integrated sensing and communications (ISAC), revolutionizing base stations (BSs) and terminals. Additionally, in the unfolding 6G landscape, a pivotal physical layer technology, the Extremely Large-Scale Antenna Array (ELAA), assumes center stage. With its expansive coverage of the near-field region, ELAA's electromagnetic (EM) wave… ▽ More

    Submitted 5 August, 2023; v1 submitted 31 July, 2023; originally announced August 2023.

    Comments: under review

  22. Precheck Sequence Based False Base Station Detection During Handover: A Physical Layer Security Scheme

    Authors: Xiangyu Li, Kaiwen Zheng, Sidong Guo, Xiaoli Ma

    Abstract: False Base Station (FBS) attack has been a severe security problem for the cellular network since 2G era. During handover, the user equipment (UE) periodically receives state information from surrounding base stations (BSs) and uploads it to the source BS. The source BS compares the uploaded signal power and shifts UE to another BS that can provide the strongest signal. An FBS can transmit signal… ▽ More

    Submitted 3 November, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

  23. Sensing-Aided Peer-to-Peer Millimeter-Wave Communication

    Authors: Xiangyu Li, Sidong Guo, Shez Malik

    Abstract: One of the bottlenecks of modern communications is to enable sensing and mutual communication simultaneously without causing scheduling conflicts, and how sensing may be leveraged to help directional communication accuracy. To this end, we propose and implement a novel peer-to-peer (P2P) millimeter-wave communication system to jointly achieve beamforming and sensing. A radar and IMU-assisted track… ▽ More

    Submitted 26 January, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

  24. arXiv:2306.00714  [pdf, other

    cs.CV cs.LG eess.IV

    Dissecting Arbitrary-scale Super-resolution Capability from Pre-trained Diffusion Generative Models

    Authors: Ruibin Li, Qihua Zhou, Song Guo, Jie Zhang, **gcai Guo, Xinyang Jiang, Yifei Shen, Zhenhua Han

    Abstract: Diffusion-based Generative Models (DGMs) have achieved unparalleled performance in synthesizing high-quality visual content, opening up the opportunity to improve image super-resolution (SR) tasks. Recent solutions for these tasks often train architecture-specific DGMs from scratch, or require iterative fine-tuning and distillation on pre-trained DGMs, both of which take considerable time and hard… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  25. arXiv:2305.19558  [pdf, other

    eess.SP cs.AI

    Look-Ahead Task Offloading for Multi-User Mobile Augmented Reality in Edge-Cloud Computing

    Authors: Ruxiao Chen, Shuaishuai Guo

    Abstract: Mobile augmented reality (MAR) blends a real scenario with overlaid virtual content, which has been envisioned as one of the ubiquitous interfaces to the Metaverse. Due to the limited computing power and battery life of MAR devices, it is common to offload the computation tasks to edge or cloud servers in close proximity. However, existing offloading solutions developed for MAR tasks suffer from h… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: Accepted by IEEE Network

  26. Beamspace Modulation for Near Field Capacity Improvement in XL-MIMO Communications

    Authors: Shuaishuai Guo, Kaiqian Qu

    Abstract: The spatial degrees of freedom (DoFs) greatly increase in the near-field region of millimeter wave or terahertz multiple-input multiple-output communications with extremely large antenna arrays (XL-MIMO). To employ the increased spatial DoFs, a beamspace modulation (BM) strategy is introduced to the near field of XL-MIMO. BM can work with a fixed small number of RF chains. It exploits the increase… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: 5 pages, 4 figures, accepted by IEEE Wireless Communications Letters

  27. arXiv:2303.09694  [pdf, other

    cs.RO eess.SP

    Drone Formation for Efficient Swarm Energy Consumption

    Authors: Shilong Guo, Balsam Alkouz, Babar Shahzaad, Abdallah Lakhdari, Athman Bouguettaya

    Abstract: We demonstrate formation flying for drone swarm services. A set of drones fly in four different swarm formations. A dataset is collected to study the effect of formation flying on energy consumption. We conduct a set of experiments to study the effect of wind on formation flying. We examine the forces the drones exert on each other when flying in a formation. We finally identify and classify the f… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: 3 pages, 7 figures. This is an accepted demo paper and it will appear in The 21st International Conference on Pervasive Computing and Communications (PerCom 2023)

  28. arXiv:2302.14763  [pdf, other

    eess.SP cs.CE cs.IT

    Vehicular Behavior-Aware Beamforming Design for Integrated Sensing and Communication Systems

    Authors: Dingyan Cong, Shuaishuai Guo, Shu** Dang, Haixia Zhang

    Abstract: Communication and sensing are two important features of connected and autonomous vehicles (CAVs). In traditional vehicle-mounted devices, communication and sensing modules exist but in an isolated way, resulting in a waste of hardware resources and wireless spectrum. In this paper, to cope with the above inefficiency, we propose a vehicular behavior-aware integrated sensing and communication (VBA-… ▽ More

    Submitted 26 February, 2023; originally announced February 2023.

  29. arXiv:2302.07142  [pdf, ps, other

    eess.SP

    Semantic Importance-Aware Communications Using Pre-trained Language Models

    Authors: Shuaishuai Guo, Yanhu Wang, Shu**g Li, Nasir Saeed

    Abstract: This letter proposes a semantic importance-aware communication (SIAC) scheme using pre-trained language models (e.g., ChatGPT, BERT, etc.). Specifically, we propose a cross-layer design with a pre-trained language model embedded in/connected by the cross-layer manager. The pre-trained language model is utilized to quantify the semantic importance of data frames. Based on the quantified semantic im… ▽ More

    Submitted 7 July, 2023; v1 submitted 12 February, 2023; originally announced February 2023.

    Comments: Accepted by IEEE Communications Letters, Semantic communications, pre-trained language model, ChatGPT, BERT, data importance

  30. arXiv:2302.01972  [pdf, other

    cs.CR eess.SY math.DS math.OC physics.soc-ph

    DCA: Delayed Charging Attack on the Electric Shared Mobility System

    Authors: Shuocheng Guo, Hanlin Chen, Mizanur Rahman, Xinwu Qian

    Abstract: An efficient operation of the electric shared mobility system (ESMS) relies heavily on seamless interconnections among shared electric vehicles (SEV), electric vehicle supply equipment (EVSE), and the grid. Nevertheless, this interconnectivity also makes the ESMS vulnerable to cyberattacks that may cause short-term breakdowns or long-term degradation of the ESMS. This study focuses on one such att… ▽ More

    Submitted 13 June, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Journal ref: IEEE Transactions on Intelligent Transportation Systems, 2023

  31. arXiv:2301.03133  [pdf, ps, other

    cs.NI cs.AI eess.SP

    Transceiver Cooperative Learning-aided Semantic Communications Against Mismatched Background Knowledge Bases

    Authors: Yanhu Wang, Shuaishuai Guo

    Abstract: Semantic communications learned on background knowledge bases (KBs) have been identified as a promising technology for communications between intelligent agents. Existing works assume that transceivers of semantic communications share the same KB. However, intelligent transceivers may suffer from the communication burden or worry about privacy leakage to exchange data in KBs. Besides, the transcei… ▽ More

    Submitted 8 January, 2023; originally announced January 2023.

  32. arXiv:2212.01756  [pdf, other

    eess.SY cs.RO math.DS

    Connected Cruise and Traffic Control for Pairs of Connected Automated Vehicles

    Authors: Sicong Guo, Gabor Orosz, Tamas G. Molnar

    Abstract: This paper considers mixed traffic consisting of connected automated vehicles equipped with vehicle-to-everything (V2X) connectivity and human-driven vehicles. A control strategy is proposed for communicating pairs of connected automated vehicles, where the two vehicles regulate their longitudinal motion by responding to each other, and, at the same time, stabilize the human-driven traffic between… ▽ More

    Submitted 12 June, 2023; v1 submitted 4 December, 2022; originally announced December 2022.

    Comments: Accepted to the IEEE Transactions on Intelligent Transportation Systems. 11 pages, 10 figures

  33. arXiv:2211.08428  [pdf, other

    eess.IV cs.CV cs.LG cs.MM

    CaDM: Codec-aware Diffusion Modeling for Neural-enhanced Video Streaming

    Authors: Qihua Zhou, Ruibin Li, Song Guo, Peiran Dong, Yi Liu, **gcai Guo, Zhenda Xu

    Abstract: Recent years have witnessed the dramatic growth of Internet video traffic, where the video bitstreams are often compressed and delivered in low quality to fit the streamer's uplink bandwidth. To alleviate the quality degradation, it comes the rise of Neural-enhanced Video Streaming (NVS), which shows great prospects for recovering low-quality videos by mostly deploying neural super-resolution (SR)… ▽ More

    Submitted 8 March, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

  34. arXiv:2209.15156  [pdf, ps, other

    cs.IT eess.SP

    Cooperative Beamforming Design for Multiple RIS-Assisted Communication Systems

    Authors: Xiaoyan Ma, Yuguang Fang, Haixia Zhang, Shuaishuai Guo, Dongfeng Yuan

    Abstract: Reconfigurable intelligent surface (RIS) provides a promising way to build programmable wireless transmission environments. Owing to the massive number of controllable reflecting elements on the surface, RIS is capable of providing considerable passive beamforming gains. At present, most related works mainly consider the modeling, design, performance analysis and optimization of single-RIS-assiste… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

  35. arXiv:2209.09138  [pdf, ps, other

    cs.IT eess.SP

    Robust Beamforming and Rate-Splitting Design for Next Generation Ultra-Reliable and Low-Latency Communications

    Authors: Tiantian Li, Haixia Zhang, Shuaishuai Guo, Dongfeng Yuan

    Abstract: The next generation ultra-reliable and low-latency communications (xURLLC) need novel design to provide satisfactory services to the emerging mission-critical applications. To improve the spectrum efficiency and enhance the robustness of xURLLC, this paper proposes a robust beamforming and rate-splitting design in the finite blocklength (FBL) regime for downlink multi-user multi-antenna xURLLC sys… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: 12 pages, 9 figures

  36. arXiv:2205.13412  [pdf, other

    cs.CV cs.CR eess.IV

    Physical-World Optical Adversarial Attacks on 3D Face Recognition

    Authors: Yanjie Li, Yiquan Li, Xuelong Dai, Songtao Guo, Bin Xiao

    Abstract: 2D face recognition has been proven insecure for physical adversarial attacks. However, few studies have investigated the possibility of attacking real-world 3D face recognition systems. 3D-printed attacks recently proposed cannot generate adversarial points in the air. In this paper, we attack 3D face recognition systems through elaborate optical noises. We took structured light 3D scanners as ou… ▽ More

    Submitted 13 November, 2022; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: Submitted to CVPR 2023

  37. arXiv:2205.04029  [pdf, other

    cs.SD cs.MM eess.AS

    Muskits: an End-to-End Music Processing Toolkit for Singing Voice Synthesis

    Authors: Jiatong Shi, Shuai Guo, Tao Qian, Nan Huo, Tomoki Hayashi, Yuning Wu, Frank Xu, Xuankai Chang, Huazhe Li, Peter Wu, Shinji Watanabe, Qin **

    Abstract: This paper introduces a new open-source platform named Muskits for end-to-end music processing, which mainly focuses on end-to-end singing voice synthesis (E2E-SVS). Muskits supports state-of-the-art SVS models, including RNN SVS, transformer SVS, and XiaoiceSing. The design of Muskits follows the style of widely-used speech processing toolkits, ESPnet and Kaldi, for data prepossessing, training,… ▽ More

    Submitted 2 July, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

    Comments: Accepted by Interspeech

  38. arXiv:2203.17001  [pdf, other

    eess.AS cs.LG cs.SD

    SingAug: Data Augmentation for Singing Voice Synthesis with Cycle-consistent Training Strategy

    Authors: Shuai Guo, Jiatong Shi, Tao Qian, Shinji Watanabe, Qin **

    Abstract: Deep learning based singing voice synthesis (SVS) systems have been demonstrated to flexibly generate singing with better qualities, compared to conventional statistical parametric based methods. However, neural systems are generally data-hungry and have difficulty to reach reasonable singing quality with limited public available training data. In this work, we explore different data augmentation… ▽ More

    Submitted 6 July, 2022; v1 submitted 31 March, 2022; originally announced March 2022.

    Comments: Accepted by INTERSPEECH 2022

  39. arXiv:2203.09294  [pdf, other

    cs.CV eess.IV

    A Differentiable Two-stage Alignment Scheme for Burst Image Reconstruction with Large Shift

    Authors: Shi Guo, Xi Yang, Jianqi Ma, Gaofeng Ren, Lei Zhang

    Abstract: Denoising and demosaicking are two essential steps to reconstruct a clean full-color image from the raw data. Recently, joint denoising and demosaicking (JDD) for burst images, namely JDD-B, has attracted much attention by using multiple raw images captured in a short time to reconstruct a single high-quality image. One key challenge of JDD-B lies in the robust alignment of image frames. State-of-… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition 2022

  40. arXiv:2202.11703  [pdf, other

    cs.CV cs.GR eess.IV

    U-Attention to Textures: Hierarchical Hourglass Vision Transformer for Universal Texture Synthesis

    Authors: Shouchang Guo, Valentin Deschaintre, Douglas Noll, Arthur Roullier

    Abstract: We present a novel U-Attention vision Transformer for universal texture synthesis. We exploit the natural long-range dependencies enabled by the attention mechanism to allow our approach to synthesize diverse textures while preserving their structures in a single inference. We propose a hierarchical hourglass backbone that attends to the global structure and performs patch map** at varying scale… ▽ More

    Submitted 30 June, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

  41. arXiv:2202.02072  [pdf, ps, other

    eess.SP

    Signal Sha** for Semantic Communication Systems with A Few Message Candidates

    Authors: Shuaishuai Guo, Yanhu Wang, Peng Zhang

    Abstract: Semantic communications target to reliably convey the semantic meaning of messages. It is different from existing communication systems focusing on reliable bit transmission. To achieve the goal of semantic communications, we propose a signal sha** method by minimizing the semantic loss, which is measured by the pretrained bidirectional encoder representation from transformers (BERT) model. The… ▽ More

    Submitted 18 August, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

  42. arXiv:2111.14916  [pdf

    eess.IV cs.NE

    High-Speed Light Focusing through Scattering Medium by Cooperatively Accelerated Genetic Algorithm

    Authors: Shu Guo, Lin Pang

    Abstract: We develop an accelerated Genetic Algorithm (GA) system constructed by the cooperation of field-programmable gate array (FPGA) and optimized parameters of the GA. We found the enhanced decay of mutation rate makes convergence of the GA much faster, enabling the parameter-induced acceleration of the GA. Furthermore, the accelerated configuration of the GA is programmed in FPGA to boost processing s… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Comments: 17 pages, 10 figures

  43. arXiv:2109.12543  [pdf, ps, other

    eess.SY

    SDN-based Resource Allocation in Edge and Cloud Computing Systems: An Evolutionary Stackelberg Differential Game Approach

    Authors: Jun Du, Chunxiao Jiang, Abderrahim Benslimane, Song Guo, Yong Ren

    Abstract: Recently, the boosting growth of computation-heavy applications raises great challenges for the Fifth Generation (5G) and future wireless networks. As responding, the hybrid edge and cloud computing (ECC) system has been expected as a promising solution to handle the increasing computational applications with low-latency and on-demand services of computation offloading, which requires new computin… ▽ More

    Submitted 26 September, 2021; originally announced September 2021.

  44. arXiv:2107.05464  [pdf, other

    cs.AI eess.SY

    IGrow: A Smart Agriculture Solution to Autonomous Greenhouse Control

    Authors: Xiaoyan Cao, Yao Yao, Lanqing Li, Wanpeng Zhang, Zhicheng An, Zhong Zhang, Li Xiao, Shihui Guo, Xiaoyu Cao, Meihong Wu, Dijun Luo

    Abstract: Agriculture is the foundation of human civilization. However, the rapid increase of the global population poses a challenge on this cornerstone by demanding more food. Modern autonomous greenhouses, equipped with sensors and actuators, provide a promising solution to the problem by empowering precise control for high-efficient food production. However, the optimal control of autonomous greenhouses… ▽ More

    Submitted 14 March, 2022; v1 submitted 6 July, 2021; originally announced July 2021.

    Comments: 9 pages, 5 figures, 2 tables, accepted by AAAI 2022

  45. arXiv:2106.05458  [pdf, other

    eess.IV cs.CV

    Joint Landmark and Structure Learning for Automatic Evaluation of Developmental Dysplasia of the Hip

    Authors: Xindi Hu, Limin Wang, Xin Yang, Xu Zhou, Wufeng Xue, Yan Cao, Shengfeng Liu, Yuhao Huang, Shuang** Guo, Ning Shang, Dong Ni, Ning Gu

    Abstract: The ultrasound (US) screening of the infant hip is vital for the early diagnosis of developmental dysplasia of the hip (DDH). The US diagnosis of DDH refers to measuring alpha and beta angles that quantify hip joint development. These two angles are calculated from key anatomical landmarks and structures of the hip. However, this measurement process is not trivial for sonographers and usually requ… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: Accepted by IEEE Journal of Biomedical and Health Informatics. 14 pages, 10 figures and 10 tables

  46. arXiv:2105.08350  [pdf, other

    cs.MM cs.CR eess.IV

    Generic Reversible Visible Watermarking Via Regularized Graph Fourier Transform Coding

    Authors: Wenfa Qi, Sirui Guo, Wei Hu

    Abstract: Reversible visible watermarking (RVW) is an active copyright protection mechanism. It not only transparently superimposes copyright patterns on specific positions of digital images or video frames to declare the copyright ownership information, but also completely erases the visible watermark image and thus enables restoring the original host image without any distortion. However, existing RVW alg… ▽ More

    Submitted 26 November, 2021; v1 submitted 18 May, 2021; originally announced May 2021.

    Comments: This manuscript is accepted to IEEE Transactions on Image Processing on November 21th 2021. It has 15 pages, 12 figures and 4 tables

  47. arXiv:2104.08395  [pdf, other

    eess.IV eess.SP physics.med-ph

    Manifold Model for High-Resolution fMRI Joint Reconstruction and Dynamic Quantification

    Authors: Shouchang Guo, Jeffrey A. Fessler, Douglas C. Noll

    Abstract: Oscillating Steady-State Imaging (OSSI) is a recent fMRI acquisition method that exploits a large and oscillating signal, and can provide high SNR fMRI. However, the oscillatory nature of the signal leads to an increased number of acquisitions. To improve temporal resolution and accurately model the nonlinearity of OSSI signals, we build the MR physics for OSSI signal generation as a regularizer f… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

  48. arXiv:2103.10651  [pdf, other

    cs.CR cs.LG cs.SD eess.AS

    SoK: A Modularized Approach to Study the Security of Automatic Speech Recognition Systems

    Authors: Yuxuan Chen, Jiangshan Zhang, Xue**g Yuan, Shengzhi Zhang, Kai Chen, Xiaofeng Wang, Shanqing Guo

    Abstract: With the wide use of Automatic Speech Recognition (ASR) in applications such as human machine interaction, simultaneous interpretation, audio transcription, etc., its security protection becomes increasingly important. Although recent studies have brought to light the weaknesses of popular ASR systems that enable out-of-band signal attack, adversarial attack, etc., and further proposed various rem… ▽ More

    Submitted 30 July, 2021; v1 submitted 19 March, 2021; originally announced March 2021.

    Comments: 17 pages

  49. arXiv:2103.02813  [pdf, other

    eess.IV cs.CV

    PET Image Reconstruction with Multiple Kernels and Multiple Kernel Space Regularizers

    Authors: Shiyao Guo, Yuxia Sheng, Shenpeng Li, Li Chai, **gxin Zhang

    Abstract: Kernelized maximum-likelihood (ML) expectation maximization (EM) methods have recently gained prominence in PET image reconstruction, outperforming many previous state-of-the-art methods. But they are not immune to the problems of non-kernelized MLEM methods in potentially large reconstruction error and high sensitivity to iteration number. This paper demonstrates these problems by theoretical rea… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: 21 pages, 9 figures

  50. Joint Denoising and Demosaicking with Green Channel Prior for Real-world Burst Images

    Authors: Shi Guo, Zhetong Liang, Lei Zhang

    Abstract: Denoising and demosaicking are essential yet correlated steps to reconstruct a full color image from the raw color filter array (CFA) data. By learning a deep convolutional neural network (CNN), significant progress has been achieved to perform denoising and demosaicking jointly. However, most existing CNN-based joint denoising and demosaicking (JDD) methods work on a single image while assuming a… ▽ More

    Submitted 24 January, 2021; originally announced January 2021.