Skip to main content

Showing 1–50 of 81 results for author: Xiong, Z

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.16083  [pdf, other

    eess.IV cs.CV

    Mamba-based Light Field Super-Resolution with Efficient Subspace Scanning

    Authors: Ruisheng Gao, Zeyu Xiao, Zhiwei Xiong

    Abstract: Transformer-based methods have demonstrated impressive performance in 4D light field (LF) super-resolution by effectively modeling long-range spatial-angular correlations, but their quadratic complexity hinders the efficient processing of high resolution 4D inputs, resulting in slow inference speed and high memory cost. As a compromise, most prior work adopts a patch-based strategy, which fails to… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 17 pages,7 figures

  2. arXiv:2406.12300  [pdf

    eess.IV cs.CV q-bio.NC

    IR2QSM: Quantitative Susceptibility Map** via Deep Neural Networks with Iterative Reverse Concatenations and Recurrent Modules

    Authors: Min Li, Chen Chen, Zhuang Xiong, Ying Liu, Pengfei Rong, Shanshan Shan, Feng Liu, Hongfu Sun, Yang Gao

    Abstract: Quantitative susceptibility map** (QSM) is an MRI phase-based post-processing technique to extract the distribution of tissue susceptibilities, demonstrating significant potential in studying neurological diseases. However, the ill-conditioned nature of dipole inversion makes QSM reconstruction from the tissue field prone to noise and artifacts. In this work, we propose a novel deep learning-bas… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 10 pages, 9 figures

  3. arXiv:2406.04111  [pdf, other

    cs.CV eess.IV

    UrbanSARFloods: Sentinel-1 SLC-Based Benchmark Dataset for Urban and Open-Area Flood Map**

    Authors: Jie Zhao, Zhitong Xiong, Xiao Xiang Zhu

    Abstract: Due to its cloud-penetrating capability and independence from solar illumination, satellite Synthetic Aperture Radar (SAR) is the preferred data source for large-scale flood map**, providing global coverage and including various land cover classes. However, most studies on large-scale SAR-derived flood map** using deep learning algorithms have primarily focused on flooded open areas, utilizing… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted by CVPR 2024 EarthVision Workshop

  4. arXiv:2405.16850  [pdf, other

    eess.IV cs.CV cs.LG

    UniCompress: Enhancing Multi-Data Medical Image Compression with Knowledge Distillation

    Authors: Runzhao Yang, Yinda Chen, Zhihong Zhang, Xiaoyu Liu, Zongren Li, Kunlun He, Zhiwei Xiong, **li Suo, Qionghai Dai

    Abstract: In the field of medical image compression, Implicit Neural Representation (INR) networks have shown remarkable versatility due to their flexible compression ratios, yet they are constrained by a one-to-one fitting approach that results in lengthy encoding times. Our novel method, ``\textbf{UniCompress}'', innovatively extends the compression capabilities of INR by being the first to compress multi… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  5. arXiv:2405.04285  [pdf, other

    cs.AI eess.SP

    On the Foundations of Earth and Climate Foundation Models

    Authors: Xiao Xiang Zhu, Zhitong Xiong, Yi Wang, Adam J. Stewart, Konrad Heidler, Yuanyuan Wang, Zhenghang Yuan, Thomas Dujardin, Qingsong Xu, Yilei Shi

    Abstract: Foundation models have enormous potential in advancing Earth and climate sciences, however, current approaches may not be optimal as they focus on a few basic features of a desirable Earth and climate foundation model. Crafting the ideal Earth foundation model, we define eleven features which would allow such a foundation model to be beneficial for any geoscientific downstream application in an en… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  6. arXiv:2405.04012  [pdf, other

    eess.SY

    Latency and Energy Minimization in NOMA-Assisted MEC Network: A Federated Deep Reinforcement Learning Approach

    Authors: Arian Ahmadi, Anders Høst-Madsen, Zixiang Xiong

    Abstract: Multi-access edge computing (MEC) is seen as a vital component of forthcoming 6G wireless networks, aiming to support emerging applications that demand high service reliability and low latency. However, ensuring the ultra-reliable and low-latency performance of MEC networks poses a significant challenge due to uncertainties associated with wireless links, constraints imposed by communication and c… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  7. arXiv:2404.19415  [pdf, other

    eess.SY math.OC

    Two-Stage Robust Planning Model for Park-Level Integrated Energy System Considering Uncertain Equipment Contingency

    Authors: Zuxun Xiong, Xinwei Shen, Hongbin Sun

    Abstract: In this paper, we propose a two-stage robust planning model for an Integrated Energy System (IES) that serves an industrial park. The term 'Park-level IES' is used to refers to IES of a smaller scale but have high demands for various forms of energy. The proposed planning model considers uncertainties like load demand fluctuations and equipment contingencies, and provides a reliable scheme of equi… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  8. arXiv:2404.14140  [pdf, other

    eess.SP

    Generative Artificial Intelligence Assisted Wireless Sensing: Human Flow Detection in Practical Communication Environments

    Authors: Jiacheng Wang, Hongyang Du, Dusit Niyato, Zehui Xiong, Jiawen Kang, Bo Ai, Zhu Han, Dong In Kim

    Abstract: Groundbreaking applications such as ChatGPT have heightened research interest in generative artificial intelligence (GAI). Essentially, GAI excels not only in content generation but also in signal processing, offering support for wireless sensing. Hence, we introduce a novel GAI-assisted human flow detection system (G-HFD). Rigorously, G-HFD first uses channel state information (CSI) to estimate t… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  9. arXiv:2404.06765  [pdf, other

    eess.SP

    Harnessing the Power of AI-Generated Content for Semantic Communication

    Authors: Yiru Wang, Wanting Yang, Zehui Xiong, Yu** Zhao, Tony Q. S. Quek, Zhu Han

    Abstract: Semantic Communication (SemCom) is envisaged as the next-generation paradigm to address challenges stemming from the conflicts between the increasing volume of transmission data and the scarcity of spectrum resources. However, existing SemCom systems face drawbacks, such as low explainability, modality rigidity, and inadequate reconstruction functionality. Recognizing the transformative capabiliti… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  10. arXiv:2403.14070  [pdf

    eess.IV cs.CV physics.med-ph

    QSMDiff: Unsupervised 3D Diffusion Models for Quantitative Susceptibility Map**

    Authors: Zhuang Xiong, Wei Jiang, Yang Gao, Feng Liu, Hongfu Sun

    Abstract: Quantitative Susceptibility Map** (QSM) dipole inversion is an ill-posed inverse problem for quantifying magnetic susceptibility distributions from MRI tissue phases. While supervised deep learning methods have shown success in specific QSM tasks, their generalizability across different acquisition scenarios remains constrained. Recent developments in diffusion models have demonstrated potential… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  11. arXiv:2403.05826  [pdf, other

    cs.NI eess.SP

    Cached Model-as-a-Resource: Provisioning Large Language Model Agents for Edge Intelligence in Space-air-ground Integrated Networks

    Authors: Minrui Xu, Dusit Niyato, Hongliang Zhang, Jiawen Kang, Zehui Xiong, Shiwen Mao, Zhu Han

    Abstract: Edge intelligence in space-air-ground integrated networks (SAGINs) can enable worldwide network coverage beyond geographical limitations for users to access ubiquitous and low-latency intelligence services. Facing global coverage and complex environments in SAGINs, edge intelligence can provision approximate large language models (LLMs) agents for users via edge servers at ground base stations (BS… ▽ More

    Submitted 31 May, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  12. arXiv:2402.19470  [pdf, other

    eess.IV cs.CV

    Towards Generalizable Tumor Synthesis

    Authors: Qi Chen, Xiaoxi Chen, Haorui Song, Zhiwei Xiong, Alan Yuille, Chen Wei, Zongwei Zhou

    Abstract: Tumor synthesis enables the creation of artificial tumors in medical images, facilitating the training of AI models for tumor detection and segmentation. However, success in tumor synthesis hinges on creating visually realistic tumors that are generalizable across multiple organs and, furthermore, the resulting AI models being capable of detecting real tumors in images sourced from different domai… ▽ More

    Submitted 28 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR 2024)

  13. arXiv:2402.09756  [pdf, other

    cs.NI eess.SP

    Mixture of Experts for Network Optimization: A Large Language Model-enabled Approach

    Authors: Hongyang Du, Guangyuan Liu, Yi**g Lin, Dusit Niyato, Jiawen Kang, Zehui Xiong, Dong In Kim

    Abstract: Optimizing various wireless user tasks poses a significant challenge for networking systems because of the expanding range of user requirements. Despite advancements in Deep Reinforcement Learning (DRL), the need for customized optimization tasks for individual users complicates develo** and applying numerous DRL models, leading to substantial computation resource and energy consumption and can… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  14. arXiv:2402.02411  [pdf, other

    eess.IV cs.CV

    Physics-Inspired Degradation Models for Hyperspectral Image Fusion

    Authors: Jie Lian, Lizhi Wang, Lin Zhu, Renwei Dian, Zhiwei Xiong, Hua Huang

    Abstract: The fusion of a low-spatial-resolution hyperspectral image (LR-HSI) with a high-spatial-resolution multispectral image (HR-MSI) has garnered increasing research interest. However, most fusion methods solely focus on the fusion algorithm itself and overlook the degradation models, which results in unsatisfactory performance in practical scenarios. To fill this gap, we propose physics-inspired degra… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  15. arXiv:2401.07120  [pdf, other

    cs.NI eess.SP quant-ph

    Generative AI-enabled Quantum Computing Networks and Intelligent Resource Allocation

    Authors: Minrui Xu, Dusit Niyato, Jiawen Kang, Zehui Xiong, Yuan Cao, Yulan Gao, Chao Ren, Han Yu

    Abstract: Quantum computing networks enable scalable collaboration and secure information exchange among multiple classical and quantum computing nodes while executing large-scale generative AI computation tasks and advanced quantum algorithms. Quantum computing networks overcome limitations such as the number of qubits and coherence time of entangled pairs and offer advantages for generative AI infrastruct… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

  16. arXiv:2311.12078  [pdf, other

    eess.IV cs.LG

    Fast Controllable Diffusion Models for Undersampled MRI Reconstruction

    Authors: Wei Jiang, Zhuang Xiong, Feng Liu, Nan Ye, Hongfu Sun

    Abstract: Supervised deep learning methods have shown promise in undersampled Magnetic Resonance Imaging (MRI) reconstruction, but their requirement for paired data limits their generalizability to the diverse MRI acquisition parameters. Recently, unsupervised controllable generative diffusion models have been applied to undersampled MRI reconstruction, without paired data or model retraining for different… ▽ More

    Submitted 11 June, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

  17. Plug-and-Play Latent Feature Editing for Orientation-Adaptive Quantitative Susceptibility Map** Neural Networks

    Authors: Yang Gao, Zhuang Xiong, Shanshan Shan, Yin Liu, Pengfei Rong, Min Li, Alan H Wilman, G. Bruce Pike, Feng Liu, Hongfu Sun

    Abstract: Quantitative susceptibility map** (QSM) is a post-processing technique for deriving tissue magnetic susceptibility distribution from MRI phase measurements. Deep learning (DL) algorithms hold great potential for solving the ill-posed QSM reconstruction problem. However, a significant challenge facing current DL-QSM approaches is their limited adaptability to magnetic dipole field orientation var… ▽ More

    Submitted 26 March, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: 13pages, 9figures

  18. arXiv:2311.06523  [pdf, other

    cs.NI eess.SP

    Generative AI for Space-Air-Ground Integrated Networks (SAGIN)

    Authors: Ruichen Zhang, Hongyang Du, Dusit Niyato, Jiawen Kang, Zehui Xiong, Abbas Jamalipour, ** Zhang, Dong In Kim

    Abstract: Recently, generative AI technologies have emerged as a significant advancement in artificial intelligence field, renowned for their language and image generation capabilities. Meantime, space-air-ground integrated network (SAGIN) is an integral part of future B5G/6G for achieving ubiquitous connectivity. Inspired by this, this article explores an integration of generative AI in SAGIN, focusing on… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: 9page, 5 figures

  19. arXiv:2309.02616  [pdf, other

    eess.IV cs.LG cs.NI

    Generative AI-aided Joint Training-free Secure Semantic Communications via Multi-modal Prompts

    Authors: Hongyang Du, Guangyuan Liu, Dusit Niyato, Jiayi Zhang, Jiawen Kang, Zehui Xiong, Bo Ai, Dong In Kim

    Abstract: Semantic communication (SemCom) holds promise for reducing network resource consumption while achieving the communications goal. However, the computational overheads in jointly training semantic encoders and decoders-and the subsequent deployment in network devices-are overlooked. Recent advances in Generative artificial intelligence (GAI) offer a potential solution. The robust learning abilities… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  20. arXiv:2309.01426  [pdf, other

    cs.NI eess.SP

    A Unified Framework for Guiding Generative AI with Wireless Perception in Resource Constrained Mobile Edge Networks

    Authors: Jiacheng Wang, Hongyang Du, Dusit Niyato, Jiawen Kang, Zehui Xiong, Deepu Rajan, Shiwen Mao, Xuemin, Shen

    Abstract: With the significant advancements in artificial intelligence (AI) technologies and powerful computational capabilities, generative AI (GAI) has become a pivotal digital content generation technique for offering superior digital services. However, directing GAI towards desired outputs still suffer the inherent instability of the AI model. In this paper, we design a novel framework that utilizes wir… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

  21. arXiv:2309.01297  [pdf, other

    cs.LG eess.SP

    Communication-Efficient Design of Learning System for Energy Demand Forecasting of Electrical Vehicles

    Authors: Jiacong Xu, Riley Kilfoyle, Zixiang Xiong, Ligang Lu

    Abstract: Machine learning (ML) applications to time series energy utilization forecasting problems are a challenging assignment due to a variety of factors. Chief among these is the non-homogeneity of the energy utilization datasets and the geographical dispersion of energy consumers. Furthermore, these ML models require vast amounts of training data and communications overhead in order to develop an effec… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

    Comments: 7 pages, 6 figures

  22. arXiv:2308.15394  [pdf, other

    cs.AI cs.LG eess.SY

    Decentralized Multi-agent Reinforcement Learning based State-of-Charge Balancing Strategy for Distributed Energy Storage System

    Authors: Zheng Xiong, Biao Luo, Bing-Chuan Wang, Xiaodong Xu, Xiaodong Liu, Tingwen Huang

    Abstract: This paper develops a Decentralized Multi-Agent Reinforcement Learning (Dec-MARL) method to solve the SoC balancing problem in the distributed energy storage system (DESS). First, the SoC balancing problem is formulated into a finite Markov decision process with action constraints derived from demand balance, which can be solved by Dec-MARL. Specifically, the first-order average consensus algorith… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  23. arXiv:2308.13736  [pdf, other

    cs.SD cs.AI cs.HC eess.AS

    A Comprehensive Survey for Evaluation Methodologies of AI-Generated Music

    Authors: Zeyu Xiong, Weitao Wang, **g Yu, Yue Lin, Ziyan Wang

    Abstract: In recent years, AI-generated music has made significant progress, with several models performing well in multimodal and complex musical genres and scenes. While objective metrics can be used to evaluate generative music, they often lack interpretability for musical evaluation. Therefore, researchers often resort to subjective user studies to assess the quality of the generated works, which can be… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  24. arXiv:2308.09467  [pdf

    eess.IV cs.CV

    Quantitative Susceptibility Map** through Model-based Deep Image Prior (MoDIP)

    Authors: Zhuang Xiong, Yang Gao, Yin Liu, Amir Fazlollahi, Peter Nestor, Feng Liu, Hongfu Sun

    Abstract: The data-driven approach of supervised learning methods has limited applicability in solving dipole inversion in Quantitative Susceptibility Map** (QSM) with varying scan parameters across different objects. To address this generalization issue in supervised QSM methods, we propose a novel training-free model-based unsupervised method called MoDIP (Model-based Deep Image Prior). MoDIP comprises… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

  25. arXiv:2308.07618  [pdf, other

    cs.GT cs.AI cs.NI eess.SP

    Vision-based Semantic Communications for Metaverse Services: A Contest Theoretic Approach

    Authors: Guangyuan Liu, Hongyang Du, Dusit Niyato, Jiawen Kang, Zehui Xiong, Boon Hee Soong

    Abstract: The popularity of Metaverse as an entertainment, social, and work platform has led to a great need for seamless avatar integration in the virtual world. In Metaverse, avatars must be updated and rendered to reflect users' behaviour. Achieving real-time synchronization between the virtual bilocation and the user is complex, placing high demands on the Metaverse Service Provider (MSP)'s rendering re… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: 6 pages,7figures

  26. arXiv:2308.05384  [pdf, other

    cs.NI eess.SP

    Enhancing Deep Reinforcement Learning: A Tutorial on Generative Diffusion Models in Network Optimization

    Authors: Hongyang Du, Ruichen Zhang, Yinqiu Liu, Jiacheng Wang, Yi**g Lin, Zonghang Li, Dusit Niyato, Jiawen Kang, Zehui Xiong, Shuguang Cui, Bo Ai, Haibo Zhou, Dong In Kim

    Abstract: Generative Diffusion Models (GDMs) have emerged as a transformative force in the realm of Generative Artificial Intelligence (GenAI), demonstrating their versatility and efficacy across various applications. The ability to model complex data distributions and generate high-quality samples has made GDMs particularly effective in tasks such as image generation and reinforcement learning. Furthermore… ▽ More

    Submitted 8 May, 2024; v1 submitted 10 August, 2023; originally announced August 2023.

    Comments: This paper has been accepted by IEEE Communications Surveys & Tutorials (COMST)

  27. arXiv:2307.10974  [pdf, other

    cs.NE cs.CV eess.IV

    Deep Multi-Threshold Spiking-UNet for Image Processing

    Authors: Hebei Li, Yueyi Zhang, Zhiwei Xiong, Xiaoyan Sun

    Abstract: U-Net, known for its simple yet efficient architecture, is widely utilized for image processing tasks and is particularly suitable for deployment on neuromorphic chips. This paper introduces the novel concept of Spiking-UNet for image processing, which combines the power of Spiking Neural Networks (SNNs) with the U-Net architecture. To achieve an efficient Spiking-UNet, we face two primary challen… ▽ More

    Submitted 11 April, 2024; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: Accepted in NeuroComputing

  28. arXiv:2306.14683  [pdf, other

    cs.AI cs.LG eess.SP

    Multi-Agent Deep Reinforcement Learning for Dynamic Avatar Migration in AIoT-enabled Vehicular Metaverses with Trajectory Prediction

    Authors: Junlong Chen, Jiawen Kang, Minrui Xu, Zehui Xiong, Dusit Niyato, Chuan Chen, Abbas Jamalipour, Shengli Xie

    Abstract: Avatars, as promising digital assistants in Vehicular Metaverses, can enable drivers and passengers to immerse in 3D virtual spaces, serving as a practical emerging example of Artificial Intelligence of Things (AIoT) in intelligent vehicular environments. The immersive experience is achieved through seamless human-avatar interaction, e.g., augmented reality navigation, which requires intensive res… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  29. arXiv:2306.12675  [pdf, other

    eess.SP

    STAR-RIS-Assisted Privacy Protection in Semantic Communication System

    Authors: Yiru Wang, Wanting Yang, Pengxin Guan, Yu** Zhao, Zehui Xiong

    Abstract: Semantic communication (SemCom) has emerged as a promising architecture in the realm of intelligent communication paradigms. SemCom involves extracting and compressing the core information at the transmitter while enabling the receiver to interpret it based on established knowledge bases (KBs). This approach enhances communication efficiency greatly. However, the open nature of wireless transmissi… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

  30. arXiv:2305.18994  [pdf, other

    cs.CV eess.IV

    Toward Real-World Light Field Super-Resolution

    Authors: Zeyu Xiao, Ruisheng Gao, Yutong Liu, Yueyi Zhang, Zhiwei Xiong

    Abstract: Deep learning has opened up new possibilities for light field super-resolution (SR), but existing methods trained on synthetic datasets with simple degradations (e.g., bicubic downsampling) suffer from poor performance when applied to complex real-world scenarios. To address this problem, we introduce LytroZoom, the first real-world light field SR dataset capturing paired low- and high-resolution… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: CVPRW 2023

  31. Toward DNN of LUTs: Learning Efficient Image Restoration with Multiple Look-Up Tables

    Authors: Jiacheng Li, Chang Chen, Zhen Cheng, Zhiwei Xiong

    Abstract: The widespread usage of high-definition screens on edge devices stimulates a strong demand for efficient image restoration algorithms. The way of caching deep learning models in a look-up table (LUT) is recently introduced to respond to this demand. However, the size of a single LUT grows exponentially with the increase of its indexing capacity, which restricts its receptive field and thus the per… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    Comments: Project Page: https://mulut.pages.dev/

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI), 2024, early access

  32. arXiv:2303.01896  [pdf, other

    eess.SP

    AI-Generated Incentive Mechanism and Full-Duplex Semantic Communications for Information Sharing

    Authors: Hongyang Du, Jiacheng Wang, Dusit Niyato, Jiawen Kang, Zehui Xiong, Dong In Kim

    Abstract: The next generation of Internet services, such as Metaverse, rely on mixed reality (MR) technology to provide immersive user experiences. However, the limited computation power of MR headset-mounted devices (HMDs) hinders the deployment of such services. Therefore, we propose an efficient information sharing scheme based on full-duplex device-to-device (D2D) semantic communications to address this… ▽ More

    Submitted 28 June, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: Accepted by IEEE JSAC

  33. arXiv:2303.01346  [pdf, other

    cs.RO cs.LG eess.SY

    Co-learning Planning and Control Policies Constrained by Differentiable Logic Specifications

    Authors: Zikang Xiong, Daniel Lawson, Joe Eappen, Ahmed H. Qureshi, Suresh Jagannathan

    Abstract: Synthesizing planning and control policies in robotics is a fundamental task, further complicated by factors such as complex logic specifications and high-dimensional robot dynamics. This paper presents a novel reinforcement learning approach to solving high-dimensional robot navigation tasks with complex logic specifications by co-learning planning and control policies. Notably, this approach sig… ▽ More

    Submitted 1 October, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

  34. arXiv:2301.03220  [pdf, other

    cs.AI eess.SY

    Enabling AI-Generated Content (AIGC) Services in Wireless Edge Networks

    Authors: Hongyang Du, Zonghang Li, Dusit Niyato, Jiawen Kang, Zehui Xiong, Xuemin, Shen, Dong In Kim

    Abstract: Artificial Intelligence-Generated Content (AIGC) refers to the use of AI to automate the information creation process while fulfilling the personalized requirements of users. However, due to the instability of AIGC models, e.g., the stochastic nature of diffusion models, the quality and accuracy of the generated content can vary significantly. In wireless edge networks, the transmission of incorre… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

  35. arXiv:2211.14771  [pdf, other

    eess.SP

    Performance Analysis of Free-Space Information Sharing in Full-Duplex Semantic Communications

    Authors: Hongyang Du, Jiacheng Wang, Dusit Niyato, Jiawen Kang, Zehui Xiong, Dong In Kim, Boon Hee Soong

    Abstract: In next-generation Internet services, such as Metaverse, the mixed reality (MR) technique plays a vital role. Yet the limited computing capacity of the user-side MR headset-mounted device (HMD) prevents its further application, especially in scenarios that require a lot of computation. One way out of this dilemma is to design an efficient information sharing scheme among users to replace the heavy… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

  36. arXiv:2211.12727  [pdf, other

    eess.SP

    Semantic Communications for Wireless Sensing: RIS-aided Encoding and Self-supervised Decoding

    Authors: Hongyang Du, Jiacheng Wang, Dusit Niyato, Jiawen Kang, Zehui Xiong, Junshan Zhang, Xuemin, Shen

    Abstract: Semantic communications can reduce the resource consumption by transmitting task-related semantic information extracted from source messages. However, when the source messages are utilized for various tasks, e.g., wireless sensing data for localization and activities detection, semantic communication technique is difficult to be implemented because of the increased processing complexity. In this p… ▽ More

    Submitted 26 March, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

  37. arXiv:2211.03058  [pdf, other

    cs.CV eess.IV

    Towards Real World HDRTV Reconstruction: A Data Synthesis-based Approach

    Authors: Zhen Cheng, Tao Wang, Yong Li, Fenglong Song, Chang Chen, Zhiwei Xiong

    Abstract: Existing deep learning based HDRTV reconstruction methods assume one kind of tone map** operators (TMOs) as the degradation procedure to synthesize SDRTV-HDRTV pairs for supervised training. In this paper, we argue that, although traditional TMOs exploit efficient dynamic range compression priors, they have several drawbacks on modeling the realistic degradation: information over-preservation, c… ▽ More

    Submitted 6 November, 2022; originally announced November 2022.

  38. arXiv:2211.00481  [pdf, other

    eess.SY cs.LG

    Multi-Resource Allocation for On-Device Distributed Federated Learning Systems

    Authors: Yulan Gao, Ziqiang Ye, Han Yu, Zehui Xiong, Yue Xiao, Dusit Niyato

    Abstract: This work poses a distributed multi-resource allocation scheme for minimizing the weighted sum of latency and energy consumption in the on-device distributed federated learning (FL) system. Each mobile device in the system engages the model training process within the specified area and allocates its computation and communication resources for deriving and uploading parameters, respectively, to mi… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

  39. arXiv:2210.12995  [pdf, other

    eess.AS cs.SD

    TridentSE: Guiding Speech Enhancement with 32 Global Tokens

    Authors: Dacheng Yin, Zhiyuan Zhao, Chuanxin Tang, Zhiwei Xiong, Chong Luo

    Abstract: In this paper, we present TridentSE, a novel architecture for speech enhancement, which is capable of efficiently capturing both global information and local details. TridentSE maintains T-F bin level representation to capture details, and uses a small number of global tokens to process the global information. Information is propagated between the local and the global representations through cross… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: 5 pages, 2 figures, 3 tables

  40. Privacy-preserving Intelligent Resource Allocation for Federated Edge Learning in Quantum Internet

    Authors: Minrui Xu, Dusit Niyato, Zhaohui Yang, Zehui Xiong, Jiawen Kang, Dong In Kim, Xuemin, Shen

    Abstract: Federated edge learning (FEL) is a promising paradigm of distributed machine learning that can preserve data privacy while training the global model collaboratively. However, FEL is still facing model confidentiality issues due to eavesdrop** risks of exchanging cryptographic keys through traditional encryption schemes. Therefore, in this paper, we propose a hierarchical architecture for quantum… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

  41. arXiv:2209.12274  [pdf, ps, other

    eess.IV

    Personalized Saliency in Task-Oriented Semantic Communications: Image Transmission and Performance Analysis

    Authors: Jiawen Kang, Hongyang Du, Zonghang Li, Zehui Xiong, Shiyao Ma, Dusit Niyato, Yuan Li

    Abstract: Semantic communication, as a promising technology, has emerged to break through the Shannon limit, which is envisioned as the key enabler and fundamental paradigm for future 6G networks and applications, e.g., smart healthcare. In this paper, we focus on UAV image-sensing-driven task-oriented semantic communications scenarios. The majority of existing work has focused on designing advanced algorit… ▽ More

    Submitted 25 September, 2022; originally announced September 2022.

  42. Model-Guided Multi-Contrast Deep Unfolding Network for MRI Super-resolution Reconstruction

    Authors: Gang Yang, Li Zhang, Man Zhou, Ai** Liu, Xun Chen, Zhiwei Xiong, Feng Wu

    Abstract: Magnetic resonance imaging (MRI) with high resolution (HR) provides more detailed information for accurate diagnosis and quantitative image analysis. Despite the significant advances, most existing super-resolution (SR) reconstruction network for medical images has two flaws: 1) All of them are designed in a black-box principle, thus lacking sufficient interpretability and further limiting their p… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

    Comments: Accepted to ACMMM 2022, 9 pages

  43. arXiv:2207.00427  [pdf, other

    cs.NI eess.SP

    Semantic Communications for Future Internet: Fundamentals, Applications, and Challenges

    Authors: Wanting Yang, Hongyang Du, Ziqin Liew, Wei Yang Bryan Lim, Zehui Xiong, Dusit Niyato, Xuefen Chi, Xuemin Sherman Shen, Chunyan Miao

    Abstract: With the increasing demand for intelligent services, the sixth-generation (6G) wireless networks will shift from a traditional architecture that focuses solely on high transmission rate to a new architecture that is based on the intelligent connection of everything. Semantic communication (SemCom), a revolutionary architecture that integrates user as well as application requirements and meaning of… ▽ More

    Submitted 13 November, 2022; v1 submitted 10 June, 2022; originally announced July 2022.

    Comments: arXiv admin note: text overlap with arXiv:2103.05391 by other authors

  44. arXiv:2206.13865  [pdf, other

    eess.AS cs.SD

    RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion

    Authors: Dacheng Yin, Chuanxin Tang, Yanqing Liu, Xiaoqiang Wang, Zhiyuan Zhao, Yucheng Zhao, Zhiwei Xiong, Sheng Zhao, Chong Luo

    Abstract: This paper proposes a new "decompose-and-edit" paradigm for the text-based speech insertion task that facilitates arbitrary-length speech insertion and even full sentence generation. In the proposed paradigm, global and local factors in speech are explicitly decomposed and separately manipulated to achieve high speaker similarity and continuous prosody. Specifically, we proposed to represent the g… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: 5 pages, 1 figure, 3 tables. Accepted by Interspeech 2022

  45. arXiv:2206.07188  [pdf, other

    cs.LG cs.RO eess.SY

    Defending Observation Attacks in Deep Reinforcement Learning via Detection and Denoising

    Authors: Zikang Xiong, Joe Eappen, He Zhu, Suresh Jagannathan

    Abstract: Neural network policies trained using Deep Reinforcement Learning (DRL) are well-known to be susceptible to adversarial attacks. In this paper, we consider attacks manifesting as perturbations in the observation space managed by the external environment. These attacks have been shown to downgrade policy performance significantly. We focus our attention on well-trained deterministic and stochastic… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

  46. arXiv:2203.15610  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT

    Authors: Rui Wang, Qibing Bai, Junyi Ao, Long Zhou, Zhixiang Xiong, Zhihua Wei, Yu Zhang, Tom Ko, Haizhou Li

    Abstract: Self-supervised speech representation learning has shown promising results in various speech processing tasks. However, the pre-trained models, e.g., HuBERT, are storage-intensive Transformers, limiting their scope of applications under low-resource settings. To this end, we propose LightHuBERT, a once-for-all Transformer compression framework, to find the desired architectures automatically by pr… ▽ More

    Submitted 18 June, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Comments: 5 pages, 2 figures, accepted to Insterspeech 2022

  47. arXiv:2203.10799  [pdf

    eess.SY

    A Stochastic Planning Method for Low-carbon Building-level Integrated Energy System Considering Electric-Heat-V2G Coupling

    Authors: Zuxun Xiong, Xinwei Shen, Qinglai Guo, Hongbin Sun

    Abstract: The concept of low-carbon building is proposed to ameliorate the climate change caused by environmental problems and realize carbon neutrality at the building level in urban areas. In addition, renewable energy curtailment in the power distribution system, as well as low efficiency due to independent operation of traditional energy systems, has been addressed by the application of integrated energ… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  48. arXiv:2203.05822  [pdf, other

    eess.IV cs.CV

    aiWave: Volumetric Image Compression with 3-D Trained Affine Wavelet-like Transform

    Authors: Dongmei Xue, Haichuan Ma, Li Li, Dong Liu, Zhiwei Xiong

    Abstract: Volumetric image compression has become an urgent task to effectively transmit and store images produced in biological research and clinical practice. At present, the most commonly used volumetric image compression methods are based on wavelet transform, such as JP3D. However, JP3D employs an ideal, separable, global, and fixed wavelet basis to convert input images from pixel domain to frequency d… ▽ More

    Submitted 18 October, 2022; v1 submitted 11 March, 2022; originally announced March 2022.

  49. arXiv:2202.12307  [pdf, other

    cs.LG cs.AI cs.CV cs.SD eess.AS

    Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph

    Authors: Dacheng Yin, Xuanchi Ren, Chong Luo, Yuwang Wang, Zhiwei Xiong, Wenjun Zeng

    Abstract: This paper addresses the unsupervised learning of content-style decomposed representation. We first give a definition of style and then model the content-style representation as a token-level bipartite graph. An unsupervised framework, named Retriever, is proposed to learn such representations. First, a cross-attention module is employed to retrieve permutation invariant (P.I.) information, define… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

    Comments: Accepted to ICLR 2022. Project page at https://ydcustc.github.io/retriever-demo/

  50. arXiv:2202.09635  [pdf, other

    cs.CV eess.IV

    Deep Single Image Deraining using An Asymetric Cycle Generative and Adversarial Framework

    Authors: Wei Liu, Rui Jiang, Cheng Chen, Tao Lu, Zixiang Xiong

    Abstract: In reality, rain and fog are often present at the same time, which can greatly reduce the clarity and quality of the scene image. However, most unsupervised single image deraining methods mainly focus on rain streak removal by disregarding the fog, which leads to low-quality deraining performance. In addition, the samples are rather homogeneous generated by these methods and lack diversity, result… ▽ More

    Submitted 18 May, 2023; v1 submitted 19 February, 2022; originally announced February 2022.