Skip to main content

Showing 1–50 of 99 results for author: Fang, Y

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.19959  [pdf, other

    cs.SD eess.AS

    RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization

    Authors: Bing Yang, Changsheng Quan, Yabo Wang, Pengyu Wang, Yujie Yang, Ying Fang, Nian Shao, Hui Bu, Xin Xu, Xiaofei Li

    Abstract: The training of deep learning-based multichannel speech enhancement and source localization systems relies heavily on the simulation of room impulse response and multichannel diffuse noise, due to the lack of large-scale real-recorded datasets. However, the acoustic mismatch between simulated and real-world data could degrade the model performance when applying in real-world scenarios. To bridge t… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2406.18592  [pdf, ps, other

    eess.SP

    On the Coexistence of OTFS Modulation with OFDM-based Communication Systems

    Authors: Akram Shafie, **hong Yuan, Paul Fitzpatrick, Taka Sakurai, Yuting Fang

    Abstract: We investigate the coexistence of orthogonal time-frequency space (OTFS) modulation with current fourth- and fifth-generation (4G/5G) communication systems that primarily use orthogonal frequency-division multiplexing (OFDM) waveforms. We first derive the input-output-relation of OTFS in the considered coexisting system. In this derivation, we consider (i) the inclusion of multiple cyclic prefixes… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: This paper has been submitted for publication in an IEEE Journal. Copyright may be transferred without notice, after which this version may no longer be accessible. arXiv admin note: text overlap with arXiv:2311.06850

  3. arXiv:2406.17173  [pdf, other

    eess.IV cs.CV cs.LG

    Diff3Dformer: Leveraging Slice Sequence Diffusion for Enhanced 3D CT Classification with Transformer Networks

    Authors: Zihao **, Yingying Fang, Jiahao Huang, Caiwen Xu, Simon Walsh, Guang Yang

    Abstract: The manifestation of symptoms associated with lung diseases can vary in different depths for individual patients, highlighting the significance of 3D information in CT scans for medical image classification. While Vision Transformer has shown superior performance over convolutional neural networks in image classification tasks, their effectiveness is often demonstrated on sufficiently large 2D dat… ▽ More

    Submitted 26 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: conference

  4. arXiv:2406.16189  [pdf, other

    eess.IV cs.CV

    Fuzzy Attention-based Border Rendering Network for Lung Organ Segmentation

    Authors: Sheng Zhang, Yang Nan, Yingying Fang, Shiyi Wang, Xiaodan Xing, Zhifan Gao, Guang Yang

    Abstract: Automatic lung organ segmentation on CT images is crucial for lung disease diagnosis. However, the unlimited voxel values and class imbalance of lung organs can lead to false-negative/positive and leakage issues in advanced methods. Additionally, some slender lung organs are easily lost during the recycled down/up-sample procedure, e.g., bronchioles & arterioles, causing severe discontinuity issue… ▽ More

    Submitted 1 July, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

    Comments: MICCAI 2024

  5. arXiv:2406.12426  [pdf, other

    cs.IT eess.SP

    Multi-Active-IRS-Assisted Cooperative Sensing: Cramér-Rao Bound and Joint Beamforming Design

    Authors: Yuan Fang, Xianghao Yu, Jie Xu, Ying-Jun Angela Zhang

    Abstract: This paper studies the multi-intelligent reflecting surface (IRS)-assisted cooperative sensing, in which multiple active IRSs are deployed in a distributed manner to facilitate multi-view target sensing at the non-line-of-sight (NLoS) area of the base station (BS). Different from prior works employing passive IRSs, we leverage active IRSs with the capability of amplifying the reflected signals to… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2404.13536

  6. arXiv:2406.09389  [pdf, other

    eess.IV cs.CV

    Sagiri: Low Dynamic Range Image Enhancement with Generative Diffusion Prior

    Authors: Baiang Li, Sizhuo Ma, Yanhong Zeng, Xiaogang Xu, Youqing Fang, Zhao Zhang, Jian Wang, Kai Chen

    Abstract: Capturing High Dynamic Range (HDR) scenery using 8-bit cameras often suffers from over-/underexposure, loss of fine details due to low bit-depth compression, skewed color distributions, and strong noise in dark areas. Traditional LDR image enhancement methods primarily focus on color map**, which enhances the visual representation by expanding the image's color range and adjusting the brightness… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: https://sagiri0208.github.io

  7. arXiv:2406.07773  [pdf

    eess.IV

    Development of Focused X-ray Luminescence Compute Tomography Imaging

    Authors: Yile Fang, Yibing Zhang, Changqing Li

    Abstract: X-ray luminescence is produced when contrast agents absorb energy from X-ray photons and release a portion of that energy by emitting photons in the visible and near-infrared range. X-ray luminescence computed tomography (XLCT) was introduced in the past decade as a hybrid molecular imaging modality combining the merits of both X-ray imaging (high spatial resolution) and optical imaging (high sens… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  8. arXiv:2405.19298  [pdf, other

    cs.CV eess.IV

    Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare

    Authors: Hanwei Zhu, Haoning Wu, Yixuan Li, Zicheng Zhang, Baoliang Chen, Lingyu Zhu, Yuming Fang, Guangtao Zhai, Weisi Lin, Shiqi Wang

    Abstract: While recent advancements in large multimodal models (LMMs) have significantly improved their abilities in image quality assessment (IQA) relying on absolute quality rating, how to transfer reliable relative quality comparison outputs to continuous perceptual quality scores remains largely unexplored. To address this gap, we introduce Compare2Score-an all-around LMM-based no-reference IQA (NR-IQA)… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  9. arXiv:2405.06178  [pdf, other

    eess.IV cs.LG q-bio.NC

    ACTION: Augmentation and Computation Toolbox for Brain Network Analysis with Functional MRI

    Authors: Yuqi Fang, Junhao Zhang, Linmin Wang, Qianqian Wang, Mingxia Liu

    Abstract: Functional magnetic resonance imaging (fMRI) has been increasingly employed to investigate functional brain activity. Many fMRI-related software/toolboxes have been developed, providing specialized algorithms for fMRI analysis. However, existing toolboxes seldom consider fMRI data augmentation, which is quite useful, especially in studies with limited or imbalanced data. Moreover, current studies… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 14 pages, 5 figures, 5 tables

  10. arXiv:2404.13536  [pdf, other

    cs.IT eess.SP

    Joint Transmit and Reflective Beamforming for Multi-Active-IRS-Assisted Cooperative Sensing

    Authors: Yuan Fang, Xianghao Yu, Jie Xu

    Abstract: This paper studies multi-active intelligent-reflecting-surface (IRS) cooperative sensing, in which multiple active IRSs are deployed in a distributed manner to help the base station (BS) provide multi-view sensing. We focus on the scenario where the sensing target is located in the non-line-of-sight (NLoS) area of the BS. Based on the received echo signal, the BS aims to estimate the target's dire… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  11. arXiv:2404.08566  [pdf, other

    eess.SP cs.LG

    Mitigating Receiver Impact on Radio Frequency Fingerprint Identification via Domain Adaptation

    Authors: Liu Yang, Qiang Li, Xiaoyang Ren, Yi Fang, Shafei Wang

    Abstract: Radio Frequency Fingerprint Identification (RFFI), which exploits non-ideal hardware-induced unique distortion resident in the transmit signals to identify an emitter, is emerging as a means to enhance the security of communication systems. Recently, machine learning has achieved great success in develo** state-of-the-art RFFI models. However, few works consider cross-receiver RFFI problems, whe… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: Accepted by IEEE Internet of Things Journal

  12. arXiv:2403.16353  [pdf, other

    cs.IT eess.SP

    Energy-Efficient Hybrid Beamforming with Dynamic On-off Control for Integrated Sensing, Communications, and Powering

    Authors: Zeyu Hao, Yuan Fang, Xianghao Yu, Jie Xu, Ling Qiu, Lexi Xu, Shuguang Cui

    Abstract: This paper investigates the energy-efficient hybrid beamforming design for a multi-functional integrated sensing, communications, and powering (ISCAP) system. In this system, a base station (BS) with a hybrid analog-digital (HAD) architecture sends unified wireless signals to communicate with multiple information receivers (IRs), sense multiple point targets, and wirelessly charge multiple energy… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: 13 pages, 6 figures, submitted to IEEE Transactions on Communications

  13. arXiv:2403.09096  [pdf, other

    eess.IV cs.CV

    Deep unfolding Network for Hyperspectral Image Super-Resolution with Automatic Exposure Correction

    Authors: Yuan Fang, Yipeng Liu, Jie Chen, Zhen Long, Ao Li, Chong-Yung Chi, Ce Zhu

    Abstract: In recent years, the fusion of high spatial resolution multispectral image (HR-MSI) and low spatial resolution hyperspectral image (LR-HSI) has been recognized as an effective method for HSI super-resolution (HSI-SR). However, both HSI and MSI may be acquired under extreme conditions such as night or poorly illuminating scenarios, which may cause different exposure levels, thereby seriously downgr… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  14. arXiv:2402.13511  [pdf, other

    eess.AS

    Mel-FullSubNet: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR

    Authors: Rui Zhou, Xian Li, Ying Fang, Xiaofei Li

    Abstract: In this work, we propose Mel-FullSubNet, a single-channel Mel-spectrogram denoising and dereverberation network for improving both speech quality and automatic speech recognition (ASR) performance. Mel-FullSubNet takes as input the noisy and reverberant Mel-spectrogram and predicts the corresponding clean Mel-spectrogram. The enhanced Mel-spectrogram can be either transformed to speech waveform wi… ▽ More

    Submitted 21 February, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  15. arXiv:2402.09752  [pdf

    physics.optics eess.SY physics.app-ph quant-ph

    Vector spectrometer with Hertz-level resolution and super-recognition capability

    Authors: Ting Qing, Shupeng Li, Huashan Yang, Lihan Wang, Yijie Fang, Xiaohu Tang, Meihui Cao, Jianming Lu, Jijun He, Junqiu Liu, Yueguang Lyu, Shilong Pan

    Abstract: High-resolution optical spectrometers are crucial in revealing intricate characteristics of signals, determining laser frequencies, measuring physical constants, identifying substances, and advancing biosensing applications. Conventional spectrometers, however, often grapple with inherent trade-offs among spectral resolution, wavelength range, and accuracy. Furthermore, even at high resolution, re… ▽ More

    Submitted 6 March, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: 21 pages, 6 figures

  16. arXiv:2402.05378  [pdf, other

    eess.SP cs.AI cs.CR cs.LG

    Graph Neural Networks for Physical-Layer Security in Multi-User Flexible-Duplex Networks

    Authors: Tharaka Perera, Saman Atapattu, Yuting Fang, Jamie Evans

    Abstract: This paper explores Physical-Layer Security (PLS) in Flexible Duplex (FlexD) networks, considering scenarios involving eavesdroppers. Our investigation revolves around the intricacies of the sum secrecy rate maximization problem, particularly when faced with coordinated and distributed eavesdroppers employing a Minimum Mean Square Error (MMSE) receiver. Our contributions include an iterative class… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  17. arXiv:2402.03473  [pdf, other

    eess.IV cs.CV

    Assessing the Efficacy of Invisible Watermarks in AI-Generated Medical Images

    Authors: Xiaodan Xing, Huiyu Zhou, Yingying Fang, Guang Yang

    Abstract: AI-generated medical images are gaining growing popularity due to their potential to address the data scarcity challenge in the real world. However, the issue of accurate identification of these synthetic images, particularly when they exhibit remarkable realism with their real copies, remains a concern. To mitigate this challenge, image generators such as DALLE and Imagen, have integrated digital… ▽ More

    Submitted 21 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: 5 pages

    Journal ref: ISBI 2024

  18. arXiv:2401.16564  [pdf

    eess.SP

    Data and Physics driven Deep Learning Models for Fast MRI Reconstruction: Fundamentals and Methodologies

    Authors: Jiahao Huang, Yinzhe Wu, Fanwen Wang, Yingying Fang, Yang Nan, Cagan Alkan, Lei Xu, Zhifan Gao, Weiwen Wu, Lei Zhu, Zhaolin Chen, Peter Lally, Neal Bangerter, Kawin Setsompop, Yike Guo, Daniel Rueckert, Ge Wang, Guang Yang

    Abstract: Magnetic Resonance Imaging (MRI) is a pivotal clinical diagnostic tool, yet its extended scanning times often compromise patient comfort and image quality, especially in volumetric, temporal and quantitative scans. This review elucidates recent advances in MRI acceleration via data and physics-driven models, leveraging techniques from algorithm unrolling models, enhancement-based models, and plug-… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  19. arXiv:2401.13442  [pdf, other

    cs.IT eess.SP

    Finite-Precision Arithmetic Transceiver for Massive MIMO Systems

    Authors: Yiming Fang, Li Chen, Yunfei Chen, Huarui Yin

    Abstract: Efficient implementation of massive multiple-input-multiple-output (MIMO) transceivers is essential for the next-generation wireless networks. To reduce the high computational complexity of the massive MIMO transceiver, in this paper, we propose a new massive MIMO architecture using finite-precision arithmetic. First, we conduct the rounding error analysis and derive the lower bound of the achieva… ▽ More

    Submitted 25 March, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: 16 pages, 8 figures. Submitted to IEEE JSAC for possible publication

  20. arXiv:2401.05431  [pdf, other

    eess.SP cs.AI cs.LG

    TRLS: A Time Series Representation Learning Framework via Spectrogram for Medical Signal Processing

    Authors: Luyuan Xie, Cong Li, Xin Zhang, Shengfang Zhai, Yuejian Fang, Qingni Shen, Zhonghai Wu

    Abstract: Representation learning frameworks in unlabeled time series have been proposed for medical signal processing. Despite the numerous excellent progresses have been made in previous works, we observe the representation extracted for the time series still does not generalize well. In this paper, we present a Time series (medical signal) Representation Learning framework via Spectrogram (TRLS) to get m… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: This paper is accept by ICASSP 2024. This is a more detailed version

  21. arXiv:2401.01544  [pdf, other

    cs.CV eess.SP

    Collaborative Perception for Connected and Autonomous Driving: Challenges, Possible Solutions and Opportunities

    Authors: Senkang Hu, Zhengru Fang, Yiqin Deng, Xianhao Chen, Yuguang Fang

    Abstract: Autonomous driving has attracted significant attention from both academia and industries, which is expected to offer a safer and more efficient driving system. However, current autonomous driving systems are mostly based on a single vehicle, which has significant limitations which still poses threats to driving safety. Collaborative perception with connected and autonomous vehicles (CAVs) shows a… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  22. arXiv:2312.14383  [pdf, other

    cs.MM cs.CV eess.IV

    Removing Interference and Recovering Content Imaginatively for Visible Watermark Removal

    Authors: Yicheng Leng, Chaowei Fang, Gen Li, Yixiang Fang, Guanbin Li

    Abstract: Visible watermarks, while instrumental in protecting image copyrights, frequently distort the underlying content, complicating tasks like scene interpretation and image editing. Visible watermark removal aims to eliminate the interference of watermarks and restore the background content. However, existing methods often implement watermark component removal and background restoration tasks within a… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI2024

  23. arXiv:2312.05557  [pdf, ps, other

    cs.IT eess.SP

    Long-Term Rate-Fairness-Aware Beamforming Based Massive MIMO Systems

    Authors: W. Zhu, H. D. Tuan, E. Dutkiewicz, Y. Fang, H. V. Poor, L. Hanzo

    Abstract: This is the first treatise on multi-user (MU) beamforming designed for achieving long-term rate-fairness in fulldimensional MU massive multi-input multi-output (m-MIMO) systems. Explicitly, based on the channel covariances, which can be assumed to be known beforehand, we address this problem by optimizing the following objective functions: the users' signal-toleakage-noise ratios (SLNRs) using SLN… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  24. arXiv:2311.06850  [pdf, ps, other

    eess.SP

    Coexistence of OTFS Modulation With OFDM-based Communication Systems

    Authors: Akram Shafie, **hong Yuan, Yuting Fang, Paul Fitzpatrick, Taka Sakurai

    Abstract: This study examines the coexistence of orthogonal time-frequency space (OTFS) modulation with current fourth- and fifth-generation (4G/5G) wireless communication systems that primarily use orthogonal frequency-division multiplexing (OFDM) waveforms. We first derive the input-output-relation (IOR) of OTFS when it coexists with an OFDM system while considering the impact of unequal lengths of the cy… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: This paper has been accepted for publication in IEEE Global Communications Conferences (GLOBECOM) 2023. Copyright may be transferred without notice, after which this version may no longer be accessible

  25. arXiv:2311.05372  [pdf, other

    cs.IT eess.SP

    Joint Angle and Delay Cramér-Rao Bound Optimization for Integrated Sensing and Communications

    Authors: Chao Hu, Yuan Fang, Ling Qiu

    Abstract: In this paper, we study a multi-input multi-output (MIMO) beamforming design in an integrated sensing and communication (ISAC) system, in which an ISAC base station (BS) is used to communicate with multiple downlink users and simultaneously the communication signals are reused for sensing multiple targets. Our interested sensing parameters are the angle and delay information of the targets, which… ▽ More

    Submitted 13 November, 2023; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: This paper has been submitted to IEEE TVT

  26. arXiv:2311.01066  [pdf, other

    eess.IV cs.CV

    Dynamic Multimodal Information Bottleneck for Multimodality Classification

    Authors: Yingying Fang, Shuang Wu, Sheng Zhang, Chaoyan Huang, Tieyong Zeng, Xiaodan Xing, Simon Walsh, Guang Yang

    Abstract: Effectively leveraging multimodal data such as various images, laboratory tests and clinical information is gaining traction in a variety of AI-based medical diagnosis and prognosis tasks. Most existing multi-modal techniques only focus on enhancing their performance by leveraging the differences or shared features from various modalities and fusing feature across different modalities. These appro… ▽ More

    Submitted 25 November, 2023; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: WACV 2024

  27. arXiv:2311.01003  [pdf, other

    eess.SY cs.RO

    Minimum Snap Trajectory Generation and Control for an Under-actuated Flap** Wing Aerial Vehicle

    Authors: Chen Qian, Rui Chen, Peiyao Shen, Yongchun Fang, Jifu Yan, Tiefeng Li

    Abstract: Minimum Snap Trajectory Generation and Control for an Under-actuated Flap** Wing Aerial VehicleThis paper presents both the trajectory generation and tracking control strategies for an underactuated flap** wing aerial vehicle (FWAV). First, the FWAV dynamics is analyzed in a practical perspective. Then, based on these analyses, we demonstrate the differential flatness of the FWAV system, and d… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  28. arXiv:2309.08150  [pdf, other

    cs.CL cs.SD eess.AS

    Unimodal Aggregation for CTC-based Speech Recognition

    Authors: Ying Fang, Xiaofei Li

    Abstract: This paper works on non-autoregressive automatic speech recognition. A unimodal aggregation (UMA) is proposed to segment and integrate the feature frames that belong to the same text token, and thus to learn better feature representations for text tokens. The frame-wise features and weights are both derived from an encoder. Then, the feature frames with unimodal weights are integrated and further… ▽ More

    Submitted 19 March, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: Accepted by ICASSP 2024

  29. arXiv:2309.03472  [pdf, other

    cs.CV eess.IV

    Perceptual Quality Assessment of 360$^\circ$ Images Based on Generative Scanpath Representation

    Authors: Xiangjie Sui, Hanwei Zhu, Xuelin Liu, Yuming Fang, Shiqi Wang, Zhou Wang

    Abstract: Despite substantial efforts dedicated to the design of heuristic models for omnidirectional (i.e., 360$^\circ$) image quality assessment (OIQA), a conspicuous gap remains due to the lack of consideration for the diversity of viewing behaviors that leads to the varying perceptual quality of 360$^\circ$ images. Two critical aspects underline this oversight: the neglect of viewing conditions that sig… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: 12 pages, 5 figures

  30. arXiv:2309.02259  [pdf, ps, other

    cs.IT eess.SP

    Design of a New CIM-DCSK-Based Ambient Backscatter Communication System

    Authors: Ruipeng Yang, Yi Fang, **** Chen, Huan Ma

    Abstract: To improve the data rate in differential chaos shift keying (DCSK) based ambient backscatter communication (AmBC) system, we propose a new AmBC system based on code index modulation (CIM), referred to as CIM-DCSK-AmBC system. In the proposed system, the CIM-DCSK signal transmitted in the direct link is used as the radio frequency source of the backscatter link. The signal format in the backscatter… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  31. arXiv:2308.16551  [pdf

    eess.IV cs.CV

    Object Detection for Caries or Pit and Fissure Sealing Requirement in Children's First Permanent Molars

    Authors: Chenyao Jiang, Shiyao Zhai, Hengrui Song, Yuqing Ma, Yachen Fan, Yancheng Fang, Dongmei Yu, Canyang Zhang, Sanyang Han, Runming Wang, Yong Liu, Jianbo Li, Peiwu Qin

    Abstract: Dental caries is one of the most common oral diseases that, if left untreated, can lead to a variety of oral problems. It mainly occurs inside the pits and fissures on the occlusal/buccal/palatal surfaces of molars and children are a high-risk group for pit and fissure caries in permanent molars. Pit and fissure sealing is one of the most effective methods that is widely used in prevention of pit… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

  32. arXiv:2308.09359  [pdf, other

    cs.IT eess.SP

    Training-Free Energy Beamforming Assisted by Wireless Sensing

    Authors: Li Zhang, Yuan Fang, Zixiang Ren, Ling Qiu, Jie Xu

    Abstract: This paper studies the transmit energy beamforming in a multi-antenna wireless power transfer (WPT) system, in which an access point (AP) equipped with a uniform linear array (ULA) sends radio signals to wirelessly charge multiple single-antenna energy receivers (ERs). Different from conventional energy beamforming designs that require the AP to acquire the channel state information (CSI) via trai… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: 6 pages, 5 figures, submitted to globecom workshop

  33. arXiv:2308.04126  [pdf, other

    cs.CV cs.LG cs.MM cs.SD eess.AS

    OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data Generation

    Authors: Dongyang Yu, Shihao Wang, Yuan Fang, Wangpeng An

    Abstract: This paper presents OmniDataComposer, an innovative approach for multimodal data fusion and unlimited data generation with an intent to refine and uncomplicate interplay among diverse data modalities. Coming to the core breakthrough, it introduces a cohesive data structure proficient in processing and merging multimodal data inputs, which include video, audio, and text. Our crafted algorithm lev… ▽ More

    Submitted 17 August, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

  34. arXiv:2307.12845  [pdf, other

    eess.IV cs.CV

    Multi-View Vertebra Localization and Identification from CT Images

    Authors: Han Wu, Jiadong Zhang, Yu Fang, Zhentao Liu, Nizhuan Wang, Zhiming Cui, Dinggang Shen

    Abstract: Accurately localizing and identifying vertebrae from CT images is crucial for various clinical applications. However, most existing efforts are performed on 3D with crop** patch operation, suffering from the large computation costs and limited global information. In this paper, we propose a multi-view vertebra localization and identification from CT images, converting the 3D problem into a 2D lo… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: MICCAI 2023

  35. arXiv:2307.11337  [pdf, other

    cs.IT eess.SP

    Fundamental CRB-Rate Tradeoff in Multi-Antenna ISAC Systems with Information Multicasting and Multi-Target Sensing

    Authors: Zixiang Ren, Yunfei Peng, Xianxin Song, Yuan Fang, Ling Qiu, Liang Liu, Derrick Wing Kwan Ng, Jie Xu

    Abstract: This paper investigates the performance tradeoff for a multi-antenna integrated sensing and communication (ISAC) system with simultaneous information multicasting and multi-target sensing, in which a multi-antenna base station (BS) sends the common information messages to a set of single-antenna communication users (CUs) and estimates the parameters of multiple sensing targets based on the echo si… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

    Comments: 32 pages

  36. arXiv:2307.05382  [pdf, other

    eess.SP cs.AI cs.LG

    Protecting the Future: Neonatal Seizure Detection with Spatial-Temporal Modeling

    Authors: Ziyue Li, Yuchen Fang, You Li, Kan Ren, Yansen Wang, Xufang Luo, Juanyong Duan, Congrui Huang, Dongsheng Li, Lili Qiu

    Abstract: A timely detection of seizures for newborn infants with electroencephalogram (EEG) has been a common yet life-saving practice in the Neonatal Intensive Care Unit (NICU). However, it requires great human efforts for real-time monitoring, which calls for automated solutions to neonatal seizure detection. Moreover, the current automated methods focusing on adult epilepsy monitoring often fail due to… ▽ More

    Submitted 2 July, 2023; originally announced July 2023.

    Comments: Accepted in IEEE International Conference on Systems, Man, and Cybernetics (SMC) 2023

  37. arXiv:2307.05127  [pdf, ps, other

    cs.IT eess.SP

    Optimal Coordinated Transmit Beamforming for Networked Integrated Sensing and Communications

    Authors: Gaoyuan Cheng, Yuan Fang, Jie Xu, Derrick Wing Kwan Ng

    Abstract: This paper studies a multi-antenna networked integrated sensing and communications (ISAC) system, in which a set of multi-antenna base stations (BSs) employ the coordinated transmit beamforming to serve multiple single-antenna communication users (CUs) and perform joint target detection by exploiting the reflected signals simultaneously. To facilitate target sensing, the BSs transmit dedicated sen… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: text overlap with arXiv:2211.01085

  38. arXiv:2307.02242  [pdf, other

    cs.IT eess.SP

    Multi-IRS-Enabled Integrated Sensing and Communications

    Authors: Yuan Fang, Siyao Zhang, Xinmin Li, Xianghao Yu, Jie Xu, Shuguang Cui

    Abstract: This paper studies a multi-intelligent-reflecting-surface-(IRS)-enabled integrated sensing and communications (ISAC) system, in which multiple IRSs are installed to help the base station (BS) provide ISAC services at separate line-of-sight (LoS) blocked areas. We focus on the scenario with semi-passive uniform linear array (ULA) IRSs for sensing, in which each IRS is integrated with dedicated sens… ▽ More

    Submitted 17 December, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

  39. arXiv:2307.00327  [pdf

    cs.CV eess.IV

    SDRCNN: A single-scale dense residual connected convolutional neural network for pansharpening

    Authors: Yuan Fang, Yuanzhi Cai, Lei Fan

    Abstract: Pansharpening is a process of fusing a high spatial resolution panchromatic image and a low spatial resolution multispectral image to create a high-resolution multispectral image. A novel single-branch, single-scale lightweight convolutional neural network, named SDRCNN, is developed in this study. By using a novel dense residual connected structure and convolution block, SDRCNN achieved a better… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Comments: This paper has been accepted for publication in the IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

  40. arXiv:2306.15303  [pdf, other

    cs.IT eess.SP

    Energy-Efficient MIMO Integrated Sensing and Communications with On-off Non-transmission Power

    Authors: Guanlin Wu, Yuan Fang, Jie Xu, Zhiyong Feng, Shuguang Cui

    Abstract: This paper investigates the energy efficiency of a multiple-input multiple-output (MIMO) integrated sensing and communications (ISAC) system, in which one multi-antenna base station (BS) transmits unified ISAC signals to a multi-antenna communication user (CU) and at the same time use the echo signals to estimate an extended target. We focus on one particular ISAC transmission block and take into… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: 13 pages, 9 figures

  41. arXiv:2306.11977  [pdf

    eess.IV cs.CV

    Encoding Enhanced Complex CNN for Accurate and Highly Accelerated MRI

    Authors: Zimeng Li, Sa Xiao, Cheng Wang, Haidong Li, Xiuchao Zhao, Caohui Duan, Qian Zhou, Qiuchen Rao, Yuan Fang, Junshuai Xie, Lei Shi, Fumin Guo, Chaohui Ye, Xin Zhou

    Abstract: Magnetic resonance imaging (MRI) using hyperpolarized noble gases provides a way to visualize the structure and function of human lung, but the long imaging time limits its broad research and clinical applications. Deep learning has demonstrated great potential for accelerating MRI by reconstructing images from undersampled data. However, most existing deep conventional neural networks (CNN) direc… ▽ More

    Submitted 13 November, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

  42. arXiv:2305.12311  [pdf, other

    cs.CL cs.AI cs.CV cs.LG eess.AS

    i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data

    Authors: Ziyi Yang, Mahmoud Khademi, Yichong Xu, Reid Pryzant, Yuwei Fang, Chenguang Zhu, Dongdong Chen, Yao Qian, Mei Gao, Yi-Ling Chen, Robert Gmyr, Naoyuki Kanda, Noel Codella, Bin Xiao, Yu Shi, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang

    Abstract: The convergence of text, visual, and audio data is a key step towards human-like artificial intelligence, however the current Vision-Language-Speech landscape is dominated by encoder-only models which lack generative abilities. We propose closing this gap with i-Code V2, the first model capable of generating natural language from any combination of Vision, Language, and Speech data. i-Code V2 is a… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

  43. arXiv:2305.09331  [pdf, other

    cs.NI eess.SP

    Energy-Efficient WiFi Backscatter Communication for Green IoTs

    Authors: Yimeng Huang, Lijie Liu, Jihong Yu, Yuguang Fang, Wei Gong

    Abstract: The boom of the Internet of Things has revolutionized people's lives, but it has also resulted in massive resource consumption and environmental pollution. Recently, Green IoT (GIoT) has become a worldwide consensus to address this issue. In this paper, we propose EEWScatter, an energy-efficient WiFi backscatter communication system to pursue the goal of GIoT. Unlike previous backscatter systems t… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  44. arXiv:2303.14739  [pdf, other

    eess.IV cs.CV

    Geometry-Aware Attenuation Field Learning for Sparse-View CBCT Reconstruction

    Authors: Zhentao Liu, Yu Fang, Changjian Li, Han Wu, Yuan Liu, Zhiming Cui, Dinggang Shen

    Abstract: Cone Beam Computed Tomography (CBCT) is the most widely used imaging method in dentistry. As hundreds of X-ray projections are needed to reconstruct a high-quality CBCT image (i.e., the attenuation field) in traditional algorithms, sparse-view CBCT reconstruction has become a main focus to reduce radiation dose. Several attempts have been made to solve it while still suffering from insufficient da… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.

  45. arXiv:2303.08015  [pdf, ps, other

    q-bio.QM cs.IT eess.SP

    Molecular Communication for Quorum Sensing Inspired Cooperative Drug Delivery

    Authors: Yuting Fang, Stuart T. Johnston, Matt Faria, Xinyu Huang, Andrew W. Eckford, Jamie Evans

    Abstract: A cooperative drug delivery system is proposed, where quorum sensing (QS), a density-dependent bacterial behavior coordination mechanism, is employed by synthetic bacterium-based nanomachines (B-NMs) for controllable drug delivery. In our proposed system, drug delivery is only triggered when there are enough QS molecules, which in turn only happens when there are enough B-NMs. This makes the propo… ▽ More

    Submitted 14 February, 2023; originally announced March 2023.

    Comments: 9 pages; 9 figures

  46. Low-Complexity Pareto-Optimal 3D Beamforming for the Full-Dimensional Multi-User Massive MIMO Downlink

    Authors: W. Zhu, H. D. Tuan, E. Dutkiewicz, Y. Fang, L. Hanzo

    Abstract: Full-dimensional (FD) multi-user massive multiple input multiple output (m-MIMO) systems employ large two-dimensional (2D) rectangular antenna arrays to control both the azimuth and elevation angles of signal transmission. We introduce the sum of two outer products of the azimuth and elevation beamforming vectors having moderate dimensions as a new class of FD beamforming. We show that this low-co… ▽ More

    Submitted 18 February, 2023; originally announced February 2023.

  47. arXiv:2301.11166  [pdf, ps, other

    cs.NI cs.LG eess.SP

    Flex-Net: A Graph Neural Network Approach to Resource Management in Flexible Duplex Networks

    Authors: Tharaka Perera, Saman Atapattu, Yuting Fang, Prathapasinghe Dharmawansa, Jamie Evans

    Abstract: Flexible duplex networks allow users to dynamically employ uplink and downlink channels without static time scheduling, thereby utilizing the network resources efficiently. This work investigates the sum-rate maximization of flexible duplex networks. In particular, we consider a network with pairwise-fixed communication links. Corresponding combinatorial optimization is a non-deterministic polynom… ▽ More

    Submitted 15 March, 2023; v1 submitted 20 January, 2023; originally announced January 2023.

  48. arXiv:2211.17048  [pdf, other

    eess.IV cs.CV

    SNAF: Sparse-view CBCT Reconstruction with Neural Attenuation Fields

    Authors: Yu Fang, Lanzhuju Mei, Changjian Li, Yuan Liu, Wen** Wang, Zhiming Cui, Dinggang Shen

    Abstract: Cone beam computed tomography (CBCT) has been widely used in clinical practice, especially in dental clinics, while the radiation dose of X-rays when capturing has been a long concern in CBCT imaging. Several research works have been proposed to reconstruct high-quality CBCT images from sparse-view 2D projections, but the current state-of-the-arts suffer from artifacts and the lack of fine details… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

  49. arXiv:2211.14576  [pdf, other

    eess.IV cs.CV

    CFNet: Conditional Filter Learning with Dynamic Noise Estimation for Real Image Denoising

    Authors: Yifan Zuo, Jiacheng Xie, Yuming Fang, Yan Huang, Wenhui Jiang

    Abstract: A mainstream type of the state of the arts (SOTAs) based on convolutional neural network (CNN) for real image denoising contains two sub-problems, i.e., noise estimation and non-blind denoising. This paper considers real noise approximated by heteroscedastic Gaussian/Poisson Gaussian distributions with in-camera signal processing pipelines. The related works always exploit the estimated noise prio… ▽ More

    Submitted 26 November, 2022; originally announced November 2022.

  50. arXiv:2210.02416  [pdf, other

    eess.IV cs.CV cs.LG

    A deep learning model for brain vessel segmentation in 3DRA with arteriovenous malformations

    Authors: Camila García, Yibin Fang, Jianmin Liu, Ana Paula Narata, José Ignacio Orlando, Ignacio Larrabide

    Abstract: Segmentation of brain arterio-venous malformations (bAVMs) in 3D rotational angiographies (3DRA) is still an open problem in the literature, with high relevance for clinical practice. While deep learning models have been applied for segmenting the brain vasculature in these images, they have never been used in cases with bAVMs. This is likely caused by the difficulty to obtain sufficiently annotat… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: 9 pages, 4 figures, submitted to SIPAIM 2022, to be published in the SPIE Digital Library