Skip to main content

Showing 1–50 of 150 results for author: Zhou, Z

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.15093  [pdf, other

    cs.CR cs.CV eess.IV

    ECLIPSE: Expunging Clean-label Indiscriminate Poisons via Sparse Diffusion Purification

    Authors: Xianlong Wang, Shengshan Hu, Yechao Zhang, Ziqi Zhou, Leo Yu Zhang, Peng Xu, Wei Wan, Hai **

    Abstract: Clean-label indiscriminate poisoning attacks add invisible perturbations to correctly labeled training images, thus dramatically reducing the generalization capability of the victim models. Recently, some defense mechanisms have been proposed such as adversarial training, image transformation techniques, and image purification. However, these schemes are either susceptible to adaptive attacks, bui… ▽ More

    Submitted 24 June, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

    Comments: Accepted by ESORICS 2024

  2. arXiv:2406.07952  [pdf, other

    eess.IV cs.CV

    Spatial-Frequency Dual Progressive Attention Network For Medical Image Segmentation

    Authors: Zhenhuan Zhou, Along He, Yanlin Wu, Rui Yao, Xueshuo Xie, Tao Li

    Abstract: In medical images, various types of lesions often manifest significant differences in their shape and texture. Accurate medical image segmentation demands deep learning models with robust capabilities in multi-scale and boundary feature learning. However, previous networks still have limitations in addressing the above issues. Firstly, previous networks simultaneously fuse multi-level features or… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 8 pages

  3. arXiv:2406.07421  [pdf, other

    cs.SD eess.AS

    A Comprehensive Investigation on Speaker Augmentation for Speaker Recognition

    Authors: Zhenyu Zhou, Shibiao Xu, Shi Yin, Lantian Li, Dong Wang

    Abstract: Data augmentation (DA) has played a pivotal role in the success of deep speaker recognition. Current DA techniques primarily focus on speaker-preserving augmentation, which does not change the speaker trait of the speech and does not create new speakers. Recent research has shed light on the potential of speaker augmentation, which generates new speakers to enrich the training dataset. In this stu… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: to be published in INTERSPEECH 2024

  4. arXiv:2406.06626  [pdf, other

    cs.LG cs.AI cs.HC eess.SP

    Benchmarking Neural Decoding Backbones towards Enhanced On-edge iBCI Applications

    Authors: Zhou Zhou, Guohang He, Zheng Zhang, Luziwei Leng, Qinghai Guo, Jianxing Liao, Xuan Song, Ran Cheng

    Abstract: Traditional invasive Brain-Computer Interfaces (iBCIs) typically depend on neural decoding processes conducted on workstations within laboratory settings, which prevents their everyday usage. Implementing these decoding processes on edge devices, such as the wearables, introduces considerable challenges related to computational demands, processing speed, and maintaining accuracy. This study seeks… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  5. arXiv:2406.03912  [pdf, other

    cs.AI cs.LG cs.RO eess.SY

    GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model

    Authors: Zhehua Zhou, Xuan Xie, Jiayang Song, Zhan Shu, Lei Ma

    Abstract: Although deep reinforcement learning has demonstrated impressive achievements in controlling various autonomous systems, e.g., autonomous vehicles or humanoid robots, its inherent reliance on random exploration raises safety concerns in their real-world applications. To improve system safety during the learning process, a variety of Safe Reinforcement Learning (SRL) algorithms have been proposed,… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  6. arXiv:2405.18558  [pdf, other

    cs.RO eess.SY

    "Golden Ratio Yoshimura" for Meta-Stable and Massively Reconfigurable Deployment

    Authors: Vishrut Deshpande, Yogesh Phalak, Ziyang Zhou, Ian Walker, Suyi Li

    Abstract: Yoshimura origami is a classical folding pattern that has inspired many deployable structure designs. Its applications span from space exploration, kinetic architectures, and soft robots to even everyday household items. However, despite its wide usage, Yoshimura has been fixated on a set of design constraints to ensure its flat-foldability. Through extensive kinematic analysis and prototype tests… ▽ More

    Submitted 30 May, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

  7. arXiv:2405.18356  [pdf, other

    eess.IV cs.CV

    Universal and Extensible Language-Vision Models for Organ Segmentation and Tumor Detection from Abdominal Computed Tomography

    Authors: Jie Liu, Yixiao Zhang, Kang Wang, Mehmet Can Yavuz, Xiaoxi Chen, Yixuan Yuan, Haoliang Li, Yang Yang, Alan Yuille, Yucheng Tang, Zongwei Zhou

    Abstract: The advancement of artificial intelligence (AI) for organ segmentation and tumor detection is propelled by the growing availability of computed tomography (CT) datasets with detailed, per-voxel annotations. However, these AI models often struggle with flexibility for partially annotated datasets and extensibility for new classes due to limitations in the one-hot encoding, architectural design, and… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted to Medical Image Analysis

  8. arXiv:2405.11263  [pdf, other

    eess.SP

    MAMCA -- Optimal on Accuracy and Efficiency for Automatic Modulation Classification with Extended Signal Length

    Authors: Yezhuo Zhang, Zinan Zhou, Yichao Cao, Guangyu Li, Xuanpeng Li

    Abstract: With the rapid growth of the Internet of Things ecosystem, Automatic Modulation Classification (AMC) has become increasingly paramount. However, extended signal lengths offer a bounty of information, yet impede the model's adaptability, introduce more noise interference, extend the training and inference time, and increase storage overhead. To bridge the gap between these requisites, we propose a… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: 5 pages, 5 figures

  9. arXiv:2405.10705  [pdf, other

    eess.IV cs.CV

    3D Vessel Reconstruction from Sparse-View Dynamic DSA Images via Vessel Probability Guided Attenuation Learning

    Authors: Zhentao Liu, Huangxuan Zhao, Wenhui Qin, Zhenghong Zhou, Xinggang Wang, Wen** Wang, Xiaochun Lai, Chuansheng Zheng, Dinggang Shen, Zhiming Cui

    Abstract: Digital Subtraction Angiography (DSA) is one of the gold standards in vascular disease diagnosing. With the help of contrast agent, time-resolved 2D DSA images deliver comprehensive insights into blood flow information and can be utilized to reconstruct 3D vessel structures. Current commercial DSA systems typically demand hundreds of scanning views to perform reconstruction, resulting in substanti… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 12 pages, 13 figures, 5 tables

  10. arXiv:2404.06393  [pdf, other

    cs.SD cs.AI eess.AS

    MuPT: A Generative Symbolic Music Pretrained Transformer

    Authors: Xingwei Qu, Yuelin Bai, Yinghao Ma, Ziya Zhou, Ka Man Lo, Jiaheng Liu, Ruibin Yuan, Lejun Min, Xueling Liu, Tianyu Zhang, Xinrun Du, Shuyue Guo, Yiming Liang, Yizhi Li, Shangda Wu, Junting Zhou, Tianyu Zheng, Ziyang Ma, Fengze Han, Wei Xue, Gus Xia, Emmanouil Benetos, Xiang Yue, Chenghua Lin, Xu Tan , et al. (4 additional authors not shown)

    Abstract: In this paper, we explore the application of Large Language Models (LLMs) to the pre-training of music. While the prevalent use of MIDI in music modeling is well-established, our findings suggest that LLMs are inherently more compatible with ABC Notation, which aligns more closely with their design and strengths, thereby enhancing the model's performance in musical composition. To address the chal… ▽ More

    Submitted 10 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  11. arXiv:2404.05832  [pdf, other

    cs.HC eess.SY

    Human-Machine Interaction in Automated Vehicles: Reducing Voluntary Driver Intervention

    Authors: Xinzhi Zhong, Yang Zhou, Varshini Kamaraj, Zhenhao Zhou, Wissam Kontar, Dan Negrut, John D. Lee, Soyoung Ahn

    Abstract: This paper develops a novel car-following control method to reduce voluntary driver interventions and improve traffic stability in Automated Vehicles (AVs). Through a combination of experimental and empirical analysis, we show how voluntary driver interventions can instigate substantial traffic disturbances that are amplified along the traffic upstream. Motivated by these findings, we present a fr… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  12. arXiv:2404.00144  [pdf, other

    eess.IV cs.CV

    An Interpretable Cross-Attentive Multi-modal MRI Fusion Framework for Schizophrenia Diagnosis

    Authors: Ziyu Zhou, Anton Orlichenko, Gang Qu, Zening Fu, Vince D Calhoun, Zhengming Ding, Yu-** Wang

    Abstract: Both functional and structural magnetic resonance imaging (fMRI and sMRI) are widely used for the diagnosis of mental disorder. However, combining complementary information from these two modalities is challenging due to their heterogeneity. Many existing methods fall short of capturing the interaction between these modalities, frequently defaulting to a simple combination of latent features. In t… ▽ More

    Submitted 29 March, 2024; originally announced April 2024.

  13. arXiv:2403.20025  [pdf, ps, other

    cs.IT eess.SP

    Secure Full-Duplex Communication via Movable Antennas

    Authors: **gze Ding, Zijian Zhou, Chenbo Wang, Wenyao Li, Lifeng Lin, Bingli Jiao

    Abstract: This paper investigates physical layer security (PLS) for a movable antenna (MA)-assisted full-duplex (FD) system. In this system, an FD base station (BS) with multiple MAs for transmission and reception provides services for an uplink (UL) user and a downlink (DL) user. Each user operates in half-duplex (HD) mode and is equipped with a single fixed-position antenna (FPA), in the presence of a sin… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: This paper has been submitted for possible publication

  14. arXiv:2403.19951  [pdf, ps, other

    eess.SP

    Fractional Delay Alignment Modulation for Spatially Sparse Wireless Communications

    Authors: Zhiwen Zhou, Zhiqiang Xiao, Yong Zeng

    Abstract: Delay alignment modulation (DAM) is a novel transmission technique for wireless systems with high spatial resolution by leveraging delay compensation and path-based beamforming, to mitigate the inter-symbol interference (ISI) without resorting to complex channel equalization or multi-carrier transmission. However, most existing studies on DAM consider a simplified scenario by assuming that the cha… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: Accepted by IEEE WCNC 2024

  15. arXiv:2403.11626  [pdf, other

    cs.GR cs.AI cs.CV cs.MM cs.SD eess.AS

    QEAN: Quaternion-Enhanced Attention Network for Visual Dance Generation

    Authors: Zhizhen Zhou, Ye**g Huo, Guoheng Huang, An Zeng, Xuhang Chen, Lian Huang, Zinuo Li

    Abstract: The study of music-generated dance is a novel and challenging Image generation task. It aims to input a piece of music and seed motions, then generate natural dance movements for the subsequent music. Transformer-based methods face challenges in time series prediction tasks related to human movements and music due to their struggle in capturing the nonlinear relationship and temporal aspects. This… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted by The Visual Computer Journal

  16. arXiv:2403.11531  [pdf, other

    eess.SP

    Specific Emitter Identification Handling Modulation Variation with Margin Disparity Discrepancy

    Authors: Yezhuo Zhang, Zinan Zhou, Xuanpeng Li

    Abstract: In the domain of Specific Emitter Identification (SEI), it is recognized that transmitters can be distinguished through the impairments of their radio frequency front-end, commonly referred to as Radio Frequency Fingerprint (RFF) features. However, modulation schemes can be deliberately coupled into signal-level data to confound RFF information, often resulting in high susceptibility to failure in… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 7 pages, 8 figures, submited to IEEE Global Communications Conference (GLOBECOM) 2024

  17. arXiv:2403.08689  [pdf, other

    eess.IV cs.CV

    Exploiting Structural Consistency of Chest Anatomy for Unsupervised Anomaly Detection in Radiography Images

    Authors: Tiange Xiang, Yixiao Zhang, Yongyi Lu, Alan Yuille, Chaoyi Zhang, Weidong Cai, Zongwei Zhou

    Abstract: Radiography imaging protocols focus on particular body regions, therefore producing images of great similarity and yielding recurrent anatomical structures across patients. Exploiting this structured information could potentially ease the detection of anomalies from radiography images. To this end, we propose a Simple Space-Aware Memory Matrix for In-painting and Detecting anomalies from radiograp… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI). arXiv admin note: substantial text overlap with arXiv:2111.13495

  18. arXiv:2403.06459  [pdf, other

    eess.IV cs.CV

    From Pixel to Cancer: Cellular Automata in Computed Tomography

    Authors: Yuxiang Lai, Xiaoxi Chen, Angtian Wang, Alan Yuille, Zongwei Zhou

    Abstract: AI for cancer detection encounters the bottleneck of data scarcity, annotation difficulty, and low prevalence of early tumors. Tumor synthesis seeks to create artificial tumors in medical images, which can greatly diversify the data and annotations for AI training. However, current tumor synthesis approaches are not applicable across different organs due to their need for specific expertise and de… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  19. arXiv:2403.04116  [pdf, other

    eess.IV cs.CV

    Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis

    Authors: Yuanhao Cai, Yixun Liang, Jiahao Wang, Angtian Wang, Yulun Zhang, Xiaokang Yang, Zongwei Zhou, Alan Yuille

    Abstract: X-ray is widely applied for transmission imaging due to its stronger penetration than natural light. When rendering novel view X-ray projections, existing methods mainly based on NeRF suffer from long training time and slow inference speed. In this paper, we propose a 3D Gaussian splatting-based framework, namely X-Gaussian, for X-ray novel view synthesis. Firstly, we redesign a radiative Gaussian… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: The first 3D Gaussian Splatting-based method for X-ray 3D reconstruction

  20. arXiv:2402.19470  [pdf, other

    eess.IV cs.CV

    Towards Generalizable Tumor Synthesis

    Authors: Qi Chen, Xiaoxi Chen, Haorui Song, Zhiwei Xiong, Alan Yuille, Chen Wei, Zongwei Zhou

    Abstract: Tumor synthesis enables the creation of artificial tumors in medical images, facilitating the training of AI models for tumor detection and segmentation. However, success in tumor synthesis hinges on creating visually realistic tumors that are generalizable across multiple organs and, furthermore, the resulting AI models being capable of detecting real tumors in images sourced from different domai… ▽ More

    Submitted 28 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR 2024)

  21. arXiv:2402.16153  [pdf, other

    cs.SD cs.AI cs.CL cs.LG cs.MM eess.AS

    ChatMusician: Understanding and Generating Music Intrinsically with LLM

    Authors: Ruibin Yuan, Hanfeng Lin, Yi Wang, Zeyue Tian, Shangda Wu, Tianhao Shen, Ge Zhang, Yuhang Wu, Cong Liu, Ziya Zhou, Ziyang Ma, Liumeng Xue, Ziyu Wang, Qin Liu, Tianyu Zheng, Yizhi Li, Yinghao Ma, Yiming Liang, Xiaowei Chi, Ruibo Liu, Zili Wang, Pengfei Li, **gcheng Wu, Chenghua Lin, Qifeng Liu , et al. (10 additional authors not shown)

    Abstract: While Large Language Models (LLMs) demonstrate impressive capabilities in text generation, we find that their ability has yet to be generalized to music, humanity's creative language. We introduce ChatMusician, an open-source LLM that integrates intrinsic musical abilities. It is based on continual pre-training and finetuning LLaMA2 on a text-compatible music representation, ABC notation, and the… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: GitHub: https://shanghaicannon.github.io/ChatMusician/

  22. arXiv:2402.02699  [pdf, other

    cs.SD cs.LG eess.AS

    Adversarial Data Augmentation for Robust Speaker Verification

    Authors: Zhenyu Zhou, Junhui Chen, Namin Wang, Lantian Li, Dong Wang

    Abstract: Data augmentation (DA) has gained widespread popularity in deep speaker models due to its ease of implementation and significant effectiveness. It enriches training data by simulating real-life acoustic variations, enabling deep neural networks to learn speaker-related representations while disregarding irrelevant acoustic variations, thereby improving robustness and generalization. However, a pot… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  23. arXiv:2401.17049  [pdf, ps, other

    cs.IT eess.SP

    Movable Antenna-Enabled Co-Frequency Co-Time Full-Duplex Wireless Communication

    Authors: **gze Ding, Zijian Zhou, Wenyao Li, Chenbo Wang, Lifeng Lin, Bingli Jiao

    Abstract: Movable antenna (MA) provides an innovative way to arrange antennas that can contribute to improved signal quality and more effective interference management. This method is especially beneficial for co-frequency co-time full-duplex (CCFD) wireless communication, which struggles with self-interference (SI) that usually overpowers the desired incoming signals. By dynamically repositioning transmit/… ▽ More

    Submitted 7 February, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: This paper has been submitted to IEEE Wireless Communications Letters

  24. arXiv:2312.13523  [pdf

    physics.med-ph eess.IV

    High-resolution myelin-water fraction and quantitative relaxation map** using 3D ViSTa-MR fingerprinting

    Authors: Congyu Liao, Xiaozhi Cao, Siddharth Srinivasan Iyer, Sophie Schauman, Zihan Zhou, Xiaoqian Yan, Quan Chen, Zhitao Li, Nan Wang, Ting Gong, Zhe Wu, Hongjian He, Jianhui Zhong, Yang Yang, Adam Kerr, Kalanit Grill-Spector, Kawin Setsompop

    Abstract: Purpose: This study aims to develop a high-resolution whole-brain multi-parametric quantitative MRI approach for simultaneous map** of myelin-water fraction (MWF), T1, T2, and proton-density (PD), all within a clinically feasible scan time. Methods: We developed 3D ViSTa-MRF, which combined Visualization of Short Transverse relaxation time component (ViSTa) technique with MR Fingerprinting (MR… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: 38 pages, 12 figures and 1 table

    Journal ref: Magnetic Resonance in Medicine 2023

  25. arXiv:2312.09488  [pdf

    eess.IV cs.LG physics.med-ph

    Sequence adaptive field-imperfection estimation (SAFE): retrospective estimation and correction of $B_1^+$ and $B_0$ inhomogeneities for enhanced MRF quantification

    Authors: Mengze Gao, Xiaozhi Cao, Daniel Abraham, Zihan Zhou, Kawin Setsompop

    Abstract: $B_1^+$ and $B_0$ field-inhomogeneities can significantly reduce accuracy and robustness of MRF's quantitative parameter estimates. Additional $B_1^+$ and $B_0$ calibration scans can mitigate this but add scan time and cannot be applied retrospectively to previously collected data. Here, we proposed a calibration-free sequence-adaptive deep-learning framework, to estimate and correct for $B_1^+… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 12 pages, 5 figures, submitted to International Society for Magnetic Resonance in Medicine 31th Scientific Meeting, 2024

  26. arXiv:2312.08097  [pdf, ps, other

    eess.SP

    Hierarchical Cognitive Spectrum Sharing in Space-Air-Ground Integrated Networks

    Authors: Zizhen Zhou, Qianqian Zhang, Jungang Ge, Ying-Chang Liang

    Abstract: In space-air-ground integrated networks (SAGINs), cognitive spectrum sharing has been regarded as a promising solution to improve spectrum efficiency by enabling a secondary network to access the spectrum of a primary network. However, different networks in SAGIN may have different quality of service (QoS) requirements, which can not be well satisfied with the traditional cognitive spectrum sharin… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  27. arXiv:2312.01785  [pdf

    eess.SY

    Closed-Form Solutions for Grid-Forming Converters: A Design-Oriented Study

    Authors: Fangzhou Zhao, Tianhua Zhu, Lennart Harnefors, Bo Fan, Heng Wu, Zichao Zhou, Yin Sun, Xiongfei Wang

    Abstract: This paper derives closed-form solutions for grid-forming converters with power synchronization control (PSC) by subtly simplifying and factorizing the complex closed-loop models. The solutions can offer clear analytical insights into control-loop interactions, enabling guidelines for robust controller design. It is proved that 1) the proportional gains of PSC and alternating voltage control (AVC)… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  28. arXiv:2311.10959  [pdf, other

    eess.IV cs.CV

    Structure-Aware Sparse-View X-ray 3D Reconstruction

    Authors: Yuanhao Cai, Jiahao Wang, Alan Yuille, Zongwei Zhou, Angtian Wang

    Abstract: X-ray, known for its ability to reveal internal structures of objects, is expected to provide richer information for 3D reconstruction than visible light. Yet, existing neural radiance fields (NeRF) algorithms overlook this important nature of X-ray, leading to their limitations in capturing structural contents of imaged objects. In this paper, we propose a framework, Structure-Aware X-ray Neural… ▽ More

    Submitted 23 March, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

    Comments: CVPR 2024; The first Transformer-based method for X-ray and CT 3D reconstruction

  29. arXiv:2311.10416  [pdf, other

    eess.SP

    Meta-DSP: A Meta-Learning Approach for Data-Driven Nonlinear Compensation in High-Speed Optical Fiber Systems

    Authors: Xinyu Xiao, Zhennan Zhou, Bin Dong, Dingjiong Ma, Li Zhou, Jie Sun

    Abstract: Non-linear effects in long-haul, high-speed optical fiber systems significantly hinder channel capacity. While the Digital Backward Propagation algorithm (DBP) with adaptive filter (ADF) can mitigate these effects, it suffers from an overwhelming computational complexity. Recent solutions have incorporated deep neural networks in a data-driven strategy to alleviate this complexity in the DBP model… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  30. arXiv:2311.09028  [pdf, other

    cs.IT eess.SP

    Integrating Sensing, Communication, and Power Transfer: Multiuser Beamforming Design

    Authors: Ziqin Zhou, Xiaoyang Li, Guangxu Zhu, Jie Xu, Kaibin Huang, Shuguang Cui

    Abstract: In the sixth-generation (6G) networks, massive low-power devices are expected to sense environment and deliver tremendous data. To enhance the radio resource efficiency, the integrated sensing and communication (ISAC) technique exploits the sensing and communication functionalities of signals, while the simultaneous wireless information and power transfer (SWIPT) techniques utilizes the same signa… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: This paper has been submitted to IEEE for possible publication

  31. arXiv:2311.09018  [pdf, ps, other

    cs.LG eess.SY math.OC stat.ML

    On the Foundation of Distributionally Robust Reinforcement Learning

    Authors: Shengbo Wang, Nian Si, Jose Blanchet, Zhengyuan Zhou

    Abstract: Motivated by the need for a robust policy in the face of environment shifts between training and the deployment, we contribute to the theoretical foundation of distributionally robust reinforcement learning (DRRL). This is accomplished through a comprehensive modeling framework centered around distributionally robust Markov decision processes (DRMDPs). This framework obliges the decision maker to… ▽ More

    Submitted 19 January, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  32. arXiv:2311.07235  [pdf, other

    eess.IV cs.CV cs.HC

    DeepMetricEye: Metric Depth Estimation in Periocular VR Imagery

    Authors: Yitong Sun, Zijian Zhou, Cyriel Diels, Ali Asadipour

    Abstract: Despite the enhanced realism and immersion provided by VR headsets, users frequently encounter adverse effects such as digital eye strain (DES), dry eye, and potential long-term visual impairment due to excessive eye stimulation from VR displays and pressure from the mask. Recent VR headsets are increasingly equipped with eye-oriented monocular cameras to segment ocular feature maps. Yet, to compu… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  33. arXiv:2311.04223  [pdf

    eess.SP eess.SY physics.optics

    Dual band wireless transmission over 75-150GHz millimeter wave carriers using frequency-locked laser pairs

    Authors: Zichuan Zhou, Amany Kassem, James Seddon, Eric Sillekens, Izzat Darwazeh, Polina Bayvel, Zhixin Liu

    Abstract: We generate and transmit 75-GHz-bandwidth OFDM signals over the air using three mutually frequency-locked lasers, achieving minimal frequency gap between the wireless W and D bands using optical-assisted approaches, resulting in 173.5 Gb/s detected capacity.

    Submitted 27 October, 2023; originally announced November 2023.

    Comments: 3 pages, 2 figures, conference submission

  34. Self-Sustained And Coordinated Rhythmic Deformations With SMA For Controller-Free Locomotion

    Authors: Ziyang Zhou, Suyi Li

    Abstract: This study presents a modular, electronics-free, and fully onboard control and actuation approach for SMA-based soft robots to achieve locomotion tasks. This approach exploits the nonlinear mechanics of compliant curved beams and carefully designed mechanical control circuits to create and synchronize rhythmic deformation cycles, mimicking the central pattern generators (CPG) prevalent in animal l… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

  35. arXiv:2309.14158  [pdf, other

    cs.SD eess.AS

    An Investigation of Distribution Alignment in Multi-Genre Speaker Recognition

    Authors: Zhenyu Zhou, Junhui Chen, Namin Wang, Lantian Li, Dong Wang

    Abstract: Multi-genre speaker recognition is becoming increasingly popular due to its ability to better represent the complexities of real-world applications. However, a major challenge is the significant shift in the distribution of speaker vectors across different genres. While distribution alignment is a common approach to address this challenge, previous studies have mainly focused on aligning a source… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: submitted to ICASSP 2024

  36. arXiv:2309.11161  [pdf, other

    cs.IT eess.SP

    Beamforming Design for RIS-Aided THz Wideband Communication Systems

    Authors: Yihang Jiang, Ziqin Zhou, Xiaoyang Li, Yi Gong

    Abstract: Benefiting from tens of GHz of bandwidth, terahertz (THz) communications has become a promising technology for future 6G networks. However, the conventional hybrid beamforming architecture based on frequency-independent phase-shifters is not able to cope with the beam split effect (BSE) in THz massive multiple-input multiple-output (MIMO) systems. Despite some work introducing the frequency-depend… ▽ More

    Submitted 21 September, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

  37. Multi-user passive beamforming in RIS-aided communications and experimental validations

    Authors: Zhibo Zhou, Haifan Yin, Li Tan, Ruikun Zhang, Kai Wang, Yingzhuang Liu

    Abstract: Reconfigurable intelligent surface (RIS) is a promising technology for future wireless communications due to its capability of optimizing the propagation environments. Nevertheless, in literature, there are few prototypes serving multiple users. In this paper, we propose a whole flow of channel estimation and beamforming design for RIS, and set up an RIS-aided multi-user system for experimental va… ▽ More

    Submitted 11 May, 2024; v1 submitted 17 September, 2023; originally announced September 2023.

    Comments: 11 pages, 8 figures, 2 tables. This paper has been accepted by IEEE Transactions on Communications

  38. arXiv:2309.02318  [pdf, other

    cs.CV eess.IV

    TiAVox: Time-aware Attenuation Voxels for Sparse-view 4D DSA Reconstruction

    Authors: Zhenghong Zhou, Huangxuan Zhao, Jiemin Fang, Dongqiao Xiang, Lei Chen, Lingxia Wu, Feihong Wu, Wenyu Liu, Chuansheng Zheng, Xinggang Wang

    Abstract: Four-dimensional Digital Subtraction Angiography (4D DSA) plays a critical role in the diagnosis of many medical diseases, such as Arteriovenous Malformations (AVM) and Arteriovenous Fistulas (AVF). Despite its significant application value, the reconstruction of 4D DSA demands numerous views to effectively model the intricate vessels and radiocontrast flow, thereby implying a significant radiatio… ▽ More

    Submitted 19 December, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: 10 pages, 8 figures

  39. arXiv:2309.01340  [pdf, other

    cs.SD cs.CV eess.AS

    MDSC: Towards Evaluating the Style Consistency Between Music and Dance

    Authors: Zixiang Zhou, Weiyuan Li, Baoyuan Wang

    Abstract: We propose MDSC(Music-Dance-Style Consistency), the first evaluation metric that assesses to what degree the dance moves and music match. Existing metrics can only evaluate the motion fidelity and diversity and the degree of rhythmic matching between music and dance. MDSC measures how stylistically correlated the generated dance motion sequences and the conditioning music sequences are. We found t… ▽ More

    Submitted 29 November, 2023; v1 submitted 3 September, 2023; originally announced September 2023.

    Comments: 19 pages, 19 figure

  40. arXiv:2308.13933  [pdf, other

    physics.optics eess.IV

    Illumination strategies for space-bandwidth-time product improvement in Fourier ptychography

    Authors: Haibo Xu, Cheng Li, Mingzhe Wei, Ziwen Zhou, Longqian Huang

    Abstract: Fourier ptychography (FP) is a promising technique for high-throughput imaging. Reconstruction algorithms and illumination paradigm are two key aspects of FP. In this review, we mainly focus on illumination strategies in FP. We derive the space-bandwidth-time product (SBP-T) for the characterization of FP performance. Based on the analysis of SBP-T, we categorize the illumination strategy in FP ef… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

  41. arXiv:2308.03008  [pdf, other

    eess.IV cs.CV cs.LG

    Early Detection and Localization of Pancreatic Cancer by Label-Free Tumor Synthesis

    Authors: Bowen Li, Yu-Cheng Chou, Shuwen Sun, Hualin Qiao, Alan Yuille, Zongwei Zhou

    Abstract: Early detection and localization of pancreatic cancer can increase the 5-year survival rate for patients from 8.5% to 20%. Artificial intelligence (AI) can potentially assist radiologists in detecting pancreatic tumors at an early stage. Training AI models require a vast number of annotated examples, but the availability of CT scans obtaining early-stage tumors is constrained. This is because earl… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: Big Task Small Data, 1001-AI, MICCAI Workshop, 2023

  42. arXiv:2308.00886  [pdf

    cs.LG cs.AI cs.SE eess.SP

    Enhancing Machine Learning Performance with Continuous In-Session Ground Truth Scores: Pilot Study on Objective Skeletal Muscle Pain Intensity Prediction

    Authors: Boluwatife E. Faremi, Jonathon Stavres, Nuno Oliveira, Zhaoxian Zhou, Andrew H. Sung

    Abstract: Machine learning (ML) models trained on subjective self-report scores struggle to objectively classify pain accurately due to the significant variance between real-time pain experiences and recorded scores afterwards. This study developed two devices for acquisition of real-time, continuous in-session pain scores and gathering of ANS-modulated endodermal activity (EDA).The experiment recruited N =… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: 18 pages, 2-page Appendix, 7 figures

    ACM Class: B.7; D.2.5; D.2.9; H.2.8; H.2.1; I.2; J.2; J.6; K.6.3

  43. arXiv:2306.15530  [pdf, other

    eess.SY

    Fast and Automatic 3D Modeling of Antenna Structure Using CNN-LSTM Network for Efficient Data Generation

    Authors: Zhaohui Wei, Zhao Zhou, Peng Wang, Jian Ren, Yingzeng Yin, Gert Frølund Pedersen, Ming Shen

    Abstract: Deep learning-assisted antenna design methods such as surrogate models have gained significant popularity in recent years due to their potential to greatly increase design efficiencies by replacing the time-consuming full-wave electromagnetic (EM) simulations. However, a large number of training data with sufficiently diverse and representative samples (antenna structure parameters, scattering pro… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  44. arXiv:2306.05624  [pdf

    eess.AS cs.SD

    Domestic Activities Classification from Audio Recordings Using Multi-scale Dilated Depthwise Separable Convolutional Network

    Authors: Yufei Zeng, Yanxiong Li, Zhenfeng Zhou, Ruiqi Wang, Difeng Lu

    Abstract: Domestic activities classification (DAC) from audio recordings aims at classifying audio recordings into pre-defined categories of domestic activities, which is an effective way for estimation of daily activities performed in home environment. In this paper, we propose a method for DAC from audio recordings using a multi-scale dilated depthwise separable convolutional network (DSCN). The DSCN is a… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: 5 pages, 2 figures, 4 tables. Accepted for publication in IEEE MMSP2021

  45. arXiv:2306.00988  [pdf, other

    eess.IV cs.CV cs.LG

    Continual Learning for Abdominal Multi-Organ and Tumor Segmentation

    Authors: Yixiao Zhang, Xinyi Li, Huimiao Chen, Alan Yuille, Yaoyao Liu, Zongwei Zhou

    Abstract: The ability to dynamically extend a model to new data and classes is critical for multiple organ and tumor segmentation. However, due to privacy regulations, accessing previous data and annotations can be problematic in the medical domain. This poses a significant barrier to preserving the high segmentation accuracy of the old classes when learning from new classes because of the catastrophic forg… ▽ More

    Submitted 21 July, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: MICCAI-2023

  46. arXiv:2305.16616  [pdf, other

    eess.SP

    Channel Measurement, Modeling, and Simulation for 6G: A Survey and Tutorial

    Authors: Jianhua Zhang, Jiaxin Lin, Pan Tang, Yuxiang Zhang, Huixin Xu, Tianyang Gao, Haiyang Miao, Zeyong Chai, Zhengfu Zhou, Yi Li, Huiwen Gong, Yameng Liu, Zhiqiang Yuan, Lei Tian, Shaoshi Yang, Liang Xia, Guangyi Liu, ** Zhang

    Abstract: The sixth generation (6G) mobile communications have attracted substantial attention in the global research community of information and communication technologies (ICT). 6G systems are expected to support not only extended 5G usage scenarios, but also new usage scenarios, such as integrated sensing and communication (ISAC), integrated artificial intelligence (AI) and communication, and communicat… ▽ More

    Submitted 28 March, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: 41 pages,52 figures

  47. arXiv:2305.14684  [pdf, other

    cs.CV eess.IV

    Collaborative Auto-encoding for Blind Image Quality Assessment

    Authors: Zehong Zhou, Fei Zhou, Guo** Qiu

    Abstract: Blind image quality assessment (BIQA) is a challenging problem with important real-world applications. Recent efforts attempting to exploit powerful representations by deep neural networks (DNN) are hindered by the lack of subjectively annotated data. This paper presents a novel BIQA method which overcomes this fundamental obstacle. Specifically, we design a pair of collaborative autoencoders (COA… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  48. arXiv:2305.09666  [pdf, other

    eess.IV cs.CV cs.LG

    AbdomenAtlas-8K: Annotating 8,000 CT Volumes for Multi-Organ Segmentation in Three Weeks

    Authors: Chongyu Qu, Tiezheng Zhang, Hualin Qiao, Jie Liu, Yucheng Tang, Alan Yuille, Zongwei Zhou

    Abstract: Annotating medical images, particularly for organ segmentation, is laborious and time-consuming. For example, annotating an abdominal organ requires an estimated rate of 30-60 minutes per CT volume based on the expertise of an annotator and the size, visibility, and complexity of the organ. Therefore, publicly available datasets for multi-organ segmentation are often limited in data size and organ… ▽ More

    Submitted 15 October, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: Conference on Neural Information Processing Systems (NeurIPS 2023)

  49. arXiv:2305.05142  [pdf, other

    eess.SP

    Integrated Super-Resolution Sensing and Communication with 5G NR Waveform: Signal Processing with Uneven CPs and Experiments

    Authors: Chaoyue Zhang, Zhiwen Zhou, Huizhi Wang, Yong Zeng

    Abstract: Integrated sensing and communication (ISAC) is a promising technology to simultaneously provide high-performance wireless communication and radar sensing services in future networks. In this paper, we propose the concept of \emph{integrated super-resolution sensing and communication} (ISSAC), which uses super-resolution algorithms in ISAC systems to achieve extreme sensing performance for those cr… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 8 pages, 14 figures, 2023 21th International Symposium on Modeling and Optimization in Mobile, Ad hoc, and Wireless Networks (WiOpt)

  50. arXiv:2305.02772  [pdf, other

    cs.RO eess.SY

    Efficient and Robust Time-Optimal Trajectory Planning and Control for Agile Quadrotor Flight

    Authors: Ziyu Zhou, Gang Wang, Jian Sun, Jikai Wang, Jie Chen

    Abstract: Agile quadrotor flight relies on rapidly planning and accurately tracking time-optimal trajectories, a technology critical to their application in the wild. However, the computational burden of computing time-optimal trajectories based on the full quadrotor dynamics (typically on the order of minutes or even hours) can hinder its ability to respond quickly to changing scenarios. Additionally, mode… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: submitted to IEEE Robotics and Automation Letters, for the associated video, see https://youtu.be/E6QVHWcvB6E