Skip to main content

Showing 1–19 of 19 results for author: Xie, C

Searching in archive eess. Search in all archives.
.
  1. arXiv:2404.14693  [pdf, other

    cs.CR cs.CV eess.IV

    Double Privacy Guard: Robust Traceable Adversarial Watermarking against Face Recognition

    Authors: Yunming Zhang, Dengpan Ye, Sipeng Shen, Caiyun Xie, Ziyi Liu, Jiacheng Deng, Long Tang

    Abstract: The wide deployment of Face Recognition (FR) systems poses risks of privacy leakage. One countermeasure to address this issue is adversarial attacks, which deceive malicious FR searches but simultaneously interfere the normal identity verification of trusted authorizers. In this paper, we propose the first Double Privacy Guard (DPG) scheme based on traceable adversarial watermarking. DPG employs a… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  2. arXiv:2403.15735  [pdf, other

    eess.IV cs.CV

    3D-TransUNet for Brain Metastases Segmentation in the BraTS2023 Challenge

    Authors: Siwei Yang, Xianhang Li, Jieru Mei, Jieneng Chen, Cihang Xie, Yuyin Zhou

    Abstract: Segmenting brain tumors is complex due to their diverse appearances and scales. Brain metastases, the most common type of brain tumor, are a frequent complication of cancer. Therefore, an effective segmentation model for brain metastases must adeptly capture local intricacies to delineate small tumor regions while also integrating global context to understand broader scan features. The TransUNet m… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  3. arXiv:2306.03494  [pdf, other

    eess.IV cs.CV

    LegoNet: Alternating Model Blocks for Medical Image Segmentation

    Authors: Ikboljon Sobirov, Cheng Xie, Muhammad Siddique, Parijat Patel, Kenneth Chan, Thomas Halborg, Christos Kotanidis, Zarqiash Fatima, Henry West, Keith Channon, Stefan Neubauer, Charalambos Antoniades, Mohammad Yaqub

    Abstract: Since the emergence of convolutional neural networks (CNNs), and later vision transformers (ViTs), the common paradigm for model development has always been using a set of identical block types with varying parameters/hyper-parameters. To leverage the benefits of different architectural designs (e.g. CNNs and ViTs), we propose to alternate structurally different types of blocks to generate a new a… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: 12 pages, 5 figures, 4 tables

  4. arXiv:2304.00440  [pdf, other

    cs.IT eess.SP

    Near-Field Channel Estimation for Extremely Large-Scale Reconfigurable Intelligent Surface (XL-RIS)-Aided Wideband mmWave Systems

    Authors: Songjie Yang, Chenfei Xie, Wanting Lyu, Boyu Ning, Zhongpei Zhang, Chau Yuen

    Abstract: Near-field communications present new opportunities over near-field channels, however, the spherical wavefront propagation makes near-field signal processing challenging. In this context, this paper proposes efficient near-field channel estimation methods for wideband MIMO mmWave systems with the aid of extremely large-scale reconfigurable intelligent surfaces (XL-RIS). For the wideband signals re… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

  5. arXiv:2303.09170  [pdf, other

    cs.CV eess.IV

    NLUT: Neural-based 3D Lookup Tables for Video Photorealistic Style Transfer

    Authors: Yaosen Chen, Han Yang, Yuexin Yang, Yuegen Liu, Wei Wang, Xuming Wen, Abstract: Video photorealistic style transfer is desired to generate videos with a similar photorealistic style to the style image while maintaining temporal consistency. However, existing methods obtain stylized video sequences by performing frame-by-frame photorealistic style transfer, which is inefficient and does not ensure the temporal consistency of the stylized video. To address this issue, we use ne… ▽ More

    Submitted 17 March, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

  6. arXiv:2210.08181  [pdf, other

    cs.CV eess.IV

    Panchromatic and Multispectral Image Fusion via Alternating Reverse Filtering Network

    Authors: Keyu Yan, Man Zhou, Jie Huang, Feng Zhao, Chengjun Xie, Chongyi Li, Danfeng Hong

    Abstract: Panchromatic (PAN) and multi-spectral (MS) image fusion, named Pan-sharpening, refers to super-resolve the low-resolution (LR) multi-spectral (MS) images in the spatial domain to generate the expected high-resolution (HR) MS images, conditioning on the corresponding high-resolution PAN images. In this paper, we present a simple yet effective \textit{alternating reverse filtering network} for pan-s… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Journal ref: NeurIPS2022

  7. arXiv:2207.14182  [pdf, other

    eess.SP cs.IT

    Channel Estimation for Reconfigurable Intelligent Surface-Assisted Cell-Free Communications

    Authors: Songjie Yang, Chenfei Xie, Mingwei Wang, Zhongpei Zhang

    Abstract: Recent research has focused on reconfigurable intelligent surface (RIS)-assisted cell-free systems with the goal of enhancing coverage and lowering the cost of cell-free networks. However, current research makes the assumption that the perfect channel state information is known. Channel acquisition is, certainly, a difficulty in this case. This work is aimed at investigating RIS-assisted cell-free… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

  8. arXiv:2207.14107  [pdf, other

    eess.SP

    Fast Compressive Channel Estimation for MmWave MIMO Hybrid Beamforming Systems

    Authors: Songjie Yang, Chenfei Xie, Dongli Wang, Zhongpei Zhang

    Abstract: Given the high degree of computational complexity of the channel estimation technique based on the conventional one-dimensional (1-D) compressive sensing (CS) framework employed in the hybrid beamforming architecture, this study proposes two low-complexity channel estimation strategies. One is two-stage CS, which exploits row-group sparsity to estimate angle-of-arrival (AoA) first and uses the con… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

  9. arXiv:2206.15155  [pdf, other

    cs.SD eess.AS

    An Evaluation of Three-Stage Voice Conversion Framework for Noisy and Reverberant Conditions

    Authors: Yeonjong Choi, Chao Xie, Tomoki Toda

    Abstract: This paper presents a new voice conversion (VC) framework capable of dealing with both additive noise and reverberation, and its performance evaluation. There have been studied some VC researches focusing on real-world circumstances where speech data are interfered with background noise and reverberation. To deal with more practical conditions where no clean target dataset is available, one possib… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

    Comments: Accepted to INTERSPEECH 2022

  10. arXiv:2206.07142  [pdf

    eess.SP

    Experimental Comparison of PAM-8 Probabilistic Sha** with Different Gaussian Orders at 200 Gb/s Net Rate in IM/DD System with O-Band TOSA

    Authors: Md Sabbir-Bin Hossain, Georg Böcherer, Youxi Lin, Shuangxu Li, Stefano Calabrò, Andrei Nedelcu, Talha Rahman, Tom Wettlin, **long Wei, Nebojša Stojanović, Changsong Xie, Maxim Kuschnerov, Stephan Pachnicke

    Abstract: For 200Gb/s net rates, cap probabilistic shaped PAM-8 with different Gaussian orders are experimentally compared against uniform PAM-8. In back-to-back and 5km measurements, cap-shaped 85-GBd PAM-8 with Gaussian order of 5 outperforms 71-GBd uniform PAM-8 by up to 2.90dB and 3.80dB in receiver sensitivity, respectively.

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: submitted to 2022 European Conference on Optical Communication (ECOC)

  11. Ultra-compact Binary Neural Networks for Human Activity Recognition on RISC-V Processors

    Authors: Francesco Daghero, Chen Xie, Daniele Jahier Pagliari, Alessio Burrello, Marco Castellano, Luca Gandolfi, Andrea Calimera, Enrico Macii, Massimo Poncino

    Abstract: Human Activity Recognition (HAR) is a relevant inference task in many mobile applications. State-of-the-art HAR at the edge is typically achieved with lightweight machine learning models such as decision trees and Random Forests (RFs), whereas deep learning is less common due to its high computational complexity. In this work, we propose a novel implementation of HAR based on deep neural networks,… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: Published in: 2021 18th ACM International Conference on Computing Frontiers (CF)

    Journal ref: 18th ACM International Conference on Computing Frontiers (CF), 2021, pp. 3-11

  12. Experimental Comparison of Cap and Cup Probabilistically Shaped PAM for O-Band IM/DD Transmission System

    Authors: Md Sabbir-Bin Hossain, Georg Boecherer, Talha Rahman, Nebojsa Stojanovic, Patrick Schulte, Stefano Calabrò, **long Wei, Christian Bluemm, Tom Wettlin, Changsong Xie, Maxim Kuschnerov, Stephan Pachnicke

    Abstract: For 200Gbit/s net rates, uniform PAM-4, 6 and 8 are experimentally compared against probabilistic shaped PAM-8 cap and cup variants. In back-to-back and 20km measurements, cap shaped 80GBd PAM-8 outperforms 72GBd PAM-8 and 83GBd PAM-6 by up to 3.50dB and 0.8dB in receiver sensitivity, respectively

    Submitted 18 May, 2022; originally announced May 2022.

    Comments: Originally published in ECOC-2021. We have updated Figure 3. The change also affects the overall outcome. In contrast to the published version, compared to uniform PAM-8 72 GBd, PS-PAM-8 80 GBd performance is updated to 3.50 dB instead of 5.17 dB, while for PAM-6 83 GBd the gain becomes 0.8 dB instead of 2.17 dB. The changes are adapted in all sections except the experimental setup and DSP section

    Journal ref: 2021 European Conference on Optical Communication (ECOC)

  13. arXiv:2204.10541  [pdf, other

    cs.LG eess.SP

    Privacy-preserving Social Distance Monitoring on Microcontrollers with Low-Resolution Infrared Sensors and CNNs

    Authors: Chen Xie, Francesco Daghero, Yukai Chen, Marco Castellano, Luca Gandolfi, Andrea Calimera, Enrico Macii, Massimo Poncino, Daniele Jahier Pagliari

    Abstract: Low-resolution infrared (IR) array sensors offer a low-cost, low-power, and privacy-preserving alternative to optical cameras and smartphones/wearables for social distance monitoring in indoor spaces, permitting the recognition of basic shapes, without revealing the personal details of individuals. In this work, we demonstrate that an accurate detection of social distance violations can be achieve… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

    Comments: Accepted as a conference paper at the 2022 IEEE International Symposium on Circuits and Systems (ISCAS)

  14. arXiv:2204.08692  [pdf, other

    eess.AS cs.CR cs.SD

    Time Domain Adversarial Voice Conversion for ADD 2022

    Authors: Cheng Wen, Tingwei Guo, Xingjun Tan, Rui Yan, Shuran Zhou, Chuandong Xie, Wei Zou, Xiangang Li

    Abstract: In this paper, we describe our speech generation system for the first Audio Deep Synthesis Detection Challenge (ADD 2022). Firstly, we build an any-to-many voice conversion (VC) system to convert source speech with arbitrary language content into the target speaker%u2019s fake speech. Then the converted speech generated from VC is post-processed in the time domain to improve the deception ability.… ▽ More

    Submitted 19 April, 2022; v1 submitted 19 April, 2022; originally announced April 2022.

    Comments: Accepted to ICASSP 2022

  15. arXiv:2204.08686  [pdf, ps, other

    cs.SD eess.AS

    Audio-Visual Wake Word Spotting System For MISP Challenge 2021

    Authors: Yanguang Xu, Jianwei Sun, Yang Han, Shuaijiang Zhao, Chaoyang Mei, Tingwei Guo, Shuran Zhou, Chuandong Xie, Wei Zou, Xiangang Li, Shuran Zhou, Chuandong Xie, Wei Zou, Xiangang Li

    Abstract: This paper presents the details of our system designed for the Task 1 of Multimodal Information Based Speech Processing (MISP) Challenge 2021. The purpose of Task 1 is to leverage both audio and video information to improve the environmental robustness of far-field wake word spotting. In the proposed system, firstly, we take advantage of speech enhancement algorithms such as beamforming and weight… ▽ More

    Submitted 19 April, 2022; v1 submitted 19 April, 2022; originally announced April 2022.

    Comments: Accepted to ICASSP 2022

  16. arXiv:2111.07116  [pdf, other

    cs.SD eess.AS

    Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion

    Authors: Chao Xie, Yi-Chiao Wu, Patrick Lumban Tobing, Wen-Chin Huang, Tomoki Toda

    Abstract: Beyond the conventional voice conversion (VC) where the speaker information is converted without altering the linguistic content, the background sounds are informative and need to be retained in some real-world scenarios, such as VC in movie/video and VC in music where the voice is entangled with background sounds. As a new VC framework, we have developed a noisy-to-noisy (N2N) VC framework to con… ▽ More

    Submitted 13 November, 2021; originally announced November 2021.

  17. arXiv:2109.10608  [pdf, ps, other

    cs.SD eess.AS

    Noisy-to-Noisy Voice Conversion Framework with Denoising Model

    Authors: Chao Xie, Yi-Chiao Wu, Patrick Lumban Tobing, Wen-Chin Huang, Tomoki Toda

    Abstract: In a conventional voice conversion (VC) framework, a VC model is often trained with a clean dataset consisting of speech data carefully recorded and selected by minimizing background interference. However, collecting such a high-quality dataset is expensive and time-consuming. Leveraging crowd-sourced speech data in training is more economical. Moreover, for some real-world VC scenarios such as VC… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

  18. 1.71 Tb/s Single-Channel and 56.51 Tb/s DWDM Transmission over 96.5 km Field-Deployed SSMF

    Authors: Fabio Pittala, Ralf-Peter Braun, Georg Boecherer, Patrick Schulte, Maximilian Schaedler, Stefano Bettelli, Stefano Calabro, Maxim Kuschnerov, Andreas Gladisch, Fritz-Joachim Westphal, Changsong Xie, Rongfu Chen, Qibing Wang, Bofang Zheng

    Abstract: We report an industry leading optical dense wavelength division multiplexing (DWDM) field trial with line rates per channel exceeding 1.66 Tb/s using 130 GBaud dual-polarization probabilistic constellation sha** 256-ary quadrature amplitude modulation (DP-PCS256QAM) in a high capacity data center interconnect (DCI) scenario. This research trial was performed on 96.5 km of field-deployed standard… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

    Comments: This work has been submitted to the IEEE Photonics Technology Letters (PTL) for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  19. arXiv:1912.01054  [pdf, other

    eess.IV cs.CV cs.LG

    The state of the art in kidney and kidney tumor segmentation in contrast-enhanced CT imaging: Results of the KiTS19 Challenge

    Authors: Nicholas Heller, Fabian Isensee, Klaus H. Maier-Hein, Xiaoshuai Hou, Chunmei Xie, Fengyi Li, Yang Nan, Guangrui Mu, Zhiyong Lin, Miofei Han, Guang Yao, Yaozong Gao, Yao Zhang, Yixin Wang, Feng Hou, Jiawei Yang, Guangwei Xiong, Jiang Tian, Cheng Zhong, Jun Ma, Jack Rickman, Joshua Dean, Bethany Stai, Resha Tejpaul, Makinna Oestreich , et al. (16 additional authors not shown)

    Abstract: There is a large body of literature linking anatomic and geometric characteristics of kidney tumors to perioperative and oncologic outcomes. Semantic segmentation of these tumors and their host kidneys is a promising tool for quantitatively characterizing these lesions, but its adoption is limited due to the manual effort required to produce high-quality 3D segmentations of these structures. Recen… ▽ More

    Submitted 7 August, 2020; v1 submitted 2 December, 2019; originally announced December 2019.

    Comments: 24 pages, 11 figures