Skip to main content

Showing 1–50 of 146 results for author: Zhenyu

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.07421  [pdf, other

    cs.SD eess.AS

    A Comprehensive Investigation on Speaker Augmentation for Speaker Recognition

    Authors: Zhenyu Zhou, Shibiao Xu, Shi Yin, Lantian Li, Dong Wang

    Abstract: Data augmentation (DA) has played a pivotal role in the success of deep speaker recognition. Current DA techniques primarily focus on speaker-preserving augmentation, which does not change the speaker trait of the speech and does not create new speakers. Recent research has shed light on the potential of speaker augmentation, which generates new speakers to enrich the training dataset. In this stu… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: to be published in INTERSPEECH 2024

  2. arXiv:2406.06543  [pdf, other

    cs.AR cs.LG cs.NE eess.SP

    SparrowSNN: A Hardware/software Co-design for Energy Efficient ECG Classification

    Authors: Zhanglu Yan, Zhenyu Bai, Tulika Mitra, Weng-Fai Wong

    Abstract: Heart disease is one of the leading causes of death worldwide. Given its high risk and often asymptomatic nature, real-time continuous monitoring is essential. Unlike traditional artificial neural networks (ANNs), spiking neural networks (SNNs) are well-known for their energy efficiency, making them ideal for wearable devices and energy-constrained edge computing platforms. However, current energy… ▽ More

    Submitted 6 May, 2024; originally announced June 2024.

  3. arXiv:2406.00485  [pdf

    eess.IV cs.RO

    TacShade A New 3D-printed Soft Optical Tactile Sensor Based on Light, Shadow and Greyscale for Shape Reconstruction

    Authors: Zhenyu Lu, Jialong Yang, Haoran Li, Yifan Li, Weiyong Si, Nathan Lepora, Chenguang Yang

    Abstract: In this paper, we present the TacShade a newly designed 3D-printed soft optical tactile sensor. The sensor is developed for shape reconstruction under the inspiration of sketch drawing that uses the density of sketch lines to draw light and shadow, resulting in the creation of a 3D-view effect. TacShade, building upon the strengths of the TacTip, a single-camera tactile sensor of large in-depth de… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: This paper has been accepted by ICRA 2024

  4. arXiv:2404.15643  [pdf, ps, other

    cs.IT eess.SP

    Dynamic Beam Coverage for Satellite Communications Aided by Movable-Antenna Array

    Authors: Lipeng Zhu, Xiangyu Pi, Wenyan Ma, Zhenyu Xiao, Rui Zhang

    Abstract: Due to the ultra-dense constellation, efficient beam coverage and interference mitigation are crucial to low-earth orbit (LEO) satellite communication systems, while the conventional directional antennas and fixed-position antenna (FPA) arrays both have limited degrees of freedom (DoFs) in beamforming to adapt to the time-varying coverage requirement of terrestrial users. To address this challenge… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  5. arXiv:2404.13786  [pdf, other

    eess.SY cs.AI cs.DC cs.LG

    Soar: Design and Deployment of A Smart Roadside Infrastructure System for Autonomous Driving

    Authors: Shuyao Shi, Neiwen Ling, Zhehao Jiang, Xuan Huang, Yuze He, Xiaoguang Zhao, Bufang Yang, Chen Bian, **gfei Xia, Zhenyu Yan, Raymond Yeung, Guoliang Xing

    Abstract: Recently,smart roadside infrastructure (SRI) has demonstrated the potential of achieving fully autonomous driving systems. To explore the potential of infrastructure-assisted autonomous driving, this paper presents the design and deployment of Soar, the first end-to-end SRI system specifically designed to support autonomous driving systems. Soar consists of both software and hardware components ca… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  6. arXiv:2404.06674  [pdf, other

    cs.SD cs.AI eess.AS

    VoiceShop: A Unified Speech-to-Speech Framework for Identity-Preserving Zero-Shot Voice Editing

    Authors: Philip Anastassiou, Zhenyu Tang, Kainan Peng, Dongya Jia, Jiaxin Li, Ming Tu, Yu** Wang, Yuxuan Wang, Mingbo Ma

    Abstract: We present VoiceShop, a novel speech-to-speech framework that can modify multiple attributes of speech, such as age, gender, accent, and speech style, in a single forward pass while preserving the input speaker's timbre. Previous works have been constrained to specialized models that can only edit these attributes individually and suffer from the following pitfalls: the magnitude of the conversion… ▽ More

    Submitted 11 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  7. arXiv:2404.06265  [pdf, other

    cs.CV eess.IV

    Spatial-Temporal Multi-level Association for Video Object Segmentation

    Authors: Deshui Miao, Xin Li, Zhenyu He, Huchuan Lu, Ming-Hsuan Yang

    Abstract: Existing semi-supervised video object segmentation methods either focus on temporal feature matching or spatial-temporal feature modeling. However, they do not address the issues of sufficient target interaction and efficient parallel processing simultaneously, thereby constraining the learning of dynamic, target-aware features. To tackle these limitations, this paper proposes a spatial-temporal m… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  8. arXiv:2404.01717  [pdf, other

    cs.CV eess.IV

    AddSR: Accelerating Diffusion-based Blind Super-Resolution with Adversarial Diffusion Distillation

    Authors: Rui Xie, Ying Tai, Chen Zhao, Kai Zhang, Zhenyu Zhang, Jun Zhou, Xiaoqian Ye, Qian Wang, Jian Yang

    Abstract: Blind super-resolution methods based on stable diffusion showcase formidable generative capabilities in reconstructing clear high-resolution images with intricate details from low-resolution inputs. However, their practical applicability is often hampered by poor efficiency, stemming from the requirement of thousands or hundreds of sampling steps. Inspired by the efficient adversarial diffusion di… ▽ More

    Submitted 23 May, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  9. arXiv:2403.10723  [pdf, other

    eess.SY

    Leveraging Symmetries in Gaits for Reinforcement Learning: A Case Study on Quadrupedal Gaits

    Authors: Jiayu Ding, Xulin Chen, Garret E. Katz, Zhenyu Gan

    Abstract: In this research, we address the complex task of develo** versatile and agile quadrupedal gaits for robotic platforms, a domain predominantly governed by model-based trajectory optimization methods. We propose an innovative, reference-free reinforcement learning framework that exploits the intrinsic symmetries of dynamic systems to synthesize a broad array of naturalistic quadrupedal locomotion… ▽ More

    Submitted 14 June, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

  10. arXiv:2402.15185  [pdf, other

    cs.IT eess.SP

    Pre-Chirp-Domain Index Modulation for Affine Frequency Division Multiplexing

    Authors: Guangyao Liu, Tianqi Mao, Ruiqi Liu, Zhenyu Xiao

    Abstract: Affine frequency division multiplexing (AFDM), tailored as a novel multicarrier technique utilizing chirp signals for high-mobility communications, exhibits marked advantages compared to traditional orthogonal frequency division multiplexing (OFDM). AFDM is based on the discrete affine Fourier transform (DAFT) with two modifiable parameters of the chirp signals, termed as the pre-chirp parameter a… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  11. arXiv:2402.02699  [pdf, other

    cs.SD cs.LG eess.AS

    Adversarial Data Augmentation for Robust Speaker Verification

    Authors: Zhenyu Zhou, Junhui Chen, Namin Wang, Lantian Li, Dong Wang

    Abstract: Data augmentation (DA) has gained widespread popularity in deep speaker models due to its ease of implementation and significant effectiveness. It enriches training data by simulating real-life acoustic variations, enabling deep neural networks to learn speaker-related representations while disregarding irrelevant acoustic variations, thereby improving robustness and generalization. However, a pot… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  12. arXiv:2402.00771  [pdf, other

    cs.IT eess.SP

    Mixed Static and Reconfigurable Metasurface Deployment in Indoor Dense Spaces: How Much Reconfigurability is Needed?

    Authors: Zhenyu Li, Ozan Alp Topal, Özlem Tuğfe Demir, Emil Björnson, Cicek Cavdar

    Abstract: In this paper, we investigate how metasurfaces can be deployed to deliver high data rates in a millimeter-wave (mmWave) indoor dense space with many blocking objects. These surfaces can either be static metasurfaces (SMSs) that reflect with fixed phase-shifts or reconfigurable intelligent surfaces (RISs) that can reconfigure their phase-shifts to the currently served user. The latter comes with an… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 6 pages, 5 figures, Accepted to be presented in IEEE WCNC 2024

  13. arXiv:2401.08974  [pdf, ps, other

    cs.IT eess.SP

    Performance Analysis and Optimization for Movable Antenna Aided Wideband Communications

    Authors: Lipeng Zhu, Wenyan Ma, Zhenyu Xiao, Rui Zhang

    Abstract: Movable antenna (MA) has emerged as a promising technology to enhance wireless communication performance by enabling the local movement of antennas at the transmitter (Tx) and/or receiver (Rx) for achieving more favorable channel conditions. As the existing studies on MA-aided wireless communications have mainly considered narrow-band transmission in flat fading channels, we investigate in this pa… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  14. arXiv:2401.00806  [pdf, other

    eess.SY

    Noise-Aware and Equitable Urban Air Traffic Management: An Optimization Approach

    Authors: Zhenyu Gao, Yue Yu, Qinshuang Wei, Ufuk Topcu, John-Paul Clarke

    Abstract: Urban air mobility (UAM), a transformative concept for the transport of passengers and cargo, faces several integration challenges in complex urban environments. Community acceptance of aircraft noise is among the most noticeable of these challenges when launching or scaling up a UAM system. Properly managing community noise is fundamental to establishing a UAM system that is environmentally and s… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

    Comments: 30 pages, 15 figures

  15. arXiv:2312.13603  [pdf, other

    eess.AS cs.SD

    Style Modeling for Multi-Speaker Articulation-to-Speech

    Authors: Miseul Kim, Zhenyu Piao, Jihyun Lee, Hong-Goo Kang

    Abstract: In this paper, we propose a neural articulation-to-speech (ATS) framework that synthesizes high-quality speech from articulatory signal in a multi-speaker situation. Most conventional ATS approaches only focus on modeling contextual information of speech from a single speaker's articulatory features. To explicitly represent each speaker's speaking style as well as the contextual information, our p… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 5 pages, Accepted to ICASSP 2023

  16. arXiv:2312.13600  [pdf, other

    eess.AS cs.SD

    BrainTalker: Low-Resource Brain-to-Speech Synthesis with Transfer Learning using Wav2Vec 2.0

    Authors: Miseul Kim, Zhenyu Piao, Jihyun Lee, Hong-Goo Kang

    Abstract: Decoding spoken speech from neural activity in the brain is a fast-emerging research topic, as it could enable communication for people who have difficulties with producing audible speech. For this task, electrocorticography (ECoG) is a common method for recording brain activity with high temporal resolution and high spatial precision. However, due to the risky surgical procedure required for obta… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 5 pages. Accepted to BHI 2023

  17. arXiv:2312.06969  [pdf, ps, other

    cs.IT eess.SP

    Channel Estimation for Movable Antenna Communication Systems: A Framework Based on Compressed Sensing

    Authors: Zhenyu Xiao, Songqi Cao, Lipeng Zhu, Yanming Liu, Xiang-Gen Xia, Rui Zhang

    Abstract: Movable antenna (MA) is a new technology with great potential to improve communication performance by enabling local movement of antennas for pursuing better channel conditions. In particular, the acquisition of complete channel state information (CSI) between the transmitter (Tx) and receiver (Rx) regions is an essential problem for MA systems to reap performance gains. In this paper, we propose… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  18. arXiv:2312.06454  [pdf, other

    eess.IV cs.CV cs.LG

    Point Transformer with Federated Learning for Predicting Breast Cancer HER2 Status from Hematoxylin and Eosin-Stained Whole Slide Images

    Authors: Bao Li, Zhenyu Liu, Lizhi Shao, Bensheng Qiu, Hong Bu, Jie Tian

    Abstract: Directly predicting human epidermal growth factor receptor 2 (HER2) status from widely available hematoxylin and eosin (HE)-stained whole slide images (WSIs) can reduce technical costs and expedite treatment selection. Accurately predicting HER2 requires large collections of multi-site WSIs. Federated learning enables collaborative training of these WSIs without gigabyte-size WSIs transportation a… ▽ More

    Submitted 27 February, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

  19. arXiv:2311.07169  [pdf, other

    eess.SP

    CASTER: A Computer-Vision-Assisted Wireless Channel Simulator for Gesture Recognition

    Authors: Zhenyu Ren, Guoliang Li, Chenqing Ji, Chao Yu, Shuai Wang, Rui Wang

    Abstract: In this paper, a computer-vision-assisted simulation method is proposed to address the issue of training dataset acquisition for wireless hand gesture recognition. In the existing literature, in order to classify gestures via the wireless channel estimation, massive training samples should be measured in a consistent environment, consuming significant efforts. In the proposed CASTER simulator, how… ▽ More

    Submitted 17 April, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: 10 pages, 11 figures

  20. arXiv:2310.09299  [pdf, other

    cs.LG eess.SP eess.SY

    Digital Twin Assisted Deep Reinforcement Learning for Online Admission Control in Sliced Network

    Authors: Zhenyu Tao, Wei Xu, Xiaohu You

    Abstract: The proliferation of diverse wireless services in 5G and beyond has led to the emergence of network slicing technologies. Among these, admission control plays a crucial role in achieving service-oriented optimization goals through the selective acceptance of service requests. Although deep reinforcement learning (DRL) forms the foundation in many admission control approaches thanks to its effectiv… ▽ More

    Submitted 21 November, 2023; v1 submitted 7 October, 2023; originally announced October 2023.

    Comments: 13 pages, 8 figures

  21. arXiv:2310.03402  [pdf, other

    cs.CV eess.IV

    A Complementary Global and Local Knowledge Network for Ultrasound denoising with Fine-grained Refinement

    Authors: Zhenyu Bu, Kai-Ni Wang, Fuxing Zhao, Shengxiao Li, Guang-Quan Zhou

    Abstract: Ultrasound imaging serves as an effective and non-invasive diagnostic tool commonly employed in clinical examinations. However, the presence of speckle noise in ultrasound images invariably degrades image quality, impeding the performance of subsequent tasks, such as segmentation and classification. Existing methods for speckle noise reduction frequently induce excessive image smoothing or fail to… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: Submitted to ICASSP 2024

  22. arXiv:2309.14158  [pdf, other

    cs.SD eess.AS

    An Investigation of Distribution Alignment in Multi-Genre Speaker Recognition

    Authors: Zhenyu Zhou, Junhui Chen, Namin Wang, Lantian Li, Dong Wang

    Abstract: Multi-genre speaker recognition is becoming increasingly popular due to its ability to better represent the complexities of real-world applications. However, a major challenge is the significant shift in the distribution of speaker vectors across different genres. While distribution alignment is a common approach to address this challenge, previous studies have mainly focused on aligning a source… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: submitted to ICASSP 2024

  23. arXiv:2308.09512  [pdf, other

    cs.IT eess.SP

    Multiuser Communications with Movable-Antenna Base Station: Joint Antenna Positioning, Receive Combining, and Power Control

    Authors: Zhenyu Xiao, Xiangyu Pi, Lipeng Zhu, Xiang-Gen Xia, Rui Zhang

    Abstract: Movable antenna (MA) is an emerging technology which enables a local movement of the antenna in the transmitter/receiver region for improving the channel condition and communication performance. In this paper, we study the deployment of multiple MAs at the base station (BS) for enhancing the multiuser communication performance. First, we model the multiuser channel in the uplink to characterize th… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2308.05546

  24. arXiv:2308.05546  [pdf, other

    cs.IT eess.SP

    Multiuser Communications with Movable-Antenna Base Station Via Antenna Position Optimization

    Authors: Xiangyu Pi, Lipeng Zhu, Zhenyu Xiao, Rui Zhang

    Abstract: This paper studies the deployment of multiple movable antennas (MAs) at the base station (BS) for enhancing the multiuser communication performance. First, we model the multiuser channel in the uplink to characterize the wireless channel variation caused by MAs' movement at the BS. Then, an optimization problem is formulated to maximize the minimum achievable rate among multiple users for MA-aided… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  25. arXiv:2308.00393  [pdf, other

    cs.LG eess.SP

    A Survey of Time Series Anomaly Detection Methods in the AIOps Domain

    Authors: Zhenyu Zhong, Qiliang Fan, Jiacheng Zhang, Minghua Ma, Shenglin Zhang, Yongqian Sun, Qingwei Lin, Yuzhi Zhang, Dan Pei

    Abstract: Internet-based services have seen remarkable success, generating vast amounts of monitored key performance indicators (KPIs) as univariate or multivariate time series. Monitoring and analyzing these time series are crucial for researchers, service operators, and on-call engineers to detect outliers or anomalies indicating service failures or significant events. Numerous advanced anomaly detection… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  26. arXiv:2307.12286  [pdf, ps, other

    cs.IT eess.SP

    Double-Active-IRS Aided Wireless Communication: Deployment Optimization and Capacity Scaling

    Authors: Zhenyu Kang, Changsheng You, Rui Zhang

    Abstract: In this letter, we consider a double-active-intelligent reflecting surface (IRS) aided wireless communication system, where two active IRSs are properly deployed to assist the communication from a base station (BS) to multiple users located in a given zone via the double-reflection links. Under the assumption of fixed per-element amplification power for each active-IRS element, we formulate a rate… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

  27. arXiv:2307.01445  [pdf, ps, other

    eess.SP

    Distributed fusion filter over lossy wireless sensor networks with the presence of non-Gaussian noise

    Authors: Jiacheng He, Bei Peng, Zhenyu Feng, Xuemei Mao, Song Gao, Gang Wang

    Abstract: The information transmission between nodes in a wireless sensor networks (WSNs) often causes packet loss due to denial-of-service (DoS) attack, energy limitations, and environmental factors, and the information that is successfully transmitted can also be contaminated by non-Gaussian noise. The presence of these two factors poses a challenge for distributed state estimation (DSE) over WSNs. In thi… ▽ More

    Submitted 6 July, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

  28. arXiv:2307.00511  [pdf

    eess.IV cs.CV cs.LG q-bio.NC

    SUGAR: Spherical Ultrafast Graph Attention Framework for Cortical Surface Registration

    Authors: Jianxun Ren, Ning An, Youjia Zhang, Danyang Wang, Zhenyu Sun, Cong Lin, Weigang Cui, Weiwei Wang, Ying Zhou, Wei Zhang, Qingyu Hu, ** Zhang, Dan Hu, Danhong Wang, Hesheng Liu

    Abstract: Cortical surface registration plays a crucial role in aligning cortical functional and anatomical features across individuals. However, conventional registration algorithms are computationally inefficient. Recently, learning-based registration algorithms have emerged as a promising solution, significantly improving processing efficiency. Nonetheless, there remains a gap in the development of a lea… ▽ More

    Submitted 2 July, 2023; originally announced July 2023.

  29. arXiv:2306.12898  [pdf

    cond-mat.mes-hall cs.LG eess.IV

    Machine-Learning-Assisted and Real-Time-Feedback-Controlled Growth of InAs/GaAs Quantum Dots

    Authors: Chao Shen, Wenkang Zhan, Kaiyao Xin, Manyang Li, Zhenyu Sun, Hui Cong, Chi Xu, Jian Tang, Zhaofeng Wu, Bo Xu, Zhongming Wei, Chunlai Xue, Chao Zhao, Zhanguo Wang

    Abstract: Self-assembled InAs/GaAs quantum dots (QDs) have properties highly valuable for develo** various optoelectronic devices such as QD lasers and single photon sources. The applications strongly rely on the density and quality of these dots, which has motivated studies of the growth process control to realize high-quality epi-wafers and devices. Establishing the process parameters in molecular beam… ▽ More

    Submitted 11 October, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

    Comments: 5 figures

  30. arXiv:2306.10275  [pdf, other

    eess.SY

    Multi-Scale Simulation of Complex Systems: A Perspective of Integrating Knowledge and Data

    Authors: Huandong Wang, Huan Yan, Can Rong, Yuan Yuan, Fenyu Jiang, Zhenyu Han, Hongjie Sui, Depeng **, Yong Li

    Abstract: Complex system simulation has been playing an irreplaceable role in understanding, predicting, and controlling diverse complex systems. In the past few decades, the multi-scale simulation technique has drawn increasing attention for its remarkable ability to overcome the challenges of complex system simulation with unknown mechanisms and expensive computational costs. In this survey, we will syste… ▽ More

    Submitted 17 June, 2023; originally announced June 2023.

  31. arXiv:2306.05581  [pdf, other

    eess.SY math.OC

    Risk-aware Urban Air Mobility Network Design with Overflow Redundancy

    Authors: Qinshuang Wei, Zhenyu Gao, John-Paul Clarke, Ufuk Topcu

    Abstract: Urban air mobility (UAM), as envisioned by aviation professionals, will transport passengers and cargo at low altitudes within urban and suburban areas. To operate in urban environments, precise air traffic management, in particular the management of traffic overflows due to physical and operational disruptions will be critical to ensuring system safety and efficiency. To this end, we propose UAM… ▽ More

    Submitted 23 October, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: 44 pages, 10 figures

  32. arXiv:2305.16445  [pdf, other

    cs.SD cs.DC eess.AS

    SoundSieve: Seconds-Long Audio Event Recognition on Intermittently-Powered Systems

    Authors: Mahathir Monjur, Yubo Luo, Zhenyu Wang, Shahriar Nirjon

    Abstract: A fundamental problem of every intermittently-powered sensing system is that signals acquired by these systems over a longer period in time are also intermittent. As a consequence, these systems fail to capture parts of a longer-duration event that spans over multiple charge-discharge cycles of the capacitor that stores the harvested energy. From an application's perspective, this is viewed as spo… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: The 21st ACM International Conference on Mobile Systems, Applications, and Services (Mobisys 2023)

  33. arXiv:2305.06806  [pdf, other

    cs.SD eess.AS

    HappyQuokka System for ICASSP 2023 Auditory EEG Challenge

    Authors: Zhenyu Piao, Miseul Kim, Hyungchan Yoon, Hong-Goo Kang

    Abstract: This report describes our submission to Task 2 of the Auditory EEG Decoding Challenge at ICASSP 2023 Signal Processing Grand Challenge (SPGC). Task 2 is a regression problem that focuses on reconstructing a speech envelope from an EEG signal. For the task, we propose a pre-layer normalized feed-forward transformer (FFT) architecture. For within-subjects generation, we additionally utilize an auxil… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: First Place in Task 2 of Auditory EEG decoding Challenge, which is part of ICASSP Signal Processing Grand Challenge (SPGC) 2023

  34. arXiv:2304.13471  [pdf, other

    eess.IV cs.CV

    OPDN: Omnidirectional Position-aware Deformable Network for Omnidirectional Image Super-Resolution

    Authors: Xiaopeng Sun, Weiqi Li, Zhenyu Zhang, Qiufang Ma, Xuhan Sheng, Ming Cheng, Haoyu Ma, Shijie Zhao, Jian Zhang, Junlin Li, Li Zhang

    Abstract: 360° omnidirectional images have gained research attention due to their immersive and interactive experience, particularly in AR/VR applications. However, they suffer from lower angular resolution due to being captured by fisheye lenses with the same sensor size for capturing planar images. To solve the above issues, we propose a two-stage framework for 360° omnidirectional image superresolution.… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: Accepted to CVPRW 2023

  35. arXiv:2304.04773  [pdf, other

    eess.IV cs.CV

    HDR Video Reconstruction with a Large Dynamic Dataset in Raw and sRGB Domains

    Authors: Huan**g Yue, Yubo Peng, Biting Yu, Xuanwu Yin, Zhenyu Zhou, **gyu Yang

    Abstract: High dynamic range (HDR) video reconstruction is attracting more and more attention due to the superior visual quality compared with those of low dynamic range (LDR) videos. The availability of LDR-HDR training pairs is essential for the HDR reconstruction quality. However, there are still no real LDR-HDR pairs for dynamic scenes due to the difficulty in capturing LDR-HDR frames simultaneously. In… ▽ More

    Submitted 12 April, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

  36. arXiv:2304.00070  [pdf, ps, other

    eess.SP

    HybridCVLNet: A Hybrid CSI Feedback System and its Domain Adaptation

    Authors: Haozhen Li, Xinyu Gu, Boyuan Zhang, Dongliang Li, Zhenyu Liu, Lin Zhang

    Abstract: Deep Learning (DL)-based channel state information (CSI) feedback is a promising technique for the transmitter to accurately acquire the CSI of massive multiple-input multiple-output (MIMO) systems. As critical concerns about DL-based physical layer applications, the intra-domain generalizability affected by dataset bias and inter-domain robustness in data drift remain challenging. Therefore, we b… ▽ More

    Submitted 3 May, 2023; v1 submitted 30 March, 2023; originally announced April 2023.

  37. Breaking Symmetries Leads to Diverse Quadrupedal Gaits

    Authors: Jiayu Ding, Zhenyu Gan

    Abstract: Symmetry manifests itself in legged locomotion in a variety of ways. No matter where a legged system begins to move periodically, the torso and limbs coordinate with each other's movements in a similar manner. Also, in many gaits observed in nature, the legs on both sides of the torso move in exactly the same way, sometimes they are just half a period out of phase. Furthermore, when some animals m… ▽ More

    Submitted 8 April, 2024; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: Please refer to the published version to cite this paper

    Journal ref: IEEE Robotics and Automation Letters, Institute of Electrical and Electronics Engineers (IEEE), 2024

  38. arXiv:2302.09257  [pdf, other

    cs.IT eess.SP

    mmWave Coverage Extension Using Reconfigurable Intelligent Surfaces in Indoor Dense Spaces

    Authors: Zhenyu Li, Ozan Alp Topal, Özlem Tuğfe Demir, Emil Björnson, Cicek Cavdar

    Abstract: In this work, we consider the deployment of reconfigurable intelligent surfaces (RISs) to extend the coverage of a millimeter-wave (mmWave) network in indoor dense spaces. We first integrate RIS into ray-tracing simulations to realistically capture the propagation characteristics, then formulate a non-convex optimization problem that minimizes the number of RISs under rate constraints. We propose… ▽ More

    Submitted 18 February, 2023; originally announced February 2023.

    Comments: 6 pages 8 figures. Accepted to be presented in IEEE ICC 2023

  39. Deep Learning-Based Modeling of 5G Core Control Plane for 5G Network Digital Twin

    Authors: Zhenyu Tao, Yongliang Guo, Guanghui He, Yongming Huang, Xiaohu You

    Abstract: Digital twin serves as a crucial facilitator in the advancement and implementation of emerging technologies within 5G and beyond networks. However, the intricate structure and diverse functionalities of the existing 5G core network, especially the control plane, present challenges in constructing core network digital twins. In this paper, we propose two novel data-driven architectures for modeling… ▽ More

    Submitted 18 October, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Journal ref: IEEE Transactions on Cognitive Communications and Networking

  40. arXiv:2301.05351  [pdf, other

    eess.SY

    Data-driven Moving Horizon Estimation for Angular Velocity of Space Noncooperative Target in Eddy Current De-tumbling Mission

    Authors: Xiyao Liu, Haitao Chang, Zhenyu Lu, Panfeng Huang

    Abstract: Angular velocity estimation is critical for eddy current de-tumbling of noncooperative space targets. However, unknown model of the noncooperative target and few observation data make the model-based estimation methods challenged. In this paper, a Data-driven Moving Horizon Estimation method is proposed to estimate the angular velocity of the noncooperative target with de-tumbling torque. In this… ▽ More

    Submitted 12 January, 2023; originally announced January 2023.

  41. arXiv:2301.04311  [pdf, other

    cs.IT eess.SP

    Active-IRS-Aided Wireless Communication: Fundamentals, Designs and Open Issues

    Authors: Zhenyu Kang, Changsheng You, Rui Zhang

    Abstract: Intelligent reflecting surface (IRS) has emerged as a promising technology to realize smart radio environment for future wireless communication systems. Existing works in this line of research have mainly considered the conventional passive IRS that reflects wireless signals without power amplification, while in this article, we give an overview of a new type of IRS, called active IRS, which enabl… ▽ More

    Submitted 25 June, 2023; v1 submitted 11 January, 2023; originally announced January 2023.

  42. arXiv:2212.05360  [pdf, other

    eess.AS cs.AI cs.LG

    Synthetic Wave-Geometric Impulse Responses for Improved Speech Dereverberation

    Authors: Rohith Aralikatti, Zhenyu Tang, Dinesh Manocha

    Abstract: We present a novel approach to improve the performance of learning-based speech dereverberation using accurate synthetic datasets. Our approach is designed to recover the reverb-free signal from a reverberant speech signal. We show that accurately simulating the low-frequency components of Room Impulse Responses (RIRs) is important to achieving good dereverberation. We use the GWA dataset that con… ▽ More

    Submitted 10 December, 2022; originally announced December 2022.

    Comments: Submitted to ICASSP 2023

  43. arXiv:2211.15995  [pdf

    eess.IV

    Shadow-Oriented Tracking Method for Multi-Target Tracking in Video-SAR

    Authors: Xiaochuan Ni, Xiaoling Zhang, Xu Zhan, Zhenyu Yang, Jun Shi, Shunjun Wei, Tianjiao Zeng

    Abstract: This work focuses on multi-target tracking in Video synthetic aperture radar. Specifically, we refer to tracking based on targets' shadows. Current methods have limited accuracy as they fail to consider shadows' characteristics and surroundings fully. Shades are low-scattering and varied, resulting in missed tracking. Surroundings can cause interferences, resulting in false tracking. To solve thes… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  44. arXiv:2211.14785  [pdf, ps, other

    cs.IT eess.SP

    Deep Learning for Efficient CSI Feedback in Massive MIMO: Adapting to New Environments and Small Datasets

    Authors: Zhenyu Liu, Li Wang, Lianming Xu, Zhi Ding

    Abstract: Deep learning (DL)-based channel state information (CSI) feedback has shown promising potential to improve spectrum efficiency in massive MIMO systems. However, practical DL approaches require a sizeable CSI dataset for each scenario, and require large storage or updating bandwidth for multiple learned models. To overcome this costly barrier, we develop a solution for efficient training and deploy… ▽ More

    Submitted 6 November, 2023; v1 submitted 27 November, 2022; originally announced November 2022.

    Comments: 13 pages, 11 figures, 6 tables

  45. arXiv:2211.12671  [pdf, other

    eess.SP

    3-D Positioning and Resource Allocation for Multi-UAV Base Stations Under Blockage-Aware Channel Model

    Authors: Pengfei Yi, Lipeng Zhu, Zhenyu Xiao, Rui Zhang, Zhu Han, Xiang-Gen Xia

    Abstract: In this paper, we propose to deploy multiple unmanned aerial vehicle (UAV) mounted base stations to serve ground users in outdoor environments with obstacles. In particular, the geographic information is employed to capture the blockage effects for air-to-ground (A2G) links caused by buildings, and a realistic blockage-aware A2G channel model is proposed to characterize the continuous variation of… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

  46. arXiv:2211.09913  [pdf, other

    cs.SD cs.AI eess.AS

    Multi-source Domain Adaptation for Text-independent Forensic Speaker Recognition

    Authors: Zhenyu Wang, John H. L. Hansen

    Abstract: Adapting speaker recognition systems to new environments is a widely-used technique to improve a well-performing model learned from large-scale data towards a task-specific small-scale data scenarios. However, previous studies focus on single domain adaptation, which neglects a more practical scenario where training data are collected from multiple acoustic domains needed in forensic scenarios. Au… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING

  47. Audio Anti-spoofing Using a Simple Attention Module and Joint Optimization Based on Additive Angular Margin Loss and Meta-learning

    Authors: Zhenyu Wang, John H. L. Hansen

    Abstract: Automatic speaker verification systems are vulnerable to a variety of access threats, prompting research into the formulation of effective spoofing detection systems to act as a gate to filter out such spoofing attacks. This study introduces a simple attention module to infer 3-dim attention weights for the feature map in a convolutional layer, which then optimizes an energy function to determine… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: Interspeech 2022

  48. arXiv:2211.04470  [pdf, other

    cs.CV eess.IV

    Efficient Single-Image Depth Estimation on Mobile Devices, Mobile AI & AIM 2022 Challenge: Report

    Authors: Andrey Ignatov, Grigory Malivenko, Radu Timofte, Lukasz Treszczotko, Xin Chang, Piotr Ksiazek, Michal Lopuszynski, Maciej Pioro, Rafal Rudnicki, Maciej Smyl, Yujie Ma, Zhenyu Li, Zehui Chen, Jialei Xu, Xianming Liu, Junjun Jiang, XueChao Shi, Difan Xu, Yanan Li, Xiaotao Wang, Lei Lei, Ziyu Zhang, Yicheng Wang, Zilong Huang, Guozhong Luo , et al. (14 additional authors not shown)

    Abstract: Various depth estimation models are now widely used on many mobile and IoT devices for image segmentation, bokeh effect rendering, object tracking and many other mobile tasks. Thus, it is very crucial to have efficient and accurate depth estimation models that can run fast on low-power mobile chipsets. In this Mobile AI challenge, the target was to develop deep learning-based single image depth es… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2105.08630, arXiv:2211.03885; text overlap with arXiv:2105.08819, arXiv:2105.08826, arXiv:2105.08629, arXiv:2105.07809, arXiv:2105.07825

  49. arXiv:2210.08493  [pdf, other

    cs.RO cs.LG eess.SY

    Indoor Smartphone SLAM with Learned Echoic Location Features

    Authors: Wenjie Luo, Qun Song, Zhenyu Yan, Rui Tan, Guosheng Lin

    Abstract: Indoor self-localization is a highly demanded system function for smartphones. The current solutions based on inertial, radio frequency, and geomagnetic sensing may have degraded performance when their limiting factors take effect. In this paper, we present a new indoor simultaneous localization and map** (SLAM) system that utilizes the smartphone's built-in audio hardware and inertial measureme… ▽ More

    Submitted 16 October, 2022; originally announced October 2022.

  50. arXiv:2210.06512  [pdf

    q-bio.QM cs.CV eess.IV

    Quantifying U-Net Uncertainty in Multi-Parametric MRI-based Glioma Segmentation by Spherical Image Projection

    Authors: Zhenyu Yang, Kyle Lafata, Eugene Vaios, Zongsheng Hu, Trey Mullikin, Fang-Fang Yin, Chunhao Wang

    Abstract: The projection of planar MRI data onto a spherical surface is equivalent to a nonlinear image transformation that retains global anatomical information. By incorporating this image transformation process in our proposed spherical projection-based U-Net (SPU-Net) segmentation model design, multiple independent segmentation predictions can be obtained from a single MRI. The final segmentation is the… ▽ More

    Submitted 12 August, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: 31 pages, 9 figures, 1 table