Skip to main content

Showing 1–50 of 71 results for author: ZHENG, T

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.19706  [pdf, other

    cs.SD eess.AS

    SAML: Speaker Adaptive Mixture of LoRA Experts for End-to-End ASR

    Authors: Qiuming Zhao, Guangzhi Sun, Chao Zhang, Mingxing Xu, Thomas Fang Zheng

    Abstract: Mixture-of-experts (MoE) models have achieved excellent results in many tasks. However, conventional MoE models are often very large, making them challenging to deploy on resource-constrained edge devices. In this paper, we propose a novel speaker adaptive mixture of LoRA experts (SAML) approach, which uses low-rank adaptation (LoRA) modules as experts to reduce the number of trainable parameters… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 5 pages, accepted by Interspeech 2024. arXiv admin note: substantial text overlap with arXiv:2309.09136

  2. arXiv:2406.17286  [pdf

    cs.RO eess.SY

    Prioritized experience replay-based DDQN for Unmanned Vehicle Path Planning

    Authors: Liu Lipeng, Letian Xu, Jiabei Liu, Haopeng Zhao, Tongzhou Jiang, Tianyao Zheng

    Abstract: Path planning module is a key module for autonomous vehicle navigation, which directly affects its operating efficiency and safety. In complex environments with many obstacles, traditional planning algorithms often cannot meet the needs of intelligence, which may lead to problems such as dead zones in unmanned vehicles. This paper proposes a path planning algorithm based on DDQN and combines it wi… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 4 pages, 6 figures, 2024 5th International Conference on Information Science, Parallel and Distributed Systems

  3. arXiv:2406.07989  [pdf, other

    cs.IT eess.SP

    Near-Field Wideband Beam Training Based on Distance-Dependent Beam Split

    Authors: Tianyue Zheng, Mingyao Cui, Zidong Wu, Linglong Dai

    Abstract: Near-field beam training is essential for acquiring channel state information in 6G extremely large-scale multiple input multiple output (XL-MIMO) systems. To achieve low-overhead beam training, existing method has been proposed to leverage the near-field beam split effect, which deploys true-time-delay arrays to simultaneously search multiple angles of the entire angular range in a distance ring… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  4. arXiv:2404.06393  [pdf, other

    cs.SD cs.AI eess.AS

    MuPT: A Generative Symbolic Music Pretrained Transformer

    Authors: Xingwei Qu, Yuelin Bai, Yinghao Ma, Ziya Zhou, Ka Man Lo, Jiaheng Liu, Ruibin Yuan, Lejun Min, Xueling Liu, Tianyu Zhang, Xinrun Du, Shuyue Guo, Yiming Liang, Yizhi Li, Shangda Wu, Junting Zhou, Tianyu Zheng, Ziyang Ma, Fengze Han, Wei Xue, Gus Xia, Emmanouil Benetos, Xiang Yue, Chenghua Lin, Xu Tan , et al. (4 additional authors not shown)

    Abstract: In this paper, we explore the application of Large Language Models (LLMs) to the pre-training of music. While the prevalent use of MIDI in music modeling is well-established, our findings suggest that LLMs are inherently more compatible with ABC Notation, which aligns more closely with their design and strengths, thereby enhancing the model's performance in musical composition. To address the chal… ▽ More

    Submitted 10 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  5. arXiv:2402.16153  [pdf, other

    cs.SD cs.AI cs.CL cs.LG cs.MM eess.AS

    ChatMusician: Understanding and Generating Music Intrinsically with LLM

    Authors: Ruibin Yuan, Hanfeng Lin, Yi Wang, Zeyue Tian, Shangda Wu, Tianhao Shen, Ge Zhang, Yuhang Wu, Cong Liu, Ziya Zhou, Ziyang Ma, Liumeng Xue, Ziyu Wang, Qin Liu, Tianyu Zheng, Yizhi Li, Yinghao Ma, Yiming Liang, Xiaowei Chi, Ruibo Liu, Zili Wang, Pengfei Li, **gcheng Wu, Chenghua Lin, Qifeng Liu , et al. (10 additional authors not shown)

    Abstract: While Large Language Models (LLMs) demonstrate impressive capabilities in text generation, we find that their ability has yet to be generalized to music, humanity's creative language. We introduce ChatMusician, an open-source LLM that integrates intrinsic musical abilities. It is based on continual pre-training and finetuning LLaMA2 on a text-compatible music representation, ABC notation, and the… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: GitHub: https://shanghaicannon.github.io/ChatMusician/

  6. arXiv:2401.05850  [pdf, other

    cs.SD eess.AS

    Contrastive Loss Based Frame-wise Feature disentanglement for Polyphonic Sound Event Detection

    Authors: Yadong Guan, Jiqing Han, Hongwei Song, Wenjie Song, Guibin Zheng, Tieran Zheng, Yongjun He

    Abstract: Overlap** sound events are ubiquitous in real-world environments, but existing end-to-end sound event detection (SED) methods still struggle to detect them effectively. A critical reason is that these methods represent overlap** events using shared and entangled frame-wise features, which degrades the feature discrimination. To solve the problem, we propose a disentangled feature learning fram… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: accepted by icassp2024

  7. arXiv:2401.01673  [pdf, other

    cs.IT eess.SP

    Coded Beam Training

    Authors: Tianyue Zheng, Jieao Zhu, Qiumo Yu, Yongli Yan, Linglong Dai

    Abstract: In extremely large-scale multiple input multiple output (XL-MIMO) systems for future sixth-generation (6G) communications, codebook-based beam training stands out as a promising technology to acquire channel state information (CSI). Despite their effectiveness, when the pilot overhead is limited, existing beam training methods suffer from significant achievable rate degradation for remote users wi… ▽ More

    Submitted 6 March, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

    Comments: In this paper, we introduce channel coding theory into hierarchical beam training and propose a beam training scheme called coded beam training. By leveraging the error-correcting capability of channel codes, the proposed coded beam training method can enable reliable beam training performance for remote users with low SNR, while kee** training overhead low

  8. arXiv:2310.13090  [pdf

    eess.SY

    Closed-Loop Motion Planning for Differentially Flat Systems: A Time-Varying Optimization Framework

    Authors: Tianqi Zheng, John W. Simpson-Porco, Enrique Mallada

    Abstract: Motion planning and control are two core components of the robotic systems autonomy stack. The standard approach to combine these methodologies comprises an offline/open-loop stage, planning, that designs a feasible and safe trajectory to follow, and an online/closed-loop stage, tracking, that corrects for unmodeled dynamics and disturbances. Such an approach generally introduces conservativeness… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  9. arXiv:2310.06336  [pdf, other

    eess.SP eess.SY

    HoloFed: Environment-Adaptive Positioning via Multi-band Reconfigurable Holographic Surfaces and Federated Learning

    Authors: **gzhi Hu, Zhe Chen, Tianyue Zheng, Robert Schober, Jun Luo

    Abstract: Positioning is an essential service for various applications and is expected to be integrated with existing communication infrastructures in 5G and 6G. Though current Wi-Fi and cellular base stations (BSs) can be used to support this integration, the resulting precision is unsatisfactory due to the lack of precise control of the wireless signals. Recently, BSs adopting reconfigurable holographic s… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  10. arXiv:2309.09136  [pdf, other

    cs.SD cs.AI eess.AS

    Enhancing Quantised End-to-End ASR Models via Personalisation

    Authors: Qiuming Zhao, Guangzhi Sun, Chao Zhang, Mingxing Xu, Thomas Fang Zheng

    Abstract: Recent end-to-end automatic speech recognition (ASR) models have become increasingly larger, making them particularly challenging to be deployed on resource-constrained devices. Model quantisation is an effective solution that sometimes causes the word error rate (WER) to increase. In this paper, a novel strategy of personalisation for a quantised model (PQM) is proposed, which combines speaker ad… ▽ More

    Submitted 16 September, 2023; originally announced September 2023.

    Comments: 5 pages, submitted to ICASSP 2024

  11. arXiv:2306.17434  [pdf

    eess.IV

    A Motion Assessment Method for Reference Stack Selection in Fetal Brain MRI Reconstruction Based on Tensor Rank Approximation

    Authors: Haoan Xu, Wen Shi, Jiwei Sun, Tianshu Zheng, Cong Sun, Sun Yi, Guangbin Wang, Dan Wu

    Abstract: Purpose: Slice-to-volume registration and super-resolution reconstruction (SVR-SRR) is commonly used to generate 3D volumes of the fetal brain from 2D stacks of slices acquired in multiple orientations. A critical initial step in this pipeline is to select one stack with the minimum motion as a reference for registration. An accurate and unbiased motion assessment (MA) is thus crucial for successf… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: 6 figures. Correspondence to: Dan Wu, Ph.D. E-mail: [email protected]

  12. arXiv:2306.07105  [pdf, ps, other

    cs.IT eess.SP

    STAR-RIS Assisted Covert Communications in NOMA Systems

    Authors: Han Xiao, Xiaoyan Hu, Tong-Xing Zheng, Kai-Kit Wong

    Abstract: Covert communications assisted by simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS) in non-orthogonal multiple access (NOMA) systems have been explored in this paper. In particular, the access point (AP) transmitter adopts NOMA to serve a downlink covert user and a public user. The minimum detection error probability (DEP) at the warden is derived considering… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: arXiv admin note: text overlap with arXiv:2305.04930, arXiv:2305.03991

  13. arXiv:2305.03991  [pdf, ps, other

    cs.IT eess.SP

    STAR-RIS Aided Covert Communication

    Authors: Han Xiao, Xiaoyan Hu, Pengcheng Mu, Wenjie Wang, Tong-Xing Zheng, Kai-Kit Wong, Kun Yang

    Abstract: This paper investigates the multi-antenna covert communications assisted by a simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS). In particular, to shelter the existence of communications between transmitter and receiver from a warden, a friendly full-duplex receiver with two antennas is leveraged to make contributions to confuse the warden. Considering the wo… ▽ More

    Submitted 30 August, 2023; v1 submitted 6 May, 2023; originally announced May 2023.

  14. arXiv:2305.03328  [pdf, other

    eess.AS cs.SD

    Time-weighted Frequency Domain Audio Representation with GMM Estimator for Anomalous Sound Detection

    Authors: Jian Guan, Youde Liu, Qiaoxi Zhu, Tieran Zheng, Jiqing Han, Wenwu Wang

    Abstract: Although deep learning is the mainstream method in unsupervised anomalous sound detection, Gaussian Mixture Model (GMM) with statistical audio frequency representation as input can achieve comparable results with much lower model complexity and fewer parameters. Existing statistical frequency representations, e.g, the log-Mel spectrogram's average or maximum over time, do not always work well for… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: To appear at ICASSP 2023

  15. arXiv:2304.07990  [pdf, other

    eess.SY

    Novel Quality Measure and Efficient Resolution of Convex Hull Pricing for Unit Commitment

    Authors: Mikhail A. Bragin, Farhan Hyder, Bing Yan, Peter B. Luh, **ye Zhao, Feng Zhao, Dane A. Schiro, Tongxin Zheng

    Abstract: Electricity prices determined by economic dispatch that do not consider fixed costs may lead to significant uplift payments. However, when fixed costs are included, prices become non-monotonic with respect to demand, which can adversely impact market transparency. To overcome this issue, convex hull (CH) pricing has been introduced for unit commitment with fixed costs. Several CH pricing methods h… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

  16. arXiv:2303.06130  [pdf, other

    cs.RO eess.SY

    Full State Estimation of Continuum Robots From Tip Velocities: A Cosserat-Theoretic Boundary Observer

    Authors: Tongjia Zheng, Qing Han, Hai Lin

    Abstract: State estimation of robotic systems is essential to implementing feedback controllers which usually provide better robustness to modeling uncertainties than open-loop controllers. However, state estimation of soft robots is very challenging because soft robots have theoretically infinite degrees of freedom while existing sensors only provide a limited number of discrete measurements. In this paper… ▽ More

    Submitted 26 June, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

  17. arXiv:2302.14752  [pdf, other

    cs.RO eess.SY

    Multi-Robot-Guided Crowd Evacuation: Two-Scale Modeling and Control

    Authors: Tongjia Zheng, Zhenyuan Yuan, Mollik Nayyar, Alan R. Wagner, Minghui Zhu, Hai Lin

    Abstract: Emergency evacuation describes a complex situation involving time-critical decision-making by evacuees. Mobile robots are being actively explored as a potential solution to provide timely guidance. In this work, we study a robot-guided crowd evacuation problem where a small group of robots is used to guide a large human crowd to safe locations. The challenge lies in how to use micro-level human-ro… ▽ More

    Submitted 11 January, 2024; v1 submitted 28 February, 2023; originally announced February 2023.

  18. arXiv:2212.01505  [pdf, other

    cs.LG eess.SY

    Constrained Reinforcement Learning via Dissipative Saddle Flow Dynamics

    Authors: Tianqi Zheng, Pengcheng You, Enrique Mallada

    Abstract: In constrained reinforcement learning (C-RL), an agent seeks to learn from the environment a policy that maximizes the expected cumulative reward while satisfying minimum requirements in secondary cumulative reward constraints. Several algorithms rooted in sampled-based primal-dual methods have been recently proposed to solve this problem in policy space. However, such methods are based on stochas… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

  19. arXiv:2210.05066  [pdf, ps, other

    math.OC eess.SP

    A Linearly Convergent Algorithm for Rotationally Invariant $\ell_1$-Norm Principal Component Analysis

    Authors: Taoli Zheng, Peng Wang, Anthony Man-Cho So

    Abstract: To do dimensionality reduction on the datasets with outliers, the $\ell_1$-norm principal component analysis (L1-PCA) as a typical robust alternative of the conventional PCA has enjoyed great popularity over the past years. In this work, we consider a rotationally invariant L1-PCA, which is hardly studied in the literature. To tackle it, we propose a proximal alternating linearized minimization me… ▽ More

    Submitted 26 October, 2022; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: 11 pages, 3 figures

  20. arXiv:2210.00976  [pdf, other

    cs.RO eess.SY

    Task Space Tracking of Soft Manipulators: Inner-Outer Loop Control Based on Cosserat-Rod Models

    Authors: Tongjia Zheng, Qing Han, Hai Lin

    Abstract: Soft robots are robotic systems made of deformable materials and exhibit unique flexibility that can be exploited for complex environments and tasks. However, their control problem has been considered a challenging subject because they are of infinite degrees of freedom and highly under-actuated. Existing studies have mainly relied on simplified and approximated finite-dimensional models. In this… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.

  21. arXiv:2209.09795  [pdf, other

    cs.RO eess.SY

    Multi-Robot-Assisted Human Crowd Evacuation using Navigation Velocity Fields

    Authors: Tongjia Zheng, Zhenyuan Yuan, Mollik Nayyar, Alan R. Wagner, Minghui Zhu, Hai Lin

    Abstract: This work studies a robot-assisted crowd evacuation problem where we control a small group of robots to guide a large human crowd to safe locations. The challenge lies in how to model human-robot interactions and design robot controls to indirectly control a human population that significantly outnumbers the robots. To address the challenge, we treat the crowd as a continuum and formulate the evac… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

  22. arXiv:2207.05896  [pdf, other

    cs.RO eess.SY

    Safe Human-Robot Collaborative Transportation via Trust-Driven Role Adaptation

    Authors: Tony Zheng, Monimoy Bujarbaruah, Yvonne R. Stürz, Francesco Borrelli

    Abstract: We study a human-robot collaborative transportation task in presence of obstacles. The task for each agent is to carry a rigid object to a common target position, while safely avoiding obstacles and satisfying the compliance and actuation constraints of the other agent. Human and robot do not share the local view of the environment. The human policy either assists the robot when they deem the robo… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

  23. arXiv:2205.09048  [pdf, other

    eess.IV cs.CV

    Global Contrast Masked Autoencoders Are Powerful Pathological Representation Learners

    Authors: Hao Quan, Xingyu Li, Weixing Chen, Qun Bai, Mingchen Zou, Ruijie Yang, Tingting Zheng, Ruiqun Qi, Xinghua Gao, Xiaoyu Cui

    Abstract: Based on digital pathology slice scanning technology, artificial intelligence algorithms represented by deep learning have achieved remarkable results in the field of computational pathology. Compared to other medical images, pathology images are more difficult to annotate, and thus, there is an extreme lack of available datasets for conducting supervised learning to train robust deep learning mod… ▽ More

    Submitted 15 November, 2023; v1 submitted 18 May, 2022; originally announced May 2022.

  24. arXiv:2205.06450  [pdf, other

    eess.SP cs.CV

    A microstructure estimation Transformer inspired by sparse representation for diffusion MRI

    Authors: Tianshu Zheng, Cong Sun, Weihao Zheng, Wen Shi, Haotian Li, Yi Sun, Yi Zhang, Guangbin Wang, Chuyang Ye, Dan Wu

    Abstract: Diffusion magnetic resonance imaging (dMRI) is an important tool in characterizing tissue microstructure based on biophysical models, which are complex and highly non-linear. Resolving microstructures with optimization techniques is prone to estimation errors and requires dense sampling in the q-space. Deep learning based approaches have been proposed to overcome these limitations. Motivated by th… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

  25. AFFIRM: Affinity Fusion-based Framework for Iteratively Random Motion correction of multi-slice fetal brain MRI

    Authors: Wen Shi, Haoan Xu, Cong Sun, Jiwei Sun, Yamin Li, Xinyi Xu, Tianshu Zheng, Yi Zhang, Guangbin Wang, Dan Wu

    Abstract: Multi-slice magnetic resonance images of the fetal brain are usually contaminated by severe and arbitrary fetal and maternal motion. Hence, stable and robust motion correction is necessary to reconstruct high-resolution 3D fetal brain volume for clinical diagnosis and quantitative analysis. However, the conventional registration-based correction has a limited capture range and is insufficient for… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

  26. arXiv:2205.02850  [pdf

    eess.IV cs.AI cs.CV

    A Deep Reinforcement Learning Framework for Rapid Diagnosis of Whole Slide Pathological Images

    Authors: Tingting Zheng, Weixing chen, Shuqin Li, Hao Quan, Qun Bai, Tianhang Nan, Song Zheng, Xinghua Gao, Yue Zhao, Xiaoyu Cui

    Abstract: The deep neural network is a research hotspot for histopathological image analysis, which can improve the efficiency and accuracy of diagnosis for pathologists or be used for disease screening. The whole slide pathological image can reach one gigapixel and contains abundant tissue feature information, which needs to be divided into a lot of patches in the training and inference stages. This will l… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

  27. arXiv:2204.06257  [pdf, ps, other

    cs.IT eess.SP

    Physical layer security in large-scale random multiple access wireless sensor networks: a stochastic geometry approach

    Authors: Tong-Xing Zheng, Xin Chen, Chao Wang, Kai-Kit Wong, **hong Yuan

    Abstract: This paper investigates physical layer security for a large-scale WSN with random multiple access, where each fusion center in the network randomly schedules a number of sensors to upload their sensed data subject to the overhearing of randomly distributed eavesdroppers. We propose an uncoordinated random jamming scheme in which those unscheduled sensors send jamming signals with a certain probabi… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

    Comments: accepted by the IEEE Transactions on Communications

  28. arXiv:2203.13724  [pdf, other

    cs.RO eess.SY

    PDE-based Dynamic Control and Estimation of Soft Robotic Arms

    Authors: Tongjia Zheng, Hai Lin

    Abstract: Compared with traditional rigid-body robots, soft robots not only exhibit unprecedented adaptation and flexibility but also present novel challenges in their modeling and control because of their infinite degrees of freedom. Most of the existing approaches have mainly relied on approximated models so that the well-developed finite-dimensional control theory can be exploited. However, this may brin… ▽ More

    Submitted 20 September, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

  29. arXiv:2201.03053  [pdf, other

    eess.IV cs.CV cs.LG

    COVID-19 Infection Segmentation from Chest CT Images Based on Scale Uncertainty

    Authors: Masahiro Oda, Tong Zheng, Yuichiro Hayashi, Yoshito Otake, Masahiro Hashimoto, Toshiaki Akashi, Shigeki Aoki, Kensaku Mori

    Abstract: This paper proposes a segmentation method of infection regions in the lung from CT volumes of COVID-19 patients. COVID-19 spread worldwide, causing many infected patients and deaths. CT image-based diagnosis of COVID-19 can provide quick and accurate diagnosis results. An automated segmentation method of infection regions in the lung provides a quantitative criterion for diagnosis. Previous method… ▽ More

    Submitted 9 January, 2022; originally announced January 2022.

    Comments: Accepted paper as a oral presentation at CILP2021, 10th MICCAI CLIP Workshop

    Journal ref: DCL 2021, PPML 2021, LL-COVID19 2021, CLIP 2021, Lecture Notes in Computer Science (LNCS) 12969, pp.88-97

  30. arXiv:2112.10894  [pdf

    cs.NE cs.LG eess.SP

    Subject-Independent Drowsiness Recognition from Single-Channel EEG with an Interpretable CNN-LSTM model

    Authors: Jian Cui, Zirui Lan, Tianhu Zheng, Yisi Liu, Olga Sourina, Lipo Wang, Wolfgang Müller-Wittig

    Abstract: For EEG-based drowsiness recognition, it is desirable to use subject-independent recognition since conducting calibration on each subject is time-consuming. In this paper, we propose a novel Convolutional Neural Network (CNN)-Long Short-Term Memory (LSTM) model for subject-independent drowsiness recognition from single-channel EEG signals. Different from existing deep learning models that are most… ▽ More

    Submitted 21 November, 2021; originally announced December 2021.

    Journal ref: 2021 International Conference on Cyberworlds (CW), 2021, pp. 201-208

  31. arXiv:2111.12324  [pdf, other

    cs.SD eess.AS

    How Speech is Recognized to Be Emotional - A Study Based on Information Decomposition

    Authors: Haoran Sun, Lantian Li, Thomas Fang Zheng, Dong Wang

    Abstract: The way that humans encode their emotion into speech signals is complex. For instance, an angry man may increase his pitch and speaking rate, and use impolite words. In this paper, we present a preliminary study on various emotional factors and investigate how each of them impacts modern emotion recognition systems. The key tool of our study is the SpeechFlow model presented recently, by which we… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

  32. MoRe-Fi: Motion-robust and Fine-grained Respiration Monitoring via Deep-Learning UWB Radar

    Authors: Tianyue Zheng, Zhe Chen, Shujie Zhang, Chao Cai, Jun Luo

    Abstract: Crucial for healthcare and biomedical applications, respiration monitoring often employs wearable sensors in practice, causing inconvenience due to their direct contact with human bodies. Therefore, researchers have been constantly searching for contact-free alternatives. Nonetheless, existing contact-free designs mostly require human subjects to remain static, largely confining their adoptions in… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

    Comments: 14 pages

    Journal ref: SenSys '21: Proceedings of the 19th ACM Conference on Embedded Networked Sensor Systems November 2021

  33. RF-Net: a Unified Meta-learning Framework for RF-enabled One-shot Human Activity Recognition

    Authors: Shuya Ding, Zhe Chen, Tianyue Zheng, Jun Luo

    Abstract: Radio-Frequency (RF) based device-free Human Activity Recognition (HAR) rises as a promising solution for many applications. However, device-free (or contactless) sensing is often more sensitive to environment changes than device-based (or wearable) sensing. Also, RF datasets strictly require on-line labeling during collection, starkly different from image and text data collections where human int… ▽ More

    Submitted 28 October, 2021; originally announced November 2021.

    Comments: 14 pages

    Journal ref: SenSys '20: Proceedings of the 18th Conference on Embedded Networked Sensor Systems, November 2020

  34. Enhancing RF Sensing with Deep Learning: A Layered Approach

    Authors: Tianyue Zheng, Zhe Chen, Shuya Ding, Jun Luo

    Abstract: In recent years, radio frequency (RF) sensing has gained increasing popularity due to its pervasiveness, low cost, non-intrusiveness, and privacy preservation. However, realizing the promises of RF sensing is highly nontrivial, given typical challenges such as multipath and interference. One potential solution leverages deep learning to build direct map**s from the RF domain to target domains, h… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: 7 pages

    Journal ref: IEEE Communications Magazine ( Volume: 59, Issue: 2, February 2021)

  35. arXiv:2110.14848  [pdf, other

    eess.SP cs.LG cs.NI

    V2iFi: in-Vehicle Vital Sign Monitoring via Compact RF Sensing

    Authors: Tianyue Zheng, Zhe Chen, Chao Cai, Jun Luo, Xu Zhang

    Abstract: Given the significant amount of time people spend in vehicles, health issues under driving condition have become a major concern. Such issues may vary from fatigue, asthma, stroke, to even heart attack, yet they can be adequately indicated by vital signs and abnormal activities. Therefore, in-vehicle vital sign monitoring can help us predict and hence prevent these issues. Whereas existing sensor-… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: 27 pages

    Journal ref: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, Volume 4, Issue 2, June 2020

  36. RF-Based Human Activity Recognition Using Signal Adapted Convolutional Neural Network

    Authors: Zhe Chen, Chao Cai, Tianyue Zheng, Jun Luo, Jie Xiong, Xin Wang

    Abstract: Human Activity Recognition (HAR) plays a critical role in a wide range of real-world applications, and it is traditionally achieved via wearable sensing. Recently, to avoid the burden and discomfort caused by wearable devices, device-free approaches exploiting RF signals arise as a promising alternative for HAR. Most of the latest device-free approaches require training a large deep neural network… ▽ More

    Submitted 27 October, 2021; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: 13 pages

    Journal ref: IEEE Transactions on Mobile Computing, 19 April 2021

  37. SiWa: See into Walls via Deep UWB Radar

    Authors: Tianyue Zheng, Zhe Chen, Jun Luo, Lin Ke, Chaoyang Zhao, Yaowen Yang

    Abstract: Being able to see into walls is crucial for diagnostics of building health; it enables inspections of wall structure without undermining the structural integrity. However, existing sensing devices do not seem to offer a full capability in map** the in-wall structure while identifying their status (e.g., seepage and corrosion). In this paper, we design and implement SiWa as a low-cost and portabl… ▽ More

    Submitted 27 October, 2021; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: 14 pages

    Journal ref: MobiCom '21: Proceedings of the 27th Annual International Conference on Mobile Computing and Networking October 2021

  38. arXiv:2110.05087  [pdf

    cs.SD eess.AS

    A Multi-Resolution Front-End for End-to-End Speech Anti-Spoofing

    Authors: Wei Liu, Meng Sun, Xiongwei Zhang, Hugo Van hamme, Thomas Fang Zheng

    Abstract: The choice of an optimal time-frequency resolution is usually a difficult but important step in tasks involving speech signal classification, e.g., speech anti-spoofing. The variations of the performance with different choices of timefrequency resolutions can be as large as those with different model architectures, which makes it difficult to judge what the improvement actually comes from when a n… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Comments: submitted to ICASSP 2022

  39. arXiv:2109.00605  [pdf, ps, other

    eess.SY

    Backstep** Mean-Field Density Control for Large-Scale Heterogeneous Nonlinear Stochastic Systems

    Authors: Tongjia Zheng, Qing Han, Hai Lin

    Abstract: This work studies the problem of controlling the mean-field density of large-scale stochastic systems, which has applications in various fields such as swarm robotics. Recently, there is a growing amount of literature that employs mean-field partial differential equations (PDEs) to model the density evolution and uses density feedback to design control laws which, by acting on individual systems,… ▽ More

    Submitted 24 March, 2022; v1 submitted 1 September, 2021; originally announced September 2021.

  40. arXiv:2107.01322  [pdf, other

    cs.IT eess.SP

    Physical Layer Security for NOMA-Enabled Multi-Access Edge Computing Wireless Networks

    Authors: Yating Wen, Tong-Xing Zheng, Yongxia Tong, Hao-Wen Liu, Xin Chen, Pengcheng Mu, Hui-Ming Wang

    Abstract: Multi-access edge computing (MEC) has been regarded as a promising technique for enhancing computation capabilities for wireless networks. In this paper, we study physical layer security in an MEC system where multiple users offload partial of their computation tasks to a base station simultaneously based on non-orthogonal multiple access (NOMA), in the presence of a malicious eavesdropper. Secrec… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

    Comments: 6 pages, 3 figures, and Accepted to present at IEEE/CIC ICCC 2021

  41. Distributed Mean-Field Density Estimation for Large-Scale Systems

    Authors: Tongjia Zheng, Qing Han, Hai Lin

    Abstract: This work studies how to estimate the mean-field density of large-scale systems in a distributed manner. Such problems are motivated by the recent swarm control technique that uses mean-field approximations to represent the collective effect of the swarm, wherein the mean-field density (especially its gradient) is usually used in feedback control design. In the first part, we formulate the density… ▽ More

    Submitted 10 October, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: arXiv admin note: text overlap with arXiv:2009.05366

  42. arXiv:2106.00899  [pdf, other

    eess.SY

    Feedback Interconnected Mean-Field Density Estimation and Control

    Authors: Tongjia Zheng, Qing Han, Hai Lin

    Abstract: Swarm robotic systems have foreseeable applications in the near future. Recently, there has been an increasing amount of literature that employs mean-field partial differential equations (PDEs) to model the time-evolution of the probability density of swarm robotic systems and uses density feedback to design stabilizing control laws that act on individuals such that their density converges to a ta… ▽ More

    Submitted 23 March, 2022; v1 submitted 1 June, 2021; originally announced June 2021.

  43. arXiv:2106.00895  [pdf, ps, other

    eess.SY cs.RO

    Field Estimation using Robotic Swarms through Bayesian Regression and Mean-Field Feedback

    Authors: Tongjia Zheng, Hai Lin

    Abstract: Recent years have seen an increased interest in using mean-field density based modelling and control strategy for deploying robotic swarms. In this paper, we study how to dynamically deploy the robots subject to their physical constraints to efficiently measure and reconstruct certain unknown spatial field (e.g. the air pollution index over a city). Specifically, the evolution of the robots' densi… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

  44. arXiv:2105.12021  [pdf, other

    math.OC eess.SY

    Inner Approximations of the Positive-Semidefinite Cone via Grassmannian Packings

    Authors: Tianqi Zheng, James Guthrie, Enrique Mallada

    Abstract: We investigate the problem of finding inner ap-proximations of positive semidefinite (PSD) cones. We developa novel decomposition framework of the PSD cone by meansof conical combinations of smaller dimensional sub-cones. Weshow that many inner approximation techniques could besummarized within this framework, including the set of (scaled)diagonally dominant matrices, Factor-widthkmatrices, andCho… ▽ More

    Submitted 30 September, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

  45. Attack on practical speaker verification system using universal adversarial perturbations

    Authors: Weiyi Zhang, Shuning Zhao, Le Liu, Jianmin Li, Xingliang Cheng, Thomas Fang Zheng, Xiaolin Hu

    Abstract: In authentication scenarios, applications of practical speaker verification systems usually require a person to read a dynamic authentication text. Previous studies played an audio adversarial example as a digital signal to perform physical attacks, which would be easily rejected by audio replay detection modules. This work shows that by playing our crafted adversarial perturbation as a separate s… ▽ More

    Submitted 19 May, 2021; originally announced May 2021.

    Comments: 6 pages, 2 figures

  46. arXiv:2012.12468  [pdf, other

    cs.SD eess.AS

    CN-Celeb: multi-genre speaker recognition

    Authors: Lantian Li, Ruiqi Liu, Jiawen Kang, Yue Fan, Hao Cui, Yunqi Cai, Ravichander Vipperla, Thomas Fang Zheng, Dong Wang

    Abstract: Research on speaker recognition is extending to address the vulnerability in the wild conditions, among which genre mismatch is perhaps the most challenging, for instance, enrollment with reading speech while testing with conversational or singing audio. This mismatch leads to complex and composite inter-session variations, both intrinsic (i.e., speaking style, physiological status) and extrinsic… ▽ More

    Submitted 24 November, 2021; v1 submitted 22 December, 2020; originally announced December 2020.

    Comments: submitted to Speech Communication

  47. arXiv:2010.14243  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Squeezing value of cross-domain labels: a decoupled scoring approach for speaker verification

    Authors: Lantian Li, Yang Zhang, Jiawen Kang, Thomas Fang Zheng, Dong Wang

    Abstract: Domain mismatch often occurs in real applications and causes serious performance reduction on speaker verification systems. The common wisdom is to collect cross-domain data and train a multi-domain PLDA model, with the hope to learn a domain-independent speaker subspace. In this paper, we firstly present an empirical study to show that simply adding cross-domain data does not help performance in… ▽ More

    Submitted 27 October, 2020; originally announced October 2020.

    Comments: Submitted to ICASSP 2021

  48. arXiv:2010.14242  [pdf, other

    cs.SD cs.LG eess.AS

    Deep generative factorization for speech signal

    Authors: Haoran Sun, Lantian Li, Yunqi Cai, Yang Zhang, Thomas Fang Zheng, Dong Wang

    Abstract: Various information factors are blended in speech signals, which forms the primary difficulty for most speech information processing tasks. An intuitive idea is to factorize speech signal into individual information factors (e.g., phonetic content and speaker trait), though it turns out to be highly challenging. This paper presents a speech factorization approach based on a novel factorial discrim… ▽ More

    Submitted 27 October, 2020; originally announced October 2020.

    Comments: Submitted to ICASSP 2021

  49. arXiv:2010.10207  [pdf, other

    eess.IV cs.CV cs.LG

    Micro CT Image-Assisted Cross Modality Super-Resolution of Clinical CT Images Utilizing Synthesized Training Dataset

    Authors: Tong Zheng, Hirohisa Oda, Masahiro Oda, Shota Nakamura, Masaki Mori, Hirotsugu Takabatake, Hiroshi Natori, Kensaku Mori

    Abstract: This paper proposes a novel, unsupervised super-resolution (SR) approach for performing the SR of a clinical CT into the resolution level of a micro CT ($μ$CT). The precise non-invasive diagnosis of lung cancer typically utilizes clinical CT data. Due to the resolution limitations of clinical CT (about $0.5 \times 0.5 \times 0.5$ mm$^3$), it is difficult to obtain enough pathological information s… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

  50. arXiv:2009.06863  [pdf

    eess.AS cs.CR cs.SD

    When Automatic Voice Disguise Meets Automatic Speaker Verification

    Authors: Linlin Zheng, Jiakang Li, Meng Sun, Xiongwei Zhang, Thomas Fang Zheng

    Abstract: The technique of transforming voices in order to hide the real identity of a speaker is called voice disguise, among which automatic voice disguise (AVD) by modifying the spectral and temporal characteristics of voices with miscellaneous algorithms are easily conducted with softwares accessible to the public. AVD has posed great threat to both human listening and automatic speaker verification (AS… ▽ More

    Submitted 15 September, 2020; originally announced September 2020.

    Comments: accepted for publication

    Journal ref: IEEE Transactions on Information Forensics and Security, 2020