Search | arXiv e-print repository

Dynamic Data Pruning for Automatic Speech Recognition

Authors: Qiao Xiao, **chuan Ma, Adriana Fernandez-Lopez, Boqian Wu, Lu Yin, Stavros Petridis, Mykola Pechenizkiy, Maja Pantic, Decebal Constantin Mocanu, Shiwei Liu

Abstract: The recent success of Automatic Speech Recognition (ASR) is largely attributed to the ever-growing amount of training data. However, this trend has made model training prohibitively costly and imposed computational demands. While data pruning has been proposed to mitigate this issue by identifying a small subset of relevant data, its application in ASR has been barely explored, and existing works… ▽ More The recent success of Automatic Speech Recognition (ASR) is largely attributed to the ever-growing amount of training data. However, this trend has made model training prohibitively costly and imposed computational demands. While data pruning has been proposed to mitigate this issue by identifying a small subset of relevant data, its application in ASR has been barely explored, and existing works often entail significant overhead to achieve meaningful results. To fill this gap, this paper presents the first investigation of dynamic data pruning for ASR, finding that we can reach the full-data performance by dynamically selecting 70% of data. Furthermore, we introduce Dynamic Data Pruning for ASR (DDP-ASR), which offers several fine-grained pruning granularities specifically tailored for speech-related datasets, going beyond the conventional pruning of entire time sequences. Our intensive experiments show that DDP-ASR can save up to 1.6x training time with negligible performance loss. △ Less

Submitted 26 June, 2024; originally announced June 2024.

Comments: Accepted to Interspeech 2024

arXiv:2404.16522 [pdf, other]

A Deep Learning-Driven Pipeline for Differentiating Hypertrophic Cardiomyopathy from Cardiac Amyloidosis Using 2D Multi-View Echocardiography

Authors: Bo Peng, Xiaofeng Li, Xinyu Li, Zhenghan Wang, Hui Deng, Xiaoxian Luo, Lixue Yin, Hongmei Zhang

Abstract: Hypertrophic cardiomyopathy (HCM) and cardiac amyloidosis (CA) are both heart conditions that can progress to heart failure if untreated. They exhibit similar echocardiographic characteristics, often leading to diagnostic challenges. This paper introduces a novel multi-view deep learning approach that utilizes 2D echocardiography for differentiating between HCM and CA. The method begins by classif… ▽ More Hypertrophic cardiomyopathy (HCM) and cardiac amyloidosis (CA) are both heart conditions that can progress to heart failure if untreated. They exhibit similar echocardiographic characteristics, often leading to diagnostic challenges. This paper introduces a novel multi-view deep learning approach that utilizes 2D echocardiography for differentiating between HCM and CA. The method begins by classifying 2D echocardiography data into five distinct echocardiographic views: apical 4-chamber, parasternal long axis of left ventricle, parasternal short axis at levels of the mitral valve, papillary muscle, and apex. It then extracts features of each view separately and combines five features for disease classification. A total of 212 patients diagnosed with HCM, and 30 patients diagnosed with CA, along with 200 individuals with normal cardiac function(Normal), were enrolled in this study from 2018 to 2022. This approach achieved a precision, recall of 0.905, and micro-F1 score of 0.904, demonstrating its effectiveness in accurately identifying HCM and CA using a multi-view analysis. △ Less

Submitted 25 April, 2024; originally announced April 2024.

arXiv:2402.14276 [pdf, other]

Bispectrum Unbiasing for Dilation-Invariant Multi-reference Alignment

Authors: Li** Yin, Anna Little, Matthew Hirn

Abstract: Motivated by modern data applications such as cryo-electron microscopy, the goal of classic multi-reference alignment (MRA) is to recover an unknown signal $f: \mathbb{R} \to \mathbb{R}$ from many observations that have been randomly translated and corrupted by additive noise. We consider a generalization of classic MRA where signals are also corrupted by a random scale change, i.e. dilation. We p… ▽ More Motivated by modern data applications such as cryo-electron microscopy, the goal of classic multi-reference alignment (MRA) is to recover an unknown signal $f: \mathbb{R} \to \mathbb{R}$ from many observations that have been randomly translated and corrupted by additive noise. We consider a generalization of classic MRA where signals are also corrupted by a random scale change, i.e. dilation. We propose a novel data-driven unbiasing procedure which can recover an unbiased estimator of the bispectrum of the unknown signal, given knowledge of the dilation distribution. Lastly, we invert the recovered bispectrum to achieve full signal recovery, and validate our methodology on a set of synthetic signals. △ Less

Submitted 21 February, 2024; originally announced February 2024.

arXiv:2402.09253 [pdf, other]

Max-Min Fair Energy-Efficient Beam Design for Quantized ISAC LEO Satellite Systems: A Rate-Splitting Approach

Authors: Ziang Liu, Longfei Yin, Wonjae Shin, Bruno Clerckx

Abstract: Low earth orbit (LEO) satellite systems with sensing functionality is envisioned to facilitate global-coverage service and emerging applications in 6G. Currently, two fundamental challenges, namely, inter-beam interference among users and power limitation at the LEO satellites, limit the full potential of the joint design of sensing and communication. To effectively control the interference, rate-… ▽ More Low earth orbit (LEO) satellite systems with sensing functionality is envisioned to facilitate global-coverage service and emerging applications in 6G. Currently, two fundamental challenges, namely, inter-beam interference among users and power limitation at the LEO satellites, limit the full potential of the joint design of sensing and communication. To effectively control the interference, rate-splitting multiple access (RSMA) scheme is employed as the interference management strategy in the system design. On the other hand, to address the limited power supply at the LEO satellites, we consider low-resolution quantization digital-to-analog converters (DACs) at the transmitter to reduce power consumption, which grows exponentially with the number of quantization bits. Additionally, optimizing the total energy efficiency (EE) of the system is a common practice to save the power. However, this metric lacks fairness among users. To ensure this fairness and further enhance EE, we investigate the max-min fairness EE of the RSMA-assisted integrated sensing and communications (ISAC)-LEO satellite system. In this system, the satellite transmits a quantized dual-functional signal serving downlink users while detecting a target. Specifically, we optimize the precoders for maximizing the minimal EE among all users, considering the power consumption of each radio frequency (RF) chain under communication and sensing constraints. To tackle this optimization problem, we proposed an iterative algorithm based on successive convex approximation (SCA) and Dinkelbach's method. Numerical results illustrate that the proposed design outperforms the strategies that aim to maximize the total EE of the system and conventional space-division multiple access (SDMA) in terms of max-min fairness EE and the communication-sensing trade-off. △ Less

Submitted 14 February, 2024; originally announced February 2024.

Comments: Submitted to IEEE journal

arXiv:2402.03817 [pdf]

Improvement of Frequency Source Phase Noise Reduction Design under Vibration Condition

Authors: Liwei Yin, Yongjiang Shu, Heng Zhang, Yuefei Dai, Xiaopeng Lu, Yunlong Lian, Zhonghua Wang, Yong Ding

Abstract: Reasonable vibration reduction design is an important way to achieve low phase noise index of airborne frequency source output signal. Aiming at the problem of phase noise deterioration of an airborne frequency source under random condition, this paper proposes to improve the vibration reduction mode crystal oscillator and reduce the distance between the barycenter of frequency source and crystal… ▽ More Reasonable vibration reduction design is an important way to achieve low phase noise index of airborne frequency source output signal. Aiming at the problem of phase noise deterioration of an airborne frequency source under random condition, this paper proposes to improve the vibration reduction mode crystal oscillator and reduce the distance between the barycenter of frequency source and crystal oscillator vibration based on the analysis of the relationship between the frequency source and the phase noise of output signal. Experimental results show that the active noise control system achieves 62dB phase noise compensation under the random vibration of 0.04-0.1g*g/Hz amplitude range and 5-2000 Hz frequency range. △ Less

Submitted 6 February, 2024; originally announced February 2024.

Comments: 17 pages,29 figures

MSC Class: D.3.2 ACM Class: B.6.2

arXiv:2312.05764 [pdf, other]

Synthesis of Temporally-Robust Policies for Signal Temporal Logic Tasks using Reinforcement Learning

Authors: Siqi Wang, Shaoyuan Li, Li Yin, Xiang Yin

Abstract: This paper investigates the problem of designing control policies that satisfy high-level specifications described by signal temporal logic (STL) in unknown, stochastic environments. While many existing works concentrate on optimizing the spatial robustness of a system, our work takes a step further by also considering temporal robustness as a critical metric to quantify the tolerance of time unce… ▽ More This paper investigates the problem of designing control policies that satisfy high-level specifications described by signal temporal logic (STL) in unknown, stochastic environments. While many existing works concentrate on optimizing the spatial robustness of a system, our work takes a step further by also considering temporal robustness as a critical metric to quantify the tolerance of time uncertainty in STL. To this end, we formulate two relevant control objectives to enhance the temporal robustness of the synthesized policies. The first objective is to maximize the probability of being temporally robust for a given threshold. The second objective is to maximize the worst-case spatial robustness value within a bounded time shift. We use reinforcement learning to solve both control synthesis problems for unknown systems. Specifically, we approximate both control objectives in a way that enables us to apply the standard Q-learning algorithm. Theoretical bounds in terms of the approximations are also derived. We present case studies to demonstrate the feasibility of our approach. △ Less

Submitted 23 March, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

Comments: Accepted to ICRA 2024

arXiv:2307.07382 [pdf, other]

Distributed Rate-Splitting Multiple Access for Multilayer Satellite Communications

Authors: Yunnuo Xu, Longfei Yin, Yijie Mao, Wonjae Shin, Bruno Clerckx

Abstract: Future wireless networks, in particular, 5G and beyond, are anticipated to deploy dense Low Earth Orbit (LEO) satellites to provide global coverage and broadband connectivity. However, the limited frequency band and the coexistence of multiple constellations bring new challenges for interference management. In this paper, we propose a robust multilayer interference management scheme for spectrum s… ▽ More Future wireless networks, in particular, 5G and beyond, are anticipated to deploy dense Low Earth Orbit (LEO) satellites to provide global coverage and broadband connectivity. However, the limited frequency band and the coexistence of multiple constellations bring new challenges for interference management. In this paper, we propose a robust multilayer interference management scheme for spectrum sharing in heterogeneous satellite networks with statistical channel state information (CSI) at the transmitter (CSIT) and receivers (CSIR). In the proposed scheme, Rate-Splitting Multiple Access (RSMA), as a general and powerful framework for interference management and multiple access strategies, is implemented distributedly at GEO and LEO satellites, coined Distributed-RSMA (D-RSMA). By doing so, D-RSMA aims to mitigate the interference and boost the user fairness of the overall multilayer satellite system. Specifically, we study the problem of jointly optimizing the GEO/LEO precoders and message splits to maximize the minimum rate among User Terminals (UTs) subject to a transmit power constraint at all satellites. A robust algorithm is proposed to solve the original non-convex optimization problem. Numerical results demonstrate the effectiveness and robustness towards network load and CSI uncertainty of our proposed D-RSMA scheme. Benefiting from the interference management capability, D-RSMA provides significant max-min fairness performance gains compared to several benchmark schemes. △ Less

Submitted 2 May, 2024; v1 submitted 14 July, 2023; originally announced July 2023.

arXiv:2306.07505 [pdf]

Deep learning radiomics for assessment of gastroesophageal varices in people with compensated advanced chronic liver disease

Authors: Lan Wang, Ruiling He, Lili Zhao, Jia Wang, Zhengzi Geng, Tao Ren, Guo Zhang, Peng Zhang, Kaiqiang Tang, Chaofei Gao, Fei Chen, Liting Zhang, Yonghe Zhou, Xin Li, Fanbin He, Hui Huan, Wenjuan Wang, Yunxiao Liang, Juan Tang, Fang Ai, Tingyu Wang, Liyun Zheng, Zhongwei Zhao, Jiansong Ji, Wei Liu , et al. (22 additional authors not shown)

Abstract: Objective: Bleeding from gastroesophageal varices (GEV) is a medical emergency associated with high mortality. We aim to construct an artificial intelligence-based model of two-dimensional shear wave elastography (2D-SWE) of the liver and spleen to precisely assess the risk of GEV and high-risk gastroesophageal varices (HRV). Design: A prospective multicenter study was conducted in patients with… ▽ More Objective: Bleeding from gastroesophageal varices (GEV) is a medical emergency associated with high mortality. We aim to construct an artificial intelligence-based model of two-dimensional shear wave elastography (2D-SWE) of the liver and spleen to precisely assess the risk of GEV and high-risk gastroesophageal varices (HRV). Design: A prospective multicenter study was conducted in patients with compensated advanced chronic liver disease. 305 patients were enrolled from 12 hospitals, and finally 265 patients were included, with 1136 liver stiffness measurement (LSM) images and 1042 spleen stiffness measurement (SSM) images generated by 2D-SWE. We leveraged deep learning methods to uncover associations between image features and patient risk, and thus conducted models to predict GEV and HRV. Results: A multi-modality Deep Learning Risk Prediction model (DLRP) was constructed to assess GEV and HRV, based on LSM and SSM images, and clinical information. Validation analysis revealed that the AUCs of DLRP were 0.91 for GEV (95% CI 0.90 to 0.93, p < 0.05) and 0.88 for HRV (95% CI 0.86 to 0.89, p < 0.01), which were significantly and robustly better than canonical risk indicators, including the value of LSM and SSM. Moreover, DLPR was better than the model using individual parameters, including LSM and SSM images. In HRV prediction, the 2D-SWE images of SSM outperform LSM (p < 0.01). Conclusion: DLRP shows excellent performance in predicting GEV and HRV over canonical risk indicators LSM and SSM. Additionally, the 2D-SWE images of SSM provided more information for better accuracy in predicting HRV than the LSM. △ Less

Submitted 12 June, 2023; originally announced June 2023.

arXiv:2306.06458 [pdf, ps, other]

Rate-Splitting Multiple Access for Simultaneous Multi-User Communication and Multi-Target Sensing

Authors: Kexin Chen, Yijie Mao, Longfei Yin, Chengcheng Xu, Yang Huang

Abstract: In this paper, we initiate the study of rate-splitting multiple access (RSMA) for a mono-static integrated sensing and communication (ISAC) system, where the dual-functional base station (BS) simultaneously communicates with multiple users and detects multiple moving targets. We aim at optimizing the ISAC waveform to jointly maximize the max-min fairness (MMF) rate of the communication users and m… ▽ More In this paper, we initiate the study of rate-splitting multiple access (RSMA) for a mono-static integrated sensing and communication (ISAC) system, where the dual-functional base station (BS) simultaneously communicates with multiple users and detects multiple moving targets. We aim at optimizing the ISAC waveform to jointly maximize the max-min fairness (MMF) rate of the communication users and minimize the largest eigenvalue of the Cramér-Rao bound (CRB) matrix for unbiased estimation. The CRB matrix considered in this work is general as it involves the estimation of angular direction, complex reflection coefficient, and Doppler frequency for multiple moving targets. Simulation results demonstrate that RSMA maintains a larger communication and sensing trade-off than conventional space-division multiple access (SDMA) and it is capable of detecting multiple targets with a high detection accuracy. The finding highlights the potential of RSMA as an effective and powerful strategy for interference management in the general multi-user multi-target ISAC systems. △ Less

Submitted 3 March, 2024; v1 submitted 10 June, 2023; originally announced June 2023.

arXiv:2304.00941 [pdf, other]

Integrated Sensing and Communications Enabled Low Earth Orbit Satellite Systems

Authors: Longfei Yin, Ziang Liu, Bhavani Shankar M. R., Mohammad Alaee-Kerahroodi, Bruno Clerckx

Abstract: Extreme crowding of electromagnetic spectrum in recent years has led to the challenges in designing sensing and communications systems. Both systems require a broad range of bandwidth, thus resulting in competing interests in exploiting the spectrum. Efficient spectrum and hardware utilization have led to the emergence of integrated sensing and communications (ISAC) systems, which have recently em… ▽ More Extreme crowding of electromagnetic spectrum in recent years has led to the challenges in designing sensing and communications systems. Both systems require a broad range of bandwidth, thus resulting in competing interests in exploiting the spectrum. Efficient spectrum and hardware utilization have led to the emergence of integrated sensing and communications (ISAC) systems, which have recently emerged as a candidate 6G technology. In particular, we provide potential techniques, namely the opportunistic ISAC and optimized ISAC. Rate-splitting multiple access (RSMA) is highlighted as an optimized ISAC technique for LEO-ISAC systems due to its effectiveness in simultaneously managing interference and enabling better communication-sensing trade-off performance. △ Less

Submitted 14 February, 2024; v1 submitted 3 April, 2023; originally announced April 2023.

Comments: Accepted to IEEE Network magazine

arXiv:2303.11784 [pdf, ps, other]

Energy Efficiency of Rate-Splitting Multiple Access for Multibeam Satellite System

Authors: **yuan Liu, Yong Liang Guan, Yao Ge, Longfei Yin, Bruno Clerckx

Abstract: Energy efficiency (EE) problem has become an important and major issue in satellite communications. In this paper, we study the beamforming design strategy to maximize the EE of rate-splitting multiple access (RSMA) for the multibeam satellite communications by considering imperfect channel state information at the transmitter (CSIT). We propose an expectation-based robust beamforming algorithm ag… ▽ More Energy efficiency (EE) problem has become an important and major issue in satellite communications. In this paper, we study the beamforming design strategy to maximize the EE of rate-splitting multiple access (RSMA) for the multibeam satellite communications by considering imperfect channel state information at the transmitter (CSIT). We propose an expectation-based robust beamforming algorithm against the imperfect CSIT scenario. By combining the successive convex approximation (SCA) with the penalty function transformation, the nonconvex EE maximization problem can be solved in an iterative manner. The simulation results demonstrate the effectiveness and superiority of RSMA over traditional space division multiple access (SDMA). Moreover, our proposed beamforming algorithm can achieve better EE performance than the conventional beamforming algorithm. △ Less

Submitted 19 March, 2023; originally announced March 2023.

Comments: 5 pages, 1 figure, accepted by the 2023 IEEE Vehicular Technology Conference

arXiv:2302.12713 [pdf, other]

A Transformer-based Deep Learning Algorithm to Auto-record Undocumented Clinical One-Lung Ventilation Events

Authors: Zhihua Li, Alexander Nagrebetsky, Sylvia Ranjeva, Nan Bi, Dianbo Liu, Marcos F. Vidal Melo, Timothy Houle, Lijun Yin, Hao Deng

Abstract: As a team studying the predictors of complications after lung surgery, we have encountered high missingness of data on one-lung ventilation (OLV) start and end times due to high clinical workload and cognitive overload during surgery. Such missing data limit the precision and clinical applicability of our findings. We hypothesized that available intraoperative mechanical ventilation and physiologi… ▽ More As a team studying the predictors of complications after lung surgery, we have encountered high missingness of data on one-lung ventilation (OLV) start and end times due to high clinical workload and cognitive overload during surgery. Such missing data limit the precision and clinical applicability of our findings. We hypothesized that available intraoperative mechanical ventilation and physiological time-series data combined with other clinical events could be used to accurately predict missing start and end times of OLV. Such a predictive model can recover existing miss-documented records and relieves the documentation burden by deploying it in clinical settings. To this end, we develop a deep learning model to predict the occurrence and timing of OLV based on routinely collected intraoperative data. Our approach combines the variables' spatial and frequency domain features, using Transformer encoders to model the temporal evolution and convolutional neural network to abstract frequency-of-interest from wavelet spectrum images. The performance of the proposed method is evaluated on a benchmark dataset curated from Massachusetts General Hospital (MGH) and Brigham and Women's Hospital (BWH). Experiments show our approach outperforms baseline methods significantly and produces a satisfactory accuracy for clinical use. △ Less

Submitted 16 February, 2023; originally announced February 2023.

Comments: Accepted to AAAI-2023 Workshop on Health Intelligence

arXiv:2210.16805 [pdf, other]

SRTNet: Time Domain Speech Enhancement Via Stochastic Refinement

Authors: Zhibin Qiu, Mengfan Fu, Yinfeng Yu, LiLi Yin, Fuchun Sun, Hao Huang

Abstract: Diffusion model, as a new generative model which is very popular in image generation and audio synthesis, is rarely used in speech enhancement. In this paper, we use the diffusion model as a module for stochastic refinement. We propose SRTNet, a novel method for speech enhancement via Stochastic Refinement in complete Time domain. Specifically, we design a joint network consisting of a determinist… ▽ More Diffusion model, as a new generative model which is very popular in image generation and audio synthesis, is rarely used in speech enhancement. In this paper, we use the diffusion model as a module for stochastic refinement. We propose SRTNet, a novel method for speech enhancement via Stochastic Refinement in complete Time domain. Specifically, we design a joint network consisting of a deterministic module and a stochastic module, which makes up the ``enhance-and-refine'' paradigm. We theoretically demonstrate the feasibility of our method and experimentally prove that our method achieves faster training, faster sampling and higher quality. Our code and enhanced samples are available at https://github.com/zhibinQiu/SRTNet.git. △ Less

Submitted 30 October, 2022; originally announced October 2022.

arXiv:2205.02462 [pdf, ps, other]

Rate-Splitting Multiple Access for 6G -- Part II: Interplay with Integrated Sensing and Communications

Authors: Longfei Yin, Yijie Mao, Onur Dizdar, Bruno Clerckx

Abstract: This letter is the second part of a three-part tutorial focusing on rate-splitting multiple access (RSMA) for 6G. As Part II of the tutorial, this letter addresses the interplay between RSMA and integrated radar sensing and communications (ISAC). In particular, we introduce a general RSMAassisted ISAC architecture, where the ISAC platform has a dual capability to simultaneously communicate with do… ▽ More This letter is the second part of a three-part tutorial focusing on rate-splitting multiple access (RSMA) for 6G. As Part II of the tutorial, this letter addresses the interplay between RSMA and integrated radar sensing and communications (ISAC). In particular, we introduce a general RSMAassisted ISAC architecture, where the ISAC platform has a dual capability to simultaneously communicate with downlink users and probe detection signals to a moving target. Then, the metrics of radar sensing and communications are respectively introduced, followed by a RSMA-assisted ISAC waveform design example which jointly minimizes the Cramer-Rao bound (CRB) of target estimation and maximizes the minimum fairness rate (MFR) amongst communication users subject to the per-antenna power constraint. The superiority of RSMA-assisted ISAC is verifed through simulation results in both terrestrial and satellite scenarios. RSMA is demonstrated to be a powerful multiple access and interference management strategy for ISAC, and provides a better communication-sensing trade-off compared with the conventional benchmark strategies. Consequently, RSMA is a promising technology for next generation multiple access (NGMA) and future networks such as 6G and beyond. △ Less

Submitted 5 May, 2022; originally announced May 2022.

arXiv:2204.10704 [pdf, other]

SUES-200: A Multi-height Multi-scene Cross-view Image Benchmark Across Drone and Satellite

Authors: Runzhe Zhu, Ling Yin, Mingze Yang, Fei Wu, Yuncheng Yang, Wenbo Hu

Abstract: Cross-view image matching aims to match images of the same target scene acquired from different platforms. With the rapid development of drone technology, cross-view matching by neural network models has been a widely accepted choice for drone position or navigation. However, existing public datasets do not include images obtained by drones at different heights, and the types of scenes are relativ… ▽ More Cross-view image matching aims to match images of the same target scene acquired from different platforms. With the rapid development of drone technology, cross-view matching by neural network models has been a widely accepted choice for drone position or navigation. However, existing public datasets do not include images obtained by drones at different heights, and the types of scenes are relatively homogeneous, which yields issues in assessing a model's capability to adapt to complex and changing scenes. In this end, we present a new cross-view dataset called SUES-200 to address these issues. SUES-200 contains 24120 images acquired by the drone at four different heights and corresponding satellite view images of the same target scene. To the best of our knowledge, SUES-200 is the first public dataset that considers the differences generated in aerial photography captured by drones flying at different heights. In addition, we developed an evaluation for efficient training, testing and evaluation of cross-view matching models, under which we comprehensively analyze the performance of nine architectures. Then, we propose a robust baseline model for use with SUES-200. Experimental results show that SUES-200 can help the model to learn highly discriminative features of the height of the drone. △ Less

Submitted 21 January, 2023; v1 submitted 22 April, 2022; originally announced April 2022.

arXiv:2203.14067 [pdf, ps, other]

Rate-Splitting Multiple Access for Dual-Functional Radar-Communication Satellite Systems

Authors: Longfei Yin, Bruno Clerckx

Abstract: In this paper, we consider a multi-antenna dual-functional radar-communication (DFRC) satellite system, where the satellite has a dual capability to simultaneously communicate with downlink satellite users (SUs) and probe detection signals to a moving target. To design an appropriate DFRC waveform, we investigate the rate-splitting multiple access (RSMA)-assisted DFRC beamfoming, and employ the Cr… ▽ More In this paper, we consider a multi-antenna dual-functional radar-communication (DFRC) satellite system, where the satellite has a dual capability to simultaneously communicate with downlink satellite users (SUs) and probe detection signals to a moving target. To design an appropriate DFRC waveform, we investigate the rate-splitting multiple access (RSMA)-assisted DFRC beamfoming, and employ the Cramer-Rao bound (CRB) as a radar performance metric, which represents a lower bound on the variance of unbiased estimators. The beamforming is optimized to minimize the CRB subject to quality of service (QoS) constraints of SUs and a per-feed transmit power budget. Satellite communication and detecting ground/ sea objects in a bistatic mode are accomplished simultaneously using the DFRC waveform we designed. Simulation results demonstrate that the proposed RSMA-assisted DFRC beamforming outperforms the conventional space-division multiple access (SDMA) strategy in terms of the communication-sensing trade-off and target estimation performance in a multibeam satellite system. △ Less

Submitted 26 March, 2022; originally announced March 2022.

Comments: WCNC 2022

arXiv:2111.14074 [pdf, ps, other]

Rate-Splitting Multiple Access for Satellite-Terrestrial Integrated Networks:Benefits of Coordination and Cooperation

Authors: Longfei Yin, Bruno Clerckx

Abstract: This work studies the joint beamforming design problem of achieving max-min rate fairness in a satellite-terrestrial integrated network (STIN) where the satellite provides wide coverage to multibeam multicast satellite users (SUs), and the terrestrial base station (BS) serves multiple cellular users (CUs) in a densely populated area. Both the satellite and BS operate in the same frequency band. Si… ▽ More This work studies the joint beamforming design problem of achieving max-min rate fairness in a satellite-terrestrial integrated network (STIN) where the satellite provides wide coverage to multibeam multicast satellite users (SUs), and the terrestrial base station (BS) serves multiple cellular users (CUs) in a densely populated area. Both the satellite and BS operate in the same frequency band. Since rate-splitting multiple access (RSMA) has recently emerged as a promising strategy for non-orthogonal transmission and robust interference management in multi-antenna wireless networks, we present two RSMA-based STIN schemes, namely the coordinated scheme relying on channel state information (CSI) sharing and the cooperative scheme relying on CSI and data sharing. Our objective is to maximize the minimum fairness rate amongst all SUs and CUs subject to transmit power constraints at the satellite and the BS. A joint beamforming algorithm is proposed to reformulate the original problem into an approximately equivalent convex one which can be iteratively solved. Moreover, an expectation-based robust joint beamforming algorithm is proposed against the practical environment when satellite channel phase uncertainties are considered. Simulation results demonstrate the effectiveness and robustness of our proposed RSMA schemes for STIN, and exhibit significant performance gains compared with various traditional transmission strategies. △ Less

Submitted 28 November, 2021; originally announced November 2021.

Comments: Submitted for publication

arXiv:2109.10471 [pdf, other]

The First Vision For Vitals (V4V) Challenge for Non-Contact Video-Based Physiological Estimation

Authors: Ambareesh Revanur, Zhihua Li, Umur A. Ciftci, Lijun Yin, Laszlo A. Jeni

Abstract: Telehealth has the potential to offset the high demand for help during public health emergencies, such as the COVID-19 pandemic. Remote Photoplethysmography (rPPG) - the problem of non-invasively estimating blood volume variations in the microvascular tissue from video - would be well suited for these situations. Over the past few years a number of research groups have made rapid advances in remot… ▽ More Telehealth has the potential to offset the high demand for help during public health emergencies, such as the COVID-19 pandemic. Remote Photoplethysmography (rPPG) - the problem of non-invasively estimating blood volume variations in the microvascular tissue from video - would be well suited for these situations. Over the past few years a number of research groups have made rapid advances in remote PPG methods for estimating heart rate from digital video and obtained impressive results. How these various methods compare in naturalistic conditions, where spontaneous behavior, facial expressions, and illumination changes are present, is relatively unknown. To enable comparisons among alternative methods, the 1st Vision for Vitals Challenge (V4V) presented a novel dataset containing high-resolution videos time-locked with varied physiological signals from a diverse population. In this paper, we outline the evaluation protocol, the data used, and the results. V4V is to be held in conjunction with the 2021 International Conference on Computer Vision. △ Less

Submitted 21 September, 2021; originally announced September 2021.

Comments: ICCVw'21. V4V Dataset and Challenge: https://vision4vitals.github.io/

arXiv:2109.10273 [pdf, ps, other]

Secrecy Offloading Rate Maximization for Multi-Access Mobile Edge Computing Networks

Authors: Mingxiong Zhao, Huiqi Bao, Li Yin, Jian** Yao, Tony Q. S. Quek

Abstract: This letter considers a multi-access mobile edge computing (MEC) network consisting of multiple users, multiple base stations, and a malicious eavesdropper. Specifically, the users adopt the partial offloading strategy by partitioning the computation task into several parts. One is executed locally and the others are securely offloaded to multiple MEC servers integrated into the base stations by l… ▽ More This letter considers a multi-access mobile edge computing (MEC) network consisting of multiple users, multiple base stations, and a malicious eavesdropper. Specifically, the users adopt the partial offloading strategy by partitioning the computation task into several parts. One is executed locally and the others are securely offloaded to multiple MEC servers integrated into the base stations by leveraging the physical layer security to combat the eavesdrop**. We jointly optimize power allocation, task partition, subcarrier allocation, and computation resource to maximize the secrecy offloading rate of the users, subject to communication and computation resource constraints. Numerical results demonstrate that our proposed scheme can respectively improve the secrecy offloading rate 1.11%--1.39% and 15.05%--17.35% (versus the increase of tasks' latency requirements), and 1.30%--1.75% and 6.08%--9.22% (versus the increase of the maximum transmit power) compared with the two benchmarks. Moreover, it further emphasizes the necessity of conducting computation offloading over multiple MEC servers. △ Less

Submitted 21 September, 2021; originally announced September 2021.

Comments: Double-column, 5 pages, 3 figures, accepted for publication at the IEEE Communications Letter

arXiv:2104.00220 [pdf, ps, other]

Rate-Splitting Multiple Access for Multi-Antenna Broadcast Channels with Statistical CSIT

Authors: Longfei Yin, Bruno Clerckx, Yijie Mao

Abstract: Rate-splitting multiple access (RSMA) is a promising technique for downlink multi-antenna communications owning to its capability of enhancing the system performance in a wide range of network loads, user deployments and channel state information at the transmitter (CSIT) inaccuracies. In this paper, we investigate the achievable rate performance of RSMA in a multi-user multiple-input single-outpu… ▽ More Rate-splitting multiple access (RSMA) is a promising technique for downlink multi-antenna communications owning to its capability of enhancing the system performance in a wide range of network loads, user deployments and channel state information at the transmitter (CSIT) inaccuracies. In this paper, we investigate the achievable rate performance of RSMA in a multi-user multiple-input single-output (MU-MISO) network where only slow-varying statistical channel state information (CSI) is available at the transmitter. RSMA-based statistical beamforming and the split of the common stream is optimized with the objective of maximizing the minimum user rate subject to a sum power budget of the transmitter. Two statistical CSIT scenarios are investigated, namely the Rayleigh fading channels with only spatial correlations known at the transmitter, and the uniform linear array (ULA) deployment with only channel amplitudes and mean of phase known at the transmitter. Numerical results demonstrate the explicit max min fairness (MMF) rate gain of RSMA over space division multiple access (SDMA) in both scenarios. Moreover, we demonstrate that RSMA is more robust to the inaccuracy of statistical CSIT. △ Less

Submitted 31 March, 2021; originally announced April 2021.

Comments: conference

arXiv:2104.00206 [pdf, ps, other]

Rate-Splitting Multiple Access for Multigroup Multicast Cellular and Satellite Communications: PHY Layer Design and Link-Level Simulations

Authors: Longfei Yin, Onur Dizdar, Bruno Clerckx

Abstract: Rate-splitting multiple access (RSMA), relying on linearly precoded rate-splitting (RS) at the transmitter and successive interference cancellation (SIC) at the receivers has emerged as a powerful and flexible multiple access strategy for downlink multi-user multi-antenna systems. Through message splitting and the transmission of both common and private messages, RSMA has been demonstrated to be a… ▽ More Rate-splitting multiple access (RSMA), relying on linearly precoded rate-splitting (RS) at the transmitter and successive interference cancellation (SIC) at the receivers has emerged as a powerful and flexible multiple access strategy for downlink multi-user multi-antenna systems. Through message splitting and the transmission of both common and private messages, RSMA has been demonstrated to be a robust interference management strategy which enables partially decoding interference and partially treating interference as noise. In this work, we consider the application of RSMA in a multigroup multicast scenario, where each message is intended to a group of users. By leveraging the recent results on the max-min fair (MMF) optimization problem of RSMA-based multigroup multicast beamforming with imperfect channel state information at the transmitter (CSIT), we investigate the design of the physical (PHY) layer including finite length polar coding, finite alphabet modulation, adaptive modulation and coding (AMC) algorithm, and SIC receivers, etc. Link-level simulation (LLS) results verify the superiority of RSMA-based multigroup multicast transmission compared with space-division multiple access (SDMA)-based strategy in both cellular systems and multibeam satellite systems. △ Less

Submitted 31 March, 2021; originally announced April 2021.

Comments: conference

arXiv:2102.05792 [pdf, other]

Rate-Splitting Multiple Access for Multigateway Multibeam Satellite Systems with Feeder Link Interference

Authors: Zhi Wen Si, Longfei Yin, Bruno Clerckx

Abstract: This paper studies the precoder design problem of achieving max-min fairness (MMF) amongst users in multigateway multibeam satellite communication systems with feeder link interference. We propose a beamforming strategy based on a newly introduced transmission scheme known as rate-splitting multiple access (RSMA). RSMA relies on multi-antenna rate-splitting at the transmitter and successive interf… ▽ More This paper studies the precoder design problem of achieving max-min fairness (MMF) amongst users in multigateway multibeam satellite communication systems with feeder link interference. We propose a beamforming strategy based on a newly introduced transmission scheme known as rate-splitting multiple access (RSMA). RSMA relies on multi-antenna rate-splitting at the transmitter and successive interference cancellation (SIC) at the receivers, such that the intended message for a user is split into a common part and a private part and the interference is partially decoded and partially treated as noise. In this paper, we formulate the MMF problem subject to per-antenna power constraints at the satellite for the system with imperfect channel state information at the transmitter (CSIT). We also consider the case of two-stage precoding which is assisted by on-board processing (OBP) at the satellite. Numerical results obtained through simulations for RSMA and the conventional linear precoding method are compared. When RSMA is used, MMF rate gain is promised and this gain increases when OBP is used. RSMA is proven to be promising for multigateway multibeam satellite systems whereby there are various practical challenges such as feeder link interference, CSIT uncertainty, per-antenna power constraints, uneven user distribution per beam and frame-based processing. △ Less

Submitted 10 February, 2021; originally announced February 2021.

Comments: Submitted for publication

arXiv:2008.05091 [pdf, other]

Rate-Splitting Multiple Access for Multigroup Multicast and Multibeam Satellite Systems

Authors: Longfei Yin, Bruno Clerckx

Abstract: This work focuses on the promising Rate-Splitting Multiple Access (RSMA) and its beamforming design problem to achieve max-min fairness (MMF) among multiple co-channel multicast groups with imperfect channel state information at the transmitter (CSIT). Contrary to the conventional linear precoding (NoRS) that relies on fully treating any residual interference as noise, we consider a novel multigro… ▽ More This work focuses on the promising Rate-Splitting Multiple Access (RSMA) and its beamforming design problem to achieve max-min fairness (MMF) among multiple co-channel multicast groups with imperfect channel state information at the transmitter (CSIT). Contrary to the conventional linear precoding (NoRS) that relies on fully treating any residual interference as noise, we consider a novel multigroup multicast beamforming strategy based on RSMA. RSMA relies on linearly precoded Rate-Splitting (RS) at the transmitter and Successive Interference Cancellation (SIC) at the receivers, and has recently been shown to enable a flexible framework for non-orthogonal transmission and robust interference management in multi-antenna wireless networks. In this work, we characterize the MMF Degrees-of-Freedom (DoF) achieved by RS and NoRS in multigroup multicast with imperfect CSIT and demonstrate the benefits of RS strategies for both underloaded and overloaded scenarios. Motivated by the DoF analysis, we then formulate a generic transmit power constrained optimization problem to achieve MMF rate performance. The superiority of RS-based multigroup multicast beamforming compared with NoRS is demonstrated via simulations in both terrestrial and multibeam satellite systems. In particular, due to the characteristics and challenges of multibeam satellite communications, our proposed RS strategy is shown promising to manage its interbeam interference. △ Less

Submitted 8 November, 2020; v1 submitted 11 August, 2020; originally announced August 2020.

Comments: Submitted for publication. arXiv admin note: text overlap with arXiv:2002.01731

arXiv:2006.16503 [pdf, other]

Vehicle Re-ID for Surround-view Camera System

Authors: Zizhang Wu, Man Wang, Lingxiao Yin, Weiwei Sun, Jason Wang, Huangbin Wu

Abstract: The vehicle re-identification (ReID) plays a critical role in the perception system of autonomous driving, which attracts more and more attention in recent years. However, to our best knowledge, there is no existing complete solution for the surround-view system mounted on the vehicle. In this paper, we argue two main challenges in above scenario: i) In single camera view, it is difficult to recog… ▽ More The vehicle re-identification (ReID) plays a critical role in the perception system of autonomous driving, which attracts more and more attention in recent years. However, to our best knowledge, there is no existing complete solution for the surround-view system mounted on the vehicle. In this paper, we argue two main challenges in above scenario: i) In single camera view, it is difficult to recognize the same vehicle from the past image frames due to the fisheye distortion, occlusion, truncation, etc. ii) In multi-camera view, the appearance of the same vehicle varies greatly from different camera's viewpoints. Thus, we present an integral vehicle Re-ID solution to address these problems. Specifically, we propose a novel quality evaluation mechanism to balance the effect of tracking box's drift and target's consistency. Besides, we take advantage of the Re-ID network based on attention mechanism, then combined with a spatial constraint strategy to further boost the performance between different cameras. The experiments demonstrate that our solution achieves state-of-the-art accuracy while being real-time in practice. Besides, we will release the code and annotated fisheye dataset for the benefit of community. △ Less

Submitted 29 June, 2020; originally announced June 2020.

Comments: CVPR 2020 workshop on Scalability in Autonomous Driving

arXiv:2003.05313 [pdf, ps, other]

Ghost Imaging with the Optimal Binary Sampling

Authors: Dongyue Yang, Guohua Wu, Bin Luo, Longfei Yin

Abstract: To extract the maximum information about the object from a series of binary samples in ghost imaging applications, we propose and demonstrate a framework for optimizing the performance of ghost imaging with binary sampling to approach the results without binarization. The method is based on maximizing the information content of the signal arm detection, by formulating and solving the appropriate p… ▽ More To extract the maximum information about the object from a series of binary samples in ghost imaging applications, we propose and demonstrate a framework for optimizing the performance of ghost imaging with binary sampling to approach the results without binarization. The method is based on maximizing the information content of the signal arm detection, by formulating and solving the appropriate parameter estimation problem - finding the binarization threshold that would yield the reconstructed image with optimal Fisher information properties. Applying the 1-bit quantized Poisson statistics to a ghost-imaging model with pseudo-thermal light, we derive the fundamental limit, i.e., the Cramer-Rao lower bound, as the benchmark for the evaluation of the accuracy of the estimator. Our theoertical model and experimental results suggest that, with the optimal binarization threshold, coincident with the statistical mean of all bucket samples, and large number of measurements, the performance of binary sampling GI can approach that of the ordinary one without binarization. △ Less

Submitted 11 March, 2020; originally announced March 2020.

arXiv:2002.01731 [pdf, ps, other]

Rate-Splitting Multiple Access for Multibeam Satellite Communications

Authors: Longfei Yin, Bruno Clerckx

Abstract: This paper studies the beamforming design problem to achieve max-min fairness (MMF) in multibeam satellite communications. Contrary to the conventional linear precoding (NoRS) that relies on fully treating any residual interference as noise, we consider a novel multibeam multicast beamforming strategy based on Rate-Splitting Multiple Access (RSMA). RSMA relies on linearly precoded ratesplitting (R… ▽ More This paper studies the beamforming design problem to achieve max-min fairness (MMF) in multibeam satellite communications. Contrary to the conventional linear precoding (NoRS) that relies on fully treating any residual interference as noise, we consider a novel multibeam multicast beamforming strategy based on Rate-Splitting Multiple Access (RSMA). RSMA relies on linearly precoded ratesplitting (RS) at the transmitter and Successive Interference Cancellation (SIC) at receivers to enable a flexible framework for non-orthogonal transmission and robust interbeam interference management. Aiming at achieving MMF among multiple co-channel multicast beams, a per-feed available power constrained optimization problem is formulated with different quality of channel state information at the transmitter (CSIT). The superiority of RS for multigroup multicast and multibeam satellite communication systems compared with conventional scheme (NoRS) is demonstrated via simulations. △ Less

Submitted 13 March, 2020; v1 submitted 5 February, 2020; originally announced February 2020.

arXiv:1912.03685 [pdf, other]

SolarNet: A Deep Learning Framework to Map Solar Power Plants In China From Satellite Imagery

Authors: Xin Hou, Biao Wang, Wanqi Hu, Lei Yin, Haishan Wu

Abstract: Renewable energy such as solar power is critical to fight the ever more serious climate change. China is the world leading installer of solar panel and numerous solar power plants were built. In this paper, we proposed a deep learning framework named SolarNet which is designed to perform semantic segmentation on large scale satellite imagery data to detect solar farms. SolarNet has successfully ma… ▽ More Renewable energy such as solar power is critical to fight the ever more serious climate change. China is the world leading installer of solar panel and numerous solar power plants were built. In this paper, we proposed a deep learning framework named SolarNet which is designed to perform semantic segmentation on large scale satellite imagery data to detect solar farms. SolarNet has successfully mapped 439 solar farms in China, covering near 2000 square kilometers, equivalent to the size of whole Shenzhen city or two and a half of New York city. To the best of our knowledge, it is the first time that we used deep learning to reveal the locations and sizes of solar farms in China, which could provide insights for solar power companies, market analysts and the government. △ Less

Submitted 10 December, 2019; v1 submitted 8 December, 2019; originally announced December 2019.

arXiv:1910.04350 [pdf, other]

Design and Performance Analysis of Multi-scale NOMA for 5G Positioning

Authors: Lu Yin, Jiameng Cao, Zhongliang Deng, Qiang Ni, Song Li, Xinyu Zheng, Hanhua Wang

Abstract: This paper presents a feasibility study for a novel positioning-communication integrated signal called Multi-Scale Non-Orthogonal Multiple Access (MS-NOMA) for 5G positioning. One of the main differences between the MS-NOMA and the traditional positioning signal is MS-NOMA supports configurable powers for different positioning users (P-Users) to obtain better ranging accuracy and signal coverage.… ▽ More This paper presents a feasibility study for a novel positioning-communication integrated signal called Multi-Scale Non-Orthogonal Multiple Access (MS-NOMA) for 5G positioning. One of the main differences between the MS-NOMA and the traditional positioning signal is MS-NOMA supports configurable powers for different positioning users (P-Users) to obtain better ranging accuracy and signal coverage. Our major contributions are: Firstly, we present the MS-NOMA signal and analyze the Bit Error Rate (BER) and ranging accuracy by deriving their simple expressions. The results show the interaction between the communication and positioning signals is rather limited, and it is feasible to use the MS-NOMA signal to achieve high positioning accuracy. Secondly, for an optimal positioning accuracy and signal coverage, we model the power allocation problem for MS-NOMA signal as a convex optimization problem by satisfying the QoS (Quality of Services) requirement and other constraints. Then, we propose a novel Positioning-Communication Joint Power Allocation (PCJPA) algorithm which allocates the powers of all P-Users iteratively. The theoretical and numerical results show our proposed MS-NOMA signal has great improvements of ranging/positioning accuracy than traditional PRS (Positioning Reference Signal) in 5G, and improves the coverage dramatically which means more P-Users could locate their positions without suffering the near-far effect. △ Less

Submitted 9 October, 2019; originally announced October 2019.

arXiv:1905.13544 [pdf]

doi 10.1109/TIM.2016.2600918

A Novel Compensation Algorithm for Thickness Measurement Immune to Lift-Off Variations Using Eddy Current Method

Authors: Mingyang Lu, Liyuan Yin, Anthony J. Peyton, Wuliang Yin

Abstract: Lift-off variation causes errors in the eddy current thickness measurements of metallic plates. In this paper, we have developed an algorithm that can compensate for this variation and produce an index that is linked to the thickness, but is virtually independent of lift-off. This index, termed as the compensated peak frequency, can be obtained from the measured multifrequency inductance spectral… ▽ More Lift-off variation causes errors in the eddy current thickness measurements of metallic plates. In this paper, we have developed an algorithm that can compensate for this variation and produce an index that is linked to the thickness, but is virtually independent of lift-off. This index, termed as the compensated peak frequency, can be obtained from the measured multifrequency inductance spectral data using the algorithm we developed in this paper. This method has been derived through mathematical manipulation and verified by both the simulation and experimental data. Accuracy in the thickness measurements at different lift-offs proved to be within 2%. △ Less

Submitted 28 May, 2019; originally announced May 2019.

Comments: arXiv admin note: text overlap with arXiv:1905.12515

arXiv:1905.12515 [pdf]

doi 10.1109/TIM.2017.2728338

Reducing the Lift-Off Effect on Permeability Measurement for Magnetic Plates From Multifrequency Induction Data

Authors: Mingyang Lu, Wenqian Zhu, Liyuan Yin, Anthony J. Peyton, Wuliang Yin, Zhigang Qu

Abstract: Liftoff variation causes errors in eddy current measurement of nonmagnetic plates as well as magnetic plates. For nonmagnetic plates, previous work has been carried out to address the issue. In this paper, we follow a similar strategy, but try to reduce the lift-off effect on another index, zerocrossing frequency for magnetic plates. This modified index, termed as the compensated zero-crossing fre… ▽ More Liftoff variation causes errors in eddy current measurement of nonmagnetic plates as well as magnetic plates. For nonmagnetic plates, previous work has been carried out to address the issue. In this paper, we follow a similar strategy, but try to reduce the lift-off effect on another index, zerocrossing frequency for magnetic plates. This modified index, termed as the compensated zero-crossing frequency, can be obtained from the measured multifrequency inductance spectral data using the algorithm we developed in this paper. Since the zero-crossing frequency can be compensated, the permeability of magnetic plates can finally be predicted by deriving the relation between the permeability and zero-crossing frequency from Dodd and Deeds method. We have derived the method through mathematical manipulation and verified it by both simulation and experimental data. The permeability error caused by liftoff can be reduced within 7.5%. △ Less

Submitted 28 May, 2019; originally announced May 2019.

arXiv:1903.01564 [pdf, other]

Life detection strategy based on infrared vision and ultra-wideband radar data fusion

Authors: Li Yin, Y. M. Zhou

Abstract: The life detection method based on a single type of information source cannot meet the requirement of post-earthquake rescue due to its limitations in different scenes and bad robustness in life detection. This paper proposes a method based on deep neural network for multi-sensor decision-level fusion which concludes Convolutional Neural Network and Long Short Term Memory neural network (CNN+LSTM)… ▽ More The life detection method based on a single type of information source cannot meet the requirement of post-earthquake rescue due to its limitations in different scenes and bad robustness in life detection. This paper proposes a method based on deep neural network for multi-sensor decision-level fusion which concludes Convolutional Neural Network and Long Short Term Memory neural network (CNN+LSTM). Firstly, we calculate the value of the life detection probability of each sensor with various methods in the same scene simultaneously, which will be gathered to make samples for inputs of the deep neural network. Then we use Convolutional Neural Network (CNN) to extract the distribution characteristics of the spatial domain from inputs which is the two-channel combination of the probability values and the smoothing probability values of each life detection sensor respectively. Furthermore, the sequence time relationship of the outputs from the last layers will be analyzed with Long Short Term Memory (LSTM) layers, then we concatenate the results from three branches of LSTM layers. Finally, two sets of LSTM neural networks that is different from the previous layers are used to integrate the three branches of the features, and the results of the two classifications are output using the fully connected network with Binary Cross Entropy (BEC) loss function. Therefore, the classification results of the life detection can be concluded accurately with the proposed algorithm. △ Less

Submitted 16 May, 2019; v1 submitted 27 February, 2019; originally announced March 2019.

Comments: 6 pages, 7 figures, conference

Showing 1–31 of 31 results for author: Yin, L