Skip to main content

Showing 1–50 of 266 results for author: Li, Q

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.19311  [pdf, other

    cs.CR cs.SD eess.AS

    Zero-Query Adversarial Attack on Black-box Automatic Speech Recognition Systems

    Authors: Zheng Fang, Tao Wang, Lingchen Zhao, Shenyi Zhang, Bowen Li, Yunjie Ge, Qi Li, Chao Shen, Qian Wang

    Abstract: In recent years, extensive research has been conducted on the vulnerability of ASR systems, revealing that black-box adversarial example attacks pose significant threats to real-world ASR systems. However, most existing black-box attacks rely on queries to the target ASRs, which is impractical when queries are not permitted. In this paper, we propose ZQ-Attack, a transfer-based adversarial attack… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: To appear in the Proceedings of The ACM Conference on Computer and Communications Security (CCS), 2024

  2. arXiv:2406.19043  [pdf

    eess.IV cs.AI cs.CV cs.DB

    CMRxRecon2024: A Multi-Modality, Multi-View K-Space Dataset Boosting Universal Machine Learning for Accelerated Cardiac MRI

    Authors: Zi Wang, Fanwen Wang, Chen Qin, Jun Lyu, Ouyang Cheng, Shuo Wang, Yan Li, Mengyao Yu, Haoyu Zhang, Kunyuan Guo, Zhang Shi, Qirong Li, Ziqiang Xu, Ya**g Zhang, Hao Li, Sha Hua, Binghua Chen, Longyu Sun, Mengting Sun, Qin Li, Ying-Hua Chu, Wenjia Bai, **g Qin, Xiahai Zhuang, Claudia Prieto , et al. (7 additional authors not shown)

    Abstract: Cardiac magnetic resonance imaging (MRI) has emerged as a clinically gold-standard technique for diagnosing cardiac diseases, thanks to its ability to provide diverse information with multiple modalities and anatomical views. Accelerated cardiac MRI is highly expected to achieve time-efficient and patient-friendly imaging, and then advanced image reconstruction approaches are required to recover h… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 19 pages, 3 figures, 2 tables

  3. arXiv:2406.14869  [pdf, other

    eess.SP

    Cost-Effective RF Fingerprinting Based on Hybrid CVNN-RF Classifier with Automated Multi-Dimensional Early-Exit Strategy

    Authors: Jiayan Gan, Zhixing Du, Qiang Li, Huaizong Shao, **gran Lin, Ye Pan, Zhongyi Wen, Shafei Wang

    Abstract: While the Internet of Things (IoT) technology is booming and offers huge opportunities for information exchange, it also faces unprecedented security challenges. As an important complement to the physical layer security technologies for IoT, radio frequency fingerprinting (RFF) is of great interest due to its difficulty in counterfeiting. Recently, many machine learning (ML)-based RFF algorithms h… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Accepted by IEEE Internet of Things Journal

  4. arXiv:2406.12646  [pdf, other

    eess.IV cs.AI cs.CV

    An Empirical Study on the Fairness of Foundation Models for Multi-Organ Image Segmentation

    Authors: Qin Li, Yizhe Zhang, Yan Li, Jun Lyu, Meng Liu, Longyu Sun, Mengting Sun, Qirong Li, Wenyue Mao, Xinran Wu, Ya**g Zhang, Yinghua Chu, Shuo Wang, Chengyan Wang

    Abstract: The segmentation foundation model, e.g., Segment Anything Model (SAM), has attracted increasing interest in the medical image community. Early pioneering studies primarily concentrated on assessing and improving SAM's performance from the perspectives of overall accuracy and efficiency, yet little attention was given to the fairness considerations. This oversight raises questions about the potenti… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted to MICCAI-2024

  5. arXiv:2406.10358  [pdf, other

    cs.CR eess.SY

    I Still See You: Why Existing IoT Traffic Resha** Fails

    Authors: Su Wang, Keyang Yu, Qi Li, Dong Chen

    Abstract: The Internet traffic data produced by the Internet of Things (IoT) devices are collected by Internet Service Providers (ISPs) and device manufacturers, and often shared with their third parties to maintain and enhance user services. Unfortunately, on-path adversaries could infer and fingerprint users' sensitive privacy information such as occupancy and user activities by analyzing these network tr… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: EWSN'24 paper accepted, to appear

  6. arXiv:2406.07256  [pdf, ps, other

    cs.SD cs.AI eess.AS

    AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection

    Authors: Rong Gong, Hongfei Xue, Lezhi Wang, Xin Xu, Qisheng Li, Lei Xie, Hui Bu, Shaomei Wu, Jiaming Zhou, Yong Qin, Binbin Zhang, Jun Du, Jia Bin, Ming Li

    Abstract: The rapid advancements in speech technologies over the past two decades have led to human-level performance in tasks like automatic speech recognition (ASR) for fluent speech. However, the efficacy of these models diminishes when applied to atypical speech, such as stuttering. This paper introduces AS-70, the first publicly available Mandarin stuttered speech dataset, which stands out as the large… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  7. arXiv:2405.16765  [pdf, ps, other

    cs.LG eess.SP

    Study of Robust Direction Finding Based on Joint Sparse Representation

    Authors: Y. Li, W. Xiao, L. Zhao, Z. Huang, Q. Li, L. Li, R. C. de Lamare

    Abstract: Standard Direction of Arrival (DOA) estimation methods are typically derived based on the Gaussian noise assumption, making them highly sensitive to outliers. Therefore, in the presence of impulsive noise, the performance of these methods may significantly deteriorate. In this paper, we model impulsive noise as Gaussian noise mixed with sparse outliers. By exploiting their statistical differences,… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 6 pages, 4 figures

  8. arXiv:2405.11856  [pdf, other

    cs.RO eess.SY

    Modeling and simulation of a mechanism for suppressing the flip** problem of a jum** robot

    Authors: Qi Li, Liang Peng, Zhiyuan Wu, Pengda Ye, Weitao Zhang, Yi Xu, Qing Shi

    Abstract: In order to solve the problem of stable jum** of micro robot, we design a special mechanism: elastic passive joint (EPJ). EPJ can assist in achieving smooth jum** through the opening-closing process when the robot jumps. First, we introduce the composition and operation principle of EPJ, and perform a dynamic modeling of the robot's jum** process. Then, in order to verify the effectiveness o… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  9. arXiv:2405.09753  [pdf, other

    cs.IT eess.SP

    Stacked Intelligent Metasurfaces for Holographic MIMO Aided Cell-Free Networks

    Authors: Qingchao Li, Mohammed El-Hajjar, Chao Xu, Jiancheng An, Chau Yuen, Lajos Hanzo

    Abstract: Large-scale multiple-input and multiple-output (MIMO) systems are capable of achieving high date rate. However, given the high hardware cost and excessive power consumption of massive MIMO systems, as a remedy, intelligent metasurfaces have been designed for efficient holographic MIMO (HMIMO) systems. In this paper, we propose a HMIMO architecture based on stacked intelligent metasurfaces (SIM) fo… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  10. arXiv:2405.06186  [pdf, other

    cs.IT eess.SP

    Sensing-Assisted Adaptive Channel Contention for Mobile Delay-Sensitive Communications

    Authors: Bojie Lv, Qianren Li, Rui Wang

    Abstract: This paper proposes an adaptive channel contention mechanism to optimize the queuing performance of a distributed millimeter wave (mmWave) uplink system with the capability of environment and mobility sensing. The mobile agents determine their back-off timer parameters according to their local knowledge of the uplink queue lengths, channel quality, and future channel statistics, where the channel… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  11. arXiv:2405.05787  [pdf, other

    cs.RO cs.CV eess.SY

    Autonomous Robotic Ultrasound System for Liver Follow-up Diagnosis: Pilot Phantom Study

    Authors: Tianpeng Zhang, Sekeun Kim, Jerome Charton, Haitong Ma, Kyungsang Kim, Na Li, Quanzheng Li

    Abstract: The paper introduces a novel autonomous robot ultrasound (US) system targeting liver follow-up scans for outpatients in local communities. Given a computed tomography (CT) image with specific target regions of interest, the proposed system carries out the autonomous follow-up scan in three steps: (i) initial robot contact to surface, (ii) coordinate map** between CT image and robot, and (iii) ta… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  12. arXiv:2405.03123  [pdf, other

    math.OC eess.SY

    Revealing Decision Conservativeness Through Inverse Distributionally Robust Optimization

    Authors: Qi Li, Zhirui Liang, Andrey Bernstein, Yury Dvorkin

    Abstract: This paper introduces Inverse Distributionally Robust Optimization (I-DRO) as a method to infer the conservativeness level of a decision-maker, represented by the size of a Wasserstein metric-based ambiguity set, from the optimal decisions made using Forward Distributionally Robust Optimization (F-DRO). By leveraging the Karush-Kuhn-Tucker (KKT) conditions of the convex F-DRO model, we formulate I… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  13. I$^3$Net: Inter-Intra-slice Interpolation Network for Medical Slice Synthesis

    Authors: Haofei Song, Xintian Mao, **g Yu, Qingli Li, Yan Wang

    Abstract: Medical imaging is limited by acquisition time and scanning equipment. CT and MR volumes, reconstructed with thicker slices, are anisotropic with high in-plane resolution and low through-plane resolution. We reveal an intriguing phenomenon that due to the mentioned nature of data, performing slice-wise interpolation from the axial view can yield greater benefits than performing super-resolution fr… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  14. Ergodic Spectral Efficiency Analysis of Intelligent Omni-Surface Aided Systems Suffering From Imperfect CSI and Hardware Impairments

    Authors: Qingchao Li, Mohammed El-Hajjar, Lajos Hanzo

    Abstract: In contrast to the conventional reconfigurable intelligent surfaces (RIS), intelligent omni-surfaces (IOS) are capable of full-space coverage of smart radio environments by simultaneously transmitting and reflecting the incident signals. In this paper, we investigate the ergodic spectral efficiency of IOS-aided systems for transmission over random channel links, while considering both realistic im… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: IEEE Transactions on Communications, 2024

  15. Energy-Efficient Reconfigurable Holographic Surfaces Operating in the Presence of Realistic Hardware Impairments

    Authors: Qingchao Li, Mohammed El-Hajjar, Yanshi Sun, Ibrahim Hemadeh, Arman Shojaeifard, Lajos Hanzo

    Abstract: Reconfigurable holographic surfaces (RHSs) constitute a promising technique of supporting energy-efficient communications. In this paper, we formulate the energy efficiency maximization problem of the switch-controlled RHS-aided beamforming architecture by alternately optimizing the holographic beamformer at the RHS, the digital beamformer, the total transmit power and the power sharing ratio of e… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: IEEE Transactions on Communications, 2024

  16. Achievable Rate Analysis of Intelligent Omni-Surface Assisted NOMA Holographic MIMO Systems

    Authors: Qingchao Li, Mohammed El-Hajjar, Yanshi Sun, Ibrahim Hemadeh, Yingming Tsai, Arman Shojaeifard, Lajos Hanzo

    Abstract: An intelligent omni-surface (IOS) assisted holographic multiple-input and multiple-output architecture is conceived for $360^\circ$ full-space coverage at a low energy consumption. The theoretical ergodic rate lower bound of our non-orthogonal multiple access (NOMA) scheme is derived based on the moment matching approximation method, while considering the signal distortion at transceivers imposed… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 6 pages, 3 figures. IEEE Transactions on Vehicular Technology, 2024

  17. arXiv:2405.00075  [pdf, ps, other

    eess.IV

    Charting the Path Forward: CT Image Quality Assessment -- An In-Depth Review

    Authors: Siyi Xun, Qiaoyu Li, Xiaohong Liu, Guangtao Zhai, Mingxiang Wu, Tao Tan

    Abstract: Computed Tomography (CT) is a frequently utilized imaging technology that is employed in the clinical diagnosis of many disorders. However, clinical diagnosis, data storage, and management are posed huge challenges by a huge volume of non-homogeneous CT data in terms of imaging quality. As a result, the quality assessment of CT images is a crucial problem that demands consideration. The history, a… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

  18. arXiv:2404.08566  [pdf, other

    eess.SP cs.LG

    Mitigating Receiver Impact on Radio Frequency Fingerprint Identification via Domain Adaptation

    Authors: Liu Yang, Qiang Li, Xiaoyang Ren, Yi Fang, Shafei Wang

    Abstract: Radio Frequency Fingerprint Identification (RFFI), which exploits non-ideal hardware-induced unique distortion resident in the transmit signals to identify an emitter, is emerging as a means to enhance the security of communication systems. Recently, machine learning has achieved great success in develo** state-of-the-art RFFI models. However, few works consider cross-receiver RFFI problems, whe… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: Accepted by IEEE Internet of Things Journal

  19. arXiv:2404.01082  [pdf, other

    eess.IV

    The state-of-the-art in Cardiac MRI Reconstruction: Results of the CMRxRecon Challenge in MICCAI 2023

    Authors: Jun Lyu, Chen Qin, Shuo Wang, Fanwen Wang, Yan Li, Zi Wang, Kunyuan Guo, Cheng Ouyang, Michael Tänzer, Meng Liu, Longyu Sun, Mengting Sun, Qin Li, Zhang Shi, Sha Hua, Hao Li, Zhensen Chen, Zhenlin Zhang, Bingyu Xin, Dimitris N. Metaxas, George Yiasemis, Jonas Teuwen, Li** Zhang, Weitian Chen, Yidong Zhao , et al. (25 additional authors not shown)

    Abstract: Cardiac MRI, crucial for evaluating heart structure and function, faces limitations like slow imaging and motion artifacts. Undersampling reconstruction, especially data-driven algorithms, has emerged as a promising solution to accelerate scans and enhance imaging performance using highly under-sampled data. Nevertheless, the scarcity of publicly available cardiac k-space datasets and evaluation p… ▽ More

    Submitted 16 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: 25 pages, 17 figures

  20. arXiv:2403.16402  [pdf, other

    eess.SY

    A Distributionally Robust Model Predictive Control for Static and Dynamic Uncertainties in Smart Grids

    Authors: Qi Li, Ye Shi, Yuning Jiang, Yuanming Shi, Haoyu Wang, H. Vincent Poor

    Abstract: The integration of various power sources, including renewables and electric vehicles, into smart grids is expanding, introducing uncertainties that can result in issues like voltage imbalances, load fluctuations, and power losses. These challenges negatively impact the reliability and stability of online scheduling in smart grids. Existing research often addresses uncertainties affecting current s… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  21. arXiv:2403.15803  [pdf, other

    eess.IV cs.CV

    Innovative Quantitative Analysis for Disease Progression Assessment in Familial Cerebral Cavernous Malformations

    Authors: Ruige Zong, Tao Wang, Chunwang Li, Xinlin Zhang, Yuanbin Chen, Longxuan Zhao, Qixuan Li, Qinquan Gao, Dezhi Kang, Fuxin Lin, Tong Tong

    Abstract: Familial cerebral cavernous malformation (FCCM) is a hereditary disorder characterized by abnormal vascular structures within the central nervous system. The FCCM lesions are often numerous and intricate, making quantitative analysis of the lesions a labor-intensive task. Consequently, clinicians face challenges in quantitatively assessing the severity of lesions and determining whether lesions ha… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  22. arXiv:2403.11953  [pdf, other

    eess.IV cs.CV

    Advancing COVID-19 Detection in 3D CT Scans

    Authors: Qingqiu Li, Runtian Yuan, Junlin Hou, Jilan Xu, Yuejie Zhang, Rui Feng, Hao Chen

    Abstract: To make a more accurate diagnosis of COVID-19, we propose a straightforward yet effective model. Firstly, we analyse the characteristics of 3D CT scans and remove the non-lung parts, facilitating the model to focus on lesion-related areas and reducing computational cost. We use ResNeSt50 as the strong feature extractor, initializing it with pretrained weights which have COVID-19-specific prior kno… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  23. arXiv:2403.11498  [pdf, other

    eess.IV cs.CV

    Domain Adaptation Using Pseudo Labels for COVID-19 Detection

    Authors: Runtian Yuan, Qingqiu Li, Junlin Hou, Jilan Xu, Yuejie Zhang, Rui Feng, Hao Chen

    Abstract: In response to the need for rapid and accurate COVID-19 diagnosis during the global pandemic, we present a two-stage framework that leverages pseudo labels for domain adaptation to enhance the detection of COVID-19 from CT scans. By utilizing annotated data from one domain and non-annotated data from another, the model overcomes the challenge of data scarcity and variability, common in emergent he… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  24. arXiv:2403.10323  [pdf, ps, other

    eess.SP

    Joint Optimization for Achieving Covertness in MIMO Over-the-Air Computation Networks

    Authors: Junteng Yao, Tuo Wu, Ming **, Cunhua Pan, Quanzhong Li, **hong Yuan

    Abstract: This paper investigates covert data transmission within a multiple-input multiple-output (MIMO) over-the-air computation (AirComp) network, where sensors transmit data to the access point (AP) while guaranteeing covertness to the warden (Willie). Simultaneously, the AP introduces artificial noise (AN) to confuse Willie, meeting the covert requirement. We address the challenge of minimizing mean-sq… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  25. arXiv:2403.10009  [pdf

    eess.IV cs.CV

    Cardiac Magnetic Resonance 2D+T Short- and Long-axis Segmentation via Spatio-temporal SAM Adaptation

    Authors: Zhennong Chen, Sekeun Kim, Hui Ren, Quanzheng Li, Xiang Li

    Abstract: Accurate 2D+T myocardium segmentation in cine cardiac magnetic resonance (CMR) scans is essential to analyze LV motion throughout the cardiac cycle comprehensively. The Segment Anything Model (SAM), known for its accurate segmentation and zero-shot generalization, has not yet been tailored for CMR 2D+T segmentation. We therefore introduce CMR2D+T-SAM, a novel approach to adapt SAM for CMR 2D+T seg… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: 10 pages, 4 figures

  26. arXiv:2403.06940  [pdf, other

    eess.IV cs.LG q-bio.QM

    Conditional Score-Based Diffusion Model for Cortical Thickness Trajectory Prediction

    Authors: Qing Xiao, Siyeop Yoon, Hui Ren, Matthew Tivnan, Lichao Sun, Quanzheng Li, Tianming Liu, Yu Zhang, Xiang Li

    Abstract: Alzheimer's Disease (AD) is a neurodegenerative condition characterized by diverse progression rates among individuals, with changes in cortical thickness (CTh) closely linked to its progression. Accurately forecasting CTh trajectories can significantly enhance early diagnosis and intervention strategies, providing timely care. However, the longitudinal data essential for these studies often suffe… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  27. arXiv:2403.06069  [pdf, other

    eess.IV cs.CV cs.LG

    Implicit Image-to-Image Schrodinger Bridge for CT Super-Resolution and Denoising

    Authors: Yuang Wang, Siyeop Yoon, Pengfei **, Matthew Tivnan, Zhennong Chen, Rui Hu, Li Zhang, Zhiqiang Chen, Quanzheng Li, Dufan Wu

    Abstract: Conditional diffusion models have gained recognition for their effectiveness in image restoration tasks, yet their iterative denoising process, starting from Gaussian noise, often leads to slow inference speeds. As a promising alternative, the Image-to-Image Schrödinger Bridge (I2SB) initializes the generative process from corrupted images and integrates training techniques from conditional diffus… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  28. arXiv:2402.17268  [pdf, other

    eess.SY

    Reinforcement Learning Based Robust Volt/Var Control in Active Distribution Networks With Imprecisely Known Delay

    Authors: Hong Cheng, Huan Luo, Zhi Liu, Wei Sun, Weitao Li, Qiyue Li

    Abstract: Active distribution networks (ADNs) incorporating massive photovoltaic (PV) devices encounter challenges of rapid voltage fluctuations and potential violations. Due to the fluctuation and intermittency of PV generation, the state gap, arising from time-inconsistent states and exacerbated by imprecisely known system delays, significantly impacts the accuracy of voltage control. This paper addresses… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  29. arXiv:2402.15944  [pdf, other

    cs.IT eess.SP

    On A Class of Greedy Sparse Recovery Algorithms -- A High Dimensional Approach

    Authors: Gang Li, Qiuwei Li, Shuang Li, Wu Angela Li

    Abstract: Sparse signal recovery deals with finding the sparest solution of an under-determined linear system $x = Qs$. In this paper, we propose a novel greedy approach to addressing the challenges from such a problem. Such an approach is based on a characterization of solutions to the system, which allows us to work on the sparse recovery in the $s$-space directly with a given measure. With $l_2$-based me… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  30. arXiv:2401.11058  [pdf, ps, other

    cs.IT eess.SP

    Low Complexity Turbo SIC-MMSE Detection for Orthogonal Time Frequency Space Modulation

    Authors: Qi Li, **hong Yuan, Min Qiu, Shuangyang Li, Yixuan Xie

    Abstract: Recently, orthogonal time frequency space (OTFS) modulation has garnered considerable attention due to its robustness against doubly-selective wireless channels. In this paper, we propose a low-complexity iterative successive interference cancellation based minimum mean squared error (SIC-MMSE) detection algorithm for zero-padded OTFS (ZP-OTFS) modulation. In the proposed algorithm, signals are de… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: 15 pages, 12 figures, accepted by IEEE Transactions on Communications

  31. arXiv:2401.08992  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Efficient Adapter Finetuning for Tail Languages in Streaming Multilingual ASR

    Authors: Junwen Bai, Bo Li, Qiujia Li, Tara N. Sainath, Trevor Strohman

    Abstract: The end-to-end ASR model is often desired in the streaming multilingual scenario since it is easier to deploy and can benefit from pre-trained speech models such as powerful foundation models. Meanwhile, the heterogeneous nature and imbalanced data abundance of different languages may cause performance degradation, leading to asynchronous peak performance for different languages during training, e… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: Accepted to ICASSP 2024

  32. arXiv:2401.05617  [pdf, other

    physics.optics eess.SP math.OC

    Estimating the time-evolving refractivity of a turbulent medium using optical beam measurements: a data assimilation approach

    Authors: Anjali Nair, Qin Li, Samuel N. Stechmann

    Abstract: In applications such as free-space optical communication, a signal is often recovered after propagation through a turbulent medium. In this setting, it is common to assume that limited information is known about the turbulent medium, such as a space- and time-averaged statistic (e.g., root-mean-square), but without information about the state of the spatial variations. It could be helpful to gain… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

  33. arXiv:2312.16383  [pdf, ps, other

    cs.SD cs.AI eess.AS

    Frame-level emotional state alignment method for speech emotion recognition

    Authors: Qifei Li, Yingming Gao, Cong Wang, Yayue Deng, **long Xue, Yichen Han, Ya Li

    Abstract: Speech emotion recognition (SER) systems aim to recognize human emotional state during human-computer interaction. Most existing SER systems are trained based on utterance-level labels. However, not all frames in an audio have affective states consistent with utterance-level label, which makes it difficult for the model to distinguish the true emotion of the audio and perform poorly. To address th… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: Accepted by ICASSP 2024

  34. arXiv:2312.14705  [pdf, other

    eess.IV cs.CV cs.LG

    SCUNet++: Swin-UNet and CNN Bottleneck Hybrid Architecture with Multi-Fusion Dense Skip Connection for Pulmonary Embolism CT Image Segmentation

    Authors: Yifei Chen, Binfeng Zou, Zhaoxin Guo, Yiyu Huang, Yifan Huang, Feiwei Qin, Qinhai Li, Changmiao Wang

    Abstract: Pulmonary embolism (PE) is a prevalent lung disease that can lead to right ventricular hypertrophy and failure in severe cases, ranking second in severity only to myocardial infarction and sudden death. Pulmonary artery CT angiography (CTPA) is a widely used diagnostic method for PE. However, PE detection presents challenges in clinical practice due to limitations in imaging technology. CTPA can p… ▽ More

    Submitted 2 January, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: 10 pages, 7 figures, accept WACV2024

    Journal ref: WACV 2024

  35. arXiv:2312.11460  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    Hybrid Internal Model: Learning Agile Legged Locomotion with Simulated Robot Response

    Authors: Junfeng Long, Zirui Wang, Quanyi Li, Jiawei Gao, Liu Cao, Jiangmiao Pang

    Abstract: Robust locomotion control depends on accurate state estimations. However, the sensors of most legged robots can only provide partial and noisy observations, making the estimation particularly challenging, especially for external states like terrain frictions and elevation maps. Inspired by the classical Internal Model Control principle, we consider these external states as disturbances and introdu… ▽ More

    Submitted 1 January, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: Use 1 hour to train a quadruped robot capable of traversing any terrain under any disturbances in the open world, Project Page: https://github.com/OpenRobotLab/HIMLoco

  36. arXiv:2312.10495  [pdf, other

    math.OC eess.SY

    Computing Optimal Joint Chance Constrained Control Policies

    Authors: Niklas Schmid, Marta Fochesato, Sarah H. Q. Li, Tobias Sutter, John Lygeros

    Abstract: We consider the problem of optimally controlling stochastic, Markovian systems subject to joint chance constraints over a finite-time horizon. For such problems, standard Dynamic Programming is inapplicable due to the time correlation of the joint chance constraints, which calls for non-Markovian, and possibly stochastic, policies. Hence, despite the popularity of this problem, solution approaches… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

  37. arXiv:2312.08144  [pdf, other

    eess.SP

    Privacy-Preserving Distributed Optimisation using Stochastic PDMM

    Authors: Sebastian O. Jordan, Qiongxiu Li, Richard Heusdens

    Abstract: Privacy-preserving distributed processing has received considerable attention recently. The main purpose of these algorithms is to solve certain signal processing tasks over a network in a decentralised fashion without revealing private/secret data to the outside world. Because of the iterative nature of these distributed algorithms, computationally complex approaches such as (homomorphic) encrypt… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  38. arXiv:2312.07941  [pdf, ps, other

    cs.IT eess.SP

    An efficient algorithm for multiuser sum-rate maximization of large-scale active RIS-aided MIMO system

    Authors: Qian Zhang, Mingjie Shao, Qiang Li, Ju Liu

    Abstract: Active reconfigurable intelligent surface (RIS) is a new RIS architecture that can reflect and amplify communication signals. It can provide enhanced performance gain compared to the conventional passive RIS systems that can only reflect the signals. On the other hand, the design problem of active RIS-aided systems is more challenging than the passive RIS-aided systems and its efficient algorithms… ▽ More

    Submitted 11 January, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: ICASSP 2024

  39. arXiv:2312.05256  [pdf, other

    eess.IV cs.AI

    Holistic Evaluation of GPT-4V for Biomedical Imaging

    Authors: Zhengliang Liu, Hanqi Jiang, Tianyang Zhong, Zihao Wu, Chong Ma, Yiwei Li, Xiaowei Yu, Yutong Zhang, Yi Pan, Peng Shu, Yanjun Lyu, Lu Zhang, Junjie Yao, Peixin Dong, Chao Cao, Zhenxiang Xiao, Jiaqi Wang, Huan Zhao, Shaochen Xu, Yaonai Wei, **gyuan Chen, Haixing Dai, Peilong Wang, Hao He, Zewei Wang , et al. (25 additional authors not shown)

    Abstract: In this paper, we present a large-scale evaluation probing GPT-4V's capabilities and limitations for biomedical image analysis. GPT-4V represents a breakthrough in artificial general intelligence (AGI) for computer vision, with applications in the biomedical domain. We assess GPT-4V's performance across 16 medical imaging categories, including radiology, oncology, ophthalmology, pathology, and mor… ▽ More

    Submitted 10 November, 2023; originally announced December 2023.

  40. arXiv:2311.12274  [pdf, other

    eess.SY

    Micro Energy-Water-Hydrogen Nexus: Data-driven Real-time Optimal Operation

    Authors: Mostafa Goodarzi, Qifeng Li

    Abstract: This paper extends a new concept of energy-water-hydrogen (EWH) nexus, which was recently developed as a solution for reducing carbon emissions from the generation side of power systems, to the distribution side. Under the concept of distribution-level EWH (micro EWH) nexus, renewable energy sources (RES) are utilized to meet the energy needs of a small community. To avoid the uncertainty caused b… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: text overlap with arXiv:2306.17395

  41. arXiv:2311.11111  [pdf, other

    eess.SY

    Economic Viability of the Energy-Water-Hydrogen Nexus for Power System Decarbonization

    Authors: Mostafa Goodarzi, Qifeng Li

    Abstract: This paper aims to evaluate the economic viability of the energy-water-hydrogen (EWH) nexus as a new solution for reducing carbon emissions from power systems. The urgency around climate change emphasizes the pressing need to mitigate carbon emissions, especially from the electricity sector, which accounts for a significant portion of total emissions in the US. In response, incorporating more rene… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: 5 pages, 3 figures, 2 tables, conference

  42. arXiv:2311.04537  [pdf, other

    eess.SP cs.IT cs.LG

    Deep Learning Assisted Multiuser MIMO Load Modulated Systems for Enhanced Downlink mmWave Communications

    Authors: Ercong Yu, **le Zhu, Qiang Li, Zilong Liu, Hongyang Chen, Shlomo Shamai, H. Vincent Poor

    Abstract: This paper is focused on multiuser load modulation arrays (MU-LMAs) which are attractive due to their low system complexity and reduced cost for millimeter wave (mmWave) multi-input multi-output (MIMO) systems. The existing precoding algorithm for downlink MU-LMA relies on a sub-array structured (SAS) transmitter which may suffer from decreased degrees of freedom and complex system configuration.… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 14 pages, Journal, accepted by IEEE TWC

  43. Long-term Dependency for 3D Reconstruction of Freehand Ultrasound Without External Tracker

    Authors: Qi Li, Ziyi Shen, Qian Li, Dean C. Barratt, Thomas Dowrick, Matthew J. Clarkson, Tom Vercauteren, Yipeng Hu

    Abstract: Objective: Reconstructing freehand ultrasound in 3D without any external tracker has been a long-standing challenge in ultrasound-assisted procedures. We aim to define new ways of parameterising long-term dependencies, and evaluate the performance. Methods: First, long-term dependency is encoded by transformation positions within a frame sequence. This is achieved by combining a sequence model wit… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted to IEEE Transactions on Biomedical Engineering (TBME, 2023)

  44. One-Bit Channel Estimation for IRS-aided Millimeter-Wave Massive MU-MISO System

    Authors: Silei Wang, Qiang Li, **gran Lin

    Abstract: Recently, intelligent reflecting surface (IRS)-assisted communication has gained considerable attention due to its advantage in extending the coverage and compensating the path loss with low-cost passive metasurface. This paper considers the uplink channel estimation for IRS-aided multiuser massive MISO communications with one-bit ADCs at the base station (BS). The use of one-bit ADC is impelled b… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

    Comments: Accepted by IEEE Trans. Signal Process

  45. arXiv:2309.13539  [pdf, other

    eess.IV

    MediViSTA-SAM: Zero-shot Medical Video Analysis with Spatio-temporal SAM Adaptation for Echocardiography

    Authors: Sekeun Kim, Kyungsang Kim, Jiang Hu, Cheng Chen, Zhiliang Lyu, Ren Hui, Sunghwan Kim, Zhengliang Liu, Aoxiao Zhong, Xiang Li, Tianming Liu, Quanzheng Li

    Abstract: The Segmentation Anything Model (SAM) has gained significant attention for its robust generalization capabilities across diverse downstream tasks. However, the performance of SAM is noticeably diminished in medical images due to the substantial disparity between natural and medical image domain. In this paper, we present a zero-shot generalization model specifically designed for echocardiography a… ▽ More

    Submitted 6 April, 2024; v1 submitted 23 September, 2023; originally announced September 2023.

  46. arXiv:2309.12963  [pdf, ps, other

    eess.AS cs.SD

    Massive End-to-end Models for Short Search Queries

    Authors: Weiran Wang, Rohit Prabhavalkar, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Zhong Meng, CJ Zheng, Yanzhang He, Tara Sainath, Pedro Moreno Mengibar

    Abstract: In this work, we investigate two popular end-to-end automatic speech recognition (ASR) models, namely Connectionist Temporal Classification (CTC) and RNN-Transducer (RNN-T), for offline recognition of voice search queries, with up to 2B model parameters. The encoders of our models use the neural architecture of Google's universal speech model (USM), with additional funnel pooling layers to signifi… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  47. arXiv:2309.06787  [pdf, other

    cs.SD eess.AS

    DCTTS: Discrete Diffusion Model with Contrastive Learning for Text-to-speech Generation

    Authors: Zhichao Wu, Qiulin Li, Sixing Liu, Qun Yang

    Abstract: In the Text-to-speech(TTS) task, the latent diffusion model has excellent fidelity and generalization, but its expensive resource consumption and slow inference speed have always been a challenging. This paper proposes Discrete Diffusion Model with Contrastive Learning for Text-to-Speech Generation(DCTTS). The following contributions are made by DCTTS: 1) The TTS diffusion model based on discrete… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: 5 pages, submitted to ICASSP

  48. arXiv:2309.04730  [pdf, other

    eess.SP cs.DC eess.SY

    Integrated Robotics Networks with Co-optimization of Drone Placement and Air-Ground Communications

    Authors: Menghao Hu, Tong Zhang, Shuai Wang, Guoliang Li, Yingyang Chen, Qiang Li, Gaojie Chen

    Abstract: Terrestrial robots, i.e., unmanned ground vehicles (UGVs), and aerial robots, i.e., unmanned aerial vehicles (UAVs), operate in separate spaces. To exploit their complementary features (e.g., fields of views, communication links, computing capabilities), a promising paradigm termed integrated robotics network emerges, which provides communications for cooperative UAVs-UGVs applications. However, h… ▽ More

    Submitted 3 December, 2023; v1 submitted 9 September, 2023; originally announced September 2023.

    Comments: Accepted by VTC2023-Fall, 5 pages, 4 figures

  49. arXiv:2309.01774  [pdf, other

    eess.SP

    Variational Tracking and Redetection for Closely-spaced Objects in Heavy Clutter: Supplementary Materials

    Authors: Runze Gan, Qing Li, Simon Godsill

    Abstract: The non-homogeneous Poisson process (NHPP) is a widely used measurement model that allows for an object to generate multiple measurements over time. However, it can be difficult to efficiently and reliably track multiple objects under this NHPP model in scenarios with a high density of closely-spaced objects and heavy clutter. Therefore, based on the general coordinate ascent variational filtering… ▽ More

    Submitted 23 April, 2024; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: Supplementary Materials, including Appendices C-F, begin on page 25, with pages 1-24 constituting the main article. Key updates from the first arXiv version include: added comparisons of the sum-product-algorithm-based tracker, included Appendix C for association update derivation, and made minor adjustments throughout the article to enhance presentation

  50. arXiv:2309.00807  [pdf, other

    eess.SP cs.NI

    Consensus-based Distributed Variational Multi-object Tracker in Multi-Sensor Network

    Authors: Qing Li, Runze Gan, Simon Godsill

    Abstract: The growing need for accurate and reliable tracking systems has driven significant progress in sensor fusion and object tracking techniques. In this paper, we design two variational Bayesian trackers that effectively track multiple targets in cluttered environments within a sensor network. We first present a centralised sensor fusion scheme, which involves transmitting sensor data to a fusion cent… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.