Skip to main content

Showing 1–50 of 133 results for author: Shen, Y

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.13268  [pdf, other

    eess.AS cs.SD

    CEC: A Noisy Label Detection Method for Speaker Recognition

    Authors: Yao Shen, Yingying Gao, Yaqian Hao, Chenguang Hu, Fulin Zhang, Junlan Feng, Shilei Zhang

    Abstract: Noisy labels are inevitable, even in well-annotated datasets. The detection of noisy labels is of significant importance to enhance the robustness of speaker recognition models. In this paper, we propose a novel noisy label detection approach based on two new statistical metrics: Continuous Inconsistent Counting (CIC) and Total Inconsistent Counting (TIC). These metrics are calculated through Cros… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: interspeech 2024

  2. arXiv:2406.07374  [pdf, other

    eess.SP

    Movable-Antenna Array Empowered ISAC Systems for Low-Altitude Economy

    Authors: Ziming Kuang, Wenchao Liu, Chunjie Wang, Zhenzhen **, **ke Ren, Xuhui Zhang, Yanyan Shen

    Abstract: This paper investigates a movable-antenna (MA) array empowered integrated sensing and communications (ISAC) over low-altitude platform (LAP) system to support low-altitude economy (LAE) applications. In the considered system, an unmanned aerial vehicle (UAV) is dispatched to hover in the air, working as the UAV-enabled LAP (ULAP) to provide information transmission and sensing simultaneously for L… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  3. arXiv:2406.03391  [pdf, other

    eess.SP

    Joint Association, Beamforming, and Resource Allocation for Multi-IRS Enabled MU-MISO Systems With RSMA

    Authors: Chunjie Wang, Xuhui Zhang, Huijun Xing, Liang Xue, Shuqiang Wang, Yanyan Shen, Bo Yang, ** Guan

    Abstract: Intelligent reflecting surface (IRS) and rate-splitting multiple access (RSMA) technologies are at the forefront of enhancing spectrum and energy efficiency in the next generation multi-antenna communication systems. This paper explores a RSMA system with multiple IRSs, and proposes two purpose-driven scheduling schemes, i.e., the exhaustive IRS-aided (EIA) and opportunistic IRS-aided (OIA) scheme… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  4. arXiv:2405.20746  [pdf, ps, other

    eess.SP

    UAV-Enabled Wireless Networks with Movable-Antenna Array: Flexible Beamforming and Trajectory Design

    Authors: Wenchao Liu, Xuhui Zhang, Huijun Xing, **ke Ren, Yanyan Shen, Shuguang Cui

    Abstract: Recently, movable antenna (MA) array becomes a promising technology for improving the communication quality in wireless communication systems. In this letter, an unmanned aerial vehicle (UAV) enabled multi-user multi-input-single-output system enhanced by the MA array is investigated. To enhance the throughput capacity, we aim to maximize the achievable data rate by jointly optimizing the transmit… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  5. arXiv:2405.19373  [pdf, other

    eess.SP cs.LG

    Multi-modal Mood Reader: Pre-trained Model Empowers Cross-Subject Emotion Recognition

    Authors: Yihang Dong, Xuhang Chen, Yanyan Shen, Michael Kwok-Po Ng, Tao Qian, Shuqiang Wang

    Abstract: Emotion recognition based on Electroencephalography (EEG) has gained significant attention and diversified development in fields such as neural signal processing and affective computing. However, the unique brain anatomy of individuals leads to non-negligible natural differences in EEG signals across subjects, posing challenges for cross-subject emotion recognition. While recent studies have attem… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted by International Conference on Neural Computing for Advanced Applications, 2024

  6. arXiv:2405.09663  [pdf

    eess.SP

    Design and Implementation of mmWave Surface Wave Enabled Fluid Antennas and Experimental Results for Fluid Antenna Multiple Access

    Authors: Yuanjun Shen, Boyi Tang, Shuai Gao, Kin-Fai Tong, Hang Wong, Kai-Kit Wong, Yangyang Zhang

    Abstract: While multiple-input multiple-output (MIMO) technologies continue to advance, concerns arise as to how MIMO can remain scalable if more users are to be accommodated with an increasing number of antennas at the base station (BS) in the upcoming sixth generation (6G). Recently, the concept of fluid antenna system (FAS) has emerged, which promotes position flexibility to enable transmitter channel st… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: Submitted to IEEE Transactions on Antennas and Propagation

  7. arXiv:2405.09446  [pdf, other

    eess.IV

    M$^4$oE: A Foundation Model for Medical Multimodal Image Segmentation with Mixture of Experts

    Authors: Yufeng Jiang, Yiqing Shen

    Abstract: Medical imaging data is inherently heterogeneous across different modalities and clinical centers, posing unique challenges for develo** generalizable foundation models. Conventional entails training distinct models per dataset or using a shared encoder with modality-specific decoders. However, these approaches incur heavy computational overheads and suffer from poor scalability. To address thes… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  8. arXiv:2405.06342  [pdf, other

    cs.CV eess.IV

    Compression-Realized Deep Structural Network for Video Quality Enhancement

    Authors: Hanchi Sun, Xiaohong Liu, Xinyang Jiang, Yifei Shen, Dongsheng Li, Xiongkuo Min, Guangtao Zhai

    Abstract: This paper focuses on the task of quality enhancement for compressed videos. Although deep network-based video restorers achieve impressive progress, most of the existing methods lack a structured design to optimally leverage the priors within compression codecs. Since the quality degradation of the video is primarily induced by the compression algorithm, a new paradigm is urgently needed for a mo… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  9. arXiv:2405.01816  [pdf, other

    eess.SP

    The Integrated Sensing and Communication Revolution for 6G: Vision, Techniques, and Applications

    Authors: Nuria González-Prelcic, Musa Furkan Keskin, Ossi Kaltiokallio, Mikko Valkama, Davide Dardari, Xiao Shen, Yuan Shen, Murat Bayraktar, Henk Wymeersch

    Abstract: Future wireless networks will integrate sensing, learning and communication to provide new services beyond communication and to become more resilient. Sensors at the network infrastructure, sensors on the user equipment, and the sensing capability of the communication signal itself provide a new source of data that connects the physical and radio frequency environments. A wireless network that har… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  10. arXiv:2403.17565  [pdf, other

    cs.RO eess.SY

    Aerial Robots Carrying Flexible Cables: Dynamic Shape Optimal Control via Spectral Method Model

    Authors: Yaolei Shen, Chiara Gabellieri, Antonio Franchi

    Abstract: In this work, we present a model-based optimal boundary control design for an aerial robotic system composed of a quadrotor carrying a flexible cable. The whole system is modeled by partial differential equations (PDEs) combined with boundary conditions described by ordinary differential equations (ODEs). The proper orthogonal decomposition (POD) method is adopted to project the original infinite-… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  11. arXiv:2403.09827  [pdf, other

    eess.IV cs.CV

    FastSAM3D: An Efficient Segment Anything Model for 3D Volumetric Medical Images

    Authors: Yiqing Shen, **gxing Li, Xinyuan Shao, Blanca Inigo Romillo, Ankush **dal, David Dreizin, Mathias Unberath

    Abstract: Segment anything models (SAMs) are gaining attention for their zero-shot generalization capability in segmenting objects of unseen classes and in unseen domains when properly prompted. Interactivity is a key strength of SAMs, allowing users to iteratively provide prompts that specify objects of interest to refine outputs. However, to realize the interactive use of SAMs for 3D medical imaging tasks… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  12. arXiv:2403.08479  [pdf, other

    eess.IV cs.CV physics.med-ph

    MD-Dose: A Diffusion Model based on the Mamba for Radiotherapy Dose Prediction

    Authors: Linjie Fu, Xia Li, Xiuding Cai, Yingkai Wang, Xueyao Wang, Yali Shen, Yu Yao

    Abstract: Radiation therapy is crucial in cancer treatment. Experienced experts typically iteratively generate high-quality dose distribution maps, forming the basis for excellent radiation therapy plans. Therefore, automated prediction of dose distribution maps is significant in expediting the treatment process and providing a better starting point for develo** radiation therapy plans. With the remarkabl… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  13. arXiv:2403.03055  [pdf, other

    cs.MA cs.LG cs.RO eess.SY

    Distributed Policy Gradient for Linear Quadratic Networked Control with Limited Communication Range

    Authors: Yuzi Yan, Yuan Shen

    Abstract: This paper proposes a scalable distributed policy gradient method and proves its convergence to near-optimal solution in multi-agent linear quadratic networked systems. The agents engage within a specified network under local communication constraints, implying that each agent can only exchange information with a limited number of neighboring agents. On the underlying graph of the network, each ag… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 14 pages, 6 figures

  14. arXiv:2402.18070  [pdf, other

    cs.AR eess.SP

    A Hierarchical Dataflow-Driven Heterogeneous Architecture for Wireless Baseband Processing

    Authors: Limin Jiang, Yi Shi, Haiqin Hu, Qingyu Deng, Siyi Xu, Yintao Liu, Feng Yuan, Si Wang, Yihao Shen, Fangfang Ye, Shan Cao, Zhiyuan Jiang

    Abstract: Wireless baseband processing (WBP) is a key element of wireless communications, with a series of signal processing modules to improve data throughput and counter channel fading. Conventional hardware solutions, such as digital signal processors (DSPs) and more recently, graphic processing units (GPUs), provide various degrees of parallelism, yet they both fail to take into account the cyclical and… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 7 pages, 7 figures, conference

  15. arXiv:2401.00740  [pdf, other

    eess.IV cs.CV

    Beyond Subspace Isolation: Many-to-Many Transformer for Light Field Image Super-resolution

    Authors: Zeke Zexi Hu, Xiaoming Chen, Vera Yuk Ying Chung, Yiran Shen

    Abstract: The effective extraction of spatial-angular features plays a crucial role in light field image super-resolution (LFSR) tasks, and the introduction of convolution and Transformers leads to significant improvement in this area. Nevertheless, due to the large 4D data volume of light field images, many existing methods opted to decompose the data into a number of lower-dimensional subspaces and perfor… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  16. arXiv:2312.15538  [pdf, other

    eess.SP

    Exploiting Multipath Information for Integrated Localization and Sensing via PHD Filtering

    Authors: Yinuo Du, Hanying Zhao, Yang Liu, Xinlei Yu, Yuan Shen

    Abstract: Accurate localization and perception are pivotal for enhancing the safety and reliability of vehicles. However, current localization methods suffer from reduced accuracy when the line-of-sight (LOS) path is obstructed, or a combination of reflections and scatterings is present. In this paper, we present an integrated localization and sensing method that delivers superior performance in complex env… ▽ More

    Submitted 13 June, 2024; v1 submitted 24 December, 2023; originally announced December 2023.

    Comments: This work has been submitted to the IEEE Transactions on Vehicular Technology

  17. arXiv:2312.09022  [pdf, other

    eess.IV cs.CV q-bio.NC

    BDHT: Generative AI Enables Causality Analysis for Mild Cognitive Impairment

    Authors: Qiankun Zuo, Ling Chen, Yanyan Shen, Michael Kwok-Po Ng, Baiying Lei, Shuqiang Wang

    Abstract: Effective connectivity estimation plays a crucial role in understanding the interactions and information flow between different brain regions. However, the functional time series used for estimating effective connectivity is derived from certain software, which may lead to large computing errors because of different parameter settings and degrade the ability to model complex causal relationships b… ▽ More

    Submitted 28 May, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 13pages, 14 figures

  18. arXiv:2312.06187  [pdf, other

    eess.IV cs.CV

    SP-DiffDose: A Conditional Diffusion Model for Radiation Dose Prediction Based on Multi-Scale Fusion of Anatomical Structures, Guided by SwinTransformer and Projector

    Authors: Linjie Fu, Xia Li, Xiuding Cai, Yingkai Wang, Xueyao Wang, Yu Yao, Yali Shen

    Abstract: Radiation therapy serves as an effective and standard method for cancer treatment. Excellent radiation therapy plans always rely on high-quality dose distribution maps obtained through repeated trial and error by experienced experts. However, due to individual differences and complex clinical situations, even seasoned expert teams may need help to achieve the best treatment plan every time quickly… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  19. arXiv:2312.01464  [pdf, other

    physics.med-ph cs.CV eess.IV physics.comp-ph

    CT Reconstruction using Diffusion Posterior Sampling conditioned on a Nonlinear Measurement Model

    Authors: Shudong Li, Xiao Jiang, Matthew Tivnan, Grace J. Gang, Yuan Shen, J. Webster Stayman

    Abstract: Diffusion models have been demonstrated as powerful deep learning tools for image generation in CT reconstruction and restoration. Recently, diffusion posterior sampling, where a score-based diffusion prior is combined with a likelihood model, has been used to produce high quality CT images given low-quality measurements. This technique is attractive since it permits a one-time, unsupervised train… ▽ More

    Submitted 11 June, 2024; v1 submitted 3 December, 2023; originally announced December 2023.

    Comments: 24 pages, 12 figures, 1 table, submitted to SPIE Journal of Medical Imaging. Updated with more realistic phantom data, Poisson likelihood, and additional evaluations including hallucination evaluation, performance under multiple noise levels, inference time evaluation, and etc. Changes in authorship is based on unanimous agreement to acknowledge the adding authors' contributions in this work

    ACM Class: J.3; I.4.4; I.4.5

  20. arXiv:2311.03217  [pdf, other

    eess.IV cs.CV cs.LG

    Leveraging Transformers to Improve Breast Cancer Classification and Risk Assessment with Multi-modal and Longitudinal Data

    Authors: Yiqiu Shen, Jungkyu Park, Frank Yeung, Eliana Goldberg, Laura Heacock, Farah Shamout, Krzysztof J. Geras

    Abstract: Breast cancer screening, primarily conducted through mammography, is often supplemented with ultrasound for women with dense breast tissue. However, existing deep learning models analyze each modality independently, missing opportunities to integrate information across imaging modalities and time. In this study, we present Multi-modal Transformer (MMT), a neural network that utilizes mammography a… ▽ More

    Submitted 15 November, 2023; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: ML4H 2023 Findings Track

  21. arXiv:2310.15801  [pdf, other

    cs.IT cs.AR eess.SP

    A Generalized Adjusted Min-Sum Decoder for 5G LDPC Codes: Algorithm and Implementation

    Authors: Yuqing Ren, Hassan Harb, Yifei Shen, Alexios Balatsoukas-Stimming, Andreas Burg

    Abstract: 5G New Radio (NR) has stringent demands on both performance and complexity for the design of low-density parity-check (LDPC) decoding algorithms and corresponding VLSI implementations. Furthermore, decoders must fully support the wide range of all 5G NR blocklengths and code rates, which is a significant challenge. In this paper, we present a high-performance and low-complexity LDPC decoder, tailo… ▽ More

    Submitted 17 February, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: 14 pages, 15 figures, accepted by IEEE Transactions on Circuits and Systems I: Regular Paper

  22. arXiv:2310.14432  [pdf, other

    cs.LG eess.SP

    Fairness-aware Optimal Graph Filter Design

    Authors: O. Deniz Kose, Yanning Shen, Gonzalo Mateos

    Abstract: Graphs are mathematical tools that can be used to represent complex real-world interconnected systems, such as financial markets and social networks. Hence, machine learning (ML) over graphs has attracted significant attention recently. However, it has been demonstrated that ML over graphs amplifies the already existing bias towards certain under-represented groups in various decision-making probl… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: 12 pages, 3 figures, 9 tables. arXiv admin note: text overlap with arXiv:2303.11459

  23. arXiv:2309.14550  [pdf, other

    eess.IV cs.CV

    MEMO: Dataset and Methods for Robust Multimodal Retinal Image Registration with Large or Small Vessel Density Differences

    Authors: Chiao-Yi Wang, Faranguisse Kakhi Sadrieh, Yi-Ting Shen, Shih-En Chen, Sarah Kim, Victoria Chen, Achyut Raghavendra, Dongyi Wang, Osamah Saeedi, Yang Tao

    Abstract: The measurement of retinal blood flow (RBF) in capillaries can provide a powerful biomarker for the early diagnosis and treatment of ocular diseases. However, no single modality can determine capillary flowrates with high precision. Combining erythrocyte-mediated angiography (EMA) with optical coherence tomography angiography (OCTA) has the potential to achieve this goal, as EMA can measure the ab… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: Submitted to IEEE JBHI

  24. arXiv:2309.02638  [pdf

    physics.med-ph eess.IV physics.optics

    Review of photoacoustic imaging plus X

    Authors: Daohuai Jiang, Luyao Zhu, Shangqing Tong, Yuting Shen, Feng Gao, Fei Gao

    Abstract: Photoacoustic imaging (PAI) is a novel modality in biomedical imaging technology that combines the rich optical contrast with the deep penetration of ultrasound. To date, PAI technology has found applications in various biomedical fields. In this review, we present an overview of the emerging research frontiers on PAI plus other advanced technologies, named as PAI plus X, which includes but not li… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  25. arXiv:2308.03354  [pdf, other

    eess.IV cs.CV physics.med-ph

    Energy-Guided Diffusion Model for CBCT-to-CT Synthesis

    Authors: Linjie Fu, Xia Li, Xiuding Cai, Dong Miao, Yu Yao, Yali Shen

    Abstract: Cone Beam CT (CBCT) plays a crucial role in Adaptive Radiation Therapy (ART) by accurately providing radiation treatment when organ anatomy changes occur. However, CBCT images suffer from scatter noise and artifacts, making relying solely on CBCT for precise dose calculation and accurate tissue localization challenging. Therefore, there is a need to improve CBCT image quality and Hounsfield Unit (… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  26. arXiv:2308.00543  [pdf, other

    cs.IT eess.SP

    On the Performance Tradeoff of an ISAC System with Finite Blocklength

    Authors: Xiao Shen, Na Zhao, Yuan Shen

    Abstract: Integrated sensing and communication (ISAC) has been proposed as a promising paradigm in the future wireless networks, where the spectral and hardware resources are shared to provide a considerable performance gain. It is essential to understand how sensing and communication (S\&C) influences each other to guide the practical algorithm and system design in ISAC. In this paper, we investigate the p… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: Accepted by ICC 2023

  27. arXiv:2307.14262  [pdf, other

    eess.IV cs.CV

    Artifact Restoration in Histology Images with Diffusion Probabilistic Models

    Authors: Zhenqi He, Junjun He, ** Ye, Yiqing Shen

    Abstract: Histological whole slide images (WSIs) can be usually compromised by artifacts, such as tissue folding and bubbles, which will increase the examination difficulty for both pathologists and Computer-Aided Diagnosis (CAD) systems. Existing approaches to restoring artifact images are confined to Generative Adversarial Networks (GANs), where the restoration process is formulated as an image-to-image t… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: Accepted by MICCAI2023

  28. arXiv:2307.13945  [pdf, ps, other

    eess.SY cs.AI

    Learning-based Control for PMSM Using Distributed Gaussian Processes with Optimal Aggregation Strategy

    Authors: Zhenxiao Yin, Xiaobing Dai, Zewen Yang, Yang Shen, Georges Hattab, Hang Zhao

    Abstract: The growing demand for accurate control in varying and unknown environments has sparked a corresponding increase in the requirements for power supply components, including permanent magnet synchronous motors (PMSMs). To infer the unknown part of the system, machine learning techniques are widely employed, especially Gaussian process regression (GPR) due to its flexibility of continuous system mode… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

  29. arXiv:2307.13353  [pdf

    eess.SP

    Gradient-based adaptive wavelet de-noising method for photoacoustic imaging in vivo

    Authors: Xinke Li, Peng Ge, Yuting Shen, Feng Gao, Fei Gao

    Abstract: Photoacoustic imaging (PAI) has been applied to many biomedical applications over the past decades. However, the received PA signal usually suffers from poor signal-to-noise ratio (SNR). Conventional solution of employing higher-power laser, or doing long-time signal averaging, may raise the system cost, time consumption, and tissue damage. Another strategy is de-noising algorithm design. In this… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

  30. arXiv:2307.11868   

    eess.SY

    Dead-time Compensation Method for Bus-clam** Modulated Voltage Source Inverter

    Authors: Reza Asrar Ghaderloo, Yidi Shen, Chanaka Singhabahu, Rakesh Resalayyan, Alireza Khaligh

    Abstract: Bus-clam** Pulse Width Modulation (PWM) is an effective method to reduce the switching loss in a three-phase voltage source inverter (VSI). In bus-clam** PWM scheme, the phase legs are switched using high frequency PWM signals for two-third of the line cycle, while for the remaining duration of cycle, the pole voltage is clamped to either positive or negative rail of the DC bus. In PWM operati… ▽ More

    Submitted 28 July, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: I will make some major changes on the paper. I prefer to withdraw it for now

  31. arXiv:2307.08051  [pdf, other

    eess.IV cs.CV

    TransNuSeg: A Lightweight Multi-Task Transformer for Nuclei Segmentation

    Authors: Zhenqi He, Mathias Unberath, **g Ke, Yiqing Shen

    Abstract: Nuclei appear small in size, yet, in real clinical practice, the global spatial information and correlation of the color or brightness contrast between nuclei and background, have been considered a crucial component for accurate nuclei segmentation. However, the field of automatic nuclei segmentation is dominated by Convolutional Neural Networks (CNNs), meanwhile, the potential of the recently pre… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

    Comments: Early accepted by MICCAI2023

  32. arXiv:2307.02779  [pdf, other

    cs.IT cs.LG cs.NI eess.SP

    Large Language Models Empowered Autonomous Edge AI for Connected Intelligence

    Authors: Yifei Shen, Jiawei Shao, Xinjie Zhang, Zehong Lin, Hao Pan, Dongsheng Li, Jun Zhang, Khaled B. Letaief

    Abstract: The evolution of wireless networks gravitates towards connected intelligence, a concept that envisions seamless interconnectivity among humans, objects, and intelligence in a hyper-connected cyber-physical world. Edge artificial intelligence (Edge AI) is a promising solution to achieve connected intelligence by delivering high-quality, low-latency, and privacy-preserving AI services at the network… ▽ More

    Submitted 25 December, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: IEEE Communication Magazine

  33. arXiv:2306.12559  [pdf, other

    cs.CV cs.SD eess.AS

    Exploring the Role of Audio in Video Captioning

    Authors: Yuhan Shen, Linjie Yang, Longyin Wen, Haichao Yu, Ehsan Elhamifar, Heng Wang

    Abstract: Recent focus in video captioning has been on designing architectures that can consume both video and text modalities, and using large-scale video datasets with text transcripts for pre-training, such as HowTo100M. Though these approaches have achieved significant improvement, the audio modality is often ignored in video captioning. In this work, we present an audio-visual framework, which aims to… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

  34. arXiv:2306.00714  [pdf, other

    cs.CV cs.LG eess.IV

    Dissecting Arbitrary-scale Super-resolution Capability from Pre-trained Diffusion Generative Models

    Authors: Ruibin Li, Qihua Zhou, Song Guo, Jie Zhang, **gcai Guo, Xinyang Jiang, Yifei Shen, Zhenhua Han

    Abstract: Diffusion-based Generative Models (DGMs) have achieved unparalleled performance in synthesizing high-quality visual content, opening up the opportunity to improve image super-resolution (SR) tasks. Recent solutions for these tasks often train architecture-specific DGMs from scratch, or require iterative fine-tuning and distillation on pre-trained DGMs, both of which take considerable time and hard… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  35. arXiv:2305.18208  [pdf, other

    eess.SP cs.AI cs.LG stat.AP

    A Semi-Supervised Learning Approach for Ranging Error Mitigation Based on UWB Waveform

    Authors: Yuxiao Li, Santiago Mazuelas, Yuan Shen

    Abstract: Localization systems based on ultra-wide band (UWB) measurements can have unsatisfactory performance in harsh environments due to the presence of non-line-of-sight (NLOS) errors. Learning-based methods for error mitigation have shown great performance improvement via directly exploiting the wideband waveform instead of handcrafted features. However, these methods require data samples fully labeled… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 5 pages, 3 figures, Published in: MILCOM 2021 - 2021 IEEE Military Communications Conference (MILCOM)

    Journal ref: MILCOM 2021 - 2021 IEEE Military Communications Conference (MILCOM), San Diego, CA, USA, 2021, pp. 533-537

  36. arXiv:2305.18206  [pdf, other

    eess.SP cs.AI cs.LG stat.AP

    Deep Generative Model for Simultaneous Range Error Mitigation and Environment Identification

    Authors: Yuxiao Li, Santiago Mazuelas, Yuan Shen

    Abstract: Received waveforms contain rich information for both range information and environment semantics. However, its full potential is hard to exploit under multipath and non-line-of-sight conditions. This paper proposes a deep generative model (DGM) for simultaneous range error mitigation and environment identification. In particular, we present a Bayesian model for the generative process of the receiv… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 6 pages, 5 figures, Published in: 2021 IEEE Global Communications Conference (GLOBECOM)

    Journal ref: 2021 IEEE Global Communications Conference (GLOBECOM), Madrid, Spain, 2021, pp. 1-6

  37. arXiv:2305.14927  [pdf

    physics.optics eess.SP

    Scalable wavelength-multiplexing photonic reservoir computing

    Authors: Rui-Qian Li, Yi-Wei Shen, Bao-De Lin, **gyi Yu, Xuming He, Cheng Wang

    Abstract: Photonic reservoir computing (PRC) is a special hardware recurrent neural network, which is featured with fast training speed and low training cost. This work shows a wavelength-multiplexing PRC architecture, taking advantage of the numerous longitudinal modes in a Fabry-Perot semiconductor laser. These modes construct connected physical neurons in parallel, while an optical feedback loop provides… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  38. A Deep Learning Approach for Generating Soft Range Information from RF Data

    Authors: Yuxiao Li, Santiago Mazuelas, Yuan Shen

    Abstract: Radio frequency (RF)-based techniques are widely adopted for indoor localization despite the challenges in extracting sufficient information from measurements. Soft range information (SRI) offers a promising alternative for highly accurate localization that gives all probable range values rather than a single estimate of distance. We propose a deep learning approach to generate accurate SRI from R… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: Published in: 2021 IEEE Globecom Workshops (GC Wkshps)

    Journal ref: 021 IEEE Globecom Workshops (GC Wkshps), Madrid, Spain, 2021, pp. 1-5

  39. arXiv:2304.13090  [pdf, ps, other

    cs.LG cs.CR eess.SY

    Model Extraction Attacks Against Reinforcement Learning Based Controllers

    Authors: Momina Sajid, Yanning Shen, Yasser Shoukry

    Abstract: We introduce the problem of model-extraction attacks in cyber-physical systems in which an attacker attempts to estimate (or extract) the feedback controller of the system. Extracting (or estimating) the controller provides an unmatched edge to attackers since it allows them to predict the future control actions of the system and plan their attack accordingly. Hence, it is important to understand… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: 8 pages, 8 figures

  40. arXiv:2304.03416  [pdf, other

    eess.SP cs.LG cs.SD eess.AS

    To Wake-up or Not to Wake-up: Reducing Keyword False Alarm by Successive Refinement

    Authors: Yashas Malur Saidutta, Rakshith Sharma Srinivasa, Ching-Hua Lee, Chouchang Yang, Yilin Shen, Hongxia **

    Abstract: Keyword spotting systems continuously process audio streams to detect keywords. One of the most challenging tasks in designing such systems is to reduce False Alarm (FA) which happens when the system falsely registers a keyword despite the keyword not being uttered. In this paper, we propose a simple yet elegant solution to this problem that follows from the law of total probability. We show that… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: Accepted for publication in ICASSP 2023

  41. arXiv:2303.11459  [pdf, other

    cs.LG cs.SI eess.SP

    Fairness-Aware Graph Filter Design

    Authors: O. Deniz Kose, Yanning Shen, Gonzalo Mateos

    Abstract: Graphs are mathematical tools that can be used to represent complex real-world systems, such as financial markets and social networks. Hence, machine learning (ML) over graphs has attracted significant attention recently. However, it has been demonstrated that ML over graphs amplifies the already existing bias towards certain under-represented groups in various decision-making problems due to the… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: 6 pages, 1 figure, 2 tables

  42. arXiv:2302.10193  [pdf

    physics.med-ph eess.IV

    Non-line-of-sight photoacoustic imaging

    Authors: Yuting Shen, Xiaohua Feng, Fei Gao

    Abstract: Photoacoustic imaging is a promising imaging technique for human brain due to its high sensitivity and functional imaging ability. However, the skull would cause strong attenuation and distortion to the photoacoustic signals, which makes non-invasive transcranial imaging difficult. In this work, the temporal bone is selected as an imaging window to minimize the influence of the skull. Moreover, no… ▽ More

    Submitted 18 February, 2023; originally announced February 2023.

  43. arXiv:2302.01186  [pdf, other

    cs.LG eess.SP math.OC stat.ML

    The Power of Preconditioning in Overparameterized Low-Rank Matrix Sensing

    Authors: Xingyu Xu, Yandi Shen, Yuejie Chi, Cong Ma

    Abstract: We propose $\textsf{ScaledGD($λ$)}$, a preconditioned gradient descent method to tackle the low-rank matrix sensing problem when the true rank is unknown, and when the matrix is possibly ill-conditioned. Using overparametrized factor representations, $\textsf{ScaledGD($λ$)}$ starts from a small random initialization, and proceeds by gradient descent with a specific form of damped preconditioning t… ▽ More

    Submitted 6 November, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: New analysis in the noisy and the approximately low-rank settings

  44. arXiv:2302.00301  [pdf, other

    cs.IT eess.SP

    Covert Communication in Hybrid Microwave/mmWave A2G Systems with Transmission Mode Selection

    Authors: Wenhao Zhang, Ji He, Yulong Shen, Xiaohong Jiang

    Abstract: This paper investigates the covert communication in an air-to-ground (A2G) system, where a UAV (Alice) can adopt the omnidirectional microwave (OM) or directional mmWave (DM) transmission mode to transmit covert data to a ground user (Bob) while suffering from the detection of an adversary (Willie). For both the OM and DM modes, we first conduct theoretical analysis to reveal the inherent relation… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  45. arXiv:2301.09080  [pdf, other

    cs.MM cs.SD eess.AS

    Dance2MIDI: Dance-driven multi-instruments music generation

    Authors: Bo Han, Yuheng Li, Yixuan Shen, Yi Ren, Feilin Han

    Abstract: Dance-driven music generation aims to generate musical pieces conditioned on dance videos. Previous works focus on monophonic or raw audio generation, while the multi-instruments scenario is under-explored. The challenges associated with the dance-driven multi-instrument music (MIDI) generation are twofold: 1) no publicly available multi-instruments MIDI and video paired dataset and 2) the weak co… ▽ More

    Submitted 27 February, 2024; v1 submitted 22 January, 2023; originally announced January 2023.

    Comments: has been accepted by Computational Visual Media Journal

  46. arXiv:2212.03752  [pdf, other

    cs.CV eess.IV

    GLeaD: Improving GANs with A Generator-Leading Task

    Authors: Qingyan Bai, Ceyuan Yang, Yinghao Xu, Xihui Liu, Yujiu Yang, Yujun Shen

    Abstract: Generative adversarial network (GAN) is formulated as a two-player game between a generator (G) and a discriminator (D), where D is asked to differentiate whether an image comes from real data or is produced by G. Under such a formulation, D plays as the rule maker and hence tends to dominate the competition. Towards a fairer game in GANs, we propose a new paradigm for adversarial training, which… ▽ More

    Submitted 6 June, 2023; v1 submitted 7 December, 2022; originally announced December 2022.

    Comments: CVPR2023. Project page: https://ezioby.github.io/glead/ Code: https://github.com/EzioBy/glead/

  47. arXiv:2212.00465  [pdf, other

    cs.CV eess.IV

    FoPro: Few-Shot Guided Robust Webly-Supervised Prototypical Learning

    Authors: Yulei Qin, Xingyu Chen, Chao Chen, Yunhang Shen, Bo Ren, Yun Gu, Jie Yang, Chunhua Shen

    Abstract: Recently, webly supervised learning (WSL) has been studied to leverage numerous and accessible data from the Internet. Most existing methods focus on learning noise-robust models from web images while neglecting the performance drop caused by the differences between web domain and real-world domain. However, only by tackling the performance gap above can we fully exploit the practical value of web… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Comments: 7 pages, 5 figures, 5 tables. Accepted in AAAI 2023

  48. Optimal Allocation of Virtual Inertia and Droop Control for Renewable Energy in Stochastic Look-Ahead Power Dispatch

    Authors: Yukang Shen, Wenchuan Wu, Shumin Sun, Bin Wang

    Abstract: To stabilize the frequency of the renewable energy sources (RESs) dominated power system, frequency supports are required by RESs through virtual inertia emulation or droop control in the newly published grid codes. Since the long-term RES prediction involves significant errors, we need online configure the frequency control parameters of RESs in a rolling manner to improve the operation economics… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

    Journal ref: IEEE Transactions on Sustainable Energy,2023

  49. An Adaptive and Robust Deep Learning Framework for THz Ultra-Massive MIMO Channel Estimation

    Authors: Wentao Yu, Yifei Shen, Hengtao He, Xianghao Yu, Shenghui Song, Jun Zhang, Khaled B. Letaief

    Abstract: Terahertz ultra-massive MIMO (THz UM-MIMO) is envisioned as one of the key enablers of 6G wireless networks, for which channel estimation is highly challenging. Traditional analytical estimation methods are no longer effective, as the enlarged array aperture and the small wavelength result in a mixture of far-field and near-field paths, constituting a hybrid-field channel. Deep learning (DL)-based… ▽ More

    Submitted 8 June, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: 15 pages, 11 figures, 5 tables, accepted by IEEE Journal of Selected Topics in Signal Processing (JSTSP)

  50. arXiv:2210.11684  [pdf, other

    eess.SY

    Change Point Detection Approach for Online Control of Unknown Time Varying Dynamical Systems

    Authors: Deepan Muthirayan, Ruijie Du, Yanning Shen, Pramod P. Khargonekar

    Abstract: We propose a novel change point detection approach for online learning control with full information feedback (state, disturbance, and cost feedback) for unknown time-varying dynamical systems. We show that our algorithm can achieve a sub-linear regret with respect to the class of Disturbance Action Control (DAC) policies, which are a widely studied class of policies for online control of dynamica… ▽ More

    Submitted 24 March, 2023; v1 submitted 20 October, 2022; originally announced October 2022.