Skip to main content

Showing 1–18 of 18 results for author: Cao, R

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.03126  [pdf

    eess.IV eess.SP

    Infrared Polarization Imaging-based Non-destructive Thermography Inspection

    Authors: Xianyu Wu, Bin Zhou, Peng Lin, Rong** Cao, Feng Huang

    Abstract: Infrared pulse thermography non-destructive testing (NDT) method is developed based on the difference in the infrared radiation intensity emitted by defective and non-defective areas of an object. However, when the radiation intensity of the defective target is similar to that of the non-defective area of the object, the detection results are poor. To address this issue, this study investigated th… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  2. arXiv:2405.02660  [pdf, other

    cs.IT eess.SP

    AFDM Channel Estimation in Multi-Scale Multi-Lag Channels

    Authors: Rongyou Cao, Yuheng Zhong, Jiangbin Lyu, Deqing Wang, Liqun Fu

    Abstract: Affine Frequency Division Multiplexing (AFDM) is a brand new chirp-based multi-carrier (MC) waveform for high mobility communications, with promising advantages over Orthogonal Frequency Division Multiplexing (OFDM) and other MC waveforms. Existing AFDM research focuses on wireless communication at high carrier frequency (CF), which typically considers only Doppler frequency shift (DFS) as a resul… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: 6 pages, 6 figures. Investigate AFDM under underwater multi-scale multi-lag channels. Derive the new input-output formula with the impact of Doppler time scaling. Propose two new channel estimation methods to tackle different level of Doppler factors. Perform diversity analyis based on CFR overlap probability (COP) and mutual incoherent property (MIP)

  3. arXiv:2404.01298  [pdf, other

    cs.CV eess.IV

    Noise2Image: Noise-Enabled Static Scene Recovery for Event Cameras

    Authors: Ruiming Cao, Dekel Galor, Amit Kohli, Jacob L Yates, Laura Waller

    Abstract: Event cameras capture changes of intensity over time as a stream of 'events' and generally cannot measure intensity itself; hence, they are only used for imaging dynamic scenes. However, fluctuations due to random photon arrival inevitably trigger noise events, even for static scenes. While previous efforts have been focused on filtering out these undesirable noise events to improve signal quality… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  4. DBPF: A Framework for Efficient and Robust Dynamic Bin-Picking

    Authors: Yichuan Li, Junkai Zhao, Yixiao Li, Zheng Wu, Rui Cao, Masayoshi Tomizuka, Yunhui Liu

    Abstract: Efficiency and reliability are critical in robotic bin-picking as they directly impact the productivity of automated industrial processes. However, traditional approaches, demanding static objects and fixed collisions, lead to deployment limitations, operational inefficiencies, and process unreliability. This paper introduces a Dynamic Bin-Picking Framework (DBPF) that challenges traditional stati… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 8 pages, 5 figures. This paper has been accepted by IEEE RA-L on 2024-03-24. See the supplementary video at youtube: https://youtu.be/n5af2VsKhkg

  5. arXiv:2312.13683  [pdf, other

    eess.SP cs.IT

    Joint Channel Estimation and Cooperative Localization for Near-Field Ultra-Massive MIMO

    Authors: Ruoxiao Cao, Hengtao He, Xianghao Yu, Shenghui Song, Kaibin Huang, Jun Zhang, Yi Gong, Khaled B. Letaief

    Abstract: The next-generation (6G) wireless networks are expected to provide not only seamless and high data-rate communications, but also ubiquitous sensing services. By providing vast spatial degrees of freedom (DoFs), ultra-massive multiple-input multiple-output (UM-MIMO) technology is a key enabler for both sensing and communications in 6G. However, the adoption of UM-MIMO leads to a shift from the far… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: Submit to JSAC

  6. arXiv:2312.11201  [pdf, other

    eess.AS cs.SD eess.SP

    A Refining Underlying Information Framework for Monaural Speech Enhancement

    Authors: Rui Cao, Tianrui Wang, Meng Ge, Longbiao Wang, Jianwu Dang

    Abstract: Supervised speech enhancement has gained significantly from recent advancements in neural networks, especially due to their ability to non-linearly fit the diverse representations of target speech, such as waveform or spectrum. However, these direct-fitting solutions continue to face challenges with degraded speech and residual noise in hearing evaluations. By bridging the speech enhancement and t… ▽ More

    Submitted 24 December, 2023; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 5 pages

  7. arXiv:2309.00755  [pdf

    physics.optics eess.IV

    High-resolution, large field-of-view label-free imaging via aberration-corrected, closed-form complex field reconstruction

    Authors: Ruizhi Cao, Cheng Shen, Changhuei Yang

    Abstract: Computational imaging methods empower modern microscopy with the ability of producing high-resolution, large field-of-view, aberration-free images. One of the dominant computational label-free imaging methods, Fourier ptychographic microscopy (FPM), effectively increases the spatial-bandwidth product of conventional microscopy by using multiple tilted illuminations to achieve high-throughput imagi… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: 13 pages, 5 figures

  8. arXiv:2306.14471  [pdf

    physics.med-ph eess.IV physics.ins-det physics.optics

    Single-shot 3D photoacoustic computed tomography with a densely packed array for transcranial functional imaging

    Authors: Rui Cao, Yilin Luo, **hua Xu, Xiaofei Luo, Ku Geng, Yousuf Aborahama, Manxiu Cui, Samuel Davis, Shuai Na, Xin Tong, Cindy Liu, Karteek Sastry, Konstantin Maslov, Peng Hu, Yide Zhang, Li Lin, Yang Zhang, Lihong V. Wang

    Abstract: Photoacoustic computed tomography (PACT) is emerging as a new technique for functional brain imaging, primarily due to its capabilities in label-free hemodynamic imaging. Despite its potential, the transcranial application of PACT has encountered hurdles, such as acoustic attenuations and distortions by the skull and limited light penetration through the skull. To overcome these challenges, we hav… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  9. arXiv:2306.02625  [pdf, other

    cs.SD eess.AS

    Rethinking the visual cues in audio-visual speaker extraction

    Authors: Junjie Li, Meng Ge, Zexu pan, Rui Cao, Longbiao Wang, Jianwu Dang, Shiliang Zhang

    Abstract: The Audio-Visual Speaker Extraction (AVSE) algorithm employs parallel video recording to leverage two visual cues, namely speaker identity and synchronization, to enhance performance compared to audio-only algorithms. However, the visual front-end in AVSE is often derived from a pre-trained model or end-to-end trained, making it unclear which visual cue contributes more to the speaker extraction p… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted in Interspeech 2023

  10. arXiv:2303.10691  [pdf, other

    eess.SP

    Multi-Channel Attentive Feature Fusion for Radio Frequency Fingerprinting

    Authors: Yuan Zeng, Yi Gong, Jiawei Liu, Shangao Lin, Zidong Han, Ruoxiao Cao, Kaibin Huang, Khaled Ben Letaief

    Abstract: Radio frequency fingerprinting (RFF) is a promising device authentication technique for securing the Internet of things. It exploits the intrinsic and unique hardware impairments of the transmitters for RF device identification. In real-world communication systems, hardware impairments across transmitters are subtle, which are difficult to model explicitly. Recently, due to the superior performanc… ▽ More

    Submitted 23 June, 2023; v1 submitted 19 March, 2023; originally announced March 2023.

  11. arXiv:2212.06596  [pdf, other

    cs.IT eess.SP

    Broadband Digital Over-the-Air Computation for Wireless Federated Edge Learning

    Authors: Lizhao You, Xinbo Zhao, Rui Cao, Yulin Shao, Liqun Fu

    Abstract: This paper presents the first orthogonal frequency-division multiplexing(OFDM)-based digital over-the-air computation (AirComp) system for wireless federated edge learning, where multiple edge devices transmit model data simultaneously using non-orthogonal OFDM subcarriers, and the edge server aggregates data directly from the superimposed signal. Existing analog AirComp systems often assume perfe… ▽ More

    Submitted 5 July, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

    Comments: 20 pages. arXiv admin note: text overlap with arXiv:2111.10508

  12. arXiv:2209.11112  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement

    Authors: Sherif Abdulatif, Ruizhe Cao, Bin Yang

    Abstract: In this work, we further develop the conformer-based metric generative adversarial network (CMGAN) model for speech enhancement (SE) in the time-frequency (TF) domain. This paper builds on our previous work but takes a more in-depth look by conducting extensive ablation studies on model inputs and architectural design choices. We rigorously tested the generalization ability of the model to unseen… ▽ More

    Submitted 3 May, 2024; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: 17 pages, 11 figures, and 6 tables. arXiv admin note: text overlap with arXiv:2203.15149

    Journal ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, pp. 2477-2493, 2024

  13. arXiv:2206.01397  [pdf, other

    physics.optics cs.CV cs.GR eess.IV eess.SP

    Dynamic Structured Illumination Microscopy with a Neural Space-time Model

    Authors: Ruiming Cao, Fanglin Linda Liu, Li-Hao Yeh, Laura Waller

    Abstract: Structured illumination microscopy (SIM) reconstructs a super-resolved image from multiple raw images captured with different illumination patterns; hence, acquisition speed is limited, making it unsuitable for dynamic scenes. We propose a new method, Speckle Flow SIM, that uses static patterned illumination with moving samples and models the sample motion during data capture in order to reconstru… ▽ More

    Submitted 28 July, 2022; v1 submitted 3 June, 2022; originally announced June 2022.

  14. arXiv:2203.15149  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    CMGAN: Conformer-based Metric GAN for Speech Enhancement

    Authors: Ruizhe Cao, Sherif Abdulatif, Bin Yang

    Abstract: Recently, convolution-augmented transformer (Conformer) has achieved promising performance in automatic speech recognition (ASR) and time-domain speech enhancement (SE), as it can capture both local and global dependencies in the speech signal. In this paper, we propose a conformer-based metric generative adversarial network (CMGAN) for SE in the time-frequency (TF) domain. In the generator, we ut… ▽ More

    Submitted 3 March, 2024; v1 submitted 28 March, 2022; originally announced March 2022.

    Comments: 5 pages, 1 figure, 2 tables, published in INTERSPEECH 2022

    Journal ref: Proceedings of INTERSPEECH, 2022, pp. 936--940

  15. arXiv:2203.02118  [pdf, other

    cs.RO eess.SY

    OmniWheg: An Omnidirectional Wheel-Leg Transformable Robot

    Authors: Ruixiang Cao, Jun Gu, Chen Yu, Andre Rosendo

    Abstract: This paper presents the design, analysis, and performance evaluation of an omnidirectional transformable wheel-leg robot called OmniWheg. We design a novel mechanism consisting of a separable omni-wheel and 4-bar linkages, allowing the robot to transform between omni-wheeled and legged modes smoothly. In wheeled mode, the robot can move in all directions and efficiently adjust the relative positio… ▽ More

    Submitted 25 July, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: 6 pages, 10 figures, IROS

  16. arXiv:2105.00594  [pdf, other

    cs.LG eess.SP

    An End-to-End and Accurate PPG-based Respiratory Rate Estimation Approach Using Cycle Generative Adversarial Networks

    Authors: Seyed Amir Hossein Aqajari, Rui Cao, Amir Hosein Afandizadeh Zargari, Amir M. Rahmani

    Abstract: Respiratory rate (RR) is a clinical sign representing ventilation. An abnormal change in RR is often the first sign of health deterioration as the body attempts to maintain oxygen delivery to its tissues. There has been a growing interest in remotely monitoring of RR in everyday settings which has made photoplethysmography (PPG) monitoring wearable devices an attractive choice. PPG signals are use… ▽ More

    Submitted 30 July, 2021; v1 submitted 2 May, 2021; originally announced May 2021.

  17. arXiv:2010.03965  [pdf, other

    eess.IV cs.CV

    High Definition image classification in Geoscience using Machine Learning

    Authors: Yajun An, Zachary Golden, Tarka Wilcox, Renzhi Cao

    Abstract: High Definition (HD) digital photos taken with drones are widely used in the study of Geoscience. However, blurry images are often taken in collected data, and it takes a lot of time and effort to distinguish clear images from blurry ones. In this work, we apply Machine learning techniques, such as Support Vector Machine (SVM) and Neural Network (NN) to classify HD images in Geoscience as clear an… ▽ More

    Submitted 25 September, 2020; originally announced October 2020.

    Comments: 8 pages, 14 figures

  18. arXiv:1910.02185  [pdf, other

    eess.IV cs.LG q-bio.QM

    Prostate cancer inference via weakly-supervised learning using a large collection of negative MRI

    Authors: Ruiming Cao, Xinran Zhong, Fabien Scalzo, Steven Raman, Kyung hyun Sung

    Abstract: Recent advances in medical imaging techniques have led to significant improvements in the management of prostate cancer (PCa). In particular, multi-parametric MRI (mp-MRI) continues to gain clinical acceptance as the preferred imaging technique for non-invasive detection and grading of PCa. However, the machine learning-based diagnosis systems for PCa are often constrained by the limited access to… ▽ More

    Submitted 4 October, 2019; originally announced October 2019.

    Comments: 6 pages, 5 figures, 2019 International Conference on Computer Vision - Visual Recognition for Medical Images Workshop