Skip to main content

Showing 1–30 of 30 results for author: Cao, C

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.15438  [pdf, other

    cs.CV cs.LG eess.IV

    Comparing remote sensing-based forest biomass map** approaches using new forest inventory plots in contrasting forests in northeastern and southwestern China

    Authors: Wenquan Dong, Edward T. A. Mitchard, Yuwei Chen, Man Chen, Congfeng Cao, Peilun Hu, Cong Xu, Steven Hancock

    Abstract: Large-scale high spatial resolution aboveground biomass (AGB) maps play a crucial role in determining forest carbon stocks and how they are changing, which is instrumental in understanding the global carbon cycle, and implementing policy to mitigate climate change. The advent of the new space-borne LiDAR sensor, NASA's GEDI instrument, provides unparalleled possibilities for the accurate and unbia… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  2. arXiv:2312.05256  [pdf, other

    eess.IV cs.AI

    Holistic Evaluation of GPT-4V for Biomedical Imaging

    Authors: Zhengliang Liu, Hanqi Jiang, Tianyang Zhong, Zihao Wu, Chong Ma, Yiwei Li, Xiaowei Yu, Yutong Zhang, Yi Pan, Peng Shu, Yanjun Lyu, Lu Zhang, Junjie Yao, Peixin Dong, Chao Cao, Zhenxiang Xiao, Jiaqi Wang, Huan Zhao, Shaochen Xu, Yaonai Wei, **gyuan Chen, Haixing Dai, Peilong Wang, Hao He, Zewei Wang , et al. (25 additional authors not shown)

    Abstract: In this paper, we present a large-scale evaluation probing GPT-4V's capabilities and limitations for biomedical image analysis. GPT-4V represents a breakthrough in artificial general intelligence (AGI) for computer vision, with applications in the biomedical domain. We assess GPT-4V's performance across 16 medical imaging categories, including radiology, oncology, ophthalmology, pathology, and mor… ▽ More

    Submitted 10 November, 2023; originally announced December 2023.

  3. arXiv:2311.03074  [pdf, other

    eess.IV cs.CV

    A Two-Stage Generative Model with CycleGAN and Joint Diffusion for MRI-based Brain Tumor Detection

    Authors: Wenxin Wang, Zhuo-Xu Cui, Guanxun Cheng, Chentao Cao, Xi Xu, Ziwei Liu, Haifeng Wang, Yulong Qi, Dong Liang, Yanjie Zhu

    Abstract: Accurate detection and segmentation of brain tumors is critical for medical diagnosis. However, current supervised learning methods require extensively annotated images and the state-of-the-art generative models used in unsupervised methods often have limitations in covering the whole data distribution. In this paper, we propose a novel framework Two-Stage Generative Model (TSGM) that combines Cyc… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 11 pages,9 figures,3 tables

  4. arXiv:2309.14372  [pdf, other

    cs.CL cs.AI cs.LG cs.SD eess.AS

    Human Transcription Quality Improvement

    Authors: Jian Gao, Hanbo Sun, Cheng Cao, Zheng Du

    Abstract: High quality transcription data is crucial for training automatic speech recognition (ASR) systems. However, the existing industry-level data collection pipelines are expensive to researchers, while the quality of crowdsourced transcription is low. In this paper, we propose a reliable method to collect speech transcriptions. We introduce two mechanisms to improve transcription quality: confidence… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

    Comments: 5 pages, 3 figures, 5 tables, INTERSPEECH 2023

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: INTERSPEECH 2023

  5. arXiv:2309.10089  [pdf, other

    eess.AS cs.AI cs.CL cs.HC cs.LG cs.SD

    HTEC: Human Transcription Error Correction

    Authors: Hanbo Sun, Jian Gao, Xiaomin Wu, Anjie Fang, Cheng Cao, Zheng Du

    Abstract: High-quality human transcription is essential for training and improving Automatic Speech Recognition (ASR) models. Recent study~\cite{libricrowd} has found that every 1% worse transcription Word Error Rate (WER) increases approximately 2% ASR WER by using the transcriptions to train ASR models. Transcription errors are inevitable for even highly-trained annotators. However, few studies have explo… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: 13 pages, 4 figures, 11 tables, AMLC 2023

    MSC Class: 68T50 ACM Class: I.2.7

  6. arXiv:2306.01974  [pdf, other

    cs.SD eess.AS

    BEDRF: Bidirectional Edge Diffraction Response Function for Interactive Sound Propagation

    Authors: Chunxiao Cao, Zili An, Zhong Ren, Dinesh Manocha, Kun Zhou

    Abstract: We introduce bidirectional edge diffraction response function (BEDRF), a new approach to model wave diffraction around edges with path tracing. The diffraction part of the wave is expressed as an integration on path space, and the wave-edge interaction is expressed using only the localized information around points on the edge similar to a bidirectional scattering distribution function (BSDF) for… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

  7. arXiv:2305.02509  [pdf, other

    eess.IV cs.CV cs.LG

    Meta-Learning Enabled Score-Based Generative Model for 1.5T-Like Image Reconstruction from 0.5T MRI

    Authors: Zhuo-Xu Cui, Congcong Liu, Chentao Cao, Yuanyuan Liu, **g Cheng, Qingyong Zhu, Yanjie Zhu, Haifeng Wang, Dong Liang

    Abstract: Magnetic resonance imaging (MRI) is known to have reduced signal-to-noise ratios (SNR) at lower field strengths, leading to signal degradation when producing a low-field MRI image from a high-field one. Therefore, reconstructing a high-field-like image from a low-field MRI is a complex problem due to the ill-posed nature of the task. Additionally, obtaining paired low-field and high-field MR image… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  8. arXiv:2303.07327  [pdf, other

    cs.CV eess.IV

    Unsupervised HDR Image and Video Tone Map** via Contrastive Learning

    Authors: Cong Cao, Huan**g Yue, Xin Liu, **gyu Yang

    Abstract: Capturing high dynamic range (HDR) images (videos) is attractive because it can reveal the details in both dark and bright regions. Since the mainstream screens only support low dynamic range (LDR) content, tone map** algorithm is required to compress the dynamic range of HDR images (videos). Although image tone map** has been widely explored, video tone map** is lagging behind, especially f… ▽ More

    Submitted 26 June, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

    Comments: Accepted by IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)

  9. arXiv:2212.11274  [pdf, other

    eess.IV cs.CV

    SPIRiT-Diffusion: SPIRiT-driven Score-Based Generative Modeling for Vessel Wall imaging

    Authors: Chentao Cao, Zhuo-Xu Cui, **g Cheng, Sen Jia, Hairong Zheng, Dong Liang, Yanjie Zhu

    Abstract: Diffusion model is the most advanced method in image generation and has been successfully applied to MRI reconstruction. However, the existing methods do not consider the characteristics of multi-coil acquisition of MRI data. Therefore, we give a new diffusion model, called SPIRiT-Diffusion, based on the SPIRiT iterative reconstruction algorithm. Specifically, SPIRiT-Diffusion characterizes the pr… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: submitted to ISMRM

  10. arXiv:2212.04736  [pdf, other

    eess.IV

    FPGA-Based In-Vivo Calcium Image Decoding for Closed-Loop Feedback Applications

    Authors: Zhe Chen, Garrett J. Blair, Chengdi Cao, Jim Zhou, Daniel Aharoni, Peyman Golshani, Hugh T. Blair, Jason Cong

    Abstract: Miniaturized calcium imaging is an emerging neural recording technique that has been widely used for monitoring neural activity on a large scale at a specific brain region of rats or mice. Most existing calcium-image analysis pipelines operate offline. This results in long processing latency, making it difficult to realize closed-loop feedback stimulation for brain research. In recent work, we hav… ▽ More

    Submitted 16 April, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: 11 pages, 15 figures

  11. arXiv:2211.05309  [pdf

    eess.SY

    Generic Cryo-CMOS Device Modeling and EDACompatible Platform for Reliable Cryogenic IC Design

    Authors: Zhidong Tang, Zewei Wang, Yumeng Yuan, Chang He, Xin Luo, Ao Guo, Renhe Chen, Yongqi Hu, Longfei Yang, Chengwei Cao, Linlin Liu, Liujiang Yu, Ganbing Shang, Yongfeng Cao, Shoumian Chen, Yuhang Zhao, Shaojian Hu, Xufeng Kou

    Abstract: This paper outlines the establishment of a generic cryogenic CMOS database in which key electrical parameters and transfer characteristics of the MOSFETs are quantified as functions of device size, temperature/frequency responses. Meanwhile, comprehensive device statistical study is conducted to evaluate the influence of variation and mismatch effects at low temperatures. Furthermore, by incorpora… ▽ More

    Submitted 9 February, 2024; v1 submitted 9 November, 2022; originally announced November 2022.

  12. arXiv:2210.14321  [pdf, other

    eess.AS cs.AI cs.MM cs.SD eess.SP

    Artificial ASMR: A Cyber-Psychological Approach

    Authors: Zexin Fang, Bin Han, C. Clark Cao, Hans. D. Schotten

    Abstract: The popularity of Autonomous Sensory Meridian Response (ASMR) has skyrockted over the past decade, but scientific studies on what exactly triggered ASMR effect remain few and immature, one most commonly acknowledged trigger is that ASMR clips typically provide rich semantic information. With our attention caught by the common acoustic patterns in ASMR audios, we investigate the correlation between… ▽ More

    Submitted 5 July, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: Accepted by IEEE MLSP 2023

  13. arXiv:2209.00835  [pdf, other

    eess.IV cs.CV

    Self-Score: Self-Supervised Learning on Score-Based Models for MRI Reconstruction

    Authors: Zhuo-Xu Cui, Chentao Cao, Shaonan Liu, Qingyong Zhu, **g Cheng, Haifeng Wang, Yanjie Zhu, Dong Liang

    Abstract: Recently, score-based diffusion models have shown satisfactory performance in MRI reconstruction. Most of these methods require a large amount of fully sampled MRI data as a training set, which, sometimes, is difficult to acquire in practice. This paper proposes a fully-sampled-data-free score-based diffusion model for MRI reconstruction, which learns the fully sampled MR image prior in a self-sup… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

  14. arXiv:2208.05481  [pdf, other

    eess.IV cs.CV cs.LG

    High-Frequency Space Diffusion Models for Accelerated MRI

    Authors: Chentao Cao, Zhuo-Xu Cui, Yue Wang, Shaonan Liu, Tai** Chen, Hairong Zheng, Dong Liang, Yanjie Zhu

    Abstract: Diffusion models with continuous stochastic differential equations (SDEs) have shown superior performances in image generation. It can serve as a deep generative prior to solving the inverse problem in magnetic resonance (MR) reconstruction. However, low-frequency regions of $k$-space data are typically fully sampled in fast MR imaging, while existing diffusion models are performed throughout the… ▽ More

    Submitted 20 January, 2024; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: accepted for IEEE TMI

  15. arXiv:2208.02466  [pdf, other

    cs.IT eess.SP

    Linear MIMO Precoders Design for Finite Alphabet Inputs via Model-Free Training

    Authors: Chen Cao, Biqian Feng, Yongpeng Wu, Derrick Wing Kwan Ng, Wenjun Zhang

    Abstract: This paper investigates a novel method for designing linear precoders with finite alphabet inputs based on autoencoders (AE) without the knowledge of the channel model. By model-free training of the autoencoder in a multiple-input multiple-output (MIMO) system, the proposed method can effectively solve the optimization problem to design the precoders that maximize the mutual information between th… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: Accepted by GLOBECOM 2022

  16. arXiv:2207.08117  [pdf, other

    eess.IV

    Accelerating Magnetic Resonance T1\r{ho} Map** Using Simultaneously Spatial Patch-based and Parametric Group-based Low-rank Tensors (SMART)

    Authors: Yuanyuan Liu, Dong Liang, Zhuo-Xu Cui, Yuxin Yang, Chentao Cao, Qingyong Zhu, **g Cheng, Caiyun Shi, Haifeng Wang, Yanjie Zhu

    Abstract: Quantitative magnetic resonance (MR) T1\r{ho} map** is a promising approach for characterizing intrinsic tissue-dependent information. However, long scan time significantly hinders its widespread applications. Recently, low-rank tensor has been employed and demonstrated good performance in accelerating MR T1\r{ho} map**. In this study, we propose a novel method that uses spatial patch-based an… ▽ More

    Submitted 28 January, 2023; v1 submitted 17 July, 2022; originally announced July 2022.

    Comments: 22 pages, 19 figures

  17. arXiv:2205.04073  [pdf, other

    eess.IV cs.CV cs.LG

    PS-Net: Learned Partially Separable Model for Dynamic MR Imaging

    Authors: Chentao Cao, Zhuo-Xu Cui, Qingyong Zhu, Congcong Liu, Dong Liang, Yanjie Zhu

    Abstract: Deep learning methods driven by the low-rank regularization have achieved attractive performance in dynamic magnetic resonance (MR) imaging. However, most of these methods represent low-rank prior by hand-crafted nuclear norm, which cannot accurately approximate the low-rank prior over the entire dataset through a fixed regularization parameter. In this paper, we propose a learned low-rank method… ▽ More

    Submitted 9 August, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

    Comments: journal

  18. arXiv:2203.04659  [pdf, other

    eess.IV cs.IT eess.SP physics.optics

    Single-pixel imaging based on weight sort of the Hadamard basis

    Authors: Wen-Kai Yu, Chong Cao, Ying Yang, Ning Wei, Shuo-Fei Wang, Chen-Xi Zhu

    Abstract: Single-pixel imaging (SPI) is very popular in subsampling applications, but the random measurement matrices it typically uses will lead to measurement blindness as well as difficulties in calculation and storage, and will also limit the further reduction in sampling rate. The deterministic Hadamard basis has become an alternative choice due to its orthogonality and structural characteristics. Ther… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: 19 pages, 13 figures

  19. arXiv:2202.01582  [pdf, other

    cs.SD cs.GR eess.AS

    A Psychoacoustic Quality Criterion for Path-Traced Sound Propagation

    Authors: Chunxiao Cao, Zili An, Zhong Ren, Dinesh Manocha, Kun Zhou

    Abstract: In develo** virtual acoustic environments, it is important to understand the relationship between the computation cost and the perceptual significance of the resultant numerical error. In this paper, we propose a quality criterion that evaluates the error significance of path-tracing-based sound propagation simulators. We present an analytical formula that estimates the error signal power spectr… ▽ More

    Submitted 8 October, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

    Comments: 12 pages, 10 figures. To be published in IEEE TVCG

  20. arXiv:2111.10633  [pdf, other

    cs.CV eess.IV

    Sparse Tensor-based Multiscale Representation for Point Cloud Geometry Compression

    Authors: Jianqiang Wang, Dandan Ding, Zhu Li, Xiaoxing Feng, Chuntong Cao, Zhan Ma

    Abstract: This study develops a unified Point Cloud Geometry (PCG) compression method through the processing of multiscale sparse tensor-based voxelized PCG. We call this compression method SparsePCGC. The proposed SparsePCGC is a low complexity solution because it only performs the convolutions on sparsely-distributed Most-Probable Positively-Occupied Voxels (MP-POV). The multiscale representation also all… ▽ More

    Submitted 21 October, 2022; v1 submitted 20 November, 2021; originally announced November 2021.

    Comments: 17 pages, 15 figures

  21. arXiv:2111.06707  [pdf, other

    eess.IV cs.CV

    Transformer-based Image Compression

    Authors: Ming Lu, Peiyao Guo, Huiqing Shi, Chuntong Cao, Zhan Ma

    Abstract: A Transformer-based Image Compression (TIC) approach is developed which reuses the canonical variational autoencoder (VAE) architecture with paired main and hyper encoder-decoders. Both main and hyper encoders are comprised of a sequence of neural transformation units (NTUs) to analyse and aggregate important information for more compact representation of input image, while the decoders mirror the… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

  22. arXiv:2106.15054  [pdf

    eess.SY

    Time-Domain Doppler Biomotion Detections Immune to Unavoidable DC Offsets

    Authors: Qinyi Lv, Lingtong Min, Congqi Cao, Shigang Zhou, Deyun Zhou, Chengkai Zhu, Yun Li, Zhongbo Zhu, Xiaojun Li, Lixin Ran

    Abstract: In the past decades, continuous Doppler radar sensor-based bio-signal detections have attracted many research interests. A typical example is the Doppler heartbeat detection. While significant progresses have been achieved, reliable, time-domain accurate demodulation of bio-signals in the presence of unavoidable DC offsets remains a technical challenge. Aiming to overcome this difficulty, we propo… ▽ More

    Submitted 29 October, 2021; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: Accepted by IEEE Transactions on Instrumentation & Measurement

  23. arXiv:2106.02514  [pdf, other

    cs.CV eess.IV

    The Image Local Autoregressive Transformer

    Authors: Chenjie Cao, Yuxin Hong, Xiang Li, Chengrong Wang, Chengming Xu, XiangYang Xue, Yanwei Fu

    Abstract: Recently, AutoRegressive (AR) models for the whole image generation empowered by transformers have achieved comparable or even better performance to Generative Adversarial Networks (GANs). Unfortunately, directly applying such AR models to edit/change local image regions, may suffer from the problems of missing global information, slow inference speed, and information leakage of local guidance. To… ▽ More

    Submitted 18 October, 2021; v1 submitted 4 June, 2021; originally announced June 2021.

    Comments: Accepted by NeurIPS2021

  24. arXiv:2103.15876  [pdf, other

    cs.CV eess.IV

    High-fidelity Face Tracking for AR/VR via Deep Lighting Adaptation

    Authors: Lele Chen, Chen Cao, Fernando De la Torre, Jason Saragih, Chenliang Xu, Yaser Sheikh

    Abstract: 3D video avatars can empower virtual communications by providing compression, privacy, entertainment, and a sense of presence in AR/VR. Best 3D photo-realistic AR/VR avatars driven by video, that can minimize uncanny effects, rely on person-specific models. However, existing person-specific photo-realistic 3D models are not robust to lighting, hence their results typically miss subtle facial behav… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Comments: The paper is accepted to CVPR 2021

  25. arXiv:2010.04600  [pdf, other

    eess.SY

    Robust Adaptive Control of Linear Parameter-Varying Systems with Unmatched Uncertainties

    Authors: Pan Zhao, Steven Snyder, Naira Hovakimyana, Chengyu Cao

    Abstract: In controlling systems with large operating envelopes, it is often necessary to adjust the desired dynamics according to operating conditions. This paper presents a robust adaptive control architecture for linear parameter-varying (LPV) systems that allows for the desired dynamics to be systematically scheduled, while being able to handle a broad class of uncertainties, both matched and unmatched,… ▽ More

    Submitted 21 September, 2023; v1 submitted 9 October, 2020; originally announced October 2020.

    Comments: 28 pages, 9 figures

  26. arXiv:2008.00901  [pdf, other

    eess.IV cs.CV

    Automated Segmentation of Brain Gray Matter Nuclei on Quantitative Susceptibility Map** Using Deep Convolutional Neural Network

    Authors: Chao Chai, Pengchong Qiao, Bin Zhao, Huiying Wang, Guohua Liu, Hong Wu, E Mark Haacke, Wen Shen, Chen Cao, Xinchen Ye, Zhiyang Liu, Shuang Xia

    Abstract: Abnormal iron accumulation in the brain subcortical nuclei has been reported to be correlated to various neurodegenerative diseases, which can be measured through the magnetic susceptibility from the quantitative susceptibility map** (QSM). To quantitively measure the magnetic susceptibility, the nuclei should be accurately segmented, which is a tedious task for clinicians. In this paper, we pro… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

    Comments: submitted to IEEE Transactions on Medical Imaging

  27. arXiv:2004.00152  [pdf, other

    eess.SY

    L1-Adaptive MPPI Architecture for Robust and Agile Control of Multirotors

    Authors: **tasit Pravitra, Kasey A. Ackerman, Chengyu Cao, Naira Hovakimyan, Evangelos A. Theodorou

    Abstract: This paper presents a multirotor control architecture, where Model Predictive Path Integral Control (MPPI) and L1 adaptive control are combined to achieve both fast model predictive trajectory planning and robust trajectory tracking. MPPI provides a framework to solve nonlinear MPC with complex cost functions in real-time. However, it often lacks robustness, especially when the simulated dynamics… ▽ More

    Submitted 31 March, 2020; originally announced April 2020.

    Comments: Submitted to 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems

  28. arXiv:2003.14013  [pdf, other

    eess.IV cs.CV cs.LG

    Supervised Raw Video Denoising with a Benchmark Dataset on Dynamic Scenes

    Authors: Huan**g Yue, Cong Cao, Lei Liao, Ronghe Chu, **gyu Yang

    Abstract: In recent years, the supervised learning strategy for real noisy image denoising has been emerging and has achieved promising results. In contrast, realistic noise removal for raw noisy videos is rarely studied due to the lack of noisy-clean pairs for dynamic scenes. Clean video frames for dynamic scenes cannot be captured with a long-exposure shutter or averaging multi-shots as was done for stati… ▽ More

    Submitted 31 March, 2020; originally announced March 2020.

    Comments: CVPR2020 accepted paper

  29. arXiv:1908.03735  [pdf

    eess.IV cs.CV cs.LG

    Automatic acute ischemic stroke lesion segmentation using semi-supervised learning

    Authors: Bin Zhao, Shuxue Ding, Hong Wu, Guohua Liu, Chen Cao, Song **, Zhiyang Liu

    Abstract: Ischemic stroke is a common disease in the elderly population, which can cause long-term disability and even death. However, the time window for treatment of ischemic stroke in its acute stage is very short. To fast localize and quantitively evaluate the acute ischemic stroke (AIS) lesions, many deep-learning-based lesion segmentation methods have been proposed in the literature, where a deep conv… ▽ More

    Submitted 20 September, 2020; v1 submitted 10 August, 2019; originally announced August 2019.

  30. arXiv:1809.01859  [pdf, ps, other

    cs.IT cs.LG eess.SP stat.ML

    Deep Learning-Based Decoding for Constrained Sequence Codes

    Authors: Congzhe Cao, Duanshun Li, Ivan Fair

    Abstract: Constrained sequence codes have been widely used in modern communication and data storage systems. Sequences encoded with constrained sequence codes satisfy constraints imposed by the physical channel, hence enabling efficient and reliable transmission of coded symbols. Traditional encoding and decoding of constrained sequence codes rely on table look-up, which is prone to errors that occur during… ▽ More

    Submitted 6 September, 2018; originally announced September 2018.

    Comments: 7 pages, 6 figures, accepted by IEEE Global Communications Conference Workshop - Machine learning for communications