Skip to main content

Showing 1–17 of 17 results for author: Gong, K

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.14802  [pdf, other

    eess.IV cs.CV

    Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation

    Authors: Hongxu Jiang, Muhammad Imran, Linhai Ma, Teng Zhang, Yuyin Zhou, Muxuan Liang, Kuang Gong, Wei Shao

    Abstract: Denoising diffusion probabilistic models (DDPMs) have achieved unprecedented success in computer vision. However, they remain underutilized in medical imaging, a field crucial for disease diagnosis and treatment planning. This is primarily due to the high computational cost associated with (1) the use of large number of time steps (e.g., 1,000) in diffusion processes and (2) the increased dimensio… ▽ More

    Submitted 23 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  2. arXiv:2401.17593  [pdf, other

    eess.IV cs.CV physics.med-ph

    Head and Neck Tumor Segmentation from [18F]F-FDG PET/CT Images Based on 3D Diffusion Model

    Authors: Yafei Dong, Kuang Gong

    Abstract: Head and neck (H&N) cancers are among the most prevalent types of cancer worldwide, and [18F]F-FDG PET/CT is widely used for H&N cancer management. Recently, the diffusion model has demonstrated remarkable performance in various image-generation tasks. In this work, we proposed a 3D diffusion model to accurately perform H&N tumor segmentation from 3D PET and CT volumes. The 3D diffusion model was… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: 28 pages, 5 figures

  3. arXiv:2306.11984  [pdf, ps, other

    eess.IV cs.AI cs.CV

    TauPETGen: Text-Conditional Tau PET Image Synthesis Based on Latent Diffusion Models

    Authors: Se-In Jang, Cristina Lois, Emma Thibault, J. Alex Becker, Yafei Dong, Marc D. Normandin, Julie C. Price, Keith A. Johnson, Georges El Fakhri, Kuang Gong

    Abstract: In this work, we developed a novel text-guided image synthesis technique which could generate realistic tau PET images from textual descriptions and the subject's MR image. The generated tau PET images have the potential to be used in examining relations between different measures and also increasing the public availability of tau PET datasets. The method was based on latent diffusion models. Both… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  4. arXiv:2302.03861  [pdf

    eess.IV cs.CV

    SwinCross: Cross-modal Swin Transformer for Head-and-Neck Tumor Segmentation in PET/CT Images

    Authors: Gary Y. Li, Junyu Chen, Se-In Jang, Kuang Gong, Quanzheng Li

    Abstract: Radiotherapy (RT) combined with cetuximab is the standard treatment for patients with inoperable head and neck cancers. Segmentation of head and neck (H&N) tumors is a prerequisite for radiotherapy planning but a time-consuming process. In recent years, deep convolutional neural networks have become the de facto standard for automated image segmentation. However, due to the expensive computational… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

    Comments: 9 pages, 3 figures. Med Phys. 2023

  5. arXiv:2212.10724  [pdf

    eess.IV cs.CV

    Investigation of Network Architecture for Multimodal Head-and-Neck Tumor Segmentation

    Authors: Ye Li, Junyu Chen, Se-in Jang, Kuang Gong, Quanzheng Li

    Abstract: Inspired by the recent success of Transformers for Natural Language Processing and vision Transformer for Computer Vision, many researchers in the medical imaging community have flocked to Transformer-based networks for various main stream medical tasks such as classification, segmentation, and estimation. In this study, we analyze, two recently published Transformer-based network architectures fo… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: Accepted for oral presentation by IEEE Medical Imaging Conference 2022

  6. arXiv:2209.06167  [pdf, other

    eess.IV cs.CV physics.med-ph

    PET image denoising based on denoising diffusion probabilistic models

    Authors: Kuang Gong, Keith A. Johnson, Georges El Fakhri, Quanzheng Li, Tinsu Pan

    Abstract: Due to various physical degradation factors and limited counts received, PET image quality needs further improvements. The denoising diffusion probabilistic models (DDPM) are distribution learning-based models, which try to transform a normal distribution into a specific data distribution based on iterative refinements. In this work, we proposed and evaluated different DDPM-based methods for PET i… ▽ More

    Submitted 14 September, 2022; v1 submitted 13 September, 2022; originally announced September 2022.

    Comments: 8 figures

  7. arXiv:2209.03300  [pdf, ps, other

    eess.IV cs.CV

    Spach Transformer: Spatial and Channel-wise Transformer Based on Local and Global Self-attentions for PET Image Denoising

    Authors: Se-In Jang, Tinsu Pan, Ye Li, Pedram Heidari, Junyu Chen, Quanzheng Li, Kuang Gong

    Abstract: Position emission tomography (PET) is widely used in clinics and research due to its quantitative merits and high sensitivity, but suffers from low signal-to-noise ratio (SNR). Recently convolutional neural networks (CNNs) have been widely used to improve PET image quality. Though successful and efficient in local feature extraction, CNN cannot capture long-range dependencies well due to its limit… ▽ More

    Submitted 10 December, 2023; v1 submitted 7 September, 2022; originally announced September 2022.

    Comments: 15 pages

  8. arXiv:2203.08034  [pdf

    eess.IV cs.CV cs.LG physics.med-ph

    A Noise-level-aware Framework for PET Image Denoising

    Authors: Ye Li, Jianan Cui, Junyu Chen, Guodong Zeng, Scott Wollenweber, Floris Jansen, Se-In Jang, Kyungsang Kim, Kuang Gong, Quanzheng Li

    Abstract: In PET, the amount of relative (signal-dependent) noise present in different body regions can be significantly different and is inherently related to the number of counts present in that region. The number of counts in a region depends, in principle and among other factors, on the total administered activity, scanner sensitivity, image acquisition duration, radiopharmaceutical tracer uptake in the… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

  9. arXiv:2201.01443  [pdf, other

    eess.IV cs.CV physics.med-ph

    Neural KEM: A Kernel Method with Deep Coefficient Prior for PET Image Reconstruction

    Authors: Siqi Li, Kuang Gong, Ramsey D. Badawi, Edward J. Kim, **yi Qi, Guobao Wang

    Abstract: Image reconstruction of low-count positron emission tomography (PET) data is challenging. Kernel methods address the challenge by incorporating image prior information in the forward model of iterative PET image reconstruction. The kernelized expectation-maximization (KEM) algorithm has been developed and demonstrated to be effective and easy to implement. A common approach for a further improveme… ▽ More

    Submitted 24 October, 2022; v1 submitted 4 January, 2022; originally announced January 2022.

    Comments: arXiv admin note: text overlap with arXiv:2110.01174

  10. arXiv:2109.09161  [pdf, other

    cs.CL eess.AS

    Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition

    Authors: Guolin Zheng, Yubei Xiao, Ke Gong, Pan Zhou, Xiaodan Liang, Liang Lin

    Abstract: Unifying acoustic and linguistic representation learning has become increasingly crucial to transfer the knowledge learned on the abundance of high-resource language data for low-resource speech recognition. Existing approaches simply cascade pre-trained acoustic and language models to learn the transfer from speech to text. However, how to solve the representation discrepancy of speech and text i… ▽ More

    Submitted 9 October, 2021; v1 submitted 19 September, 2021; originally announced September 2021.

  11. arXiv:2106.10359  [pdf, other

    eess.IV cs.CV physics.med-ph

    Direct Reconstruction of Linear Parametric Images from Dynamic PET Using Nonlocal Deep Image Prior

    Authors: Kuang Gong, Ciprian Catana, **yi Qi, Quanzheng Li

    Abstract: Direct reconstruction methods have been developed to estimate parametric images directly from the measured PET sinograms by combining the PET imaging model and tracer kinetics in an integrated framework. Due to limited counts received, signal-to-noise-ratio (SNR) and resolution of parametric images produced by direct reconstruction frameworks are still limited. Recently supervised deep learning me… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

    Comments: 10 pages, 10 figures

  12. arXiv:2012.11896  [pdf, other

    cs.CL cs.SD eess.AS

    Adversarial Meta Sampling for Multilingual Low-Resource Speech Recognition

    Authors: Yubei Xiao, Ke Gong, Pan Zhou, Guolin Zheng, Xiaodan Liang, Liang Lin

    Abstract: Low-resource automatic speech recognition (ASR) is challenging, as the low-resource target language data cannot well train an ASR model. To solve this issue, meta-learning formulates ASR for each source language into many small ASR tasks and meta-learns a model initialization on all tasks from different source languages to access fast adaptation on unseen target languages. However, for different s… ▽ More

    Submitted 12 April, 2021; v1 submitted 22 December, 2020; originally announced December 2020.

    Comments: accepted in AAAI2021

  13. arXiv:2009.06129  [pdf, other

    eess.IV cs.LG physics.med-ph

    Super Resolution of Arterial Spin Labeling MR Imaging Using Unsupervised Multi-Scale Generative Adversarial Network

    Authors: Jianan Cui, Kuang Gong, Paul Han, Huafeng Liu, Quanzheng Li

    Abstract: Arterial spin labeling (ASL) magnetic resonance imaging (MRI) is a powerful imaging technology that can measure cerebral blood flow (CBF) quantitatively. However, since only a small portion of blood is labeled compared to the whole tissue volume, conventional ASL suffers from low signal-to-noise ratio (SNR), poor spatial resolution, and long acquisition time. In this paper, we proposed a super-res… ▽ More

    Submitted 13 September, 2020; originally announced September 2020.

    Comments: Accepted to 2020 MICCAI MLMI workshop

  14. arXiv:2009.05901  [pdf

    physics.med-ph cs.LG eess.IV

    Clinically Translatable Direct Patlak Reconstruction from Dynamic PET with Motion Correction Using Convolutional Neural Network

    Authors: Nuobei Xie, Kuang Gong, Ning Guo, Zhixing Qin, Jianan Cui, Zhifang Wu, Huafeng Liu, Quanzheng Li

    Abstract: Patlak model is widely used in 18F-FDG dynamic positron emission tomography (PET) imaging, where the estimated parametric images reveal important biochemical and physiology information. Because of better noise modeling and more information extracted from raw sinogram, direct Patlak reconstruction gains its popularity over the indirect approach which utilizes reconstructed dynamic PET images alone.… ▽ More

    Submitted 12 September, 2020; originally announced September 2020.

    Comments: Accepted to MICCAI 2020

  15. arXiv:2004.06272  [pdf, other

    cs.CV cs.LG eess.IV

    Bidirectional Graph Reasoning Network for Panoptic Segmentation

    Authors: Yangxin Wu, Gengwei Zhang, Yiming Gao, Xiajun Deng, Ke Gong, Xiaodan Liang, Liang Lin

    Abstract: Recent researches on panoptic segmentation resort to a single end-to-end network to combine the tasks of instance segmentation and semantic segmentation. However, prior models only unified the two related tasks at the architectural level via a multi-branch scheme or revealed the underlying correlation between them by unidirectional feature fusion, which disregards the explicit semantic and co-occu… ▽ More

    Submitted 13 April, 2020; originally announced April 2020.

    Comments: CVPR2020

  16. arXiv:1912.07180  [pdf

    physics.med-ph cs.LG eess.IV

    Penalized-likelihood PET Image Reconstruction Using 3D Structural Convolutional Sparse Coding

    Authors: Nuobei Xie, Kuang Gong, Ning Guo, Zhixin Qin, Zhifang Wu, Huafeng Liu, Quanzheng Li

    Abstract: Positron emission tomography (PET) is widely used for clinical diagnosis. As PET suffers from low resolution and high noise, numerous efforts try to incorporate anatomical priors into PET image reconstruction, especially with the development of hybrid PET/CT and PET/MRI systems. In this work, we proposed a novel 3D structural convolutional sparse coding (CSC) concept for penalized-likelihood PET i… ▽ More

    Submitted 15 December, 2019; originally announced December 2019.

    Comments: 11 pages, 12 figures

  17. arXiv:1906.03639  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Consensus Neural Network for Medical Imaging Denoising with Only Noisy Training Samples

    Authors: Dufan Wu, Kuang Gong, Kyungsang Kim, Quanzheng Li

    Abstract: Deep neural networks have been proved efficient for medical image denoising. Current training methods require both noisy and clean images. However, clean images cannot be acquired for many practical medical applications due to naturally noisy signal, such as dynamic imaging, spectral computed tomography, arterial spin labeling magnetic resonance imaging, etc. In this paper we proposed a training m… ▽ More

    Submitted 9 June, 2019; originally announced June 2019.

    Comments: 9 pages, 2 figures, accepted by MICCAI 2019