Skip to main content

Showing 1–14 of 14 results for author: Zou, K

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.16942  [pdf, other

    eess.IV cs.AI cs.CV

    Enhancing Diagnostic Reliability of Foundation Model with Uncertainty Estimation in OCT Images

    Authors: Yuanyuan Peng, Aidi Lin, Meng Wang, Tian Lin, Ke Zou, Yinglin Cheng, Tingkun Shi, Xulong Liao, Lixia Feng, Zhen Liang, Xinjian Chen, Huazhu Fu, Haoyu Chen

    Abstract: Inability to express the confidence level and detect unseen classes has limited the clinical implementation of artificial intelligence in the real-world. We developed a foundation model with uncertainty estimation (FMUE) to detect 11 retinal conditions on optical coherence tomography (OCT). In the internal test set, FMUE achieved a higher F1 score of 96.76% than two state-of-the-art algorithms, RE… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: All codes are available at https://github.com/yuanyuanpeng0129/FMUE

  2. arXiv:2406.09317  [pdf, other

    eess.IV cs.CV

    Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases

    Authors: Meng Wang, Tian Lin, Aidi Lin, Kai Yu, Yuanyuan Peng, Lianyu Wang, Cheng Chen, Ke Zou, Huiyu Liang, Man Chen, Xue Yao, Meiqin Zhang, Binwei Huang, Chaoxin Zheng, Peixin Zhang, Wei Chen, Yilong Luo, Yifan Chen, Honghe Xia, Tingkun Shi, Qi Zhang, **ming Guo, Xiaolin Chen, **gcheng Wang, Yih Chung Tham , et al. (24 additional authors not shown)

    Abstract: Previous foundation models for retinal images were pre-trained with limited disease categories and knowledge base. Here we introduce RetiZero, a vision-language foundation model that leverages knowledge from over 400 fundus diseases. To RetiZero's pre-training, we compiled 341,896 fundus images paired with text descriptions, sourced from public datasets, ophthalmic literature, and online resources… ▽ More

    Submitted 30 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

  3. arXiv:2406.08835  [pdf, other

    cs.SD eess.AS

    A Single-Step Non-Autoregressive Automatic Speech Recognition Architecture with High Accuracy and Inference Speed

    Authors: Ziyang Zhuang, Chenfeng Miao, Kun Zou, Shuai Gong, Ming Fang, Tao Wei, Zijian Li, Wei Hu, Shaojun Wang, **g Xiao

    Abstract: Non-autoregressive (NAR) automatic speech recognition (ASR) models predict tokens independently and simultaneously, bringing high inference speed. However, there is still a gap in the accuracy of the NAR models compared to the autoregressive (AR) models. To further narrow the gap between the NAR and AR models, we propose a single-step NAR ASR architecture with high accuracy and inference speed, ca… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  4. arXiv:2405.18167  [pdf, other

    eess.IV cs.CV

    Confidence-aware multi-modality learning for eye disease screening

    Authors: Ke Zou, Tian Lin, Zongbo Han, Meng Wang, Xuedong Yuan, Haoyu Chen, Changqing Zhang, Xiao**g Shen, Huazhu Fu

    Abstract: Multi-modal ophthalmic image classification plays a key role in diagnosing eye diseases, as it integrates information from different sources to complement their respective performances. However, recent improvements have mainly focused on accuracy, often neglecting the importance of confidence and robustness in predictions for diverse modalities. In this study, we propose a novel multi-modality evi… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 27 pages, 7 figures, 9 tables

  5. arXiv:2405.16102  [pdf, other

    eess.IV cs.CV

    Reliable Source Approximation: Source-Free Unsupervised Domain Adaptation for Vestibular Schwannoma MRI Segmentation

    Authors: Hongye Zeng, Ke Zou, Zhihao Chen, Rui Zheng, Huazhu Fu

    Abstract: Source-Free Unsupervised Domain Adaptation (SFUDA) has recently become a focus in the medical image domain adaptation, as it only utilizes the source model and does not require annotated target data. However, current SFUDA approaches cannot tackle the complex segmentation task across different MRI sequences, such as the vestibular schwannoma segmentation. To address this problem, we proposed Relia… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: Early accepted by MICCAI 2024

  6. arXiv:2402.11211  [pdf, other

    eess.IV cs.CV

    Training-free image style alignment for self-adapting domain shift on handheld ultrasound devices

    Authors: Hongye Zeng, Ke Zou, Zhihao Chen, Yuchong Gao, Hongbo Chen, Haibin Zhang, Kang Zhou, Meng Wang, Rick Siow Mong Goh, Yong Liu, Chang Jiang, Rui Zheng, Huazhu Fu

    Abstract: Handheld ultrasound devices face usage limitations due to user inexperience and cannot benefit from supervised deep learning without extensive expert annotations. Moreover, the models trained on standard ultrasound device data are constrained by training data distribution and perform poorly when directly applied to handheld device data. In this study, we propose the Training-free Image Style Align… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  7. arXiv:2310.12111  [pdf, other

    eess.AS cs.AI

    DASA: Difficulty-Aware Semantic Augmentation for Speaker Verification

    Authors: Yuanyuan Wang, Yang Zhang, Zhiyong Wu, Zhihan Yang, Tao Wei, Kun Zou, Helen Meng

    Abstract: Data augmentation is vital to the generalization ability and robustness of deep neural networks (DNNs) models. Existing augmentation methods for speaker verification manipulate the raw signal, which are time-consuming and the augmented samples lack diversity. In this paper, we present a novel difficulty-aware semantic augmentation (DASA) approach for speaker verification, which can generate divers… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: Accepted by ICASSP 2023

  8. Federated Uncertainty-Aware Aggregation for Fundus Diabetic Retinopathy Staging

    Authors: Meng Wang, Lianyu Wang, Xinxing Xu, Ke Zou, Yiming Qian, Rick Siow Mong Goh, Yong Liu, Huazhu Fu

    Abstract: Deep learning models have shown promising performance in the field of diabetic retinopathy (DR) staging. However, collaboratively training a DR staging model across multiple institutions remains a challenge due to non-iid data, client reliability, and confidence evaluation of the prediction. To address these issues, we propose a novel federated uncertainty-aware aggregation paradigm (FedUAA), whic… ▽ More

    Submitted 22 July, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Report number: 978-3-031-43894-3

    Journal ref: Medical Image Computing and Computer Assisted Intervention(MICCAI 2023)

  9. arXiv:2303.09790  [pdf, other

    eess.IV cs.CV

    Reliable Multimodality Eye Disease Screening via Mixture of Student's t Distributions

    Authors: Ke Zou, Tian Lin, Xuedong Yuan, Haoyu Chen, Xiao**g Shen, Meng Wang, Huazhu Fu

    Abstract: Multimodality eye disease screening is crucial in ophthalmology as it integrates information from diverse sources to complement their respective performances. However, the existing methods are weak in assessing the reliability of each unimodality, and directly fusing an unreliable modality may cause screening errors. To address this issue, we introduce a novel multimodality evidential fusion pipel… ▽ More

    Submitted 29 August, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: MICCAI 2023 (Early accept):11 pages, 4 figures

  10. arXiv:2302.08119  [pdf, other

    eess.IV cs.CV

    A Review of Uncertainty Estimation and its Application in Medical Imaging

    Authors: Ke Zou, Zhihao Chen, Xuedong Yuan, Xiao**g Shen, Meng Wang, Huazhu Fu

    Abstract: The use of AI systems in healthcare for the early screening of diseases is of great clinical importance. Deep learning has shown great promise in medical imaging, but the reliability and trustworthiness of AI systems limit their deployment in real clinical scenes, where patient safety is at stake. Uncertainty estimation plays a pivotal role in producing a confidence evaluation along with the predi… ▽ More

    Submitted 15 May, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: 11 pages, 3 figures, 3 tables

  11. arXiv:2301.00349  [pdf, other

    eess.IV cs.CV

    Towards Reliable Medical Image Segmentation by utilizing Evidential Calibrated Uncertainty

    Authors: Ke Zou, Yidi Chen, Ling Huang, Xuedong Yuan, Xiao**g Shen, Meng Wang, Rick Siow Mong Goh, Yong Liu, Huazhu Fu

    Abstract: Medical image segmentation is critical for disease diagnosis and treatment assessment. However, concerns regarding the reliability of segmentation regions persist among clinicians, mainly attributed to the absence of confidence assessment, robustness, and calibration to accuracy. To address this, we introduce DEviS, an easily implementable foundational model that seamlessly integrates into various… ▽ More

    Submitted 13 April, 2024; v1 submitted 1 January, 2023; originally announced January 2023.

    Comments: 34 pages, 11 figures

  12. arXiv:2212.00330   

    eess.IV cs.CV

    Reliable Joint Segmentation of Retinal Edema Lesions in OCT Images

    Authors: Meng Wang, Kai Yu, Chun-Mei Feng, Ke Zou, Yanyu Xu, Qingquan Meng, Rick Siow Mong Goh, Yong Liu, Huazhu Fu

    Abstract: Focusing on the complicated pathological features, such as blurred boundaries, severe scale differences between symptoms, background noise interference, etc., in the task of retinal edema lesions joint segmentation from OCT images and enabling the segmentation results more reliable. In this paper, we propose a novel reliable multi-scale wavelet-enhanced transformer network, which can provide accur… ▽ More

    Submitted 1 January, 2024; v1 submitted 1 December, 2022; originally announced December 2022.

    Comments: Improving algorithm

  13. arXiv:2206.09309  [pdf, other

    eess.IV cs.CV

    TBraTS: Trusted Brain Tumor Segmentation

    Authors: Ke Zou, Xuedong Yuan, Xiao**g Shen, Meng Wang, Huazhu Fu

    Abstract: Despite recent improvements in the accuracy of brain tumor segmentation, the results still exhibit low levels of confidence and robustness. Uncertainty estimation is one effective way to change this situation, as it provides a measure of confidence in the segmentation results. In this paper, we propose a trusted brain tumor segmentation network which can generate robust segmentation results and re… ▽ More

    Submitted 28 July, 2022; v1 submitted 18 June, 2022; originally announced June 2022.

    Comments: 11 pages, 4 figures, Accepted by MICCAI 2022

  14. arXiv:2101.09967  [pdf

    physics.optics eess.SP

    Turbulence-Resilient Coherent Free-Space Optical Communications using Automatic Power-Efficient Pilot-Assisted Optoelectronic Beam Mixing of Many Modes

    Authors: Runzhou Zhang, Nanzhe Hu, Huibin Zhou, Kaiheng Zou, Xinzhou Su, Yiyu Zhou, Haoqian Song, Kai Pang, Hao Song, Amir Minoofar, Zhe Zhao, Cong Liu, Karapet Manukyan, Ahmed Almaiman, Brittany Lynn, Robert W. Boyd, Moshe Tur, Alan E. Willner

    Abstract: Atmospheric turbulence generally limits free-space optical (FSO) communications, and this problem is severely exacerbated when implementing highly sensitive and spectrally efficient coherent detection. Specifically, turbulence induces power coupling from the transmitted Gaussian mode to higher-order Laguerre-Gaussian (LG) modes, resulting in a significant decrease of the power that mixes with a si… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.