Skip to main content

Showing 1–50 of 97 results for author: Bai, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01027  [pdf, other

    cs.CV

    Blind Inversion using Latent Diffusion Priors

    Authors: Weimin Bai, Siyi Chen, Wenzheng Chen, He Sun

    Abstract: Diffusion models have emerged as powerful tools for solving inverse problems due to their exceptional ability to model complex prior distributions. However, existing methods predominantly assume known forward operators (i.e., non-blind), limiting their applicability in practical settings where acquiring such operators is costly. Additionally, many current approaches rely on pixel-space diffusion m… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2407.01014  [pdf, other

    cs.CV

    An Expectation-Maximization Algorithm for Training Clean Diffusion Models from Corrupted Observations

    Authors: Weimin Bai, Yifei Wang, Wenzheng Chen, He Sun

    Abstract: Diffusion models excel in solving imaging inverse problems due to their ability to model complex image priors. However, their reliance on large, clean datasets for training limits their practical use where clean data is scarce. In this paper, we propose EMDiffusion, an expectation-maximization (EM) approach to train diffusion models from corrupted observations. Our method alternates between recons… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  3. arXiv:2406.19043  [pdf

    eess.IV cs.AI cs.CV cs.DB

    CMRxRecon2024: A Multi-Modality, Multi-View K-Space Dataset Boosting Universal Machine Learning for Accelerated Cardiac MRI

    Authors: Zi Wang, Fanwen Wang, Chen Qin, Jun Lyu, Ouyang Cheng, Shuo Wang, Yan Li, Mengyao Yu, Haoyu Zhang, Kunyuan Guo, Zhang Shi, Qirong Li, Ziqiang Xu, Ya**g Zhang, Hao Li, Sha Hua, Binghua Chen, Longyu Sun, Mengting Sun, Qin Li, Ying-Hua Chu, Wenjia Bai, **g Qin, Xiahai Zhuang, Claudia Prieto , et al. (7 additional authors not shown)

    Abstract: Cardiac magnetic resonance imaging (MRI) has emerged as a clinically gold-standard technique for diagnosing cardiac diseases, thanks to its ability to provide diverse information with multiple modalities and anatomical views. Accelerated cardiac MRI is highly expected to achieve time-efficient and patient-friendly imaging, and then advanced image reconstruction approaches are required to recover h… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 19 pages, 3 figures, 2 tables

  4. arXiv:2405.10246  [pdf, other

    eess.IV cs.CV

    A Foundation Model for Brain Lesion Segmentation with Mixture of Modality Experts

    Authors: Xinru Zhang, Ni Ou, Berke Doga Basaran, Marco Visentin, Mengyun Qiao, Renyang Gu, Cheng Ouyang, Yaou Liu, Paul M. Matthew, Chuyang Ye, Wenjia Bai

    Abstract: Brain lesion segmentation plays an essential role in neurological research and diagnosis. As brain lesions can be caused by various pathological alterations, different types of brain lesions tend to manifest with different characteristics on different imaging modalities. Due to this complexity, brain lesion segmentation methods are often developed in a task-specific manner. A specific segmentation… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: The work has been early accepted by MICCAI 2024

  5. arXiv:2404.13388  [pdf

    eess.IV cs.CV cs.LG

    Diagnosis of Multiple Fundus Disorders Amidst a Scarcity of Medical Experts Via Self-supervised Machine Learning

    Authors: Yong Liu, Mengtian Kang, Shuo Gao, Chi Zhang, Ying Liu, Shiming Li, Yue Qi, Arokia Nathan, Wenjun Xu, Chenyu Tang, Edoardo Occhipinti, Mayinuer Yusufu, Ningli Wang, Weiling Bai, Luigi Occhipinti

    Abstract: Fundus diseases are major causes of visual impairment and blindness worldwide, especially in underdeveloped regions, where the shortage of ophthalmologists hinders timely diagnosis. AI-assisted fundus image analysis has several advantages, such as high accuracy, reduced workload, and improved accessibility, but it requires a large amount of expert-annotated data to build reliable models. To addres… ▽ More

    Submitted 23 April, 2024; v1 submitted 20 April, 2024; originally announced April 2024.

  6. arXiv:2404.13386  [pdf

    eess.IV cs.CV cs.LG

    SSVT: Self-Supervised Vision Transformer For Eye Disease Diagnosis Based On Fundus Images

    Authors: Jiaqi Wang, Mengtian Kang, Yong Liu, Chi Zhang, Ying Liu, Shiming Li, Yue Qi, Wenjun Xu, Chenyu Tang, Edoardo Occhipinti, Mayinuer Yusufu, Ningli Wang, Weiling Bai, Shuo Gao, Luigi G. Occhipinti

    Abstract: Machine learning-based fundus image diagnosis technologies trigger worldwide interest owing to their benefits such as reducing medical resource power and providing objective evaluation results. However, current methods are commonly based on supervised methods, bringing in a heavy workload to biomedical staff and hence suffering in expanding effective databases. To address this issue, in this artic… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: ISBI 2024

  7. arXiv:2403.17353  [pdf, other

    cs.RO cs.LG

    Multi-Objective Trajectory Planning with Dual-Encoder

    Authors: Beibei Zhang, Tian Xiang, Chentao Mao, Yuhua Zheng, Shuai Li, Haoyi Niu, Xiangming Xi, Wenyuan Bai, Feng Gao

    Abstract: Time-jerk optimal trajectory planning is crucial in advancing robotic arms' performance in dynamic tasks. Traditional methods rely on solving complex nonlinear programming problems, bringing significant delays in generating optimized trajectories. In this paper, we propose a two-stage approach to accelerate time-jerk optimal trajectory planning. Firstly, we introduce a dual-encoder based transform… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 6 pages, 7 figures, conference

  8. arXiv:2403.08808  [pdf, other

    cs.RO cs.AI

    A Bionic Data-driven Approach for Long-distance Underwater Navigation with Anomaly Resistance

    Authors: Songnan Yang, Xiaohui Zhang, Shiliang Zhang, Xuehui Ma, Wenqi Bai, Yushuai Li, Tingwen Huang

    Abstract: Various animals exhibit accurate navigation using environment cues. The Earth's magnetic field has been proved a reliable information source in long-distance fauna migration. Inspired by animal navigation, this work proposes a bionic and data-driven approach for long-distance underwater navigation. The proposed approach uses measured geomagnetic data for the navigation, and requires no GPS systems… ▽ More

    Submitted 6 February, 2024; originally announced March 2024.

  9. arXiv:2403.06659  [pdf, other

    eess.SP cs.AI cs.LG

    Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge Enhancement

    Authors: Che Liu, Zhongwei Wan, Cheng Ouyang, Anand Shah, Wenjia Bai, Rossella Arcucci

    Abstract: Electrocardiograms (ECGs) are non-invasive diagnostic tools crucial for detecting cardiac arrhythmic diseases in clinical practice. While ECG Self-supervised Learning (eSSL) methods show promise in representation learning from unannotated ECG data, they often overlook the clinical knowledge that can be found in reports. This oversight and the requirement for annotated samples for downstream tasks… ▽ More

    Submitted 2 July, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: Accepted by ICML2024

  10. arXiv:2312.01529  [pdf, other

    cs.CV cs.CL cs.LG eess.IV

    T3D: Towards 3D Medical Image Understanding through Vision-Language Pre-training

    Authors: Che Liu, Cheng Ouyang, Yinda Chen, Cesar César Quilodrán-Casas, Lei Ma, Jie Fu, Yike Guo, Anand Shah, Wenjia Bai, Rossella Arcucci

    Abstract: Expert annotation of 3D medical image for downstream analysis is resource-intensive, posing challenges in clinical applications. Visual self-supervised learning (vSSL), though effective for learning visual invariance, neglects the incorporation of domain knowledge from medicine. To incorporate medical knowledge into visual representation learning, vision-language pre-training (VLP) has shown promi… ▽ More

    Submitted 5 December, 2023; v1 submitted 3 December, 2023; originally announced December 2023.

  11. arXiv:2312.01522  [pdf, other

    cs.CV cs.LG

    G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training

    Authors: Che Liu, Cheng Ouyang, Sibo Cheng, Anand Shah, Wenjia Bai, Rossella Arcucci

    Abstract: Recently, medical vision-language pre-training (VLP) has reached substantial progress to learn global visual representation from medical images and their paired radiology reports. However, medical imaging tasks in real world usually require finer granularity in visual features. These tasks include visual localization tasks (e.g., semantic segmentation, object detection) and visual grounding task.… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  12. arXiv:2310.07644  [pdf, other

    cs.AI cs.CL cs.LG

    Rethinking the BERT-like Pretraining for DNA Sequences

    Authors: Chaoqi Liang, Weiqiang Bai, Lifeng Qiao, Yuchen Ren, Jianle Sun, Peng Ye, Hongliang Yan, Xinzhu Ma, Wangmeng Zuo, Wanli Ouyang

    Abstract: With the success of large-scale pretraining in NLP, there is an increasing trend of applying it to the domain of life sciences. In particular, pretraining methods based on DNA sequences have garnered growing attention due to their potential to capture generic information about genes. However, existing pretraining methods for DNA sequences largely rely on direct adoptions of BERT pretraining from N… ▽ More

    Submitted 11 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

  13. arXiv:2310.07355  [pdf, other

    cs.CV cs.LG

    IMITATE: Clinical Prior Guided Hierarchical Vision-Language Pre-training

    Authors: Che Liu, Sibo Cheng, Miao**g Shi, Anand Shah, Wenjia Bai, Rossella Arcucci

    Abstract: In the field of medical Vision-Language Pre-training (VLP), significant efforts have been devoted to deriving text and image features from both clinical reports and associated medical images. However, most existing methods may have overlooked the opportunity in leveraging the inherent hierarchical structure of clinical reports, which are generally split into `findings' for descriptive content and… ▽ More

    Submitted 1 May, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: Under Review

  14. arXiv:2310.07027  [pdf, other

    cs.CV cs.LG

    Utilizing Synthetic Data for Medical Vision-Language Pre-training: Bypassing the Need for Real Images

    Authors: Che Liu, Anand Shah, Wenjia Bai, Rossella Arcucci

    Abstract: Medical Vision-Language Pre-training (VLP) learns representations jointly from medical images and paired radiology reports. It typically requires large-scale paired image-text datasets to achieve effective pre-training for both the image encoder and text encoder. The advent of text-guided generative models raises a compelling question: Can VLP be implemented solely with synthetic images generated… ▽ More

    Submitted 30 April, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted by CVPR 2024 Workshop Data Curation and Augmentation in Enhancing Medical Imaging Applications

  15. arXiv:2309.14306  [pdf, other

    eess.IV cs.CV

    DeepMesh: Mesh-based Cardiac Motion Tracking using Deep Learning

    Authors: Qingjie Meng, Wenjia Bai, Declan P O'Regan, and Daniel Rueckert

    Abstract: 3D motion estimation from cine cardiac magnetic resonance (CMR) images is important for the assessment of cardiac function and the diagnosis of cardiovascular diseases. Current state-of-the art methods focus on estimating dense pixel-/voxel-wise motion fields in image space, which ignores the fact that motion estimation is only relevant and useful within the anatomical objects of interest, e.g., t… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

  16. arXiv:2309.10836  [pdf, other

    cs.CV

    CMRxRecon: An open cardiac MRI dataset for the competition of accelerated image reconstruction

    Authors: Chengyan Wang, Jun Lyu, Shuo Wang, Chen Qin, Kunyuan Guo, Xinyu Zhang, Xiaotong Yu, Yan Li, Fanwen Wang, Jianhua **, Zhang Shi, Ziqiang Xu, Yapeng Tian, Sha Hua, Zhensen Chen, Meng Liu, Mengting Sun, Xutong Kuang, Kang Wang, Haoran Wang, Hao Li, Yinghua Chu, Guang Yang, Wenjia Bai, Xiahai Zhuang , et al. (3 additional authors not shown)

    Abstract: Cardiac magnetic resonance imaging (CMR) has emerged as a valuable diagnostic tool for cardiac diseases. However, a limitation of CMR is its slow imaging speed, which causes patient discomfort and introduces artifacts in the images. There has been growing interest in deep learning-based CMR imaging algorithms that can reconstruct high-quality images from highly under-sampled k-space data. However,… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: 14 pages, 8 figures

  17. arXiv:2308.09026  [pdf, ps, other

    eess.IV cs.CV cs.LG

    LesionMix: A Lesion-Level Data Augmentation Method for Medical Image Segmentation

    Authors: Berke Doga Basaran, Weitong Zhang, Mengyun Qiao, Bernhard Kainz, Paul M. Matthews, Wenjia Bai

    Abstract: Data augmentation has become a de facto component of deep learning-based medical image segmentation methods. Most data augmentation techniques used in medical imaging focus on spatial and intensity transformations to improve the diversity of training images. They are often designed at the image level, augmenting the full image, and do not pay attention to specific abnormalities within the image. H… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: 13 pages, 5 figures, 4 tables, MICCAI DALI Workshop 2023

  18. arXiv:2308.08465  [pdf, other

    eess.IV cs.CV cs.LG

    Hierarchical Uncertainty Estimation for Medical Image Segmentation Networks

    Authors: Xinyu Bai, Wenjia Bai

    Abstract: Learning a medical image segmentation model is an inherently ambiguous task, as uncertainties exist in both images (noise) and manual annotations (human errors and bias) used for model training. To build a trustworthy image segmentation model, it is important to not just evaluate its performance but also estimate the uncertainty of the model prediction. Most state-of-the-art image segmentation net… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: 8 pages, 3 figures

  19. arXiv:2307.08347  [pdf, other

    cs.CV cs.AI cs.LG

    M-FLAG: Medical Vision-Language Pre-training with Frozen Language Models and Latent Space Geometry Optimization

    Authors: Che Liu, Sibo Cheng, Chen Chen, Mengyun Qiao, Weitong Zhang, Anand Shah, Wenjia Bai, Rossella Arcucci

    Abstract: Medical vision-language models enable co-learning and integrating features from medical imaging and clinical text. However, these models are not easy to train and the latent representation space can be complex. Here we propose a novel way for pre-training and regularising medical vision-language models. The proposed method, named Medical vision-language pre-training with Frozen language models and… ▽ More

    Submitted 19 July, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: Accepted by MICCAI 2023

  20. arXiv:2306.16738  [pdf, other

    cs.LG cs.CR cs.GT

    Towards Optimal Randomized Strategies in Adversarial Example Game

    Authors: Jiahao Xie, Chao Zhang, Weijie Liu, Wensong Bai, Hui Qian

    Abstract: The vulnerability of deep neural network models to adversarial example attacks is a practical challenge in many artificial intelligence applications. A recent line of work shows that the use of randomization in adversarial training is the key to find optimal strategies against adversarial example attacks. However, in a fully randomized setting where both the defender and the attacker can use rando… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Comments: Extended version of paper https://doi.org/10.1609/aaai.v37i9.26247 which appeared in AAAI 2023

  21. arXiv:2306.06637  [pdf, other

    cs.LG

    PACER: A Fully Push-forward-based Distributional Reinforcement Learning Algorithm

    Authors: Wensong Bai, Chao Zhang, Yichao Fu, Lingwei Peng, Hui Qian, Bin Dai

    Abstract: In this paper, we propose the first fully push-forward-based Distributional Reinforcement Learning algorithm, called Push-forward-based Actor-Critic EncourageR (PACER). Specifically, PACER establishes a stochastic utility value policy gradient theorem and simultaneously leverages the push-forward operator in the construction of both the actor and the critic. Moreover, based on maximum mean discrep… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

  22. arXiv:2301.13098  [pdf, other

    eess.IV cs.CV cs.LG

    CHeart: A Conditional Spatio-Temporal Generative Model for Cardiac Anatomy

    Authors: Mengyun Qiao, Shuo Wang, Huaqi Qiu, Antonio de Marvao, Declan P. O'Regan, Daniel Rueckert, Wenjia Bai

    Abstract: Two key questions in cardiac image analysis are to assess the anatomy and motion of the heart from images; and to understand how they are associated with non-imaging clinical factors such as gender, age and diseases. While the first question can often be addressed by image segmentation and motion tracking algorithms, our capability to model and to answer the second question is still limited. In th… ▽ More

    Submitted 30 November, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: Accepted by IEEE Transactions on Medical Imaging

  23. arXiv:2210.06385  [pdf, other

    eess.IV cs.CV physics.med-ph

    The Extreme Cardiac MRI Analysis Challenge under Respiratory Motion (CMRxMotion)

    Authors: Shuo Wang, Chen Qin, Chengyan Wang, Kang Wang, Haoran Wang, Chen Chen, Cheng Ouyang, Xutong Kuang, Chengliang Dai, Yuanhan Mo, Zhang Shi, Chenchen Dai, Xinrong Chen, He Wang, Wenjia Bai

    Abstract: The quality of cardiac magnetic resonance (CMR) imaging is susceptible to respiratory motion artifacts. The model robustness of automated segmentation techniques in face of real-world respiratory motion artifacts is unclear. This manuscript describes the design of extreme cardiac MRI analysis challenge under respiratory motion (CMRxMotion Challenge). The challenge aims to establish a public benchm… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Summary of CMRxMotion Challenge Design

  24. arXiv:2210.05740  [pdf, other

    cs.LG cs.AI math.OC

    Stochastic Constrained DRO with a Complexity Independent of Sample Size

    Authors: Qi Qi, Jiameng Lyu, Kung sik Chan, Er Wei Bai, Tianbao Yang

    Abstract: Distributionally Robust Optimization (DRO), as a popular method to train robust models against distribution shift between training and test sets, has received tremendous attention in recent years. In this paper, we propose and analyze stochastic algorithms that apply to both non-convex and convex losses for solving Kullback Leibler divergence constrained DRO problem. Compared with existing methods… ▽ More

    Submitted 16 August, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: 37 pages, 16 figures

    Journal ref: Transactions on Machine Learning Research, 2023

  25. arXiv:2209.02004  [pdf, other

    eess.IV cs.CV cs.LG

    Mesh-based 3D Motion Tracking in Cardiac MRI using Deep Learning

    Authors: Qingjie Meng, Wenjia Bai, Tianrui Liu, Declan P O'Regan, Daniel Rueckert

    Abstract: 3D motion estimation from cine cardiac magnetic resonance (CMR) images is important for the assessment of cardiac function and diagnosis of cardiovascular diseases. Most of the previous methods focus on estimating pixel-/voxel-wise motion fields in the full image space, which ignore the fact that motion estimation is mainly relevant and useful within the object of interest, e.g., the heart. In thi… ▽ More

    Submitted 5 September, 2022; originally announced September 2022.

  26. arXiv:2208.13146  [pdf, other

    eess.IV cs.CV cs.LG

    Generative Modelling of the Ageing Heart with Cross-Sectional Imaging and Clinical Data

    Authors: Mengyun Qiao, Berke Doga Basaran, Huaqi Qiu, Shuo Wang, Yi Guo, Yuanyuan Wang, Paul M. Matthews, Daniel Rueckert, Wenjia Bai

    Abstract: Cardiovascular disease, the leading cause of death globally, is an age-related disease. Understanding the morphological and functional changes of the heart during ageing is a key scientific question, the answer to which will help us define important risk factors of cardiovascular disease and monitor disease progression. In this work, we propose a novel conditional generative model to describe the… ▽ More

    Submitted 10 October, 2022; v1 submitted 28 August, 2022; originally announced August 2022.

  27. CROLoss: Towards a Customizable Loss for Retrieval Models in Recommender Systems

    Authors: Yongxiang Tang, Wentao Bai, Guilin Li, Xialong Liu, Yu Zhang

    Abstract: In large-scale recommender systems, retrieving top N relevant candidates accurately with resource constrain is crucial. To evaluate the performance of such retrieval models, Recall@N, the frequency of positive samples being retrieved in the top N ranking, is widely used. However, most of the conventional loss functions for retrieval models such as softmax cross-entropy and pairwise comparison meth… ▽ More

    Submitted 9 November, 2022; v1 submitted 4 August, 2022; originally announced August 2022.

    Comments: 9 pages, 5 figures. Accepted by by CIKM 2022

  28. arXiv:2208.02870  [pdf, other

    cs.CV

    Improved post-hoc probability calibration for out-of-domain MRI segmentation

    Authors: Cheng Ouyang, Shuo Wang, Chen Chen, Zeju Li, Wenjia Bai, Bernhard Kainz, Daniel Rueckert

    Abstract: Probability calibration for deep models is highly desirable in safety-critical applications such as medical imaging. It makes output probabilities of deep networks interpretable, by aligning prediction probability with the actual accuracy in test data. In image segmentation, well-calibrated probabilities allow radiologists to identify regions where model-predicted segmentations are unreliable. The… ▽ More

    Submitted 14 September, 2022; v1 submitted 4 August, 2022; originally announced August 2022.

    Comments: Accepted for UNSURE workshop at MICCAI 2022

  29. arXiv:2208.02135  [pdf, ps, other

    eess.IV cs.CV cs.LG

    Subject-Specific Lesion Generation and Pseudo-Healthy Synthesis for Multiple Sclerosis Brain Images

    Authors: Berke Doga Basaran, Mengyun Qiao, Paul M. Matthews, Wenjia Bai

    Abstract: Understanding the intensity characteristics of brain lesions is key for defining image-based biomarkers in neurological studies and for predicting disease burden and outcome. In this work, we present a novel foreground-based generative method for modelling the local lesion characteristics that can both generate synthetic lesions on healthy images and synthesize subject-specific pseudo-healthy imag… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: 13 pages, 6 figures, 2022 MICCAI SASHIMI (Simulation and Synthesis in Medical Imaging) Workshop paper

  30. arXiv:2208.00034  [pdf, other

    eess.IV cs.CV cs.LG

    MulViMotion: Shape-aware 3D Myocardial Motion Tracking from Multi-View Cardiac MRI

    Authors: Qingjie Meng, Chen Qin, Wenjia Bai, Tianrui Liu, Antonio de Marvao, Declan P O'Regan, Daniel Rueckert

    Abstract: Recovering the 3D motion of the heart from cine cardiac magnetic resonance (CMR) imaging enables the assessment of regional myocardial function and is important for understanding and analyzing cardiovascular disease. However, 3D cardiac motion estimation is challenging because the acquired cine CMR images are usually 2D slices which limit the accurate estimation of through-plane motion. To address… ▽ More

    Submitted 29 July, 2022; originally announced August 2022.

  31. arXiv:2207.06799  [pdf, other

    cs.CV

    MMOTU: A Multi-Modality Ovarian Tumor Ultrasound Image Dataset for Unsupervised Cross-Domain Semantic Segmentation

    Authors: Qi Zhao, Shuchang Lyu, Wenpei Bai, Linghan Cai, Binghao Liu, Guangliang Cheng, Mei**g Wu, Xiubo Sang, Min Yang, Lijiang Chen

    Abstract: Ovarian cancer is one of the most harmful gynecological diseases. Detecting ovarian tumors in early stage with computer-aided techniques can efficiently decrease the mortality rate. With the improvement of medical treatment standard, ultrasound images are widely applied in clinical treatment. However, recent notable methods mainly focus on single-modality ultrasound ovarian tumor segmentation or r… ▽ More

    Submitted 30 November, 2023; v1 submitted 14 July, 2022; originally announced July 2022.

    Comments: code: https://github.com/cv516Buaa/MMOTU_DS2Net paper:18 pages, 12 figures, 11 tables, 16 formulas

  32. arXiv:2206.12970  [pdf, ps, other

    cs.CR

    Cost-Asymmetric Memory Hard Password Hashing

    Authors: Wenjie Bai, Jeremiah Blocki, Mohammad Hassan Ameri

    Abstract: In the past decade, billions of user passwords have been exposed to the dangerous threat of offline password cracking attacks. An offline attacker who has stolen the cryptographic hash of a user's password can check as many password guesses as s/he likes limited only by the resources that s/he is willing to invest to crack the password. Pepper and key-stretching are two techniques that have been p… ▽ More

    Submitted 26 June, 2022; originally announced June 2022.

  33. arXiv:2206.03830  [pdf, other

    eess.IV cs.CV

    Generative Myocardial Motion Tracking via Latent Space Exploration with Biomechanics-informed Prior

    Authors: Chen Qin, Shuo Wang, Chen Chen, Wenjia Bai, Daniel Rueckert

    Abstract: Myocardial motion and deformation are rich descriptors that characterize cardiac function. Image registration, as the most commonly used technique for myocardial motion tracking, is an ill-posed inverse problem which often requires prior assumptions on the solution space. In contrast to most existing approaches which impose explicit generic regularization such as smoothness, in this work we propos… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

    Comments: Under review

  34. arXiv:2206.01737  [pdf, other

    eess.IV cs.CV q-bio.QM

    MaxStyle: Adversarial Style Composition for Robust Medical Image Segmentation

    Authors: Chen Chen, Zeju Li, Cheng Ouyang, Matt Sinclair, Wenjia Bai, Daniel Rueckert

    Abstract: Convolutional neural networks (CNNs) have achieved remarkable segmentation accuracy on benchmark datasets where training and test sets are from the same domain, yet their performance can degrade significantly on unseen domains, which hinders the deployment of CNNs in many clinical scenarios. Most existing works improve model out-of-domain (OOD) robustness by collecting multi-domain datasets for tr… ▽ More

    Submitted 19 June, 2022; v1 submitted 2 June, 2022; originally announced June 2022.

    Comments: Early accepted by MICCAI 2022 (Camera-ready version)

  35. arXiv:2206.01014  [pdf, other

    cs.CV cs.AI

    Suggestive Annotation of Brain MR Images with Gradient-guided Sampling

    Authors: Chengliang Dai, Shuo Wang, Yuanhan Mo, Elsa Angelini, Yike Guo, Wenjia Bai

    Abstract: Machine learning has been widely adopted for medical image analysis in recent years given its promising performance in image segmentation and classification tasks. The success of machine learning, in particular supervised learning, depends on the availability of manually annotated datasets. For medical imaging applications, such annotated datasets are not easy to acquire, it takes a substantial am… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

    Comments: Manuscript accepted by MedIA

  36. arXiv:2205.15941  [pdf, other

    eess.IV cs.CV

    Memory-efficient Segmentation of High-resolution Volumetric MicroCT Images

    Authors: Yuan Wang, Laura Blackie, Irene Miguel-Aliaga, Wenjia Bai

    Abstract: In recent years, 3D convolutional neural networks have become the dominant approach for volumetric medical image segmentation. However, compared to their 2D counterparts, 3D networks introduce substantially more training parameters and higher requirement for the GPU memory. This has become a major limiting factor for designing and training 3D networks for high-resolution volumetric images. In this… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

    Comments: The paper is accepted to MIDL 2022. The codes are available at https://github.com/Virgil3706/Memory-efficient-U-net

  37. arXiv:2112.10074  [pdf, other

    eess.IV cs.CV cs.LG

    QU-BraTS: MICCAI BraTS 2020 Challenge on Quantifying Uncertainty in Brain Tumor Segmentation - Analysis of Ranking Scores and Benchmarking Results

    Authors: Raghav Mehta, Angelos Filos, Ujjwal Baid, Chiharu Sako, Richard McKinley, Michael Rebsamen, Katrin Datwyler, Raphael Meier, Piotr Radojewski, Gowtham Krishnan Murugesan, Sahil Nalawade, Chandan Ganesh, Ben Wagner, Fang F. Yu, Baowei Fei, Ananth J. Madhuranthakam, Joseph A. Maldjian, Laura Daza, Catalina Gomez, Pablo Arbelaez, Chengliang Dai, Shuo Wang, Hadrien Reynaud, Yuan-han Mo, Elsa Angelini , et al. (67 additional authors not shown)

    Abstract: Deep learning (DL) models have provided state-of-the-art performance in various medical imaging benchmarking challenges, including the Brain Tumor Segmentation (BraTS) challenges. However, the task of focal pathology multi-compartment segmentation (e.g., tumor and lesion sub-regions) is particularly challenging, and potential errors hinder translating DL models into clinical workflows. Quantifying… ▽ More

    Submitted 23 August, 2022; v1 submitted 19 December, 2021; originally announced December 2021.

    Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA): https://www.melba-journal.org/papers/2022:026.html

    Journal ref: Machine.Learning.for.Biomedical.Imaging. 1 (2022)

  38. arXiv:2111.12525  [pdf, other

    cs.CV

    Causality-inspired Single-source Domain Generalization for Medical Image Segmentation

    Authors: Cheng Ouyang, Chen Chen, Surui Li, Zeju Li, Chen Qin, Wenjia Bai, Daniel Rueckert

    Abstract: Deep learning models usually suffer from domain shift issues, where models trained on one source domain do not generalize well to other unseen domains. In this work, we investigate the single-source domain generalization problem: training a deep network that is robust to unseen domains, under the condition that training data is only available from one source domain, which is common in medical imag… ▽ More

    Submitted 21 April, 2023; v1 submitted 24 November, 2021; originally announced November 2021.

    Comments: This is an early, non-peer-reviewed version. For the final peer-reviewed full version that has been substantially revised, please find: https://ieeexplore.ieee.org/document/9961940. Please find the code at https://github.com/cheng-01037/Causality-Medical-Image-Domain-Generalization

  39. DeepMCAT: Large-Scale Deep Clustering for Medical Image Categorization

    Authors: Turkay Kart, Wenjia Bai, Ben Glocker, Daniel Rueckert

    Abstract: In recent years, the research landscape of machine learning in medical imaging has changed drastically from supervised to semi-, weakly- or unsupervised methods. This is mainly due to the fact that ground-truth labels are time-consuming and expensive to obtain manually. Generating labels from patient metadata might be feasible but it suffers from user-originated errors which introduce biases. In t… ▽ More

    Submitted 30 September, 2021; originally announced October 2021.

    Comments: Accepted for the DALI workshop at MICCAI 2021 (full oral)

  40. arXiv:2109.13324  [pdf, other

    cs.RO

    Multiple-Pilot Collaboration for Advanced Remote Intervention using Reinforcement Learning

    Authors: Ziwei Wang, Weibang Bai, Zhang Chen, Bo Xiao, Bin Liang, Eric M. Yeatman

    Abstract: The traditional master-slave teleoperation relies on human expertise without correction mechanisms, resulting in excessive physical and mental workloads. To address these issues, a co-pilot-in-the-loop control framework is investigated for cooperative teleoperation. A deep deterministic policy gradient(DDPG) based agent is realised to effectively restore the master operators' intents without prior… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

    Comments: 6 pages, 7 figures, accepted by IECON 2021

  41. arXiv:2108.12007  [pdf, other

    cs.RO

    Dual-arm Coordinated Manipulation for Object Twisting with Human Intelligence

    Authors: Weibang Bai, Ningshan Zhang, Baoru Huang, Ziwei Wang, Francesco Cursi, Ya-Yen Tsai, Bo Xiao, Eric Yeatman

    Abstract: Robotic dual-arm twisting is a common but very challenging task in both industrial production and daily services, as it often requires dexterous collaboration, a large scale of end-effector rotating, and good adaptivity for object manipulation. Meanwhile, safety and efficiency are preliminary concerns for robotic dual-arm coordinated manipulation. Thus, the normally adopted fully automated task ex… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Comments: Accepted by IEEE SMC 2021, 7 pages, 7 figures

  42. arXiv:2108.03429  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM

    Enhancing MR Image Segmentation with Realistic Adversarial Data Augmentation

    Authors: Chen Chen, Chen Qin, Cheng Ouyang, Zeju Li, Shuo Wang, Huaqi Qiu, Liang Chen, Giacomo Tarroni, Wenjia Bai, Daniel Rueckert

    Abstract: The success of neural networks on medical image segmentation tasks typically relies on large labeled datasets for model training. However, acquiring and manually labeling a large medical image set is resource-intensive, expensive, and sometimes impractical due to data sharing and privacy issues. To address this challenge, we propose AdvChain, a generic adversarial data augmentation framework, aimi… ▽ More

    Submitted 19 June, 2022; v1 submitted 7 August, 2021; originally announced August 2021.

    Comments: Under review

  43. arXiv:2107.07975  [pdf, other

    eess.IV cs.CV

    Joint Semi-supervised 3D Super-Resolution and Segmentation with Mixed Adversarial Gaussian Domain Adaptation

    Authors: Nicolo Savioli, Antonio de Marvao, Wenjia Bai, Shuo Wang, Stuart A. Cook, Calvin W. L. Chin, Daniel Rueckert, Declan P. O'Regan

    Abstract: Optimising the analysis of cardiac structure and function requires accurate 3D representations of shape and motion. However, techniques such as cardiac magnetic resonance imaging are conventionally limited to acquiring contiguous cross-sectional slices with low through-plane resolution and potential inter-slice spatial misalignment. Super-resolution in medical imaging aims to increase the resoluti… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

  44. arXiv:2107.05625  [pdf, other

    cs.RO

    Kinematic Parameter Optimization of a Miniaturized Surgical Instrument Based on Dexterous Workspace Determination

    Authors: Xin Zhi, Weibang Bai, Eric M. Yeatman

    Abstract: Miniaturized instruments are highly needed for robot assisted medical healthcare and treatment, especially for less invasive surgery as it empowers more flexible access to restricted anatomic intervention. But the robotic design is more challenging due to the contradictory needs of miniaturization and the capability of manipulating with a large dexterous workspace. Thus, kinematic parameter optimi… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

    Comments: IEEE ICARM 2021, Best Paper Award Finalist, 7 pages, 10 figures

  45. arXiv:2107.03887  [pdf, other

    eess.IV cs.CV

    Joint Motion Correction and Super Resolution for Cardiac Segmentation via Latent Optimisation

    Authors: Shuo Wang, Chen Qin, Nicolo Savioli, Chen Chen, Declan O'Regan, Stuart Cook, Yike Guo, Daniel Rueckert, Wenjia Bai

    Abstract: In cardiac magnetic resonance (CMR) imaging, a 3D high-resolution segmentation of the heart is essential for detailed description of its anatomical structures. However, due to the limit of acquisition duration and respiratory/cardiac motion, stacks of multi-slice 2D images are acquired in clinical routine. The segmentation of these images provides a low-resolution representation of cardiac anatomy… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

    Comments: The paper is early accepted to MICCAI 2021. The codes are available at https://github.com/shuowang26/SRHeart

  46. arXiv:2107.01079  [pdf, other

    cs.CV cs.AI cs.LG q-bio.QM

    Cooperative Training and Latent Space Data Augmentation for Robust Medical Image Segmentation

    Authors: Chen Chen, Kerstin Hammernik, Cheng Ouyang, Chen Qin, Wenjia Bai, Daniel Rueckert

    Abstract: Deep learning-based segmentation methods are vulnerable to unforeseen data distribution shifts during deployment, e.g. change of image appearances or contrasts caused by different scanners, unexpected imaging artifacts etc. In this paper, we present a cooperative framework for training image segmentation models and a latent space augmentation method for generating hard examples. Both contributions… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

    Comments: MICCAI 2021

  47. arXiv:2104.07195  [pdf, other

    cs.CR cs.AI cs.LG

    Discover the Hidden Attack Path in Multi-domain Cyberspace Based on Reinforcement Learning

    Authors: Lei Zhang, Wei Bai, Wei Li, Shiming Xia, Qibin Zheng

    Abstract: In this work, we present a learning-based approach to analysis cyberspace security configuration. Unlike prior methods, our approach has the ability to learn from past experience and improve over time. In particular, as we train over a greater number of agents as attackers, our method becomes better at discovering hidden attack paths for previously methods, especially in multi-domain cyberspace. T… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    Comments: 12 pages, 2 figures, 3 tables. arXiv admin note: substantial text overlap with arXiv:2007.04614

    MSC Class: 68T01 ACM Class: I.2.0

  48. arXiv:2103.02750  [pdf

    cs.RO

    Multiple-Channel Real Time Filtering for a Myoelectric Prosthetic Hand-Arm Robot System

    Authors: Weibang Bai, Yinlai Jiang, Hiroshi Yokoi

    Abstract: On the base of the developed master-slave prosthetic hand-arm robot system, which is controlled mainly based on signals obtained from bending sensors fixed on the data glove, the first idea deduced was to develop and add a multi-dimensional filter into the original control system to make the control signals cleaner and more stable at real time. By going further, a second new idea was also proposed… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: 14 pages, 12 figures

  49. arXiv:2101.10374  [pdf, ps, other

    cs.CR cs.GT

    DAHash: Distribution Aware Tuning of Password Hashing Costs

    Authors: Wenjie Bai, Jeremiah Blocki

    Abstract: An attacker who breaks into an authentication server and steals all of the cryptographic password hashes is able to mount an offline-brute force attack against each user's password. Offline brute-force attacks against passwords are increasingly commonplace and the danger is amplified by the well documented human tendency to select low-entropy password and/or reuse these passwords across multiple a… ▽ More

    Submitted 30 January, 2021; v1 submitted 25 January, 2021; originally announced January 2021.

    Comments: 25 pages, 15 figures, Financial Crypto 2021

  50. arXiv:2012.15616  [pdf, other

    cs.AI cs.LG

    Quantitative Evaluations on Saliency Methods: An Experimental Study

    Authors: Xiao-Hui Li, Yuhan Shi, Haoyang Li, Wei Bai, Yuanwei Song, Caleb Chen Cao, Lei Chen

    Abstract: It has been long debated that eXplainable AI (XAI) is an important topic, but it lacks rigorous definition and fair metrics. In this paper, we briefly summarize the status quo of the metrics, along with an exhaustive experimental study based on them, including faithfulness, localization, false-positives, sensitivity check, and stability. With the experimental results, we conclude that among all th… ▽ More

    Submitted 31 December, 2020; originally announced December 2020.

    Comments: 14 pages, 16 figures