Skip to main content

Showing 1–50 of 77 results for author: Ni, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00678  [pdf, other

    eess.IV cs.CV

    A Review of Image Processing Methods in Prostate Ultrasound

    Authors: Haiqiao Wang, Hong Wu, Zhuoyuan Wang, Peiyan Yue, Dong Ni, Pheng-Ann Heng, Yi Wang

    Abstract: Prostate cancer (PCa) poses a significant threat to men's health, with early diagnosis being crucial for improving prognosis and reducing mortality rates. Transrectal ultrasound (TRUS) plays a vital role in the diagnosis and image-guided intervention of PCa.To facilitate physicians with more accurate and efficient computer-assisted diagnosis and interventions, many image processing algorithms in T… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  2. arXiv:2406.14098  [pdf, ps, other

    cs.CV

    HeartBeat: Towards Controllable Echocardiography Video Synthesis with Multimodal Conditions-Guided Diffusion Models

    Authors: Xinrui Zhou, Yuhao Huang, Wufeng Xue, Haoran Dou, Jun Cheng, Han Zhou, Dong Ni

    Abstract: Echocardiography (ECHO) video is widely used for cardiac examination. In clinical, this procedure heavily relies on operator experience, which needs years of training and maybe the assistance of deep learning-based systems for enhanced accuracy and efficiency. However, it is challenging since acquiring sufficient customized data (e.g., abnormal cases) for novice training and deep model development… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted by MICCAI 2024

  3. arXiv:2406.01154  [pdf, other

    cs.CV

    UniUSNet: A Promptable Framework for Universal Ultrasound Disease Prediction and Tissue Segmentation

    Authors: Zehui Lin, Zhuoneng Zhang, Xindi Hu, Zhifan Gao, Xin Yang, Yue Sun, Dong Ni, Tao Tan

    Abstract: Ultrasound is a widely used imaging modality in clinical practice due to its low cost, portability, and safety. Current research in general AI for healthcare focuses on large language models and general segmentation models, with insufficient attention to solutions addressing both disease prediction and tissue segmentation. In this study, we propose a novel universal framework for ultrasound, namel… ▽ More

    Submitted 20 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  4. arXiv:2403.16526  [pdf, other

    cs.CV

    ModeTv2: GPU-accelerated Motion Decomposition Transformer for Pairwise Optimization in Medical Image Registration

    Authors: Haiqiao Wang, Zhuoyuan Wang, Dong Ni, Yi Wang

    Abstract: Deformable image registration plays a crucial role in medical imaging, aiding in disease diagnosis and image-guided interventions. Traditional iterative methods are slow, while deep learning (DL) accelerates solutions but faces usability and precision challenges. This study introduces a pyramid network with the enhanced motion decomposition Transformer (ModeTv2) operator, showcasing superior pairw… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  5. arXiv:2403.01412  [pdf, other

    cs.CV eess.IV eess.SP

    LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition

    Authors: Lingfeng Liu, Dong Ni, Hangjie Yuan

    Abstract: Bandwidth constraints during signal acquisition frequently impede real-time detection applications. Hyperspectral data is a notable example, whose vast volume compromises real-time hyperspectral detection. To tackle this hurdle, we introduce a novel approach leveraging pre-acquisition modulation to reduce the acquisition volume. This modulation process is governed by a deep learning model, utilizi… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: Accepted to ICLR 2024

  6. arXiv:2402.11497  [pdf, other

    cs.CV

    Thyroid ultrasound diagnosis improvement via multi-view self-supervised learning and two-stage pre-training

    Authors: Jian Wang, Xin Yang, Xiaohong Jia, Wufeng Xue, Rusi Chen, Yanlin Chen, Xiliang Zhu, Lian Liu, Yan Cao, Jianqiao Zhou, Dong Ni, Ning Gu

    Abstract: Thyroid nodule classification and segmentation in ultrasound images are crucial for computer-aided diagnosis; however, they face limitations owing to insufficient labeled data. In this study, we proposed a multi-view contrastive self-supervised method to improve thyroid nodule classification and segmentation performance with limited manual labels. Our method aligns the transverse and longitudinal… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: The article has been accepted by the journal of Computers in Biology and Medicine

  7. arXiv:2402.07452  [pdf, other

    cs.CV cs.AI cs.LG

    TriAug: Out-of-Distribution Detection for Imbalanced Breast Lesion in Ultrasound

    Authors: Yinyu Ye, Shi**g Chen, Dong Ni, Ruobing Huang

    Abstract: Different diseases, such as histological subtypes of breast lesions, have severely varying incidence rates. Even trained with substantial amount of in-distribution (ID) data, models often encounter out-of-distribution (OOD) samples belonging to unseen classes in clinical reality. To address this, we propose a novel framework built upon a long-tailed OOD detection task for breast ultrasound images.… ▽ More

    Submitted 26 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  8. One-Stop Automated Diagnostic System for Carpal Tunnel Syndrome in Ultrasound Images Using Deep Learning

    Authors: Jiayu Peng, Jiajun Zeng, Manlin Lai, Ruobing Huang, Dong Ni, Zhenzhou Li

    Abstract: Objective: Ultrasound (US) examination has unique advantages in diagnosing carpal tunnel syndrome (CTS) while identifying the median nerve (MN) and diagnosing CTS depends heavily on the expertise of examiners. To alleviate this problem, we aimed to develop a one-stop automated CTS diagnosis system (OSA-CTSD) and evaluate its effectiveness as a computer-aided diagnostic tool. Methods: We combined r… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: Accepted by Ultrasound in Medicine & Biology

    Journal ref: Ultrasound in Medicine & Biology, Volume 50, Issue 2, February 2024, Pages 304-314

  9. arXiv:2402.04921  [pdf, other

    eess.IV cs.CV

    Is Two-shot All You Need? A Label-efficient Approach for Video Segmentation in Breast Ultrasound

    Authors: Jiajun Zeng, Dong Ni, Ruobing Huang

    Abstract: Breast lesion segmentation from breast ultrasound (BUS) videos could assist in early diagnosis and treatment. Existing video object segmentation (VOS) methods usually require dense annotation, which is often inaccessible for medical datasets. Furthermore, they suffer from accumulative errors and a lack of explicit space-time awareness. In this work, we propose a novel two-shot training paradigm fo… ▽ More

    Submitted 3 March, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: 5 pages, 4 figure, 2 tables, accepted by ISBI 2024

    ACM Class: I.4.6

  10. arXiv:2312.12490  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    InstructVideo: Instructing Video Diffusion Models with Human Feedback

    Authors: Hangjie Yuan, Shiwei Zhang, Xiang Wang, Yujie Wei, Tao Feng, Yining Pan, Yingya Zhang, Ziwei Liu, Samuel Albanie, Dong Ni

    Abstract: Diffusion models have emerged as the de facto paradigm for video generation. However, their reliance on web-scale data of varied quality often yields results that are visually unappealing and misaligned with the textual prompts. To tackle this problem, we propose InstructVideo to instruct text-to-video diffusion models with human feedback by reward fine-tuning. InstructVideo has two key ingredient… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: Project page: https://instructvideo.github.io/

  11. arXiv:2310.20271  [pdf, other

    cs.CV

    From Denoising Training to Test-Time Adaptation: Enhancing Domain Generalization for Medical Image Segmentation

    Authors: Ruxue Wen, Hangjie Yuan, Dong Ni, Wenbo Xiao, Yaoyao Wu

    Abstract: In medical image segmentation, domain generalization poses a significant challenge due to domain shifts caused by variations in data acquisition devices and other factors. These shifts are particularly pronounced in the most common scenario, which involves only single-source domain data due to privacy concerns. To address this, we draw inspiration from the self-supervised learning paradigm that ef… ▽ More

    Submitted 2 November, 2023; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: Accepted to WACV 2024

  12. arXiv:2310.19293  [pdf, other

    eess.IV cs.CV

    FetusMapV2: Enhanced Fetal Pose Estimation in 3D Ultrasound

    Authors: Chaoyu Chen, Xin Yang, Yuhao Huang, Wenlong Shi, Yan Cao, Mingyuan Luo, Xindi Hu, Lei Zhue, Lequan Yu, Kejuan Yue, Yuanji Zhang, Yi Xiong, Dong Ni, Weijun Huang

    Abstract: Fetal pose estimation in 3D ultrasound (US) involves identifying a set of associated fetal anatomical landmarks. Its primary objective is to provide comprehensive information about the fetus through landmark connections, thus benefiting various critical applications, such as biometric measurements, plane localization, and fetal movement monitoring. However, accurately estimating the 3D fetal pose… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: 16 pages, 11 figures, accepted by Medical Image Analysis(2023)

  13. arXiv:2309.17264  [pdf, other

    cs.CV cs.AI cs.LG

    A Foundation Model for General Moving Object Segmentation in Medical Images

    Authors: Zhongnuo Yan, Tong Han, Yuhao Huang, Lian Liu, Han Zhou, Jiongquan Chen, Wenlong Shi, Yan Cao, Xin Yang, Dong Ni

    Abstract: Medical image segmentation aims to delineate the anatomical or pathological structures of interest, playing a crucial role in clinical diagnosis. A substantial amount of high-quality annotated data is crucial for constructing high-precision deep segmentation models. However, medical annotation is highly cumbersome and time-consuming, especially for medical videos or 3D volumes, due to the huge lab… ▽ More

    Submitted 27 February, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: 5 pages, 7 figures, 3 tables. This paper has been accepted by ISBI 2024

  14. arXiv:2308.13790  [pdf, other

    eess.IV cs.CV

    FFPN: Fourier Feature Pyramid Network for Ultrasound Image Segmentation

    Authors: Chaoyu Chen, Xin Yang, Rusi Chen, Junxuan Yu, Liwei Du, Jian Wang, Xindi Hu, Yan Cao, Yingying Liu, Dong Ni

    Abstract: Ultrasound (US) image segmentation is an active research area that requires real-time and highly accurate analysis in many scenarios. The detect-to-segment (DTS) frameworks have been recently proposed to balance accuracy and efficiency. However, existing approaches may suffer from inadequate contour encoding or fail to effectively leverage the encoded results. In this paper, we introduce a novel F… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

    Comments: 10 pages, 5 figures, Accepted by MLMI 2023

  15. arXiv:2308.13746  [pdf, other

    cs.CV cs.AI cs.LG

    PE-MED: Prompt Enhancement for Interactive Medical Image Segmentation

    Authors: Ao Chang, Xing Tao, Xin Yang, Yuhao Huang, Xinrui Zhou, Jiajun Zeng, Ruobing Huang, Dong Ni

    Abstract: Interactive medical image segmentation refers to the accurate segmentation of the target of interest through interaction (e.g., click) between the user and the image. It has been widely studied in recent years as it is less dependent on abundant annotated data and more flexible than fully automated segmentation. However, current studies have not fully explored user-provided prompt information (e.g… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: Accepted by MICCAI MLMI 2023

  16. arXiv:2308.11635  [pdf, other

    eess.SP cs.HC cs.LG

    Semi-Supervised Dual-Stream Self-Attentive Adversarial Graph Contrastive Learning for Cross-Subject EEG-based Emotion Recognition

    Authors: Weishan Ye, Zhiguo Zhang, Min Zhang, Fei Teng, Li Zhang, Linling Li, Gan Huang, Jianhong Wang, Dong Ni, Zhen Liang

    Abstract: Electroencephalography (EEG) is an objective tool for emotion recognition with promising applications. However, the scarcity of labeled data remains a major challenge in this field, limiting the widespread use of EEG-based emotion recognition. In this paper, a semi-supervised Dual-stream Self-Attentive Adversarial Graph Contrastive learning framework (termed as DS-AGC) is proposed to tackle the ch… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

    Comments: arXiv admin note: text overlap with arXiv:2304.06496

  17. arXiv:2308.09351  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    RLIPv2: Fast Scaling of Relational Language-Image Pre-training

    Authors: Hangjie Yuan, Shiwei Zhang, Xiang Wang, Samuel Albanie, Yining Pan, Tao Feng, Jianwen Jiang, Dong Ni, Yingya Zhang, Deli Zhao

    Abstract: Relational Language-Image Pre-training (RLIP) aims to align vision representations with relational texts, thereby advancing the capability of relational reasoning in computer vision tasks. However, hindered by the slow convergence of RLIPv1 architecture and the limited availability of existing scene graph data, scaling RLIPv1 is challenging. In this paper, we propose RLIPv2, a fast converging mode… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV 2023. Code and models: https://github.com/JacobYuan7/RLIPv2

  18. arXiv:2308.08269  [pdf, other

    eess.IV cs.CV

    OnUVS: Online Feature Decoupling Framework for High-Fidelity Ultrasound Video Synthesis

    Authors: Han Zhou, Dong Ni, Ao Chang, Xinrui Zhou, Rusi Chen, Yanlin Chen, Lian Liu, Jiamin Liang, Yuhao Huang, Tong Han, Zhe Liu, Deng-** Fan, Xin Yang

    Abstract: Ultrasound (US) imaging is indispensable in clinical practice. To diagnose certain diseases, sonographers must observe corresponding dynamic anatomic structures to gather comprehensive information. However, the limited availability of specific US video cases causes teaching difficulties in identifying corresponding diseases, which potentially impacts the detection rate of such cases. The synthesis… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: 14 pages, 13 figures and 6 tables

  19. arXiv:2307.07807  [pdf, other

    eess.IV cs.CV

    MUVF-YOLOX: A Multi-modal Ultrasound Video Fusion Network for Renal Tumor Diagnosis

    Authors: Junyu Li, Han Huang, Dong Ni, Wufeng Xue, Dongmei Zhu, Jun Cheng

    Abstract: Early diagnosis of renal cancer can greatly improve the survival rate of patients. Contrast-enhanced ultrasound (CEUS) is a cost-effective and non-invasive imaging technique and has become more and more frequently used for renal tumor diagnosis. However, the classification of benign and malignant renal tumors can still be very challenging due to the highly heterogeneous appearance of cancer and im… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

    Comments: MICCAI 2023

  20. arXiv:2306.16197  [pdf, other

    cs.CV eess.IV

    Multi-IMU with Online Self-Consistency for Freehand 3D Ultrasound Reconstruction

    Authors: Mingyuan Luo, Xin Yang, Zhongnuo Yan, Junyu Li, Yuanji Zhang, Jiongquan Chen, Xindi Hu, Jikuan Qian, Jun Cheng, Dong Ni

    Abstract: Ultrasound (US) imaging is a popular tool in clinical diagnosis, offering safety, repeatability, and real-time capabilities. Freehand 3D US is a technique that provides a deeper understanding of scanned regions without increasing complexity. However, estimating elevation displacement and accumulation error remains challenging, making it difficult to infer the relative position using images alone.… ▽ More

    Submitted 18 July, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: Accepted by MICCAI-2023

  21. arXiv:2306.14687  [pdf, other

    eess.IV cs.CV

    GSMorph: Gradient Surgery for cine-MRI Cardiac Deformable Registration

    Authors: Haoran Dou, Ning Bi, Luyi Han, Yuhao Huang, Ritse Mann, Xin Yang, Dong Ni, Nishant Ravikumar, Alejandro F. Frangi, Yunzhi Huang

    Abstract: Deep learning-based deformable registration methods have been widely investigated in diverse medical applications. Learning-based deformable registration relies on weighted objective functions trading off registration accuracy and smoothness of the deformation field. Therefore, they inevitably require tuning the hyperparameter for optimal registration performance. Tuning the hyperparameters is hig… ▽ More

    Submitted 20 July, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: Accepted at MICCAI 2023

  22. ModeT: Learning Deformable Image Registration via Motion Decomposition Transformer

    Authors: Haiqiao Wang, Dong Ni, Yi Wang

    Abstract: The Transformer structures have been widely used in computer vision and have recently made an impact in the area of medical image registration. However, the use of Transformer in most registration networks is straightforward. These networks often merely use the attention mechanism to boost the feature learning as the segmentation networks do, but do not sufficiently design to be adapted for the re… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: Early accepted by MICCAI 2023

  23. arXiv:2306.03497  [pdf, other

    cs.CV

    Instructive Feature Enhancement for Dichotomous Medical Image Segmentation

    Authors: Lian Liu, Han Zhou, Jiongquan Chen, Si**g Liu, Wenlong Shi, Dong Ni, Deng-** Fan, Xin Yang

    Abstract: Deep neural networks have been widely applied in dichotomous medical image segmentation (DMIS) of many anatomical structures in several modalities, achieving promising performance. However, existing networks tend to struggle with task-specific, heavy and complex designs to improve accuracy. They made little instructions to which feature channels would be more beneficial for segmentation, and that… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: Accepted by MICCAI 2023

  24. arXiv:2306.02548  [pdf, other

    cs.CV

    Inflated 3D Convolution-Transformer for Weakly-supervised Carotid Stenosis Grading with Ultrasound Videos

    Authors: Xinrui Zhou, Yuhao Huang, Wufeng Xue, Xin Yang, Yuxin Zou, Qilong Ying, Yuanji Zhang, Jia Liu, Jie Ren, Dong Ni

    Abstract: Localization of the narrowest position of the vessel and corresponding vessel and remnant vessel delineation in carotid ultrasound (US) are essential for carotid stenosis grading (CSG) in clinical practice. However, the pipeline is time-consuming and tough due to the ambiguous boundaries of plaque and temporal variation. To automatize this procedure, a large number of manual delineations are usual… ▽ More

    Submitted 12 June, 2023; v1 submitted 4 June, 2023; originally announced June 2023.

    Comments: Accepted by MICCAI 2023

  25. arXiv:2306.02544  [pdf, other

    cs.CV

    Fourier Test-time Adaptation with Multi-level Consistency for Robust Classification

    Authors: Yuhao Huang, Xin Yang, Xiaoqiong Huang, Xinrui Zhou, Haozhe Chi, Haoran Dou, Xindi Hu, Jian Wang, Xuedong Deng, Dong Ni

    Abstract: Deep classifiers may encounter significant performance degradation when processing unseen testing data from varying centers, vendors, and protocols. Ensuring the robustness of deep models against these domain shifts is crucial for their widespread clinical application. In this study, we propose a novel approach called Fourier Test-time Adaptation (FTTA), which employs a dual-adaptation design to i… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

    Comments: Accepted by MICCAI 2023

  26. arXiv:2304.14660  [pdf, other

    eess.IV cs.CV cs.LG

    Segment Anything Model for Medical Images?

    Authors: Yuhao Huang, Xin Yang, Lian Liu, Han Zhou, Ao Chang, Xinrui Zhou, Rusi Chen, Junxuan Yu, Jiongquan Chen, Chaoyu Chen, Si**g Liu, Haozhe Chi, Xindi Hu, Kejuan Yue, Lei Li, Vicente Grau, Deng-** Fan, Fa** Dong, Dong Ni

    Abstract: The Segment Anything Model (SAM) is the first foundation model for general image segmentation. It has achieved impressive results on various natural image segmentation tasks. However, medical image segmentation (MIS) is more challenging because of the complex modalities, fine anatomical structures, uncertain and complex object boundaries, and wide-range object scales. To fully validate SAM's perfo… ▽ More

    Submitted 17 January, 2024; v1 submitted 28 April, 2023; originally announced April 2023.

    Comments: Accepted by Medical Image Analysis. 23 pages, 18 figures, 8 tables

  27. arXiv:2304.07036  [pdf, other

    eess.IV cs.CV cs.LG

    Hierarchical Agent-based Reinforcement Learning Framework for Automated Quality Assessment of Fetal Ultrasound Video

    Authors: Si**g Liu, Qilong Ying, Shuangchi He, Xin Yang, Dong Ni, Ruobing Huang

    Abstract: Ultrasound is the primary modality to examine fetal growth during pregnancy, while the image quality could be affected by various factors. Quality assessment is essential for controlling the quality of ultrasound images to guarantee both the perceptual and diagnostic values. Existing automated approaches often require heavy structural annotations and the predictions may not necessarily be consiste… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  28. arXiv:2303.11531  [pdf

    cs.RO

    A reproducible approach to merging behavior analysis based on High Definition Map

    Authors: Yang Li, Yang Liu, Daiheng Ni, Ang Ji, Linbo Li, Yajie Zou

    Abstract: Existing research on merging behavior generally prioritize the application of various algorithms, but often overlooks the fine-grained process and analysis of trajectories. This leads to the neglect of surrounding vehicle matching, the opaqueness of indicators definition, and reproducible crisis. To address these gaps, this paper presents a reproducible approach to merging behavior analysis. Speci… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: 19 pages, 21 figures

  29. arXiv:2210.10956  [pdf, other

    cs.CV

    Non-Iterative Scribble-Supervised Learning with Pacing Pseudo-Masks for Medical Image Segmentation

    Authors: Zefan Yang, Di Lin, Dong Ni, Yi Wang

    Abstract: Scribble-supervised medical image segmentation tackles the limitation of sparse masks. Conventional approaches alternate between: labeling pseudo-masks and optimizing network parameters. However, such iterative two-stage paradigm is unwieldy and could be trapped in poor local optima since the networks undesirably regress to the erroneous pseudo-masks. To address these issues, we propose a non-iter… ▽ More

    Submitted 28 September, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: 12 pages, 8 figures

  30. arXiv:2209.01814  [pdf, other

    cs.CV

    RLIP: Relational Language-Image Pre-training for Human-Object Interaction Detection

    Authors: Hangjie Yuan, Jianwen Jiang, Samuel Albanie, Tao Feng, Ziyuan Huang, Dong Ni, Mingqian Tang

    Abstract: The task of Human-Object Interaction (HOI) detection targets fine-grained visual parsing of humans interacting with their environment, enabling a broad range of applications. Prior work has demonstrated the benefits of effective architecture design and integration of relevant cues for more accurate HOI detection. However, the design of an appropriate pre-training strategy for this task remains und… ▽ More

    Submitted 16 November, 2022; v1 submitted 5 September, 2022; originally announced September 2022.

    Comments: Accepted to NeurIPS 2022 as a Spotlight paper

  31. arXiv:2208.09881  [pdf, other

    cs.CV

    Masked Video Modeling with Correlation-aware Contrastive Learning for Breast Cancer Diagnosis in Ultrasound

    Authors: Zehui Lin, Ruobing Huang, Dong Ni, Jiayi Wu, Baoming Luo

    Abstract: Breast cancer is one of the leading causes of cancer deaths in women. As the primary output of breast screening, breast ultrasound (US) video contains exclusive dynamic information for cancer diagnosis. However, training models for video analysis is non-trivial as it requires a voluminous dataset which is also expensive to annotate. Furthermore, the diagnosis of breast lesion faces unique challeng… ▽ More

    Submitted 9 September, 2022; v1 submitted 21 August, 2022; originally announced August 2022.

    Comments: Accepted by MICCAI-REMIA oral 2022

  32. arXiv:2208.00386  [pdf, other

    cs.RO cs.AI cs.CV

    Robotic Dough Sha**

    Authors: Jan Ondras, Di Ni, Xi Deng, Zeqi Gu, Henry Zheng, Tapomayukh Bhattacharjee

    Abstract: Robotic manipulation of deformable objects gains great attention due to its wide applications including medical surgery, home assistance, and automatic food preparation. The ability to deform soft objects remains a great challenge for robots due to difficulties in defining the problem mathematically. In this paper, we address the problem of sha** a piece of dough-like deformable material into a… ▽ More

    Submitted 5 October, 2022; v1 submitted 31 July, 2022; originally announced August 2022.

    Comments: To be published in International Conference on Control, Automation and Systems (ICCAS), 2022

  33. arXiv:2207.00496  [pdf, other

    cs.CV cs.AI cs.LG

    Personalized Diagnostic Tool for Thyroid Cancer Classification using Multi-view Ultrasound

    Authors: Han Huang, Yijie Dong, Xiaohong Jia, Jianqiao Zhou, Dong Ni, Jun Cheng, Ruobing Huang

    Abstract: Over the past decades, the incidence of thyroid cancer has been increasing globally. Accurate and early diagnosis allows timely treatment and helps to avoid over-diagnosis. Clinically, a nodule is commonly evaluated from both transverse and longitudinal views using thyroid ultrasound. However, the appearance of the thyroid gland and lesions can vary dramatically across individuals. Identifying key… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted by MICCAI 2022

  34. arXiv:2207.00476  [pdf, other

    cs.CV cs.AI cs.LG

    Online Reflective Learning for Robust Medical Image Segmentation

    Authors: Yuhao Huang, Xin Yang, Xiaoqiong Huang, Jiamin Liang, Xinrui Zhou, Cheng Chen, Haoran Dou, Xindi Hu, Yan Cao, Dong Ni

    Abstract: Deep segmentation models often face the failure risks when the testing image presents unseen distributions. Improving model robustness against these risks is crucial for the large-scale clinical application of deep models. In this study, inspired by human learning cycle, we propose a novel online reflective learning framework (RefSeg) to improve segmentation robustness. Based on the reflection-on-… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted by MICCAI 2022

  35. arXiv:2207.00475  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Agent with Tangent-based Formulation and Anatomical Perception for Standard Plane Localization in 3D Ultrasound

    Authors: Yuxin Zou, Haoran Dou, Yuhao Huang, Xin Yang, Jikuan Qian, Chaojiong Zhen, Xiaodan Ji, Nishant Ravikumar, Guoqiang Chen, Weijun Huang, Alejandro F. Frangi, Dong Ni

    Abstract: Standard plane (SP) localization is essential in routine clinical ultrasound (US) diagnosis. Compared to 2D US, 3D US can acquire multiple view planes in one scan and provide complete anatomy with the addition of coronal plane. However, manually navigating SPs in 3D US is laborious and biased due to the orientation variability and huge search space. In this study, we introduce a novel reinforcemen… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted by MICCAI 2022

  36. arXiv:2207.00474  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Weakly-supervised High-fidelity Ultrasound Video Synthesis with Feature Decoupling

    Authors: Jiamin Liang, Xin Yang, Yuhao Huang, Kai Liu, Xinrui Zhou, Xindi Hu, Zehui Lin, Huanjia Luo, Yuanji Zhang, Yi Xiong, Dong Ni

    Abstract: Ultrasound (US) is widely used for its advantages of real-time imaging, radiation-free and portability. In clinical practice, analysis and diagnosis often rely on US sequences rather than a single image to obtain dynamic anatomical information. This is challenging for novices to learn because practicing with adequate videos from patients is clinically unpractical. In this paper, we propose a novel… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted by MICCAI 2022

  37. arXiv:2207.00347  [pdf, other

    cs.CV

    Fine-grained Correlation Loss for Regression

    Authors: Chaoyu Chen, Xin Yang, Ruobing Huang, Xindi Hu, Yankai Huang, Xiduo Lu, Xinrui Zhou, Mingyuan Luo, Yinyu Ye, Xue Shuang, Juzheng Miao, Yi Xiong, Dong Ni

    Abstract: Regression learning is classic and fundamental for medical image analysis. It provides the continuous map** for many critical applications, like the attribute estimation, object detection, segmentation and non-rigid registration. However, previous studies mainly took the case-wise criteria, like the mean square errors, as the optimization objectives. They ignored the very important population-wi… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted by MICCAI 2022

  38. arXiv:2207.00177  [pdf, other

    cs.CV eess.IV

    Deep Motion Network for Freehand 3D Ultrasound Reconstruction

    Authors: Mingyuan Luo, Xin Yang, Hongzhang Wang, Liwei Du, Dong Ni

    Abstract: Freehand 3D ultrasound (US) has important clinical value due to its low cost and unrestricted field of view. Recently deep learning algorithms have removed its dependence on bulky and expensive external positioning devices. However, improving reconstruction accuracy is still hampered by difficult elevational displacement estimation and large cumulative drift. In this context, we propose a novel de… ▽ More

    Submitted 30 June, 2022; originally announced July 2022.

    Comments: Early accepted by MICCAI-2022

  39. arXiv:2204.06929  [pdf, other

    eess.IV cs.CV cs.LG

    Sketch guided and progressive growing GAN for realistic and editable ultrasound image synthesis

    Authors: Jiamin Liang, Xin Yang, Yuhao Huang, Haoming Li, Shuangchi He, Xindi Hu, Zejian Chen, Wufeng Xue, Jun Cheng, Dong Ni

    Abstract: Ultrasound (US) imaging is widely used for anatomical structure inspection in clinical diagnosis. The training of new sonographers and deep learning based algorithms for US image analysis usually requires a large amount of data. However, obtaining and labeling large-scale US imaging data are not easy tasks, especially for diseases with low incidence. Realistic US image synthesis can alleviate this… ▽ More

    Submitted 25 May, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted by Medical Image Analysis (13 figures, 4 tabels)

  40. arXiv:2204.06697  [pdf, other

    cs.CV cs.AI cs.LG

    HASA: Hybrid Architecture Search with Aggregation Strategy for Echinococcosis Classification and Ovary Segmentation in Ultrasound Images

    Authors: Jikuan Qian, Rui Li, Xin Yang, Yuhao Huang, Mingyuan Luo, Zehui Lin, Wenhui Hong, Ruobing Huang, Haining Fan, Dong Ni, Jun Cheng

    Abstract: Different from handcrafted features, deep neural networks can automatically learn task-specific features from data. Due to this data-driven nature, they have achieved remarkable success in various areas. However, manual design and selection of suitable network architectures are time-consuming and require substantial effort of human experts. To address this problem, researchers have proposed neural… ▽ More

    Submitted 20 April, 2022; v1 submitted 13 April, 2022; originally announced April 2022.

    Comments: 17 pages,11 figures. Accepted by Expert Systems and Applications, 2022

  41. arXiv:2202.00259  [pdf, other

    cs.CV

    Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics

    Authors: Hangjie Yuan, Mang Wang, Dong Ni, Liangpeng Xu

    Abstract: Human-Object Interaction (HOI) detection is an essential task to understand human-centric images from a fine-grained perspective. Although end-to-end HOI detection models thrive, their paradigm of parallel human/object detection and verb class prediction loses two-stage methods' merit: object-guided hierarchy. The object in one HOI triplet gives direct clues to the verb to be predicted. In this pa… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: Accepted to AAAI2022

  42. arXiv:2201.05344  [pdf, other

    eess.IV cs.CV cs.LG

    AWSnet: An Auto-weighted Supervision Attention Network for Myocardial Scar and Edema Segmentation in Multi-sequence Cardiac Magnetic Resonance Images

    Authors: Kai-Ni Wang, Xin Yang, Juzheng Miao, Lei Li, **g Yao, ** Zhou, Wufeng Xue, Guang-Quan Zhou, Xiahai Zhuang, Dong Ni

    Abstract: Multi-sequence cardiac magnetic resonance (CMR) provides essential pathology information (scar and edema) to diagnose myocardial infarction. However, automatic pathology segmentation can be challenging due to the difficulty of effectively exploring the underlying information from the multi-sequence CMR data. This paper aims to tackle the scar and edema segmentation from multi-sequence CMR with a n… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.

    Comments: 19 pages, 10 figures, accepted by Medical Image Analysis

  43. arXiv:2201.00317  [pdf, other

    eess.IV cs.CV

    Recurrent Feature Propagation and Edge Skip-Connections for Automatic Abdominal Organ Segmentation

    Authors: Zefan Yang, Di Lin, Dong Ni, Yi Wang

    Abstract: Automatic segmentation of abdominal organs in computed tomography (CT) images can support radiation therapy and image-guided surgery workflows. Develo** of such automatic solutions remains challenging mainly owing to complex organ interactions and blurry boundaries in CT images. To address these issues, we focus on effective spatial context modeling and explicit edge segmentation priors. Accordi… ▽ More

    Submitted 19 May, 2023; v1 submitted 2 January, 2022; originally announced January 2022.

  44. arXiv:2112.11177  [pdf, other

    cs.CV

    Generalizable Cross-modality Medical Image Segmentation via Style Augmentation and Dual Normalization

    Authors: Ziqi Zhou, Lei Qi, Xin Yang, Dong Ni, Yinghuan Shi

    Abstract: For medical image segmentation, imagine if a model was only trained using MR images in source domain, how about its performance to directly segment CT images in target domain? This setting, namely generalizable cross-modality segmentation, owning its clinical potential, is much more challenging than other related settings, e.g., domain adaptation. To achieve this goal, we in this paper propose a n… ▽ More

    Submitted 28 March, 2022; v1 submitted 21 December, 2021; originally announced December 2021.

    Comments: Accepted by CVPR 2022

  45. Image-Guided Navigation of a Robotic Ultrasound Probe for Autonomous Spinal Sonography Using a Shadow-aware Dual-Agent Framework

    Authors: Keyu Li, Yangxin Xu, Jian Wang, Dong Ni, Li Liu, Max Q. -H. Meng

    Abstract: Ultrasound (US) imaging is commonly used to assist in the diagnosis and interventions of spine diseases, while the standardized US acquisitions performed by manually operating the probe require substantial experience and training of sonographers. In this work, we propose a novel dual-agent framework that integrates a reinforcement learning (RL) agent and a deep learning (DL) agent to jointly deter… ▽ More

    Submitted 10 November, 2021; v1 submitted 3 November, 2021; originally announced November 2021.

    Comments: Accepted by IEEE Transactions on Medical Robotics and Bionics. Copyright may be transferred without notice, after which this version may no longer be accessible

    Journal ref: IEEE Transactions on Medical Robotics and Bionics (2021)

  46. arXiv:2109.06080  [pdf

    cs.RO

    Pareto-optimal lane-changing motion planning in mixed traffic

    Authors: Yang Li, Linbo Li, Daiheng Ni

    Abstract: This paper applies the pareto-optimal concept to LC (lane-changing) motion planning in the presence of mixed traffic including manual and autonomous vehicles. Firstly, a multiobjective optimization problem is presented, in which the comfort, efficiency and safety of the LC vehicle and the surrounding vehicles are jointly modelled. Thereafter, the pareto-optimal solutions are obtained through emplo… ▽ More

    Submitted 4 April, 2023; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: arXiv admin note: text overlap with arXiv:2108.05711

  47. arXiv:2108.11743  [pdf, other

    cs.CV

    Spatio-Temporal Dynamic Inference Network for Group Activity Recognition

    Authors: Hangjie Yuan, Dong Ni, Mang Wang

    Abstract: Group activity recognition aims to understand the activity performed by a group of people. In order to solve it, modeling complex spatio-temporal interactions is the key. Previous methods are limited in reasoning on a predefined graph, which ignores the inherent person-specific interaction context. Moreover, they adopt inference schemes that are computationally expensive and easily result in the o… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Comments: Accepted to ICCV2021

  48. arXiv:2108.05711  [pdf

    cs.RO

    Hierarchical automatic lane-changing motion planning: from self-optimum to local-optimum

    Authors: Yang Li, Linbo Li, Daiheng Ni, Wenxuang Wang

    Abstract: In order to minimize the impact of lane change (LC) maneuver on surrounding traffic environment, a hierarchical automatic LC algorithm that could realize local optimum has been proposed. This algorithm consists of a tactical layer planner and an operational layer controller. The former generates a local-optimum trajectory. The comfort, efficiency, and safety of the LC vehicle and its surrounding v… ▽ More

    Submitted 8 August, 2021; originally announced August 2021.

    Comments: 24 pages 16 figures

  49. arXiv:2108.05710  [pdf

    cs.RO cs.PF eess.SY

    Exploration of lane-changing duration for heavy vehicles and passenger cars: a survival analysis approach

    Authors: Yang Li, Linbo Li, Daiheng Ni

    Abstract: Lane-changing (LC) behavior describes the lateral movement of the vehicle from the current-lane to the target-lane while proceeding forward. Among the many research directions, LC duration (LCD) measures the total time it takes for a vehicle to travel from the current lane to the target lane, which is an indispensable indicator to characterize the LC behavior. Although existing research has made s… ▽ More

    Submitted 8 August, 2021; originally announced August 2021.

  50. arXiv:2108.05055  [pdf, other

    cs.CV

    Statistical Dependency Guided Contrastive Learning for Multiple Labeling in Prenatal Ultrasound

    Authors: Shuangchi He, Zehui Lin, Xin Yang, Chaoyu Chen, Jian Wang, Xue Shuang, Ziwei Deng, Qin Liu, Yan Cao, Xiduo Lu, Ruobing Huang, Nishant Ravikumar, Alejandro Frangi, Yuanji Zhang, Yi Xiong, Dong Ni

    Abstract: Standard plane recognition plays an important role in prenatal ultrasound (US) screening. Automatically recognizing the standard plane along with the corresponding anatomical structures in US image can not only facilitate US image interpretation but also improve diagnostic efficiency. In this study, we build a novel multi-label learning (MLL) scheme to identify multiple standard planes and corresp… ▽ More

    Submitted 20 May, 2022; v1 submitted 11 August, 2021; originally announced August 2021.

    Comments: Accepted by MICCAI-MLMI 2021